The problem is, as I say, most of the comp.bio problems are things that need validation, and against a finite evaluation set, you end up with a machine that will regurgitate the test data. I guess you could try and derive a (relatively) small number of rules that will do the job.
no subject