Table 1. Sequence validation rates of predictions.
Gene prediction set | Total no. of predictions* | No. of predictions tested | No. sequence validated (%)† |
---|---|---|---|
sjc set | 197 | 171 | 64 (37.4) |
Heidelberg set‡ | 1,266 | 160 | 18 (11.3) |
Homol-2 set | 362 | 209 | 28 (13.4) |
Homol-0 set§ | 9,577 | 204 | 12 (5.9) |
Total predictions | 11,402 | 744 | 122 (16.4) |
Controls | 159 | 159 | 154 (96.9) |
See Table 2 for specific numbers from genscan vs. fgenesh predictions
Gene predictions were considered validated if the aligned sequence of the PCR product was consistent with a spliced gene model in the region of the prediction
Only 1,266 multiexon predictions from the 2,636 predictions described by Hild et al. (2) were considered for analysis, and, of these, we tested only the 160 with the highest priority scores that did not overlap any genscan or fgenesh predictions tested in the other sets
The homol-0 set was selected to be representative of the full range of priority scores