Table 2.
Cross-Validation for the Discrimination Between Genes and Pseudogenes from KA/KS Benchmark Distributions
Known test fractions
|
Training set
|
Estimated fractions of pseudogenes
|
|||
---|---|---|---|---|---|
Functional | Pseudogene | Functional | Pseudogene | Averagea | SDb |
1000 | 0 | 659 | 1703 | 16.1 | 15 |
900 | 100 | 759 | 1603 | 109.6 | 24.8 |
800 | 200 | 859 | 1503 | 205.4 | 20.5 |
700 | 300 | 959 | 1403 | 310.2 | 23.7 |
600 | 400 | 1059 | 1303 | 401.9 | 19.7 |
500 | 500 | 1159 | 1203 | 500.8 | 21.3 |
400 | 600 | 1259 | 1103 | 600.5 | 24.2 |
300 | 700 | 1359 | 1003 | 694.7 | 24.1 |
200 | 800 | 1459 | 903 | 793.2 | 26.1 |
100 | 900 | 1559 | 803 | 893.2 | 26.4 |
0 | 1000 | 1659 | 703 | 983.9 | 20 |
Average estimation of the 100 iterations
Standard Deviation from the complete test set