Table 1.
Numbers of uORFs, protein-coding genes, and assembled EST/TSA and RefSeq sequences extracted at each step of ESUCA.
| Step | Drosophila melanogaster | Danio rerio | Gallus gallus | Homo sapiens | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| uORFa | Gene | EST/TSA + RefSeq | uORFa | Gene | EST/TSA + RefSeq | uORFa | Gene | EST/TSA + RefSeq | uORFa | Gene | EST/TSA + RefSeq | |
| Before selection | – | 13,938 | – | – | 25,206 | – | – | 14,697 | – | – | 19,956 | – |
| Step 1 | 17,035 | 7066 | – | 39,616 | 14,453 | – | 8929 | 3535 | – | 44,085 | 12,321 | – |
| Step 2 | 5040 | 2343 | – | 3599 | 2323 | – | 1320 | 767 | – | 15,069 | 6568 | – |
| Step 3.1 | 4900 | 2308 | 1,854,900 | 3494 | 2271 | 1,822,408 | 1275 | 751 | 668,417 | 14,529 | 6408 | 7,577,191 |
| Step 3.2 | 4882 | 2297 | 873,484 | 3479 | 2261 | 846,829 | 1271 | 750 | 314,665 | 14,499 | 6399 | 3,711,515 |
| Step 4.1 | 4307 | 2076 | 40,982 | 2549 | 1689 | 37,125 | 1122 | 668 | 42,622 | 13,993 | 6217 | 383,797 |
| Step 4.2 | 4294 | 2067 | 40,894 | 2543 | 1688 | 36,434 | 1119 | 665 | 41,306 | 13,970 | 6215 | 378,480 |
| Step 4.3 | 49 | 40 | 1212 | 408 | 343 | 4082 | 774 | 485 | 8171 | 5262 | 3067 | 33,776 |
| Step 5 | 49 | 40 | 1212 | 192 | 180 | 2798 | 261 | 221 | 4074 | 1495 | 1201 | 12,402 |
| Step 7 | 37 | 36 | 1072 | 156 | 154 | 2729 | 230 | 209 | 3945 | 1094 | 969 | 9964 |
aWhen multiple uORFs in a transcript shared the same stop or start codon, they were counted as one.