Skip to main content
. 2020 Oct 1;10:16289. doi: 10.1038/s41598-020-73307-6

Table 1.

Numbers of uORFs, protein-coding genes, and assembled EST/TSA and RefSeq sequences extracted at each step of ESUCA.

Step Drosophila melanogaster Danio rerio Gallus gallus Homo sapiens
uORFa Gene EST/TSA + RefSeq uORFa Gene EST/TSA + RefSeq uORFa Gene EST/TSA + RefSeq uORFa Gene EST/TSA + RefSeq
Before selection 13,938 25,206 14,697 19,956
Step 1 17,035 7066 39,616 14,453 8929 3535 44,085 12,321
Step 2 5040 2343 3599 2323 1320 767 15,069 6568
Step 3.1 4900 2308 1,854,900 3494 2271 1,822,408 1275 751 668,417 14,529 6408 7,577,191
Step 3.2 4882 2297 873,484 3479 2261 846,829 1271 750 314,665 14,499 6399 3,711,515
Step 4.1 4307 2076 40,982 2549 1689 37,125 1122 668 42,622 13,993 6217 383,797
Step 4.2 4294 2067 40,894 2543 1688 36,434 1119 665 41,306 13,970 6215 378,480
Step 4.3 49 40 1212 408 343 4082 774 485 8171 5262 3067 33,776
Step 5 49 40 1212 192 180 2798 261 221 4074 1495 1201 12,402
Step 7 37 36 1072 156 154 2729 230 209 3945 1094 969 9964

aWhen multiple uORFs in a transcript shared the same stop or start codon, they were counted as one.