Table 1.
OpenProt (v1.6) prediction pipeline output
Species | Genome assembly | Annotations | ORFeome (both annotations) | ||||
---|---|---|---|---|---|---|---|
NCBI RefSeq | Ensembl | Total # | Ref # | II_ # | IP_ # | ||
Homo sapiens | GRCh38.p12 | GRCh38.p12 | GRCh38.95 | 692 045 | 134 477 | 68 612 | 488 956 |
Pan troglodytes | Pan_tro_3.0 | Pan_tro_3.0 | Pan_tro_3.0.95 | 331 247 | 79 070 | 14 308 | 237 869 |
Mus musculus | GRCm38.p6 | GRCm38.p6 | GRCm38.95 | 558 632 | 87 339 | 40 870 | 430 423 |
Rattus norvegicus | Rnor_6.0 | Rnor_6.0 | Rnor_6.0.95 | 294 727 | 51 662 | 7872 | 235 193 |
Bos taurus | ARS-UCD1.2 | ARS-UCD1.2 | ARS-UCD1.2.95 | 285 565 | 67 753 | 11 382 | 206 430 |
Ovis aries | Oar_v3.1 | Oar_v3.1 | Oar_v3.1.95 | 162 972 | 30 283 | 6339 | 126 350 |
Danio rerio | GRCz11 | GRCz11 | GRCz11.95 | 287 990 | 68 272 | 11 896 | 207 822 |
Drosophila melanogaster | Release 6 plus ISO1 MT | BDGP6 | BDGP6.95 | 97 834 | 22 058 | 2125 | 73 651 |
Caenorhabditis elegans | WBcel235 | WBcel235 | WBcel235.95 | 94 890 | 28 516 | 3034 | 63 340 |
Saccharomyces cerevisiae S288c | R64-1-1 | R64 | R64-1-1.95 | 16 873 | 6615 | 28 | 10 230 |
Ref = currently annotated protein (RefProt); II_ = novel isoforms of known protein; IP_ = novel protein from alternative ORF (AltProt).