Table 1.
Length aa | Percentiles | ||||||||
---|---|---|---|---|---|---|---|---|---|
Group | Species Code | Species Name | Mean | SD | 10% | 25% | 50% | 75% | 90% |
ARCHAEA | ARC_PRO | Archaeoglobus profundus DSM5631 | 263 | 187 | 80 | 128 | 221 | 346 | 479 |
ARCHAEA | CAN_KOR | Candidatus Korarchaeum cryptofilum OPF8 | 296 | 191 | 104 | 160 | 262 | 379 | 501 |
ARCHAEA | CEN_SYM | Cenarchaeum symbiosum A | 308 | 535 | 74 | 117 | 213 | 348 | 521 |
ARCHAEA | DES_KAM | Desulfurococcus kamchatkensis 1221n | 272 | 188 | 75 | 129 | 238 | 369 | 499 |
ARCHAEA | MET_JAN | Methanococcus jannaschii | 283 | 204 | 98 | 149 | 241 | 365 | 492 |
ARCHAEA | NAN_EQU | Nanoarchaeum equitans Kin4-M | 276 | 203 | 91 | 142 | 225 | 352 | 512 |
ARCHAEA | SUL_ACI | Sulfolobus acidocaldarius DSM 639 | 284 | 183 | 96 | 146 | 249 | 375 | 511 |
ARCHAEA | THE_NEU | Thermoproteus neutrophilus V24Sta | 268 | 182 | 91 | 142 | 230 | 346 | 463 |
ARCHAEA | THE_VOL | Thermoplasma volcanium GSS1 | 297 | 198 | 98 | 157 | 258 | 390 | 518 |
BACTERIA | ACI_FER | Acidimicrobium ferrooxidans DSM 10331 | 322 | 203 | 109 | 174 | 287 | 415 | 553 |
BACTERIA | BAC_FRA | Bacteroides fragilis NCTC 9343 | 361 | 249 | 107 | 182 | 310 | 455 | 691 |
BACTERIA | BAC_SUB | Bacillus subtilis 168 | 294 | 266 | 85 | 145 | 254 | 382 | 504 |
BACTERIA | BIF_ADO | Bifidobacterium adolescentis ATCC 15703 | 369 | 233 | 136 | 218 | 325 | 461 | 654 |
BACTERIA | BRA_JAP | Bradyrhizobium japonicum USDA 110 | 317 | 229 | 107 | 170 | 277 | 403 | 552 |
BACTERIA | BUR_CEP | Burkholderia cepacia AMMD | 330 | 250 | 110 | 180 | 295 | 410 | 549 |
BACTERIA | CAM_JEJ | Campylobacter jejuni RM1221 | 294 | 202 | 83 | 150 | 254 | 392 | 538 |
BACTERIA | CHL_MUR | Chlamydia muridarum Nigg | 355 | 296 | 105 | 172 | 290 | 446 | 650 |
BACTERIA | COR_AUR | Corynebacterium aurimucosum ATCC 700975 | 325 | 225 | 105 | 177 | 283 | 417 | 557 |
BACTERIA | DEI_DES | Deinococcus deserti VCD115 | 314 | 209 | 117 | 169 | 274 | 395 | 552 |
BACTERIA | ESC_COL | Escherichia coli O157:H7 str. EC4115 | 287 | 236 | 58 | 121 | 239 | 384 | 548 |
BACTERIA | GLO_VIO | Gloeobacter violaceus PCC 7421 | 313 | 233 | 95 | 151 | 256 | 398 | 593 |
BACTERIA | HYD_THE | Hydrogenobacter thermophilus TK-6 | 293 | 198 | 93 | 149 | 251 | 389 | 540 |
BACTERIA | KOC_RHI | Kocuria rhizophila DC2201 | 337 | 213 | 118 | 189 | 300 | 434 | 578 |
BACTERIA | LEP_BIF | Leptospira biflexa Patoc 1 (Ames) | 338 | 216 | 123 | 184 | 292 | 430 | 611 |
BACTERIA | MYC_ABS | Mycobacterium abscessus | 317 | 250 | 115 | 174 | 273 | 400 | 524 |
BACTERIA | PER_MAR | Persephonella marina EX-H1 | 304 | 240 | 95 | 152 | 256 | 392 | 569 |
BACTERIA | STA_AUR | Staphylococcus aureus aureus MW2 | 298 | 285 | 84 | 149 | 254 | 385 | 522 |
BACTERIA | STR_AVE | Streptomyces avermitilis MA-4680 | 341 | 308 | 115 | 182 | 289 | 422 | 578 |
BACTERIA | SUL_DEL | Sulfurospirillum deleyianum DSM 6946 | 312 | 223 | 101 | 166 | 266 | 403 | 577 |
BACTERIA | SYN_SP | Synechocystis sp. PCC 6803 | 319 | 256 | 96 | 153 | 264 | 404 | 584 |
BACTERIA | THE_ELO | Thermosynechococcus elongatus BP-1 | 314 | 214 | 98 | 157 | 273 | 403 | 577 |
BACTERIA | THE_THE | Thermus thermophilus HB27 | 303 | 199 | 109 | 167 | 264 | 390 | 529 |
BACTERIA | XAN_CAM | Xanthomonas campestris pv armoraciae | 311 | 258 | 59 | 134 | 257 | 412 | 623 |
APICOMPLEXA | CRY_PAR | Cryptosporidium parvum | 597 | 628 | 155 | 251 | 433 | 729 | 1192 |
APICOMPLEXA | PLA_FAL | Plasmodium falciparum | 753 | 866 | 145 | 253 | 453 | 930 | 1707 |
APICOMPLEXA | TOX_GON | Toxoplasma gondii | 682 | 766 | 139 | 224 | 441 | 843 | 1486 |
CILIOPHORA | PAR_TET | Paramecium tetraurelia | 457 | 438 | 127 | 205 | 348 | 541 | 854 |
CILIOPHORA | TET_THE | Tetrahymena thermophila | 649 | 660 | 110 | 229 | 456 | 839 | 1396 |
AMOEBOZOA | DIC_DIS | Dictyostelium discoideum | 533 | 513 | 92 | 198 | 392 | 702 | 1123 |
DIPLOMONADIDA | GUI_LAM | Giardia lamblia | 543 | 630 | 84 | 180 | 369 | 689 | 1110 |
PLACOZOA | TRI_ADH | Trichoplax adhaerens | 453 | 426 | 141 | 217 | 345 | 539 | 854 |
FUNGI_ASC | PIC_STI | Pichia stipitis | 492 | 346 | 161 | 263 | 416 | 613 | 893 |
FUNGI_ASC | SAC_CER | Saccharomyces cerevisiae | 497 | 382 | 137 | 239 | 409 | 632 | 951 |
FUNGI_ASC | TRI_REE | Trichoderma reesei | 491 | 452 | 154 | 262 | 408 | 600 | 891 |
FUNGI_BAS | LAC_BIC | Laccaria bicolor | 370 | 312 | 88 | 153 | 289 | 488 | 749 |
FUNGI_BAS | PHA_CHR | Phanerochaete chrysosporium strain RP78 | 456 | 327 | 157 | 246 | 373 | 556 | 856 |
FUNGI_BAS | UST_MAY | Ustilago maydis | 613 | 454 | 176 | 298 | 501 | 793 | 1198 |
STRAM_DIA | PHA_TRI | Phaeodactylum tricornutum | 462 | 343 | 162 | 249 | 381 | 562 | 841 |
STRAM_DIA | THA_PSE | Thalassiosira pseudonana | 499 | 424 | 159 | 249 | 391 | 613 | 947 |
STRAM_OOM | PHY_RAM | Phytophthora ramorum | 479 | 407 | 152 | 237 | 373 | 584 | 903 |
STRAM_OOM | PHY_SOJ | Phytophthora sojae | 502 | 447 | 146 | 234 | 382 | 616 | 986 |
CNIDARIA | NEM_VEC | Nematostella vectensis | 335 | 336 | 95 | 145 | 250 | 405 | 646 |
INSECTA | ANO_GAM | Anopheles gambiae | 529 | 547 | 132 | 223 | 389 | 632 | 1065 |
INSECTA | DRO_MEL | Drosophila melanogaster | 584 | 642 | 141 | 242 | 427 | 700 | 1164 |
NEMATODA | CAE_ELE | Caenorhabditis elegans | 444 | 484 | 124 | 211 | 342 | 522 | 820 |
NEMATODA | PRI_PAC | Pristionchus pacificus | 288 | 285 | 76 | 116 | 206 | 359 | 583 |
VERT_AVE | GAL_GAL | Gallus gallus | 490 | 508 | 108 | 184 | 346 | 608 | 1007 |
VERT_AVE | MEL_GAL | Meleagris gallopavo | 479 | 463 | 116 | 197 | 351 | 595 | 968 |
VERT_MAM | BOS_TAU | Bos taurus | 495 | 490 | 145 | 246 | 356 | 592 | 947 |
VERT_MAM | EQU_CAB | Equus caballus | 564 | 606 | 147 | 247 | 393 | 688 | 1139 |
VERT_MAM | HOM_SAP | Homo sapiens | 456 | 540 | 98 | 163 | 311 | 562 | 947 |
VERT_MAM | MON_DOM | Monodelphis domestica | 574 | 489 | 174 | 295 | 457 | 719 | 1069 |
VERT_MAM | ORN_ANA | Ornithorhynchus anatinus | 445 | 416 | 123 | 202 | 327 | 540 | 868 |
VERT_MAM | RAT_NOR | Rattus norvegicus | 520 | 500 | 130 | 224 | 374 | 643 | 1039 |
VERT_SAU | ANO_CAR | Anolis carolinensis | 462 | 436 | 128 | 207 | 346 | 559 | 903 |
VERT_TEL | DAN_RER | Danio rerio | 473 | 456 | 151 | 234 | 363 | 565 | 879 |
VERT_TEL | TAK_RUB | Takifugu rubripes | 634 | 536 | 215 | 324 | 494 | 780 | 1177 |
PLANT_BRY | PHY_PAT | Physcomitrella patens | 363 | 308 | 115 | 165 | 278 | 461 | 711 |
PLANT_CHL | CHL_REI | Chlamydomonas reinhardtii | 503 | 589 | 97 | 173 | 335 | 608 | 1074 |
PLANT_CHL | MIC_CCM | Micromonas CCMP1545 | 426 | 390 | 123 | 202 | 334 | 522 | 799 |
PLANT_CHL | MIC_RCC | Micromonas RCC299 | 485 | 475 | 146 | 236 | 371 | 571 | 920 |
PLANT_CHL | OST_LUC | Ostreococcus lucimarinus | 397 | 343 | 121 | 199 | 319 | 486 | 726 |
PLANT_CHL | OST_TAU | Ostreococcus tauri | 387 | 349 | 114 | 186 | 307 | 476 | 716 |
PLANT_DIC | ARA_THA | Arabidopsis thaliana | 403 | 299 | 115 | 202 | 345 | 513 | 749 |
PLANT_DIC | CAR_PAP | Carica papaya | 296 | 249 | 68 | 112 | 225 | 411 | 611 |
PLANT_DIC | GLY_MAX | Glycine max | 422 | 354 | 139 | 220 | 353 | 529 | 768 |
PLANT_DIC | MED_TRU | Medicagao truncatula | 245 | 245 | 59 | 78 | 149 | 334 | 550 |
PLANT_DIC | POP_TRI | Populus trichocarpa | 375 | 292 | 101 | 167 | 306 | 490 | 732 |
PLANT_LYC | SEL_MOE | Selaginella moellendorfii | 382 | 300 | 124 | 191 | 316 | 481 | 699 |
PLANT_MON | BRA_DIS | Brachypodium distachyon | 428 | 303 | 146 | 223 | 361 | 537 | 788 |
PLANT_MON | ORY_SAT | Oryza sativa ssp. japonica | 448 | 389 | 108 | 174 | 332 | 574 | 960 |
PLANT_MON | SOR_BIC | Sorghum bicolor | 361 | 282 | 103 | 167 | 288 | 476 | 706 |
PLANT_MON | ZEA_MAY | Zea mays | 345 | 258 | 97 | 164 | 286 | 455 | 655 |
RHODOPHYTA | CYA_MER | Cyanidioschyzon merolae | 504 | 404 | 158 | 259 | 412 | 628 | 918 |
Statistical summary of protein length values in the proteomes of dataset 1 including 84 different species (9 archeal, 24 bacterial and 51 eukaryotic organisms). The mean, standard deviation (SD) and the 10%, 25%, 50%, 75% and 90% percentiles were calculated for each organism individually (see methods)