Skip to main content
. 2012 Feb 1;5:85. doi: 10.1186/1756-0500-5-85

Table 1.

Protein size summary.

Length aa Percentiles
Group Species Code Species Name Mean SD 10% 25% 50% 75% 90%
ARCHAEA ARC_PRO Archaeoglobus profundus DSM5631 263 187 80 128 221 346 479
ARCHAEA CAN_KOR Candidatus Korarchaeum cryptofilum OPF8 296 191 104 160 262 379 501
ARCHAEA CEN_SYM Cenarchaeum symbiosum A 308 535 74 117 213 348 521
ARCHAEA DES_KAM Desulfurococcus kamchatkensis 1221n 272 188 75 129 238 369 499
ARCHAEA MET_JAN Methanococcus jannaschii 283 204 98 149 241 365 492
ARCHAEA NAN_EQU Nanoarchaeum equitans Kin4-M 276 203 91 142 225 352 512
ARCHAEA SUL_ACI Sulfolobus acidocaldarius DSM 639 284 183 96 146 249 375 511
ARCHAEA THE_NEU Thermoproteus neutrophilus V24Sta 268 182 91 142 230 346 463
ARCHAEA THE_VOL Thermoplasma volcanium GSS1 297 198 98 157 258 390 518
BACTERIA ACI_FER Acidimicrobium ferrooxidans DSM 10331 322 203 109 174 287 415 553
BACTERIA BAC_FRA Bacteroides fragilis NCTC 9343 361 249 107 182 310 455 691
BACTERIA BAC_SUB Bacillus subtilis 168 294 266 85 145 254 382 504
BACTERIA BIF_ADO Bifidobacterium adolescentis ATCC 15703 369 233 136 218 325 461 654
BACTERIA BRA_JAP Bradyrhizobium japonicum USDA 110 317 229 107 170 277 403 552
BACTERIA BUR_CEP Burkholderia cepacia AMMD 330 250 110 180 295 410 549
BACTERIA CAM_JEJ Campylobacter jejuni RM1221 294 202 83 150 254 392 538
BACTERIA CHL_MUR Chlamydia muridarum Nigg 355 296 105 172 290 446 650
BACTERIA COR_AUR Corynebacterium aurimucosum ATCC 700975 325 225 105 177 283 417 557
BACTERIA DEI_DES Deinococcus deserti VCD115 314 209 117 169 274 395 552
BACTERIA ESC_COL Escherichia coli O157:H7 str. EC4115 287 236 58 121 239 384 548
BACTERIA GLO_VIO Gloeobacter violaceus PCC 7421 313 233 95 151 256 398 593
BACTERIA HYD_THE Hydrogenobacter thermophilus TK-6 293 198 93 149 251 389 540
BACTERIA KOC_RHI Kocuria rhizophila DC2201 337 213 118 189 300 434 578
BACTERIA LEP_BIF Leptospira biflexa Patoc 1 (Ames) 338 216 123 184 292 430 611
BACTERIA MYC_ABS Mycobacterium abscessus 317 250 115 174 273 400 524
BACTERIA PER_MAR Persephonella marina EX-H1 304 240 95 152 256 392 569
BACTERIA STA_AUR Staphylococcus aureus aureus MW2 298 285 84 149 254 385 522
BACTERIA STR_AVE Streptomyces avermitilis MA-4680 341 308 115 182 289 422 578
BACTERIA SUL_DEL Sulfurospirillum deleyianum DSM 6946 312 223 101 166 266 403 577
BACTERIA SYN_SP Synechocystis sp. PCC 6803 319 256 96 153 264 404 584
BACTERIA THE_ELO Thermosynechococcus elongatus BP-1 314 214 98 157 273 403 577
BACTERIA THE_THE Thermus thermophilus HB27 303 199 109 167 264 390 529
BACTERIA XAN_CAM Xanthomonas campestris pv armoraciae 311 258 59 134 257 412 623
APICOMPLEXA CRY_PAR Cryptosporidium parvum 597 628 155 251 433 729 1192
APICOMPLEXA PLA_FAL Plasmodium falciparum 753 866 145 253 453 930 1707
APICOMPLEXA TOX_GON Toxoplasma gondii 682 766 139 224 441 843 1486
CILIOPHORA PAR_TET Paramecium tetraurelia 457 438 127 205 348 541 854
CILIOPHORA TET_THE Tetrahymena thermophila 649 660 110 229 456 839 1396
AMOEBOZOA DIC_DIS Dictyostelium discoideum 533 513 92 198 392 702 1123
DIPLOMONADIDA GUI_LAM Giardia lamblia 543 630 84 180 369 689 1110
PLACOZOA TRI_ADH Trichoplax adhaerens 453 426 141 217 345 539 854
FUNGI_ASC PIC_STI Pichia stipitis 492 346 161 263 416 613 893
FUNGI_ASC SAC_CER Saccharomyces cerevisiae 497 382 137 239 409 632 951
FUNGI_ASC TRI_REE Trichoderma reesei 491 452 154 262 408 600 891
FUNGI_BAS LAC_BIC Laccaria bicolor 370 312 88 153 289 488 749
FUNGI_BAS PHA_CHR Phanerochaete chrysosporium strain RP78 456 327 157 246 373 556 856
FUNGI_BAS UST_MAY Ustilago maydis 613 454 176 298 501 793 1198
STRAM_DIA PHA_TRI Phaeodactylum tricornutum 462 343 162 249 381 562 841
STRAM_DIA THA_PSE Thalassiosira pseudonana 499 424 159 249 391 613 947
STRAM_OOM PHY_RAM Phytophthora ramorum 479 407 152 237 373 584 903
STRAM_OOM PHY_SOJ Phytophthora sojae 502 447 146 234 382 616 986
CNIDARIA NEM_VEC Nematostella vectensis 335 336 95 145 250 405 646
INSECTA ANO_GAM Anopheles gambiae 529 547 132 223 389 632 1065
INSECTA DRO_MEL Drosophila melanogaster 584 642 141 242 427 700 1164
NEMATODA CAE_ELE Caenorhabditis elegans 444 484 124 211 342 522 820
NEMATODA PRI_PAC Pristionchus pacificus 288 285 76 116 206 359 583
VERT_AVE GAL_GAL Gallus gallus 490 508 108 184 346 608 1007
VERT_AVE MEL_GAL Meleagris gallopavo 479 463 116 197 351 595 968
VERT_MAM BOS_TAU Bos taurus 495 490 145 246 356 592 947
VERT_MAM EQU_CAB Equus caballus 564 606 147 247 393 688 1139
VERT_MAM HOM_SAP Homo sapiens 456 540 98 163 311 562 947
VERT_MAM MON_DOM Monodelphis domestica 574 489 174 295 457 719 1069
VERT_MAM ORN_ANA Ornithorhynchus anatinus 445 416 123 202 327 540 868
VERT_MAM RAT_NOR Rattus norvegicus 520 500 130 224 374 643 1039
VERT_SAU ANO_CAR Anolis carolinensis 462 436 128 207 346 559 903
VERT_TEL DAN_RER Danio rerio 473 456 151 234 363 565 879
VERT_TEL TAK_RUB Takifugu rubripes 634 536 215 324 494 780 1177
PLANT_BRY PHY_PAT Physcomitrella patens 363 308 115 165 278 461 711
PLANT_CHL CHL_REI Chlamydomonas reinhardtii 503 589 97 173 335 608 1074
PLANT_CHL MIC_CCM Micromonas CCMP1545 426 390 123 202 334 522 799
PLANT_CHL MIC_RCC Micromonas RCC299 485 475 146 236 371 571 920
PLANT_CHL OST_LUC Ostreococcus lucimarinus 397 343 121 199 319 486 726
PLANT_CHL OST_TAU Ostreococcus tauri 387 349 114 186 307 476 716
PLANT_DIC ARA_THA Arabidopsis thaliana 403 299 115 202 345 513 749
PLANT_DIC CAR_PAP Carica papaya 296 249 68 112 225 411 611
PLANT_DIC GLY_MAX Glycine max 422 354 139 220 353 529 768
PLANT_DIC MED_TRU Medicagao truncatula 245 245 59 78 149 334 550
PLANT_DIC POP_TRI Populus trichocarpa 375 292 101 167 306 490 732
PLANT_LYC SEL_MOE Selaginella moellendorfii 382 300 124 191 316 481 699
PLANT_MON BRA_DIS Brachypodium distachyon 428 303 146 223 361 537 788
PLANT_MON ORY_SAT Oryza sativa ssp. japonica 448 389 108 174 332 574 960
PLANT_MON SOR_BIC Sorghum bicolor 361 282 103 167 288 476 706
PLANT_MON ZEA_MAY Zea mays 345 258 97 164 286 455 655
RHODOPHYTA CYA_MER Cyanidioschyzon merolae 504 404 158 259 412 628 918

Statistical summary of protein length values in the proteomes of dataset 1 including 84 different species (9 archeal, 24 bacterial and 51 eukaryotic organisms). The mean, standard deviation (SD) and the 10%, 25%, 50%, 75% and 90% percentiles were calculated for each organism individually (see methods)