Skip to main content
. 2021 Feb 23;9:55. doi: 10.1186/s40168-020-00981-z

Table 2.

Composition of the multi-species iPtgxDB (trypsin)

Strains RefSeq proteins RefSeq sProteinsa Extensions to RefSeq sProteinsa Additional Prodigal sProteinsa Additional ChemGenome sProteinsa Additional in silico sProteinsa Total iPtgxDB annotation clusters Total iPtgxDB sProtein annotation clustersa
A. caccae 3440 295 129 106 2398 78,654 90,548 81,582
B. longum 1728 85 55 175 1338 43,506 59,156 45,159
B. producta 5682 577 305 306 3749 140,819 165,184 145,756
B. thetaiotaomicron 4941 463 274 279 3814 128,815 146,729 133,645
C. butyricum 4148 391 131 187 282 68,564 75,453 69,555
C. ramosum 3025 281 131 110 218 52,203 56,936 52,943
E. coli K-12 4411 551 295 128 3485 106,268 123,548 110,727
L. plantarum 3067 384 176 92 1611 72,764 80,137 75,027
Combined iPtgxDB 30,442 3027 1496 1383 16,895 691,593 797,691 714,394

aDue to our focus on novel sProtein discovery, we list the respective number of sProteins for a given category