Table 2.
Strains | RefSeq proteins | RefSeq sProteinsa | Extensions to RefSeq sProteinsa | Additional Prodigal sProteinsa | Additional ChemGenome sProteinsa | Additional in silico sProteinsa | Total iPtgxDB annotation clusters | Total iPtgxDB sProtein annotation clustersa |
---|---|---|---|---|---|---|---|---|
A. caccae | 3440 | 295 | 129 | 106 | 2398 | 78,654 | 90,548 | 81,582 |
B. longum | 1728 | 85 | 55 | 175 | 1338 | 43,506 | 59,156 | 45,159 |
B. producta | 5682 | 577 | 305 | 306 | 3749 | 140,819 | 165,184 | 145,756 |
B. thetaiotaomicron | 4941 | 463 | 274 | 279 | 3814 | 128,815 | 146,729 | 133,645 |
C. butyricum | 4148 | 391 | 131 | 187 | 282 | 68,564 | 75,453 | 69,555 |
C. ramosum | 3025 | 281 | 131 | 110 | 218 | 52,203 | 56,936 | 52,943 |
E. coli K-12 | 4411 | 551 | 295 | 128 | 3485 | 106,268 | 123,548 | 110,727 |
L. plantarum | 3067 | 384 | 176 | 92 | 1611 | 72,764 | 80,137 | 75,027 |
Combined iPtgxDB | 30,442 | 3027 | 1496 | 1383 | 16,895 | 691,593 | 797,691 | 714,394 |
aDue to our focus on novel sProtein discovery, we list the respective number of sProteins for a given category