Table 1:
Prokaryotic benchmark datasets statistics. Code column indicates the subcellular location representation in our predictive model. Gram-negative bacteria have five major subcellular localization sites, namely, the cytoplasm, the periplasm, the inner membrane, the outer membrane, and the extracellular space, whereas Gram-positive bacteria do not have an outer cell membrane. However in these benchmark datasets cell wall is absent in Gram-negative dataset and in Gram-positive bacteria, we observe the lack of periplasm proteins.
No | Subcellular location | Code | Proteins count | |
---|---|---|---|---|
Gram negative | Gram positive | |||
1 | Cytoplasm | C | 4,152 | 349 |
2 | Extracellular | S | 272 | 290 |
3 | Inner membrane | I | 1,415 | 1,779 |
4 | Outer membrane | O | 346 | – |
5 | Periplasm | P | 422 | – |
6 | Cell wall | W | – | 34 |
7 | Vacuole | V | 10 | 4 |
Multiple localizations | 39 | 8 | ||
Total | 6,578 | 2,448 |