Table 3.
Mining for candidate mucin-degrading enzymes in seven sequenced genomes
| Candidate mucin-degrading enzymes4 | ||||||||||
| Phylum | Species name1 | Size (Mbp) | # CDS | # Signal P2 (%) | # UF3 (%) | GH | P | Su | Si | Total |
| Verrucomicrobia | A. muciniphila | 2.7 | 2176 | 576 (26) | 270 (47) | 35 | 13 | 11 | 2 | 61 |
| Bacteroidetes | B. vulgatus | 5.2 | 4076 | 1200 (29) | 510 (43) | 145 | 42 | 4 | 7 | 198 |
| B. thetaiotaomicron | 6.3 | 4816 | 1574 (33) | 777 (49) | 171 | 30 | 11 | 9 | 221 | |
| P. gingivalis | 2.4 | 2090 | 451 (22) | 248 (55) | 9 | 28 | 0 | 1 | 38 | |
| Proteobacteria | H. pylori | 1.6 | 1494 | 311 (21) | 141 (45) | 0 | 6 | 0 | 0 | 6 |
| Firmicutes | S. mutans | 2.0 | 1960 | 331 (17) | 140 (42) | 4 | 10 | 0 | 0 | 14 |
| Actinobacteria | Bif. longum subsp. infantis | 2.8 | 2486 | 462 (19) | 181 (39) | 6 | 7 | 1 | 0 | 15 |
The accession numbers for the different species are NC_010655 (A. muciniphila), NC_009614 (B. vulgatus), NC_004663 (B. thetaiotaomicron), NC_010729 (P. gingivalis), NC_000921 (H. pylori), NC_004350 (S. mutans), NC_011593 (B. longum subsp. infantis).
These numbers (percentages) represent the protein coding genes that encode a putative signal peptide.
These numbers (percentages) represent the protein coding genes that encode a putative signal peptide and lack a predicted function.
The candidate mucin-degrading enzymes are categorized into four broad categories; glycosyl hydrolases (GH), proteases/peptidases (P), sulfatases (Su) and sialidases/neuraminidases (Si). For details of the list, see Table S2.