TABLE 1.
Gene | Predicted function of gene producta | Predicted localizationb | Mol massc (kDa) | Modular structured |
---|---|---|---|---|
CHU_0778 | Related to endo-β-1,4-glucanase | Unknown | 81.7 | CelD_N-GH9 |
CHU_1107 | Candidate endo-β-1,4-glucanase | Extracellular | 135.2 | GH5-X1-PKD-PKD-FN3 |
CHU_1280 | Candidate endo-β-1,4-glucanase | Unknown | 66.2 | CelD_N-GH9 |
CHU_1335 | Related to endo-β-1,4-glucanases | Extracellular/outer membrane | 207.3 | GH9-PKD-PKD-PKD-BIG-BIG-BIG-BIG-BIG-BIG-BIG-BIG-BIG-BIG-BIG-D5 |
CHU_1336 | Related to endo-β-1,4-glucanases | Extracellular | 105.3 | GH9-PKD-PKD-PKD |
CHU_1655 | Candidate endo-β-1,4-glucanase | Extracellular/outer membrane | 92.6 | CelD_N-GH9 |
CHU_1727 | Related to endo-β-1,4-glucanases | Unknown | 67.9 | GH5 |
CHU_2103 | Candidate endo-β-1,4-glucanase | Extracellular | 38.8 | GH5 |
CHU_2235 | Distantly related to endo-β-1,4-glucanases | Unknown | 64.2 | GH9 |
CHU_2268 | Candidate β-glucosidase | Periplasmic lipoprotein | 83.7 | GH3 |
CHU_2273 | Candidate β-glucosidase | Periplasmic lipoprotein | 89.7 | GH3 |
CHU_3577 | Candidate β-glucosidase | Periplasmic lipoprotein | 83.2 | GH3 |
CHU_3784 | Candidate β-glucosidase | Periplasmic | 81.8 | GH3 |
Assigned by routines used for updating the CAZY database (http://www.cazy.org/CAZY/), using the following criteria. Typically, ≥70% amino acid identity to a protein domain with a biochemically determined function at the time of analysis resulted in “candidate” status. A 30% to 70% amino acid identity to a protein domain with known function resulted in “related to” status. Less than 30% amino acid identity to a protein domain with known function resulted in “distantly related to” status. Because the threshold of similarity that correlates with a change of substrate specificity is variable from one glycoside hydrolase family to another, the criteria were tightened or loosened appropriately for several families. All analyses were conducted domain by domain because of the modular structure of many of the proteins.
Predicted using the default settings of PSORTb (21). Predicted lipoproteins were identified using LipoP (33).
Molecular mass of primary product of translation, including any predicted signal peptide.
Modular structure is indicated by the following abbreviations: BIG, bacterial immunoglobulin-like domain group 2 (Pfam number PF02368); CelD_N, N-terminal immunoglobulin-like domain of cellulase (Pfam number PF02927); D5, carboxy-terminal domain of Rhodothermus marinus xylanases predicted to be involved in attachment to the cell surface (34); FN3, fibronectin type 3 domain (Pfam number PF00041); GH, glycoside hydrolase (number indicates family), as assigned by CAZY; PKD, polycystic kidney disease protein PKD1 (Pfam number PF00801); and X1, conserved domain of unknown function.