Skip to main content
. 2005 Mar;69(1):124–154. doi: 10.1128/MMBR.69.1.124-154.2005

TABLE 2.

Dockerin-containing proteins found in the draft genome of C. thermocellum

Protein family or putative function group Gene producta Putative function and/or reading frameb Mol mass (kDa) Reference(s) Module structurec
Structural component CipA (+) Scaffoldin, Chte02002467, ZP_00312244 197 88, 96, 374 2(Coh1)-CBM3a-7(Coh1)-UN-Doc2
Cellulase/β-glucanase CelO Cellobiohydrolase, Chte02003367, ZP_00311446 75 372 CBM3b-GH5-Doc1
Chte02003044, ZP_00311743 103 GH5-CBM6-Fn3-Doc1
CelB Endoglucanase, Chte02000960, ZP_00313593 64 105 GH5-Doc1
CelG (+) Endoglucanase, Chte02001813, ZP_00312831 63 188 GH5-Doc1
Chte02001497, ZP_00313148 58 GH5-Doc1
CelA (+) Endoglucanase, Chte02000171, ZP_00314359 53 30 GH8-Doc1
CbhA (+) Cellobiohydrolase, Chte02001488, ZP_00313140 138 375 CBM4-Ig-GH9-2(Fn3)-CBD3b-Doc1
CelK (+) Cellobiohydrolase, Chte02001490, ZP_00313141 101 152, 374 CBM4-Ig-GH9-Doc1
CelD Endoglucanase, Chte02001777, ZP_00312895 72 143 Ig-GH9-Doc1
Chte02001890, ZP_00312800 110 GH9-CBM3c-CBM3b-Doc1
Chte02001086, ZP_00313565 105 GH9-CBM3c-CBM3b-Doc1
CelN (+) Endoglucanase, Chte02000571, ZP_00314035 82 373 GH9-CBM3c-Doc1
CelR (+) Endoglucanase, Chte02001003, ZP_00313635 85 GH9-CBM3c-Doc1
CelQ (+) Endoglucanase, Chte02001251, ZP_00313301 80 9 GH9-CBM3c-Doc1
CelF endoglucanase, Chte02000967, ZP_00313600 82 245 GH9-CBM3c-Doc1
Chte02001891, ZP_00312801 80 GH9-CBM3c-Doc1
Chte02001467, ZP_00313120 89 GH9-CBM3c-Doc1
Chte02002786, ZP_00311957 82 GH9-CBM3c-Doc1
Chte02000166, ZP_00314354 63 GH9-Doc1
CelT (+) Endoglucanase, Chte02001327, ZP_00313235 69 172 GH9-Doc1
CelS (+) Exoglucanase, Chte02001227, ZP_00313420 83 341 GH48-Doc1
Xylanases XynD (+) Xylanase, Chte02000364, ZP_00314122 72 CBM22-GH10-Doc1
XynC (+) Xylanase, Chte02001156, ZP_00313489 70 120 CBM22-GH10-Doc1
XynA, XynU (+) Xylanase, Chte02001934, ZP_00312745 74 121 GH11-CBM4-Doc1-NodB
XynB, XynV (+) Xylanase connecting error? 50 121 GH11-CBM4-Doc1
Other hemicellulases LicB (+) Lichenase, Chte02000203, ZP_00314391 38 367 GH16-Doc1
ChiA (+) Chitinase, Chte02000170, ZP_00314358 55 366 GH18-Doc1
ManA (+) Mannanase, Chte02001325, ZP_00313234 67 114 CBM-GH26-Doc1
Chte02000583, ZP_00314046 67 GH26-Doc1
Chte02002707, ZP_00312042 71 GH30-CBM6-Doc1
Chte02002259, ZP_00312443 47 GH53-Doc1
Chte02002200, ZP_00312458 86 GH81-Doc1
Putative glycosidases Chte02003048, ZP_00311747 104 GH2-CBM6-Doc1
Chte02003396, ZP_00311430 88 GH39-2(CBM6)-Doc1
Chte02003047, ZP_00311746 59 GH43-CBM6-Doc1
Chte02002202, ZP_00312459 64 GH43-CBM13-Doc1
Chte02000090, ZP_00314291 75 GH43-2(CBM6)-Doc1
Xyloglucan hydrolase XghA (+) Xyloglucanase, Chte02002261, ZP_00312445 92 GH74-CBM2-Doc1
Putative carbohydrate esterases Chte02001642, ZP_00312970 91 Fn3-CE12-Doc1-CBM6-CE12
Chte02001749, ZP_00312868 55 CE3-CE3-Doc1
Chte02001822, ZP_00312838 55 Doc1-CE6
/PICK>
Chte02003045, ZP_00311744 54 CE1-CBM6-Doc1
Putative pectinases Chte02001171, ZP_00313369 92 GH28-Doc1
Chte02002538, ZP_00312205 60 PL1-Doc1-CBM6
Chte02000923, ZP_00313704 98 PL1-Doc1-CBM6-PL9
Chte02002537, ZP_00312204 42 PL10-UN-Doc1
Chte02002767, ZP_00311987 89 Doc1-CBM6-PL11
Multifunctional components CelJ (+) Cellulase, Chte02001252, ZP_00313302 178 1 CBM30-Ig-GH9-GH44-Doc1-UN
CelH Endoglucanase, Chte02003109, ZP_00311690 102 361 GH26-GH5-CBM9-Doc1
Chte02003393, ZP_00311428 111 GH30-GH54-GH43-Doc1
Chte02001578, ZP_00313008; Chte02001577, ZP_00313007, frame shift? 79 GH54-Doc1-GH43
Chte02003394, ZP_00311429 66 GH54-GH43-Doc1
ZynZ (+) Xylanase, Chte02002412, ZP_00312306 92 106 CE1-CBM6-Doc1-GH10
XynY Xylanase, Chte02002344, ZP_00312390 120 83 CBM22-GH10-CBM22-Doc1-CE1
CelE (+) Endoglucanase, Chte02001748, ZP_00312867 90 122 GH5-Doc1-CE2
Putative protease and protease inhibitors Chte02001648, ZP_00312975 40 Subtilisin-like serine protease-Doc1
Chte02000228, ZP_00314411 64 Fn3-Doc1-serpin
Chte02000229, ZP_00314412 68 Doc1-serpin
Components with unknown function Chte02002760, ZP_00311980 117 2(UN)-UN-UN(CelP 550-870)-Doc1
Chte02003046, ZP_00311745 105 UN-CBM6-Doc1
CseP (+) Chte02000570, ZP_00314034 (structural component) 62 373 UN-Doc1
Chte02002500, ZP_00312225 58 Doc1-UN
Chte02000182, ZP_00314370 51 Doc1-U
Chte02001566, ZP_00313102 76 Doc1-UN
Chte02002663, ZP_00312096 65 2(UN)-Doc1
Chte02003357, ZP_00311454 135 UN-UN-UN-Doc1
Chte02001465, ZP_00313118 40 Doc1-UN
Chte02001653, ZP_00312979 47 UN-Doc1
Chte02000318, ZP_00314080 37 UN-Doc1
Chte02001119, ZP_00313455 236 2(UN)-UN-Doc2
Chte02002121, ZP_00312553 19 Doc1-Doc1
a

The ORFs in the draft genome sequence, each containing at least a dockerin molecule, are listed. A gene designation indicates that the gene encoding the component has been cloned and the encoded protein has been biochemically characterized; a dash indicates that this is not the case. +, the existence of the component in the cellulosome has been experimentally verified.

b

The enzymatic activity or function (putative or known) and the ORF number (Chte) from REFSEQ accession number AABG00000000 on the NCBI website (http://www.ncbi.nlm.nih.gov/RefSeq/; database as of 17 June 2004) are provided. The Chte number indicates the locus tag in the genome sequence (nucleotide search RefSeq_DNA); the ZP number indicates the protein identification code (protein search RefSeq_Prot).

c

Module classification according to P. M. Coutinho and B. Henrissat (Carbohydrate- Active Enzymes server at http://afmb.cnrs-mrs.fr/CAZY/index.html, 1999). Coh, cohesion; Doc, dockerin; CBM, carbohydrate-binding module; GH, glycosyl hydrolase family; Fn3, fibronectin III; Ig, Ig-like fold; CE, carbohydrate esterase; PL, pectin lyase; UN, module with unknown function; serpin, serine protease inhibitor homologous module. Adapted from reference 368 with permission of Wiley-VCH Verlag.