Ca. P. polyenzymogenes encodes a cellulolytic gene cluster. a Gene organisation of a putative cellulolytic gene cluster encoding four GH5s with varying modular arrangements. For multi-modular ORFs, both C-terminal (c) and N-terminal (n) domains are indicated, whereas “CTD” denotes a carboxy-terminal domain that infers export via T9SS. Hyp indicates domains with no known function. b Products released from crystalline cellulose (Avicel) by Cel5A, Cel5B, Cel5C_N, and Cel5D enzymes. A total of 1% (w/v) Avicel was incubated with 1 μM enzyme in 20 mM citrate buffer, pH 5.5 at 40 °C with 1000 rpm horizontal shaking. Products were analyzed by HPAEC-PAD at given time points after stopping the reactions by addition of NaOH to 0.1 M. Error bars represent standard deviations between three replicates (hidden by the markers). c Degradation of Glc(6) by Cel5B and Cel5C_N enzymes, illustrating differences in substrate specificity. Glc(6) (0.1 mg/ml) was incubated with 0.25 μM enzyme in 20 mM citrate buffer pH 5.5. Samples were taken at indicated intervals, and the reaction was stopped by adding NaOH to 0.1 M. Products were analyzed using HPAEC-PAD with cellodextrins as standards. A more complete analysis for all four cellulases shown in Additional file 13: Figure S7. d Crystal structure of Cel5C_N. Surface view of the structure, seen from above the active-site cleft, and − 45° rotated. The surface of the catalytic residues is shown in red. Figures were created using The PyMOL Molecular Graphics System, Version 1.3 Schrödinger, LLC