Abstract
Lytic polysaccharide monooxygenases (LPMOs) are redox-enzymes involved in biomass degradation. All characterized LPMOs possess an active site of two highly conserved histidine residues coordinating a copper ion (the histidine brace), which are essential for LPMO activity. However, some protein sequences that belong to the AA9 LPMO family display a natural N-terminal His to Arg substitution (Arg-AA9). These are found almost entirely in the phylogenetic fungal class Agaricomycetes, associated with wood decay, but no function has been demonstrated for any Arg-AA9. Through bioinformatics, transcriptomic, and proteomic analyses we present data, which suggest that Arg-AA9 proteins could have a hitherto unidentified role in fungal degradation of lignocellulosic biomass in conjunction with other secreted fungal enzymes. We present the first structure of an Arg-AA9, LsAA9B, a naturally occurring protein from Lentinus similis. The LsAA9B structure reveals gross changes in the region equivalent to the canonical LPMO copper-binding site, whereas features implicated in carbohydrate binding in AA9 LPMOs have been maintained. We obtained a structure of LsAA9B with xylotetraose bound on the surface of the protein although with a considerably different binding mode compared with other AA9 complex structures. In addition, we have found indications of protein phosphorylation near the N-terminal Arg and the carbohydrate-binding site, for which the potential function is currently unknown. Our results are strong evidence that Arg-AA9s function markedly different from canonical AA9 LPMO, but nonetheless, may play a role in fungal conversion of lignocellulosic biomass.
Keywords: polysaccharide, copper monooxygenase, crystal structure, glycobiology, phosphorylation, biomass degradation, His-brace, N-terminal Arg-AA9, xylooligosaccharide
Introduction
Fungal degradation of plant biomass involves a collection of carbohydrate active enzymes (CAZymes),2 which include glycoside hydrolases, carbohydrate esterases, polysaccharide lyases, and oxidoreductases. Lytic polysaccharide monooxygenases (LPMOs) are copper-dependent oxidoreductases shown to be pivotal in efficient plant biomass degradation (1). These enzymes activate molecular oxygen or hydrogen peroxide to introduce oxidative chain breaks into polysaccharide chains (2, 3) (Fig. 1). Some LPMOs are single domain enzymes, whereas others are linked to CBMs (carbohydrate binding modules) or domains of unknown function (4, 5). LPMOs are classified into seven Auxiliary Activity families in the CAZy database, AA9-AA11 and AA13-AA16 (6), and have been found in bacteria, viruses, fungi, and recently in arthropod species and in plants (7, 8). In the active site a copper ion is held by a motif denoted the histidine brace (His-brace) formed by the highly conserved N-terminal histidine and a second histidine later in the sequence (9). The His-brace motif is strictly conserved in all LPMOs studied to date (10–12) and is considered essential for LPMO activity. This is supported by mutational analysis of both AA9 and AA10 LPMOs (first performed on TtAA9E and SmAA10A, followed by later studies on MtAA9D and ScAA10C) in which replacement of these critical residues within the His-brace or the secondary coordination sphere of the copper ion resulted in reduced LPMO activity and hence reduced glycoside hydrolase boosting (13–16).
Figure 1.
Schematic representation of oxidative cleavage of cellooligosaccharides/cellulose by LPMOs.
LPMOs display different substrate specificities, with activity demonstrated on cellulose, hemicelluloses, chitin, and starch (2, 9, 14, 17–19) and interact with their polysaccharide substrates through a surface on which the His-brace is found (20–24). This surface can have varying contour and polarity, presumably dictating substrate specificity (19, 23, 25, 26). Additionally, for some AA9 LPMOs activity has been demonstrated on soluble β-1,4-linked polysaccharides and also oligosaccharides (17, 22, 23, 27, 28), a feature that was instrumental in the determination of the first enzyme-substrate complex structures of an AA9 from Lentinus similis (LsAA9A) (22, 29). The structures revealed polar residues in a loop denoted L3 (after notation in Ref. 25) interacting with cellooligosaccharides near the active site and demonstrated for the first time that a conserved Tyr (Tyr-203 in LsAA9A) located on the same surface was involved in substrate binding (22, 23). In addition these structures revealed a new and highly unusual lone pair- π aromatic interaction between the pyranose ring O5 and the imidazole of the N-terminal His of the His-brace, further highlighting its importance for LPMO function (29).
Both in Basidiomycetes and Ascomycetes fungal species multiple AA9-encoding genes are found (more than 20 and 30 in Schizophyllum commune and Podospora anserina, respectively), the vast majority of which encode proteins with N-terminal His (either demonstrated or predicted). For several of these genes the expression and subsequent secretion of the proteins is readily induced by growth on plant biomass (30). Interestingly, some AA9 members have an intriguing substitution of the N-terminal His to an Arg (from here on referred to as Arg-AA9). This can be observed in genomes of species belonging to the Lentinus genus, where for example, Lentinus tigrinis has 16 AA9s one of which is an Arg-AA9 (https://genome.jgi.doe.gov/mycocosm/proteins-browser/browse;3gQkyP?p=Lenti7_1). Similarly, for L. similis at least seven proteins classified as AA9s are described in patent literature (31, 32)), and one of these proteins (GH61-5 in Ref. 32 and denoted LsAA9B from this point) has an N-terminal Arg.
This type of His to Arg substitution was first noted by Yakovlev and co-workers (33) in the Russulales fungus Heterobasidion irregulare in an AA9 denoted HiGH61G (HiAA9G from this point). Yakovlev et al. (33) found that genes encoding six AA9s, including HiAA9G, were up-regulated during growth on lignocellulosic biomass. We have previously reported comprehensive integrative omics studies of the saprotrophic white-rot fungi Pycnoporus coccineus BRFM310 (34) and Polyporus brumalis BRFM985 (35) of the Polyporales order. There we reported co-regulation of glycoside hydrolases (GH) commonly involved plant biomass breakdown (e.g. GH5_5) and Arg-AA9 members (protein ID 1430659 and 1403153 on the JGI Mycocosm public database), which in each species was highly up-regulated during fungal growth on wheat straw (for the latter during solid-state fermentation). In addition, in a recent bioinformatics study (5) it was shown that Arg-AA9s are found in several fungal species almost entirely restricted to one phylogenetic class of largely wood-decaying members namely the Agaricomycetes. These Arg-AA9 sequences clustered together (cluster 63 in Ref. 5), and appear to have been maintained for a novel function rather than simply being a decaying AA9 gene product. This indicates a putative biological function for the Arg-AA9s related to lignocellulosic biomass degradation, but to date no function has been ascribed to this novel AA9 subgroup.
Using bioinformatics, transcriptomic, and secretomic approaches, we have obtained data supporting a potential role for Arg-AA9s from Agaricomycetes species in plant biomass conversion. We present a high-resolution X-ray crystal structure of a naturally occurring protein from the Polyporale fungus L. similis, LsAA9B, the first for any AA9 with an N-terminal Arg. The LsAA9B structure has an overall fold highly similar to AA9 LPMOs, but in addition to the N-terminal His to Arg substitution, reveals extensive changes in residues equivalent to those forming the copper-binding site and the secondary coordination sphere of the copper (essential for fully functioning AA9 LPMOs). Thus, perhaps predictably, no LPMO activity could be detected for LsAA9B. Unexpectedly, however, the structure appears with a post-translational modification in the form of a phosphoserine (pSer-25), in a position adjacent to the N-terminal Arg side chain. In addition, we have obtained a complex structure of LsAA9B with xylotetraose (Xyl4) bound at the protein surface in a conserved cleft near the speculated phosphorylation site. Sequence alignment indicates also that other Arg-AA9 proteins contain potentially similar putative phosphorylation sites and carbohydrate-binding sites.
Our results strongly suggest a role related to biomass degradation for these Arg-AA9 proteins, which seem to be strikingly different compared with canonical AA9 LPMOs. The functional significance of both the N-terminal His to Arg substitution, the phosphoserine, and the Xyl4 ligand require further investigation, but the characterization and structure determination of LsAA9B presented here will serve as an important starting point for the elucidation of the function and exact biological role of Arg-AA9s.
Results
Phylogenetic distribution and regulation of AA9s with N-terminal Arg
To assess the phylogenetic distribution of fungal AA9 family sequences with N-terminal Arg (Arg-AA9s), we analyzed AA9 sequences belonging to cluster 63 (5) in the CAZy database (36), which we found were restricted to species of the Polyporales, Agaricales, and Russulales orders (all of the Agaricomycetes class in Basidiomycetes fungi). Notably, the sequence alignment showed that residues critical for function in canonical AA9 LPMOs (His-1, His-68, His-142, Gln-151, and Tyr-153 in TtAA9E (14)) were all changed in the Arg-AA9 sequences (Fig. S1). Interestingly, among the Agaricales and Russulales sequences a small number displayed N-terminal Lys instead (and in a few occasions Asn). The Arg and Lys sequences clustered together phylogenetically; meaning that for species of these orders Arg and Lys are perhaps interchangeable at the N-terminal position. We found that many Arg-AA9 members have signal peptides that indicate they would be secreted. As an example, we found that for the Arg-AA9 of P. coccineus BRFM310 (JGI protein ID 1430659) the SignalP server (37) reports with high certainty a putative signal peptide with predicted cleavage before the Arg, meaning that the protein is most likely targeted for the secretory pathway.
To augment previous reports on transcriptional regulation of Arg-AA9–encoding genes during fungal growth on plant biomass (33–35), we sought additional evidence in other fungal species. Using RNA-seq analysis (as previously described in Ref. 38), we identified a number of genes in several Polyporales species (with around 70% sequence identity to LsAA9B, Fig. 2), which were up-regulated when the fungi were cultivated on cellulose or complex biomass compared with control growth on maltose as an easily up-takable carbon source (Table 1). We found Arg-AA9s in the Polyporales fungi Trametes ljubarskyi BRFM 1659 (JGI protein ID 971189), Leiotrametes sp. BRFM 1775 (JGI protein ID 236437), Pycnoporus sanguineus BFRM 1264 (JGI protein ID 1740610), P. coccineus BRFM 1662 (JGI protein ID 248104), and Trametes elegans BRFM 1663 (JGI protein ID 360271), which were highly up-regulated at day 3 on Avicel, pine, aspen, and wheat straw. In addition, these were co-regulated with several other secreted CAZymes targeting plant biomass polysaccharides (Table 1 and Fig. 2).
Figure 2.
AA9s with N-terminal Arg up-regulated on lignocellulosic biomass. Multiple sequence alignment of Arg-AA9 sequences found to be up-regulated during growth on aspen, pine, and wheat straw. The signal peptides are included for each sequence. The N-terminal Arg of the mature proteins is indicated with a star above the sequence. Numbering above the sequence alignment is according to the mature LsAA9B protein with Arg in the N terminus. Secondary structure elements of LsAA9B are shown above the alignment in yellow and red for β-strands and α-helices, respectively. The JGI protein ID is given next to each sequence. Residues with more than 70% sequence identity are indicated with blue (i.e. residues shared by at least seven of the nine protein sequences).
Table 1.
Transcriptional regulation of genes coding for Arg-AA9 in six Polyporales species
Normalized transcript read counts after 3 days growth on Avicel (Av), aspen, (As), pine (Pi), or wheat straw (WS) were compared to normalized read counts after 3 days growth on maltose. Coregulated genes were identified that shared similar transcript levels and transcription profiles on the five carbon sources.
| Organism | JGI Prot ID | X-fold upregulation on |
Co-regulated genes | Reference | |||
|---|---|---|---|---|---|---|---|
| Av | As | Pi | WS | ||||
| P. coccineus | 1430659 | 110 | 10 | 52 | 110 | GH5_5, GH7, AA9, CE16, GH28, AA2 | 34 |
| BRFM 310 | |||||||
| T. ljubarskyi | 971189 | 675 | 388 | 64 | 256 | GH5_5, GH7, GH5_7, GH12, AA9, CE16 | This manuscript |
| BRFM 1659 | |||||||
| P. coccineus | 248104 | 14 | 9 | 5 | 6 | – | This manuscript |
| BRFM 1662 | |||||||
| Leiotrametes sp. | 236437 | 104 | 10 | 3 | 10 | GH1, GH3, GH131 | This manuscript |
| BRFM 1775 | |||||||
| P.sanguineus | 1740610 | 78 | 7 | nd | 3 | GH7, AA9 | 38 |
| BRFM 1264 | |||||||
| T.elegans | 360271 | 16 | 48 | 11 | 34 | GH10 | This manuscript |
| BRFM 1663 | |||||||
These results strongly suggest that these Arg-AA9s may have a role in the fungal adaptive response to (ligno)cellulosic substrates in coordination with the plant cell well targeting CAZymes that they are coexpresssed alongside (i.e. GH5_5, GH5_7, and GH12 endoglucanases and endomannanases, GH7 cellobiohydrolases, CE16 deacetylases, GH3 glucosidases, GH28 polygalacturonases, and canonical AA9, Table 1).
LsAA9B, an AA9 with an N-terminal Arg from L. similis, is secreted upon growth on hardwood pulp
Because the bioinformatics analysis and transcriptomic data suggested that Arg-AA9s might be functionally expressed during fungal growth on biomass, we sought to structurally and functionally characterize one of these proteins. The Polyporales fungus L. similis contains at least seven genes encoding proteins that are classified as AA9 of which LsAA9A has been well-characterized (22, 23, 29). Also among these proteins is a single domain Arg-AA9 protein of 221 amino acids (aa) residues, which we denote LsAA9B (GH61–5 in Ref. 32, GenBankTM accession MN265867). The LsAA9B sequence includes a signal peptide (MKTWAVLSSLALLASSVSA) that suggests that this protein is secreted by L. similis. Indeed, from secreted fractions obtained after induction experiments of L. similis with hardwood pulp, we identified peptides corresponding to the mature LsAA9B protein by electrospray ionization tandem-MS (ESI MS/MS) (Fig. 3 and Table S1), conclusively showing that the LsAA9B protein is produced extracellularly when the fungus grows on biomass. This is to our knowledge only the second report of identification of up-regulation of an Arg-AA9 at the protein level. The first one was of P. brumalis BRFM985 (JGI protein ID 1403153), which as reported in the table S3 in Ref. 35 was found in the fungal secretome when P. brumalis was grown on wheat straw.
Figure 3.
Sequence coverage of LsAA9B peptides found in the L. similis secretome following growth on hardwood pulp.
Sequence comparison with well-characterized canonical AA9 LPMOs indicated that residues Arg-1, Asn-84, Leu-158, Gln-167, and Phe-169 in LsAA9B were equivalent to the His-brace, conserved Gln and His residues in the secondary coordination sphere, and a Tyr that occupies the axial position of the active site copper (which in TtAA9E are His-1, His-68, His-142, Gln-151, and Tyr-153 (14)). Thus, it appeared that, except for Gln-167, the residues important for AA9 LPMO activity (13, 15) were changed in LsAA9B. LsAA9B was heterologously expressed in Aspergillus oryzae and purified for further biochemical studies (see “Experimental procedures” and Ref. 32). As expected, with the purified LsAA9B protein no enzymatic activity could be detected on cellulosic and hemicellulosic substrates (PASC, AZCL-cellulose, or AZO-Xylan), under the experimental conditions tested as compared with the previously characterized LsAA9A that showed clear activity on PASC and AZCL-cellulose.
The LsAA9B X-ray crystal structure reveals a fold very similar to active AA9 LPMOs
Crystals of LsAA9B were obtained in a range of conditions with combinations of polyethylene glycols (PEGs) (see Table 2 for further details). Structures could be determined by molecular replacement (MR) in space group P212121 to better than 1.60-Å resolution. We obtained structures of both a naturally glycosylated and a partly deglycosylated form of LsAA9B (LsAA9B and LsAA9B_deglyc). In addition, using the glycosylated LsAA9B batch, we obtained structures after soaking experiments with either a solution containing transition metals (LsAA9B_metalsoak) or a solution containing xylotetraose (LsAA9B-Xyl4). For all structures, all 221 residues corresponding to the mature protein could be modeled in the electron density and the final structures showed good refinement statistics (Table 3). The structure of LsAA9B revealed a typical LPMO topology (Fig. 4a and Fig. S2) consisting of a central immunoglobulin-like β-sandwich (β1:3–9; β2:52–55; β3:59–65; β4:88–94; β5:108–114; β6:134–138; β7:146–156; β8:167–178). The AA9 structure most closely related to LsAA9B (based on RMSD of the Cα trace calculated with Superpose (39), see Table S2) is NcAA9C (25) (RMSD of 1.04 Å) (PDB entry 4D7U) followed by NcAA9D (40) (PDB entry 4EIR), and LsAA9A (22) (PDB entry 5ACH) all harboring an extended L3 loop (25) implicated in cellulose binding. NcAA9F (PDB entry 4QI8) and in fact TtAA9E (PDB entry 3EJA, used as MR search model) are the more distantly related structures, whereas the TaAA9A (9), NcAA9M (40), and TrAA9B (41) structures (PDB entries 3ZUD, 4EIS, and 2VTC, respectively), which all possess an extended L2 loop (40) (which is absent in LsAA9B) show intermediate RMSDs. However, all are structurally quite similar to LsAA9B with RMSDs of less than 2 Å (Fig. S2 and Table S2).
Table 2.
Crystallization conditions for crystals from which datasets were collected
Drop sizes were 0.3 μl with protein:reservoir ratios of 3:1.
| Protein batch | Protein concentration | Primary precipitant | Additivea | Bufferb (pH) | Notes | Structure |
|---|---|---|---|---|---|---|
| LsAA9B | 4 mg/ml | PPT4 | NPS | Imidazole/MES | – | – |
| (initial) | 37.5% (w/v) | 0.1 m (pH 6.5) | ||||
| LsAA9B | 5 mg/ml | PPT1 | AA | Imidazole/MES | – | LsAA9B |
| (glycosylated) | 30% (w/v) | 0.1 m (pH 6.5) | ||||
| LsAA9B | 5 mg/ml | PPT4 | NPS | Imidazole/MES | – | LsAA9B_deglyc |
| (deglycosylated) | 48% (w/v) | 0.1 m (pH 6.5) | ||||
| LsAA9B | 11 mg/ml | PPT4 | CA | HEPES/MOPS | 30 min | LsAA9B_metalsoak |
| (glycosylated) | 37.5% (w/v) | 0.1 m (pH 7.5) | metal soak | |||
| LsAA9B | 11 mg/ml | PPT1 | Divalents | HEPES/MOPS | 1 hour 20 min. | LsAA9B_Xyl4 |
| (glycosylated) | 36% (w/v) | 0.1 m (pH 7.5) | xylotetraose soak |
a Primary precipitant mixtures were: PPT1: 2:1 ratio of PEGMME 500 and PEG 20,000; PPT4, 1:1:1 ratio of MPD (racemic), PEG 1000, PEG 3350. Additives were: amino acids (AA): 20 mm glutamate, alanine, glycine, lysine, and serine (some racemic); NPS: 30 mm of each of NaNO3, Na2HPO4, (NH4)2SO4: carboxylic acids (CA): 0.02 m of each of Na-formate, NH4-acetate, Na-citrate, Na/K-tartrate, Na-oxamate; and divalents: 30 mm each of CaCl2 and MgCl2.
b The buffer components were: imidazole, MES, MOPS, and HEPES.
Table 3.
Crystallographic data and refinement statistics
The highest resolution shell is shown in parentheses.
| Data collection | LsAA9B | LsAA9B_deglyc | LsAA9B_metalsoak | LsAA9B_Xyl4 |
|---|---|---|---|---|
| Beamline synchrotron | I911–3 | I911–3 | I911–3 | ID32–1 |
| MAX-IV | MAX-IV | MAX-IV | ESRF | |
| Wavelength (Å) | 0.9900 | 1.0000 | 1.3499 | 0.9760 |
| Temperature of data collections (K) | 100 | 100 | 100 | 100 |
| Space group | P212121 | P212121 | P212121 | P212121 |
| Cell dimensions | ||||
| a, b, c (Å) | 35.19, 72.48, 78.56 | 35.08, 72.99, 79.06 | 35.16, 72.58, 78.68 | 35.41, 72.79, 78.59 |
| α = β = γ (°) | 90 | 90 | 90 | 90 |
| Molecules/ASU | 1 | 1 | 1 | 1 |
| Resolution (Å) | 20.00–1.60 | 20.00–1.60 | 40.00–1.58 | 40.00–1.40 |
| (1.70–1.60) | (1.70–1.60) | (1.62–1.58) | (1.44–1.40) | |
| Rrim (%) | 10.8 (67.8) | 8.4 (82.0) | 5.5 (39.1) | 10.0 (156.0) |
| I/σ(I) | 14.07 (2.74) | 14.39 (1.94) | 28.46 (3.0) | 13.57 (1.36) |
| CC1/2(%) | 99.8 (76.4) | 99.8 (76.6) | 99.9 (82.6) | 99.9 (45.4) |
| Completeness (%) | 99.6 (98.5) | 98.5 (91.7) | 92.4 (72.4) | 100 (100) |
| Multiplicity | 7.83 (6.01) | 5.81 (3.59) | 8.60 (2.84) | 7.17 (7.27) |
| No. total reflections | 212379 (26267) | 157803 (14758) | 239233 (4271) | 292531 (40817) |
| No. unique reflections | 27139 (4369) | 27139 (4106) | 27805 (1501) | 40817 (2961) |
| Rfactor (%) | 13.67 (22.0) | 15.48 (29.9) | 11.27 (22.2) | 13.31 (29.9) |
| Rfree (%) | 18.22 (27.5) | 21.14 (37.5) | 17.31 (24.4) | 17.51 (34.9) |
| No. atoms/B factors (Å2) | ||||
| Proteina | 1703/14.84 | 1737/17.41 | 1740/12.39 | 1741/19.21 |
| Solventb | 241/30.10 | 291/31.47 | 270/26.94 | 231/31.12 |
| RMSD | ||||
| Bond lengths (Å) | 0.018 | 0.009 | 0.015 | 0.013 |
| Bond angles (°) | 1.90 | 1.48 | 1.83 | 1.81 |
| Ramachandranc (%) | ||||
| Favored/allowed/outliers | 90/10/0 | 88/12/0 | 88/11/0 | 89/11/0 |
| PDB accession codes | 6RS6 | 6RS7 | 6RS8 | 6RS9 |
a Glycosylation (N-acetylglucosamine units, mannosyl units) is included under “Protein.”
b PEG, MES, acetate, Cl−, and water molecules are included under “Solvent.”
c Calculated using Procheck within the CCP4 suite.
Figure 4.
Structure of LsAA9B and positions equivalent to critical residues in AA9 LPMOs. a, side (left) and top (right) view of LsAA9B showing the common immunoglobulin-like β-sandwich-fold. b, detailed view with electron density near the N-terminal Arg (Arg-1), including phosphorylated Ser-25 (pS25). Ser-25 is modeled in two conformations (70 and 30% occupancies) with the highest occupied conformation being phosphorylated. The side chain of Arg-1 makes a 2.9 Å interaction with the phosphorylation on Ser-25. LsAA9B Arg-1, Asn-84, Leu-158, Gln-167, and Phe-169 are equivalent to His-1, His-68, His-142, Gln-151, and Tyr-153 in TtAA9E (residues critical for AA9 LPMO function). The electron density maps (2Fo − Fc in blue and Fo − Fc in green, contoured at 1.0σ and 3.0σ, respectively) are calculated from a structure before modeling the phosphorylation. The logo-conservation depict most common residues at a given position and their frequency (letter size) in Arg-AA9 sequences. c, active site residues of the structurally closest homolog NcAA9C (cyan, PDB 4D7U) superimposed onto the equivalent residues of LsAA9B.
The LsAA9B structures show striking differences compared with canonical LPMO active sites, but share structural features with AA9 LPMOs active on oligosaccharides
In all the LsAA9B structures the N-terminal Arg (Arg-1) and the residues, Asn-84, Leu-158, and Gln-167, Phe-169 (equivalent to His-1, His-68, His-142, Gln-151, and Tyr-153 in TtAA9E are important for AA9 LPMO activity) were clearly defined in the electron density map (Fig. 4b). These residues in LsAA9B (Fig. 4c and Fig. S2) are arranged in a similar configuration compared with NcAA9C (the closest structural match, PDB 4D7U), as defined by the backbone positions, but are not associated with any metal or positioned compatibly with a likely metal binding function (guanidinium group of Arg-1 points away from Asn-84) (Fig. 4). This confirms that residues important for AA9 LPMO activity are changed in LsAA9B, as was indicated by sequence comparison (note that in other LPMO families with demonstrated activity Phe is found in positions equivalent to Phe-169). Furthermore, the structure does not indicate that nearby residues could functionally compensate for the loss of the conserved residues to provide a copper-binding site/LPMO activity.
Near the N-terminal Arg, LsAA9B Tyr-206 is found (Fig. S2) in a position equivalent to Tyr-204 in NcAA9C and Tyr-203 in LsAA9A, shown to be involved in AA9 LPMO oligosaccharide interactions (21–23). In those structures the Tyr residues are well-defined and make hydrogen bonds to the backbone amide of the conserved His of the secondary copper coordination sphere (His-147 in LsAA9A and His-155 in NcAA9C). However, in LsAA9B the rotamer of the apolar Leu-158 side chain interferes with the hydrogen bonding of the backbone amide to Tyr-206, which in many of the structures was not well-defined (Fig. S3, a and b). In addition, LsAA9B has a number of residues, such as Asn-26 and His-66, which are in positions equivalent to those involved in oligosaccharide binding in LsAA9A (LsAA9A Asn-26, His-66, Asn-67, Ser-77, Glu-148, Asp-150, and Tyr-203 are equivalent to LsAA9B Asn-26, His-66, Thr-67, Thr-83, Asn-159, Glu-161, and Tyr-206) (Fig. S2). Almost all of the Arg-AA9 sequences have Asn, His/Arg, and Tyr/Phe in the positions of Asn-26, His-66 (Arg is found in this position in the oligosaccharide active CvAA9A LPMO (23)), and Tyr-206 (Fig. S1). However, for Asn-159 and Glu-161 (equivalent to residues in LsAA9A interacting with the substrate in the minus subsites) there seems to be virtually no conservation (Fig. 5 and Fig. S1).
Figure 5.

Surface features and conservation of LsAA9B. a, structure of LsAA9B interacting with a MES buffer molecule in a small pocket (magenta). The MES molecule makes an H-bond to His-65, and makes additional interactions with Tyr-71 and a GlcNAc molecule (N-linked glycosylation at Asn-134). b, electron density (2Fo − Fc (1.0σ) in blue and Fo − Fc (3.0σ) in green) for MES before modeling the buffer molecule. c, view of LsAA9B Ser-6, Asp-13, Gln-16, His-63, His-65, Tyr-71, and Asn-134-GlcNAc that form the pocket. d, residues of LsAA9B colored in a rainbow spectrum according to their conservation within Arg-AA9 sequences (red, completely conserved; yellow, intermediately conserved; blue, not conserved). Positions equivalent to residues involved in copper coordination and secondary coordination sphere in canonical AA9 LPMOs (Arg-1, Asn-84, Phe-169, Leu-158, Gln-167) and in substrate interactions in LsAA9A (Asn-26, His-66, Asn-67, Thr-83, Asn-159, Glu-161, Tyr-206) are shown as sticks. The speculated phosphorylation sites at Ser-25 and Ser-24 and the completely conserved residues in the L3 loop (His-65, Tyr-71, Pro-76, Pro-79, and Pro-82) are also shown in stick representation.
Other structural features of LsAA9B
O-Linked and N-linked glycosylation at Thr-59 and Asn-134, respectively, were also observed in all of the LsAA9B structures. From Thr-59 one α-linked mannosyl unit could be modeled in all cases. In the LsAA9B structure, two N-acetylglucosamine (GlcNAc) units as well as one poorly defined mannosyl unit could be modeled from Asn-134. In the LsAA9B_deglyc structure, one well-defined and one less defined GlcNAc unit were modeled, indicating that the protein batch was not fully deglycosylated (as confirmed by MS).
Near Asn-134 a small pocket was found (mainly formed by Tyr-71 and a GlcNAc unit from the N-glycosylation site) (Fig. 5), which was often occupied by electron density that could be attributed to crystallization condition components (e.g. glycine, sulfate, or a MES molecule). In the LsAA9B_metalsoak structure a MES molecule could quite confidently be modeled in this position (Fig. 5b) interacting with His-65 and Tyr-71, and in part with the GlcNAc unit (Fig. 5c). A putative functional role is supported by full conservation of His-65 and Tyr-71 (in loop L3) among AA9-Arg, whereas the glycosylated Asn-134 can be substituted with Phe/Tyr, which could assume a similar structural role in forming a pocket. Additional full conservation of three prolines (Pro-76, -79, and -82) suggests that the extended L3 loop has a conserved rigid structure in this AA9 subgroup (Fig. 5d and Fig. S1). In some canonical AA9 LPMOs charged or polar residues in the extended L3 loop are putative determinants of specificity toward soluble substrates.
For many of the crystal structures determined during this work, Fourier difference map density was visible proximal to Ser-25. Because additional evidence suggested a putative phosphorylation site in LsAA9B (see “Indications of a potential phosphorylation site in LsAA9B”), the final structure includes a phosphoserine (pSer-25), which fits well the electron density in two of the four structures (LsAA9B_deglyc and LsAA9B_metalsoak). In these structures, the pSer-25 and Ser-25 are modeled in alternative conformations with 70 and 30% occupancy, respectively (Fig. 4b).
LsAA9B is not a copper-binding protein
When inspecting the LsAA9B structures no bound metals were found. We speculated whether Arg-1 would be able to adopt a different conformation in response to addition of transition metals allowing for LsAA9B to coordinate a metal cofactor at this surface. Co-crystallization and soaking experiments with transition metals were attempted with solution mixtures containing Fe2+, Cu2+, Mn2+, and Co2+. X-ray diffraction data were collected at an appropriate wavelength of 1.35 Å ∼9.184 keV to ensure an anomalous signal would be obtained to allow location of any bound metal ions in the crystal structure. Following map generation, no peaks that could be interpreted as bound metals could be found when inspecting an anomalous Fourier difference map. The highest peaks were found near the sulfur atoms in the Cys residues of the protein, confirming that these metals did not bind in any of these experiments. None of the structures inspected during this work showed any sign of bound metals.
Metal binding was also investigated by thermal shift assays with differential scanning fluorimetry (DSF) and differential scanning calorimetry (DSC). For three completely independent measurements with DSF, the Ti for LsAA9B was 65.5 ± 0.75 °C. The presence of copper ions at slightly over stoichiometric amounts (0.048 mm copper acetate) resulted in a decrease of Ti to 62.4 °C. A similar approximate 4 °C decrease of Tm was found with the addition of copper measured by DSC. No significant change of Tm was found for a range of other metals by DSC.
As a final step, microPIXE (Proton-induced X-ray Emission) analysis (42) was used to detect any additional elements present in the sample with no added metals, to ascertain whether LsAA9B copurified with any metal ions that might hint at function. The analysis did not reveal significant amounts of copper, iron, or any other metal typically associated with redox enzymes present in the sample above trace levels (Fig. S4 and Table S3) (calculations suggest a lower detection limit for copper of 4 × 10−3 atoms per protein molecule).
Indications of a potential phosphorylation site in LsAA9B
Although it did not reveal any natively bound metals, the PIXE analysis detected the presence of a single atom of phosphorus in two point spectra (Fig. S4 and Table S3) for every protein molecule in the sample. In many of the structures we obtained during this study, density was observed in close proximity to the side chain of Ser-25, upon inspection of the Fourier difference map. In light of these data, this was therefore interpreted as being a phosphoserine. Modeling a phosphorylated Ser (pSer-25), with the phosphate group hydrogen bonding to the side chain of the N-terminal Arg (2.9 Å), led to an improved fit to the electron density map and model quality (improved R-factor and clash score). The position of the phosphate group does not appear to interfere with possible copper binding (Fig. S3C).
Following this observation, we tried to identify phosphorylation using MS. Making several measurements of the Endo H-deglycosylated LsAA9B batches with matrix-associated laser desorption ionization time of flight (MALDI-TOF) we identified the largest peaks with m/z values that could correspond to LsAA9B with glycosylation (N-GlcNAc and O-Man) and a phosphorylation (23.255 Da + 203 Da + 162 Da + 80 Da = 23.670 Da). Using MALDI-TOF to measure the glycosylated LsAA9B we found that the m/z values could correspond with a single phosphorylation, two GlcNAc and varying degrees of mannosylation (m/z increasing by 162 Da for every mannosyl unit). MS techniques with a trypsin-digested preparation of LsAA9B only identified the nonmodified peptide (total sequence coverage of 55%), thus conclusive evidence for phosphorylation is lacking.
However, it is interesting to observe that in positions corresponding to Ser-24 and Ser-25 in LsAA9B most of the analyzed Arg-AA9 sequences have Ser or Thr (Figs. 4b and 5, and Fig. S1), which are potential phosphorylation sites, as predicted by the NetPhos 3.1 Server (43). Thus, at least theoretically, it seems that a number of the Arg-AA9 sequences could be phosphorylated.
Investigation of oligosaccharide binding
As already stated LsAA9B contains residues (e.g. Asn-26, His-66, and Tyr-206) equivalent to some of those involved in LsAA9A oligosaccharide binding (Fig. S2). When superimposing either the LsAA9A-Cell5 (PDB 5NLS) or LsAA9A-Xyl5 (PDB 5NLO) complex onto the LsAA9B structure, we found that some of these residues in LsAA9B came within hydrogen bonding distance of the Cell5 and Xyl5 ligands of the LsAA9A complex structures. In addition, we observed that the sugar moiety of the +2 subsite in LsAA9A (the C6-hydroxyl and the C2-hydroxyl for Cell5 and Xyl5, respectively) could interact with the putative phosphorylation in LsAA9B. Thus, LsAA9B carbohydrate binding was investigated by cocrystallization with monosaccharides, and crystal soaking experiments and thermal shift assays with oligosaccharides.
Addition of oligosaccharides at 50 mm concentration in solution caused very little change in Ti (66.1 °C with maltopentaose; 65.9 °C with Cell5; 64.7 °C for Xyl5; 65.0 °C for chitopentaose). Addition of stoichiometric amounts of copper (with or without 200 mm NaCl as described in Ref. 22) decreases the Ti to 62.4 °C (Ti of 61.9 °C in the presence of NaCl), still with no upward thermal shift on addition of oligosaccharides (Ti values of 59.9–61.4 °C with copper alone and Ti values of 61.3–62.2 °C with copper and NaCl). No crystals appropriate for data collection were obtained from cocrystallization experiments. Soaking experiments with cellotetraose (Cell4), cellopentaose (Cell5), or xylotetraose (Xyl4) oligosaccharides were done in conditions with MgCl2 and CaCl2 (30–100 mm) because the presence of chloride ions has been shown to enhance oligosaccharide binding to canonical AA9 (22). No electron density that could be interpreted as cellooligosaccharides was identified at the protein surface.
However, we did identify a Xyl4 ligand bound at the surface of LsAA9B, but the binding mode is distinct compared with LsAA9A-Xyl5 (Fig. 6a). One explanation for this could be that glycosylation of a symmetry related molecule occludes this binding site (Fig. 6b). Instead the Xyl4 ligand bound in a small cleft formed by residues 41–49 (TGFIQPVSK) and 99–10 (TQS). Three xylosyl units of the Xyl4 ligand (numbered 1–3 from the reducing end) are well-defined and could easily be modeled in the electron density, whereas the fourth xylosyl unit at the nonreducing end is only partially defined (Fig. 6c). Many of the glycosidic torsion angles of the Xyl4 ligand deviate from ideal values, e.g. between xylosyl units 1 and 2 (in particular the Ψ-torsion angle) (Table S4).
Figure 6.
Xylotetraose bound on the surface of LsAA9B. a and b, structure of LsAA9B-Xyl4 showing interaction with the xylotetraose ligand (Xyl4; yellow). The N-terminal Arg (Arg-1) and Asn-84 (equivalent to the His-brace of canonical AA9s) are shown in green. Positions involved in AA9-substrate interactions are colored magenta. The xylopentaose ligand (cyan) from the LsAA9A-Xyl5 complex structure (PDB 5NLO) are shown following superimposition onto LsAA9B. Blue surface indicates LsAA9B crystal contacts. The pSer-25 involved in crystal contacts are highlighted with an arrow. N-Linked glycosylation from a symmetry molecule (indicated with an asterisk) is found close to the Xyl5 ligand. c, electron density (2Fo − Fc (1.0σ) in blue and Fo − Fc (3.0σ) in green) calculated from a structure before modeling the Xyl4 ligand.
Discussion
LsAA9B can be heterologously expressed and is a well-folded and stable protein, with melting temperatures comparable with enzymatically active members of the family (44–47), but showed no detectable LPMO activity under the condition tested. The X-ray crystal structure of LsAA9B in fact confirms a complete disruption of the copper-binding site that is the hallmark of LPMOs (9) and absence of other metal-binding sites, even though the overall-fold remains very similar to that expected. We clearly demonstrated that residues Arg-1 and Asn-84 in LsAA9B are positioned equivalently to the His-brace of AA9 LPMOs, but that neither of these (nor any other) residues coordinate copper. In all structures (with or without the putative phosphorylation site modeled and/or metals added), Arg-1 in fact assumes a rotamer conformation that is not optimal for forming a His-brace like metal-binding site. PIXE spectroscopy (42), crystal soaking experiments with metal ions, and thermal shift analyses in solution failed to indicate binding of any metal. This is consistent with a survey of metal-binding sites in protein structures reporting that Asn and Arg interact with elements such as potassium, sodium, magnesium, and calcium more frequently than with transition metals (of which only manganese is reported), whereas in contrast His is the most common protein residue to bind copper (48). Thus, our work confirms the expectation that LsAA9B and other Arg-AA9s are very unlikely to be copper-dependent LPMOs.
At the same time, evidence presented here and elsewhere shows that a number of genes (in wood-degrading fungal Polyporales species) encoding Arg-AA9 proteins are up-regulated during fungal growth on plant biomass alongside lignocellulose degrading CAZymes (most commonly members of GH and AA families). The majority of the Arg-AA9 sequences possess a signal peptide (with predicted processing before Arg) expectedly targeting the mature N-terminal Arg proteins for the secretory pathway. Indeed both LsAA9B and the Arg-AA9 of P. brumalis BRFM 985 (JGI protein ID 1403153) have been detected in fungal secretomic data. In addition, the mature LsAA9B protein with N-linked glycosylation is secreted when recombinantly produced in Aspergillus from a construct with the native L. similis signal peptide. Thus, there is circumstantial evidence that Arg-AA9 proteins, like conventional LPMOs, could be involved in plant biomass degradation. The open question remains how, and although this work does not provide a conclusive answer, the structural analysis suggest a number of new avenues for further investigation.
For example, we found that the relatively conserved L3 loop is likely structurally rigid in the majority of Arg-AA9 proteins and together with the Asn-134 glycosylation forms a small pocket. In the LsAA9B_metalsoak structure, a MES molecule is interacting with the completely conserved His-65, Tyr-71 in this pocket. It is possible that the MES molecule mimics biologically relevant interactions and notably, MES bear some resemblance to phenolic compounds (e.g. p-coumaryl-coniferyl, sinapyl alcohols) that make up lignin, and thus the possible connection to lignin degradation/detoxification pathways should be investigated in the future.
Structural comparison suggested that LsAA9B could bind oligosaccharides in a manner similar to LsAA9A, but this could not be confirmed by our structural studies, perhaps because of glycosylation from a symmetry-related molecule occluding this binding site. In contrast, we obtained an LsAA9B structure with a Xyl4 ligand bound at a distinct binding site, in a small cleft made up mostly by relatively conserved residues (no similar cleft is present in LsAA9A where this space is occupied by the peptide stretch VDNRVV formed by residues 43 to 48). It is difficult to establish whether the binding site is biologically relevant: the expected structural conservation of this region in Arg-AA9 and the fact that there is only one hydrogen bond of Xyl4 with a symmetry-related molecule suggest this is not a crystal artifact. On the other hand the bound conformation of Xyl4 deviates considerably from what is usually observed in xylooligosaccharides and xylan (49) (Table S4), and Xyl5 in solution caused no thermal shift. The path connecting this binding site and the binding site in LsAA9A is definitely occluded by crystal contacts, and thus binding of longer oligosaccharides (connecting the two binding sites) is not possible in this crystal form. However, the discovery of a Xyl4-binding site warrants additional investigations of polysaccharide binding by this and other Arg-AA9.
Finally, we found indications that Ser-25 could be phosphorylated in LsAA9B as supported by some high-resolution structures and by PIXE data, and consistent with MALDI-TOF analysis, although a phosphorylated peptide could not be conclusively identified. Intriguingly, the phosphoserine modeled in the structure interacts with the N-terminal Arg (2.9 Å hydrogen bond) and could additionally potentially interact with bound polysaccharides near the binding sites observed for LsAA9A and LsAA9B.
Potential similar phosphorylation sites (equivalent to Ser-24 and Ser-25 of LsAA9B, Figs. 4b and 5d) are also predicted for Arg-AA9 protein sequences (presented here in Fig. 2 and Table 1 and previously (34, 35, 38)), for which the transcription of the corresponding genes were up-regulated during fungal growth on plant biomass. In addition, in the Fungi Phosphorylation Database (FPD) (50) documented Ser phosphorylation sites in fungal species (including Aspergillus) can be found that are consistent with the LsAA9B sequence surrounding the Ser-25 phosphorylation, supporting the notion that phosphorylation could play a role in fungal plant biomass degradation. It is known that proteins can be phosphorylated in the secretory pathway (51, 52), in mammalian cells by the Fam20C kinase (53, 54), and phosphorylation does appear to play a role extracellularly during some biological events (e.g. during microbial host infection (55–57)). Thus, it is possible that also phosphorylated Arg-AA9s could exist extracellularly, because the evidence points to an extracellular location for these proteins.
In summary, our results indicate the importance of Arg-AA9s in fungal adaptation to biomass degradation. We have shown that the transcription of Arg-AA9 genes are up-regulated in fungal species belonging to the phylogenetic class of Agaricomycetes (of which many are wood or litter decayers) when cultivated on plant biomass. We have determined this first structure of an Arg-AA9 protein, LsAA9B (belonging to the wood-decaying Lentinus genus (58)), and have identified interesting structural features, including a potential carbohydrate-binding cleft, a small conserved pocket, and a potential phosphorylation site. Sequences conservation suggests that these features are found in the majority of the Arg-AA9s as well. These findings provide a solid framework for future investigations of this subgroup of AA9 and for the pursuit of their potential role in fungal biology. The original structures of AA9 (14) and AA10 (24) family members were of key importance toward the subsequent identification of the nature of LPMO action on biomass. We hope that the structure of LsAA9B will likewise act as the starting point for further characterization of these unusual AA9 family members.
Experimental procedures
Fungal transcriptomic and secretomic data
L. similis was cultivated at 30 °C on a basic fungal media including 0.5% (w/v) hardwood BCTMP (bleached chemi-thermo mechanical pulp) for induction experiments. Samples were taken after 0, 3, 5, 7, and 10 days to analyze secreted proteins and identify LsAA9B with ESI-MS/MS. Tryptic digests were prepared by a filter-aided sample preparation method. Following digestion the extracted peptides were analyzed on a nano LC-MS/MS system: UltiMate 3000 RSLC nano/LTQ Orbitrap Velos Pro (Thermo/Dionex). For protein identification the data were searched against the complete L. similis proteome (internal database) using the Mascot search engine (Matrix science) in the Genedata Expressionist software (1% false discovery rate cutoff). Relative protein concentrations were calculated by label-free quantification from peptide volumes using a Hi3 standard method in Genedata Expressionist.
P. coccineus BRFM 1662, T. ljubarskyi BRFM 1659, Leiotrametes sp. BRFM 1775, and T. elegans BRFM 1663 strains were obtained from the CIRM collection at the National Institute of Agricultural Research. Transcriptome and secretome data were collected from triplicated independent 3-day cultures in the presence of 20 g × liter−1 of maltose, 15 g × liter−1 of Avicel, 15 g × liter−1 of ground wheat straw, 15 g × liter−1 of ground pine wood, or 15 g × liter−1 of ground aspen wood as the sole carbon source. RNA libraries were prepared and sequenced on Illumina HighSeq-2500 as described in Ref. 59. Secreted proteins were collected from the same cultures, diafiltered and identified by ESI-MS/MS (as in Ref. 59). Transcript reads were analyzed as described in Ref. 38. For each strain, the genes with similar transcription profiles on the five carbon sources were grouped into nodes using the Self-organizing maps Harboring Informative Nodes with Gene Ontology Pipeline (SHIN+GO (34)). All sequence data are available on the Mycocosm public database at Joint Genome Institute (https://genome.jgi.doe.gov/programs/fungi) (60).
LsAA9B protein production, purification, and activity measurements
An AA9 protein from L. similis (LsAA9B) was recombinantly produced in A. oryzae MT3568 and purified as described in the patent literature (32) (similar to the previously characterized LsAA9A (22, 31)). The mycelium was removed by filtration and the broth collected for protein purification by chromatography with a 50-mm diameter 167-ml Butyl-ToyoPearl 650 column (ToSoh BioSciences, Stuttgart, Germany) with a gradient of 0–100% of buffer A (25 mm Tris-HCl, 1.0 m ammonium sulfate, pH 7.5) and buffer B (25 mm Tris-HCl, pH 7.5). To ensure purity suitable for structural studies, two further purification steps were applied. After buffer change by ultrafiltration (Vivaspin, Sartorius) the sample was applied to a 100-ml Q-Sepharose column (Sigma) in 20 mm Tris-HCl, pH 8.0. The protein was eluted with a gradient of 0 to 500 mm NaCl. Fractions containing protein were further purified on a 26-mm Superdex 75 size exclusion (Sigma) column with isocratic 20 mm MES, 125 mm NaCl, pH 6.0. All protein batches were buffered in 20 mm MES, pH 6.0. Activity was measured as previously described by depolymerization of AZCL-cellulose (61) and PASC (9) in parallel on LsAA9A and LsAA9B in the presence of a variety of electron donors (pyrogallol, 4-OH-5-CH3–3-furanone, ascorbate, and cysteine) and under similar conditions. In addition, LsAA9B activity was measured (according to manufacturer instructions) on AZO-xylan (purchased from Megazyme; product code S-AXBL) without and with equimolar concentrations (1 μm) of copper or manganese in the presence of ascorbate (1 mm).
Mass spectrometry analysis of purified LsAA9B
MS was performed using MALDI-TOF. The matrix was prepared by saturating a TA solution (0.1% (v/v) trifluoroacetic acid (TFA) and acetonitrile in a 2:1 (v/v) ratio) with sinapic acid. Matrix and protein sample were sequentially applied to the target plate (in a final 2:1 ratio) and left for solvent to evaporate. The TOF experiments were performed in linear mode calibrated for 10–50 kDa molecules.
Products following a trypsin digest of the heterologously expressed LsAA9B batches were analyzed using a Thermo Fisher Scientific Q-Exactive HF-X Orbitrap mass spectrometer that was operated in positive mode with a MS1 resolution of 120,000. A top 10 method was utilized and a resolution of 45,000 for MS2 fragment scans was utilized. Peptides were separated on a 15-cm column (75 μm inner diameter) packed in-house with 1.9 μm C18 particles. EASY-nLC 1200 nano-LC system coupled to the mass spectrometer was utilized to separate injected peptides over an increasing gradient of buffer B (80% acetonitirile, 0.1% formic acid). The collected raw data were analyzed using MaxQuant (version 1.6.1.11).
Crystallization and data collection
Deglycosylation of protein for crystallization was performed in 20 mm MES, pH 6.0, 125 mm NaCl, by incubating about 10 μl of endoglycosidase H (from Roche Diagnostics, 11643053001) per mg of LsAA9B protein overnight at room temperature. Crystallization was carried out by sitting drop vapor diffusion in 96-well MRC-2 plates using an Oryx8 crystallization robot (Douglas Instruments) with drop sizes of 0.3–0.4 μl (with protein:reservoir:Milli-Q water ratios of 3:1:1, 3:1:0, 1:1:1, or 1:1:0) and with 100-μl reservoirs in 24-well VDX plates with drop sizes of 2–4 μl and reservoirs of 1 ml. Morpheus screens (62) were set up using both a naturally glycosylated batch (11 mg/ml) and a deglycosylated (with endoglycosidase H from Roche Diagnostics, 11643053001) batch (4–5 mg/ml) of LsAA9B. For all batches, crystals could be obtained in quite similar conditions (Table 2).
With an initial protein batch a crystal grew in Morpheus condition 28 in well C4 (Table 2), from which a complete 1.6 Å resolution X-ray diffraction dataset of 155 frames (155°) was collected. With a glycosylated LsAA9B, intergrown crystal plates were obtained in Morpheus condition 85 (Table 2). A single plate was mounted separately from a cluster and a dataset of 200 images (200°) was collected. Crystals were also obtained with a deglycosylated LsAA9B (Morpheus condition 6 in well A6) and were reproducible in MRC-2 plates with protein concentrations from 2 to 5 mg/ml and 24–48% (w/v) PPT4 (a 1:1:1 mixture of MPD(racemic), PEG100, and PEG3350), and a dataset of 324 images (162°) was collected (Table 2). For all datasets the data collection were carried out with the crystals mounted in nylon loops at 100 K without additional cryoprotectant, because the Morpheus screen composition is already cryo-protecting.
Cocrystallization experiments with transition metals were performed in VDX plates with 33–42% (w/v) of PPT1 (a 2:1 mixture of PEGMME500 and PEG20.000) or PPT4 at pH 6.5, 7.5, or 8.5 and a metal mixture additive of 5–50 mm CuSO4, MnCl2, and CoCl2, but failed to provide crystals of sufficient quality. For soaking experiments, crystals were grown in PPT4 at pH 7.5 (with either carboxylic acids or amino acids or monosaccharides additives). Transition metal mixture solutions were added to an approximate final concentration of 6–7 mm of each of Fe2+, Cu2+, Mn2+, and Co2+. Crystals were mounted at different time points from 1 min to 5 h 20 min and datasets (generally to better than 1.7 Å resolution) were collected as before but at a wavelength of 1.35 Å (9.184 keV) at which the anomalous signal should be significant to detect all of the metals. No peaks that could account for bound metals were found when inspecting an anomalous Fourier difference map. In fact, the highest peaks were found near the sulfur atoms in the cysteine residues.
Optimized cocrystallization with monosaccharides additives (20–100 mm) were carried out in VDX plates with 33–42% (w/v) of either PPT1 or PPT4 at pH 6.5, 7.5, or 8.5 (because intergrown protein crystals in conditions with monosaccharide additives were observed in the screens). However, no crystals appropriate for data collection were obtained. Soaking experiments with oligosaccharides were carried out in MRC-2 plates using crystals grown with 6–11 mg/ml of protein in 27–42% (w/v) PPT1 at either pH 6.5 or 8.5 with additives of either 30 or 100 mm MgCl2 and CaCl2. Crystals were transferred to reservoir solutions containing 400–800 mm of cellotetraose (Cell4), cellopentaose (Cell5), or xylotetraose (Xyl4) and soaked for 40–80 min.
All data were collected at the MX beamline I911-3 at the MAX-IV laboratory, Lund (Sweden), or by remote access at beamline ID23-1 at the ESRF, Grenoble (France). All crystals were isomorphous and processed and scaled with XDS/XSCALE (63) in P212121 with similar cell dimensions (Table 3).
Structure determination and refinement
From sequence alignment the closest AA9s with structures available at the time were TtAA9E (PDB entry 3EJA (14)) and NcAA9D (PDB entry 4EIR (40)) with 42.9 and 45.1% sequence identity, respectively. MR with MOLREP (64) using only the protein coordinates (or Sculptor (65) modified models) of either of PDB entries 3EJA or 4EIR as search models gave solutions, which following one round (10 cycles) of restrained refinement (with Refmac5 (66), CCP4 suite (67)) resulted in R-factors of ∼40% after which parts of the map could be relatively easily interpreted. Modeling in COOT (68, 69) followed by several rounds of restrained refinement resulted in an initial preliminary structure, which was used to refine additional structures obtained later. Structures of LsAA9B were refined in Refmac5 (CCP4 suite) (66, 67) using Rfree flags imported from the previous structure factor file. All LsAA9B structures obtained after this point were refined with anisotropic B factors for all protein atoms (including glycosylation) and isotropic B factors for all other atoms. Refinement and validation statistics are shown in Table 3.
Construction of multiple sequence alignments and structural comparison
Superpose (CCP4 suite) was used for calculation of the RMSD Cα trace using secondary-structure matching (39, 67). Multiple sequence alignment was constructed using a standalone version of STRAP (STRuctural Alignments of Proteins) and programs within (70, 71). For multiple sequence alignment, MAFFT was used (72). For the structure-based sequence alignment TMalign was used to align the Cα trace of the AA9 3D structures (73). Mapping of sequence on the LsAA9B structure was done using CAMPO (74) available in the PyMod 2.0 plugin for PyMOL (75).
Thermal shift assays
Thermal shift assays with DSF were carried out using intrinsic fluorescence using the NanoTemper Tycho NT.6 (Nano Temper Technologies) according to the manufacturer's instructions. Tycho NT.6 follows the unfolding process by recording sample fluorescence at 330 and 350 nm during thermal unfolding. A constant heating rate of 30 °C/min is applied to the sample, heating from 35 to 95 °C. A shift in Ti to higher temperatures is usually indicative of ligand binding. 10 μl of each sample were prepared, incubated for 5 min, and then loaded in capillaries for measurements. LsAA9B was at a concentration of 1 mg/ml in 20 mm MES, pH 6.0. All measurements were carried out at least in triplicates. Thermal shift assays with capillary DSC (Northampton, MA) were carried out with a heating rate of 1.5 °C/min with LsAA9B at pH 5.0 in the presence of 1 mm divalent metal ions (Cu2+, Ni2+, Fe2+, Zn2+, Mn2+, and Ca2+) or 1 mm DTPA (as chelating agent).
Particle-induced X-ray emission (microPIXE) analysis
LsAA9B, purified as described above, was prepared for microPIXE analysis by passing down a 16/600 Superdex 75 (GE Healthcare) column to buffer exchange into 20 mm ammonium acetate, pH 5.5, 200 mm KBr. The final sample was then concentrated to 7.7 mg/ml using a 10-kDa cut off VivaSpin Concentrator (Sartorius).
0.1 μl of sample was placed onto a 4-μm polypropylene film (Prolene supplied by Fluxana GmbH & Co., Germany) which was then allowed to dry naturally in a closed environment to prevent dust contamination. The sample was mounted in the path of a 2.5-MeV proton beam with a beam diameter of ∼2.5 μm at the Ion Beam Centre, University of Surrey. Prior to the experiment a glass standard was analyzed and validated to ensure accurate elemental quantitation. Data were collected by first scanning the proton beam across the sample to generate a set of elemental maps. Point spectra were then measured from two distinct regions of the sample based on the distribution of sulfur in the map, signifying where protein was located. The collected spectra were processed using the Q-factor method (76) as implemented in the OMDAQ-3 software and converted into the number of metal atoms per protein molecule as described in Ref. 42.
Author contributions
K. S. J. and L. L. L. funding acquisition; L. L. L. supervision; L. L. L. project administration; K. E. H. F. and L. L. L. planned most of the experimental work and drafted the manuscript. M.-N. R., G. R. H., E. F. G., K. S. J., B. H., and J.-G. B. contributed to the manuscript draft. All other authors commented on the manuscript. K. E. H.F. carried out MALDI-TOF experiments. K. E. H. F. crystallized protein. K. E. H. F. collected crystallographic X-ray data. K. E. H. F solved and analyzed crystallographic data. K. E. H. F refined the structures. K. E. H. F carried out the bioinformatics analysis; K. E. H. F. tested for activity on AZO-xylan. M. T. purified the protein and tested activity. C. I. J. carried out proteomic and secretomic experiments on L. similis. N. S. purified the protein and tested activity. M.-N. R. analyzed the RNA-seq data. G. R. H. performed the PIXE analysis and interpreted the data. E. F. G. performed the PIXE analysis and interpreted the data. G. W. G. performed the PIXE analysis and interpreted the data. J.-C. N. P. crystallized protein. J-C. N. P. collected crystallographic X-ray data. J-C. N. P. carried out the DSF analysis; T. S. B. carried out advanced mass spectrometry and analyzed phosphoproteomic data; S. M. performed the RNASeq analysis of up-regulated and co-regulated genes. S. M. analyzed the RNA-seq data; A. L. performed Polyporales genome and transcriptome sequencing at the Joint Genome Institute (JGI); C. D. performed Polyporales genome and transcriptome sequencing at the Joint Genome Institute (JGI); I. V. G. coordinated Polyporales genome and transcriptome sequencing at the Joint Genome Institute. K. S. J. purified the protein and tested activity. B. H. carried out the bioinformatics analysis. B. H. commented on the manuscript; J.-G. B. provided input for the bioinformatics analysis. J.-G. B. provided input for analysis of RNA-seq data. L. L. L. collected crystallographic X-ray data. L. L. L. solved and analyzed crystallographic data.
Supplementary Material
Acknowledgments
We thank the MAX-IV laboratory, Lund, Sweden, and the ESRF, Grenoble, France, for beamtime and staff assistance. The work conducted by the United States Department of Energy Joint Genome Institute (JGI), a DOE Office of Science User Facility, was supported by the Office of Science of the United States Department of Energy under Contract DE-AC02–05CH11231. We thank Rebecca A. Batters for assistance with the microPIXE data analysis and Jesper V. Olsen at the Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, for involvement with advanced MS and phosphoproteomic data. K. E. H. F., J.-C. N. P., and L. L. L. are members of ISBUC Integrative Structural Biology at the University of Copenhagen.
This work was supported in part by the Critical Enzymes for Sustainable Biofuels from Cellulose (CESBIC) project, grants from the European Research Agency-Industrial Biotechnology Initiative, Danish Council for Strategic Research Grants 12-134923 and 12-134922 (to L. L. L. and K. S. J.), Biotechnology and Biological Sciences (BBSRC) Grant BB/L000423/1 (to Paul H. Walton and Gideon Davies), Novo Nordisk Foundation Grant NNF17SA0027704 (to K. S. J. and L. L. L.), Carlsberg Foundation Internationalisation Postdoc Fellowship Grants CF16-0673 and CF17-0533 (to K. E. H. F.), EU Marie-Curie FP7 COFUND People Programme (grant agreement 609398), through an AgreenSkills+ Fellowship (to K. E. H. F.), Danish Ministry of Higher Education and Science through the Instrument Center DANSCATT, and the European Community's Seventh Framework Programme Grant FP7/2007–2013 under BioStruct-X Grant N283570 for access to synchrotron beamlines. M. T., C. I. J., and N. S. are employed by Novozymes A/S, which is a major enzyme producing company. The remaining authors declare that they have no conflicts of interest with the contents of this article.
This article contains Figs. S1–S4 and Tables S1–S4.
The atomic coordinates and structure factors (codes 6RS6, 6RS7, 6RS8, and 6RS9) have been deposited in the Protein Data Bank (http://wwpdb.org/).
The nucleotide sequence(s) reported in this paper has been submitted to the DDBJ/GenBankTM/EBI Data Bank with accession number(s) MN265867.
The protein sequences reported in this paper are available in the MycoCosm Database at the Joint Genome Institute (JGI) under Protein ID 1430659 (https://mycocosm.jgi.doe.gov/Pycco1), 971189 (http://genome.jgi.doe.gov/Tralj1), 248104 (http://genome.jgi.doe.gov/Pycco1662_1), 236437 (http://genome.jgi.doe.gov/Leisp1), 1740610 (http://genome.jgi.doe.gov/Pycsa1), 360271 (http://genome.jgi.doe.gov/Artel1), 1403153 (http://genome.jgi.doe.gov/Polbr1). The JGI Protein ID of HiAA9G is 173373 (https://mycocosm.jgi.doe.gov/Hetan2). The RNASeq data reported in this paper are available from the Sequence Read Archive (SRA) under accession nos. SRP047926, SRP083645, SRP047826, SRP077308 and from the Gene Expression Omnibus repository under accession nos. GSE82433, GSE82419, GSE112011.
- CAZymes
- Carbohydrate Active enZymes
- LPMO
- lytic polysaccharide monooxygenase
- AA
- auxiliary activity
- GH
- glycoside hydrolases
- CE
- carbohydrate esterases
- RNA-seq
- RNA sequencing
- PEG
- polyethylene glycol
- PIXE
- proton-induced X-ray emission
- MR
- molecular replacement
- ITC
- isothermal titration calorimetry
- Cell4
- cellotetraose
- Cell5
- cellopentaose
- Cell6
- cellohexaose
- Xyl5
- xylopentaose
- Xyl4
- xylotetraose
- DTPA
- 2-[bis[2-[bis(carboxymethyl)amino]ethyl]amino]acetic acid
- PASC
- phosphoric acid swollen cellulose
- ESI-MS/MS
- electrospray ionization tandem-mass spectrometry
- Ti
- inflection point temperature
- DSF
- differential scanning fluorimetry
- DSC
- differential scanning calorimetry
- Endo H
- endo-β-N-acetylglucosaminidase H
- PDB
- Protein Data Bank
- RMSD
- root mean square deviation
- PEGMME
- polyethylene glycol monomethyl ether
- MES
- 2-morpholin-4-ylethanesulfonic acid
- MOPS
- 3-morpholinopropane-1-sulfonic acid
- HEPES
- 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid.
References
- 1. Johansen K. S. (2016) Lytic polysaccharide monooxygenases: the microbial power tool for lignocellulose degradation. Trends Plant Sci. 21, 926–936 10.1016/j.tplants.2016.07.012 [DOI] [PubMed] [Google Scholar]
- 2. Vaaje-Kolstad G., Westereng B., Horn S. J., Liu Z., Zhai H., Sørlie M., and Eijsink V. G. (2010) An oxidative enzyme boosting the enzymatic conversion of recalcitrant polysaccharides. Science 330, 219–222 10.1126/science.1192231 [DOI] [PubMed] [Google Scholar]
- 3. Bissaro B., Røhr Å. K., Muller G., Chylenski P., Skaugen M., Forsberg Z., Horn S. J., Vaaje-Kolstad G., and Eijsink V. G. H. (2017) Oxidative cleavage of polysaccharides by monocopper enzymes depends on H2O2. Nat. Chem. Biol. 13, 1123–1128 10.1038/nchembio.2470 [DOI] [PubMed] [Google Scholar]
- 4. Hemsworth G. R., Henrissat B., Davies G. J., and Walton P. H. (2014) Discovery and characterization of a new family of lytic polysaccharide monooxygenases. Nat. Chem. Biol. 10, 122–126 10.1038/nchembio.1417 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Lenfant N., Hainaut M., Terrapon N., Drula E., Lombard V., and Henrissat B. (2017) A bioinformatics analysis of 3400 lytic polysaccharide oxidases from family AA9. Carbohydr. Res. 448, 166–174 10.1016/j.carres.2017.04.012 [DOI] [PubMed] [Google Scholar]
- 6. Levasseur A., Drula E., Lombard V., Coutinho P. M., and Henrissat B. (2013) Expansion of the enzymatic repertoire of the CAZy database to integrate auxiliary redox enzymes. Biotechnol. Biofuels 6, 41 10.1186/1754-6834-6-41 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Sabbadin F., Hemsworth G. R., Ciano L., Henrissat B., Dupree P., Tryfona T., Marques R. D. S., Sweeney S. T., Besser K., Elias L., Pesante G., Li Y., Dowle A. A., Bates R., Gomez L. D., et al. (2018) An ancient family of lytic polysaccharide monooxygenases with roles in arthropod development and biomass digestion. Nat. Commun. 9, 756 10.1038/s41467-018-03142-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Yadav S. K., Archana Singh R., Singh P. K., and Vasudev P. G. (2019) Insecticidal fern protein Tma12 is possibly a lytic polysaccharide monooxygenase. Planta 249, 1987–1996 10.1007/s00425-019-03135-0 [DOI] [PubMed] [Google Scholar]
- 9. Quinlan R. J., Sweeney M. D., Lo Leggio L., Otten H., Poulsen J. C., Johansen K. S., Krogh K. B., Jørgensen C. I., Tovborg M., Anthonsen A., Tryfona T., Walter C. P., Dupree P., Xu F., Davies G. J., and Walton P. H. (2011) Insights into the oxidative degradation of cellulose by a copper metalloenzyme that exploits biomass components. Proc. Natl. Acad. Sci. U.S.A. 108, 15079–15084 10.1073/pnas.1105776108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Frandsen K. E., and Lo Leggio L. (2016) Lytic polysaccharide monooxygenases: a crystallographer's view on a new class of biomass-degrading enzymes. IUCrJ 3, 448–467 10.1107/S2052252516014147 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Vaaje-Kolstad G., Forsberg Z., Loose J. S., Bissaro B., and Eijsink V. G. (2017) Structural diversity of lytic polysaccharide monooxygenases. Curr. Opin. Struct. Biol. 44, 67–76 10.1016/j.sbi.2016.12.012 [DOI] [PubMed] [Google Scholar]
- 12. Tandrup T., Frandsen K. E. H., Johansen K. S., Berrin J. G., and Lo Leggio L. (2018) Recent insights into lytic polysaccharide monooxygenases (LPMOs). Biochem. Soc. Trans. 46, 1431–1447 10.1042/BST20170549 [DOI] [PubMed] [Google Scholar]
- 13. Lo Leggio L., Welner D., and De Maria L. (2012) A structural overview of GH61 proteins: fungal cellulose degrading polysaccharide monooxygenases. Comput. Struct. Biotechnol. J. 2, e201209019 10.5936/csbj.201209019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Harris P. V., Welner D., McFarland K. C., Re E., Navarro Poulsen J. C., Brown K., Salbo R., Ding H., Vlasenko E., Merino S., Xu F., Cherry J., Larsen S., and Lo Leggio L. (2010) Stimulation of lignocellulosic biomass hydrolysis by proteins of glycoside hydrolase family 61: structure and function of a large, enigmatic family. Biochemistry 49, 3305–3316 10.1021/bi100009p [DOI] [PubMed] [Google Scholar]
- 15. Span E. A., Suess D. L. M., Deller M. C., Britt R. D., and Marletta M. A. (2017) The role of the secondary coordination sphere in a fungal polysaccharide monooxygenase. ACS Chem. Biol. 12, 1095–1103 10.1021/acschembio.7b00016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Forsberg Z., Røhr A. K., Mekasha S., Andersson K. K., Eijsink V. G., Vaaje-Kolstad G., and Sørlie M. (2014) Comparative study of two chitin-active and two cellulose-active AA10-type lytic polysaccharide monooxygenases. Biochemistry 53, 1647–1656 10.1021/bi5000433 [DOI] [PubMed] [Google Scholar]
- 17. Agger J. W., Isaksen T., Várnai A., Vidal-Melgosa S., Willats W. G., Ludwig R., Horn S. J., Eijsink V. G., and Westereng B. (2014) Discovery of LPMO activity on hemicelluloses shows the importance of oxidative processes in plant cell wall degradation. Proc. Natl. Acad. Sci. U.S.A. 111, 6287–6292 10.1073/pnas.1323629111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Vu V. V., Beeson W. T., Span E. A., Farquhar E. R., and Marletta M. A. (2014) A family of starch-active polysaccharide monooxygenases. Proc. Natl. Acad. Sci. U.S.A. 111, 13822–13827 10.1073/pnas.1408090111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Couturier M., Ladevèze S., Sulzenbacher G., Ciano L., Fanuel M., Moreau C., Villares A., Cathala B., Chaspoul F., Frandsen K. E., Labourel A., Herpoël-Gimbert I., Grisel S., Haon M., Lenfant N., et al. (2018) Lytic xylan oxidases from wood-decay fungi unlock biomass degradation. Nat. Chem. Biol. 14, 306–310 10.1038/nchembio.2558 [DOI] [PubMed] [Google Scholar]
- 20. Aachmann F. L., Sørlie M., Skjåk-Braek G., Eijsink V. G., and Vaaje-Kolstad G. (2012) NMR structure of a lytic polysaccharide monooxygenase provides insight into copper binding, protein dynamics, and substrate interactions. Proc. Natl. Acad. Sci. U.S.A. 109, 18779–18784 10.1073/pnas.1208822109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Courtade G., Wimmer R., Rohr Å. K., Preims M., Felice A. K., Dimarogona M., Vaaje-Kolstad G., Sørlie M., Sandgren M., Ludwig R., Eijsink V. G., and Aachmann F. L. (2016) Interactions of a fungal lytic polysaccharide monooxygenase with β-glucan substrates and cellobiose dehydrogenase. Proc. Natl. Acad. Sci. U.S.A. 113, 5922–5927 10.1073/pnas.1602566113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Frandsen K. E., Simmons T. J., Dupree P., Poulsen J. C., Hemsworth G. R., Ciano L., Johnston E. M., Tovborg M., Johansen K. S., von Freiesleben P., Marmuse L., Fort S., Cottaz S., Driguez H., Henrissat B., Lenfant N., Tuna F., Baldansuren A., Davies G. J., Lo Leggio L., and Walton P. H. (2016) The molecular basis of polysaccharide cleavage by lytic polysaccharide monooxygenases. Nat. Chem. Biol. 12, 298–303 10.1038/nchembio.2029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Simmons T. J., Frandsen K. E. H., Ciano L., Tryfona T., Lenfant N., Poulsen J. C., Wilson L. F. L., Tandrup T., Tovborg M., Schnorr K., Johansen K. S., Henrissat B., Walton P. H., Lo Leggio L., and Dupree P. (2017) Structural and electronic determinants of lytic polysaccharide monooxygenase reactivity on polysaccharide substrates. Nat. Commun. 8, 1064 10.1038/s41467-017-01247-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Vaaje-Kolstad G., Houston D. R., Riemen A. H., Eijsink V. G., and van Aalten D. M. (2005) Crystal structure and binding properties of the Serratia marcescens chitin-binding protein CBP21. J. Biol. Chem. 280, 11313–11319 10.1074/jbc.M407175200 [DOI] [PubMed] [Google Scholar]
- 25. Borisova A. S., Isaksen T., Dimarogona M., Kognole A. A., Mathiesen G., Várnai A., Røhr Å. K., Payne C. M., Sørlie M., Sandgren M., and Eijsink V. G. (2015) Structural and functional characterization of a lytic polysaccharide monooxygenase with broad substrate specificity. J. Biol. Chem. 290, 22955–22969 10.1074/jbc.M115.660183 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Lo Leggio L., Simmons T. J., Poulsen J.-C., Frandsen K. E., Hemsworth G. R., Stringer M. A., von Freiesleben P., Tovborg M., Johansen K. S., De Maria L., Harris P. V., Soong C.-L., Dupree P., Tryfona T., Lenfant N., et al. (2015) Structure and boosting activity of a starch-degrading lytic polysaccharide monooxygenase. Nat. Commun. 6, 5961 10.1038/ncomms6961 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Isaksen T., Westereng B., Aachmann F. L., Agger J. W., Kracher D., Kittl R., Ludwig R., Haltrich D., Eijsink V. G., and Horn S. J. (2014) A C4-oxidizing lytic polysaccharide monooxygenase cleaving both cellulose and cello-oligosaccharides. J. Biol. Chem. 289, 2632–2642 10.1074/jbc.M113.530196 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Bennati-Granier C., Garajova S., Champion C., Grisel S., Haon M., Zhou S., Fanuel M., Ropartz D., Rogniaux H., Gimbert I., Record E., and Berrin J. G. (2015) Substrate specificity and regioselectivity of fungal AA9 lytic polysaccharide monooxygenases secreted by Podospora anserina. Biotechnol. Biofuels 8, 90 10.1186/s13068-015-0274-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Frandsen K. E. H., Poulsen J. N., Tandrup T., and Lo Leggio L. (2017) Unliganded and substrate bound structures of the cellooligosaccharide active lytic polysaccharide monooxygenase LsAA9A at low pH. Carbohydr. Res. 448, 187–190 10.1016/j.carres.2017.03.010 [DOI] [PubMed] [Google Scholar]
- 30. Berrin J. G., Rosso M. N., and Abou Hachem M. (2017) Fungal secretomics to probe the biological functions of lytic polysaccharide monooxygenases. Carbohydr. Res. 448, 155–160 10.1016/j.carres.2017.05.010 [DOI] [PubMed] [Google Scholar]
- 31. Spodsberg N., Shaghasi T., Sweeney M., Xu F., and Schnorr K. (May 1, 2014) Polypeptides having cellulolytic enhancing activity and polynucleotides encoding same. World Intellectual Property Organization (WIPO, PCT) Patent WO 2014/066141 A2
- 32. Spodsberg N. (September 3, 2015) Polypeptides having cellulolytic enhancing activity and polynucleotides encoding same. U. S. Patent US 2015/0247137 A1
- 33. Yakovlev I., Vaaje-Kolstad G., Hietala A. M., Stefańczyk E., Solheim H., and Fossdal C. G. (2012) Substrate-specific transcription of the enigmatic GH61 family of the pathogenic white-rot fungus Heterobasidion irregulare during growth on lignocellulose. Appl. Microbiol. Biotechnol. 95, 979–990 10.1007/s00253-012-4206-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Miyauchi S., Navarro D., Grisel S., Chevret D., Berrin J. G., and Rosso M. N. (2017) The integrative omics of white-rot fungus Pycnoporus coccineus reveals co-regulated CAZymes for orchestrated lignocellulose breakdown. PloS One 12, e0175528 10.1371/journal.pone.0175528 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Miyauchi S., Rancon A., Drula E., Hage H., Chaduli D., Favel A., Grisel S., Henrissat B., Herpoël-Gimbert I., Ruiz-Dueñas F. J., Chevret D., Hainaut M., Lin J., Wang M., Pangilinan J., et al. (2018) Integrative visual omics of the white-rot fungus Polyporus brumalis exposes the biotechnological potential of its oxidative enzymes for delignifying raw plant biomass. Biotechnol. Biofuels 11, 201 10.1186/s13068-018-1198-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Lombard V., Golaconda Ramulu H., Drula E., Coutinho P. M., and Henrissat B. (2014) The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 42, D490–D495 10.1093/nar/gkt1178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Petersen T. N., Brunak S., von Heijne G., and Nielsen H. (2011) SignalP 4.0: discriminating signal peptides from transmembrane regions. Nat. Methods 8, 785–786 10.1038/nmeth.1701 [DOI] [PubMed] [Google Scholar]
- 38. Miyauchi S., Navarro D., Grigoriev I. V., Lipzen A., Riley R., Chevret D., Grisel S., Berrin J. G., Henrissat B., and Rosso M. N. (2016) Visual comparative omics of fungi for plant biomass deconstruction. Front. Microbiol. 7, 1335 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Krissinel E., and Henrick K. (2004) Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. D Biol. Crystallogr. 60, 2256–2268 10.1107/S0907444904026460 [DOI] [PubMed] [Google Scholar]
- 40. Li X., Beeson W.T. 4th, Phillips C. M., Marletta M. A., and Cate J. H. (2012) Structural basis for substrate targeting and catalysis by fungal polysaccharide monooxygenases. Structure 20, 1051–1061 10.1016/j.str.2012.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Karkehabadi S., Hansson H., Kim S., Piens K., Mitchinson C., and Sandgren M. (2008) The first structure of a glycoside hydrolase family 61 member, Cel61B from Hypocrea jecorina, at 1.6 Å resolution. J. Mol. Biol. 383, 144–154 10.1016/j.jmb.2008.08.016 [DOI] [PubMed] [Google Scholar]
- 42. Garman E. F., and Grime G. W. (2005) Elemental analysis of proteins by microPIXE. Prog. Biophys. Mol. Biol. 89, 173–205 10.1016/j.pbiomolbio.2004.09.005 [DOI] [PubMed] [Google Scholar]
- 43. Blom N., Gammeltoft S., and Brunak S. (1999) Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. J. Mol. Biol. 294, 1351–1362 10.1006/jmbi.1999.3310 [DOI] [PubMed] [Google Scholar]
- 44. Kadowaki M. A. S., Várnai A., Jameson J. K., T Leite A. E., Costa-Filho A. J., Kumagai P. S., Prade R. A., Polikarpov I., and Eijsink V. G. H. (2018) Functional characterization of a lytic polysaccharide monooxygenase from the thermophilic fungus Myceliophthora thermophila. PloS One 13, e0202148 10.1371/journal.pone.0202148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Kittl R., Kracher D., Burgstaller D., Haltrich D., and Ludwig R. (2012) Production of four Neurospora crassa lytic polysaccharide monooxygenases in Pichia pastoris monitored by a fluorimetric assay. Biotechnol. Biofuels 5, 79 10.1186/1754-6834-5-79 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Lo Leggio L., Weihe C. D., Poulsen J. N., Sweeney M., Rasmussen F., Lin J., De Maria L., and Wogulis M. (2018) Structure of a lytic polysaccharide monooxygenase from Aspergillus fumigatus and an engineered thermostable variant. Carbohydr. Res. 469, 55–59 10.1016/j.carres.2018.08.009 [DOI] [PubMed] [Google Scholar]
- 47. Kracher D., Andlar M., Furtmüller P. G., and Ludwig R. (2018) Active-site copper reduction promotes substrate binding of fungal lytic polysaccharide monooxygenase and reduces stability. J. Biol. Chem. 293, 1676–1687 10.1074/jbc.RA117.000109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Zheng H., Chruszcz M., Lasota P., Lebioda L., and Minor W. (2008) Data mining of metal ion environments present in protein structures. J. Inorg. Biochem. 102, 1765–1776 10.1016/j.jinorgbio.2008.05.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Martínez-Abad A., Berglund J., Toriz G., Gatenholm P., Henriksson G., Lindström M., Wohlert J., and Vilaplana F. (2017) Regular motifs in xylan modulate molecular flexibility and interactions with cellulose surfaces. Plant Physiol. 175, 1579–1592 10.1104/pp.17.01184 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Bai Y., Chen B., Li M., Zhou Y., Ren S., Xu Q., Chen M., and Wang S. (2017) FPD: a comprehensive phosphorylation database in fungi. Fungal Biol. 121, 869–875 10.1016/j.funbio.2017.06.004 [DOI] [PubMed] [Google Scholar]
- 51. Klement E., and Medzihradszky K. F. (2017) Extracellular protein phosphorylation, the neglected side of the modification. Mol. Cell. Proteomics 16, 1–7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Yalak G., and Vogel V. (2012) Extracellular phosphorylation and phosphorylated proteins: not just curiosities but physiologically important. Sci. Signal. 5, re7 [DOI] [PubMed] [Google Scholar]
- 53. Tagliabracci V. S., Engel J. L., Wen J., Wiley S. E., Worby C. A., Kinch L. N., Xiao J., Grishin N. V., and Dixon J. E. (2012) Secreted kinase phosphorylates extracellular proteins that regulate biomineralization. Science 336, 1150–1153 10.1126/science.1217817 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Tagliabracci V. S., Wiley S. E., Guo X., Kinch L. N., Durrant E., Wen J., Xiao J., Cui J., Nguyen K. B., Engel J. L., Coon J. J., Grishin N., Pinna L. A., Pagliarini D. J., and Dixon J. E. (2015) A single kinase generates the majority of the secreted phosphoproteome. Cell 161, 1619–1632 10.1016/j.cell.2015.05.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. de Sá Pinheiro A. A., Amazonas J. N., de Souza Barros F., De Menezes L. F., Batista E. J., Silva E. F., De Souza W., and Meyer-Fernandes J. R. (2007) Entamoeba histolytica: an ecto-phosphatase activity regulated by oxidation-reduction reactions. Exp. Parasitol. 115, 352–358 10.1016/j.exppara.2006.09.019 [DOI] [PubMed] [Google Scholar]
- 56. Yalak G., Ehrlich Y. H., and Olsen B. R. (2014) Ecto-protein kinases and phosphatases: an emerging field for translational medicine. J. Transl. Med. 12, 165 10.1186/1479-5876-12-165 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Crane J. K., and Vezina C. M. (2005) Externalization of host cell protein kinase C during enteropathogenic Escherichia coli infection. Cell Death Differ. 12, 115–127 10.1038/sj.cdd.4401531 [DOI] [PubMed] [Google Scholar]
- 58. Karunarathna S. C., Yang Z. L., Zhao R.-L., Vellinga E. C., Bahkali A. H., Chukeatirote E., and Hyde K. D. (2011) Three new species of Lentinus from northern Thailand. Mycological Prog. 10, 389–398 10.1007/s11557-010-0701-6 [DOI] [Google Scholar]
- 59. Couturier M., Navarro D., Chevret D., Henrissat B., Piumi F., Ruiz-Dueñas F. J., Martinez A. T., Grigoriev I. V., Riley R., Lipzen A., Berrin J. G., Master E. R., and Rosso M. N. (2015) Enhanced degradation of softwood versus hardwood by the white-rot fungus Pycnoporus coccineus. Biotechnol. Biofuels 8, 216 10.1186/s13068-015-0407-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Grigoriev I. V., Nikitin R., Haridas S., Kuo A., Ohm R., Otillar R., Riley R., Salamov A., Zhao X., Korzeniewski F., Smirnova T., Nordberg H., Dubchak I., and Shabalov I. (2014) MycoCosm portal: gearing up for 1000 fungal genomes. Nucleic Acids Res. 42, D699–D704 10.1093/nar/gkt1183 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Badino S. F., Christensen S. J., Kari J., Windahl M. S., Hvidt S., Borch K., and Westh P. (2017) Exo-exo synergy between Cel6A and Cel7A from Hypocrea jecorina: role of carbohydrate binding module and the endo-lytic character of the enzymes. Biotechnol. Bioeng. 114, 1639–1647 10.1002/bit.26276 [DOI] [PubMed] [Google Scholar]
- 62. Gorrec F. (2009) The MORPHEUS protein crystallization screen. J. Appl. Crystallogr. 42, 1035–1042 10.1107/S0021889809042022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Kabsch W. (2010) Xds. Acta Crystallogr. D Biol. Crystallogr. 66, 125–132 10.1107/S0907444909047337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Vagin A., and Teplyakov A. (2010) Molecular replacement with MOLREP. Acta Crystallogr. D Biol. Crystallogr. 66, 22–25 10.1107/S0907444909042589 [DOI] [PubMed] [Google Scholar]
- 65. Bunkoczi G., and Read R. J. (2011) Improvement of molecular-replacement models with Sculptor. Acta Crystallogr. D Biol. Crystallogr. 67, 303–312 10.1107/S0907444910051218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Murshudov G. N., Skubak P., Lebedev A. A., Pannu N. S., Steiner R. A., Nicholls R. A., Winn M. D., Long F., and Vagin A. A. (2011) REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D Biol. Crystallogr. 67, 355–367 10.1107/S0907444911001314 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Winn M. D., Ballard C. C., Cowtan K. D., Dodson E. J., Emsley P., Evans P. R., Keegan R. M., Krissinel E. B., Leslie A. G., McCoy A., McNicholas S. J., Murshudov G. N., Pannu N. S., Potterton E. A., Powell H. R., et al. (2011) Overview of the CCP4 suite and current developments. Acta Crystallogr. D Biol. Crystallogr. 67, 235–242 10.1107/S0907444910045749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Emsley P., and Cowtan K. (2004) Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 10.1107/S0907444904019158 [DOI] [PubMed] [Google Scholar]
- 69. Emsley P., Lohkamp B., Scott W. G., and Cowtan K. (2010) Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 10.1107/S0907444910007493 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Gille C., and Frommel C. (2001) STRAP: editor for STRuctural Alignments of Proteins. Bioinformatics 17, 377–378 10.1093/bioinformatics/17.4.377 [DOI] [PubMed] [Google Scholar]
- 71. Gille C., Fähling M., Weyand B., Wieland T., and Gille A. (2014) Alignment-Annotator web server: rendering and annotating sequence alignments. Nucleic Acids Res. 42, W3–W6 10.1093/nar/gku400 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Katoh K., and Standley D. M. (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 30, 772–780 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Zhang Y., and Skolnick J. (2005) TMalign: a protein structure alignment algorithm based on the TMscore. Nucleic Acids Res. 33, 2302–2309 10.1093/nar/gki524 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Paiardini A., Bossa F., and Pascarella S. (2005) CAMPO, SCR_FIND and CHC_FIND: a suite of web tools for computational structural biology. Nucleic Acids Res. 33, W50–W55 10.1093/nar/gki416 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Janson G., Zhang C., Prado M. G., and Paiardini A. (2017) PyMod 2.0: improvements in protein sequence-structure analysis and homology modeling within PyMOL. Bioinformatics 33, 444–446 [DOI] [PubMed] [Google Scholar]
- 76. Grime G. W. (1996) The “Q factor” method: quantitative microPIXE analysis using RBS normalisation. Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms 109–110, 170–174 10.1016/0168-583X(95)00901-9 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





