Abstract
The sulfosugar sulfoquinovose (SQ) is produced by photosynthetic plants, algae, and cyanobacteria on a scale of 10 billion tons per annum. Its degradation, which is essential to allow cycling of its constituent carbon and sulfur, involves specialized glycosidases termed sulfoquinovosidases (SQases), which release SQ from sulfolipid glycoconjugates, so SQ can enter catabolism pathways. However, many SQ catabolic gene clusters lack a gene encoding a classical SQase. Here, we report the discovery of a new family of SQases that use an atypical oxidoreductive mechanism involving NAD+ as a catalytic cofactor. Three-dimensional X-ray structures of complexes with SQ and NAD+ provide insight into the catalytic mechanism, which involves transient oxidation at C3. Bioinformatic survey reveals this new family of NAD+-dependent SQases occurs within sulfoglycolytic and sulfolytic gene clusters that lack classical SQases and is distributed widely including within Roseobacter clade bacteria, suggesting an important contribution to marine sulfur cycling.
Introduction
Sulfur is essential for life, yet significant gaps remain in our understanding of sulfur cycling through the biosphere.1,2 These knowledge gaps are inspiring renewed efforts to discover and characterize the biochemical pathways that mineralize abundant, but poorly studied, organosulfur compounds. The new pathways provide archetypes that allow annotation of the dark matter of environmental metagenomic data sets and illuminate the pathways of sulfur utilization by microbial communities in diverse natural environments. Sulfoquinovose (SQ; 6-deoxy-6-sulfoglucose) is one such abundant but neglected organosulfur compound that has attracted significant attention in recent years. It forms the headgroup of plant and cyanobacterial sulfolipid, sulfoquinovosyl diacylglycerol,3 and it is estimated that 10 billion tons of SQ are biosynthesized and degraded each year.4 As such, it is a key species in the biogeochemical sulfur cycle. Microbial degradation of SQ occurs in all environments in which plant matter is found: by soil and marine bacteria,5,6 and even enteric bacteria within the gastrointestinal tract of herbivorous and omnivorous animals.7−9
The mineralization of SQ occurs through various sulfoglycolytic and sulfolytic pathways that enable utilization of its carbon and sulfur content (Figures 1a and S1, S2).10−12 Sulfoglycolytic pathways break the sugar chain to release C2 or C3 organosulfonates, which are substrates for secondary biomineralization pathways that catabolise these short-chain organosulfonates and release inorganic sulfite. Sulfolytic pathways cleave the C–S bond to release sulfite and provide glucose to fuel glycolysis. In all cases, these pathways require cleavage of the glycosidic bond present within sulfolipid or its delipidated form, sulfoquinovosyl glycerol (SQGro). Genes encoding sulfoquinovosidases (SQases) are diagnostic of SQ-utilizing gene clusters13,14 and are thus useful for discovery of new SQ degradation pathways,15 and for making functional inferences in (meta)genomic data sets.8
To date, only one class of SQase has been identified.13,14 It forms a subfamily of the carbohydrate active enzyme (CAZy) family of glycoside hydrolase 31 (GH31).16 Bioinformatic studies have shown that gene clusters encoding sulfoglycolytic and sulfolytic SQ degradation pathways often include genes encoding family GH31 SQases.15 However, some organisms harboring SQ degradation pathways lack GH31 SQases,6,17 raising questions as to how such organisms can utilize SQ glycosides. We recently reported two Arthrobacter spp. soil bacteria that contain a sulfoglycolytic Embden–Meyerhof–Parnas (sulfo-EMP) pathway but lack a candidate GH31 SQase, yet grow robustly on the SQ glycoside methyl α-sulfoquinovoside (SQMe) as sole carbon source.17 Here, we identify an alternative approach to sulfoquinovoside hydrolysis that is adopted by these organisms and discover a family of SQases belonging to a new glycoside hydrolase family (GH188). These new SQases operate through an oxidoreductive mechanism that results in net hydrolysis and involves a catalytic NAD+ cofactor. We show that these NAD+-dependent SQases adopt a complementary distribution to that of GH31 SQases and provide the missing SQase functionality in diverse sulfoglycolytic and sulfolytic gene clusters. Genes encoding these enzymes occur in bacteria, eukaryotes, and archaea and are especially abundant within marine bacteria.
Results
To confirm the ability of Arthrobacter sp. strain AK0117 to grow on SQ glycosides, we examined growth on the naturally occurring glycoside, SQGro, which is formed by delipidation of SQDG (through action of nonspecific sulfolipases). Arthrobacter sp. strain AK01 exhibited robust growth and produced only sulfolactate in culture media, showing that glycerol and half of the SQ molecule were consumed, and thus that SQGro is cleaved to give SQ and glycerol (Figures 1b and S3). Growth curves for Arthrobacter sp. strain AK01 on 5 mM Glc or SQGro achieved peak optical density approximately twice that of 5 mM SQ or SQMe, suggesting that Glc and SQGro function as 6-carbon substrates, while SQ and SQMe function as 3-carbon substrates. To identify the molecular basis of sulfoglycolysis in this organism, we grew strain AK01 on SQ to remove complicating effects from coincident glycerol catabolism with SQGro. Cultures were independently grown on SQ or glucose, and comparative proteomics was conducted. Proteins displaying increased abundance belong to the previously identified sulfo-EMP pathway, including SQ-sulfofructose (SF) isomerase, SF kinase, SF-1-phosphate (SFP) aldolase and sulfolactaldehyde dehydrogenase (Figure 1c,d). Also identified was a protein (LN996_0165) that belongs to the short-chain dehydrogenase/reductase (SDR) superfamily, a large grouping of nicotinamide-cofactor dependent oxidoreductases with activity on diverse substrates.18 Preliminary analysis revealed that sequence-related homologues were associated with other Arthrobacter spp. that possessed syntenic sulfo-EMP gene clusters. Based on data as described later, we named this enzyme SQ glycosidase, SqgA.
We synthesized several genes of homologues (Supplementary experimental) with codons harmonized for E. coli and screened their expression. FlSqgA from Flavobacterium sp. strain K172 was selected for further study. Optimization of the reaction conditions revealed that incubation of FlSqgA with DTT and NAD+ led to cleavage of SQGro to give SQ (Figure 2a). Experiments that involved the omission of DTT or NAD+ did not lead to the formation of SQ. Inclusion of Mn2+ led to an increase in product formation; however, kinetic analysis revealed no improvement in rate in the presence of Mn2+, and inclusion of EDTA caused a partial reduction of enzyme activity (Figure S4). Further analysis revealed that the small increase in product formation observed in the presence of Mn2+ was a result of improved enzyme stability; thus, Mn2+ was included in subsequent kinetic studies. These data show that FlSqgA is a metal-independent NAD+-dependent SQase. Michaelis–Menten parameters for SQGro were measured by liquid chromatography–mass spectrometry using a triple-quadrupole mass spectrometer. This showed kcat = 0.24 ± 0.02 s–1, KM = 3.7 ± 0.9 mM, and kcat/KM = 0.07 ± 0.02 mM–1 s–1 (Figure 2b). Another feature of NAD+-dependent glycosidases is the ability of many members to act on both α- and β-linked glycosides.19 Both 4-nitrophenyl α- and β-sulfoquinovosides (α-PNPSQ and β-PNPSQ) were substrates with the following Michaelis–Menten kinetic parameters: α-PNPSQ, kcat = 0.025 ± 0.002 s–1, KM = 2.9 ± 0.8 mM, and kcat/KM = 0.012 ± 0.001 mM–1 s–1; β-PNPSQ, kcat = 0.014 ± 0.002 s–1, KM = 1.7 ± 0.6 mM, and kcat/KM = 0.008 ± 0.001 mM–1 s–1 (Figure 2c,d). No activity was observed against 4-nitrophenyl α- and β-glucosides or 4-nitrophenyl β-glucuronic acid, showing that FlSqgA is specific for SQ glycosides (Figure S4). The Michaelis activation constant for NAD+ at a constant α-PNPSQ concentration was KA = 0.16 ± 0.02 mM (Figure S5). Two other orthologues, ArSqgA from Arthrobacter sp. U41 and CrSqgA from Cryobacterium sp. TMT2–4, also cleaved α-PNPSQ (Figure S6).
NAD+-dependent glycoside hydrolases have been described that belong to CAZy GH families 4, 109, 177, and 179.19 Among these, the best studied are those belonging to GH family 4, which are Mn2+-dependent and have been shown to operate through a multistep mechanism that involves oxidation at C3 to a ketone, elimination of the glycoside to give an α,β-unsaturated ketone, hydration and then reduction of the ketone to give the sugar hemiacetal.20 A diagnostic feature of this pathway is the incorporation of deuterium at C2 when the reaction is conducted in D2O, through hydration of the intermediate α,β-unsaturated ketone. A similar mechanism is invoked for enzymes of GH109, which like FlSqgA studied here, are metal-independent.21 We therefore incubated FlSqgA and the chromogenic glycoside 4-nitrophenyl α-sulfoquinovoside (α-PNPSQ) in D2O and analyzed the product by mass spectrometry. Upon exchange to H2O, the molecular ion ([M-H]−) increased by m/z of 1, consistent with incorporation of a nonexchangeable D atom. Fragmentation of the resulting [M-H]− ion with collision-induced dissociation supports the incorporation of D at C2 (Figure 3).
To gain insight into the molecular basis of catalysis, we determined the 3D structures of FlSqgA and ArSqgA using X-ray crystallography. Crystals of FlSqgA and ArSqgA were grown in the presence of NAD+ (or a ternary complex of ArSqgA with citrate and NAD+) and diffracted to 2.35 and 2.65 (or 1.95) Å, respectively (Tables S1 and S2). The structures reveal homodimers, consistent with molar mass analysis using size-exclusion chromatography-multiangle laser light scattering in solution (Figures 4a and S7–S9). SqgA proteins dimerize through hydrophobic and polar interactions over a nine-stranded, flat β-sheet surface that buries 7468 Å2 at the dimerization interface, corresponding to 22% of the monomer surface. The SqgA proteins adopt a two-domain fold with an N-terminal dinucleotide binding Rossmann domain and a C-terminal α/β dimerization domain. Clear, contiguous density was seen in both structures within a cleft between the two domains that allowed modeling of a single molecule of NAD+ bound in an extended conformation in each monomer with its nicotinamide ring projecting into a polar pocket.
Soaking and cocrystallization experiments provided structures of the ArSqgA·NAD+·SQ and FlSqgA·NAD+·SQ complexes, which diffracted to 2.4 and 2.3 Å, respectively (Figures 4b and S10). Both ternary complexes were essentially identical, with respect to the active site interactions. In the ArSqgA·NAD+·SQ complex, SQ is situated above the nicotinamide ring, with C3 of SQ 3.4 Å away from C4 of the nicotinamide ring, at an appropriate distance for hydride transfer and oxidation of C3 of an SQ glycoside. A range of specific interactions occur with the hydroxyl groups of SQ: C1–OH at 2.5 Å from His321 and a water molecule; C2-OH with Tyr136 (2.6 Å); C3–OH with His193 (2.7 Å) and Lys107 (2.6 Å); and C4–OH with Glu189 (2.6 Å; Figure 4b). The characteristic 6-sulfonate group of SQ engages in a triad of interactions in the ArSqgA·NAD+·SQ: one oxygen H-bonds to Arg166 (2.6 Å), a second to Lys172 (2.9 Å), and a third to the backbone amide of Leu170 (2.8 Å). Superposition of ArSqgA·NAD+ binary and ArSqgA·NAD+·SQ ternary complexes (backbone RMSD of 0.5 Å over 364 residues) revealed no major structural changes upon binding SQ. The active site was also largely unchanged with the notable exception of the Tyr136 residue in some of the individual protomer chains, which rotated 72° about the Cα–Cβ bond to engage with C2–OH of SQ in the ArSqgA·NAD+·SQ complex. All of these residues are conserved across the studied SqgA homologues (Figure S11).
Structural comparison of SqgA enzymes using the DALI server matched various oxidoreductases with high Z-score of >33, despite very low sequence similarity. These include an inositol-2-dehydrogenase (IDH) (PDB: 4MIE), backbone RMSD of 1.87 Å over 279 aligned residues and 19% sequence identity; a glucose-fructose/IDH/MocA-like oxidoreductase (PDB: 1ZH8) with backbone RMSD 2.1 Å over 313 residues and 24% sequence identity; and intriguingly, an NAD+-dependent N-acetylgalactosaminidase, AmGH109A, belonging to family GH109 (PDB ID: 6T2B with backbone RMSD 2.22 over 293 residues, 15% sequence identity). As noted earlier, GH109 glycosidases are NAD+-dependent but metal-independent enzymes, acting through an analogous oxidoreductive mechanism. In the case of AmGH109A, His404 is proposed to act as base/acid in the oxidation/reduction of C3, Tyr226 as general base/acid for proton transfer at C2, and His259 as general acid/base for cleavage of the glycosidic linkage and addition of water.22 Superposition of the ArSqgA·NAD+·SQ and AmGH109A·NAD+·GalNAc complexes shows that His259 (His193 in ArSqgA) and His404 (His321 in ArSqgA) are structurally conserved (Figure 4c). However, a difference occurs in the active site with respect to the location of Tyr226 of AmGH109A, where Glu163 occupies the equivalent position in ArSqgA; and Tyr136 hydrogen-bonds to C2–OH of SQ in our structure, implicating it in catalysis. Comparison of the superimposed structures of ArSqgA·NAD+·SQ with those of AmGH109A·NAD+·GalNAc and the Mn2+-dependent family GH4 member BglT·NAD+·G6P (from Thermotoga maritima, PDB: 1UP6) revealed conserved N-terminal domain folds, and cofactor and substrate binding sites, but differences in the C-terminal domain and active site interactions (Figures S12 and S13). Notably, BglT contains a well-defined Mn2+ coordination site that includes conserved cysteine (Cys162) and histidine (His192) residues, whereas both metal-independent enzymes lack a metal binding site at the equivalent position.
These NAD+-dependent SQases serve as prototypes for a new GH family. We used the C-terminal domain of AK01 (lacking the NAD+-binding domain) as a query to search for related sequences on the phmmer database that were used to build a hidden Markov model (HMM). Iterative searches and rebuilding of the HMM were conducted until a stable set of sequences was obtained containing approximately 1600 retrieved sequences (Figure S14). The retrieved sequences constitute a new GH family (GH188), which was studied further using sequence similarity network (SSN) analysis.23 We used analysis of network centrality24 to guide the selection of an appropriate alignment score for the SSN (Figures S15 and S16). At an alignment score of 133, this SSN breaks into clusters that align closely with taxonomy and showcases that these NAD+-dependent SQases are found in diverse bacteria, algae, plants, and a small number of archaea (Figure 5a). Eukaryotic members are present within agriculturally significant plants including Oryza sativa, Zea mays, Hordeum vulgare, Theobroma cacao, Nicotinia tabacam, and the red alga Gracilariopsis chorda. Bacterial representatives include Faecalicatena contorta, which has been isolated from human feces.25 We used the bacterial representatives to retrieve the genome neighborhood diagrams (open reading frame ± 10 around sqgA). Representative genome neighborhood diagrams reveal SqgA encoding genes are associated with sulfo-EMP, sulfo-ED, sulfo-SFT, sulfo-SMO, and sulfo-SDO pathways (Figures 5b,c, Table S3). These clusters lack a family GH31 SQase, and thus, the GH188 member will provide these organisms with a capacity to utilize SQ glycosides through these pathways. A significant observation is the near-universal occurrence of GH188 SQases in the sulfo-SMO gene clusters within marine Roseobacter clade bacteria (Figure S17). These organisms have previously been reported to lack GH31 SQases and were proposed to be restricted to growth only on the free sugar, SQ.6
Discussion
The ubiquity of SQ within photosynthetic tissues means that it is a major species in the biosulfur cycle. Its significance as a nutrient is highlighted by the evolution of a variety of sulfoglycolytic and sulfolytic pathways for SQ catabolism. Organisms that seek to utilize SQ are faced with the challenge of cleaving it from its glycoconjugates, which are the major forms present in the natural world. Previously, a single class of SQases were identified that belongs to CAZy family GH31.13,14 However, experimental work and bioinformatic analysis have identified many organisms that are capable of growth on SQ-glycosides and/or which lack a gene encoding a candidate GH31 SQase. Here, we discover a new family of NAD+-dependent SQases with a widespread distribution that is complementary to classical GH family 31 SQases, and which provides the missing enzymatic capability.
The new family of NAD+-dependent SQases described here possesses the ability to cleave α- and β-linked SQ glycosides, in contrast to family GH31 SQases, which are specific for α-SQ glycosides. The ability to cleave α- and β-linked glycosides is a consequence of an oxidoreductive mechanism (with a rate-limiting transition state that does not involve C1–O cleavage)20 that is distinct from that used by family GH31 glycosidases (involving C1–O cleavage at the rate-limiting transition state) (Figure 6).26 Although not investigated here, it is tempting to suggest that these anomer nonspecific SQases could allow the release of SQ from sulfolipid and related species (α-linked) and SQ-containing N-glycans (β-linked).3
Conclusions
Our work highlights that the distribution of NAD+-dependent SQases is complementary to that of classical GH31 SQases. We discover examples of NAD+-dependent SQase encoding genes within gene clusters of almost all known SQ degrading pathways, suggesting their importance, along with GH31 SQases, as gateway enzymes for SQ catabolism pathways. Of significance is the occurrence of NAD+-dependent SQases within Roseobacter clade bacteria. It has recently been argued that because Roseobacter clade bacteria lack a classical GH31 SQase, they cannot utilize SQ glycosides, and only utilize SQ, a very minor species within the overall budget of natural SQ-related molecules.6 In fact, essentially all Roseobacter clade bacteria investigated encode family GH188 SQases, highlighting that this new family of NAD+-dependent SQases contributes to SQ utilization within the marine environment. Collectively, the two classes of SQases that are now known provide a powerful bioinformatic signature for the identification of SQ degradation pathways and will support the survey of SQ degradation pathways across (meta)genomic data sets to provide a deeper understanding of the biogeochemical sulfur cycle.
Acknowledgments
This work was supported by the Australian Research Council (DP210100233, DP210100235, DP210100362, and DP230102668) and the National Health and Medical Research Council (NHMRC) (GNT2021638, GNT2000517). A.K. was supported by a Norma Hilda Schuster Scholarship. M.S. was funded by the Biotechnology and Biological Sciences Research Council (BB/W003805/1), I.B.P. by the European Research Council (ERC/951231) and G.J.D. acknowledges the Royal Society for the Ken Murray Research Professorship award. N.E.S. was supported by an Australian Research Council Future Fellowship (FT200100270). E.D.G.-B. acknowledges support from The Walter and Eliza Hall Institute of Medical Research; a Victorian State Government Operational Infrastructure support grant; and the Brian M. Davis Charitable Foundation Centenary Fellowship. We thank the Melbourne Mass Spectrometry and Proteomics Facility of The Bio21 Molecular Science and Biotechnology Institute for access to MS instrumentation. We thank Nicholas Terrapon for assistance with bioinformatics, Alexandre Tolotchkov for design and construction of the MicrobeMeter, Dr. Johan P. Turkenburg and Sam Hart for assistance with X-ray data collection, the Bioscience Technology Facility (University of York) for assistance with SEC-MALLS analysis, and the staff of the Diamond Light Source (U.K.) for provision of iO3 beamline facility under proposal number mx-32736.
Data Availability Statement
Atomic coordinate files and structure factors have been deposited in the Protein DataBank (PDB) with accession codes 8QC8 (FlSqgA·NAD), 8QC2 (FlSqgA·NAD·SQ), 8QC3 (ArSqgA·NAD), 8QC6 (ArSqgA·NAD·SQ) and 8QC5 (ArSqgA·NAD·citrate). Data collection and refinement statistics are presented in Supplementary Tables S1 and S2. PFAM codes for neighborhood genes from different sulfoglycolytic and sulfolytic pathways are presented in Supplementary Table S3. A list of organisms containing homologous sqgA genes used for bioinformatics analysis is provided in Supporting Information file 2. Proteomics data has been deposited to the PRIDE partner repository with the data set identifier: PXD043482. The family classification for NAD+-dependent SQases is available at www.cazy.org.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/jacs.3c11126.
The authors declare no competing financial interest.
Supplementary Material
References
- Tang K.; Liu L. Bacteria are driving the ocean’s organosulfur cycle. Trends Microbiol. 2023, 31, 772–775. 10.1016/j.tim.2023.05.003. [DOI] [PubMed] [Google Scholar]
- Durham B. P.; Boysen A. K.; Carlson L. T.; Groussman R. D.; Heal K. R.; Cain K. R.; Morales R. L.; Coesel S. N.; Morris R. M.; Ingalls A. E.; et al. Sulfonate-based networks between eukaryotic phytoplankton and heterotrophic bacteria in the surface ocean. Nat. Microbiol. 2019, 4, 1706–1715. 10.1038/s41564-019-0507-5. [DOI] [PubMed] [Google Scholar]
- Goddard-Borger E. D.; Williams S. J. Sulfoquinovose in the biosphere: occurrence, metabolism and functions. Biochem. J. 2017, 474, 827–849. 10.1042/BCJ20160508. [DOI] [PubMed] [Google Scholar]
- Harwood J. L.; Nicholls R. G. The plant sulpholipid - a major component of the sulphur cycle. Biochem. Soc. Trans. 1979, 7, 440–447. 10.1042/bst0070440. [DOI] [PubMed] [Google Scholar]
- Roy A. B.; Hewlins M. J.; Ellis A. J.; Harwood J. L.; White G. F. Glycolytic breakdown of sulfoquinovose in bacteria: a missing link in the sulfur cycle. Appl. Environ. Microbiol. 2003, 69, 6434–6441. 10.1128/AEM.69.11.6434-6441.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu L.; Chen X.; Ye J.; Ma X.; Han Y.; He Y.; Tang K. Sulfoquinovose is a widespread organosulfur substrate for Roseobacter clade bacteria in the ocean. ISME J. 2023, 17, 393–405. 10.1038/s41396-022-01353-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frommeyer B.; Fiedler A. W.; Oehler S. R.; Hanson B. T.; Loy A.; Franchini P.; Spiteller D.; Schleheck D. Environmental and intestinal phylum Firmicutes bacteria metabolize the plant sugar sulfoquinovose via a 6-deoxy-6-sulfofructose transaldolase pathway. iScience 2020, 23, 101510 10.1016/j.isci.2020.101510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanson B. T.; Dimitri Kits K.; Löffler J.; Burrichter A. G.; Fiedler A.; Denger K.; Frommeyer B.; Herbold C. W.; Rattei T.; Karcher N.; et al. Sulfoquinovose is a select nutrient of prominent bacteria and a source of hydrogen sulfide in the human gut. ISME J. 2021, 15, 2779–2791. 10.1038/s41396-021-00968-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Denger K.; Weiss M.; Felux A. K.; Schneider A.; Mayer C.; Spiteller D.; Huhn T.; Cook A. M.; Schleheck D. Sulphoglycolysis in Escherichia coli K-12 closes a gap in the biogeochemical sulphur cycle. Nature 2014, 507, 114–117. 10.1038/nature12947. [DOI] [PubMed] [Google Scholar]
- Snow A. J. D.; Burchill L.; Sharma M.; Davies G. J.; Williams S. J. Sulfoglycolysis: catabolic pathways for metabolism of sulfoquinovose. Chem. Soc. Rev. 2021, 50, 13628–13645. 10.1039/D1CS00846C. [DOI] [PubMed] [Google Scholar]
- Wei Y.; Tong Y.; Zhang Y. New mechanisms for bacterial degradation of sulfoquinovose. Biosci. Rep. 2022, 42, BSR20220314 10.1042/BSR20220314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ye Z.; Wei Y.; Jiang L.; Zhang Y. Oxygenolytic sulfoquinovose degradation by an iron-dependent alkanesulfonate dioxygenase. iScience 2023, 26, 107803 10.1016/j.isci.2023.107803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abayakoon P.; Jin Y.; Lingford J. P.; Petricevic M.; John A.; Ryan E.; Wai-Ying Mui J.; Pires D. E. V.; Ascher D. B.; Davies G. J.; et al. Structural and biochemical insights into the function and evolution of sulfoquinovosidases. ACS Cent. Sci. 2018, 4, 1266–1273. 10.1021/acscentsci.8b00453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Speciale G.; Jin Y.; Davies G. J.; Williams S. J.; Goddard-Borger E. D. YihQ is a sulfoquinovosidase that cleaves sulfoquinovosyl diacylglyceride sulfolipids. Nat. Chem. Biol. 2016, 12, 215–217. 10.1038/nchembio.2023. [DOI] [PubMed] [Google Scholar]
- Liu J.; Wei Y.; Ma K.; An J.; Liu X.; Liu Y.; Ang E. L.; Zhao H.; Zhang Y. Mechanistically diverse pathways for sulfoquinovose degradation in bacteria. ACS Catal. 2021, 11, 14740–14750. 10.1021/acscatal.1c04321. [DOI] [Google Scholar]
- Arumapperuma T.; Li J.; Hornung B.; Soler N. M.; Goddard-Borger E. D.; Terrapon N.; Williams S. J. A subfamily classification to choreograph the diverse activities within glycoside hydrolase family 31. J. Biol. Chem. 2023, 299, 103038 10.1016/j.jbc.2023.103038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaur A.; van der Peet P. L.; Mui J. W.; Herisse M.; Pidot S.; Williams S. J. Genome sequences of Arthrobacter spp. that use a modified sulfoglycolytic Embden-Meyerhof-Parnas pathway. Arch. Microbiol. 2022, 204, 193. 10.1007/s00203-022-02803-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kavanagh K. L.; Jörnvall H.; Persson B.; Oppermann U. Medium- and short-chain dehydrogenase/reductase gene and protein families: the SDR superfamily: functional and structural diversity within a family of metabolic and regulatory enzymes. Cell. Mol. Life Sci. 2008, 65, 3895–3906. 10.1007/s00018-008-8588-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jongkees S. A.; Withers S. G. Unusual enzymatic glycoside cleavage mechanisms. Acc. Chem. Res. 2014, 47, 226–235. 10.1021/ar4001313. [DOI] [PubMed] [Google Scholar]
- Yip V. L.; Varrot A.; Davies G. J.; Rajan S. S.; Yang X.; Thompson J.; Anderson W. F.; Withers S. G. An unusual mechanism of glycoside hydrolysis involving redox and elimination steps by a family 4 β-glycosidase from Thermotoga maritima. J. Am. Chem. Soc. 2004, 126, 8354–8355. 10.1021/ja047632w. [DOI] [PubMed] [Google Scholar]
- Liu Q. P.; Sulzenbacher G.; Yuan H.; Bennett E. P.; Pietz G.; Saunders K.; Spence J.; Nudelman E.; Levery S. B.; White T.; et al. Bacterial glycosidases for the production of universal red blood cells. Nat. Biotechnol. 2007, 25, 454–464. 10.1038/nbt1298. [DOI] [PubMed] [Google Scholar]
- Teze D.; Shuoker B.; Chaberski E. K.; Kunstmann S.; Fredslund F.; Nielsen T. S.; Stender E. G. P.; Peters G. H. J.; Karlsson E. N.; Welner D. H.; et al. The catalytic acid–base in GH109 resides in a conserved GGHGG loop and allows for comparable α-retaining and β-inverting activity in an N-acetylgalactosaminidase from Akkermansia muciniphila. ACS Catal. 2020, 10, 3809–3819. 10.1021/acscatal.9b04474. [DOI] [Google Scholar]
- Zallot R.; Oberg N.; Gerlt J. A. The EFI web resource for genomic enzymology tools: leveraging protein, genome, and metagenome databases to discover novel enzymes and metabolic pathways. Biochemistry 2019, 58, 4169–4182. 10.1021/acs.biochem.9b00735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hornung B. V. H.; Terrapon N. An objective criterion to evaluate sequence-similarity networks helps in dividing the protein family sequence space. PLOS Comp. Biol. 2023, 19, e1010881 10.1371/journal.pcbi.1010881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sakamoto M.; Iino T.; Ohkuma M. Faecalimonas umbilicata gen. nov., sp. nov., isolated from human faeces, and reclassification of Eubacterium contortum, Eubacterium fissicatena and Clostridium oroticum as Faecalicatena contorta gen. nov., comb. nov., Faecalicatena fissicatena comb. nov. and Faecalicatena orotica comb. nov.. Int. J. Syst. Evol. Microbiol. 2017, 67, 1219–1227. 10.1099/ijsem.0.001790. [DOI] [PubMed] [Google Scholar]
- Zechel D. L.; Withers S. G. Glycosidase mechanisms: Anatomy of a finely tuned catalyst. Acc. Chem. Res. 2000, 33, 11–18. 10.1021/ar970172+. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Atomic coordinate files and structure factors have been deposited in the Protein DataBank (PDB) with accession codes 8QC8 (FlSqgA·NAD), 8QC2 (FlSqgA·NAD·SQ), 8QC3 (ArSqgA·NAD), 8QC6 (ArSqgA·NAD·SQ) and 8QC5 (ArSqgA·NAD·citrate). Data collection and refinement statistics are presented in Supplementary Tables S1 and S2. PFAM codes for neighborhood genes from different sulfoglycolytic and sulfolytic pathways are presented in Supplementary Table S3. A list of organisms containing homologous sqgA genes used for bioinformatics analysis is provided in Supporting Information file 2. Proteomics data has been deposited to the PRIDE partner repository with the data set identifier: PXD043482. The family classification for NAD+-dependent SQases is available at www.cazy.org.