Abstract
Capsular polysaccharides are important virulence factors in pathogenic bacteria. Characterizing the structural components and biosynthetic pathways for these polysaccharides is key to our ability to design vaccines and other preventative therapies that target encapsulated pathogens. Many gram-negative pathogens such as Neisseria meningitidis and Escherichia coli express acidic capsules. The E. coli K15 serotype has been identified as both an enterotoxigenic and uropathogenic pathogen. Despite its relevance as a disease-causing serotype, the associated capsular polysaccharide remains poorly characterized. We describe in this report the chemical structure of the K15 polysaccharide, based on chemical analysis and nuclear magnetic resonance (NMR) data. The repeating structure of the K15 polysaccharide consists of 4)-α-GlcpNAc-(1 → 5)-α-KDOp-(2 → partially O-acetylated at 3-hydroxyl of GlcNAc. We also report, the organization of the gene cluster responsible for capsule biosynthesis. We identify genes in this cluster that potentially encode an O-acetyltransferase, an N-acetylglucosamine transferase, and a KDO transferase consistent with the structure we report.
Subject terms: Biochemistry, Carbohydrates, Glycobiology
Introduction
Capsular polysaccharides are major virulence factors of pathogenic Escherichia coli. These surface glycans form a coat exterior to most other surface structures such as lipopolysaccharides, protecting the bacteria from a multitude of environmental stress factors and serving as a barrier to the host immune responses1. The presence of specific polysaccharide capsules on the surfaces of some strains of bacteria that cause disease, make capsules, useful serological targets for monitoring and prevention of infection. Escherichia coli has more than 80 reported serologically distinct capsular polysaccharides, referred to as K-antigens2. Capsules in E. coli have been classified into 4 groups based on organization of the capsule synthesizing genes, mechanism of biosynthesis, and mode of transport across the membranes3. Groups 1 and 4 capsules use the Wzy-dependent pathway, while groups 2 and 3 make use of the ABC transporter dependent pathway of assembly and export.
Most capsules of pathogenic E. coli contain anionic glycan residues like hexuronic acid, N-acetyl neuraminic acid (NeuNAc), 2-keto-3-deoxy-D-mannooctulonic acid (KDO), and phosphodiesters4. Among these negatively charged components, KDO has only been identified in group 2 capsules. In capsular polysaccharide structures, KDO is usually present in a β-pyranosidic (β-KDOp, in E. coli K12, K16, K20, K23, etc.) and β-furanosidic forms (β-KDOf, in E. coli K74 and K95)5–9. The only known examples of KDO as α-KDOp in capsular polysaccharides were reported for E. coli K6 (LP1092) and K1610–13. On the other hand, α-KDOp is widely reported as part of the lipid A core component of the lipopolysaccharide (LPS). The genetic organization of several gene clusters encoding the biosynthesis of capsular polysaccharides containing KDO with putative glycosyltransferases have been described14. However, a description of the genes encoding the specific glycosyltransferases and O-acetyltransferase for synthesis of the K15 polysaccharides has not been reported. Therefore, determining the structure of these polysaccharides as well as the genetic organization of the biosynthetic machinery, is crucial to understanding infection and developing of effective treatments.
E. coli K15 is an enterotoxigenic K antigen type originally isolated from children with diarrhea and occurring at high frequency in combination with the O6 antigen and a mannose resistant hemagglutinin15,16. Orskov et al. demonstrated, in a large study of enteropathogenic E. coli strains from adults and children from widespread geographic locations, that some O and K serotypes frequently occurred together17,18. They suggested that these strains represent clones which have adapted to growth in the small intestine. A structure was reported for the E. coli K15 polysaccharide without experimental details of how this structure was determined nor the source the polysaccharide7. Significantly, this reported structure differs from our findings. In this study, we describe structural analysis of capsular polysaccharide isolated from E. coli str. F8316/41 (O6:K15:H16). We also report the partial characterization of the gene cluster responsible for biosynthesis of the K15 capsular polysaccharide in the same strain of E. coli K15. This gene cluster is very similar to that of E. coli K15 strain 53619 whose polysaccharide structure has not been reported.
Results and discussion
Composition of the K15 polysaccharide
The composition of the K15 capsular polysaccharide was determined by a combination of chemical degradation methods and analysis of its one-dimensional carbon nuclear magnetic resonance (1D 13C-NMR) spectrum. The polysaccharide is composed of two monosaccharides, N-acetylglucosamine (GlcNAc) and KDO. N-acetylglucosamine and KDO were detected exclusively in acid methanolysates by gas-chromatography mass-spectrometry (GC–MS) as trimethylsilyl (TMS) derivatives. The identity of the hexosamine in the polysaccharide was confirmed by ninhydrin degradation of an acid hydrolysate. The product of the ninhydrin degradation was identified chromatographically as arabinose, the expected degradation product for glucosamine20. KDO was identified by acid hydrolysis of the carboxyl reduced polysaccharide, conversion to alditol acetates and analysis by GC–MS for characteristic fragmentation patterns. These composition results are in agreement with the chemical shift location and number of carbon signals observed in 1-D 13C -NMR spectra (Fig. 1). Treatment of the native polysaccharide (Fig. 1A) with dilute sodium hydroxide results in loss of methyl and carbonyl carbon signals, simplifying the spectrum (Fig. 1B). The alkali treated polysaccharide has two chemical shifts in the anomeric region at 98.5 and 100 ppm, two carbonyl chemical shifts at 174.46 and 174.83 ppm and a single methyl chemical shift at 22.09 ppm. The loss of some methyl and carbonyl carbon signals under alkaline conditions with a corresponding simplification of the spectrum is characteristic of an O-acetylated polysaccharide21.
The presence of only two resonances of similar intensities in the anomeric region of the 13C -NMR spectra of the de-O-acetylated polysaccharide suggests a disaccharide repeat unit. The reduction of the number of signals after de-O-acetylation to 16 and their chemical shift positions are expected for a disaccharide repeat unit composed of an N-acetylhexosamine and a deoxy acidic sugar with eight carbons, i.e., KDO.
An oligosaccharide consisting of KDO and GlcNAc was isolated by selective cleavage at the acid labile KDO linkage with 1% acetic acid. This oligosaccharide eluted on a BioGel P2 gel filtration as a disaccharide, which is in agreement with the suggestion of a disaccharide repeat unit from the NMR evidence above. Methylation analysis of the purified oligosaccharide yielded only 3,4,6-O-methyl-N-acetylglucosamine, as expected for a terminal GlcNAc, indicating that the repeat unit consists of a disaccharide of GlcNAc and KDO.
Substitution of repeat unit
The substitution pattern of the monosaccharides in the polysaccharide repeat unit was determined by two dissimilar methylation analysis experiments, due to the difference in lability of GlcNAc and KDO. The GlcNAc linkage was determined by permethylation of the polysaccharide and subsequent analysis by GC–MS of the resulting alditol acetate, which yielded only 3,6-O-methyl-N-acetylglucosamine, suggesting that this glycan is 4-substituted. A permethylated KDO residue was not conveniently detected during this analysis, thus another approach was taken. The permethylated K15 was treated with LiBD4 to reduce the carboxylate group, and subsequently hydrolyzed with TFA and converted to alditol acetates before GC–MS. The major fraction detected in this analysis corresponded to a 2,5 linked KDO suggesting that the KDO is 5-substituted (Fig. 2). This was further confirmed by the alkali lability of the periodate oxidized polysaccharide. The K15 polysaccharide is oxidized by periodate with concomitant loss of KDO but remains polymeric. The oxidized product is depolymerized by mild alkali, suggesting a β-elimination reaction. Depolymerization was easily followed by TLC, which showed the disappearance of immobile polysaccharide and the appearance of a component with RfGlcNAc = 0.75. Thus, the chemical analysis indicates that the repeat unit of the K15 CPS is a disaccharide of 4-substituted GlcNAc and 5-substituted KDO.
The analysis above was confirmed by NMR experiments. We used the following process to confirm the composition and substitution of the repeat unit. The HSQC-TOCSY22,23 experiment produces signals from single-bonded CHs correlated to 1H signals from Hs belonging to the same ring as the CH. Since there are multiple carbon atoms in each ring, this experiment yields redundant information that allows us to confirm 1Hs assignments. An overlay of the 1H–13C HSQC and HSQC-TOCSY (Fig. 3A), allows assignment of the 1H and 13C chemical shifts in the KDO and GlcNAc rings, as follows. In the HSQC-TOCSY, cross-peak intensities originating from a given 1H decrease as the number of bonds between the 1Hs increases, thus suggesting tentative assignments based on relative intensities. For instance, all 1H resonances in the GlcNAc ring can be identified and assigned tentatively from the 13C slice corresponding to G-C1 (Fig. 3A). The anomeric G1 is found at ~ (98.6, 5.07) ppm in 13C and 1H, respectively (Table 1). From G1 it is possible to use the HSQC-TOCSY to provisionally assign all 1H resonances through G5. Signals from H5-C5 and H4-C4 have lower intensity than those from H3-C3 and H2-C2. Similarly, the characteristic signals for the equatorial and axial H3′s in KDO, H3e,a, respectively in KDO24,25, allow tentative assignment of H4 and H5 at the C3 slice (~ 34.7 ppm).
Table 1:
Bond | K15(-OAc) | K15(G3 + OAc) | ||
---|---|---|---|---|
δC (ppm) | δH (ppm) | δC (ppm) | δH (ppm) | |
G1 | 98.55 | 5.070 | 98.46 | 5.127 |
G2 | 53.97 | 3.988 | 52.05 | 4.219 |
G3 | 71.25 | 3.769 | 74.20 | 5.132 |
G4 | 70.25 | 3.589 | 67.84 | 3.796 |
G5 | 70.41 | 4.257 | 70.14 | 4.369 |
G6' | 61.1 | 3.387 | 60.5 | 3.400 |
G6′' | 61.1 | 3.692 | 60.5 | 3.708 |
CH3(NAc) | 22.09 | 2.083 | 21.88 | 2.027 |
C(NAc) | 174.46 | – | 174.36 | – |
CH3(OAc) | – | – | 20.34 | 2.103 |
CO(OAc) | – | – | 173.76 | – |
K1 | 174.83 | – | 174.81 | – |
K2 | 100.01 | – | 99.95 | – |
K3e | 34.75 | 1.887 | 34.67 | 1.946 |
K3a | 34.75 | 2.098 | 34.67 | 2.125 |
K4 | 65.92 | 4.167 | 66.02 | 4.187 |
K5 | 75.1 | 4.037 | 75.25 | 4.079 |
K6 | 72.18 | 3.612 | 72.30 | 3.592 |
K7 | 69.2 | 3.891 | 69.2 | 3.891 |
K8' | 63.6 | 3.636 | 63.7 | 3.576 |
K8′' | 63.6 | 3.915 | 63.7 | 3.923 |
To confirm the initial assignments and complete the remaining ones, we added to our analysis a low-pass filtered HMBC26,27, which produces only long-range 1H–13C correlations. Figure 3B shows an overlay of the HSQC (red) with the HMBC (blue) spectra. The presence of peaks depends on both, the 1H–13C couplings between each specific pair of atoms and multiple nuclear relaxation rates. Consequently, not all potential signals are observed. For instance, there is no cross-peak for (GH1- GC2) but there is one between a related pair of nuclei, GH2 and GC1. The HMBC also provided evidence of the G-K linkage: a cross-peak between G1 and K5, indicated a G1-K5 linkage between the glycan residues. In addition, the 13C chemical shift assignments of K2 (δC = 100.0 ppm) and N-acetyl carbonyl group (δC = 174.5 ppm) are confirmed through the cross-peaks K(H3e, C2) and G(CH3, C7), respectively. This indicates that the other carbonyl peak (δC = 174.8 ppm) corresponds to K1.
The analysis continued with the 1H assignments of the H3a and H3e of KDO in the K15 polysaccharide using a 1H–13C high-resolution HSQC at 70 °C, which allowed determination of 1H–1H couplings between K-H3a/e and K-H4 (Fig. 4). H4 is axial in KDOp, the resulting 3JH3aH4 is large (H4-C4-C3-H3a torsion of ~ 174°) and similar to 2JH3aH3e in magnitude, while 3JH3eH4 is small (H4-C4-C3-H3e torsion of ~ 56°) compared to 2JH3aH3e. Thus, K-H3e displays a doublet at 2.10 ppm (3JHH = 12.6 Hz) while K-H3a results in a triplet at 1.89 ppm (3JHH = 13.0 Hz).
Multiple NMR parameters are indicative of the anomeric configuration of pyranose KDO (KDOp, Fig. 5A), most notably the δH of the axial and equatorial H3 (H3a and H3e, respectively)28, and the C-H three-bond coupling (3JCH) between H3a and C129. From δH analysis, it has been observed that a reliable criterion for distinguishing αKDO and βKDO is the signal separation between δH3a and δH3e (or Δ(δH3a, δH3e))13; for αKDO Δ(δH3a, δH3e) < 0.4 ppm, while for βKDO Δ(δH3a, δH3e) > 0.4 ppm (Supplementary Table S1). For K15 Δ(δH3a, δH3e) = 0.21 ppm, further validating that KDO is in the α-configuration in K15. Based on a representative survey (Supplementary Table S1), we propose that the same conclusion can be reached using only the δH3e, as for all reported values it is verified that δH3e(αKDO) < 2.3 ppm and δH3e(βKDO) > 2.3 ppm. For K15 the δH3e signal is ~ 2.1 ppm, well below the 2.3 ppm threshold, further confirming the configuration as αKDO.
The 3JCH criteria is based on the fact that the H3a-C3-C2-C1 torsion for the β-configuration is ~ 166°, which results in a 3JCH coupling of ~ 6 Hz. For the α-configuration the same torsion is ~ 60°, yielding a K(H3a-C1) 3JCH coupling of < 1 Hz (Fig. 5B). The torsion between H3e and C1 is about the same (~ 60°) for both α- and β-KDO, thus, the 3JCH value for the H3e-C3-C2-C1 torsion is not informative for establishing the α or β anomeric configuration in KDO. For the K15 PS, the KC1 (174.8 ppm) and KC2 (100.0 ppm) 13C 1D signals (Fig. 5C) are broad preventing a precise measurement of the couplings, however, these peaks are similar to those in Fig. 5B arising from αKDO-C1 (176.8 ppm) and C2 (96.4 ppm), and dissimilar to those arising from βKDO-C1 (175.5 ppm) and C2 (97.4 ppm) signals, providing independent confirmation of the presence of α-KDO in the K15 PS.
The δH of ~ 5.1 ppm indicates that GlcNAc is in the α-configuration, as δH ranges between 4.4 and 4.8 ppm for β-GlcNAc-H130,31. The measured one-bond coupling (1JCH) for GlcNAc-C1H1 is ~ 172 Hz (Fig. 5C), also indicating that the GlcNAc ring is in the α configuration, expected 1JCH values for β-GlcNAc-C1H1 are about 161 Hz32.
The native (O-acetylated) PS shows several NMR signals that cannot be directly identified by comparison between the native and de-O-acetylated NMR spectra (Supplementary Fig. S2A). Signals for the native PS, were completely assigned running the same set of experiments as for the de-O-acetylated sample and indicated that G-C3 is O-acetylated. O-acetylation deshields the G-H3 signal by ~ 1.35 ppm, by far the most pronounced change in δH for K15 resulting from O-acetylation. The δH at ~ 5.1 ppm of G-H3-OAc is typical for O-acetylated positions33.
Table 1 contains the 1H and 13C NMR assignments for de-O-acetylated K15, and the major population of native K15. Except for the O-acetylated G3 and its immediate neighbors, few chemical shifts differ for the native and de-O-acetylated forms.
To obtain an approximate O-acetylation ratio in the K15 CPS, we ran a long 1D inverse-gated 13C experiment to quantitate, by integration, isolated NMR signals of the O-acetylated PS and compare it with signals comprising total PS. Supplementary Fig. S2B shows this comparison, illustrating that ⪞ 60% of GlcNAc residues are O-acetylated at the C3 position. Altogether, the data indicate that the K15 repeating unit consists of the structure in Scheme 1.
Sequencing the E. coli strain F8316/41(O6:K15:H16) and capsule gene cluster identification
The genetic organization of the capsular gene cluster (in pathogenic island V) of E. coli strain 536 (O6:K15:H31) was previously reported to be a combination of structural features from group 2 and 3 capsule determinants19. However, the lack of insight into the capsule specific genes in region 2 and the unusual organization of the strain 536 gene cluster, prompted us to study the capsular gene cluster from E. coli strain F8316/41 (O6:K15:H16), from which the capsular polysaccharide was isolated for determining the structure. The chromosomal DNA isolated from E. coli strain F8316/41 was sequenced using single-molecule real-time (SMRT) sequencing technology34 yielding five contigs (GenBank accession numbers: JAADZB010000001-JAADZB010000005). The annotated DNA contigs were screened for gene markers reported to exist in proximity to Group 2 and Group 3 capsular gene clusters of E. coli that use the ABC transporter pathway, namely pgk (gene encoding phosphoglycerate kinase), pheV (gene encoding phenylalanine tRNA)and serA (gene encoding phosphoglycerate dehydrogenase)35–37. All these gene markers along with a second 22 bp truncated copy of pheV (pheV’) were identified in the second contig (GenBank accession numbers are provided in Supplementary Table S2). We also observed that 20 kb of DNA downstream to pheV’, contained open reading frames (ORF’s) that closely resembled K15 capsular gene cluster from E. coli 53619 (GenBank accession number: AJ617685) with a few variations. Given the relatively high error rate of long-read SMRT platforms we amplified 20 specific areas of interest within the gene cluster. Sanger sequencing of the amplified DNA segments identified two misreads in the SMRT sequencing data. Correcting these misreads resulted in a capsule gene cluster which had 99.6% sequence identity with the E. coli 536 gene cluster19.
The K15 capsular gene cluster from E. coli F8316/41 (GenBank accession number: NDK77120.1-NDK77133.1), when screened against the nucleotide collection (nt) database using BLASTN identified 13 different strains of E. coli carrying a similar gene cluster in their genome38 Of these, genomes of 7 strains were found to be annotated (NCTC11105, FORC_031, FMU073332, 536, 743, MRY15-131 and MRY15-117) and were further analyzed and compared against F8316/41. While all the 7 strains were enterobacterial, MRY15-131 and MRY15-117 were isolated from Bos taurus. The capsular gene cluster from all the 8 strains compared earlier belongs to a composite, pheV-associated pathogenicity island (PAI). A PAI represents a mosaic-like gene structure with multiple functional and fragmented mobile genetic elements39. Fragments of pheV-PAI are virulence-associated and highly homologous to chromosomal regions of other entero- and uropathogenic E. coli40. While DNA sequence and the virulence-associated fragments are a perfect match in pheV-PAIs of E. coli F8316/41 and E. coli NCTC11105 (99.97% identity), they are distinct from the remaining E. coli strains listed earlier. Another major distinction between E. coli F8316/41, E. coli NCTC11105 and the rest of the K15 capsular gene cluster containing E. coli strains is the relative position of serA, pgk , pheV and K15 capsular gene cluster. In E. coli F8316/41 and E. coli NCTC11105, serA and pgk genes are located within the pheV-PAI with the pheV gene placed 44,010 and 58,008 bp upstream to pgk and serA respectively, while the K15 gene cluster is 32,765 bp downstream to the serA gene. On the other hand, the pheV-PAI in the remaining six E. coli strains (FORC_031, FMU073332, 536, 743, MRY15-131 and MRY15-117) is located upstream to pgk which is in turn located upstream to the serA gene.
The K15 capsular gene cluster in E. coli F8316/41 is flanked by genes yeeUV (type IV toxin-antitoxin family)41 and gspM-C (type II secretion system)42. Presence of type II secretion system downstream to capsule gene cluster is the only common feature among all K15 E. coli strains discussed here. Pix fimbria-encoding gene cluster and the phosphoglycerate transporter system identified in the pheV-PAI of E. coli 53619 are absent from the pheV-PAI of E. coli F8316/41. Further in-depth discussion on the composition of E. coli F8316/41 pheV-PAI is not in the scope of this article. In summation of findings from the genetic analysis section, we identified the K15 capsular gene cluster from E. coli F8316/41 as a part of pheV associated pathogenicity island. Although the location of PAI in proximity to serA gene is consistent with observations from other E. coli strains, a different order of arrangement and orientation of genes in PAIs of E. coli F8316/41 and NCTC11105 strains point to a divergent evolution from the base E. coli genome. As E. coli F8316/41 is known to be entero- and uropathogenic, a detailed structural study of PAIs and their genomic elements can help understand the origins of virulence and their role in development of disease.
Organization of the E. coli K15 strain F8316/41 capsule gene cluster
The arrangement of genes in E. coli K15 capsule gene cluster from strain F8316/41 (Fig. 6) shares similarities with other Group 2 capsule gene clusters of the ABC transporter dependent pathway43. The gene cluster is made up of three regions, with region 1 containing a total of seven ORFs (kpsF, -E, -D, -U, -C’ and two unknown putative ORFs). The first five ORFs have a high degree of sequence similarity to other region 1 genes from group 2 capsule forming E. coli, except for kpsC’, which is a truncated version of kpsC. KpsC’ was reported to be a non-functional gene, inactivated due to truncation.
Regions 2 and 3 are transcribed as a single polycistronic mRNA in the opposite direction to that of region 1, with a JUMPstart sequence located upstream of Region 344. Region 3 codes for KpsM and KpsT, that are homologous to their counterparts of other group 2 capsules. Region 2 is made-up of seven ORFs and flanked by regions 1 and 3. ORFs 1 through 5 are capsule specific genes followed by kpsC and kpsS that are usually grouped with region 1 genes in group 2 capsule gene clusters. However, these two genes are described to be homologues to kpsC and S from group 3 capsule forming E. coli (K10), and are essential for K15 capsule production19. A low G + C content (30.9%) of regions 2 and 3 compared to region 1 (50.9%) of K15 capsular gene cluster was used as evidence to hypothesize origin of these DNA regions from different sources19. Functional analysis of capsule specific genes in region 2 was initially performed using a BLASTP search, which identified ORF 1 as a glycosyltransferase and ORF 5 as an α/β hydrolase and the remaining three ORFs as hypothetical E. coli proteins without any known functions. As region 2 genes are known to be capsule structure specific, using the K15 PS structure described in the earlier sections, we hypothesized the presence an α-KDOp transferase, an α-GlcpNAc transferase and an O-acetyltransferase among the five ORFs. Although capsules with acetylation and α-GlcpNAc units are common, α-KDOp is unusual with only two other capsules reported10,13.
Enzymes that catalyze capsular O-acetylation in bacteria can be classified into one of the following two protein families45. The first one is a hexapeptide repeat family, that contains tandem repeats of (LIV)(GAED)X2(STAV)X consensus sequence and folds into a left-handed β-helix domain (also termed LβH family)46. Members of this family include CssF (both OatW and OatY)47 from N. meningitidis serogroups W and Y48, NeuO from E. coli K149 and NeuD from S. agalactiae50. In addition to the characteristic hexapeptide repeat, enzymes of this family arrange into homotrimers with an active site at each interface. The second family of capsular O-acetyltransferases is α/β-hydrolase family. They feature a Ser-Asp-His catalytic triad with a conformationally strained serine located in a conserved nucleophile elbow motif (GXSXGG). Previously reported members of this family are CssE (OatC) and CsaC (SacC/MynC) from N. meningitidis serogroups C and A respectively47,51. Although crystal structure data are lacking for the α/β-hydrolase family, both CssE and CsaC have been thoroughly investigated and reported to catalyze O-acetylation of their respective capsular polysaccharide. Upon bioinformatics analysis (BLASTP, pfam, InterPro, etc.)52–54, of the five serogroup specific ORFs in region 2 of K15 capsular gene cluster, we identified ORF5 (GenBank accession number: NDK77127.1) to contain an α/β hydrolase fold (pfam-A family: UPF0227 and clan: CL0028, InterPro homologous family: IPR029058, CATH code: 3.40.50.1820) ORF5 encodes a 315 amino acid protein. Sequence alignment with CssE and CsaC (Supplementary Fig. S3) further highlighted a highly conserved catalytic triad composed of Ser-179, Asp-253 and His-281 in addition to the presence of Ser-179 in a conserved nucleophile elbow motif (GGS179MGG). α/β hydrolases are a versatile family of enzymes that primarily catalyze hydrolytic reactions through a double-displacement mechanism (hydrolases, thioesterases, haloperoxidases, halogenases etc.). There is considerable structural and experimental evidence for α/β hydrolase family catalyzing O-acetylation reaction using a Ser-Asp-His catalytic triad (homoserine O-acetyltransferase from H. influenza)55,56. In spite of the sequence similarity between ORF5, OatC and SacC being around 30%, grouping of the three proteins into the same family via bioinformatics analysis, presence of a highly conserved catalytic triad, nucleophile elbow motif, and finally the absence of O-acetylation related protein structures in other region 2 ORFs is consistent with ORF5 being the capsular O-acetyltransferase.
Identification of specific glycosyl transferases in region 2 for transfer of GlcNAc and KDO was done by searching for capsules from other bacteria, that contained monosaccharide units and glycosidic linkages similar to the K15 PS. In this process, the CPS from Actinobacillus pleuropneumoniae serogroup 5a str. J45 (with 6)-α-D-GlcpNAc-(1 → 5)-β-KDOp-(2 → repeating units) was identified containing a GlcpNAc linked to the C5 of KDOp, in its α-anomeric conformation57. In addition to this CPS structural similarity the capsular gene cluster of A. pleuropneumoniae shares a high degree of homology with ABC transporter dependent capsule export genes from H. influenzae type b, N. meningitidis group B, and E. coli58. The capsule specific region 2 of A. pleuropneumoniae serogroup 5a contains four genes, cps5A, 5B, 5C and 5D (sequence AF053723.1)59, of which only cps5A, (GenBank accession number: AAC26630.1) classified as a putative glycosyltransferase exhibited a 46.2% sequence similarity with ORF1 (GenBank accession number: NDK77131.1) of K15 E .coli. This observation along with the absence of similarities between any other region 2 genes of either organisms suggests that ORF1 might encode for an α-GlcpNAc transferase. Bioinformatic analysis using primary protein sequence predicted ORF1 to be a member of GT4 family of glycosyltransferases (pfam-A family: Glyco trans 1_4 and clan: CL0113, CDD: cd03801-GT4 PimA-like, CATH code: 3.40.50.2000). As supporting evidence to this hypothesis, a characteristic signature motif, typical to retaining α-glycosyltransferases was observed in the C-terminal end of ORF1 (S211PYEGGPACLPEALA225). This conserved secondary structure is an EX7E motif, first reported using hydrophobic cluster analysis60–62. The EX7E motif in ORF1 of E. coli K15 perfectly aligns with S224SHEGGPANIPEALA238 in Cps5A of A. pleuropneumoniae (Supplementary Fig. S4). Several other retaining glycosyltransferases belonging to GT4 families carry this conserved motif (for example: E. coli lipopolysaccharide α-glucosyltranferase WaaG; Mycobacterium smegmatis α-mannosyltransferase, PimA and Corynebacterium glutamicum glycosyltransferase, MshA) (Supplementary Fig. S4).
The K15 structure reported previously contains a β-KDO in the repeat unit, thus predicting the presence of β-KDO transferase. Our analysis of the gene cluster did not find evidence of a sequence belonging to the β-KDO transferase family GT-99. The uncommon presence of α-KDOp in the capsular polysaccharide made it difficult to use the same approach we employed in identifying α-GlcpNAc transferase.
Bioinformatic analysis using CDD, Pfam, dbCAN and InterPro did not generate any hits for ORFs 2,3, and 4. As a result, the second capsule specific glycosyl transferase (α-KDOp transferase) was identified using the Phyre2 web portal for protein structure modeling and functional analysis. This tool generated hits for ORF4 (GenBank accession number: NDK77128.1) with 100 and 99.5% confidence to Maf (Motility associated factor) glycosyltransferase, from Magnetospirillum magneticum AMB-1, and α-2,3/8-sialyltransferase CstII from Campylobacter jejuni, respectively. Though the sequence identity was only 19%, models generated with high confidence have been shown to be predictive of structural features and function63. Maf glycosyltransferase belongs to a class of bacterial glycosyltransferases involved in transfer of nonulosonic acids like moieties to an acceptor64. The Maf central domain has been shown to exhibit similarity to a sialyltransferase for C. jejuni64. Taking all these findings into account, we predict ORF4 to be an α-KDOp transferase with no structural homology to currently documented glycosyltransferases. In summary, we have identified genes in the K15 gene cluster that potentially encode an O-acetyltransferase, an N-acetylglucosamine transferase, and a KDO transferase consistent with the structure we report. The β-KDO transferases encoded by CPS gene clusters have been reported to belong to the glycosyltransferase family GT99 and have a different fold than KDO transferases associated with LPS biosynthesis14. The relationship of the α-KDO transferases of E. coli K6, K15, and K16 to these two different families of glycosyltransferases awaits further structural and function analysis.
Methods
Growth of the bacteria and preparation of polysaccharide
Escherichia coli strain F8316/41 (O6:K15:H16) was obtained from Drs. F. and I. Orskov at Staten Serum Institute, Copenhagen, Denmark. For the isolation of polysaccharide, bacteria were grown on a low molecular weight medium and the polysaccharide was purified by published methods65. During isolation, volumes were kept to a minimum to improve yields. Carboxylate reduced polysaccharide was prepared by the carbodiimide-borohydride method of Taylor66.
NMR spectroscopy
20 mg of K15 polysaccharide were dissolved in 0.6 mL of 99.9% D2O. The resulting solution was approximately neutral. De-O-acetylation of K15 was achieved by increasing the solution pD to ~ 12 by addition of NaOH from a stock solution. NMR experiments were run on a Bruker 700 MHz NMR instrument equipped with a triple gradient TCI cryoprobe. The temperature was set to 50 °C to reduce the line widths. The internal reference standard was ~ 0.1 (wt %) DSS-d6. NMR spectra were obtained using Bruker’s TopSpin 3.5 software (www.bruker.com), analysis was performed with both Topspin and Mnova 11 (www.mestrelab.com). A spin-lock time of 60 ms was used in the HSQC-TOCSY experiment. For the evolution of long-range couplings in the 1H,13C HMBC experiment a JH,C coupling constant of 6 Hz was used.
Other analytical methods
Sugar components were detected by paper chromatography, paper electrophoresis, gas chromatography and automated sugar analysis. Thin layer chromatography was performed on TLC-cellulose plates in butanol-pyridine-water, 6/4/3, v/v/v67. Hexosamine was released by hydrolysis in 4 N HCl. KDO was quantified in hydrolysates by the thiobarbituric acid assay. Alditol acetates were prepared and analyzed as described previously65.
Glycosyl composition
Glycosyl composition analysis was performed by combined gas chromatography/mass spectrometry (GC/MS) of the per-O-trimethylsilyl (TMS) derivatives of the monosaccharide methyl glycosides produced from the sample by acidic methanolysis as described previously by Santander et al.68. Briefly, the samples (230 and 240 μg) were heated with methanolic HCl in a sealed screw-top glass test tube for 17 h at 80 °C. After cooling and removal of the solvent under a stream of nitrogen, the samples were treated with a mixture of methanol, pyridine, and acetic anhydride for 30 min to re–N-acetylate the hexosamines. The solvents were evaporated, and the samples were derivatized with Tri-Sil (Pierce) at 80 °C for 30 min. GC/MS analysis of the TMS methyl glycosides was performed on an Agilent 7890A GC interfaced to a 5975C MSD, using a Supelco Equity-1 fused silica capillary column (30 m 0.25 mm ID).
Degradation procedures
Oligosaccharides were prepared by mild acid hydrolysis in 1% acetic acid at 100 °C for 1 h. The resulting disaccharide was purified by gel filtration on Bio-Gel P-2 in 0.1 M ammonium acetate and subsequent paper electrophoresis in pyridine acetate, pH 5.4. K15 was oxidized with excess sodium metaperiodate for 40 h at 4 °C and desalted after quenching. Oxidized polysaccharide was reduced with sodium borohydride at pH 7.0 on a pH stat. Beta elimination of periodate oxidized polysaccharide was performed as follows. Oxidized polysaccharide was dialyzed, lyophilized, and then treated with 0.1 N sodium hydroxide at 37 °C for 2 h. The reaction mixture was applied to a TLC cellulose plate in butanol-pyridine-water, 6/4/3, v/v/v.
Hexosamine released by acid hydrolysis was degraded to a pentose by ninhydrin degradation in pyridine as described previously20. The products were identified by automated sugar analysis.
Glycosyl linkage analysis
For glycosyl linkage analysis, the samples were permethylated, reduced, hydrolyzed under mild conditions, reduced, hydrolyzed again and acetylated; and the resultant partially methylated alditol acetates (PMAAs) analyzed by gas chromatography-mass spectrometry (GC–MS). The procedure is a slight modification of the one described by Willis et al.69.
About 500 μg of the samples were used for linkage analysis. The samples were suspended in 200 μl of dimethyl sulfoxide and left to stir for 1 day. Permethylation of the sample was affected by two rounds of treatment with sodium hydroxide (15 min) and methyl iodide (45 min). The permethylated sample carboxylic acids were reduced by adding 200 μl of a 5 mg/ml solution of LiBD4 in 90% THF and reacting overnight at room temperature, followed by 1 h at 100 °C. The samples were then hydrolyzed using 0.1 M TFA (0.5 h in sealed tube at 100 °C), reduced with NaBD4, hydrolyzed again using 2 M TFA (2 h in sealed tube at 100 °C) and acetylated using acetic anhydride/TFA. The resulting PMAAs were analyzed on an Agilent 7890A GC interfaced to a 5975C MSD (mass selective detector, electron impact ionization mode); separation was performed on a 30 m Supelco SP-2331 bonded phase fused silica capillary column.
Whole genome sequencing and capsule cluster analysis
The chromosomal DNA isolated from E. coli str. F8316/41 was sequenced using single-molecule real-time (SMRT) sequencing technology34. The raw sequencing data was processed using Canu v. 1.570. Canu is a long read single-molecule sequence de novo assembly tool for PacBio and/or Nanopore reads. This process resulted in five contiguous DNA segments with base lengths as follows: 2,558,639, 2,393,415, 99,428, 88,757, and 29,847 bases (GenBank accession numbers: JAADZB010000001 – JAADZB010000005). The second contiguous DNA segment with 2,393,415 bases was amplified in 20 different regions using forward and reverse primers (see Supplementary Information) to perform Sanger sequencing. The contiguous DNA sequences were annotated using NCBI’s Prokaryotic Genome Annotation Pipeline (PGAP) and submitted to GenBank71. The K15 capsular gene cluster and the pheV-PAI genetic region were analyzed and compared with genomes of other E. coli K15 strains using BLASTN38 and BLASTP53. The ORFs in the K15 gene cluster were generated using ORFfinder (https://www.ncbi.nlm.nih.gov/orffinder/). A wide array of bioinformatic resources like the pfam database52, CDD search72, InterPro classification of protein families54, CATH/Gene3D v4.273, Phyre2 protein fold recognition server74 and dbCAN meta server75 for automated CAZyme annotation were used for identification and functional characterization of capsule specific region 2 genes.
Supplementary information
Acknowledgements
Permethylation and composition analysis was performed by the Analytical Group at CCRC at the University of Georgia. The work performed at CCRC was supported by the Chemical Sciences, Geosciences and Biosciences Division, Office of Basic Energy Sciences, U.S. Department of Energy grant (DE-SC0015662) to Parastoo Azadi at the Complex Carbohydrate Research Center. The PacBio single-molecule real-time (SMRT) sequencing was performed at Biomedical Research Core Facilities DNA sequencing core, University of Michigan and the raw data was processed using Canu v. 1.5 with the help of Luis Santana-Quintero and Vahan Simonyan at High-performance Integrated Virtual Environment (HIVE) Laboratory, Center for Biologics Evaluation and Research, U.S. FDA.
Author contributions
H.F.A. and S. F, performed the NMR analysis, V.V. did the genetic characterization, S.F. V.V and W.V. were responsible for performing and procuring the chemical analysis, F.L. purified K15 polysaccharide, H.F.A., V.V., D.I.F., and W.F.V. prepared the manuscript. H.F.A. and V.V. made equivalent contributions to this manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
is available for this paper at 10.1038/s41598-020-69476-z.
References
- 1.Holst, O., Moran, A.P. & Brennan, P.J. in Microbial Glycobiology: Structures, Relevance and Applications. (ed. A.P. Moran) 3–13 (Academic Press, London; 2009).
- 2.Jann K, Jann B. The K antigens of Escherichia coli. Prog. Allergy. 1983;33:53–79. doi: 10.1159/000407421. [DOI] [PubMed] [Google Scholar]
- 3.Willis LM, Whitfield C. Structure, biosynthesis, and function of bacterial capsular polysaccharides synthesized by ABC transporter-dependent pathways. Carbohydr. Res. 2013;378:35–44. doi: 10.1016/j.carres.2013.05.007. [DOI] [PubMed] [Google Scholar]
- 4.Lindberg B. Components of bacterial polysaccharides. Adv. Carbohydr. Chem. Biochem. 1990;48:279–318. doi: 10.1016/S0065-2318(08)60033-5. [DOI] [PubMed] [Google Scholar]
- 5.Jann B, Hofmann P, Jann K. Structure of the 3-deoxy-D-manno-octulosonic acid-(KDO)-containing capsular polysaccharide (K14 antigen) from Escherichia coli 06:K14:H31. Carbohydr. Res. 1983;120:131–141. doi: 10.1016/0008-6215(83)88012-4. [DOI] [PubMed] [Google Scholar]
- 6.Jann B, Jann K. Structure and biosynthesis of the capsular antigens of Escherichia coli. Curr. Top. Microbiol. Immunol. 1990;150:19–42. doi: 10.1007/978-3-642-74694-9_2. [DOI] [PubMed] [Google Scholar]
- 7.Jann K, Jann B. Polysaccharide antigens of Escherichia coli. Rev. Infect. Dis. 1987;9:S517–S526. doi: 10.1093/clinids/9.Supplement_5.S517. [DOI] [PubMed] [Google Scholar]
- 8.Schmidt MA, Jann K. Structure of the 2-keto-3-deoxy-D-manno-octonic-acid-containing capsular polysaccharide (K12 antigen) of the urinary-tract-infective Escherichia coli O4:K12:H. Eur. J. Biochem. 1983;131:509–517. doi: 10.1111/j.1432-1033.1983.tb07291.x. [DOI] [PubMed] [Google Scholar]
- 9.Griffiths AJ, Davies DB. Type-specific carbohydrate antigens of pathogenic bacteria. Part 1: enterobacteriaceae. Carbohydr. Polym. 1990;14:241–279. doi: 10.1016/0144-8617(91)90079-R. [DOI] [Google Scholar]
- 10.Jennings HJ, Rosell KG, Johnson KG. Structure of the 3-deoxy-D-manno-octulosonic acid-containing polysaccharide (K6 antigen) from Escherichia coli LP 1092. Carbohydr. Res. 1982;105:45–56. doi: 10.1016/S0008-6215(00)81853-4. [DOI] [PubMed] [Google Scholar]
- 11.Birnbaum GI, Roy R, Brisson JR, Jennings HJ. Conformations of ammonium 3-deoxy-D-manno-2-octulosonate (KDO) and methyl α- and β-ketopyranosides of KDO: x-ray structure and proton NMR analyses. J. Carbohydr. Chem. 1987;6:17–39. doi: 10.1080/07328308708058858. [DOI] [Google Scholar]
- 12.Neszmelyi, A., Jann, K., Messner, P. & Unger, F. Constitutional and configurational assignments by carbon-13 NMR spectroscopy of Escherichia coli capsular polysaccharides containing ribose and 3-deoxy-D-manno-2-octulosonic acid (KDO). J. Chem. Soc. Chem. Commun., 1017–1019 (1982).
- 13.Lenter M, Jann B, Jann K. Structure of the K16 antigen from Escherichia coli O7:K16:H-, a Kdo-containing capsular polysaccharide. Carbohydr. Res. 1990;197:197–204. doi: 10.1016/0008-6215(90)84142-H. [DOI] [PubMed] [Google Scholar]
- 14.Ovchinnikova OG, et al. Bacterial beta-Kdo glycosyltransferases represent a new glycosyltransferase family (GT99) Proc. Natl. Acad. Sci. USA. 2016;113:E3120–3129. doi: 10.1073/pnas.1603146113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Binsztein N, et al. Colonization factors of enterotoxigenic Escherichia coli isolated from children with diarrhea in Argentina. J. Clin. Microbiol. 1991;29:1893–1898. doi: 10.1128/jcm.29.9.1893-1898.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Blum G, Marre R, Hacker J. Properties of Escherichia coli strains of serotype O6. Infection. 1995;23:234–236. doi: 10.1007/BF01781204. [DOI] [PubMed] [Google Scholar]
- 17.Orskov F, et al. Special Escherichia coli serotypes among enterotoxigenic strains from diarrhoea in adults and children. Med. Microbiol. Immunol. 1976;162:73–80. doi: 10.1007/BF02121318. [DOI] [PubMed] [Google Scholar]
- 18.Orskov I, Orskov F. Special O:K: H serotypes among enterotoxigenic E. coli strains from diarrhea in adults and children. Occurrence of the CF (colonization factor) antigen and of hemagglutinating abilities. Med. Microbiol. Immunol. 1977;163:99–110. doi: 10.1007/BF02121825. [DOI] [PubMed] [Google Scholar]
- 19.Schneider G, et al. The pathogenicity island-associated K15 capsule determinant exhibits a novel genetic structure and correlates with virulence in uropathogenic Escherichia coli strain 536. Infect. Immun. 2004;72:5993–6001. doi: 10.1128/IAI.72.10.5993-6001.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Stoffyn PJ, Jeanloz RW. Identification of amino sugars by paper chromatography. Arch. Biochem. Biophys. 1954;52:373–379. doi: 10.1016/0003-9861(54)90137-X. [DOI] [PubMed] [Google Scholar]
- 21.Jennings HJ, Smith ICP. 3 Polysaccharide structures using carbon-13 nuclear magnetic resonance. Methods Enzymol. 1978;50:39–50. doi: 10.1016/0076-6879(78)50005-0. [DOI] [PubMed] [Google Scholar]
- 22.Lerner L, Bax A. Sensitivity-enhanced two-dimensional heteronuclear relayed coherence transfer nmr-spectroscopy. J. Magn. Reson. 1986;69:375–380. [Google Scholar]
- 23.Lerner L, Bax A. Application of new, high-sensitivity 1H–13C-N.M.R.- spectral techniques to the study of oligosaccharides. Carbohydr. Res. 1987;166:35–46. doi: 10.1016/0008-6215(87)80042-3. [DOI] [PubMed] [Google Scholar]
- 24.Bhattacharjee AK, Jennings HJ, Kenny CP. Structural elucidation of the 3-deoxy-D-manno-octulosonic acid containing meningococcal 29-e capsular polysaccharide antigen using carbon-13 nuclear magnetic resonance. Biochemistry. 1978;17:645–651. doi: 10.1021/bi00597a013. [DOI] [PubMed] [Google Scholar]
- 25.Jennings HJ, Rosell KG, Johnson KG. Structure of the 3-deoxy-D-manno-octulosonic acid-containing polysaccharide (K6 antigen) from Escherichia coli LP 1092. Carbohydr Res. 1982;105:45–56. doi: 10.1016/S0008-6215(00)81853-4. [DOI] [PubMed] [Google Scholar]
- 26.Kogler, H. 0. W. Sorensen, G. Bodenhausen, and RR Ernst. J. Magn. Reson.55, 1 (1983). [DOI] [PubMed]
- 27.Bax A, Summers MF. Proton and carbon-13 assignments from sensitivity-enhanced detection of heteronuclear multiple-bond connectivity by 2D multiple quantum NMR. J. Am. Chem. Soc. 1986;108:2093–2094. doi: 10.1021/ja00268a061. [DOI] [Google Scholar]
- 28.Birnbaum GI, Roy R, Brisson J-R, Jennings HJ. Conformations of ammonium 3-Deoxy-D-manno-2-octulosonate (KDO) and methyl α- and β-ketopyranosides of KDO: x-ray structure and 1H NMR analyses. J. Carbohydr. Chem. 1987;6:17–39. doi: 10.1080/07328308708058858. [DOI] [Google Scholar]
- 29.Kohlbrenner WE, Fesik SW. Determination of the anomeric specificity of the Escherichia coli CTP:CMP-3-deoxy-D-manno-octulosonate cytidylyltransferase by 13C NMR spectroscopy. J. Biol. Chem. 1985;260:14695–14700. doi: 10.1016/S0021-9258(17)38627-1. [DOI] [PubMed] [Google Scholar]
- 30.Bush CA. HIgh resolution NMR in the determination of structure in complex carbohydrates. Bull. Magn. Reson. 1988;10:73–95. [Google Scholar]
- 31.Pomin VH. Unravelling Glycobiology by NMR Spectroscopy. London: Intech; 2012. pp. 63–98. [Google Scholar]
- 32.Yu BEA. More accurate 1JCH coupling measurement in the presence of 3JHH strong coupling in natural abundance. J. Magn. Reson. 2012;215:10–22. doi: 10.1016/j.jmr.2011.09.037. [DOI] [PubMed] [Google Scholar]
- 33.Shashkov AS, Arbatsky NP, Knirel YA. Structures and genetics of Kdo-containing O-antigens of Cronobacter sakazakii G2706 and G2704, the reference strains of serotypes O5 and O6. Carbohyd. Res. 2011;346:1924–1929. doi: 10.1016/j.carres.2011.05.014. [DOI] [PubMed] [Google Scholar]
- 34.Eid J, et al. Real-time DNA sequencing from single polymerase molecules. Science. 2009;323:133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
- 35.Oerskov I, Sharma V, Oerskov F. Genetic mapping of the K1 and K4 antigens (L) of Escherichia coli. Non-allelism of K(L) antigens with K antigens of O8:K27(A), O8:K8(L) and O9:K57(B) Acta Pathol. Microbiol. Scand. Sect. B Microbiol. 1976;84B:125–131. [PubMed] [Google Scholar]
- 36.Orskov I, Nyman K. Genetic mapping of the antigenic determinants of two polysaccharide K antigens, K10 and K54, in Escherichia coli. J. Bacteriol. 1974;120:43–51. doi: 10.1128/jb.120.1.43-51.1974. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vimr ER. Map position and genomic organization of the kps cluster for polysialic acid synthesis in Escherichia coli K1. J. Bacteriol. 1991;173:1335–1338. doi: 10.1128/jb.173.3.1335-1338.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Camacho C, et al. Blast+: architecture and applications. BMC Bioinf. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dobrindt U, et al. Analysis of genome plasticity in pathogenic and commensal Escherichia coli isolates by use of DNA arrays. J. Bacteriol. 2003;185:1831–1840. doi: 10.1128/JB.185.6.1831-1840.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Beatson SA, et al. Molecular analysis of asymptomatic bacteriuria Escherichia coli strain VR50 reveals adaptation to the urinary tract by gene acquisition. Infect. Immun. 2015;83:1749–1764. doi: 10.1128/IAI.02810-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Brown JM, Shaw KJ. A novel family of Escherichia coli toxin-antitoxin gene pairs. J. Bacteriol. 2003;185:6600–6608. doi: 10.1128/JB.185.22.6600-6608.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Patrick M, Gray MD, Sandkvist M, Johnson TL. Type II secretion in Escherichia coli. EcoSal Plus. 2014;4:1–18. doi: 10.1128/ecosalplus.4.3.4. [DOI] [PubMed] [Google Scholar]
- 43.Roberts IS. The biochemistry and genetics of capsular polysaccharide production in bacteria. Annu. Rev. Microbiol. 1996;50:285–315. doi: 10.1146/annurev.micro.50.1.285. [DOI] [PubMed] [Google Scholar]
- 44.Bailey MJA, Hughes C, Koronakis V. RfaH and the ops element, components of a novel system controlling bacterial transcription elongation. Mol. Microbiol. 1997;26:845–851. doi: 10.1046/j.1365-2958.1997.6432014.x. [DOI] [PubMed] [Google Scholar]
- 45.Muehlenhoff, M. & Bergfeld, A.K. 209–235 (Bentham Science Publishers Ltd., 2014).
- 46.Raetz CRH, Roderick SL. A left-handed parallel β helix in the structure of UDP-N-acetylglucosamine acyltransferase. Science (Washington, D. C.) 1995;270:997–1000. doi: 10.1126/science.270.5238.997. [DOI] [PubMed] [Google Scholar]
- 47.Bergfeld AK, et al. The polysialic acid-specific O-acetyltransferase OatC from Neisseria meningitidis serogroup C evolved apart from other bacterial sialate O-acetyltransferases. J. Biol. Chem. 2009;284:6–16. doi: 10.1074/jbc.M807518200. [DOI] [PubMed] [Google Scholar]
- 48.Lee HJ, et al. Structural and kinetic characterizations of the polysialic acid O-acetyltransferase OatWY from Neisseria meningitidis. J. Biol. Chem. 2009;284:24501–24511. doi: 10.1074/jbc.M109.006049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Schulz EC, Bergfeld AK, Ficner R, Muehlenhoff M. Crystal structure analysis of the polysialic acid specific O-acetyltransferase NeuO. PLoS ONE. 2011;6:e17403. doi: 10.1371/journal.pone.0017403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Lewis AL, Hensler ME, Varki A, Nizet V. The group B streptococcal sialic acid O-acetyltransferase is encoded by neuD, a conserved component of bacterial sialic acid biosynthetic gene clusters. J. Biol. Chem. 2006;281:11186–11192. doi: 10.1074/jbc.M513772200. [DOI] [PubMed] [Google Scholar]
- 51.Gudlavalleti SK, et al. The Neisseria meningitidis serogroup A capsular polysaccharide O-3 and O-4 acetyltransferase. J. Biol. Chem. 2004;279:42765–42773. doi: 10.1074/jbc.M313552200. [DOI] [PubMed] [Google Scholar]
- 52.El-Gebali S, et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2019;47:D427–D432. doi: 10.1093/nar/gky995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Gish W, States DJ. Identification of protein coding regions by database similarity search. Nat. Genet. 1993;3:266–272. doi: 10.1038/ng0393-266. [DOI] [PubMed] [Google Scholar]
- 54.Mitchell AL, et al. InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucl. Acids Res. 2019;47:D351–D360. doi: 10.1093/nar/gky1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Mirza IA, Nazi I, Korczynska M, Wright GD, Berghuis AM. Crystal structure of homoserine transacetylase from Haemophilus influenzae reveals a new family of α/β-hydrolases. Biochemistry. 2005;44:15768–15773. doi: 10.1021/bi051951y. [DOI] [PubMed] [Google Scholar]
- 56.Wang M, et al. Crystal structure of homoserine O-acetyltransferase from Leptospira interrogans. Biochem. Biophys. Res. Commun. 2007;363:1050–1056. doi: 10.1016/j.bbrc.2007.08.153. [DOI] [PubMed] [Google Scholar]
- 57.Altman E, Brisson JR, Perry MB. Structure of the capsular polysaccharide of Haemophilus pleuropneumoniae serotype 5. Eur. J. Biochem. 1987;170:185–192. doi: 10.1111/j.1432-1033.1987.tb13685.x. [DOI] [PubMed] [Google Scholar]
- 58.Ward CK, Inzana TJ. Identification and characterization of a DNA region involved in the export of capsular polysaccharide by Actinobacillus pleuropneumoniae serotype 5a. Infect. Immun. 1997;65:2491–2496. doi: 10.1128/iai.65.6.2491-2496.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Ward CK, Lawrence ML, Veit HP, Inzana TJ. Cloning and mutagenesis of a serotype-specific DNA region involved in encapsulation and virulence of Actinobacillus pleuropneumoniae serotype 5a: concomitant expression of serotype 5a and 1 capsular polysaccharides in recombinant A. pleuropneumoniae serotype 1. Infect. Immun. 1998;66:3326–3336. doi: 10.1128/IAI.66.7.3326-3336.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Amor PA, Whitfield C. Molecular and functional analysis of genes required for expression of group IB K antigens in Escherichia coli: characterization of the his-region containing gene clusters for multiple cell-surface polysaccharides. Mol. Microbiol. 1997;26:145–161. doi: 10.1046/j.1365-2958.1997.5631930.x. [DOI] [PubMed] [Google Scholar]
- 61.Geremia RA, Petroni EA, Ielpi L, Henrissat B. Towards a classification of glycosyltransferases based on amino acid sequence similarities: prokaryotic α-mannosyltransferases. Biochem. J. 1996;318:133–138. doi: 10.1042/bj3180133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Saxena IM, Brown RM, Jr, Fevre M, Geremia RA, Henrissat B. Multidomain architecture of β-glycosyl transferases: implications for mechanism of action. J. Bacteriol. 1995;177:1419–1424. doi: 10.1128/jb.177.6.1419-1424.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJ. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Sulzenbacher G, et al. Glycosylate and move! The glycosyltransferase Maf is involved in bacterial flagella formation. Environ. Microbiol. 2018;20:228–240. doi: 10.1111/1462-2920.13975. [DOI] [PubMed] [Google Scholar]
- 65.Vann WF, Jann K. Structure and serological specificity of the K13-antigenic polysaccharide (K13 antigen) of urinary tract-infective Escherichia coli. Infect. Immun. 1979;25:85–92. doi: 10.1128/iai.25.1.85-92.1979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Taylor RL, Conrad HE. Stoichiometric depolymerization of polyuronides and glycosaminoglycuronans to monosaccharides following reduction of their carbodiimide-activated carboxyl groups. Biochemistry. 1972;11:1383–1388. doi: 10.1021/bi00758a009. [DOI] [PubMed] [Google Scholar]
- 67.Vann WF, et al. Serological, chemical, and structural analyses of the Escherichia coli cross-reactive capsular polysaccharides K13, K20, and K23. Infect. Immun. 1983;39:623–629. doi: 10.1128/iai.39.2.623-629.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Santander J, et al. Mechanisms of intrinsic resistance to antimicrobial peptides of Edwardsiella ictaluri and its influence on fish gut inflammation and virulence. Microbiology. 2013;159:1471–1486. doi: 10.1099/mic.0.066639-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Willis LM, et al. Conserved glycolipid termini in capsular polysaccharides synthesized by ATP-binding cassette transporter-dependent pathways in gram-negative pathogens. Proc. Natl. Acad. Sci. USA. 2013;110:7868–7873. doi: 10.1073/pnas.1222317110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Koren S, et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Tatusova T, et al. NCBI prokaryotic genome annotation pipeline. Nucl. Acids Res. 2016;44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Marchler-Bauer A, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucl. Acids Res. 2017;45:D200–D203. doi: 10.1093/nar/gkw1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Lewis TE, et al. Gene3D: Extensive prediction of globular domains in proteins. Nucl. Acids Res. 2018;46:D435–D439. doi: 10.1093/nar/gkx1069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Zhang H, et al. DbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucl. Acids Res. 2018;46:W95–W101. doi: 10.1093/nar/gky418. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.