Abstract

Despite their many important physiological functions, past work on the diverse sequences of human milk oligosaccharides (HMOs) has been focused mainly on the highly abundant HMOs with a relatively low degree of polymerization (DP) due to the lack of efficient methods for separation/purification and high-sensitivity sequencing of large-sized HMOs with DP ≥ 10. Here we established an ultrahigh-temperature preparative HPLC based on a porous graphitized carbon column at up to 145 °C to overcome the anomeric α/β splitting problem and developed further the negative-ion ESI-CID-MS/MS into multistage MSn using a combined product-ion scanning of singly charged molecular ion and doubly charged fragment ion of the branching Gal and adjacent GlcNAc residues. The separation and sequencing method allows efficient separation of a neutral fraction with DP ≥ 10 into 70 components, among which 17 isomeric difucosylated nona- and decasaccharides were further purified and sequenced. As a result, novel branched difucosyl heptaose and octaose backbones were unambiguously identified in addition to the conventional linear and branched octaose backbones. The novel structures of difucosylated DF-novo-heptaose, DF-novo-LNO I, and DF-novo-LNnO I were corroborated by NMR. The various fucose-containing Lewis epitopes identified on different backbones were confirmed by oligosaccharide microarray analysis.
Human milk oligosaccharides (HMOs) make up the third most abundant component and consist of a lactose core decorated with N-acetylglucosamine (GlcNAc), d-galactose (Gal), l-fucose (Fuc), and sialic acid (NeuAc). Fucosylated HMOs are most abundant and form the blood group ABH(O) and Lewis (Le) epitopes.1,2 HMOs have been identified as decoy receptors preventing infections by many pathogens and regulate cell surface receptors triggering immune responses.3−5 HMOs also play important roles in regulation of gut microbiota,6,7 promote the survival of beneficial Bifidobacterium species, and suppress potentially harmful or pathogenic bacteria.8,9
Despite their many important physiological functions, past work has been focused mainly on the high-abundance HMOs with a relatively low degree of polymerization (DP) due to the lack of efficient methods for separation/purification and high-sensitivity sequencing. Separation and sequence assignment of the large sized HMOs (e.g., DP ≥ 10) have been difficult10,11 because of their hugely diverse and isomeric structural features, particularly the multiply fucosylated structures carrying different recognition motifs on backbones with different branching patterns,12,13 in addition to their low abundance (<1% of total HMOs). The slow progress in method development to tackle the problem of high sequence complexity of HMOs has hindered the in-depth understanding of the structure–function relationships of different types of HMOs.
In the recent past, HPLC with a porous graphitic carbon (PGC) column has been widely used for separation/purification of different types of carbohydrate molecules because of its high resolution. However, α/β anomeric forms of reducing glycans are a unique feature of naturally derived sugars and can cause considerable problems for separation/purification. The α/β splitting increases the number of eluted peaks.14 In many cases, one component’s α can be merged with another component’s β, and this has been an obstacle for HPLC separation. For analytical LC-MS using single-ion monitoring, this may not be a major problem, but for preparative fractionation to isolate individual oligosaccharides, the splitting problem is a big headache in chromatography. Although chemical reduction can eliminate the α/β splitting,15 it is not ideal for subsequent mass spectrometry (MS) sequencing as some of the well-established methods for neutral sugars require reducing termini, as some characteristic fragmentations observed for typing of blood group and Lewis antigens,16 branching pattern,17 and partial linkage18 analysis can only be obtained for reducing sugars with the hemiacetal functionality but not for reduced alditols.19 Moreover, the alditols cannot be used for reducing terminal tagging, a strategy conventionally employed for high-sensitivity HPLC and activity detection by glycan microarrays after their conversion into glycan probes.20 As PGC is stable under some extreme conditions (e.g., high and low pH, and high temperature) without any loss in performance,21,22 ammonium hydroxide has been added to alleviate the anomeric problem.23 Recently, ultrahigh column temperature has been attempted to circumvent the α/β splitting by acceleration of the interconversion between α and β anomers, and satisfactory results were obtained for N-glycans using analytical scale UPLC-MS.24 However, preparative scale HPLC at a high temperature has not been attempted. Preparative HPLC is important for the isolation and preparation of reducing glycans from natural sources for functional glycomics.
For glycan structural analysis, MS has become the major player.16,17,25−27 Negative-ion electrospray tandem MS with collision-induced dissociation (ESI-CID-MS/MS) has been developed for the high-sensitivity sequencing of HMOs. However, for multifucosylated and highly branched HMOs with high DPs, MS2 is not sufficient while quasi-MS3 is not always achievable25 for assignment of branched sequences. The lack of direct sequence information on the 3-branch remains a problem.17
In the present work, we aimed to develop methods to tackle both separation and sequencing problems. A preparative-scale PGC-HPLC using ultrahigh temperature (UHT) is developed for separation and purification of isomeric multifucosylated HMOs in the deca- and dodecasaccharide range, and off-line negative-ion multistage product-ion scanning ESI-CID-MSn is established for branching pattern analysis and detailed assignment of the difucosylated neutral HMOs on different backbones. As a result, novel heptaose and octaose backbones with different fucosylation patterns were identified. The novel structural features were corroborated by NMR and peripheral epitopes, which were corroborated by microarray analysis.
Experimental Section
Materials
All solvents were of HPLC grade, and other reagents used were of analytical grade or higher. Human breast milk samples (∼20 L) were collected from healthy mothers at different lactation periods and stored at −20 °C before use. Oligosaccharide standards lactose-N-fucopentaose (LNFP) I, II, III, 3′-sialyllactose (3′-SL) and 6′-sialyllactose (6′-SL) were purchased from Dextra Laboratories (Reading, England). All the carbohydrate-binding proteins were purchased from commercial sources as listed in Supplementary Methods.
Fucosylated Decasaccharide Fraction from Human Milk
Pretreatment of HMOs was as described previously.28 The total HMOs was subjected to preparative HILIC on a Click TE-GSH column (100 × 250 mm, 5 μm, Acchrom) to remove the large amount of lactose and separate neutral from sialylated oligosaccharides.29 The pooled fraction F5 containing neutral HMOs with DP ≥ 10 was then further fractionated on a preparative amide column (100 × 250 mm, 5 μm, Acchrom) to obtain the “DP10” fraction (Supplementary Methods) which was then subfractionated as described below.
Preparative-Scale Ultrahigh-Temperature PGC-HPLC of HMOs
A preparative-scale UHT HPLC system was built based on a Waters Alliance HPLC system with a UV photodiode array detector 2998. A purpose built high-temperature oven, postcolumn cooling device, and high-temperature and cooling controllers were added into the system as modular components (Figure S1a). The high-temperature oven contains an aluminum shell heater and insulation plates wrapped by aluminum alloy heat sink with a maximum operating temperature of at least 180 °C (Figure S1b). The postcolumn cooling device consists of a Peltier thermoelectrical cooler module, an aluminum alloy heat sink, and a fan (Figure S1c). For fast reaching and better maintaining of column temperature, a column with shorter diameter is always preferable, although the oven can hold columns with larger diameters.
Initial separation of “DP10” fraction was carried out at 105 °C on a PGC column (4.6 × 150 mm, 5 μm) with the following solvent gradient: 0–120 min, 10/90 to 25/75 (CH3CN/H2O); 120–180 min, 25/75 to 50/50 (CH3CN/H2O), at a flow rate of 1.0 mL/min. The injection volume was 20 μL at a flow rate of 250 μg/μL (5 mg). Seventy fractions were manually collected based on 195 nm detection and concentrated and dried by lyophilization.
Twenty-four fractions containing difucosylated nona- and decasaccharides were further purified on the same column at a flow rate of 0.5 mL/min with an optimized gradient. For some selected fractions, a PGC HT column (3.0 × 100 mm, 3 μm) was used at a column temperature of 145 °C and a flow rate of 0.4 mL/min. The elution was by CH3CN/H2O gradient with or without 0.1% formic acid and monitored at 195 nm.
The purity of the isolated HMOs was assessed on a PGC HT column at 145 °C and a CH3CN/H2O gradient of 15/85 to 50/50 (v/v) in 70 min at a flow rate of 0.4 mL/min. The stabilities of neutral and sialylated HMOs under the UHT conditions were examined by HPLC and MS analysis using LNFP-I, 3′-SL, and 6′-SL as examples. The test samples were subjected to the 145 °C PGC-HPLC, and the collected samples were analyzed by PGC-HPLC at 40 °C and ESI-MS.
Negative-Ion ESI-MSn
Negative-ion ESI-MS was performed on an Orbitrap ID-X Tribrid mass spectrometer (Thermo Fisher Scientific) equipped with a Vanquish UHPLC system (Thermo Fisher Scientific). The purified samples were dissolved in CH3CN/H2O (1:1) at a concentration of 1 μg/μL, of which 2 μL was injected for MSn. CH3CN/H2O (1:1) was used as the mobile phase, with a flow rate of 0.2 mL/min. The spray voltage was at 3.0 kV with a source temperature of 400 °C, ion transfer tube temperature 300 °C, RF S-lens 50 V, and sheath velocity 40 psi. Higher-energy collisional dissociation was used for the MSn. For optimal fragmentation, normalized collision energy was adjusted to 15–30% . Precursor selection for product-ion scanning was made manually using the Xcalibur software Version 4.2 data system.
NMR Spectroscopy
Oligosaccharide samples were submitted to two cycles of dissolution in D2O followed by lyophilization to reduce H2O content, then taken up in 300 μL of D2O and transferred to a Shigemi tube for NMR spectroscopy. NMR spectra were recorded at 950 MHz (1D 1H and 2D HSQC, HSQC-TOCSY, HMBC, and H2BC) and 700 MHz (for the ROESY spectra) at 25 °C, using Bruker Avance spectrometers. Pulse sequences were supplied by the spectrometer manufacturer. Chemical shifts are relative to acetone at 2.218 ppm for proton and 33.0 ppm for carbon spectra.
Oligosaccharide Microarrays
The HMO probes were prepared by reductive-amination with a fluorescent amino-terminating bifunctional linker BABI (J. Yan, Y. Zheng and colleagues, unpublished) and printed in quadruplicate on NHS glass slides (Schott Nexterion H, Germany) at probe concentrations of 25 and 50 μM. After blocking and washing, the arrays were overlaid with biotinylated plant lectins and anticarbohydrate antibodies before incubation at ambient temperature for 90 min. The binding signals were detected using AlexaFluor-647-labeled streptavidin or the biotinylated anti-Mouse IgG with subsequent AlexaFluor-647-labeled streptavidin. Anti-B antibody was detected with antimouse IgM AlexaFluor-680. The fluorescence binding signals were measured and quantified using GenePix Pro 7 software (Molecular Devices). The detailed experimental conditions are in Supplementary Methods.
Results and Discussion
Optimization of Conditions for PGC-HPLC Separation of HMO “DP10” Fraction
Using a standard HMO mixture containing LNFP I/II/III, we assessed several conditions to suppress the α/β splitting, including alkaline condition of the mobile phase23 and high column temperatures (80 and 110 °C).24 As shown in Figure S2a, α- and β-anomers of LNFP I/II/III were well separated at 30 °C although the α-anomer of LNFP III and β-anomer of LNFP II overlapped. Change to alkaline condition (e.g., 0.4% NH4OH) improved separation of the three isomers, but the peak shape became very broad, which is not ideal for separation of complex mixtures (Figure S2b). Increasing column temperature to 80 °C (Figure S2c) or 105 °C (Figure S2d) can largely eliminate the α/β splitting while maintaining the good resolution, indicating that higher temperature is necessary for separation of more complex and/or higher oligomeric HMOs.
A UHT system for preparative-scale HPLC was then designed, purpose-built (Figure S1), and tested at 145 °C for resolution of isomeric HMOs in the “DP10” fraction. As shown in Figure S3a, a single peak of decasaccharide collected at 30 °C showed a shoulder at 105 °C, while at 145 °C a second peak with baseline resolution was obtained (Figure S3b, sequence assignment in sections below).
Using LNFP I, 3′-SL, and 6′-SL as examples, we examined the stabilities of HMOs under the UHT conditions. We found that the neutral HMOs are stable (Figure S4). Although sialic acid residues are labile, our experiments indicated that under UHT conditions they are reasonably stable at 145 °C in the HPLC elution time scale (Figure S4a). Even under acidic conditions (e.g., with 0.1% formic acid), a majority of the 6-linked 6′-SL can survive the high temperature but 3-linked 3′-SL cannot (Figure S4b). The results obtained from the sialylated oligosaccharides are different from a previous publication.24
Preparative PGC-HPLC at Ultrahigh Temperature for Fractionation and Purification of Human Milk Nona- and Decasaccharides
After removal of protein and lipid, the total HMOs was subjected to the first dimension HPLC using a Click-TE GSH column with mixed hydrophilic and charge exchange mechanism29 to separate the sialylated (F1, Figure S5a) HMOs and the large amount of lactose (F2) and to obtain 3 neutral fractions F3–F5 with increasing oligomeric size. F5 containing the largest HMOs was then further fractionated into 8 pools primarily based on DPs (DP5 to DP12–14, Figure S5b) using an amide HILIC as the second dimension HPLC.
Based on the assessment of PGC-HPLC conditions shown above, UHT was used for the third dimension PGC-HPLC for subfractionation of the “DP10” fraction. High-resolution separation was achieved at 105 °C, and a total of 70 fractions were obtained (Figure 1a). Negative-ion ESI-MS analysis (Table S1) showed that these were mainly the HMOs containing octaose backbone with 2–4 Fuc residues (e.g., fractions #9 to #54) and decaose backbone with 0–3 Fuc (e.g., #45 to #70), in addition to some minor fractions with hexaose (#1 to #8) and heptaose backbones (#18).
Figure 1.
Ultrahigh temperature PGC-HPLC fractionation of difucosylated HMOs with different octaose backbones. (a) PGC-HPLC of the “DP10” fraction at 105 °C. (b) Selected 24 difucosylated nona- and decasaccharide fractions were repurified using the same column at either 105 or 145 °C. Fraction numbers in blue: difucosylated HMOs with octaose backbone; fraction numbers in red: novel structures.
Selected 24 difucosylated decasaccharide fractions were repurified using the same column at either 105 or 145 °C (Figure 1b). A total of 16 difucosylated decasaccharides on different octaose backbones and a difucosylated nonasaccharide on a heptaose backbone were selected for detailed sequence analysis after purity assessment using analytical PGC-HPLC at 105 °C (Figure S6).
Sequence Determination of Difucosylated Decasaccharides with Linear and Branched Octaose Backbones by Negative-Ion ESI-MSn
The strategy for determination of linear16 and branched17 backbone sequences and Lewis epitopes16,30 using negative-ion ESI-MSn is illustrated in Figure S7.
Fraction #21a and #21b can be readily deduced as linear sequences DF-para-LNO I (Figure S8a) and DF-para-LNnO I (Figure S8b), respectively, from the complete sets of glycosidic C-type ions C1 to C7. The locations of the two Fuc residues were determined by the 146 Da increment (203 + 146 = 349 Da) between the respective adjacent C-ions (e.g., between C4 and C3: 893–544 = 349, and between C6 and C5: 1404–1055 = 349 Da).16 The Fuc 3-linkage to the internal −GlcNAc1–3 of Lex can be assigned by the double cleavage D ions (D4–3βm/z 729 and D6–5βm/z 1240), which is characteristic for 3-linaked residues.16 The nonreducing terminal Type 1 and Type 2 chains with Gal1–3GlcNAc1– and Gal1–4GclNAc1– sequences, respectively, can be determined by the D2–1 ion at m/z 202 for the 3-linked GlcNAc (Figure S8a) and the 0,2A2 doublet m/z 263/281 (Figure S8b).16 The reducing terminal −4Glc can be assigned by the doubly charged ions 2,4A8 and 0,2A8-h at m/z 803.7 and 824.8, respectively. Thus, the two linear decasaccharides DF-para-LNO I and DF-para-LNnO I were both obtained with two internal Lex (inLex) (Table S2). The former has not been reported previously.
For decasaccharides with a conventional branched octaose backbone, product-ion scanning using the doubly charged molecular ion [M-2H]2– (m/z 863.8) gave clear information on the branching point at the Gal of the reducing terminal lactose by D5–4β (m/z 1037), e.g. in the case of #24b (Figure S9a). As established previously,17 sequence information on both 3- and 6-branches can be obtained from the product-ion spectrum of [M-2H]2–. D4β-3βm/z 364 and 0,2A2α doublet m/z 281/263 indicate a nonreducing terminal Lex and type 2 chain (Gal1–4GlcNAc−),16 respectively. Further MS3 scanning using the branching ion D5–4β (m/z 1037) produced fragment ions only from the 6-branch (Figure S9b), in the product-ion spectrum of which the m/z 281/263 doublet unambiguously identified the type 2 chain on the 6-branch, and therefore, the Lex epitope shown in the spectrum of [M-2H]2– should be on the 3-branch. The internal Lex ion (D4α-3α′m/z 729) (Figure S8a) is also on the 6-branch. Other difucosylated decasaccharide isomers with conventional branched octaose backbones were similarly identified (Table S2).
Determination of Novel Difucosyated Branched Octa- and Heptaose Backbones by Negative-Ion ESI-MSn
The novel backbone sequence of #44c can be assigned by the branching fragment ion D3–2β at m/z = 672 (Figure 2a). However, the feature of both 3- and 6-branched sequences from product-ion scanning of [M-2H]2– is absent. Only the 6-branched sequence containing a single Fuc bearing the Lex epitope with an ion D2α-1αm/z 364 was observed. This is because the branching point is farther from the reducing terminus. For this novel octaose backbone, the branching point is at the second Gal residue from the reducing end, i.e., the internal Gal rather than the lactose Gal. As only the doubly charged ion from a reducing side Glc/GlcNAc residue next to the branching point can produce both 3- and 6-branch information, selection of a doubly charged ion from the reducing side GlcNAc is important. Unfortunately, although some intense singly charged ions from this GlcNAc were present in the MS2 spectrum, only a very weak doubly charged 0,2A4-h ion (m/z 642.2) can be found (Figure 2a) and used as the precursor for further product-ion scanning. Although very weak, the MS3 spectrum (Figure 2b) obtained from it still gave clear sequence information on both 3- and 6- branches. It is apparent that both Lea (D-ion m/z 348) and Lex (D-ion m/z 364) epitopes are present in fraction #44c. Similar to the conventional octaose branch backbones, further MS3 scanning using the singly charged branching ion 0,2A4-h ion (m/z 1285) as the precursor (Figure 2c) gave a spectrum showing that Lex-specific ion (m/z 364) at the 6-branch, and therefore, the Lea epitope (with m/z 348) should be at the 3-branch. Thus, a difucosylated decasaccharide on a novel branched octaose backbone with a Lea and a Lex terminal sequence can be proposed for fraction #44c (Figure 2 and Table S2).
Figure 2.
Negative-ion MSn analysis of fraction #44c (DF-novo-LNO I). (a) MS2 of [M-2H]2–m/z 863.8; (b) MS3 of [0,2A4-h]2–m/z 642.2; (c) MS3 of 0,2A4-h m/z 1285.
Fraction #42b with the same novel octaose backbone (Figure 3) can be deduced in a similar way. However, in this case, both branches contain the same Lex epitopes as only the Lex-specific ion m/z 364 was present in both product-ion spectra of the doubly charged 0,2A4-h ion (m/z 642.2) (Figure 3b) and the singly charged 0,2A4-h ion (m/z 1285) (Figure 3c). An additional two difucosylated HMOs on the novo-octaose backbone were proposed for fractions #52b with a blood group H and a Lex at the 3- and 6-branch, respectively (Figure S10 and Table S2), and #46d with a Ley at the 3-branch (Figure S11 and Table S2).
Figure 3.
Negative-ion MSn analysis of fraction #42b (DF-novo-LNnO I). (a) MS2 of [M-2H]2–m/z 863.8; (b) MS3 of [0,2A4-h]2–m/z 642.2; (c) MS3 of 0,2A4-h m/z 1285.
Fraction #18a gave a molecular mass of 1525.5 with a composition of Fuc2, Hex5, and GlcNAc2 (Table S1), indicating a potentially interesting and unusual heptaose backbone. The product-ion spectrum from the doubly charged [M-2H]2– (Figure 4a) clearly showed a Gal as a single monosaccharide residue at the 3-branch by the double cleavage of D5–4β and its dehydrate ion at m/z 1183 and 1165, respectively. Further demonstration of the 6-linked branch can be obtained by MS3 of D5–4βm/z 1183 (Figure 4b). A terminal Lea epitope and an internal Lex epitope were assigned by the two D-type ions at m/z 348 and 875 (Figure 4b and Table S2), respectively.
Figure 4.
Negative-ion MSn of fraction #18a (DF-novo-Hepta). (a) MS2 of [M-2H]2–m/z 762.3; (b) MS3 of D5–4βm/z 1183.
Validation of Structural Assignment by NMR Spectroscopy
NMR spectra for DF-novo-LNO I (fraction #44c) and DF-novo-LNnO I (fraction #42b) were recorded at 950 MHz and assigned using 2D heteronuclear 1H/13C NMR spectra. The anomeric region of the HSQC spectrum of DF-novo-LNO I is shown in Figure 5a. Heteronuclear spectroscopy can discriminate between signals that are overcrowded in the 1H dimension. Figure 5b shows an expansion of the β-anomeric region of the HSQC (blue) and HSQC-TOCSY (red) spectra of DF-novo-LNO I, overlaid with the HMBC spectrum (green). The HMBC peaks illustrated are inter-residue cross-peaks between H1 of one residue and the carbon immediately across the glycosidic linkage, defining both sequence and linkage positions. 1H and 13C chemical shifts for DF-novo-LNO I are summarized in Table S3a (GlcNAc, Glc, and Gal) and Table S3b (Fuc).
Figure 5.

NMR spectra of fraction #44c (DF-novo-LNO I). (a) The anomeric region (1H 5.3 ppm to 4.3 ppm) of the HSQC spectrum. (b) Expansion of the β-anomeric region (1H 4.8 ppm to 4.3 ppm) of the HSQC (blue) and HSQC-TOCSY (red) spectra, overlaid with the HMBC spectrum (green), to define sequence and linkage of monosaccharide residues. (c) Expansion of the 1H–1H ROESY spectrum for the differential assignment of residues II and IV.
In order to establish whether the 6-branch is attached to the backbone at residue Gal II or Gal IV, further reasoning and experimental work were necessary. As is common for Gal residues, the HSQC-TOCSY spectrum can be traced securely only between C1/H1 and C4/H4 for both Gal II and Gal IV. The two spectra are similar; both residues are substituted at the 3-position, and one of them is also substituted at the 6-position.
H1 of GlcNAc V is linked by an inter-residue HMBC connection to glycosylated C6 of Gal at 71.2 ppm (H, H′ 3.96, 3.82) (Figure 5b). H4 of II (4.142 ppm) and H4 of IV (4.135 ppm) are overlapped but are distinguishable. A faint HSQC-TOCSY cross-peak (not illustrated) links H4 of II to C6 at 63.7 ppm (H6, H6′ 3.78, 3.74 ppm), consistent with a nonsubstituted C6.
Further evidence was sought from 2D ROESY spectroscopy. The spectrum obtained (Figure 5c) showed clear ROESY cross-peaks from H4 of II to H6,6′ at 3.78 and 3.74 ppm, and from H4 of IV to H6 at 3.96 and 3.82 ppm, confirming the position of the 6-branch at residue IV.
Two fucose spin systems are traceable through HSQC-TOCSY, H2BC, and HMBC spectra, as described in Supplementary Results.
NMR assignments for DF-novo-LNnO I (fraction #42b) were obtained in the same manner, as illustrated in Figure S12, Table S3c,d, and Supplementary Results.
The relatively small amount of sample #18a available was insufficient for heteronuclear NMR spectroscopy except for HSQC, but 1H homonuclear 2-dimensional TOCSY and ROESY spectra gave almost complete 1H assignments and clear inter-residue ROESY connectivity corroborating the structures indicated by MS3 (Figure 4). Assignments for each monosaccharide residue are summarized in Table S4. Detailed ROESY interpretation is shown in the Supplementary Results; the Galβ1–3Gal motif is strongly supported by the two well-resolved inter-residue ROE cross peaks from Gal VI H1 to Gal II H3 and H4 (Figure S13).
Oligosaccharide Microarray Analysis of the Peripheral Epitopes
To validate the assignment of the epitopes, the 17 purified HMOs together with 14 controls selected from HMOs were converted to fluorescent BABI-probes for the construction of covalent microarrays on NHS-functionalized glass slides. The resulting microarrays were probed with 3 plant lectins (AAL, ECL, and UEA I) and 8 anticarbohydrate antibodies (antibodies against Lea, Leb, Lex, Ley, H-T1, H-T2, blood-group A, and blood-group B) with well-defined carbohydrate binding specificities (Table 1 and Figure S14; background color schemes in both the table and figures highlight the structural features and blood-group ABH and Lewis epitopes of the HMO probes and controls). As expected, AAL showed binding signals to all Fuc-containing oligosaccharides (Figure S14a), whereas UEA I (Figure S14b), anti-H-T2 (Figure S14c), and anti-Ley (Figure S14d) did not show any binding signals as H-type 2 and Ley motifs are not present in any of the HMOs identified in the isolated fractions or in any of the controls selected. ECL only recognized HMOs with terminal type 2 chains (Figure S14e). The anti-Lea exhibited specific binding signal to all HMOs with Lea epitope (Figure S14f), while anti-Leb recognized HMOs with Leb epitope (Figure S14g), and not unexpectedly, anti-Leb also showed weak binding signals to Lea-containing probes, consistent with previous knowledge.26 The anti-H type 1 (Figure S14h), antiblood-group A and antiblood-group B signals (Figure S14i,j) were all as expected.
Table 1. Microarray Analysis of the Difucosylated HMOs with Heptaose and Octaose Backbonesa.
T1, type 1 chain; T2, type 2 chain; in, internal Lewis (Le) epitope; b-g: blood group; samples in black font are included as controls; *purity <80%. Sample names in blue are described in the present study, and those in black are HMO controls. The color shades represent different features of the HMOs (see Figure S13 for details). HMO probes were printed in two concentrations: 25 μM and 50 μM, and 50 μM was used for the heatmap presentation, in which dark red indicates highest binding intensity and light blue indicates weakest or no binding signal.
In this study, the activity of anti-Lex antibody is relatively weak, and for most Lex-containing probes including the control LNFP III (probe 12) the fluorescent intensities were below 1,000 counts (Table 1 and Figure S14k). Only DF-novo-LNnO I (probe 20, fraction #42b) with double terminal Lex epitopes showed a good binding signal. It is interesting to note that the internal Lex epitope showed no binding signal, e.g., in the case of probes 23 and 24, in agreement with our recent results obtained from microarray binding analysis (Tajadura-Ortega, Chai and colleagues, unpublished) of a comprehensive panel of synthetic sequences containing Lewis antigens at different terminal and internal positions of the backbones.31 DF-LNO I (probe 17, fraction #30a/#31c) showed very weak ECL and Lex binding signals but also exhibited some Lea signal as this fraction is a mixture containing ∼40% impurity, among which DF-LNO II with Lea and T2 epitopes was the major one. This is consistent with ESI-MS4 (not shown) and purity analysis (Figure S6).
The microarray binding results of all 17 selected HMOs are consistent with ESI-MSn and NMR analyses in terms of epitope assignments.
Conclusions
Defining the fine structures of oligosaccharides is extremely important for understanding the structural basis of their diverse functions. Unlike N-glycans, HMOs and mucin type O-glycans are not only functionally but also structurally similar32 and can have many different backbones and multiple fucose residues forming different isomeric recognition motifs, posing considerable challenge to their separation and structural characterization.
UHT preparative PGC-HPLC, as demonstrated here, can overcome the α/β splitting problem due to the rapid switching between the α and β anomeric forms leading to peak splitting collapses and thus provides a high-efficiency and high-resolution method for separation of reducing glycans obtained from natural glycome sources. As the high temperature used in this study is much lower than the critical temperature of the supercritical fluid ACN/H2O, this ensures the solvent is in the liquid state during sample elution. Although the temperature used is high, the column back pressure is reduced due to the reduced solvent viscosity.
For the neutral oligosaccharides, negative-ion ESI-MS/MS is important for the analysis of linear and branched sequences. The combined use of singly and doubly charged molecular deprotonated molecular ions has been successfully used in assignment of many novel sequences of HMOs on branched backbones. However, for the more complex HMOs with DP ≥ 10 and novel branched backbones, MS/MS may not be sufficient, and MSn is required. For example, for MS3 scanning the dehydrated 0,2A-h ion of the GlcNAc residue at the reducing side of the branching Gal is selected as the unique precursor. As a result, from the highly complex HMO fraction, at least 70 subfractions were detected, and 16 isomeric difucosylated decasaccharides on different octaose backbones and one difucosylated nonasaccharide on a heptaose backbone were isolated, purified, and characterized, among which six are novel sequences. In this and several previous studies,16,17,25,30 negative-ion ESI-CID-MSn has played a key role in sequencing of HMOs and other reducing sugars.18 However, for complete structural assignment, NMR is always important when a sufficient amount of material is available, as demonstrated here. The sources and the synthetic pathways of oligosaccharides are also important knowledge in the determination of their structures.
The novo-octaose backbone and its monofucosylated analogue as AEAB derivatives have been described previously by MS analysis only.26 The latter was assigned in a complex mixture of three isomeric structures by product-ion scanning using exactly the same molecular ion precursor. The detection and assignment of the odd-numbered novo-heptaose backbone was unexpected as it has not been found in milk, although tetraose backbone from marsupial milk33 and pentaose backbone from human milk34 with Galβ1–3Gal linkage have been described.
The results presented here can serve to demonstrate further the complexity of HMOs. The method developed in this work can be used for high-efficiency separation and high-sensitivity sequence analysis of large-sized and complex reducing oligosaccharides with multiple isomeric sequences obtained from other glycome sources, e.g., the very challenging mucin O-glycomes when released in nonreducing conditions.
Acknowledgments
This work was supported by the National Natural Science Foundation of China (22074143, 21934005, and 22174140), and in part by the Wellcome Trust Biomedical Resource grant (218304/Z/19/Z), the March of Dimes Prematurity Research Center grant (22-FY18-821), and the Francis Crick Institute through provision of access to the MRC Biomedical NMR Centre. The Francis Crick Institute receives its core funding from Cancer Research UK (CC1078), the UK Medical Research Council (CC1078), and the Wellcome Trust (CC1078). We are indebted to Haifeng Zhang of Acchrom Technologies for his support in the design of the UTH column oven.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.3c05008.
Additional information on results and methods; PGC-HPLC, SI-MSn, and NMR results (Tables S1–S4); HPLC system and additional optimimization, chromatography, negative-ion MSn, and NMR results (Figures S1–S14) (PDF)
The authors declare no competing financial interest.
Supplementary Material
References
- Feizi T. Demonstration by monoclonal antibodies that carbohydrate structures of glycoproteins and glycolipids are onco-developmental antigens. Nature 1985, 314, 53–57. 10.1038/314053a0. [DOI] [PubMed] [Google Scholar]
- Smilowitz J. T.; Lebrilla C. B.; Mills D. A.; German J. B.; Freeman S. L.. Breast milk oligosaccharides: structure-function relationships in the neonate. In Annual Review of Nutrition; Cousins R. J., Ed.; 2014; pp 143–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bode L. Human milk oligosaccharides: Every baby needs a sugar mama. Glycobiology 2012, 22, 1147–1162. 10.1093/glycob/cws074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blank D.; Dotz V.; Geyer R.; Kunz C. Human Milk Oligosaccharides and Lewis Blood Group: Individual High-Throughput Sample Profiling to Enhance Conclusions From Functional Studies. Advances in Nutrition 2012, 3, 440S–449S. 10.3945/an.111.001446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang J.; Chen M. S.; Wang R. S.; Hu J. Q.; Liu S.; Wang Y. Y. F.; Xing X. L.; Zhang B. W.; Liu J. M.; Wang S. Current Advances in Structure-Function Relationships and Dose-Dependent Effects of Human Milk Oligosaccharides. J. Agric. Food Chem. 2022, 70, 6328–6353. 10.1021/acs.jafc.2c01365. [DOI] [PubMed] [Google Scholar]
- Akkerman R.; Faas M. M.; de Vos P. Non-digestible carbohydrates in infant formula as substitution for human milk oligosaccharide functions: Effects on microbiota and gut maturation. Crit. Rev. Food Sci. Nutr. 2019, 59, 1486–1497. 10.1080/10408398.2017.1414030. [DOI] [PubMed] [Google Scholar]
- Dinleyici M.; Barbieur J.; Dinleyici E. C.; Vandenplas Y. Functional effects of human milk oligosaccharides (HMOs). Gut Microbes 2023, 15, 2186115. 10.1080/19490976.2023.2186115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin C.; Lin Y.; Zhang H.; Wang G.; Zhao J.; Zhang H.; Chen W. Intestinal ’Infant-Type’ Bifidobacteria Mediate Immune System Development in the First 1000 Days of Life. Nutrients 2022, 14, 1498. 10.3390/nu14071498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hill D. R.; Chow J. M.; Buck R. H. Multifunctional Benefits of Prevalent HMOs: Implications for Infant Health. Nutrients 2021, 13, 3364. 10.3390/nu13103364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Auer F.; Jarvas G.; Guttman A. Recent advances in the analysis of human milk oligosaccharides by liquid phase separation methods. J. Chromatogr B 2021, 1162, 122497. 10.1016/j.jchromb.2020.122497. [DOI] [PubMed] [Google Scholar]
- Porfirio S.; Archer-Hartmann S.; Moreau G. B.; Ramakrishnan G.; Haque R.; Kirkpatrick B. D.; Petri W. A. Jr; Azadi P. New strategies for profiling and characterization of human milk oligosaccharides. Glycobiology 2020, 30, 774–786. 10.1093/glycob/cwaa028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Urashima T.; Katayama T.; Fukuda K.; Hirabayashi J.. Human Milk Oligosaccharides and Innate Immunity. In Comprehensive Glycoscience, 2nd ed.; Barchi J. J., Jr.. ; 2021; pp 389–439. [Google Scholar]
- Urashima T.; Hirabayashi J.; Sato S.; Kobata A. Human Milk Oligosaccharides as Essential Tools for Basic and Application Studies on Galectins. Trends Glycosci. Glycotechnol. 2018, 30, SE51–SE65. 10.4052/tigg.1734.1SE. [DOI] [Google Scholar]
- Chai W. G.; Leteux C.; Westling C.; Lindahl U.; Feizi T. Relative Susceptibilities of the Glucosamine-Glucuronic Acid and N-Acetylglucosamine-Glucuronic Acid Linkages to Heparin Lyase III†. Biochemistry. 2004, 43, 8590–8599. 10.1021/bi036250k. [DOI] [PubMed] [Google Scholar]
- Ninonuevo M. R.; Park Y.; Yin H.; Zhang J.; Ward R. E.; Clowers B. H.; German J. B.; Freeman S. L.; Killeen K.; Grimm R.; Lebrilla C. B. A strategy for annotating the human milk glycome. J. Agric. Food Chem. 2006, 54, 7471–7480. 10.1021/jf0615810. [DOI] [PubMed] [Google Scholar]
- Chai W. G.; Piskarev V.; Lawson A. M. Negative-Ion Electrospray Mass Spectrometry of Neutral Underivatized Oligosaccharides. Anal. Chem. 2001, 73, 651–657. 10.1021/ac0010126. [DOI] [PubMed] [Google Scholar]
- Chai W. G.; Piskarev V.; Lawson A. M. Branching pattern and sequence analysis of underivatized oligosaccharides by combined MS/MS of singly and doubly charged molecular ions in negative-ion electrospray mass spectrometry. J. Am. Soc. Mass Spectrom. 2002, 13, 670–679. 10.1016/S1044-0305(02)00363-X. [DOI] [PubMed] [Google Scholar]
- Palma A. S.; Liu Y.; Zhang H.; Zhang Y.; McCleary B. V.; Yu G.; Huang Q.; Guidolin L. S.; Ciocchini A. E.; Torosantucci A.; Wang D.; Carvalho A. L.; Fontes C. M. G. A.; Mulloy B.; Childs R. A.; Feizi T.; Chai W. Unravelling Glucan Recognition Systems by Glycome Microarrays Using the Designer Approach and Mass Spectrometry. Mol. Cell Proteomics 2015, 14, 974–988. 10.1074/mcp.M115.048272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang H.; Zhang S.; Tao G.; Zhang Y.; Mulloy B.; Zhan X.; Chai W. Typing of blood-group antigens on neutral oligosaccharides by negative-ion electrospray ionization tandem mass spectrometry. Anal. Chem. 2013, 85, 5940–5949. 10.1021/ac400700e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feizi T.; Chai W. G. Oligosaccharide microarrays to decipher the glyco code. Nat. Rev. Mol. Cell Biol. 2004, 5, 582–588. 10.1038/nrm1428. [DOI] [PubMed] [Google Scholar]
- Lim C. K. Porous graphitic carbon in biomedical applications. Adv. Chromatogr. 1992, 32, 1–19. [PubMed] [Google Scholar]
- Pereira L. Porous graphitic carbon as a stationary phase in HPLC: Theory and applications. J. Liq. Chromatogr. Relat. Technol. 2008, 31, 1687–1731. 10.1080/10826070802126429. [DOI] [Google Scholar]
- Koizumi K. High-performance liquid chromatographic separation of carbohydrates on graphitized carbon columns. J. Chromatogr. A 1996, 720, 119–126. 10.1016/0021-9673(94)01274-1. [DOI] [PubMed] [Google Scholar]
- Chen C. H.; Lin Y. P.; Ren C. T.; Shivatare S. S.; Lin N. H.; Wu C. Y.; Chen C. H.; Lin J. L. Enhancement of fucosylated N -glycan isomer separation with an ultrahigh column temperature in porous graphitic carbon liquid chromatography-mass spectrometr. J. Chromatogr. A 2020, 1632, 461610. 10.1016/j.chroma.2020.461610. [DOI] [PubMed] [Google Scholar]
- Chai W. G.; Piskarev V. E.; Zhang Y. B.; Lawson A. M.; Kogelberg H. Structural determination of novel lacto-N-decaose and its monofucosylated analogue from human milk by electrospray tandem mass spectrometry and 1H NMR spectroscopy. Arch. Biochem. Biophys. 2005, 434, 116–127. 10.1016/j.abb.2004.09.035. [DOI] [PubMed] [Google Scholar]
- Ashline D. J.; Yu Y.; Lasanajak Y.; Song X.; Hu L.; Ramani S.; Prasad V.; Estes M. K.; Cummings R. D.; Smith D. F.; Reinhold V. N. Structural Characterization by Multistage Mass Spectrometry (MSn) of Human Milk Glycans Recognized by Human Rotaviruses. Mol. Cell. Proteomics 2014, 13, 2961–2974. 10.1074/mcp.M114.039925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ninonuevo M. R.; Lebrilla C. B. Mass spectrometric methods for analysis of oligosaccharides in human milk. Nutr. Rev. 2009, 67, S216–S226. 10.1111/j.1753-4887.2009.00243.x. [DOI] [PubMed] [Google Scholar]
- Li J.; Bi Y.; Zheng Y.; Cao C.; Yu L.; Yang Z.; Chai W.; Yan J.; Lai J.; Liang X. Development of high-throughput UPLC-MS/MS using multiple reaction monitoring for quantitation of complex human milk oligosaccharides and application to large population survey of secretor status and Lewis blood group. Food Chem. 2022, 397, 133750. 10.1016/j.foodchem.2022.133750. [DOI] [PubMed] [Google Scholar]
- Yan J.; Ding J.; Jin G.; Yu D.; Yu L.; Long Z.; Guo Z.; Chai W.; Liang X. Profiling of Sialylated Oligosaccharides in Mammalian Milk Using Online Solid Phase Extraction-Hydrophilic Interaction Chromatography Coupled with Negative-Ion Electrospray Mass Spectrometry. Anal. Chem. 2018, 90, 3174–3182. 10.1021/acs.analchem.7b04468. [DOI] [PubMed] [Google Scholar]
- Gao C.; Zhang Y.; Liu Y.; Feizi T.; Chai W. Negative-Jon Electrospray Tandem Mass Spectrometry and Microarray Analyses of Developmentally Regulated Antigens Based on Type 1 and Type 2 Backbone Sequences. Anal. Chem. 2015, 87, 11871–11878. 10.1021/acs.analchem.5b03471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ye J.; Xia H.; Sun N.; Liu C. C.; Sheng A.; Chi L.; Liu X.-W.; Gu G.; Wang S. Q.; Zhao J.; Wang P.; Xiao M.; Wang F.; Cao H. Reprogramming the enzymatic assembly line for site-specific fucosylation. Nature Catalysis 2019, 2, 514–522. 10.1038/s41929-019-0281-z. [DOI] [Google Scholar]
- Luis A. S.; Hansson G. C. Intestinal mucus and their glycans: A habitat for thriving microbiota. Cell host microbe 2023, 31, 1087–1100. 10.1016/j.chom.2023.05.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradbury J. H.; Collins J. G.; Jenkins G. A.; Trifonoff E.; Messer M. 13C-N.m.r. study of the structures of two branched oligosaccharides from marsupial milk. Carbohydr. Res. 1983, 122, 327–331. 10.1016/0008-6215(83)88344-X. [DOI] [PubMed] [Google Scholar]
- Gronberg G.; Lipniunas P.; Lundgren T.; Lindh F.; Nilsson B. Structural analysis of five new monosialylated oligosaccharides from human milk. Arch. Biochem. Biophys. 1992, 296, 597–610. 10.1016/0003-9861(92)90616-5. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





