Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Dec 16.
Published in final edited form as: Angew Chem Int Ed Engl. 2019 Nov 6;58(51):18577–18583. doi: 10.1002/anie.201908805

Expedient Synthesis of Core Disaccharide Building Blocks from Natural Polysaccharides for Heparan Sulfate Oligosaccharide Assembly

Nitin J Pawar 1,+, Lei Wang 2,+, Takuya Higo 3, Chandrabali Bhattacharya 4, Pavan K Kancharla 5, Fuming Zhang 6, Kedar Baryal 7, Chang-Xin Huo 8, Jian Liu 9, Robert J Linhardt 10, Xuefei Huang 11, Linda C Hsieh-Wilson 12
PMCID: PMC6901730  NIHMSID: NIHMS1052528  PMID: 31553820

Abstract

The complex sulfation motifs of heparan sulfate glycosaminoglycans (HS GAGs) play critical roles in many important biological processes. However, an understanding of their specific functions has been hampered by an inability to synthesize large numbers of diverse, yet defined, HS structures. Here, we describe a new approach to access the four core disaccharides required for HS/heparin oligosaccharide assembly from natural polysaccharides. The use of disaccharides as minimal precursors rather than monosaccharides greatly accelerates the synthesis of HS GAGs, providing key disaccharide and tetrasaccharide intermediates in about half the number of steps compared to traditional strategies. Rapid access to such versatile intermediates will enable the generation of comprehensive libraries of sulfated oligosaccharides for unlocking the ‘sulfation code’ and understanding the roles of specific GAG structures in physiology and disease.

Keywords: carbohydrates, heparan sulfate, glycosaminoglycans, oligosaccharides, synthesis design

Graphical Abstract

graphic file with name nihms-1052528-f0001.jpg

We report a highly efficient approach to obtain the four core disaccharides required for heparan sulfate (HS)/heparin oligosaccharide synthesis from natural polysaccharides. Rapid access to these key building blocks will greatly facilitate the generation of comprehensive libraries of HS oligosaccharides for unlocking the ‘sulfation code’ and understanding the roles of GAGs in physiology and disease.

Introduction

Heparan sulfate (HS) glycosaminoglycans (GAGs) are linear, sulfated polysaccharides that mediate a wide range of important biological and disease processes, including cell growth and proliferation, brain development, immune regulation, viral invasion, angiogenesis, and tumor metastasis.[1] The diverse activities of GAGs stem largely from their complex sulfation patterns, which facilitate the interactions of GAGs with hundreds of different proteins.[2] HS chains are composed of repeating disaccharide units of glucosamine (GlcN) joined via α−1,4-linkages to either D-glucuronic acid (GlcA) or L-iduronic acid (IdoA; Figure 1). Sulfation at the N-, 6-O- and 3-O-positions of GlcN and the 2-O-position of GlcA or IdoA produces many different sulfation patterns that are tissue-specific, age-specific, disease-related, and tightly regulated in vivo.[3] Indeed, a simple HS tetrasaccharide has the potential to display 1024 different sulfation sequences, highlighting the intriguing capacity of GAGs for molecular recognition. However, this chemical complexity has limited access to well-defined structures and hampered efforts to understand the biology of HS GAGs and to develop HS-based therapeutics.

Figure 1.

Figure 1.

Representative structure found in HS/heparin and core disaccharide building blocks.

Synthetic chemistry provides an elegant solution to this challenge. A notable example is the heparin-based pentasaccharide drug fondaparinux, which is approved for the treatment of deep vein thrombosis. The synthesis of various HS analogues was critical for identifying a rare 3-O-sulfated sequence that regulates antithrombin III activity.[1a] However, despite remarkable progress over the past two decades,[4] the synthesis of HS oligosaccharides remains a significant challenge. Only a small subset of the potential sulfation motifs has been generated, resulting in a paucity of structure-function information and hindering broad application of the compounds to biology.

A major roadblock lies in the preparation of suitable HS building blocks. The synthesis of selectively protected IdoA- and GlcA-containing disaccharides, which are traditionally derived from monosaccharide precursors, usually requires 18–29 steps each, depending on the complexity of the protecting groups (See Supporting Information, Figure S1, for representative literature examples). This is due to a lack of commercial sources for L-iduronic acid/L-idose, as well as the need for stereoselective formation of glycosidic bonds and elaborate protecting group strategies to direct regioselective sulfation. Another major roadblock is the lack of universal building blocks for the synthesis of HS GAGs. Ideally, virtually any sulfation motif could be obtained from a small set of building blocks. The lengthy, laborious processes required and the absence of universal building blocks have fundamentally limited the ability to produce large, comprehensive libraries of HS oligosaccharides. To date, sulfated libraries of compounds representing all possible sulfation motifs have only been attainable in the case of HS disaccharides.[4o]

Here, we report a novel, expedient approach to synthesize the four core disaccharides required for HS/heparin oligosaccharide assembly. We show that these key disaccharides can be obtained on a large scale from readily available natural polysaccharides and efficiently converted into versatile synthetic intermediates. Importantly, by employing disaccharides rather than monosaccharides as the minimum precursors, this approach eliminates half of the challenging glycosidic bond-forming reactions and significantly reduces the total number of steps. The new methods described herein should greatly streamline the synthesis of HS oligosaccharides and accelerate the production of diverse collections of HS GAG sequences.

Results and Discussion

Naturally occurring heparin is an attractive starting material because it is commercially available (~$14/g) and produced on a large scale for pharmaceutical use. We envisioned that key IdoA-containing building blocks for HS/heparin synthesis might be obtained through the controlled hydrolysis of heparin. However, hydrolysis of heparin under basic conditions results in β-elimination to form unsaturated uronic acid moieties,[5] while nitrous acid-mediated depolymerization of heparin is accompanied by deaminative ring contraction of GlcN.[6] Fortunately, elegant studies by Davidson and Meyer,[7] as well as Lopin and Jacquinet, [8] showed that chondroitin sulfate (CS) could be hydrolyzed using aqueous H2SO4 to provide GlcA-N-acetylgalactosamine (GalNAc) disaccharides. Selective cleavage of the GalNAc-GlcA bond was presumably facilitated by neighboring group participation from the N-acetyl group of GalNAc.

Encouraged by these reports, we generated N-acetylated (N-Ac) heparin by subjecting sodium heparinate (which is >85% N-sulfated) to N-desulfation and N-acetylation[9] and then explored the ability of various acids (e.g., H2SO4, TFA, CuCl, TMSOTf, BF3Et2O, TfOH) to produce intact HS disaccharides. We found that N-Ac-heparin was efficiently hydrolyzed to give disaccharides as the major product using 1 M TfOH at 100°C for approximately 6 h (Scheme 1 and Figure S2). The crude free disaccharide was then esterified using AcCl and MeOH, which also resulted in methylation of the anomeric hydroxyl group. Hydrolysis of this methyl glycoside and acetylation of the free hydroxyls using acetic anhydride, followed by treatment with acetic anhydride/pyridine to effect N-acetylation, resulted in peracetylated disaccharide 1.

Scheme 1.

Scheme 1.

Synthesis of GlcN-IdoA and GlcA-GlcN Building Blocks from Natural Polysaccharides. TfOH = trifluoromethanesulfonic acid, Ac = acetyl, TfN3 = trifluoromethanesulfonyl azide, CbzCl = benzyl chloroformate.

Unexpectedly, NMR structural analysis revealed that cleavage of the glycosidic bond had occurred predominantly at the reducing end of IdoA to give the GlcN-IdoA disaccharide. Formation of the other disaccharide, IdoA-GlcN, was not observed by NMR, although its presence was detected by hydrophilic interaction chromatography-Fourier transform mass spectrometry (HILIC-FTMS) (Figure S2). These results suggest that the N-acetyl group of heparin does not significantly promote the reaction. As further confirmation that an N-acetyl group was not required, we performed the reaction directly on sodium heparinate. We found that sodium heparinate was efficiently hydrolyzed using 2 M TfOH at 100°C (Figure S3). Extensive optimization was conducted to maximize scalability and reproducibility. Starting from 25 g of natural heparin, the key GlcN-IdoA disaccharide 1 was routinely obtained via a one-pot, four-step reaction sequence (single purification step) in 18% overall yield (4.4 g) (Scheme 1, Table S1). Importantly, direct access to this disaccharide obviates the need to perform the notoriously challenging stereoselective 1,2-cis glycosylation reaction[10] to generate GlcN-α(1,4)-IdoA linkages.

The amino groups of HS/heparin are known to be acetylated, sulfated, or unmodified in vivo. Thus, to expand the versatility of the method further, we sought to convert the N-acetamide of 1 to other synthetically useful functionalities. Treatment of the amine with TfN3, K2CO3 and ZnCl2 after the esterification step, followed by anomeric demethylation and peracetylation, resulted in smooth conversion to the desired GlcN3-IdoA disaccharide 1b in 20% yield over the four steps (Scheme 1). From 25 g of heparin, 4.8 g of disaccharide 1b was obtained in the course of 3–4 days (Table S1), highlighting the scalability and practicality of the route. Alternatively, the use of benzyl chloroformate introduced a N-Cbz protecting group in place of the N-acetyl group to give 1c in 8.5% yield over the four steps.

We next investigated whether we could obtain GlcA-containing disaccharides from heparosan, the biosynthetic precursor to heparin. Large-scale fermentation procedures have been developed for the purification of heparosan (>100 g) from Escherichia coli K5.[11] We found that heparosan was efficiently hydrolyzed under slightly milder acidic conditions of 1M TfOH at 100°C (Scheme 1 and Figure S4). After esterification and peracetylation using similar conditions as before, the GlcA-GlcN disaccharide 2 was obtained in 16% overall yield (4.9 g) over the four steps (single purification step) starting from 19 g of heparosan. In contrast to heparin, cleavage of the glycosidic bond occurred exclusively at the reducing end of GlcN to afford the GlcA-GlcN disaccharide (Figure S4). We also observed that N-deacetylated heparosan was hydrolyzed very slowly under the same conditions, suggesting the importance of the N-acetyl group in this case (Figure S5). The interesting differences in the mode of glycosidic bond cleavage between heparin and heparosan are presumably due to different stereoelectronic effects during oxocarbenium ion formation, which are possibly facilitated by the conformational flexibility of the IdoA ring. Possible explanations are provided in the SI (Figure S6). Independent of the mechanism, it is fortuitous that heparin and heparosan undergo distinct cleavages to form two of the four disaccharides required for HS assembly (Figure 1).

We envisaged that the other two disaccharides might be readily accessed through epimerization of 1 and 2. Only a few methods have been reported on epimerization as a synthetic means to access IdoA from GlcA, and all using monosaccharides.[12] Base- or metal-catalyzed interconversion of IdoA to GlcA in 1 led to poor overall yield due to β-elimination or significant disaccharide decomposition (data not shown). Inspired by a report by Wong and colleagues,[12a] we subjected the GlcA- containing 2 to α-bromination using NBS in the presence of UV light[12a] to produce the C-5 bromo compound 5 in 75% yield. Various radical initiators (AIBN, triethylborane), reducing agents (tributyltin hydride, triphenyltin hydride), and temperatures (0–110°C) were then explored to effect α-dehalogenation. We found that treatment of 5 with Et3B and Bu3SnH at 20°C gave the highest amount of epimerized IdoA-GlcN product 4 in 63% yield, along with recovery of the valuable GlcA-GlcN epimer 2 in 31% yield (Scheme 2 and Table S2). On the other hand, NBS-mediated bromination of 1 gave an epimeric mixture of the C-5 bromo compound 6. Subsequent α-dehalogenation of 6 using AIBN and Bu3SnH at 110°C afforded the GlcN-GlcA disaccharide 3 in 33% yield (Scheme 2 and Table S3). Although only moderate conversion to GlcN-GlcA was observed, the GlcN-IdoA building block 1 could be recovered in 62% yield and readily recycled to produce more GlcN-GlcA building block. Thus, this approach provides novel streamlined routes to all four of the core HS disaccharide modules (1-4; Figure 1).

Scheme 2.

Scheme 2.

Epimerization to Form All Four HS Core Disaccharides. NBS = N-bromosuccinimide, AIBN = 2,2′-Azobis(2-methylpropionitrile).

The assembly of HS oligosaccharides with defined sulfation sequences requires orthogonally protected disaccharides bearing functionalities that can be selectively removed to unmask hydroxyl or amino groups for sulfation. We therefore sought to synthesize strategically protected building blocks from the core disaccharides. Disaccharide 1b was treated with ZnI2 and TMSSPh to give the corresponding thioglycoside 7 in 88% yield (Scheme 3a). After global deacetylation, the 6-O- and 4-O- hydroxyl groups of GlcN were protected with a (2-naphthyl) methylene acetal to afford compound 8. Next, we investigated the selective 2-O-protection of IdoA, which is particularly challenging for trans-di-axial 1,2-diols. Treatment of 8 with dibutyltin oxide and benzoyl chloride at 70°C gave the desired compound 9 as the exclusive product. Although dibutyltin oxide has been widely used for the regioselective protection of cis-1,2 and di-equatorial trans-1,2 diols,[13] to our knowledge, this is the first successful application of dibutyltin oxide to the regioselective protection of the di-axial trans 2-O- or 3-O-positions of IdoA. We found that 9-fluorenylmethoxycarbonyl (Fmoc), 2,2,2-trichloroethoxycarbonyl (Troc), and benzoyl (Bz) groups could be selectively installed at the 2-O-position of GlcN-IdoA derivatives, whereas the monochloroacetyl (MCA) group could be selectively installed at the 3-O-position, highlighting the synthetic utility of this transformation (Table S4). Benzylation of the remaining hydroxyl groups in 9 using Ag2O and BnBr in 1:1 DCM/DMF gave the valuable intermediate 10 in only 9 steps (5 purifications) from heparin. For comparison, an analogous disaccharide containing a cyclic benzylidene acetal was synthesized by Gardiner and coworkers over 17 steps in the shortest route to date (Figure S1).[4m,4n]

Scheme 3.

Scheme 3.

Differential protection of the GlcN-IdoA building block. TMSSPh = trimethyl(phenylthio)silane, SPh = benzenethiol, DCM = dichloromethane, NapCH(OMe)2 = 2-naphthaldehyde dimethyl acetal, Naph = 2-naphthyl, p-TsOH = p-toluenesulfonic acid, TBAI = tetrabutylammonium iodide, BzCl = benzoyl chloride, MS = molecular sieves, BnBr = benzyl bromide, DMF = dimethylformamide, LevOH = levulinic acid, EDC = 1-ethyl-3-(3-dimethylaminopropyl)carbodiimide, DMAP = 4-(dimethylamino)pyridine, TBDPSCl = tert-butyl(chloro)diphenylsilane, Im = imidazole, Py = pyridine, TFAA = trifluoroacetic anhydride, NIS = N-iodosuccinimide. FmocCl = 9-fluorenylmethoxycarbonyl chloride, TFA = trifluoroacetic acid, TFAHN = trifluoroacetamide, STol = 4-methylbenzenethiol, All = allyl.

Disaccharide 10 is a highly versatile intermediate that can be used to synthesize various simple to highly complex HS oligosaccharides. For example, regioselective opening of the Naph acetal would lead to a 6-O-(2-naphthyl)methyl (Nap) ether group and unmask the 4-O hydroxyl group, providing a glycosyl acceptor for HS chain elongation. Here, we chose to convert disaccharide 10 into the highly differentially protected disaccharide donor 13 and acceptor 18 as general building blocks for the synthesis of various HS sulfation motifs. These building blocks have five O-protecting groups (tert-butyldiphenylsilyl (TBDPS), levulinoyl (Lev), Nap, Fmoc and CO2t-Bu) and two N-protecting groups (N3, N-trifluoroacetyl (TFA)), whose orthogonality is well documented.[14, 4b] This overall strategy maximizes the number of possible sulfation patterns from each disaccharide, enabling the generation of up to 64 different sulfation sequences from a single protected tetrasaccharide.

The required disaccharide donor 13 was synthesized by exchanging the 2-OBz group of 10 for a Lev group, removal of the Naph acetal, and protection of the resulting primary and secondary hydroxyl groups with TBDPS and Bz groups, respectively. To generate the disaccharide acceptor 18, the azido group of 10 was reduced and the resulting amine protected with a TFA group to afford 14 in 81% yield over two steps (Scheme 3b). Glycosylation of 14 using tert-butyl 2-hydroxyacetate gave compound 15 with a versatile linker[15] at the reducing end. Chemoselective deprotection of the 2-OBz group in 15, followed by Fmoc protection of the resultant secondary alcohol and regioselective opening of the Naph acetal, gave the desired acceptor 18 in 72% yield. Differentially protected disaccharides 19 - 20 were also successfully synthesized using similar reaction sequences (Scheme 3c, S1 and S2). In the future, orthogonally protected GlcN-GlcA disaccharides derived from 3 can be readily produced. Overall, strategic protection of the core disaccharides 1/1b with different sets of protecting groups was accomplished to provide key HS/heparin disaccharide building blocks in only 5–11 steps.

With the disaccharides in hand, we sought to generate a highly orthogonally protected tetrasaccharide that could serve as a ‘universal’ building block for the generation of many sulfation sequences. Surprisingly, glycosylation of donor 19 with model acceptor 20 gave the undesired β-anomer, as determined by HMQC and NOE analysis (Scheme S3). Although participation of the axially oriented 3-OAc moiety of 19 following anomeric activation could promote the observed β-selectivity, exchanging the 3-OAc with an OBn group did not alter the glycosylation selectivity (data not shown). Therefore, we reasoned that the Fmoc carbonate group may not provide strong enough neighboring group participation to overcome the intrinsic β-selectivity of the glycosylation reaction, which prompted us to examine the 2-OLev ester protected donor 13. Reaction of donor 13 with acceptor 18 using NIS and AgOTf at room temperature delivered tetrasaccharide 21 with exclusively the desired α- stereochemistry (1JC–H = 174.3 Hz) in 60% yield, in addition to 18% recovery of unreacted acceptor 18 (Scheme 4a). Thus, the new methods reported herein have enabled synthesis of the uniquely designed, highly orthogonally protected HS tetrasaccharide 21, with seven different protecting groups (TBDPS, Lev, Nap, Fmoc, CO2t-Bu, N3, and TFA) and differentiation of the two nitrogen atoms, in only 21 steps.

Scheme 4. A universal tetrasaccharide building block for the generation of diverse HS sequences.

Scheme 4.

a) Assembly of the strategically protected tetrasaccharide 21. b) Demonstration of the orthogonality of the TBDPS, Nap, Lev, Fmoc, and t-Bu protecting groups. DDQ = 2,3-dichloro-5,6-dicyano-p-benzoquinone.

We next demonstrated the orthogonality of the O-protecting groups in tetrasaccharide 21. Selective removal of the TBDPS or Nap group was achieved using HF·Py or DDQ, respectively, to give compounds 22 and 23 (Scheme 4b). Alternatively, the Lev- or Fmoc groups could be chemoselectively removed using hydrazine acetate or piperidine, respectively, to afford compounds 24 and 25. To provide a carboxylic acid handle for the potential attachment to a solid support, the t-Bu ester at the reducing end was selectively hydrolyzed using 3% TFA in DCM to obtain compound 26. Thus, each 2-O-, 6-O, or amino group in 21 can be selectively unmasked for regioselective sulfation, allowing in principle for the production of 64 different sulfation motifs from a single, universal tetrasaccharide building block.

To demonstrate the versatility of 21, we generated tetrasaccharides with regiodefined sulfation patterns. Sulfation of free hydroxyl groups in 22 and 25 using sulfur trioxide triethylamine complex gave the 6-O- and 2-O-monosulfated tetrasaccharides 27 and 28, respectively (Scheme 5). These results confirm that all of the protecting groups (TBDPS, Nap, Lev, Fmoc, N3 and TFA) remain intact under typical sulfation conditions and that our approach can allow for regioselective sulfation of the 2-O-, 6-O- and N-positions of tetrasaccharide 21. Deprotection of the TBDPS group of 28, global ester and trifluoroacetamide hydrolysis using LiOH, and acetylation of the resultant primary amine provided compound 29. Subsequent hydrogenolysis of the benzyl and Nap groups with concomitant reduction of the N3 group and chemoselective N-sulfation delivered the defined sulfated tetrasaccharide 30. On the other hand, treatment of 27 with thioacetic acid accomplished the one-pot conversion of the azide to the acetamide to give compound 31. Global hydrolysis, followed by hydrogenolysis and chemoselective N-sulfation as before, delivered the desired sulfated compound 32. Thus, tetrasaccharide 21 can serve as a versatile intermediate for the synthesis of HS GAGs displaying different regiodefined sulfation patterns. It is worth noting that structures such as 30 and 32 bearing distinct O-sulfation and N-sulfation patterns on its two disaccharide units are generally more tedious to prepare chemically and are inaccessible using current chemoenzymatic methods.

Scheme 5.

Scheme 5.

Regioselective sulfation of the 2-O, 6-O and N positions to give tetrasaccharides bearing asymmetrical, regiodefined sulfation patterns.

Conclusion

In summary, we have developed novel, concise and scalable routes to obtain key HS/heparin disaccharide building blocks from natural heparin and heparosan polysaccharides. We demonstrate that the use of naturally-derived disaccharides as minimal synthetic precursors accelerates the synthesis of HS GAGs, providing general and versatile disaccharide and tetrasaccharide building blocks for the assembly of HS/heparin oligosaccharides in about half the number of steps compared to traditional approaches starting from monosaccharides. Future studies will focus on obtaining large numbers of diverse, sulfated oligosaccharides through the exploration of solid-phase synthesis methodologies, library encoding strategies, and automation.[14a, 16] Rapid access to key, universal HS building blocks promises to significantly expand the scope of HS synthesis, enabling the future generation of large libraries of compounds for deciphering the ‘sulfation code’ and developing new GAG-based therapies.

Experimental Section

Full experimental details and compound characterizations are provided in the Supplementary Information, including supporting figures, tables, experimental procedures, characterization data, and NMR spectra.

Supplementary Material

Supp info

Acknowledgements

Financial support was provided by the NIH Common Fund grant U01 GM116262–03 and the NIH R01 grant HL62244. We thank Dr. Mona Shahgholi in the CCE Division Mass Spectrometry Facility and Dr. David Vander Velde in the CCE Division NMR Facility at Caltech.

Footnotes

Supporting information for this article is given via a link at the end of the document.

Contributor Information

Nitin J. Pawar, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 (USA).

Lei Wang, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 (USA).

Takuya Higo, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 (USA).

Chandrabali Bhattacharya, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 (USA).

Pavan K. Kancharla, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 (USA)

Fuming Zhang, Departments of Chemical Biology and Chemical and Biological Engineering, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180 (USA).

Kedar Baryal, Departments of Chemistry and Biomedical Engineering, Michigan State University, East Lansing, MI 48824 (USA).

Chang-Xin Huo, Departments of Chemistry and Biomedical Engineering, Michigan State University, East Lansing, MI 48824 (USA).

Jian Liu, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599 (USA).

Robert J. Linhardt, Departments of Chemical Biology and Chemical and Biological Engineering, Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180 (USA)

Xuefei Huang, Departments of Chemistry and Biomedical Engineering, Michigan State University, East Lansing, MI 48824 (USA).

Linda C. Hsieh-Wilson, Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 (USA)

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp info

RESOURCES