Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2015 Mar 12.
Published in final edited form as: Biomacromolecules. 2015 Feb 16;16(3):951–961. doi: 10.1021/bm5018386

Proteoglycans and their heterogeneous glycosaminoglycans at the atomic scale

Benedict M Sattelle 1, Javad Shakeri 1, Matthew J Cliff 1, Andrew Almond 1,*
PMCID: PMC4356732  EMSID: EMS62149  PMID: 25645947

Abstract

Proteoglycan spatiotemporal organization underpins extracellular matrix biology but atomic scale glimpses of this microarchitecture are obscured by glycosaminoglycan size and complexity. To overcome this, multi-microsecond aqueous simulations of chondroitin and dermatan sulfates were abstracted into a prior coarse-grained model, which was extended to heterogeneous glycosaminoglycans and small leucine-rich proteoglycans. Exploration of relationships between sequence and shape led to hypotheses that proteoglycan size is dependent on glycosaminoglycan unit composition but independent of sequence permutation. Uronic acid conformational equilibria were modulated by adjacent hexosamine sulfonation and iduronic acid increased glycosaminoglycan chain volume and rigidity, while glucuronic acid imparted chain plasticity. Consequently, block copolymeric glycosaminoglycans contained microarchitectures capable of multivalent binding to growth factors and collagen, with potential for interactional synergy at greater chain number. The described atomic scale views of proteoglycans and heterogeneous glycosaminoglycans provide structural routes to understanding their fundamental signaling and mechanical biological roles and development of new biomaterials.

Keywords: Coarse grain, Extracellular matrix, Glycosaminoglycan, Microsecond dynamics, Proteoglycan

INTRODUCTION

Proteoglycans (PGs) are vital constituents of cellular membranes and extracellular matrices, where they confer tissue mechanical properties, facilitate constitutive cell-cell interactions and are implicated in neurodegeneration, arthritis and cancer.1-4 Human PGs, such as bikunin and decorin,5, 6 comprise a core protein plus one or more covalently linked glycosaminoglycan (GAG) polymers, which possess functionally important patterns of post-synthetic modification. In particular, domains rich in sulfate and iduronic acid maintain connective tissue integrity and mediate signal transduction via multivalent interactions with collagen and growth factors.7-10 However, the molecular basis for such bioactivity remains enigmatic, in part because current representations of PGs and attendant GAGs lack atomic scale conformational detail.

Obtaining high resolution 3D data for PGs is hampered by GAG sequence heterogeneity, conformational flexibility and molecular size. Electron microscopy and small-angle X-ray scattering have provided low resolution insights.11, 12 However, atomic resolution crystallographic studies have been limited to either the protein core (e.g., for small leucine-rich repeat PGs13-15) or isolated GAG components (e.g., fiber X-ray scattering data for chondroitin-4-sulfate16). Modeling offers an avenue to progress but accurate fine-grained simulations (i.e., including all atoms and water solvent) have been restricted to oligosaccharides and short sub-μs timescales (notwithstanding approaches that distort biologically important kinetics17). This limitation has, to some extent, been recently overcome by the use of application-specific integrated circuits and graphics processing units, which have enabled unbiased multi-μs computational studies.18, 19 However, application of fine-grained simulations to even small PGs will remain out of reach for the foreseeable future.

Less computationally demanding coarse-grained (CG) carbohydrate models have been successfully developed,20, 21 but in these only the polymer glycosidic linkage degrees of freedom remain and functionally important pyranose ring flexing (or puckering) is neglected. Recently, we developed a CG approach that includes both carbohydrate linkage and ring motions.22 In this method, linkage torsional and ring puckering free energy surfaces are calculated from microsecond timescale fine-grained simulations (containing implicit information on sugar substituents) and used to parametrize a simplified CG representation, which can be used to rapidly predict the motions of polymers using numerical integration. Although promising, the method’s ability to model heterogeneous GAG sequences or glycoconjugates, such as PGs, has not been established. Therefore, at present, neither experimental nor theoretical techniques have been shown capable of characterizing heterogeneous GAGs or PGs at high resolution. Consequently, the relationships between GAG post-synthetic modification, domain organization and PG dynamic shape, and their role in multivalent interactions with proteins, remain enigmatic at the atomic scale.

Here, we overcome this inability by broadening the applicability of our CG approach to characterize heterogeneous GAGs and PGs both accurately and at the atomic scale. Unbiased microsecond fine-grained simulations of hyaluronan and six key sequence units from chondroitin and dermatan sulfates (Figure 1) were performed to investigate the effect of post-synthetic modifications on molecular motions. Using these data to further parameterize our CG model enabled simulations of GAGs to be performed on complex heterogeneous sequences in general, and physiologically-relevant block co-polymeric structures in particular. In further key steps, the CG approach was extended to PGs, and by exploiting GAG sequencing information and X-ray diffraction data, realistic atomic models of small leucine-rich repeat PGs were constructed. Where possible, the derived models were validated by comparison with hydrodynamic measurements. The results represent the first accurate atomistic views of heterogeneous GAGs and PGs, allowing relationships between sequence, atomic scale shape and dynamics to be explored in these complex extracellular molecules. The reported models transcend simple schematic representations of PGs and advance understanding of their assembly and protein interactions, towards clarifying the roles of PGs in health and disease and furthering their use as therapeutics.

Figure 1.

Figure 1

Chemical structures of hyaluronan, chondroitin and dermatan repeating disaccharide units. All-atom explicit solvent (or fine-grained) 10 μs simulations were performed for model decasccharides (n=5) of: (A) hyaluronan, (B) the chondroitin O-unit (R1=R2=OH), A-unit (R1=OSO3, R2=OH) and C-unit (R1=OH, R2=OSO3) and (C) the dermatan iO-unit (R1=R2=OH), iA-unit (R1=OSO3, R2=OH) and iC-unit (R1=OH, R2=OSO3).

MATERIALS AND METHODS

Materials

Medical grade hyaluronan was obtained from Genzyme Corp. (Boston, USA). Shark cartilage chondroitin sulfate (cat. no. C4384), bovine trachea chondroitin sulfate (cat. no. 27042) and porcine intestinal mucosa dermatan sulfate (cat. no. C3788) were obtained from Sigma-Aldrich (Dorset, UK). Two disaccharide samples were produced by bacterial chondroitinase digestion of chondroitin and dermatan sulfate and isolated using high resolution gel filtration and ion exchange chromatography (Iduron Ltd., Manchester, UK) to use as reference compounds for NMR assignment of polysaccharides. Each disaccharide sample contained a mixture of unsulfonated uronic acids (UA) with unsaturated UA C4-C5 bonds (ΔUA) and a single O-sulfate at either C4 (4S) or C6 (6S) on the galactosamine (GalNAc); chemically they were: ΔUAβ(1→3)GalNAc-4S (cat. no. CD002) and ΔUAβ(1→3)GalNAc-6S (cat. no. CD003).

NMR Spectroscopy

For NMR analyses, sulfated polysaccharide and disaccharide samples (see Materials) were prepared from lyophilized material reconstituted in 595 μl D2O (CK Gas Products Ltd., Ibstock, UK) and 5 μl 4,4-dimethyl-4-silapentane-1-sulfonic acid (DSS). All samples were centrifuged at 12,000g for 10 minutes before transfer to 5mm NMR tubes (part no. 542-PP-7, Wilmad-LabGlass, Vineland, USA). Each was adjusted to pD 7.3-7.4 and final concentrations were in the range 0.5-2.0 mM. The [1H]-1D, [1H,13C]-HSQC, [1H]-COSY and [1H,13C]-HSQC-TOCSY spectra were recorded at 25°C with a 1H frequency of 800 MHz (Avance III spectrometer with TCI 1H-13C/15N-1H cryoprobe equipped with z-gradients, Bruker Corp., Billerica, USA). Heteronuclear experiments were gradient selected and sensitivity enhanced; chemical shifts were referenced relative to internal DSS. Spectra were processed using Topspin software (Bruker Corp., Billerica, USA); 1D experiments were apodized by an exponential function (0.3 Hz decay constant) and zero-filled to twice the acquired points; 2D spectra were linear predicted in the indirect dimension to twice the acquired points and apodized with a 90° sine-bell function in both dimensions. Assignments for two sulfated disaccharides (see Materials) were performed with reference to prior data for similar compounds.23-27 Glycosaminoglycan polysaccharide 1H and 13C assignments were aided by comparison with the spectra and assignments from these disaccharides. In polysaccharide [1H,13C]-HSQC spectra, overlap of GalNAc C4 and C6 peaks hindered integration and precluded estimation of 6-O-sulfate; in this case C5 peak integrals were used to the estimate relative levels of sulfate.

Laser Light Scattering

Glycosaminoglycan polysaccharides (4 mg) were dissolved in 0.5 ml of ammonium bicarbonate (100 mM) for 24 hr at 4°C and then centrifuged at 12000g for 10 minutes prior to application to a Superose 6 10/300GL gel filtration column (GE Healthcare, Little Chalfont, UK) running at a flow rate of 0.75 ml/min in the same eluent. Samples eluting from the column at room temperature passed through an in-line DAWN HELEOS-II laser photometer with a 658 nm laser (Wyatt Technology Corp., Santa Barbara, USA), WyattQELS detector model number Q-03 and an Optilab rEX refractometer; a refractive index increment (dn/dc) of 0.16 mlg−1 was used throughout.28 Light scattering and concentration during elution were recorded using ASTRA software (v5.3.4.13). Multi-angle laser light scattering was used to determine weight-averaged molecular masses (M) and radius of gyration (Rg) using analysis modules within the ASTRA software. For molecules too small to accurately determine Rg, quasi-elastic light scattering was used to measure hydrodynamic radii (Rh), again using analysis modules within the ASTRA software. Conversion of calculated Rg to Rh was achieved using the formula Rg = 1.5Rh, derived previously for a random coil.29

Fine-grained Simulations

Fine-grained molecular modeling of seven GAG decasaccharides and the GAG-PG linker trisaccharide, Galβ(1→3)Galβ(1→4)Xyl, were performed using the GLYCAM06 force-field topologies and parameters for carbohydrates.30 Each was built using the AMBER1231 tool “tleap” and then added to cubic boxes of TIP3P water,32 resulting in side lengths in the range 6.2-7.2 nm (≈12,000 water molecules). Solute atoms were positioned at least 1.2 nm from the solvent box edge and charge neutral assemblies were achieved by adding the appropriate number of explicit Na+ ions. Following initial conjugate-gradient energy minimization (1000 steps), each assembly was heated to 298 K and equilibrated in the NPT ensemble for 20 ns prior to 10.3 μs unbiased NVT production molecular dynamics, performed using ACEMD software33 and GTX TITAN graphics processing unit hardware (Nvidia Corp., Santa Clara, USA), as described previously.19, 34 The initial 300 ns was discarded and for the remaining 10 μs, atomic coordinate data were saved at 10 ps intervals for analyses. Molecular properties and axial helical rises (per disaccharide) were computed from complete 10 μs trajectories (i.e., one million datasets) using standard equations described previously.35 The distance r (graphed in Figure 2) was calculated over the four internal disaccharides within decasaccharides (i.e., between reducing-end and nonreducing-end glycosidic linkage oxygen atoms). Predominant decasaccharide conformers (illustrated in Figure 2) were derived using the AMBER1231 tool “ptraj” and a hierarchical 3D-clustering algorithm with a sampling frequency of 100 frames (10000 in total). Glycosidic torsion angles ϕ and ψ in (1→n) linkages (where n = 3 or 4) were computed using the IUPAC definitions: ϕ defined as O5-C1-O-Cn and ψ defined as C1-O-Cn-C(n-1). A charge neutral 1 μs simulation of bikunin with a GAG-PG linker trisaccharide attached at serine residue 10 (Ser-10) was also performed using the methods detailed above; in this case both the GLYCAM06 and AMBER FF0336 force-fields were used and the proteoglycan was solvated in a cubic box of TIP3P water with a side length of 7.3 nm.

Figure 2.

Figure 2

(A-G) Predominant conformers (left, with their percentage occupancy) and microsecond conformational dynamics in 10 μs fine-grained simulations of GAG decasaccharides, measured using the distance (r, right) between the two endmost glycosidic oxygen atoms. The structures comprise homogeneous repeats of: (A) hyaluronan (H-unit), (B, D, F) chondroitin sulfate containing glucuronic acid (O-, A- and C-units) and (C, E, G) dermatan sulfate containing iduronic acid (iO-, iA- and iC-units). The correlation between r and pyranose puckering is exemplified in grey (B) by the Cremer-Pople azimuthal angle (θ) for an internal glucuronic acid ring (number four counting from the reducing-end).

Coarse-grained Simulations

The coarse-grained (CG) model, utilizing empirical energy functions for glycosidic linkage and ring pucker, has been described previously.22 Two-dimensional free energy surfaces for glycosidic linkage torsions and ring azimuthal and meridian angles were derived from fine-grained 10 μs simulations and parameterization of the CG empirical energy functions was achieved by fitting to these surfaces, as described previously.22 For validation purposes only, all linkages and all rings present in decasaccharides were parameterized from their respective fine-grained simulation equivalent. For simulations of polymers, β(1→3) linked disaccharide units were parameterized from data for the central two linkages and two rings of respective fine-grained decasaccharides. These units were stitched together via their β(1→4) linkages to produce model chains, i.e., the constituent disaccharide [β(1→4)Uβ(1→3)N] is a representative unit with two linkages and two rings parameterized from a single fine-grained simulation (here, U and N represent uronic acid and hexosamine, respectively). If the ends of the chain did not naturally fit within this disaccharide register then individual linkages and rings (with identical parameters) were appended (N to the nonreducing end and/or β(1→4)U to the reducing end). This allowed the construction of heterogeneous polymers of arbitrary length and sequence. All CG simulations were performed for 10 μs, accomplished by integrating the Langevin equation of motion independently for the linkages and rings, using the Leapfrog algorithm and a time-step of 2.5 fs, described previously.22 During the simulations, only the positions of the glycosidic oxygen atom and pyranose ring atoms were calculated (i.e., ring substituents were not explicitly modeled) and coordinates were saved at regular intervals for analyses. No steric interactions were calculated during CG simulations (other than those implicit in the parameters). Therefore, when it was deemed necessary, and after completion of each simulation, coordinate sets with steric clashes were identified by detecting those in which CG atoms had approached within 1 nm (excluding those in the same or neighboring residues) and then removed from the simulated ensemble. Radius of gyration (Rg) calculations on polymers were performed without adding any substituents to the carbohydrate rings (test calculations on polymers showed this had a negligible effect on the outcome).

Proteoglycan Simulations

Models were constructed using Cartesian coordinates for small leucine-rich repeat PG core proteins (bikunin, decorin and biglycan); these were obtained from the Protein Data Bank (identification codes: 1BIK,13 1XKU15 and 2FT3,14 respectively). The decorin dimer was built by applying a transformation matrix (the PDB file BIOMT record). Terminal amino acids missing from these experimental data (residues 1-24 in bikunin and biglycan, 1-21 in decorin) were constructed using a knowledge-based algorithm with default settings in the “Loop Modeler” feature of MOE software (Chemical Computing Group Inc., Montreal, Canada). In each case the lowest energy loop conformer was selected. In the absence of experimental coordinates, this approach provides an energetically favorable N-terminal loop conformer that is based on experimental 3D-structures from the Protein Data Bank. Subsequently, all experimentally refined non-protein atoms (i.e., ions, waters, carbohydrate glycosylation fragments) and all amino acid substituents were removed, leaving only Cartesian coordinates of the protein backbone atoms.

Polymer chains representing GAGs with specific sequences (as described above) were attached to a linker trisaccharide and onto previously identified Ser residues (within the terminal loop regions) to form the final modeled sequence: [β(1→4)Uβ(1→3)N]n-β(1→4)Uβ(1→3)Galβ(1→3)Gal-β(1→4)-Xyl-Ser. Parameters for the CG model of the linker were derived from a fine grained simulation of the trisaccharide (see above) and the parameters for the CG model of the GAG portion were as described above. The glycoprotein linkages between Xyl and Ser were fixed using the following torsional angles (C-Cα-Cβ-OG: ±60°, Cα-Cβ-OG-C1: +100°, Cβ-OG-C1-O5: −60°), which are consistent with previous NMR measurements37 and a 1 μs fine grained simulation of bikunin with an attached linker trisaccharide. The PG core protein was also kept in a fixed geometry while 10 μs CG simulations were performed on the carbohydrate chains. Radii of gyration (Rg) calculations on PGs were performed without adding substituents to the GAG pyranose rings or sidechains to the PG core protein amino acids. Additional post-translational modifications beyond the GAG chains, such as N-linked glycans, were assumed to be a minor contributor to Rg and were not considered in CG simulations for practical reasons. Conversion of calculated Rg to Rh was achieved using the formula Rg = 0.77Rh, for a constant density sphere.

RESULTS AND DISCUSSION

Compositional Analyses and Nomenclature

The sulfate, d-glucuronic acid (GlcA) and l-iduronic acid (IdoA) content of shark cartilage and bovine trachea chondroitin sulfate (CS) and porcine intestinal dermatan sulfate (DS) were assayed using 13C NMR spectra, Figure S1. The GAG compositional nomenclature in this work follows prior descriptions,38 wherein CS β(1→3) linked disaccharide units comprising unsulfonated, 4-O-sulfo and 6-O-sulfo N-acetyl-d-galactosamine (GalNAc) are designated O, A and C, respectively. In DS these are prefixed with “i” to indicate the presence of IdoA (cf. GlcA in CS). For shark CS, the proportions of O-, A- and C-units were estimated to be 20%, 20% and 60%, respectively. Similarly, in bovine CS, NMR analyses indicated approximate levels of 20%, 60% and 20% for these units. Porcine DS was estimated to contain a mix of GalNAc 4-O-sulfo (A/iA) and 6-O-sulfo (C/iC) units at 80% and 20%, respectively; the GlcA:IdoA ratio was approximated at 1:4. It was concluded that neither 2-O-sulfo IdoA nor 4,6-O-sulfo GalNAc were present in substantial quantity (greater than 10%) in shark, bovine or porcine GAG samples. These findings concur with previous chromatographic studies of similar but not identical biological materials39 and for the purposes of this work shark (CS), bovine (CS) and porcine (DS) materials were deemed to be repeating blocks of [A][C]3[O], [C][A]3[O] and [A][iA]4, respectively. Medical grade hyaluronan (HA), with N-acetyl-d-glucosamine (GlcNAc), was assumed to be a homogeneous (100%) repeating disaccharide, referred to here (unconventionally) as the H-unit.

The Dynamics of Subunits from Heterogeneous GAGs

Decasaccharides comprising five repeats of subunits from HA, CS and DS (H, O, A, C, iO, iA and iC) were subjected to unbiased 10 μs explicit molecular dynamics simulations including all atoms (GAG, water solvent and counterions), referred to here as fine-grained. For each of the simulated molecules the distance r between the two endmost glycosidic oxygen atoms (across the internal four disaccharides and thereby excluding possible end effects40) was computed. Based on time series plots of r (Figure 2), each decasaccharide had a characteristic dynamic range and frequency of motion. In all molecules, transitions occurred between numerous conformations, but those containing iduronic acid were observed to spend more time in extended geometries than their glucuronic acid equivalents (predominant shapes are illustrated in Figure 2). Further inspection and analysis revealed extremes of maximally extended chains and compact turns with exchange being correlated to μs timescale linkage and ring flexing (exemplified in Figure 2B), as found previously in amylose.34 It was confirmed that in every simulation, arithmetic mean r-values for 1 μs subsets had a standard deviation of less than 0.1 nm from the mean for the entire 10 μs dataset, substantiating that motions in the model GAG decasaccharides had approached equilibrium.

In the H-unit decasaccharide simulation 67% of conformers were 2- to 4-fold left-handed helices and 81% had an axial rise per disaccharide (h) of >0.82 nm. Prior X-ray fiber diffraction observations for HA include 2-, 3- and 4-fold left-handed helices with h in the range 0.84-0.98 nm.41 In the A-unit simulation, 43% of conformers were 2- to 3-fold left-handed helices, with 42% having h>0.94 nm. Experimentally, chondroitin-4-sulfate has been observed as 2- to 3-fold left-handed helices with h in the range 0.96-0.98 nm.42 The C-unit decasaccharide simulation contained 65% of conformers between right-handed 8-fold helices and left-handed 3-fold helices, with 52% having h>0.94 nm, consistent with previously described chondroitin-6-sulfate extended left-handed 3-fold and right-handed 8-fold helices with h in the range 0.96-0.98 nm.43 The iA-unit decasaccharide simulation had 58% of conformers with h in the range 0.90-0.96 nm, 44% were between right-handed 8-fold helices and left-handed 3-fold helices. Dermatan-4-sulfate has been observed to form 2-, 3- and 8-fold helices with h in the range 0.92-0.94 nm.44 Considering that these prior experiments were performed in the solid state, and hence only a fraction of solution phase conformers could satisfy the packing constraints, it was concluded that the predicted conformer populations had a good overlap with published X-ray fiber diffraction refinements.

Computed glycosidic linkage free energy surfaces (Figure S2) showed all β(1→3) linkages to be similar and (φ,ψ) for the major minima were centered on ≈(−60°,−120°), with minor conformers at ≈(−60°,+60°) and ≈(+60°,−120°), Figure 3A-F. A slight difference in these was the relatively restricted range of the β(1→3) ψ torsion in DS decasaccharides. Furthermore, the illustrated free energy surfaces were derived from central linkages, and it should be noted that other minor differences were observed within each chain and at their ends. Constituent β(1→4) linkages, Figure 3G-L, were almost indistinguishable and (Φ,ψ) for the major minima were centered on ≈(−60°,120°), with minor populations at (−120°,−60°) and (+60°,+120°); the latter was not sampled in decasaccharides of A- or C-units. Prior NMR studies of homogeneous unsulfonated oligosaccharides have refined the H-unit β(1→3) and β(1→4) linkages to be (−68°,−111°) and (−71°,124°), respectively.45 Analogous refinements for the O-unit resulted in (−72°,−132°) and (−73°,122°), which was concluded to be virtually identical to the A-unit.46 Taken together, the computed linkage free energy surfaces for each of the decasaccharides are predicted to have surprising similarities, irrespective of their sulfate, hexosamine and uronic acid composition, and are consistent with available solution phase experimental data.

Figure 3.

Figure 3

Glycosidic linkage and pyranose ring motions in 10 μs unbiased fine-grained simulations of model homogeneous chondroitin sulfate and dermatan sulfate decasaccharide subunits. Two-dimensional (ϕ,ψ) surfaces of free energy for (A-F) the internal β(1→3) linkage 5 and (G-L) the adjacent β(1→4) linkage 6 (counting from the reducing end). (M, N) Puckering free energy graphs for uronic acid ring 6 (counting from the reducing end), with ring conformational space labeled as: C, chair, E, envelope; HC, half-chair; B, boat; SB, skew-boat.

A Molecular Mechanism for Encoding GAG Shape via Sequence

Puckering occurred in all decasaccharide pyranose rings and converged to equilibria over 2-5 μs (Table S3), consistent with previous simulations.19 Based on an average of the four internal hexosamines in each model, all were principally 4C1 chairs (>97%) and motions were invariant of C4-position stereochemistry (GlcNAc or GalNAc) and O-sulfonation (at the C4- or C6-position). While all GlcA pyranoses were predominantly 4C1-chairs, they also underwent 4C11C4 exchange (e.g., Figure 3M). This behavior is qualitatively similar to prior experimentally-consistent GlcA monosaccharide simulations,47 but with average 4C1 populations of: 64% (H-unit), 65% (O-unit), 95% (A-unit) and 75% (C-unit). The predicted ring conformational equilibria in the H- and O-unit decasaccharides were consistent with prior NMR measurements of pyranose 1H-1H vicinal couplings (3JH,H), Table S4.46, 48

The DS decasaccharide IdoA puckers were predicted to be predominantly 1C4-chairs (e.g., Figure 3N) with populations of: 82% (iO-unit), 90% (iA-unit) and 56% (iC-unit). All IdoA rings underwent 4C11C4 exchange and the mix of IdoA ring conformers produced a qualitative agreement with measured low 3JH,H values (<5 Hz) for iA regions of porcine DS.49 The deviation between predicted and experimentally measured 3JH,H values were as described and discussed previously.47 Although 2SO was the predominant non-chair conformer, it was predicted to be a minor ensemble component (<7%) in the iO-, iA- and iC-unit decasaccharides. This skew-boat is reportedly key for protein binding50, 51 but has been difficult to observe in free crystals52 and was infrequently populated in prior IdoA monosaccharide simulations (<10%).47

The fine-grained simulations performed here represent the most extensive for GAG oligosaccharides to date and suggest that CS and DS uronic acid ring puckering equilibria are influenced across β(1→3) glycosidic linkages by neighboring hexosamine sulfate groups. In particular, GalNAc 4-O-sulfonation biases GlcA and IdoA towards 4C1 and 1C4, respectively. To investigate, conformers with both uronic acid chair puckers were extracted from A- and iA-unit decasaccharide simulations. Inspection of the A-unit at the chain center suggested that repulsion of the GalNAc 4-O-sulfate and GlcA carboxyl groups disfavored the 1C4 chair, which was not present in the favored 4C1 pucker (Figure 4A). For the iA-unit, the IdoA 1C4 pucker was stabilized by interaction of its C3-position hydroxyl substituent with the adjacent GalNAc 4-O-sulfo group (Figure 4B). This allowed water to be excluded from the IdoA ring face, which was not the case when IdoA was in the less favored 4C1 chair conformation. It is hypothesized that within the fundamental β(1→3)-linked GAG unit, networks of through-space interactions between uronic acid, adjacent hexosamine sulfate groups and water establish characteristic uronic acid puckering equilibria that contribute to polymer hydrodynamic properties. This represents a mechanism for encoding shape via sequence and may be a key factor in both specific GAG-protein interactions and PG organization.

Figure 4.

Figure 4

Representative conformers with the most and least favored uronic acid chair puckers from 10 μs unbiased fine-grained simulations of homogeneous decasaccharides comprising: A-units (A) and iA-units (B). Geometries of disaccharides from the chain centers are illustrated, with putative molecular mechanisms for conformational preferences highlighted: intermolecular repulsion (red spheres), attraction (dashed line) and interactions with water solvent (blue spheres). Ring substituents are hidden for clarity, except at the C3-position in the iA-unit IdoA 1C4 conformer (B), highlighting a predicted through-space interaction with the adjacent hexosamine sulfate group.

Validating Simulations of Homogeneous and Heterogeneous GAG Polymers

Computed glycosidic linkage and pyranose ring free energy surfaces were used to parameterize a coarse-grained (CG) model, described previously.22 For validation purposes, the decasaccharides studied above were resimulated for 10 μs using the CG model. Comparing CG with reference fine-grained simulations, equivalent calculated mean r values (see above) were all within 0.3 nm and linkage and ring motions were faithfully reproduced (Figure 5).

Figure 5.

Figure 5

Homogeneous decasaccharide dynamics from fine-grained simulations (left, black) were faithfully reproduced by the coarse-grained model (right, grey) in equivalent 10 μs simulations. The dynamic range of the model A-unit β(1→3) and β(1→4) glycosidic linkages (numbers 3 and 4 counting from the reducing end) and iA-unit GalNAc and IdoA pyranose rings (numbers 5 and 6 counting from the reducing end) are shown, using glycosidic (ϕ,ψ) and ring puckering (θ,ϕ) parameters, respectively.

Radius of gyration (Rg) and molecular mass (M) were measured simultaneously, using multi-angle laser light scattering (MALLS), on a sample of medical grade HA. Model polymers in the range [H]100 to [H]400 (reflecting the range of measured molecular weights) were simulated for 10 μs using the CG method. Experimental data and simulated mean Rg are graphed in Figure 6A (see “CG all”). Within these simulations unrealistic atomic overlap occurred due to the absence of steric interactions in the CG model.22 Correcting for this by removing those computed structures containing steric clashes and recalculating Rg achieved excellent agreement with experiment (see Figure 6A “CG no clash”). Finally, recalculation of Rg (after removing steric clashes) from identical CG simulations but with all constituent monosaccharides fixed in rigid 4C1 chair puckers resulted in predictions that disagreed with experimental measurements (see Figure 6A “CG no clash, rigid rings”).

Figure 6.

Figure 6

Accurate, rapid prediction of hydrodynamic size for polymers of: (A) hyaluronan, (B) chondroitin sulfates and (C) dermatan sulfate from biological sources. Predictions from 10 μs coarse-grained (CG) polymer simulations (open symbols) and experimental radius of gyration (Rg) or hydrodynamic size (Rh) are plotted against molecular mass (M) on logarithmic axes. Note: n is the number of disaccharides in the model polymers and analyzed experimental data points are shown as dots in (A), while in (B-C) the experimental data are depicted as lines.

In the cases of CS and DS materials from shark, bovine and porcine sources, the small polymer sizes (≈10 nm) precluded reliable Rg measurements using MALLS. Therefore, hydrodynamic radii (Rh) at defined M were derived from quasi-elastic light scattering and MALLS (Figure 6B-C and Figure S5). Model CG polymers of these materials were built from blocks of five disaccharides (with compositions reflecting NMR analyses reported above) and simulations were performed within the experimentally observed mass ranges (again following removal of steric clashes). Simulated Rg values for representative shark and bovine CS polymers (in the range 50 to 200 disaccharides) concurred with the measurements after conversion to equivalent Rh (Figure 6B). Here, the experimental trend for shark CS to have slightly larger Rh than bovine CS at equivalent mass was reproduced in the CG simulations. For model porcine DS polymers of 35-55 disaccharides in length, simulated Rg values (converted to Rh) were also close to measured equivalents (Figure 6C). Marginal deviations in experimental and calculated data for sulfonated polymers may be attributed to sequence differences in the biological materials and models, the assumed refractive index increment value (determined from bovine nasal cartilage PG fractions28) or the conformationally-dependent Rg to Rh conversion factor.

This validation demonstrates the applicability of the CG approach to all human GAGs except keratan sulfate (the model’s predictive ability for heparan sulfate was shown previously22). It also reiterates that inclusion of both glycosidic linkage and pyranose ring μs timescale motions are necessary to produce accurate GAG polymer models. Furthermore, the CG method enables rapid and accurate predictions using standard computer hardware; for example, these 10 μs simulations for polymers of 100 disaccharides in length were each performed in less than one hour.

Effects of Combining and Permuting GAG Chain Units

The CG method is particularly suited to studying the relationship between chain dynamics and GAG sequence. To effectuate a study of this kind, polymer chains with 100 disaccharide units were constructed by repeating blocks of either 5 or 10 units (idiosyncratic patterns were chosen to explore the effect of both hexosamine sulfonation and uronic acid type). Performing 10 simulations of [A]100 resulted in a calculated Rg standard deviation of 0.03 nm, confirming the utility of mean Rg as a sensitive indicator of chain conformation. Chains were simulated for 10 μs using the CG approach; steric clashes occurred infrequently at this degree of polymerization and removal was not necessary for accurate predictions. Computed Rg values for each polymer are shown in Figure 7.

Figure 7.

Figure 7

Prediction of the effect of sequence and domain organization on radius of gyration (Rg) in model homogeneous and heterogeneous polymers of 100 disaccharide units in length: (A) chondroitin sulfate polymers, (B) dermatan sulfate polymers, (C) hybrid chondroitin/dermatan sulfate polymers, and (D) models of biological materials studied experimentally here. All Rg values were derived from 10 μs coarse-grained simulations.

Of any CS sequence considered, chains with [A]100 and [C]100 resulted in the smallest and largest Rg values, respectively (Figure 7A). Successive substitution of A- with C-units (or vice versa) resulted in a systematic increase (or decrease) in polymer size. Chains with only O-units were significantly longer than those with only A-units but shorter than polymers comprising only C-units. Thus, inclusion of O-units increased the average hydrodynamic size of A-unit polymers and decreased that of C-unit polymers. In model DS chains, equivalent sulfonation patterns (to CS) resulted in consistently larger Rg values (Figure 7B), a trend that reflected observations from fine-grained simulations (Figure 2). Here, the 4-O-sulfo [iA]100 chain had a smaller calculated Rg than either the [iO]100 or [iC]100 polymers. In contrast to CS, successive inclusion of iO- into iC-unit polymers marginally increased calculated radii. Polymers comprising exclusively IdoA-units had a relatively restricted range of Rg values; all were within 1.4 nm of each other (cf. 2.0 nm in GlcA-unit polymers). Progressive addition of A-units into an iA-based polymer decreased the overall size, and maintaining the A:iA stoichiometry while increasing the iA-block size had a negligible effect on calculated hydrodynamic radii (Figure 7C). Therefore, it was hypothesized that permutation of units (while maintaining the combination) has little effect on Rg and the combinations of uronic acid type and hexosamine sulfonation contribute to the final size of Rg in the following order (smallest to largest): A<O<C<iA<iO<iC. This suggests that combining these units in heterogeneous polymers provides a mechanism for tuning the hydrodynamic properties of GAG chains.

Block Copolymer Microarchitectures as a Basis for Multivalency

Supporting these hypotheses, permutations of a sequence combination mimicking shark CS were predicted to have identical Rg values, and the same was true for bovine CS (Figure 7D). Shark CS polymers were predicted to be slightly larger than their bovine equivalents, but the porcine DS models were, as expected, the most extended model biological sequences considered. As a consequence of this it would be expected that PGs comprising DS would have a larger hydrodynamic domain than their CS equivalents. However, in DS block copolymers overall polymer Rg did not provide the full picture. For example, inspection of conformational snapshots from a simulation of the {[A]4[iA]6}10 polymer revealed flexibility in the GlcA-containing regions, resulting in tight turns reminiscent of protein beta hairpins, and comparative rigidity in the IdoA-containing domains (Figure 8). These predicted microarchitectures could be a molecular basis for multivalent interactions with proteins, such as fibroblast growth factors.7, 53 The differential dynamics of these copolymeric sequences can thus be expected to have significance in signal transduction.

Figure 8.

Figure 8

Three representative conformers of a heterogeneous dermatan sulfate polymer with the sequence {[A]4[iA]6}10 (a 2D projection of 3D data). Constituent A- and iA-units are blue and sand colored, respectively. Emergent rigid, flexible and turn conformational features in the 10 μs coarse-grained simulation are labeled. These generate microarchitectures hypothesized to enable multivalent interactions with fibroblast growth factors; structures of FGF-2, FGF-7 and FGF-10 (Protein Data Bank entries: 1FQ9, 1QQK and 1NUN) are shown to scale and colored cyan.

Given the documented ability of IdoA to be an equilibrium of chair conformations, prior work had logically assumed IdoA to impart GAG chain flexibility and GlcA to confer rigidity.24 Our current studies, based on 10 μs fine- and coarse-grained simulations, have come to an alternative set of conclusions: (a) pyranose rings of both IdoA and GlcA are flexible, (b) their conformational equilibria are driven by neighboring hexosamine 4-O-sulfonation towards either 1C4 or 4C1, (c) equivalent chains with IdoA are more extended than those with GlcA (irrespective of sulfonation), and (d) IdoA does not substantially populate non-chair boat or skew-boat puckers.

Accurate Proteoglycan Models: Bikunin and Decorin

In a 10 μs fine-grained simulation of the GAG-PG interface trisaccharide, Galβ(1→3)Gal-β(1→4)-Xyl, both galactose (Gal) pyranose rings were predicted to be stable 4C1 puckers (>99%), while xylose (Xyl) was an equilibrium comprising mostly 4C1 chairs (70%) with a minor 1C4 component (26%). The Gal-Gal and Gal-Xyl glycosidic linkages were centered on (Φ,ψ)≈(−60°,−120°) and ≈(−60°,+120°), respectively, which are consistent with prior NMR37 and a 1 μs fine-grained simulation of bikunin glycosylated with the GAG-PG trisaccharide. Comparing linkage and ring dynamics in these two simulations (Figure S6) suggested that motions in this region do not differ substantially when free in water and when linked to the core protein. Models of PGs were constructed by parametrizing a fully dynamic CG model of the linker from these fine grained simulations and elaborating with specific GAG chain sequences; these polymers were then attached to previously identified PG core protein serine (Ser) residues. The PG core proteins were kept rigid while the dynamics of GAG chains were predicted in 10 μs CG simulations. Conformers that contained steric clashes were removed from resulting ensembles prior to calculation of Rg (as above).

A human bikunin PG model was built from an X-ray refinement of its core protein, the linker (attached to Ser-10) and a single CS chain. Several human bikunin GAG glycoforms have been sequenced, with 27-43 monosaccharides.5 Initially, an intermediate length chain (33 monosaccharides) was modeled: GalNAc-[O]7[A]2[O][A][O][A]2-GlcA-. In this simulation the GAG folded and transiently enveloped the core protein. The distance between the core protein and terminal monosaccharide geometric centers was on average 9.1 nm and thus significantly larger than the protein core (Figure 9A). This distance could range between 2.0 nm and 14.8 nm, highlighting the extent of sampled GAG conformations. Representative 3D structures from this human bikunin simulation are illustrated in Figure 9B and an Rg of 3.4 nm was predicted (repeated calculations of Rg had a standard deviation below the number of significant figures reported, see Table S7). Furthermore, performing this simulation with an alternative extended N-terminal loop conformer, derived from a 1 μs fine-grained simulation of the bikunin core protein (see Figure S8), resulted in an indistinguishable Rg value, suggesting that the N-terminal loop conformation has very little effect on intact PG hydrodynamic properties. Prefixing the sequence described above (at its non-reducing end) with [O]5 resulted in the longest CS chain identified previously.5 In an ancillary simulation of this glycoform, the computed Rg increased by almost a quarter, to a value of 4.1 nm. For comparison, an experimental Rh value for rat bikunin (thought to have similar hydrodynamic properties to human bikunin) was measured previously at 3.5 ± 0.1 nm.54 This is within the predicted range of Rg values for the two human bikunin glycoforms considered here.

Figure 9.

Figure 9

(A) An average conformer of human bikunin derived from a 10 μs coarse-grained simulation. The model comprises a heterogeneous chondroitin sulfate sequence of intermediate length and constituent O- and A-units are blue and sand colored, respectively. The mean chain extension from the core protein geometric center is depicted as a sphere (see Supporting Information for the model coordinates). (B) The 15 representative conformations from the same simulation illustrate the dynamic range of chondroitin sulfate conformational sampling within human bikunin.

Crystallographic data for the bovine skin decorin core protein (for modeling purposes assumed here to be a homomeric dimer) was used as the basis for a second PG model. In the absence of sequence data for bovine decorin GAG chains, they were initially based on recent sequence analysis of porcine decorin GAGs55 and the average molecular weight of bovine decorin GAG chains (≈24 kDa).56 A linker and a single DS chain with the sequence {[A][iA]4}10-GlcA- was appended to each subunit (at Ser-5). The calculated Rg for this dynamic PG model was 8.2 nm. Bovine decorin can be assumed to exhibit GAG glycoform variability, and thus two further simulations were performed with variations in the IdoA content and chain length. Increasing the IdoA block length in both GAG chains with the sequence {[A][iA]5}9-GlcA- (≈25 kDa) resulted in an Rg of 8.7 nm. A further PG simulation where both chains were extended with the sequence {[A][iA]4}12-GlcA- (≈27 kDa) resulted in a concomitant increase in Rg to 9.4 nm. These predictions are consistent with an experimentally measured Rg of 9.3 ± 0.1 for intact bovine decorin,56 underscoring the CG method’s ability to reproduce hydrodynamic data for intact PGs. The predictions also provide evidence for a dependence of PG size on the length and sequence combinations of the attendant GAGs.

In the absence of an X-ray diffraction refinement for human decorin core protein, the homologous bovine core protein was again used as a start point for building a human decorin PG model. Recent sequencing work has resulted in a human decorin GAG glycoform map, comprising multiple domains of variable length, sulfonation and IdoA content.6 Based on this, a presumed consensus sequence was constructed: [C]6[A]8[O]6-[iA]4[iC]3[iO]3-[A]3[C]4[O]3-[iC][iA][iO]-[O]3[iO][O]4-GlcA-. Two of these chains, one per subunit, were appended to the bovine decorin PG model at Ser-5 (a homomeric dimer was assumed). Representative conformations from the subsequent CG simulation are illustrated in Figure 10A. The predicted overall Rg value was 8.0 nm and thus, compared to a bovine skin decorin model, the hydrodynamic size of human skin decorin is predicted to be smaller, in part due to reduced IdoA composition. The dynamics of the two GAG chains featured bending and wrapping motions, with the average distance between the [iA]4 regions of each chain predicted to be 23 nm. As alluded to above, multivalent interactions are theorized to be possible between a protein and a single GAG chain (see Figure 8). In this PG model, [iA]4 regions of separate chains approached with a proximity of less than 5 nm, the approximate size of an FGF protein, confirming the feasibility of increased multivalent binding in a PG system with two GAG chains (Figure 10B).

Figure 10.

Figure 10

(A) Representative conformers derived from a 10 μs coarse-grained simulation of decorin with a single consensus human dermatan sulfate polymer attached to each core protein subunit (see text for the sequence). The 30 conformations reveal the dynamic range of polymer motion and their significant contribution to PG size; mean chain extension from the protein geometric center is depicted as a sphere of radius 21 nm. (B) Conformers of decorin and biglycan (two chains per subunit) with consensus human dermatan sulfate polymers (see text for sequences and Supporting Information for model coordinates). In decorin, GlcA and IdoA-containing blocks are blue and sand colored, respectively. A schematic collagen fibril with D-spacing of ≈70 nm and growth factor proteins (Protein Data Bank entries: 1FQ9, 1QQK and 1NUN) are drawn to scale, these (colored cyan) are positioned to illustrate the potential for multivalent binding.

Substantiating Functional Roles for GAG Chain Length and Density: Biglycan

Two human biglycan models were constructed using crystallographic data for the bovine core protein (again assumed to be a homomeric dimer). In each, both subunits were appended (via linkers at Ser-4 and Ser-11) with two GAG chains containing an average number of units found in human mitral valve tissue associated with either tensile ({[A][iA]4}12-GlcA-) or compressive ({[A][C]2[iA]2}17-GlcA-) regions.57 In these dynamic models, the respective predicted Rg values were 13.0 nm and 15.4 nm. The larger hydrodynamic radii of PGs in compressive tissue, as reproduced here, are thought to be important in resisting stress.57 Based on the biglycan simulations, it is simply concluded that the GAG chains with longer sequences found in compressive tissue are primarily responsible for the enhanced hydrodynamic radii. While the exact sequences have some effect, this is predicted to be a secondary factor. Indeed, the simulations imply that at equivalent GAG length the tension associated sequence, with a higher IdoA content, would be larger in size than the sequence found in compressive mitral valve tissue. If so, the specific sequences of IdoA and sulfonation are likely to be responsible for other functions such as protein recruitment. In the tensile tissue biglycan model, the average approach distances of the [iA]4 blocks at the chain centers was 16 nm, significantly closer than in the human decorin model (23 nm), implying that the synergistic effect of four GAG chains increased the possibility of multivalent interactions with other proteins (Figure 10B), as recently evidenced.8 These simulations also highlight the vast contribution of GAG chains to PG size. The computed Rg for the homodimeric biglycan core protein alone was around 3 nm, much less than the 13-15 nm found here for fully glycoconjugated biglycan dimer models (with a total mass of ≈200 kDa). Moreover, based on standard bovine serum albumin, a folded globular protein would require a mass of approximately 3 MDa to achieve this physical size.

CONCLUSIONS

It has been shown that accurate spatiotemporal PG models, including complex heterogeneous GAG sequences, can be constructed by abstracting glycosidic linkage and pyranose ring atomic motions from multi-μs fine-grained simulations. The current CG method implementation was successful for the cases of small PGs and isolated GAGs, resulting in predictions that supersede schematic “bottle brush” PG representations. These atomic scale dynamic perspectives are essential to uncover how PGs collectively perform constitutive signal transduction and mechanical functions, and extension to GAG sequences not considered here should be straightforward.

The described CG technique can address PG assemblies comprising millions of atoms, which remain beyond the capability of even hardware accelerated fine-grained modeling. Therefore, emerging GAG sequence and core protein 3D data, in concert with methodological developments enabling molecular interactions, will allow larger cell-surface and aggregating PGs, plus assemblies thereof, to be studied routinely at the atomic scale. In time, the approach could inform how GAG synthetic enzyme deficiencies lead to connective tissue diseases,58 provide mode-of-action insights for biopharmaceutical PGs and facilitate the development of new biologically inspired medicines and materials.

Supplementary Material

SupportingInformation

Acknowledgments

Funding Sources

Financial support was provided by the UK Biotechnology and Biological Sciences Research Council (Grant BB/J00040X/1).

ABBREVIATIONS

CS

chondroitin sulfate

DS

dermatan sulfate

DSS

4,4-dimethyl-4-silapentane-1-sulfonic acid

CG

coarse grained

GAG

glycosaminoglycan

HA

hyaluronan

MALLS

multi-angle laser light scattering

NMR

nuclear magnetic resonance spectroscopy

PG

proteoglycan

Footnotes

ASSOCIATED CONTENT

Supporting Information. The following supporting information is available: (S1) 800 MHz NMR spectra for glycosaminoglycan polymers from biological sources, (S2) decasaccharide linkage dynamics in fine- and coarse-grained 10 μs simulations, (S3) decasaccharide ring dynamics in fine- and coarse-grained 10 μs simulations, (S4) pyranose ring 1H-1H three-bond vicinal spin-couplings, (S5) light scattering data for glycosaminoglycan polymers from biological sources, (S6) protein-GAG linker dynamics during free and protein-attached μs fine-grained simulations, (S7) statistical analyses of radii of gyration from coarse-grained 10 μs simulations, (S8) representative bikunin conformers from a 1 μs fine-grained simulation, (S9) ring puckering definitions and (S10) decasaccharide r values from fine- and coarse-grained 10 μs simulations. Three-dimensional models from 10 μs coarse-grained simulations of bikunin, decorin and biglycan (as shown in Figures 9A and 10B) are also provided. This material is available free of charge via the Internet at http://pubs.acs.org.

REFERENCES

  • 1.Ariga T, Miyatake T, Yu RK. J Neurosci Res. 2010;88:2303–15. doi: 10.1002/jnr.22393. [DOI] [PubMed] [Google Scholar]
  • 2.Li L, Ly M, Linhardt RJ. Mol Biosyst. 2012;8:1613–25. doi: 10.1039/c2mb25021g. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kim SH, Turnbull J, Guimond S. J Endocrinol. 2011;209:139–51. doi: 10.1530/JOE-10-0377. [DOI] [PubMed] [Google Scholar]
  • 4.Sugahara K, Mikami T, Uyama T, Mizuguchi S, Nomura K, Kitagawa H. Curr Opin Struct Biol. 2003;13:612–20. doi: 10.1016/j.sbi.2003.09.011. [DOI] [PubMed] [Google Scholar]
  • 5.Ly M, Leach FE, Laremore TN, Toida T, Amster IJ, Linhardt RJ. Nat Chem Biol. 2011;7:827–833. doi: 10.1038/nchembio.673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Laremore TN, Ly M, Zhang Z, Solakyildirim K, McCallum SA, Owens RT, Linhardt RJ. Biochem J. 2010;431:199–205. doi: 10.1042/BJ20100788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Radek KA, Taylor KR, Gallo RL. Wound Repair Regen. 2009;17:118–26. doi: 10.1111/j.1524-475X.2008.00449.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nguyen TK, Tran VM, Sorna V, Eriksson I, Kojima A, Koketsu M, Loganathan D, Kjellen L, Dorsky RI, Chien CB, Kuberan B. ACS Chem Biol. 2013;8:939–48. doi: 10.1021/cb400132r. [DOI] [PubMed] [Google Scholar]
  • 9.Fransson LA, Coster L, Havasmark B, Malmstrom A, Sjoberg I. Biochem J. 1974;143:379–89. doi: 10.1042/bj1430379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Seidler DG, Faiyaz-Ul-Haque M, Hansen U, Yip GW, Zaidi SH, Teebi AS, Kiesel L, Gotte M. J Mol Med. 2006;84:583–94. doi: 10.1007/s00109-006-0046-4. [DOI] [PubMed] [Google Scholar]
  • 11.Buckwalter JA, Rosenberg LC. J Biol Chem. 1982;257:9830–9. [PubMed] [Google Scholar]
  • 12.Watanabe Y, Inoko Y. J Chromatogr A. 2013;1303:100–4. doi: 10.1016/j.chroma.2013.06.048. [DOI] [PubMed] [Google Scholar]
  • 13.Xu Y, Carr PD, Guss JM, Ollis DL. J. Mol. Biol. 1998;276:955–66. doi: 10.1006/jmbi.1997.1582. [DOI] [PubMed] [Google Scholar]
  • 14.Scott PG, Dodd CM, Bergmann EM, Sheehan JK, Bishop PN. J Biol Chem. 2006;281:13324–32. doi: 10.1074/jbc.M513470200. [DOI] [PubMed] [Google Scholar]
  • 15.Scott PG, McEwan PA, Dodd CM, Bergmann EM, Bishop PN, Bella J. Proc Natl Acad Sci U S A. 2004;101:15633–8. doi: 10.1073/pnas.0402976101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Cael JJ, Winter WT, Arnott S. J. Mol. Biol. 1978;125:21–42. doi: 10.1016/0022-2836(78)90252-8. [DOI] [PubMed] [Google Scholar]
  • 17.Schlick T. F1000 Biol. Rep. 2009;1:51. doi: 10.3410/B1-51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shaw DE, Maragakis P, Lindorff-Larsen K, Piana S, Dror RO, Eastwood MP, Bank JA, Jumper JM, Salmon JK, Shan Y, Wriggers W. Science. 2010;330:341–6. doi: 10.1126/science.1187409. [DOI] [PubMed] [Google Scholar]
  • 19.Sattelle BM, Almond A. Carbohydr. Res. 2014;383:34–42. doi: 10.1016/j.carres.2013.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lopez CA, Rzepiela AJ, de Vries AH, Dijkhuizen L, Hunenberger PH, Marrink SJ. J Chem Theory Comput. 2009;5:3195–3210. doi: 10.1021/ct900313w. [DOI] [PubMed] [Google Scholar]
  • 21.Ingolfsson HI, Lopez CA, Uusitalo JJ, de Jong DH, Gopal SM, Periole X, Marrink SJ. WIREs Comput Mol Sci. 2014;4:225–248. doi: 10.1002/wcms.1169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Sattelle BM, Shakeri J, Almond A. Biomacromolecules. 2013;14:1149–59. doi: 10.1021/bm400067g. [DOI] [PubMed] [Google Scholar]
  • 23.Bossennec V, Petitou M, Perly B. Biochem J. 1990;267:625–30. doi: 10.1042/bj2670625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Ferro DR, Provasoli A, Ragazzi M, Casu B, Torri G, Bossennec V, Perly B, Sinay P, Petitou M, Choay J. Carbohydr Res. 1990;195:157–67. doi: 10.1016/0008-6215(90)84164-p. [DOI] [PubMed] [Google Scholar]
  • 25.Silipo A, Zhang Z, Canada FJ, Molinaro A, Linhardt RJ, Jimenez-Barbero J. Chembiochem. 2008;9:240–52. doi: 10.1002/cbic.200700400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Yamada S, Yamane Y, Sakamoto K, Tsuda H, Sugahara K. Eur J Biochem. 1998;258:775–83. doi: 10.1046/j.1432-1327.1998.2580775.x. [DOI] [PubMed] [Google Scholar]
  • 27.Yang HO, Gunay NS, Toida T, Kuberan B, Yu GL, Kim YS, Linhardt RJ. Glycobiology. 2000;10:1033–1040. doi: 10.1093/glycob/10.10.1033. [DOI] [PubMed] [Google Scholar]
  • 28.Kitchen RG, Cleland RL. Biopolymers. 1978;17:759–83. doi: 10.1002/bip.1978.360170316. [DOI] [PubMed] [Google Scholar]
  • 29.Cohen DE, Thurston GM, Chamberlin RA, Benedek GB, Carey MC. Biochemistry. 1998;37:14798–814. doi: 10.1021/bi980182y. [DOI] [PubMed] [Google Scholar]
  • 30.Kirschner KN, Yongye AB, Tschampel SM, Gonzalez-Outeirino J, Daniels CR, Foley BL, Woods RJ. J. Comput. Chem. 2008;29:622–55. doi: 10.1002/jcc.20820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Case DA, Cheatham TE, 3rd, Darden T, Gohlke H, Luo R, Merz KM, Jr., Onufriev A, Simmerling C, Wang B, Woods RJ. J. Comput. Chem. 2005;26:1668–88. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
  • 33.Harvey MJ, Giupponi G, De Fabritiis G. J. Chem. Theory. Comput. 2009;5:1632–1639. doi: 10.1021/ct9000685. [DOI] [PubMed] [Google Scholar]
  • 34.Sattelle BM, Almond A. Phys Chem Chem Phys. 2014;16:8119–26. doi: 10.1039/c4cp00570h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Almond A, Brass A, Sheehan JK. J. Mol. Biol. 1998;284:1425–37. doi: 10.1006/jmbi.1998.2245. [DOI] [PubMed] [Google Scholar]
  • 36.Duan Y, Wu C, Chowdhury S, Lee MC, Xiong G, Zhang W, Yang R, Cieplak P, Luo R, Lee T, Caldwell J, Wang J, Kollman P. Journal of computational chemistry. 2003;24:1999–2012. doi: 10.1002/jcc.10349. [DOI] [PubMed] [Google Scholar]
  • 37.Agrawal PK, Jacquinet JC, Krishna NR. Glycobiology. 1999;9:669–77. doi: 10.1093/glycob/9.7.669. [DOI] [PubMed] [Google Scholar]
  • 38.Malmstrom A, Bartolini B, Thelin MA, Pacheco B, Maccarana M. J Histochem Cytochem. 2012;60:916–25. doi: 10.1369/0022155412459857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Volpi N. J Pharm Pharmacol. 2009;61:1271–80. doi: 10.1211/jpp/61.10.0002. [DOI] [PubMed] [Google Scholar]
  • 40.Blundell CD, DeAngelis PL, Day AJ, Almond A. Glycobiology. 2004;14:999–1009. doi: 10.1093/glycob/cwh117. [DOI] [PubMed] [Google Scholar]
  • 41.Winter WT, Smith PJ, Arnott S. J. Mol. Biol. 1975;99:219–35. doi: 10.1016/s0022-2836(75)80142-2. [DOI] [PubMed] [Google Scholar]
  • 42.Millane RP, Mitra AK, Arnott S. J. Mol. Biol. 1983;169:903–20. doi: 10.1016/s0022-2836(83)80142-9. [DOI] [PubMed] [Google Scholar]
  • 43.Arnott S, Guss JM, Hukins DW, Matthews MB. Biochem Biophys Res Commun. 1973;54:1377–83. doi: 10.1016/0006-291x(73)91139-x. [DOI] [PubMed] [Google Scholar]
  • 44.Mitra AK, Arnott S, Atkins ED, Isaac DH. J. Mol. Biol. 1983;169:873–901. doi: 10.1016/s0022-2836(83)80141-7. [DOI] [PubMed] [Google Scholar]
  • 45.Almond A, Deangelis PL, Blundell CD. J Mol Biol. 2006;358:1256–69. doi: 10.1016/j.jmb.2006.02.077. [DOI] [PubMed] [Google Scholar]
  • 46.Sattelle BM, Shakeri J, Roberts IS, Almond A. Carbohyd Res. 2010;345:291–302. doi: 10.1016/j.carres.2009.11.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Sattelle BM, Hansen SU, Gardiner J, Almond A. J. Am. Chem. Soc. 2010;132:13132–4. doi: 10.1021/ja1054143. [DOI] [PubMed] [Google Scholar]
  • 48.Blundell CD, Roberts IS, Sheehan JK, Almond A. J Mol Microbiol Biotechnol. 2009;17:71–82. doi: 10.1159/000215933. [DOI] [PubMed] [Google Scholar]
  • 49.Gatti G, Casu B, Torri G, Vercellotti JR. Carbohyd Res. 1979;68:C3–C7. [Google Scholar]
  • 50.Pellegrini L, Burke DF, von Delft F, Mulloy B, Blundell TL. Nature. 2000;407:1029–34. doi: 10.1038/35039551. [DOI] [PubMed] [Google Scholar]
  • 51.Hricovini M, Guerrini M, Bisio A, Torri G, Petitou M, Casu B. Biochem J. 2001;359:265–272. doi: 10.1042/0264-6021:3590265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Ochsenbein P, Bonin M, Schenk-Joss K, El-Hajji M. Angew Chem Int Ed Engl. 2011;50:11637–9. doi: 10.1002/anie.201105172. [DOI] [PubMed] [Google Scholar]
  • 53.Taylor KR, Rudisill JA, Gallo RL. J Biol Chem. 2005;280:5300–6. doi: 10.1074/jbc.M410412200. [DOI] [PubMed] [Google Scholar]
  • 54.Lindstrom KE, Blom A, Johnsson E, Haraldsson B, Fries E. Kidney Int. 1997;51:1053–8. doi: 10.1038/ki.1997.147. [DOI] [PubMed] [Google Scholar]
  • 55.Zhao X, Yang B, Solakyildirim K, Joo EJ, Toida T, Higashi K, Linhardt RJ, Li L. J Biol Chem. 2013;288:9226–37. doi: 10.1074/jbc.M112.437236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Scott PG, Grossmann JG, Dodd CM, Sheehan JK, Bishop PN. J Biol Chem. 2003;278:18353–18359. doi: 10.1074/jbc.M211936200. [DOI] [PubMed] [Google Scholar]
  • 57.Grande-Allen KJ, Calabro A, Gupta V, Wight TN, Hascall VC, Vesely I. Glycobiology. 2004;14:621–33. doi: 10.1093/glycob/cwh076. [DOI] [PubMed] [Google Scholar]
  • 58.Miyake N, Kosho T, Mizumoto S, Furuichi T, Hatamochi A, Nagashima Y, Arai E, Takahashi K, Kawamura R, Wakui K, Takahashi J, Kato H, Yasui H, Ishida T, Ohashi H, Nishimura G, Shiina M, Saitsu H, Tsurusaki Y, Doi H, Fukushima Y, Ikegawa S, Yamada S, Sugahara K, Matsumoto N. Hum Mutat. 2010;31:966–74. doi: 10.1002/humu.21300. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SupportingInformation

RESOURCES