Abstract
While DNA encodes protein structure, glycans provide a complementary layer of information to protein function. As a prime example of the significance of glycans, the ability of the cell surface receptor CD44 to bind its ligand, hyaluronan, is modulated by N-glycosylation. However, the details of this modulation remain unclear. Based on atomistic simulations and NMR, we provide evidence that CD44 has multiple distinct binding sites for hyaluronan, and that N-glycosylation modulates their respective roles. We find that non-glycosylated CD44 favors the canonical sub-micromolar binding site, while glycosylated CD44 binds hyaluronan with an entirely different micromolar binding site. Our findings show (for the first time) how glycosylation can alter receptor affinity by shielding specific regions of the host protein, thereby promoting weaker binding modes. The mechanism revealed in this work emphasizes the importance of glycosylation in protein function and poses a challenge for protein structure determination where glycosylation is usually neglected.
Subject terms: Molecular modelling, NMR spectroscopy, Carbohydrates, Computational chemistry, Glycobiology, Post-translational modifications, Proteins
Introduction
Glycosylation is a fundamental process where proteins are linked to complex oligosaccharides, glycans1. Most of the proteins at the extracellular side of eukaryotic cells contain covalently linked glycans2. Their structural roles include the mediation of interactions with the surrounding environment3, facilitation of correct folding4–6, and involvement in the assembly of membrane proteins7, also by direct interaction with lipids8. Glycans are also known to modulate the binding of ligands with several proteins, e.g., by masking the binding site9–11. Such regulation is relevant, especially in most immune processes, such as activation and homing, guided by regulated remodeling of the glycans12. However, the details of these modulation mechanisms are often poorly understood due to the glycans’ structural flexibility and dynamic nature13,14.
The transmembrane protein called CD44 is a key example of glycoproteins, whose functions are modulated by N-glycosylation9,15–19. Its primary task is to serve as a receptor for a carbohydrate polymer, hyaluronic acid (hyaluronan (HA))20,21. This ligand–protein interaction mediates a variety of physiological processes such as white blood cell homing, healing of injuries, embryonic development, and controlled cell death22. Recently, the CD44–HA interaction has also been utilized in the design of functional biomaterials23. CD44 binds HA exclusively via its lectin-like hyaluronate binding domain (HABD). In the canonical form, CD44 is a 722 residue-long type I transmembrane protein from which HABD comprises the first 150 amino acids (20-169) after the signal peptide24,25. Notably, human CD44-HABD contains five possible N-glycosylation sites (N25, N57, N100, N110, and N120)24 (see Fig. 1) that are known to be occupied by highly branched N-glycans, especially in various cancer cell lines18,19,26. The N-glycans elicit a dual effect on HA binding: while some glycan content favors the recognition of HA, the presence of negatively-charged sialic acids generally interferes or even blocks it16,27,28. However, the molecular mechanisms underlying such a dual effect remain unclear. In fact, most of the currently available structural data of HA–CD44 complexes are derived from non-glycosylated constructs24,25,29,30, leaving the structural details of fully N-glycosylated HABD elusive.
A shallow groove on the surface of the HABD forms the canonical binding site for HA. There the residue R41 stabilizes the binding in a pincer-like fashion25,31,32. In addition to this so-called crystallographic binding mode (blue chain in Fig. 1), in our previous work, we postulated the existence of two potential lower-affinity binding modes called parallel (green chain in Fig. 1) and upright (red chain in Fig. 1) modes14. These modes occupy the same general face of CD44-HABD, sharing to a large extent the R41-containing binding epitope. Additionally, each of these modes involves a second arginine residue that is distinct from that of the other binding poses14. As a result, each mode covers a unique region of the CD44-HABD surface. Such separation of the binding sites allows their selective silencing via antibodies that target different regions of CD4433. It also suggests that the presence of N-glycosylations may affect each of the binding modes differently. This idea is the central hypothesis of this work.
In this study, we employed atomistic molecular dynamics (MD) simulations to unravel how complex N-glycans at N25, N100, and N110 cooperatively cover the canonical binding groove of CD44-HABD. This sugar shield hinders the accessibility and ligand availability of the canonical binding groove significantly, thereby promoting the secondary upright HA–CD44 binding mode over the crystallographic binding site. We then used NMR complemented by atomistic MD simulations to show that a few short HA oligomers can bind CD44-HABD simultaneously at distinct binding sites. The observed binding sites correspond to the previously characterized crystallographic25, parallel, and upright binding modes14. We further reveal that anti-CD44 antibody MEM-85 does not cross-block the canonical HA binding site in non-glycosylated CD44. Instead, it blocks HA binding to glycosylated CD4434,35. These findings provide compelling evidence for the existence of a lower-affinity upright binding mode for HA. This binding mode overlaps with the binding site of MEM-85 and is promoted by N-glycosylation. The results demonstrate the existence of a new mechanism to control the ligand binding affinity of receptor proteins by promoting alternate binding sites by N-glycosylation.
Results
Complex N-glycans on CD44-HABD can cooperatively block its canonical binding site for hyaluronate
To characterize how N-glycans behave and fold on CD44-HABD, we in silico glycosylated a HABD structure (PDB:1UUH) with myeloma asialo, myeloma monosialo, partial monosialo, and full pentasaccharide N-glycan profiles (Systems G1–4 in Table 2 depicted in Fig. 2c). We then simulated each glycoform through 15 replicas. An average minimum distance between the complex N-glycans and the protein, as mapped onto the surface of HABD (Fig. 2b), reveals that in the myeloma glycoforms, the N-glycans cover a significant fraction of the protein surface. That is, with the complex oligosaccharides in myeloma monosialo and myeloma asialo glycoforms, the N25 glycan can interact intimately with the nearby N100 and N110 glycans, forming a sugar shield that covers the canonical binding site of hyaluronate (Fig. 2a). Furthermore, the contact map for the five N-glycans in the myeloma monosialo glycoform (Fig. 2d) shows the glycans at N25, N100, and N110 to establish, on average, several hundred intermolecular contacts, which are possible only if the three N-glycans become interconnected in the region that resides over the crystallographic hyaluronate binding groove. These results clearly show how complex N-glycans, facilitated by inter-N-glycan interactions, shield a significant portion of the hyaluronate binding face of HABD.
Table 2.
System | Glycoform | HA | FF | Length (ns) | DOI data |
---|---|---|---|---|---|
G1 | Myeloma asialo | None | GLYCAM06 | 10.5281/zenodo.3742147 | |
G2 | Myeloma monosialo | None | GLYCAM06 | 10.5281/zenodo.3742149 | |
G3 | Partial monosialo | None | GLYCAM06 | 10.5281/zenodo.3742154 | |
G4 | Full pentasaccharide | None | GLYCAM06 | 10.5281/zenodo.3742158 | |
B1 | Non-glycosylated | GLYCAM06 | 10.5281/zenodo.4005682 | ||
B2 | Full GlcNAc | GLYCAM06 | 10.5281/zenodo.4005689 | ||
B3 | Full asialo | GLYCAM06 | 10.5281/zenodo.4005691 | ||
B4 | Full monosialo | GLYCAM06 | 10.5281/zenodo.4005695 | ||
B5 | Partial monosialo | GLYCAM06 | 10.5281/zenodo.4005701 | ||
B6 | Full polysialo | GLYCAM06 | 10.5281/zenodo.4005707 | ||
B7 | Full extended asialo | GLYCAM06 | 10.5281/zenodo.4005740 | ||
B8 | Full pentasaccharide | GLYCAM06 | 10.5281/zenodo.4005743 | ||
C1 | Myeloma asialo | CHARMM36 | 10.5281/zenodo.3742175 | ||
C2 | Myeloma monosialo | CHARMM36 | 10.5281/zenodo.3742177 | ||
G5 | Non-glycosylated | GLYCAM06 | 10.5281/zenodo.3742160 | ||
G6 | Non-glycosylated | GLYCAM06 | 10.5281/zenodo.3742167 |
Glycoform tells the N-glycan content on the CD44 HABD in each system. HA tells whether HA was present and which kind. FF lists the simulation force field. Length lists the duration of the simulations. Glycoforms of CD44 were experimentally found in: Ref.26, Ref.19, Ref.9, Ref.18, Ref.24, Ref.16, and Ref.17.
To study the spontaneous binding of HA to the N-glycosylated CD44-HABD, we performed simulations where both molecules were initially significantly separated. In this setting, the molecules can interact in a spontaneous manner without any apparent bias. These simulations refer to sets B1–8 in Table 2. Typical binding complexes arising from this set-up are shown in Fig. 3a–d.
Comparing the final ( ns) HA–HABD and HA–N-glycans interface areas after the spontaneous binding of HA to the glycosylated HABD (Fig. 3e) reveals how the glycans hinder the recognition and how different glycoforms influence this process. While the HA–N-glycans interface obtains average values of 6–10 nm with all glycoforms larger than full GlcNAc, the HA–CD44 interface varies significantly depending on the N-glycan content. The shortest full GlcNAc glycoform displays HA–CD44 binding similar to that of the non-glycosylated reference with recognizable binding modes, showing that these neutral sugar units do not obstruct the ligand binding. Instead, they offer more binding surface for HA compared to non-glycosylated HABD. Medium-sized neutral glycans (i.e., full pentasaccharide and full asialo) display HA–HABD interfaces (6 nm) slightly lower than the non-glycosylated reference (8 nm). While these glycoforms also provide additional interaction sites for HA through the larger size of the N-glycans, they also cover the important binding residues, preventing the formation of clear HA–CD44 binding modes. Agreeing with previous experimental findings, the sialylated glycoforms (full monosialo, partial monosialo, and full polysialo) display relatively low HA–HABD interfaces, with the partially glycosylated form showing the strongest binding to the protein. Together these results indicate that both the size of the N-glycans and charge (number of sialic acids) abrogate the binding of HA.
N-glycans foster the occupancy of a secondary hyaluronate–CD44 binding mode
Table 1 lists the coverage of each of the three binding sites by the N-glycans. In the tested glycoforms (systems G1–4 in Table 2), amino acid residues distinct to the CD44-HABD binding modes exhibit a coverage of about 20 to 50%. The somewhat high standard errors indicate a large replica-to-replica variance in the folding of the N-glycans, as well as slow interconversion between the folding patterns. The use of 15 replicas, however, ensures a reasonable sampling of the possible patterns. In all cases, the crystallographic binding site is most significantly obstructed by the N-glycans, while the upright site is obstructed the least. Furthermore, coverage values calculated for the key hyaluronate binding residues of CD44-HABD (see Note SE) reveal how the key residues that are specific to the upright mode, such as K38 and R162, are generally less covered by the N-glycans. These observations imply that the lower-affinity upright mode is the most accessible binding configuration in a glycosylated CD44-HABD.
Table 1.
Binding mode | Realistic asialo coverage (%) | Realistic monosialo coverage (%) | Partial monosialo coverage (%) | Pentasacharides coverage (%) |
---|---|---|---|---|
Cryst. | ||||
Parallel | ||||
Upright |
Strikingly, we observe minimal differences between the myeloma monosialo and myeloma asialo glycoforms, where the oligosaccharides are of the same length. However, the coverage values decrease notably with reduced glycan content in the partial monosialo or the shorter full pentasaccharide glycoforms. Like the myeloma-derived CD44-HABDs, the partial monosialo glycoform also displays a large number of contacts between the glycans N100 and N110 (Note SD). Their interaction is, however, less prone to disturb the crystallographic binding site as the N25-linked glycan is missing (Note SD). The full pentasaccharide glycoform is fully glycosylated but entails shorter oligosaccharides, which therefore limit the degree of protein coverage. These observations suggest that it is predominantly the degree of glycosylation and the size of the attached oligosaccharides that determine the coverage of the binding site. The inclusion of sialic acids has little effect on the coverage when compared to similar-sized non-sialylated N-glycans.
Figure S8 in Note SH compiled from the spontaneous binding simulations shows that the non-glycosylated HABD expresses the most interactions between HA and the arginines at the crystallographic binding groove (R41 and R78) compared to all the glycosylated HABDs. This indicates that the presence of N-glycans generally decreases the accessibility of these key HA binding residues. Consistently, the flanking arginines (R150, R154, and R162) are relatively more prone to interact with the ligand in the glycosylated cases, further suggesting that the binding modes involving these flanking arginines are activated in the glycosylated receptor. For additional observations from the spontaneous binding simulations, see Note SI.
Antibody MEM-85 does not cross-block hyaluronate binding to non-glycosylated CD44
scFv MEM-85 antibody prevents the binding of hyaluronate to glycosylated CD4434,35. We used NMR to probe whether the same antibody prevents hyaluronate from binding a non-glycosylated CD44-HABD. The hyaluronate and antibody induced changes are clearly visualized in an overlay of the N/H HSQC spectra for the free N-CD44-HABD, N-CD44-HABD in complex with either hyaluronate hexamer (in a threefold molar excess), or scFv MEM-85 (in a twofold molar excess), and both scFv MEM-85 (in a twofold molar excess) and hyaluronate hexamer (in a threefold molar excess), see Fig. 4a. The observed changes can be interpreted as local perturbations/contacts in the vicinity of a given residue but they may also reflect a non-local secondary perturbation of some sort.
Residues from the hyaluronate-perturbed region such as K38, N39, and G40 exhibit similar spectral behaviour for the mixture of hyaluronate and antibody as for hyaluronate alone, i.e., their signals disappear. On the other hand, residues from the antibody-perturbed region33 such as A138, I145, and G159—necessary for upright mode—exhibit similar perturbations for the complex with both hyaluronate and the antibody, as in that of the antibody alone. Moreover, we calculated the histograms of the minimal combined chemical shift perturbation with respect to the free CD44-HABD spectra along its sequence (Fig. 4c). The obtained chemical shifts indicate that the spectra of the complex of CD44-HABD with both the antibody and hyaluronate still possesses the antibody-induced changes (residues within mainly the C-terminal segment of CD44-HABD) in addition to the hyaluronate-induced changes (residues within mainly the N-terminal segment of CD44-HABD). This clearly suggests the simultaneous binding of both hyaluronate hexamer and scFv MEM-85 to the non-glycosylated recombinant CD44-HABD.
In addition, the signals in the spectrum obtained for N-CD44-HABD in the presence of both antibody and hyaluronate hexamer are significantly broadened relatively to the signals in the spectra obtained for binary mixtures, as expected for a higher molecular weight of the ternary complex.
Short hyaluronate oligomers bind to CD44-HABD simultaneously at distinct binding sites
We analyzed the individual signals in the HSQC spectra for N-CD44-HABD titrated with hyaluronate hexamer; signals located in crowded areas of the spectra, including R41, were not taken into account to avoid ambiguity. This analysis revealed two trends (Fig. 5). Certain backbone amide group signals exhibited an instant shift or disappearance already at the hyaluronate to CD44-HABD ratio of 1:1, which indicates a strong interaction in the sub-M range of the respective residues with hyaluronate, while other signals shifted gradually during the individual titration steps, suggesting a relatively weaker interaction (>10 M) (Fig. 5a). In addition, several signals exhibited doubling, connected either with an instant shift or a gradual shift (Fig. 5b). This points out residues which interact only in a fraction of CD44-HABD molecules with hyaluronate, and/or interact in two different modes. Specifically, the signals of the following residues exhibited instant disappearance: K38, G40, G80, Y114; instants shift: S43, I44, Y79; instant shift with doubling: D140, R150, R154, V156, T174; gradual shift: D23, N25, E37, E75, I96, Y105, Q113, E127, V148, G159, R162, E166; and gradual shift with doubling: N39, R78, L107, K158, N164, D175.
Next, we mapped the critical residues involved in either strong or weak interaction with hyaluronate (Fig. 5c) and the residues interacting with hyaluronate in a single/double mode (Fig. 5d–e) onto the surface of a computational model of CD44-HABD (residues 20–169)14. This illustrates that the surface patches associated with all the three modes are affected in our hyaluronate titration experiments. Notably, the linear patch including residues K38, S43, I44, Y79, G80, Y105, Q113, Y114, R162, and E166 outlines the binding site for the upright binding mode. Moreover, the doubling of the signals in the C-terminal portion of CD44-HABD (residues D140, R150, R154, V156, K158, and N164) indicates the coexistence of the parallel and upright modes with the crystallographic mode. The NMR data, therefore, demonstrate that the short hyaluronate hexamer can, especially in higher molar excess, bind to non-glycosylated recombinant CD44-HABD simultaneously in several modes at distinct binding sites.
To further explore the simultaneous binding of hyaluronate on CD44-HABD, we performed a set of MD simulations with three hyaluronate hexamers binding to CD44-HABD (simulation G5 in Table 2). In these systems, the hyaluronate hexamers are initially in an unbound state (see Note SB), and thus, readily able to sample the space and find their respective binding sites during the course of the simulations (See Table S2 in Note SF). Fig. 5c shows the probability of the HABD surface to be in contact with HA, which correlates with the combined chemical shift perturbations observed in NMR. Additionally, Fig. S6 in Note SF shows a contact profile similar to the chemical shift profile recorded in NMR (Fig. 4), indicating that our experimental and computational results are in agreement.
Discussion
We employed atomistic MD simulations and NMR to shed light on ligand–receptor interactions of CD44 and hyaluronate to unravel how N-glycosylation modulates the interactions. MD simulations showed that in the crystallographic mode (sub-M), N-glycans on CD44-HABD collectively shield the primary binding residues for hyaluronate. The shielding effect in this canonical binding mode is the strongest when complex type N-glycans occupy the N-glycosylation sites N25, N100, and N110. They are the most typical oligosaccharides found in these N-glycosylation sites26 and are sufficiently long to interlock over the canonical hyaluronate binding groove, thereby severely hindering its availability for the ligand.
Backing these observations, our HA binding simulations with glycosylated HABD show how the smaller N-glycan types, such as simple GlcNAc residue, lack both the reach and charge necessary to influence the binding of HA in a negative way. Instead, the presence of GlcNAc residues offers additional binding surface for HA, thereby possibly advocating the recognition of HA by providing additional polar interaction sites and minimal hindrance to the binding. This observation is in line with previous research that has shown with metabolic glycosidase enzymes that GlcNAc residues on CD44-HABD have a positive effect on the binding of HA16. Our simulations also show that once the size of the glycans on HABD increase, they start to prevent the entry of the ligand into its main binding site. High concentration of sialic acids further amplifies this effect through the increased size and negative charge of the glycans. Overall, our results from the spontaneous binding of HA to glycosylated HABD are in good qualitative agreement with previous experiments that have assessed the effect of different glycoforms, showing similar N-glycan-related size and charge-dependence for the binding of HA9,16,27.
We also found that the N-glycosylation of CD44-HABD promotes a secondary, less shielded but weaker (>10 M) hyaluronate binding site, which corresponds to the upright binding mode characterized previously by us14 and also suggested by others24. The results also revealed the degree of glycosylation and the size of the attached oligosaccharides to be the key factors in determining the coverage of the binding site, while the inclusion of single sialic acids to the glycan termini was found to have only a minor additional effect when glycans of equal length were compared to one another. Thus, it can be speculated, in the case of CD44–HA binding, that the binding-inhibiting role of monosialic acids stems from the more extended nature of the oligosaccharides and the resulting increase in the degree of coverage. The negative charge may play a more significant role in the case of polysialylated sugars, see Fig. 3e. Furthermore, if the glycosylation site N25 lacks sufficiently long glycans, the propensity to interlock with the N100 and N110 glycans decreases, thereby substantially decreasing the coverage of the crystallographic site, resulting in a more exposed site to the ligand. This is again in line with findings that have suggested some glycosylation patterns do not decrease the hyaluronate binding16.
Our NMR experiments support the notion of distinct hyaluronate binding sites on non-glycosylated CD44-HABD, which provides substantial evidence for the existence of separate hyaluronate binding modes. Strikingly, the residues perturbed in NMR match closely to those involved in the crystallographic, parallel, and upright binding modes. In our previous computational work, we illustrated the dynamic nature of the HABD–HA interactions outside the R41 epitope, especially in the case of the crystallographic binding mode (see Note SG). Similarly, the strong versus weak combined chemical shift perturbations in Fig. 5a show both the R41 epitope and upright groove to give predominantly strong interaction signals, while other regions flanking the R41 epitope tend to give out weak interaction signals, corresponding with the increased mobility of the bound HA in those regions. Despite the dynamic interactions, the importance of such weak binding sites to the overall strength of the binding is found to be high in a related protein–carbohydrate interaction37. The dynamics of the bound HA can be visualized in Note SG. Notably, while our NMR readouts support our computational findings, we cannot rule out the possibility of conformational changes as a reason for some of the observed perturbations.
The experimental results also agree well with the findings of our simulations of multiple hyaluronate hexamers with CD44-HABD, showing a similar hyaluronate–HABD binding profile (Fig. S6 in Note SF). The NMR readouts also show that the anti-CD44 antibody MEM-85 co-binds with hyaluronate on a non-glycosylated CD44, thus having a minimal effect on hyaluronate binding in this case. Conversely, the literature clearly states that MEM-85 blocks the hyaluronate binding of a glycosylated CD4434,35, implying the existence of a lower-affinity binding mode, whose binding site overlaps with the binding site of MEM-85. The MEM-85 epitope is known to be located around the residues Glu160, Tyr161, and Thr16333. As these residues are also a part of the upright mode, our results hint towards the existence of such binding.
Providing further evidence for the existence of the upright mode, when CD44-Ig (immunoglobulin) fusion proteins were expressed in COS cells and hence were presumably glycosylated, both MEM-85 and hyaluronate binding were significantly reduced by the mutation of K38 to arginine34. According to our previous work, K38 is exclusive to the upright mode14, which further implies that glycosylated CD44 favors to bind hyaluronate with the upright mode over the canonical crystallographic binding. We also note that distinct N-glycosylation profiles, e.g., ones that include an increased number of sialic acids, might cause different alterations to the binding.
CD44–HA interaction is known to display glycosylation-dependent levels of activation9 and binding affinities16. The activation levels have been attributed to varying degrees of sialylation19,38, yet the glycosylation dependent binding affinities could stem from the simultaneous masking of high-affinity binding sites and promotion of secondary sites. Such activation-dependent regulation of glycan remodeling is undoubtedly known to be a major mechanism driving cell motility, e.g., in the immune response12. CD44, in particular, is a hyaluronate-dependent leukocyte homing receptor that mediates both rolling interactions39 and cellular transmigration40. In such processes, tightly regulated affinity is required to enable dynamic velcro-like interactions between leukocytes and endothelial cells at inflamed tissue.
It is known that glycans stabilize or promote specific protein conformations4–6, dimer interfaces3, or orientations8,41, which ultimately affect ligand binding. There is also evidence of oligosaccharides that mask and shield specific parts of the protein surface11,42. N-glycosylations are also generally quite well known to protect large regions of the protein surface from, e.g., non-specific interactions or proteolytic cleavage43. The novelty of the present work lies in the fact that, in addition to all these features, N-glycosylation has an extremely valuable and hitherto unknown mechanism of action: N-glycosylation can control the affinity of ligand–receptor interaction by selectively blocking binding sites and promoting others.
Methods
Simulation system construction and models
We generated computational simulation models of glycosylated CD44-HABD. As the primary oligosaccharides, we employed fucosylated complex-type triantennary N-glycans, containing zero (asialo) or one (monosialo) terminal sialic acids per antenna, i.e., non-reducing termini. These oligosaccharide structures represent the predominant types in the so-called inducible (monosialo) hyaluronate binding phenotypes, together with a non-sialylated reference (asialo)9,18,26. To mimic the predominant CD44 glycovariants found recently in mouse myeloma cells26, we glycosylated N25, N57, N100, and N110 with the above-described complex type N-glycans and N120 with a triantennary high-mannose type structure without fucosylation (Fig. 2c). We call these glycoforms myeloma asialo and myeloma monosialo, depending on the degree of sialylation. Additionally, to emulate the mutant proteins lacking the N25 and N120 glycans that also lead to the inducible phenotype19, we constructed a monosialo glycoform, which lacks N-glycans at N25 and N120 (partial monosialo). Finally, we designed a fourth glycoform, where each of the five N-glycans is a charge-neutral core pentasaccharide (full pentasaccharide), to represent mildly glycosylated, less-cancerous cell types.
We constructed the simulation systems using the crystal structure of human CD44-HABD (PDB:1UUH44). We then followed the steps described in our previous work to curate the 1UUH structure14. This was followed by the in silico N-glycosylation of the HABD structure with the doGlycans45 tool. Before simulations, we inspected the ready-made glycan structures visually46 to confirm their correct configuration and stereochemistry. In every system, sodium and chloride ions were added to reach a typical physiological salt concentration of 150 mM, and to neutralize the charge of the system (Dang ions47 for AMBER99SB-ILDN and default ions for CHARMM36 systems). The systems were solvated with the recommended TIP3P water model48.
For each GLYCAM06-modeled system without a HA ligand (systems G1–4 in Table 2), we generated three different N-glycan starting configurations. Each configuration was used to start five replica simulations of 1000 ns, totalling to 15 replicas per glycoform. The CHARMM36 systems were simulated with three replicas (systems C1–2 in Table 2). Those three additional CHARMM36 systems had an added hyaluronate oligomer (18 monosaccharide units). We set them up initially to study HA binding to HABD, yet the oligomer never bound during the trajectories. Hence, we do not expect the hyaluronate to interfere with the folding of the N-glycans in those systems.
To understand how CD44 glycoprotein bind HA, we constructed GLYCAM06-modeled systems where was let to spontaneously form a complex with different glycoforms of HABD (simulations B1–8 in Table 2). In the initial frame, we positioned the HA oligomer to the water phase, roughly 2.5 nm away from the R41 residue (the most important binding residue). The reasoning behind this initial distancing is to avoid any bias in the binding, see Note SB. In total, we studied seven different glycoforms: full asialo, full monosialo, partial monosialo, full extended asialo, full pentasaccharide, full GlcNAc, and full polysialo. These glycoform names are further explained in the Note SA. Lastly, we used non-glycosylated HABD from Ref.14 as reference system without glycosylation. For each glycoform, we performed eight replicas of 1000 ns, as listed in Table 2.
We also constructed an additional GLYCAM06-modeled system (20 replicas of 1000 ns) having non-glycosylated CD44-HABD together with three unbound (i.e., 1.5–2 nm from the protein surface as explained in Note SB) hyaluronate hexamers to study their spontaneous and simultaneous binding (system G5 in Table 2). That is, the carbohydrate fragments associated and/or dissociated from the protein spontaneously during the course of the simulation trajectories. Similarly, we generated systems (10 replicas of 1000 ns each) with CD44-HABD and two hyaluronate hexamers from which one was initially complexed to the crystallographic binding site, while the other was unbound (system G6 in Table 2). Tables S2 and S3 in Note SF list the observed association/dissociation cycles between the hexamers and HABD in simulations G5 and G6, showing that the sampling is adequate. All simulation data are publicly available in zenodo.org.
Parameters for molecular dynamics simulations
Simulations were conducted using the GROMACS simulation software package49. For every simulation, we employed the following protocol. First, to relax clashes produced in the building process, we performed a short energy minimization run with the steepest descent algorithm (1000 steps). Subsequently, we performed 1 and 2 ns equilibration runs in the NVT and NpT ensembles, respectively, with coordinates of the protein and glycans restrained. Finally, we conducted production runs of different lengths (see Table 2).
The production runs, along with equilibration, employed the leap-frog integrator with a time step of 2 fs. During the runs, periodic boundary conditions were used in all three directions, and the LINCS algorithm was used to keep all bonds constrained50. Electrostatic interactions were treated with particle-mesh Ewald (PME)51 with a cut-off of 1.0 nm for the real part. Lennard–Jones interactions were cut off at 1 nm. Neighbour searching for long-range interactions was carried out every ten steps. The V-rescale52 thermostat was used to couple the systems to a heat bath of 310 K, while the Parrinello–Rahman53 barostat was employed to keep the pressure at 1 bar. At the beginning of each production simulation, we assigned random initial velocities using the Boltzmann distribution at the target temperature. The CHARMM36 simulations used the default parameters provided by CHARMM-GUI v1.754. The simulation trajectories were saved every 100 ps. For other non-specified parameters, we refer to the GROMACS 4.6.755,56 defaults for the AMBER99SB-ILDN/GLYCAM06 systems or to the GROMACS 5.1.449 defaults for the CHARMM36 systems.
Analysis of simulations
All distances and numbers of contact were calculated with gmx mindist tool from the GROMACS 5.1.4 package, using a cutoff of 0.3 nm unless stated otherwise.
N-glycan coverage for a given binding mode, , is calculated by comparing the interactions of HA with a non-glycosylated HABD to the interactions of N-glycans with their core HABD. The calculation averaged over each replica is conducted as follows:
1 |
is the average coverage of the protein residue “res” by HA in a given binding “mode”. The parameter is the average coverage of the protein residue “res” by the N-glycans in each replica (“rep”). The average coverage () is calculated as the ratio of frames where the distance between any atom of the HABD “res” to any atom of the target HA or N-glycans is closer than 0.3 nm.
NMR spectroscopy
The proteins were expressed and purified as described in Ref.33. The N/H “heteronuclear single quantum coherence” (HSQC) spectra were acquired as described in Ref.33 using a 350 l sample containing 100 M N-labeled CD44-HABD, a 350 l sample containing 200 M N-labeled CD44-HABD and 210, 415, and 620 M hyaluronate hexamer (Contipro Group, Dolni Dobrouc, Czech Republic), using a 350 l sample containing 100 M N-labeled CD44-HABD and 200 M unlabeled scFv MEM-85, or using a 320 l sample containing 90 M N-labeled CD44-HABD, 180 M unlabeled scFv MEM-85, and 300 M hyaluronate hexamer. The sequence-specific resonance assignment for free CD44-HABD was obtained as published in Ref.33; signals could not be assigned for the following residues: Tyr42, Ser95, Asn100, Thr108, Ser109, Asn110, Ser112, Cys129 and prolines. The perturbations of N-labeled CD44-HABD signals in the HSQC spectra were monitored employing the minimal backbone chemical shift method (N and H)57.
Supplementary Information
Acknowledgements
HMS acknowledges support from the Czech Science Foundation (19-19561S). IV acknowledges financial support from the Academy of Finland Center of Excellence program, Sigrid Juselius Foundation, and the European Research Council (CROWDED-PRO-LIPIDS). JŠ, MF, VV, PŘ acknowledge funding from projects RVO 61388963 and 68378050 awarded by the Academy of Sciences of the Czech Republic and by the Ministry of Education of the Czech Republic, projects LO1304 (program ’NPU I’) and CZ.02.1.01/0.0/0.0/16_019/0000729 (program OP RDE). We also acknowledge CSC-IT Center for Science (Espoo, Finland) for providing the computing resources that rendered this work possible.
Author contributions
H.M.-S. and J.V. designed the research with contributions from V.V., J.Š., I.V., and P.Ř. J.V. performed the simulations under the supervision of H.M.-S. and I.V. J.Š., M.F., V.V., and P.Ř. performed the NMR experiments. JV and HM-S wrote the paper with contributions from J.Š., V.V., and comments from all authors.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-84569-z.
References
- 1.Corfield AP, Berry M. Glycan variation and evolution in the eukaryotes. Trends Biochem. Sci. 2015;40:351–359. doi: 10.1016/j.tibs.2015.04.004. [DOI] [PubMed] [Google Scholar]
- 2.Apweiler R, Hermjakob H, Sharon N. On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database1. Biochim. Biophys. Acta (BBA)-Gen. Subj. 1999;1473:4–8. doi: 10.1016/s0304-4165(99)00165-8. [DOI] [PubMed] [Google Scholar]
- 3.Halder S, Surolia A, Mukhopadhyay C. Dynamics simulation of soybean agglutinin (SBA) dimer reveals the impact of glycosylation on its enhanced structural stability. Carbohydr. Res. 2016;428:8–17. doi: 10.1016/j.carres.2016.04.009. [DOI] [PubMed] [Google Scholar]
- 4.Huang X, et al. Glycosylation affects both the three-dimensional structure and antibody binding properties of the HIV-1IIIB GP120 peptide RP135. Biochemistry. 1997;36:10846–10856. doi: 10.1021/bi9703655. [DOI] [PubMed] [Google Scholar]
- 5.Arshad N, Ballal S, Visweswariah SS. Site-specific N-linked glycosylation of receptor guanylyl cyclase C regulates ligand binding, ligand-mediated activation and interaction with vesicular integral membrane protein 36, VIP36. J. Biol. Chem. 2013;288:3907–3917. doi: 10.1074/jbc.M112.413906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lowery JW, Amich JM, Andonian A, Rosen V. N-linked glycosylation of the bone morphogenetic protein receptor type 2 (BMPR2) enhances ligand binding. Cell. Mol. Life Sci. 2014;71:3165–3172. doi: 10.1007/s00018-013-1541-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Moremen KW, Tiemeyer M, Nairn AV. Vertebrate protein glycosylation: diversity, synthesis and function. Nat. Rev. Mol. Cell Biol. 2012;13:448. doi: 10.1038/nrm3383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kaszuba, K. et al. N-Glycosylation as determinant of epidermal growth factor receptor conformation in membranes. Proc. Natl. Acad. Sci.201503262, 10.1073/pnas.1503262112 (2015). [DOI] [PMC free article] [PubMed]
- 9.Lesley J, English N, Perschl A, Gregoroff J, Hyman R. Variant cell lines selected for alterations in the function of the hyaluronan receptor CD44 show differences in glycosylation. J. Exp. Med. 1995;182:431–437. doi: 10.1084/jem.182.2.431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lee HS, Qi Y, Im W. Effects of N-glycosylation on protein conformation and dynamics: Protein Data Bank analysis and molecular dynamics simulation study. Sci. Rep. 2015;5:8926. doi: 10.1038/srep08926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Liwosz A, Lei T, Kukuruzinska MA. N-glycosylation affects the molecular organization and stability of E-cadherin junctions. J. Biol. Chem. 2006;281:23138–23149. doi: 10.1074/jbc.m512621200. [DOI] [PubMed] [Google Scholar]
- 12.Van Kooyk Y, Rabinovich GA. Protein-glycan interactions in the control of innate and adaptive immune responses. Nat. Immunol. 2008;9:593. doi: 10.1038/ni.f.203. [DOI] [PubMed] [Google Scholar]
- 13.van Oosten AS, Janmey PA. Extremely charged and incredibly soft: Physical characterization of the pericellular matrix. Biophys. J . 2013;104:961. doi: 10.1016/j.bpj.2013.01.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Vuorio J, Vattulainen I, Martinez-Seara H. Atomistic fingerprint of hyaluronan-CD44 binding. PLoS Comput. Biol. 2017;13:e1005663. doi: 10.1371/journal.pcbi.1005663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rudy W, et al. The two major CD44 proteins expressed on a metastatic rat tumor cell line are derived from different splice variants: each one individually suffices to confer metastatic behavior. Cancer Res. 1993;53:1262–1268. [PubMed] [Google Scholar]
- 16.Skelton TP, Zeng C, Nocks A, Stamenkovic I. Glycosylation provides both stimulatory and inhibitory effects on cell surface and soluble CD44 binding to hyaluronan. J. Cell Biol. 1998;140:431–446. doi: 10.1083/jcb.140.2.431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Katoh S, Zheng Z, Oritani K, Shimozato T, Kincade PW. Glycosylation of CD44 negatively regulates its recognition of hyaluronan. J. Exp. Med. 1995;182:419–429. doi: 10.1084/jem.182.2.419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zheng Z, Cummings RD, Pummill PE, Kincade PW. Growth as a solid tumor or reduced glucose concentrations in culture reversibly induce CD44-mediated hyaluronan recognition by Chinese hamster ovary cells. J. Clin. Investig. 1997;100:1217. doi: 10.1172/jci119635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.English NM, Lesley JF, Hyman R. Site-specific de-N-glycosylation of CD44 can activate hyaluronan binding, and CD44 activation states show distinct threshold densities for hyaluronan binding. Cancer Res. 1998;58:3736–3742. [PubMed] [Google Scholar]
- 20.Aruffo A, Stamenkovic I, Melnick M, Underhill CB, Seed B. CD44 is the principal cell surface receptor for hyaluronate. Cell. 1990;61:1303–1313. doi: 10.1016/0092-8674(90)90694-a. [DOI] [PubMed] [Google Scholar]
- 21.Toole BP. Hyaluronan: from extracellular glue to pericellular cue. Nat. Rev. Cancer. 2004;4:528–539. doi: 10.1038/nrc1391. [DOI] [PubMed] [Google Scholar]
- 22.Ponta H, Sherman L, Herrlich PA. CD44: from adhesion molecules to signalling regulators. Nat. Rev. Mol. Cell Biol. 2003;4:33–45. doi: 10.1038/nrm1004. [DOI] [PubMed] [Google Scholar]
- 23.Wolf KJ, Kumar S. Hyaluronic acid: Incorporating the bio into the material. ACS Biomater. Sci. Eng. 2019;5:3753–3765. doi: 10.1021/acsbiomaterials.8b01268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Teriete P, et al. Structure of the regulatory hyaluronan binding domain in the inflammatory leukocyte homing receptor CD44. Mol. Cell. 2004;13:483–496. doi: 10.1016/s1097-2765(04)00080-2. [DOI] [PubMed] [Google Scholar]
- 25.Banerji S, et al. Structures of the Cd44-hyaluronan complex provide insight into a fundamental carbohydrate-protein interaction. Nat. Struct. Mol. Biol. 2007;14:234–239. doi: 10.1038/nsmb1201. [DOI] [PubMed] [Google Scholar]
- 26.Han H, et al. Comprehensive characterization of the N-glycosylation status of CD44s by use of multiple mass spectrometry-based techniques. Anal. Bioanal. Chem. 2012;404:373–388. doi: 10.1007/s00216-012-6167-4. [DOI] [PubMed] [Google Scholar]
- 27.Katoh S, et al. A crucial role of sialidase Neu1 in hyaluronan receptor function of CD44 in T helper type 2-mediated airway inflammation of murine acute asthmatic model. Clin. Exp. Immunol. 2010;161:233–241. doi: 10.1111/j.1365-2249.2010.04165.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Faller CE, Guvench O. Terminal sialic acids on CD44 N-glycans can block hyaluronan binding by forming competing intramolecular contacts with arginine sidechains. Proteins Struct. Funct. Bioinf. 2014;82:3079–3089. doi: 10.1002/prot.24668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Takeda M, et al. Hyaluronan recognition mode of CD44 revealed by cross-saturation and chemical shift perturbation experiments. J. Biol. Chem. 2003;278:43550–43555. doi: 10.1074/jbc.m308199200. [DOI] [PubMed] [Google Scholar]
- 30.Liu L-K, Finzel BC. Fragment-based identification of an inducible binding site on cell surface receptor CD44 for the design of protein-carbohydrate interaction inhibitors. J. Med. Chem. 2014;57:2714–2725. doi: 10.1021/jm5000276. [DOI] [PubMed] [Google Scholar]
- 31.Jamison FW, II, Foster TJ, Barker JA, Hills RD, Jr, Guvench O. Mechanism of binding site conformational switching in the CD44-hyaluronan protein-carbohydrate binding interaction. J. Mol. Biol. 2011;406:631–647. doi: 10.1016/j.jmb.2010.12.040. [DOI] [PubMed] [Google Scholar]
- 32.Favreau AJ, Faller CE, Guvench O. CD44 receptor unfolding enhances binding by freeing basic amino acids to contact carbohydrate ligand. Biophys. J . 2013;105:1217–1226. doi: 10.1016/j.bpj.2013.07.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Škerlová J, et al. Molecular mechanism for the action of the anti-CD44 monoclonal antibody MEM-85. J. Struct. Biol. 2015;191:214–223. doi: 10.1016/j.jsb.2015.06.005. [DOI] [PubMed] [Google Scholar]
- 34.Bajorath J, Greenfield B, Munro SB, Day AJ, Aruffo A. Identification of CD44 residues important for hyaluronan binding and delineation of the binding site. J. Biol. Chem. 1998;273:338–343. doi: 10.1074/jbc.273.1.338. [DOI] [PubMed] [Google Scholar]
- 35.Sandmaier BM, Storb R, Bennett KL, Appelbaum FR, Santos EB. Epitope specificity of CD44 for monoclonal antibody-dependent facilitation of marrow engraftment in a canine model. Blood. 1998;91:3494–3502. doi: 10.1182/blood.v91.9.3494.3494_3494_3502. [DOI] [PubMed] [Google Scholar]
- 36.Varki A, et al. Symbol nomenclature for graphical representations of glycans. Glycobiology. 2015;25:1323–1324. doi: 10.1093/glycob/cwv091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.van Bueren, A. L. & Boraston, A. B. Binding sub-site dissection of a carbohydrate-binding module reveals the contribution of entropy to oligosaccharide recognition at “non-primary” binding subsites. J. Mol. Biol.340, 869–879. 10.1016/j.jmb.2004.05.038 (2004). [DOI] [PubMed]
- 38.Katoh S, et al. Cutting edge: an inducible sialidase regulates the hyaluronic acid binding ability of CD44-bearing human monocytes. J. Immunol. 1999;162:5058–5061. [PubMed] [Google Scholar]
- 39.DeGrendele HC, Estess P, Picker LJ, Siegelman MH. CD44 and its ligand hyaluronate mediate rolling under physiologic flow: a novel lymphocyte-endothelial cell primary adhesion pathway. J. Exp. Med. 1996;183:1119–1130. doi: 10.1084/jem.183.3.1119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.DeGrendele HC, Estess P, Siegelman MH. Requirement for CD44 in activated T cell extravasation into an inflammatory site. Science. 1997;278:672–675. doi: 10.1126/science.278.5338.672. [DOI] [PubMed] [Google Scholar]
- 41.Polley A, et al. Glycosylation and lipids working in concert direct CD2 ectodomain orientation and presentation. J. Phys. Chem. Lett. 2017;8:1060–1066. doi: 10.1021/acs.jpclett.6b02824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Peiris D, et al. Cellular glycosylation affects Herceptin binding and sensitivity of breast cancer cells to doxorubicin and growth factors. Sci. Rep. 2017;7:43006. doi: 10.1038/srep43006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Rudd PM, Wormald MR, Dwek RA. Sugar-mediated ligand-receptor interactions in the immune system. Trends Biotechnol. 2004;22:524–530. doi: 10.1016/j.tibtech.2004.07.012. [DOI] [PubMed] [Google Scholar]
- 44.Berman HM, et al. The protein data bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1201/9780203911327.ch14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Danne R, et al. doGlycans-tools for preparing carbohydrate structures for atomistic simulations of glycoproteins, glycolipids, and carbohydrate polymers for GROMACS. J. Chem. Inf. Model. 2017;57:2401–2406. doi: 10.1021/acs.jcim.7b00237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Humphrey W, Dalke A, Schulten K. VMD: Visual molecular dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- 47.Dang LX. Development of nonadditive intermolecular potentials using molecular dynamics: solvation of Li+ and F- ions in polarizable water. J. Chem. Phys. 1992;96:6970–6977. doi: 10.1063/1.462555. [DOI] [Google Scholar]
- 48.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. doi: 10.1063/1.445869. [DOI] [Google Scholar]
- 49.Abraham MJ, et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1:19–25. doi: 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
- 50.Hess, B., Bekker, H., Berendsen, H. J. C. & Fraaije, J. G. E. M. LINCS: a linear constraint solver for molecular simulations. J. Comput. Chem.18, 1463–1472. https://doi.org/10.1002/(sici)1096-987x(199709)18:12%3c1463::aid-jcc4%3e3.0.co;2-h (1997).
- 51.Darden T, York D, Pedersen L. Particle mesh Ewald: An N log (N) method for Ewald sums in large systems. J. Chem. Phys. 1993;98:10089–10092. doi: 10.1063/1.464397. [DOI] [Google Scholar]
- 52.Bussi G, Donadio D, Parrinello M. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007;126:4101. doi: 10.1063/1.2408420. [DOI] [PubMed] [Google Scholar]
- 53.Parrinello M, Rahman A. Polymorphic transitions in single crystals: A new molecular dynamics method. J. Appl. Phys. 1981;52:7182–7190. doi: 10.1063/1.328693. [DOI] [Google Scholar]
- 54.Jo S, Kim T, Iyer VG, Im W. CHARMM-GUI: A web-based graphical user interface for CHARMM. J. Comput. Chem. 2008;29:1859–1865. doi: 10.1002/jcc.20945. [DOI] [PubMed] [Google Scholar]
- 55.Hess B, Kutzner C, van der Spoel D, Lindahl E. GROMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation. J. Chem. Theory Comput. 2008;4:435–447. doi: 10.1021/ct700301q. [DOI] [PubMed] [Google Scholar]
- 56.Pronk, S. et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformaticsbtt055, 10.1093/bioinformatics/btt055 (2013). [DOI] [PMC free article] [PubMed]
- 57.Veverka V, et al. Structural characterization of the interaction of mTOR with phosphatidic acid and a novel class of inhibitor: compelling evidence for a central role of the FRB domain in small molecule-mediated regulation of mTOR. Oncogene. 2008;27:585. doi: 10.1038/sj.onc.1210693. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.