Abstract
The structural features of MUC1-like glycopeptides bearing the Tn antigen (α-O-GalNAc-Ser/Thr) in complex with an anti MUC-1 antibody are reported at atomic resolution. For the α-O-GalNAc-Ser derivative, the glycosidic linkage adopts a high-energy conformation, barely populated in the free state. This unusual structure (also observed in an α-S-GalNAc-Cys mimic) is stabilized by hydrogen bonds between the peptidic fragment and the sugar. The selection of a particular peptide structure by the antibody is thus propagated to the carbohydrate through carbohydrate/peptide contacts, which force a change in the orientation of the sugar moiety. This seems to be unfeasible in the α-O-GalNAc-Thr glycopeptide owing to the more limited flexibility of the side chain imposed by the methyl group. Our data demonstrate the non-equivalence of Ser and Thr O-glycosylation points in molecular recognition processes. These features provide insight into the occurrence in nature of the APDTRP epitope for anti-MUC1 antibodies.
Keywords: antibodies, conformation analysis, glycopeptides, molecular recognition, X-ray diffraction
The Tn antigen (α-O-GalNAc-Ser/Thr) is one of the most specific human tumor-associated structures.[1] This entity, which is also implicated in HIV infection,[2] is expressed in approximately 90 % of carcinomas, and a direct correlation between the aggressiveness of the carcinoma and the occurrence of the antigen has been observed.[3] Consequently, the Tn antigen has found widespread use as biomarker and as a potential therapeutic target against cancer.[1, 4–7] Structural analysis of Tn antigen bound to its biological targets is thus of great significance for elucidating the mechanism of recognition, as well as for engineering novel antibodies and biosensors. In general, the Tn antigen is referred to as N-acetylgalactosamine (GalNAc) α-O-linked to serine (Ser) or threonine (Thr), without specifying which of the two amino acids the GalNAc is linked to. However, we and others have observed the existence of subtly different conformational behaviors in solution of the basic Ser- and Thr-containing structures.[8–14]
Herein, we present a detailed analysis of the interaction of these two Tn determinants, as MUC1 glycopeptides, to an anti-MUC1 antibody. MUC1 is a heavily O-glycosylated membrane glycoprotein consisting of tandem repeats of 20 amino acids (AHGVTSAPDTRPAPGSTAPP), with five possible glycosylation sites.[15, 16] This protein is overexpressed and partially glycosylated in cancer cells. Consequently, some peptide fragments that are masked in healthy cells, such as APDTRP and their glycosylated analogues, are now accessible and can interact with the immune system. Although the observed enhancement of antibody affinity has been attributed to conformational changes induced by the glycan in the peptide backbone,[17–20] the molecular basis for this observation remains unclear.
To our knowledge, the only reported crystal structure of a complex between an antibody and a GalNAc-containing glycopeptide is unrelated to mucins.[21] In addition, the X-ray structure of a model anti-MUC1 antibody (SM3) in complex with a mucin is limited to a naked peptide.[22] Moreover, an NMR study on this peptide and its corresponding GalNAc-glycopeptide bound to SM3 led to the hypothesis that the sugar residue fixes the bioactive conformation of the peptide fragment and interacts via the N-acetyl group with the surface of the antibody.[23] However, no detailed information on the intermolecular interactions could be deduced from this ligand-based NMR analysis.
A detailed analysis of the interactions between the SM3 antibody and two synthetic glycopeptides bearing the α-O-GalNAc-Thr and α-O-GalNAc-Ser antigens (m1* and m2* respectively; Figure 1) is presented herein. These molecules include the tandem repeat sequence of MUC1. The SM3 antibody was selected because its epitope recognition mode is similar to that of other anti-MUC1 antibodies,[24] which expands the scope of these results, and also because of its potential for use in the early diagnosis and treatment of breast cancer.[22]
Figure 1.

The MUC1-like peptides and glycopeptides studied in this work (upper panel). Bio-layer interferometry (BLI) curves and fit obtained for glycopeptide m1* and scFv-SM3, together with the KD constants derived from BLI experiments for all of the MUC1-related compounds (lower panel).
The influence of the chemical nature of the underlying amino acid, as well as the GalNAcylation, on antibody binding affinity was first evaluated. For this purpose, the related naked peptides (m1 and m2, respectively; Figure 1) were also synthesized and tested. The KD constants for the MUC1 variants with the scFv-SM3 antibody were experimentally determined through bio-layer interferometry (BLI) experiments. The higher affinity (around 3-fold) of SM3 for glycosylated m1* compared the naked peptide m1 was confirmed by these tests[24] (Figure 1). Furthermore, the Ser-containing compounds showed significantly lower affinity, thus highlighting the differences between the two Tn antigens. These results were corroborated by ELISA tests (see the Supporting Information).
To explain these results at the atomic level, a scFv-SM3 antibody was produced and purified. High quality crystals of the SM3:1, SM3:1*, and SM3:2* complexes were obtained, where 1, 1* and 2* are simplified models of m1, m1* and m2*, respectively (Figure 2 a, PDB IDs 5a2j, 5a2k, and 5a2i). These compounds include the peptide fragment that represents the minimal epitope recognized by most anti-MUC1 antibodies.[24]
Figure 2.

a) Simplified MUC1 variants used for crystallization. b) Surface representation of SM3 in complex with 1*. The antigen is shown as a stick model with carbon atoms in green. c) Superposition of the peptide backbone of compounds 1, 1*, 2*, and the SAPDTRPAP peptide[22] in complex with SM3.
The obtained crystals enabled solution of the structures at high resolution (<2.0 Å, see the Supporting Information). Crystallographic analysis revealed that the surface groove of the recombinant SM3 antibody nicely fits all of the peptide residues in the three studied complexes (Figure 2 b), independent of the presence of the sugar moiety. Moreover, the overall conformation of the peptide fragment of the different simplified MUC1 variants is nearly identical and is similar to that found in the crystal structure reported for the naked peptide[22] (Figure 2 c and Figure 3).
Figure 3.

Key binding interactions of peptide 1 (a), glycopeptide 1* (b), and glycopeptide 2* (c) with SM3 mAb, as observed in the X-ray crystal structures. Peptide and glycopeptide carbon atoms are shown in green. GalNAc carbon atoms are shown in cyan. Carbon atoms of key residues of SM3 are colored yellow. Blue dashed lines indicate interactions between GalNAc and SM3 surface, and pink dashed lines indicate hydrogen bonds between peptide backbones and SM3 antibody.
Therefore, the presence of the GalNAc moiety, regardless of the attached amino acid (Ser or Thr), does not significantly affect the conformation of the peptide backbone in the SM3-bound state. In fact, the antibody–antigen pairs display the same pattern of interactions with each of the three substrates (1, 1* and 2*) and with the reported peptide[22] (Figure 3). The stabilizing contacts in these complexes involve several hydrogen bonds, some of them mediated by water molecules, as well as several stacking interactions (Figure 3 and the Supporting Information). Pro2 stacks with Trp91L, Trp96L, and Tyr32L, while the side-chains of Asp3 and Arg5 are engaged in hydrophobic contacts with Trp 33H and Tyr 32H, respectively. Additionally, in compounds 1* and 2*, the NH group of Ala1 and the carbonyl group of Thr4/Ser4 are involved in hydrogen bonds with Tyr 32L and Gln97H, respectively. The 3D presentation of the side chain of Arg5 in glycopeptide 2* (Figure 2 c) differs slightly from that found for the other analogues. As a result, there is no hydrogen bond between the side chain of Arg5 and the carbonyl group of Asn31H. This interaction, among other factors (see below), may be at the cause of the low affinity of SM3 for 2*.
Interestingly, the major difference between glycopeptides 1* and 2* bound to SM3 resides in the geometry of the glycosidic linkage (Figure 4 a). In fact, in the SM3:1* complex, this linkage adopts the expected exo-anomeric/syn conformation, with ϕ and ψ values of around 63° and 91°, respectively.[9] This conformation is similar to that found for a non-related MUC1 glycopeptide bound to 237-mAb[21] (Figure 4 b) and that exhibited for 1* bound to a model lectin (Soybean agglutinin).[25] This geometry allows the formation of an intermolecular hydrogen bond between the hydroxymethyl group of GalNAc and the side chain of Tyr32L on the SM3 antibody. Moreover, the N-acetyl group of the sugar stacks with the aromatic ring of Trp 33H, thus providing the impetus for the observed selectivity of SM3 for GalNAc-containing antigens. This interaction is compatible with the previous solution NMR data.[23] By contrast, the GalNAc unit of compound 2* establishes only weak water-mediated hydrogen bonds with the antibody (see the Supporting Information). The glycosidic linkage shows a high-energy conformation, with a ψ value of around −97° (Figure 4 a, c). This conformation is barely populated for the α-O-GalNAc-Ser motif in solution. In fact, the conformational analysis performed on glycopeptides 1* and 2* in the free state in water by means of NMR experiments and Molecular Dynamics (MD) simulations with time-averaged restraints[9] indicates that compound 2* displays this conformation with a population approximately 20 % (see the Supporting Information). As previously observed by us,[8, 9] while in compound 1*, the glycosidic linkage adopts mainly the typical eclipsed conformation, in variant 2* it prefers to adopt the alternate conformation. According to the X-ray structure, the unusual conformation of the glycosidic linkage in compound 2* bound to the antibody is stabilized by two intramolecular hydrogen bonds with the peptide chain (Figure 5). The GalNAc endocyclic oxygen O5 engages in a hydrogen bond with the NH group of the attached Ser and the O6 interacts with the side chain of Asp3. To our knowledge, this geometry of the glycosidic linkage has not been previously observed for protein-bound Tn-containing peptides. It is important to note that the quality of the SM3:2* structure was lower and that the B-factor for 2* was higher compared to the other structures, which may indicate some degree of flexibility.
Figure 4.

a) Distribution of ϕ/ψ torsional angles found for α-O-GalNAc-Ser and α-O-GalNAc-Thr in water,[8, 9] together with the geometries found in X-ray structures for these determinants when bound to some biological targets.[21, 25–28] Newman projections of Cβ–O1 bond are shown. b) Superposition of the Tn antigen moiety α-O-GalNAc-Thr bound to SM3 (in green) and 237-mAb (in yellow; mAb=monoclonal antibody).[21] c) Superposition of the Tn antigen moiety α-O-GalNAc-Ser bound to SM3 mAb (in green) and to HPA lectin (in yellow).[27]
Figure 5.

Conformation of glycopeptides 2* (in green and cyan) and 3* (in brown) in complex with scFv-SM3 antibody, together with the geometry of the glycosidic linkage and the hydrogen bonds established with the peptide fragment.
MD simulations performed for the SM3:2* complex corroborated this flexibility in the bound state. The X-ray conformation was retained only for the first 4 ns of the trajectory (see the Supporting Information). Then, the glycosidic linkage exhibits the typical parallel orientation found for this compound in the free state.[8] To reinforce the significance of the β-methyl group of the threonine residue for the conformation of the glycosidic linkage, and to corroborate the exceptional 3D conformation of glycopeptide 2* in the complex with the antibody, we solved the structure of the cysteine analogue in complex with scFv-SM3 (compound 3* in Figure 2, PDB ID: 5a2L). Notably, the GalNAc unit and the peptide backbone adopt an almost identical spatial conformation in glycopeptides 2* and 3*, with the main difference being the conformation of the side chain of Arg5 (Figure 5). Therefore, the selection of a particular peptide structure by the antibody is somehow propagated to the carbohydrate through specific carbohydrate/peptide contacts, thereby forcing a drastic change in the orientation of the sugar moiety. This situation is not possible in the Thr-containing derivative owing to the limited conformational freedom of its side chain imposed by the methyl group (Figure 5). As a result, the GalNAc unit adopts a completely different presentation when linked to the threonine residue, with most of the hydroxy groups exposed and able to interact with the corresponding partners of the immune system.
In conclusion, we have uncovered, at the atomic level, the reasons why Ser- and Thr-linked glycopeptides bind differently to SM3. We have provided experimental evidences for distinct presentations of the Tn-carrying serine (α-GalNAc-Ser) and threonine (α-GalNAc-Thr) antigens when bound to SM3. The reasons for the observed limited enhancement in SM3 affinity in 1* versus 1 can be attributed to the weak hydrogen bond between O6 and Tyr32L, and a hydrophobic contact between the methyl group of the GalNAc unit and Trp33H. These findings emphasize the differences between these two Tn antigens in the context of recognition by anti-MUC1 antibodies and may have important implications for the design of novel antibodies and biosensors. In addition, our findings may provide insight into the occurrence in nature of the APDTRP epitope for anti-MUC1 antibodies.
Supporting Information
As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re-organized for online delivery, but are not copy-edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.
miscellaneous_information
References
- 1.Ju T, Otto VI, Cummings RD. Angew. Chem. Int. Ed. 2011;50:1770–1791. doi: 10.1002/anie.201002313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Angew. Chem. 2011;123 [Google Scholar]
- 2.Hansen JE, Nielsen C, Arendrup M, Olofsson S, Mathiesen L, Nielsen JO, Clausen H. J. Virol. 1991;65:6461–6467. doi: 10.1128/jvi.65.12.6461-6467.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Springer GF. J. Mol. Med. 1997;75:594–602. doi: 10.1007/s001090050144. [DOI] [PubMed] [Google Scholar]
- 4.Buskas T, Thompson P, Boons GJ. Chem. Commun. 2009:5335–5349. doi: 10.1039/b908664c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lakshminarayanan V, Thompson P, Wolfert MA, Buskas T, Bradley JM, Pathangey LB, Madsen CS, Cohen PA, Gendler SJ, Boons G-J. Proc. Natl. Acad. Sci. USA. 2012;109:261–266. doi: 10.1073/pnas.1115166109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wilson RM, Danishefsky SJ. J. Am. Chem. Soc. 2013;135:14462–14472. doi: 10.1021/ja405932r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cai H, Chen M-S, Sun Z-Y, Zhao Y-F, Kunz H, Li Y-M. Angew. Chem. Int. Ed. 2013;52:6106–6110. doi: 10.1002/anie.201300390. [DOI] [PubMed] [Google Scholar]
- Angew. Chem. 2013;125 [Google Scholar]
- 8.Corzana F, Busto JH, Jiménez-Osés G, Asensio JL, Jiménez-Barbero J, Peregrina JM, Avenoza A. J. Am. Chem. Soc. 2006;128:14640–14648. doi: 10.1021/ja064539u. [DOI] [PubMed] [Google Scholar]
- 9.Corzana F, Busto JH, Jiménez-Osés G, García de Luis M, Asensio JL, Jiménez-Barbero J, Peregrina JM, Avenoza A. J. Am. Chem. Soc. 2007;129:9458–9467. doi: 10.1021/ja072181b. [DOI] [PubMed] [Google Scholar]
- 10.Madariaga D, Martínez-Sáez N, Somovilla VJ, García-García L, Berbis MÁ, Valero-González J, Martín-Santamaría S, Hurtado-Guerrero R, Asensio JL, Jiménez-Barbero J, Avenoza A, Busto JH, Corzana F, Peregrina JM. Chem. Eur. J. 2014;20:12616–12627. doi: 10.1002/chem.201403700. [DOI] [PubMed] [Google Scholar]
- 11.Mazal D, Lo-Man R, Bay S, Pritsch O, Dériaud E, Ganneau C, Medeiros A, Ubillos L, Obal G, Berois N, Bollati-Fogolin M, Leclerc C, Osinaga E. Cancer Immunol. Immunother. 2013;62:1107–1122. doi: 10.1007/s00262-013-1425-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhang Y, Li Q, Rodriguez LG, Gildersleeve JC. J. Am. Chem. Soc. 2010;132:9653–9662. doi: 10.1021/ja100608w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kanekura T, Sakuraba H, Matsuzawa F, Aikawa S, Doi H, Hirabayashi Y, Yoshii N, Fukushige T, Kanzaki T. J. Dermatol. Sci. 2005;37:15–20. doi: 10.1016/j.jdermsci.2004.09.005. [DOI] [PubMed] [Google Scholar]
- 14.Brister MA, Pandey AK, Bielska AA, Zondlo NJ. J. Am. Chem. Soc. 2014;136:3803–3816. doi: 10.1021/ja407156m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Taylor-Papadimitriou J, Burchell J, Miles DW. Biochim. Biophys. Acta Mol. Basis Dis. 1999;1455:301–313. doi: 10.1016/s0925-4439(99)00055-1. [DOI] [PubMed] [Google Scholar]
- 16.Tarp MA, Clausen H. Biochim. Biophys. Acta Gen. Subj. 2008;1780:546–563. doi: 10.1016/j.bbagen.2007.09.010. [DOI] [PubMed] [Google Scholar]
- 17.Barchi JJ., Jr Biopolymers. 2013;99:713–723. doi: 10.1002/bip.22313. [DOI] [PubMed] [Google Scholar]
- 18.Schuman J, Campbell AP, Koganty RR, Longenecker BM. J. Pept. Res. 2003;61:91–108. doi: 10.1034/j.1399-3011.2003.00031.x. [DOI] [PubMed] [Google Scholar]
- 19.Kinarsky L, Suryanarayanan G, Prakash O, Paulsen H, Clausen H, Hanisch F-G, Hollingsworth MA, Sherman S. Glycobiology. 2003;13:929–939. doi: 10.1093/glycob/cwg109. [DOI] [PubMed] [Google Scholar]
- 20.Matsushita T, Ohyabu N, Fujitani N, Naruchi K, Shimizu H, Hinou H, Nishimura S-I. Biochemistry. 2013;52:402–414. doi: 10.1021/bi3013142. [DOI] [PubMed] [Google Scholar]
- 21.Brooks CL, Schietinger A, Borisova SN, Kufer P, Okon M, Hirama T, Mackenzie CR, Wang L-X, Schreiber H, Evans SV. Proc. Natl. Acad. Sci. USA. 2010;107:10056–10061. doi: 10.1073/pnas.0915176107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Dokurno P, Bates PA, Band HA, Stewart LM, Lally JM, Burchell JM, Taylor-Papadimitriou J, Snary D, Sternberg MJ, Freemont PS. J. Mol. Biol. 1998;284:713–728. doi: 10.1006/jmbi.1998.2209. [DOI] [PubMed] [Google Scholar]
- 23.Möller H, Serttas N, Paulsen H, Burchell JM, Taylor-Papadimitriou J, Meyer B. Eur. J. Biochem. 2002;269:1444–1455. doi: 10.1046/j.1432-1033.2002.02787.x. [DOI] [PubMed] [Google Scholar]
- 24.Karsten U, Serttas N, Paulsen H, Danielczyk A, Goletz S. Glycobiology. 2004;14:681–692. doi: 10.1093/glycob/cwh090. [DOI] [PubMed] [Google Scholar]
- 25.Madariaga D, Martínez-Sáez N, Somovilla VJ, Coelho H, Valero-Gónzalez J, Castro-López J, Asensio JL, Jiménez-Barbero J, Busto JH, Avenoza A, Marcelo F, Hurtado-Guerrero R, Corzana F, Peregrina JM. ACS Chem. Biol. 2015;10:747–756. doi: 10.1021/cb500855x. [DOI] [PubMed] [Google Scholar]
- 26.Sousa BL, Silva Filho JC, Kumar P, Pereira RI, Łyskowski A, Rocha BAM, Delatorre P, Bezerra GA, Nagano CS, Gruber K, Cavada BS. Int. J. Biochem. Cell Biol. 2015;59:103–110. doi: 10.1016/j.biocel.2014.12.002. [DOI] [PubMed] [Google Scholar]
- 27.Lescar J, Sanchez J-F, Audfray A, Coll J-L, Breton C, Mitchell EP, Imberty A. Glycobiology. 2007;17:1077–1083. doi: 10.1093/glycob/cwm077. [DOI] [PubMed] [Google Scholar]
- 28.Babino A, Tello D, Rojas A, Bay S, Osinaga E, Alzari PM. FEBS Lett. 2003;536:106–110. doi: 10.1016/s0014-5793(03)00037-1. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
miscellaneous_information
