Abstract
Spermatozoon represents a very special cell type in human body, and glycosylation plays essential roles in its whole life including spermatogenesis, maturation, capacitation, sperm–egg recognition, and fertilization. In this study, by mapping the most comprehensive N-glycoproteome of human spermatozoa using our recently developed site-specific glycoproteomic approaches, we show that spermatozoa contain a number of distinctive glycoproteins, which are mainly involved in spermatogenesis, acrosome reaction and sperm:oocyte membrane binding, and fertilization. Heavy fucosylation is observed on 14 glycoproteins mostly located at extracellular and cell surface regions in spermatozoa but not in other tissues. Sialylation and Lewis epitopes are enriched in the biological process of immune response in spermatozoa, while bisected core structures and LacdiNAc structures are highly expressed in acrosome. These data deepen our knowledge about glycosylation in spermatozoa and lay the foundation for functional study of glycosylation and glycan structures in male infertility.
Keywords: spermatozoa, glycoproteomics, intact glycopeptides, glycan structures, mass spectrometry
Abbreviations: ACN, acetonitrile; BP, biological processes; CC, cellular component; DAVID, database for annotation, visualization, and integrated discovery; DTT, dithiothreitol; FA, formic acid; FDR, false discovery rate; GO, gene ontology; IGP, intact glycopeptide; KEGG, Kyoto Encyclopedia of Genes and Genomes; MF, molecular function; RPLC, reverse-phase liquid chromatography; RT, room temperature; TFA, trifluoroacetic acid
Graphical Abstract
Highlights
-
•
A precision site-specific glycoproteome is documented in human spermatozoa.
-
•
Distinctive glycoproteins and heavy fucosylation are detected in spermatozoa.
-
•
Sialylation and Lewis epitopes are related to immune response of spermatozoa.
-
•
Bisected core structures and LacdiNAc are enriched on acrosome of spermatozoa.
In Brief
An in-depth N-glycoproteome map of human spermatozoa has been established by interpreting precise glycan structures at each glycosite, which allowed revealing a number of distinctive glycoproteins and heavily fucosylated glycans in spermatozoa. Sialylation and Lewis epitopes were enriched in the biological process of immune response, and highly expressed bisected core structure and LacdiNAc were detected in spermatozoa acrosome. These findings lay the foundation for subsequent functional studies of glycosylation and glycan structures in spermatozoa and male reproduction diseases.
Glycosylation is an important co- and posttranslational modification of proteins, playing fundamental roles in various biological processes (BP) and cellular activities, such as adhesion, interactions, structural and functional regulation (1, 2). Glycosylation can be further sorted into N-glycosylation (normally N-X-S/T and rarely N-X-C/V, X≠P) (3, 4), O-glycosylation (Ser/Thr residues via N-acetylgalactosamine), and rarely some other types (5, 6). Currently, N-glycosylation raises widespread concerns due to its huge potentials for current therapeutic targets and clinical biomarkers (7). While the complexity of glycosylation heterogeneity poses great challenges for comprehensive glycoproteome analysis.
Over the past decades, considerable progress has been made in the large-scale characterization of glycosylation events (8). Glycomic and deglycosylated glycoproteomic studies characterized liberated glycans or glycopeptides, which unfortunately lost information of glycosites or attached glycans (9, 10, 11). With the demand of knowing detailed information between glycosites and their corresponding glycans, intact glycopeptide (IGP) approaches have been developed to identify glycoproteins with site-specific glycan information in biological samples (5, 12, 13). However, most developed IGP analysis approaches assign compositions from the glycan database, which neither identify functional glycan structures nor distinguish different isomeric structures (13, 14, 15). Recently, a new IGP analysis software termed StrucGP developed by our team has been put forward to interpret precise glycan structures and identify new/rare glycans, as well as distinguish glycan isomeric glycoforms without relying on glycan database (16). This approach provides valuable insights into the interdependence of glycan structures and glycoprotein function.
Spermatozoon carries the male genetic information and works to complete its mission for successful fertilization (17). Glycosylation has been reported to accompany the whole process of gene transportation during accomplishment of fertilization task (18), such as spermatogenesis, maturation (19), capacitation (20, 21), sperm–egg recognition, and fertilization (22, 23). Spermatozoa surface possess glycoconjugates and glycocalyx for functioning in immune response, capacitation, and subsequent fertilization (24). Current glycomic and glycoproteomic studies on human spermatozoa have performed on either liberated glycan compositions or deglycopeptides, which cannot obtain the corresponding information between glycan structures and glycopeptides (25, 26). Consequently, the functional interactions of glycan structures and glycoproteins keep largely mysterious in human spermatozoa.
In this study, we applied our recently developed site-specific glycoproteomic method combined with 2D-LC-MS/MS analysis for in-depth characterization of glycan structures at each glycosite of glycoproteins in human spermatozoa. These sperm glycoproteomic data were compared with previously published glycoproteome and glycome data to determine the glycoproteins and glycan structures distinctively expressed in spermatozoa. Finally, the association of glycan substructures with different BP and cellular localizations was systematically explored to understand the structural function of glycans and glycoproteins in human spermatozoa.
Experimental Procedures
Experimental Design and Statistical Rationale
Spermatozoa proteins from ten healthy human donors were extracted and digested by trypsin individually, and IGPs were then enriched using hydrophilic cotton columns. Ten glycopeptide samples were pooled into one sample with the equal amount prior to separation into 12 fractions via high-pH reverse-phase liquid chromatography (RPLC) and subsequently subjected to triplicate LC-MS/MS analyses per fraction (Fig. 1A). Hence, a total of 36 LC-MS raw data (12 fractions x 3 runs/fraction) were generated for IGP identification using StrucGP. The false discovery rate of less than 1% (FDR<1%) was required for both peptide and glycan portions of IGPs. Functional analysis of glycoproteins, including gene ontology (GO) enrichment, and KEGG and reactome pathway used p value less than 0.05 as the significance.
Sample Collection
Human semen was collected from healthy donors with the age between 27 and 34 years old (n = 10), according to the World Health Organization 2010 recommendations (27), in the Northwest Women's and Children's Hospital, Shaanxi Province, China. This study was approved by the Ethics Committee of Northwest Women's and Children's Hospital and Northwest University and abided by the Declaration of Helsinki principles. The clinical parameters of semen from ten healthy donors used in the present study are given in supplemental Table S1. Semen after 30 min liquefaction was centrifuged at 2000g, 4 °C for 15 min to separate spermatozoa (bottom pellets) from seminal plasma (upper supernatant). Spermatozoa were washed three times in phosphate-buffered saline (PBS, pH 7.4).
Protein Digestion and Desalination
The collected human spermatozoa samples were denatured individually in 8 M urea/1 M NH4HCO3 and went through ultrasonication on ice by Ultrasonic Cell Distribution System. Thereafter, denatured samples were centrifuged at 15,000g for 20 min, and supernatant was collected for protein concentration measurement by BCA reagent (Beyotime). Spermatozoa proteins (1 mg) were reduced by 5 mM dithiothreitol (DTT) at 37 °C for 1 h with gentle shaking, then alkylated by 15 mM iodoacetamide at room temperature (RT) for 30 min in the dark. Subsequently, another 2.5 mM DTT was added and incubated for 10 min at RT. Protein samples were diluted twofold with deionized water and digested by sequencing grade trypsin (protein: enzyme, 100:1, w/w; Promega) at 37 °C for 2 h with gentle shaking as the first digestion. Afterward, samples were diluted fourfold with deionized water and sequencing grade trypsin (protein: enzyme, 100:1, w/w; Promega) was used to digest proteins into peptides again by incubation at 37 °C with gentle shaking overnight, namely the second digestion. The samples were acidified with trifluoroacetic acid (TFA) and centrifuged at 15,000g for 15 min to remove any particulate matter. The digested peptides were desalted with C18 column (Waters) and eluted with 50% acetonitrile (ACN)/0.1% TFA. The peptide concentration was measured by BCA reagent (Beyotime).
Enrichment of N-Linked Intact Glycopeptides
The desalted peptides mixtures eluted from C18 column were adjusted to a final solvent composition of 80% ACN/1%TFA. Three microgram cotton was pushed into a 200 μl pipet tip as in house cotton column. Cotton columns were washed ten times by ddH2O and sequentially conditioned ten times by 80% ACN/1%TFA via aspirating. Samples in 80% ACN/1%TFA were loaded onto cotton columns 20 to 30 times and washed ten times by 80% ACN/1%TFA. The IGPs bound to cotton column were eventually eluted in 0.1% formic acid (FA). The concentrations of enriched IGPs were estimated by Ultra Trace Ultraviolet Spectrophotometer (Denovix). Afterward, equal amount of enriched IGPs (20 μg) from different individuals were pooled and dried by vacuum concentration with an RVC 2-18 CDplus concentrator (Christ).
High-Performance Liquid Chromatography Fractionation
The dried IGPs were resuspended in 2% ACN/20 mM NH4COOH and fractionated through high pH reverse-phase high-performance liquid chromatography with Agilent RP Zorbax 300 Å Extend C18 column (250 mm × 4.6 mm, OD 5 μm) in an Agilent 1260 LC instrument. Intact glycopeptides were first separated into 96 fractions through a gradient of mobile phases A (2% ACN/5 mM NH4COOH, pH 10) and B (90% ACN/5 mM NH4COOH, pH 10) at a flow rate of 0.2 ml/min. A total of 120 min was used for gradient elution, 0 to 2% B for 7 min, 2 to 8% B for 35 min, 8 to 16% B for 35 min, 16 to 35% B for 10 min, 35 to 95% B for 5 min, 95 to 95% B for 28 min. Then, IGPs were integrated into 12 fractions and dried by vacuum centrifuging.
LC-MS/MS Analysis
Intact glycopeptides in each fraction underwent three LC-MS/MS runs on an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher Scientific). About 1 μg IGPs were separated by an Easy-nLC 1200 system with a 75 μm × 50 cm Acclaim PepMap-100 C18 analytical column protected by a 75 μm × 2 cm trapping column. The mobile phase flow rate was 0.2 μl/min and consisted of 0.1% FA in ddH2O (A) and 0.1% FA/80% ACN (B). A complete run of 240 min was set as follows: 3 to 9% B for 88 min, 9 to 31% B for 120 min, 31 to 40% B for 15 min, 40 to 99% B for 4 min, 99% B for 13 min. MS analysis was performed using Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher Scientific). The spray voltage (+) was set at 2300V. Orbitrap spectra (AGC 4 × 105) were collected from 375 to 2000 m/z at a resolution of 120K followed by data-dependent HCD-MS/MS (AGC 2 × 105, collected from 120 to 3000 m/z at a resolution of 30K, collision energy was 20% and 33%, respectively). An isolation window of 2.0 M/Z was used. Charge state screening enabled to reject unassigned and singly charged ions. A dynamic exclusion time of 20 s was set for each precursor ion after being selected once.
Intact Glycopeptide Identification
The identification of IGPs was performed by our newly developed software, StrucGP 1.0 (16). Briefly, all “Raw” format MS data were first converted to “mzML” format by Trans-Proteomic Pipeline (TPP, v5.0.0) (28). IGPs analyses were performed by StrucGP using the built-in glycan branch structure database from StrucGP 1.0 (16) and the UniProtKB of human protein databases (20,341 entries, downloaded from http://www.uniprot.org May 2020). The protein enzymatic digestion was set as trypsin with max two missed cleavage sites and potential glycosite-containing peptides were screened with the N-X-S/T motif (X is any amino acid except Proline). The carbamidomethylation (C, +57.0215 Da) was as a fixed modification, and oxidization (M, +15.9949 Da) as a dynamic modification. The mass tolerances for MS1 and MS2 were set at 10 ppm and 20 ppm, respectively. For the Y ions determination, an optional mass shift of ±1 Da or ±2 Da was allowed in addition to the 20 ppm mass tolerance in MS2. Finally, FDR evaluation of peptides and glycans (FDR < 1%) was determined by a decoy database and a decoy spectrum, respectively (16).
Bioinformatics Analysis
GO enrichment analysis was performed by the database for annotation, visualization, and integrated discovery (DAVID) (https://david.ncifcrf.gov/) to identify biological themes, including BP, cellular component (CC), and molecular function (MF) (29). Kyoto Encyclopedia of Genes and Genomes (KEGG) and reactome pathway analysis was performed by using ClueGO plug-in and Cluepedia of Cytoscape software (30). These pathway enrichment analyses were performed to search for the associated important pathway information and key glycoproteins (after correcting for multiple term testing using the two-sided hypergeometric test and procedure of Bonferroni−Hochberg). p value less than 0.05 was regarded as the significant pathway.
Results
In-Depth Glycoproteome Characteristics of Human Spermatozoa
With well-established site-specific glycoproteomic approaches coupled with 2D-LC-MS/MS (Fig. 1A), a total of 10,355 unique N-linked IGPs from 968 glycoproteins, consisting of 1489 glycosites and 719 glycans with distinct structures (292 glycan compositions) were screened out in human spermatozoa (supplemental Fig. S1 and supplemental Table S2), which is the most comprehensive glycoproteome map of human spermatozoa till now. Over 95% of glycoproteins (Fig. 1B) and 87% of glycopeptides (Fig. 1C) were overlapped among triplicate LC-MS analyses of a single fraction, indicating a reasonably high reproducibility of the mass spectrometry data. Based on the heat map of site-specific N-glycans from spermatozoa (Fig. 1D and supplemental Table S3), the majority of glycosites were occupied by high mannose (36.3% of PSMs) and complex glycans (29.2%), followed by a combination of three different types of glycans (11.5%). The top ten glycans appearing at different glycosites (based on PSMs) were mostly high mannose (N2H5, N2H8, N2H6, N2H7, N2H9, N2H4) and diantennary glycans (N4H5F1, N4H5F1S1, N4H5S2, N4H5F2, N4H5F3) (Fig. 1E). The majority of glycoproteins (71.8%) contained one glycosite (Fig. 1F), while other 273 glycoproteins (28.2%) contained two to ten glycosites.
Precise N-Glycan Structure Interpretation in Human Spermatozoa
As described above, a total of 719 precise glycan structures have been identified in human spermatozoa (supplemental Fig. S1). These glycans included high mannose, bi-, tri-, and tetra-antennary bisecting types, and bi-, tri-, and tetra-antennary core-fucosylated oligosaccharides terminated with Lewis structures (including Lewisx/a/y/b), as well as bi-, tri-, and tetraantennary core-fucosylated complex type N-glycans with antennae capped with sialic acid. Among the glycans on unique IGPs, 57.31% were complex glycans, followed by hybrid (17.87%) and oligomannose glycans (24.83%) (Fig. 2A).
These spermatozoa N-glycans comprised four types of core structures and 13 types of branch structures. Among 13 distinct branch structures, oligo-mannose (labeled as Hex) and LacNAc (HexNAc+Hex) accounted for 42.69% and 39.22% of unique IGPs, respectively. Sole HexNAc and HexNAc+Hex+Neu5Ac took up 20.30% and 20.04% of IGPs (Fig. 2B). Lewisy/b, sialylated Lewisx/a, and other remaining structures existed merely at relatively low abundances (Fig. 2B). Referring to four types of core structures, the common core structure (HexNAc2Hex3, 46.69%) and fucosylated core structure (38.55%) accounted for the predominance of all unique IGPs, while the bisected core structure with and without core fucosylation only occupied 6.63% and 8.12%, respectively (Fig. 2C).
Since 719 N-glycan structures were sorted into 292 different compositions in present study, lots of glycan compositions actually consisted of different isoforms (supplemental Table S4). Indeed, our data showed that up to ten distinct glycan structure isoforms can be discriminated from one single composition, such as N4H5F1S1 and N4H5F1 (Fig. 2D). Most glycans with isomers were hybrid and complex glycans. These glycan isoforms attached at the same peptide could be identified at the MS/MS level, like different isomers of the N4H5F1 glycan composition distinguished by feature B and Y ions in the MS/MS spectra (supplemental Fig. S2).
Distinctive Functional Glycoproteins Identified in Human Spermatozoa
Spermatozoa glycoproteins identified in this study were compared with previously published spermatozoa glycoproteome data, and glycoproteins in two N-linked glycoprotein databases—UniProtKB (http://www.uniprot.org) and N-glycositeAtlas (31). In current study, 968 identified glycoproteins covered 229 of 297 human spermatozoa glycoproteins published previously (25), and other 739 glycoproteins were characterized newly in human spermatozoa (Fig. 3A). Moreover, 660 glycoproteins coincided with known human glycoproteins in UniProtKB database, while other 308 glycoproteins had no reports in the database (Fig. 3B). Importantly, 212 glycoproteins were uniquely identified in spermatozoa, which have never been detected in other human tissues or cells using mass-spectrometry-based glycoproteomic approaches according to the N-GlycositeAtlas database (31) (Fig. 3B and supplemental Table S5). In-depth bioinformatic analysis of these 212 glycoproteins demonstrated that these proteins mainly participated in very specific BP of spermatozoa, such as binding of sperm to zona pellucida, reproductive process, spermatogenesis, spermatid development, sperm–egg recognition, fertilization (Fig. 3C). Similarly, based on the reactome pathway analysis, these glycoproteins were mainly involved in fertilization, reproduction, acrosome reaction, and sperm:oocyte membrane binding, as well as sperm motility and taxes (Fig. 3D).
Heavy Fucosylation in Human Spermatozoa
Heavy fucosylation was discovered in human spermatozoa. There were 52% of unique IGPs fucosylated in human spermatozoa (Fig. 4A), and 14 glycoproteins were modified by glycans with at least six fucoses per glycan (supplemental Table S6). Remarkably, up to ten fucoses per glycan were detected on glycosite N-374 of clusterin (Fig. 4A). Subsequent cellular component of GO enrichment revealed most glycoproteins with heavy fucosylation (above six fucoses per glycan) localizing on extracellular and cell surface regions (Fig. 4B). As a case study, four N-glycopeptides from clusterin, a glycoprotein that was mainly involved in sperm capacitation and immune tolerance in the female reproductive tract (32), were modified by heavy fucosylation (up to ten fucose per glycan) in human spermatozoa. The majority of glycans attached at all four N-glycosylation sites of clusterin were complex and hybrid glycans with Lewisx/a (30%), Lewisy/b (21%), and sialylated Lewisx/a (19%). The maximal fucoses per glycan and glycan structures on each glycosite of clusterin were displayed (supplemental Figs. S3–S6, and supplemental Table S7). Especially, the interpretation of glycan structures N7H8F10 on peptide:LANLTQGEDQYYLR of clusterin by spectrogram was drawn (supplemental Fig. S7). Intriguingly, no heavy fucosylation was identified on clusterin in other human tissues and body fluids. For example, maximal three fucoses per glycan were identified in the liver (33), and only one fucose per glycan was identified in serum (34) (Fig. 4D). Specially, maximal five fucoses were exhibited on intact glycopeptides of clusterin in seminal plasma (35) (Fig. 4C), although heavy fucosylation (up to nine) has also been identified in seminal plasma glycans without peptides information (36). These data indicated that heavy fucosylation might be a distinctive feature of spermatozoa.
Enriched Sialylation and Lewis Epitopes in the BP of Immune Response
A functional annotation of glycoproteins from human spermatozoa was performed using DAVID and divided into different categories: “BP,” “cellular components,” and “MFs” (Fig. 5A). Although there are different categories for each of the divisions made, the most important were those involved in immune response, cell motility, spermatogenesis, fertilization, sperm–egg recognition, binding of sperm to zona pellucida in case of BP (Fig. 5A); extracellular region, plasma membrane, cytoplasmic vesicle, Golgi apparatus, lysosome, nuclear inner membrane in case of cellular components; cation binding, hydrolase activity, peptidase activity, ion channel activity, peptide binding, and vitamin binding for MFs (Fig. 5A).
To predict the relevant molecular interaction, reaction, and relation networks of glycoproteins in human spermatozoa, the KEGG pathway analysis was conducted using ClueGO plug-in from Cytoscape software. The most important pathways enriched by spermatozoa glycoproteins were glycosaminoglycan degradation, calcium signaling pathway, protein processing in endoplasmic reticulum, lysosome, notch signaling pathway, cell adhesion molecules, other glycan degradation, glycosylphosphatidylinositol-anchor biosynthesis, various types of N-glycan biosynthesis, IL-17 signaling pathway, MAPK signaling pathway, regulation of actin cytoskeleton, PI3K-Akt signaling pathway, ECM–receptor interaction, TGF-beta signaling pathway, galactose metabolism, cAMP signaling pathway (Fig. 5B).
To further explore the functional interactions between glycan structures and glycoproteins, we compared the core and branch glycan structures on glycoproteins from different BP subsequently. Branch structures with salic acid were expressed outstanding in the BP of immune response in comparison with other BP (Fig. 5C). Similarly, branch structures of Lewis structures were also displayed highly in the BP of immune response and sperm motility (Fig. 5C). Correspondingly, the fucosylated core structure was much higher on glycoproteins involved in immune response and sperm motility than other BP (Fig. 5D). The integrating information revealed that sialylation and Lewis structures may play special roles in the specific spermatozoa BP of immune response.
Highly Expressed Bisecting N-Glycans on Acrosome
According to different distributions of glycoproteins in human spermatozoa, we also compared the branch and core structures to gain insight into the demand of glycan structures in different cellular localization. Intriguingly, bisected core structure (with and without core fucosylation) was highly expressed in acrosome compared with the glycoproteins in other localization and total spermatozoa (Fig. 6A and supplemental Fig. S8). The presence of core bisecting N-glycans was known to simplify the branch structures by precluding further processing and elongation of N-glycans (37, 38). Within expectation, the complex branch structures such as Lewis and sialylation were significantly decreased on acrosome. Contrarily, simple branch structures of HexNAc and even the lack of one branch increased (Fig. 6, B and C). Besides, special branch structures of LacdiNAc (with and without fucosylation) were also increased in acrosome in comparison with total spermatozoa (Fig. 6, B and C). Interestingly, all 11 glycoproteins with LacdiNAc glycans in acrosome were also the glycoproteins with bisected core structures (supplemental Table S8), and these glycoproteins mainly participated in the BP of fertilization, sperm–egg recognition and fusion, acrosome reaction, etc. (Fig. 6D). All these data indicated that the LacdiNAc structures might work together with bisected core structures for special functions of acrosome.
Discussion
As an important generative cell in males, spermatozoon owns many distinct features comparing with other types of human cells, likely progressive movement, survival in allogeneic immune response, and acrosome structure. These features are prerequisites for spermatozoa to accomplish their mission of fertilization. Genomics and proteomics studies have performed broadly on spermatozoa to understand the mechanism of spermatogenesis and further reproduction (39, 40, 41). In addition, glycomic and glycoproteomic studies have indicated the accompanying of glycosylation in the whole process of spermatogenesis, maturation, and fertilization (24). Nevertheless, these studies focused on either a small number of glycoproteins without glycan information or speculative glycan structures without corresponding glycosite (25, 26). This makes it difficult to further investigate the functions of N-glycosylation in depth, such as the mediation of glycan structures on the specific glycosites of glycoprotein function. In current study, we applied a site-specific glycoproteomic approach (16) to characterize glycoproteins with both glycosites and attached glycan structures precisely in human spermatozoa. Furthermore, 2D-LC-MS/MS ensured the landscape of glycoprotein identification even with low abundance. Based on the above strategies, a total of 10,355 IGPs with 719 glycan structures were identified from 968 spermatozoa glycoproteins, representing the largest and most comprehensive spermatozoa glycoproteome map till now. These achievements possessed distinct advantages in comparison with 554 N-glycosylation sites and 297 glycoproteins identified previously with loss of glycan information in human spermatozoa (25). Within our expectation, a portion of glycoproteins were uniquely identified in human spermatozoa, involving in the specific BP and pathways, likely spermatozoa development and fertilization. This further attested the comprehensiveness, depth, and reliability of glycoproteins identified in present study.
Heavy fucosylation has been detected in spermatozoa previously by using the glycomic approach (26), which unfortunately could not obtain the corresponding glycoproteins information. In this study, we are capable of detecting heavy fucosylation on specific glycoproteins precisely and further analyze their localizations and potential functions. It was indicated that glycoproteins modified by glycans with above six fucoses mostly localized on plasma membrane and extracellular region. Comparing with other tissues, such as human liver with maximal three fucoses per glycan, we realized the uniqueness of heavy fucosylation in human spermatozoa. Especially, we detected heavy fucosylation (up to ten fucoses) on clusterin in spermatozoa, which was in sharp contrast to few fucoses on clusterin in the serum (34) and liver (33), as well as in some mouse tissues based on IGPs (16). These findings indicated that N-glycans on clusterin might be tissue-specific. Since heavy fucosylation encompassed clusterin distinctively in human spermatozoa, the structural study of heavy fucosylation on clusterin function will be facilitated in human spermatozoa.
Since the corresponding glycan structure information has been obtained on glycoproteins of human spermatozoa, analysis of glycan structure regulation on glycoprotein function became possible during cellular activities and biological function (19). In present study, we discovered the high expression of sialylated branch structures and Lewis epitopes in the BP of immune response. Abundant sialylation was demonstrated to facilitate tolerance of female innate pattern recognition molecules when “foreign cells” spermatozoa passed through the onslaught of female immune factors for fertilization (20, 42). During this journey, appropriate sialylation acted as a mask for sperm antigens and protected spermatozoa from excessive phagocytosis by leucocytes (43). Lewis structures with core fucosylation contributed to block adaptive antigen directed against human sperm in both male and female reproductive systems (26). Coordinately, glycoproteins with heavy fucosylation (above six fucoses per glycan) were mainly localized on extracellular region and plasma membrane. This was consistent with complicated branch (Lewis structures), and core structures (fucosylated core structures) tend to be involved in the BP of immune response, which occurred in spermatozoa plasma membrane and extracellular region.
Different from the abundant common core structure and fucosylated core structure, bisecting N-glycans only occupied a minor portion of the total IGP profile in human spermatozoa. Surprisingly, significant highly expressed bisected core structures were detected on acrosome in comparison with total spermatozoa. Accordingly, the branch structures became shorter and rare LacdiNAc structures increased on acrosome as well. This agreed with the opinion that presence of bisected core structure prevented the formation of highly-branched species, such as the β1-6-GlcNAc (44, 45). Moreover, overexpression of bisected core structure was demonstrated to dramatically suppress α2-3-sialylation, but not α2-6-sialylation (38). Consistently, branch structures of all sialylated Lewisx/a, LacNAc, and LacdiNAc were significantly decreased, but not disappeared completely on acrosome glycoproteins. In addition, bisected core structures were demonstrated to be on sperm surface (26), likely enriched on acrosome in the present study. Based on all above analyses, highly expressed bisected core structures, along with simple and special branch structures on acrosome, may make a significant contribution in spermatozoa function.
It should be emphasized that the left/right branches of glycan structures were not actually distinguished using StrucGP v1.0 (16), and the structural interpretation of fucose-containing glycans might be affected by the rearrangement of the fucose residues in these glycans during mass-spectrometry-based fragmentation (46). In addition, the chemical modifications of N-glycans such as phosphorylation and sulfation, which might exist on sperm N-glycans (47, 48), were not considered and therefore could not be identified in this study. Also, additional experimental data are still required for further validation of the distinctive glycosylation and subglycan structures, as well as their biological functions in spermatozoa.
Conclusion
Glycosylation participates in the whole process of spermatogenesis, maturation, capacitation, sperm–egg recognition, and fertilization. However, we are only beginning to truly understand what glycosylation looks like and how glycosylation functions in spermatozoa. In the present study, an in-depth glycoproteome map of human spermatozoa has been established by determining the precision glycan structures at each glycosylation site, which overcomes the drawbacks of glycomics and traditional glycoproteomics focusing on either released glycan compositions or deglycosylated peptides. The IGPs based on glycan structure interpretation allowed to characterize the distinctive glycosylation in spermatozoa systematically and at meticulous levels. These include many glycoproteins and heavily fucosylated glycans uniquely identified in the whole spermatozoa, as well as the special glycan characteristics in specific organisms or BP of spermatozoa. These achievements will be greatly beneficial for further in-depth functional investigation of glycan structures and glycoproteins in human spermatozoa, as well as the glycosylation regulation of sperm dysfunction and male reproductive diseases.
Data Availability
The mass spectrometry data, and all spectra for identified glycopeptides in human spermatozoa, have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository (49) with the dataset identifier PXD026649.
Supplemental data
This article contains supplemental data.
Conflict of interest
The authors declare no competing interests.
Acknowledgments
This work was supported by the National Key R&D Program of China (2019YFA0905200), National Natural Science Foundation of China (Grant No. 91853123, 81773180, and 21705127), and Natural Sciences Foundation of Shaanxi Province (Grant No. 2021JQ-447).
Author contributions
S. S. and M. X. conceptualization; M. X., B. Z., Y. S., J. W., Z. C., and J. S. data curation; S. Y., M. X., and Y. X. formal analysis; S. S. and M. X. funding acquisition; S. Y., M. X., Y. X., and C. L. investigation; S. S. and M. X. methodology; W. S. and J. S. resources; M. X., B. Z., Y. S., J. W., Z. C., and J. S. software; S. S. supervision; M. X. writing—original draft; S. S. and M. X. writing—review and editing.
Contributor Information
Juanzi Shi, Email: shijuanzi@126.com.
Shisheng Sun, Email: suns@nwu.edu.cn.
Supplemental Data
References
- 1.Sprovieri P., Martino G. The role of the carbohydrates in plasmatic membrane. Physiol. Res. 2018;67:1–11. doi: 10.33549/physiolres.933593. [DOI] [PubMed] [Google Scholar]
- 2.Moremen K.W., Tiemeyer M., Nairn A.V. Vertebrate protein glycosylation: Diversity, synthesis and function. Nat. Rev. Mol. Cell Biol. 2012;13:448–462. doi: 10.1038/nrm3383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zielinska D.F., Gnad F., Wisniewski J.R., Mann M. Precision mapping of an in vivo N-glycoproteome reveals rigid topological and sequence constraints. Cell. 2010;141:897–907. doi: 10.1016/j.cell.2010.04.012. [DOI] [PubMed] [Google Scholar]
- 4.Sun S., Zhang H. Identification and validation of atypical N-glycosylation sites. Anal. Chem. 2015;87:11948–11951. doi: 10.1021/acs.analchem.5b03886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shu Q., Li M., Shu L., An Z., Wang J., Lv H., Yang M., Cai T., Hu T., Fu Y., Yang F. Large-scale identification of N-linked intact glycopeptides in human serum using HILIC enrichment and spectral library search. Mol. Cell. Proteomics. 2020;19:672–689. doi: 10.1074/mcp.RA119.001791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Cervoni G.E., Cheng J.J., Stackhouse K.A., Heimburg-Molinaro J., Cummings R.D. O-glycan recognition and function in mice and human cancers. Biochem. J. 2020;477:1541–1564. doi: 10.1042/BCJ20180103. [DOI] [PubMed] [Google Scholar]
- 7.Wang M., Zhu J., Lubman D.M., Gao C. Aberrant glycosylation and cancer biomarker discovery: A promising and thorny journey. Clin. Chem. Lab. Med. 2019;57:407–416. doi: 10.1515/cclm-2018-0379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Xiao H., Suttapitugsakul S., Sun F., Wu R. Mass spectrometry-based chemical and enzymatic methods for global analysis of protein glycosylation. Acc. Chem. Res. 2018;51:1796–1806. doi: 10.1021/acs.accounts.8b00200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Jensen P.F., Comamala G., Trelle M.B., Madsen J.B., Jorgensen T.J., Rand K.D. Removal of N-linked glycosylations at acidic pH by PNGase A facilitates hydrogen/deuterium exchange mass spectrometry analysis of N-linked glycoproteins. Anal. Chem. 2016;88:12479–12488. doi: 10.1021/acs.analchem.6b03951. [DOI] [PubMed] [Google Scholar]
- 10.Szigeti M., Bondar J., Gjerde D., Keresztessy Z., Szekrenyes A., Guttman A. Rapid N-glycan release from glycoproteins using immobilized PNGase F microcolumns. J. Chromatogr. B Analyt. Technol. Biomed. Life Sci. 2016;1032:139–143. doi: 10.1016/j.jchromb.2016.02.006. [DOI] [PubMed] [Google Scholar]
- 11.Yamamoto S., Ueda M., Kasai M., Ueda Y., Kinoshita M., Suzuki S. A fast and convenient solid phase preparation method for releasing N-glycans from glycoproteins using trypsin- and peptide-N-glycosidase F (PNGase F)-impregnated polyacrylamide gels fabricated in a pipette tip. J. Pharm. Biomed. Anal. 2020;179:112995. doi: 10.1016/j.jpba.2019.112995. [DOI] [PubMed] [Google Scholar]
- 12.Wang S., Qin H., Mao J., Fang Z., Chen Y., Zhang X., Hu L., Ye M. Profiling of endogenously intact N-linked and O-linked glycopeptides from human serum using an integrated platform. J. Proteome Res. 2020;19:1423–1434. doi: 10.1021/acs.jproteome.9b00592. [DOI] [PubMed] [Google Scholar]
- 13.Liu M.Q., Zeng W.F., Fang P., Cao W.Q., Liu C., Yan G.Q., Zhang Y., Peng C., Wu J.Q., Zhang X.J., Tu H.J., Chi H., Sun R.X., Cao Y., Dong M.Q., et al. pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification. Nat. Commun. 2017;8:438. doi: 10.1038/s41467-017-00535-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Polasky D.A., Yu F., Teo G.C., Nesvizhskii A.I. Fast and comprehensive N- and O-glycoproteomics analysis with MSFragger-Glyco. Nat. Methods. 2020;17:1125–1132. doi: 10.1038/s41592-020-0967-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bern M., Kil Y.J., Becker C. Byonic: Advanced peptide and protein identification software. Curr. Protoc. Bioinformatics. 2012 doi: 10.1002/0471250953.bi1320s40. Chapter 13:Unit13.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Shen J., Jia L., Dang L., Su Y., Zhang J., Xu Y., Zhu B., Chen Z., Wu J., Lan R., Hao Z., Ma C., Zhao T., Gao N., Bai J., et al. StrucGP: De novo structural sequencing of site-specific N-glycan on glycoproteins using a modularization strategy. Nat. Methods. 2021;18:921–929. doi: 10.1038/s41592-021-01209-0. [DOI] [PubMed] [Google Scholar]
- 17.Aitken R.J., Henkel R.R. Sperm cell biology: Current perspectives and future prospects. Asian J. Androl. 2011;13:3–5. doi: 10.1038/aja.2010.155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cheon Y.P., Kim C.H. Impact of glycosylation on the unimpaired functions of the sperm. Clin. Exp. Reprod. Med. 2015;42:77–85. doi: 10.5653/cerm.2015.42.3.77. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Tecle E., Gagneux P. Sugar-coated sperm: Unraveling the functions of the mammalian sperm glycocalyx. Mol. Reprod. Dev. 2015;82:635–650. doi: 10.1002/mrd.22500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ma F., Wu D., Deng L., Secrest P., Zhao J., Varki N., Lindheim S., Gagneux P. Sialidases on mammalian sperm mediate deciduous sialylation during capacitation. J. Biol. Chem. 2012;287:38073–38079. doi: 10.1074/jbc.M112.380584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Liu M. Capacitation-associated glycocomponents of mammalian sperm. Reprod. Sci. 2016;23:572–594. doi: 10.1177/1933719115602760. [DOI] [PubMed] [Google Scholar]
- 22.Redgrove K.A., Nixon B., Baker M.A., Hetherington L., Baker G., Liu D.Y., Aitken R.J. The molecular chaperone HSPA2 plays a key role in regulating the expression of sperm surface receptors that mediate sperm-egg recognition. PLoS One. 2012;7 doi: 10.1371/journal.pone.0050851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Uchida H., Maruyama T., Nishikawa-Uchida S., Miyazaki K., Masuda H., Yoshimura Y. Glycodelin in reproduction. Reprod. Med. Biol. 2013;12:79–84. doi: 10.1007/s12522-013-0144-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lan R., Xin M., Hao Z., You S., Xu Y., Wu J., Dang L., Zhang X., Sun S. Biological functions and large-scale profiling of protein glycosylation in human semen. J. Proteome Res. 2020;19:3877–3889. doi: 10.1021/acs.jproteome.9b00795. [DOI] [PubMed] [Google Scholar]
- 25.Wang G., Wu Y., Zhou T., Guo Y., Zheng B., Wang J., Bi Y., Liu F., Zhou Z., Guo X., Sha J. Mapping of the N-linked glycoproteome of human spermatozoa. J. Proteome Res. 2013;12:5750–5759. doi: 10.1021/pr400753f. [DOI] [PubMed] [Google Scholar]
- 26.Pang P.C., Tissot B., Drobnis E.Z., Sutovsky P., Morris H.R., Clark G.F., Dell A. Expression of bisecting type and Lewisx/Lewisy terminated N-glycans on human sperm. J. Biol. Chem. 2007;282:36593–36602. doi: 10.1074/jbc.M705134200. [DOI] [PubMed] [Google Scholar]
- 27.Lu J.C., Huang Y.F., Lü N.Q. WHO laboratory manual for the examination and processing of human semen: Its applicability to andrology laboratories in China. Zhonghua Nan Ke Xue. 2010;16:867–871. [PubMed] [Google Scholar]
- 28.Deutsch E.W., Mendoza L., Shteynberg D., Slagel J., Sun Z., Moritz R.L. Trans-Proteomic Pipeline, a standardized data processing pipeline for large-scale reproducible proteomics informatics. Proteomics Clin. Appl. 2015;9:745–754. doi: 10.1002/prca.201400164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Huang da W., Sherman B.T., Lempicki R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
- 30.Otasek D., Morris J.H., Boucas J., Pico A.R., Demchak B. Cytoscape automation: Empowering workflow-based network analysis. Genome Biol. 2019;20:185. doi: 10.1186/s13059-019-1758-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sun S., Hu Y., Ao M., Shah P., Chen J., Yang W., Jia X., Tian Y., Thomas S., Zhang H. N-GlycositeAtlas: A database resource for mass spectrometry-based human N-linked glycoprotein and glycosylation site mapping. Clin. Proteomics. 2019;16:35. doi: 10.1186/s12014-019-9254-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Janiszewska E., Kratz E.M. Could the glycosylation analysis of seminal plasma clusterin become a novel male infertility biomarker? Mol. Reprod. Dev. 2020;87:515–524. doi: 10.1002/mrd.23340. [DOI] [PubMed] [Google Scholar]
- 33.Zhao T., Jia L., Li J., Ma C., Wu J., Shen J., Dang L., Zhu B., Li P., Zhi Y., Lan R., Xu Y., Hao Z., Chai Y., Li Q., et al. Heterogeneities of site-specific N-glycosylation in HCC tumors with low and high AFP concentrations. Front. Oncol. 2020;10:496. doi: 10.3389/fonc.2020.00496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sun S., Hu Y., Jia L., Eshghi S.T., Liu Y., Shah P., Zhang H. Site-specific profiling of serum glycoproteins using N-linked glycan and glycosite analysis revealing atypical N-glycosylation sites on albumin and α-1B-glycoprotein. Anal. Chem. 2018;90:6292–6299. doi: 10.1021/acs.analchem.8b01051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Saraswat M., Joenvaara S., Tomar A.K., Singh S., Yadav S., Renkonen R. N-glycoproteomics of human seminal plasma glycoproteins. J. Proteome Res. 2016;15:991–1001. doi: 10.1021/acs.jproteome.5b01069. [DOI] [PubMed] [Google Scholar]
- 36.Pang P.C., Tissot B., Drobnis E.Z., Morris H.R., Dell A., Clark G.F. Analysis of the human seminal plasma glycome reveals the presence of immunomodulatory carbohydrate functional groups. J. Proteome Res. 2009;8:4906–4915. doi: 10.1021/pr9001756. [DOI] [PubMed] [Google Scholar]
- 37.Link-Lenczowski P., Bubka M., Balog C.I.A., Koeleman C.A.M., Butters T.D., Wuhrer M., Litynska A. The glycomic effect of N-acetylglucosaminyltransferase III overexpression in metastatic melanoma cells. GnT-III modifies highly branched N-glycans. Glycoconj. J. 2018;35:217–231. doi: 10.1007/s10719-018-9814-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lu J., Isaji T., Im S., Fukuda T., Kameyama A., Gu J. Expression of N-acetylglucosaminyltransferase III suppresses alpha2,3-sialylation, and its distinctive functions in cell migration are attributed to alpha2,6-sialylation Levels. J. Biol. Chem. 2016;291:5708–5720. doi: 10.1074/jbc.M115.712836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Nowicka-Bauer K., Kurpisz M. Current knowledge of the human sperm proteome. Expert Rev. Proteomics. 2013;10:591–605. doi: 10.1586/14789450.2013.846221. [DOI] [PubMed] [Google Scholar]
- 40.Bell A.D., Mello C.J., Nemesh J., Brumbaugh S.A., Wysoker A., McCarroll S.A. Insights into variation in meiosis from 31,228 human sperm genomes. Nature. 2020;583:259–264. doi: 10.1038/s41586-020-2347-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Jodar M., Soler-Ventura A., Oliva R., Molecular Biology of Reproduction and Development Research Group Semen proteomics and male infertility. J. Proteomics. 2017;162:125–134. doi: 10.1016/j.jprot.2016.08.018. [DOI] [PubMed] [Google Scholar]
- 42.Varki A., Gagneux P. Multifarious roles of sialic acids in immunity. Ann. N. Y. Acad. Sci. 2012;1253:16–36. doi: 10.1111/j.1749-6632.2012.06517.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ma X., Pan Q., Feng Y., Choudhury B.P., Ma Q., Gagneux P., Ma F. Sialylation facilitates the maturation of mammalian sperm and affects its survival in female uterus. Biol. Reprod. 2016;94:123. doi: 10.1095/biolreprod.115.137810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kizuka Y., Taniguchi N. Enzymes for N-glycan branching and their genetic and nongenetic regulation in cancer. Biomolecules. 2016;6:25. doi: 10.3390/biom6020025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Nakano M., Mishra S.K., Tokoro Y., Sato K., Nakajima K., Yamaguchi Y., Taniguchi N., Kizuka Y. Bisecting GlcNAc is a general suppressor of terminal modification of N-glycan. Mol. Cell. Proteomics. 2019;18:2044–2057. doi: 10.1074/mcp.RA119.001534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mucha E., Lettow M., Marianski M., Thomas D.A., Struwe W.B., Harvey D.J., Meijer G., Seeberger P.H., Helden G., Pagel K. Fucose migration in intact protonated glycan ions: A universal phenomenon in mass spectrometry. Angew. Chem. Int. Ed. Engl. 2018;57:7440–7443. doi: 10.1002/anie.201801418. [DOI] [PubMed] [Google Scholar]
- 47.Young S.A., Aitken J., Baker M.A. Phosphorylation of izumo1 and its role in male infertility. Asian J. Androl. 2015;17:708–710. doi: 10.4103/1008-682X.156119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Dutta S., Aoki K., Doungkamchan K., Tiemeyer M., Bovin N., Miller D.J. Sulfated Lewis A trisaccharide on oviduct membrane glycoproteins binds bovine sperm and lengthens sperm lifespan. J. Biol. Chem. 2019;294:13445–13463. doi: 10.1074/jbc.RA119.007695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Vizcaino J.A., Cote R.G., Csordas A., Dianes J.A., Fabregat A., Foster J.M., Griss J., Alpi E., Birim M., Contell J., O'Kelly G., Schoenegger A., Ovelleiro D., Perez-Riverol Y., Reisinger F., et al. The PRoteomics IDEntifications (PRIDE) database and associated tools: Status in 2013. Nucleic Acids Res. 2013;41:D1063–D1069. doi: 10.1093/nar/gks1262. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The mass spectrometry data, and all spectra for identified glycopeptides in human spermatozoa, have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository (49) with the dataset identifier PXD026649.