Abstract
The coronavirus disease 2019 (COVID-19) pandemic has led to worldwide efforts to understand the biological traits of the newly identified human coronavirus (HCoV-19) virus. In this mass spectrometry (MS)-based study, we reveal that out of 21 possible glycosites in the HCoV-19 spike protein (S protein), 20 are completely occupied by N-glycans, predominantly of the oligomannose type. All seven glycosylation sites in human angiotensin I converting enzyme 2 (hACE2) were found to be completely occupied, mainly by complex N-glycans. However, glycosylation did not directly contribute to the binding affinity between HCoV-19 S protein and hACE2. Additional post-translational modification (PTM) was identified, including multiple methylated sites in both proteins and multiple sites with hydroxylproline in hACE2. Refined structural models of HCoV-19 S protein and hACE2 were built by adding N-glycan and PTMs to recently published cryogenic electron microscopy structures. The PTM and glycan maps of HCoV-19 S protein and hACE2 provide additional structural details for studying the mechanisms underlying host attachment and the immune response of HCoV-19, as well as knowledge for developing desperately needed remedies and vaccines.
Keywords: N-glycosylation, COVID-19, Spike protein, hACE2, Structure
1. Introduction
The current coronavirus pandemic, which has been ongoing since December 2019 (coronavirus disease 2019 (COVID-19)), has become a global health emergency [1], [2], [3]. The disease is caused by a newly identified beta-coronavirus, human coronavirus 2019 (HCoV-19) (also known as human coronavirus 2019 (SARS-CoV-2)), which is closely related to the SARS-CoV. Like SARS, HCoV-19 causes lower respiratory tract infection (LRTI), which may eventually progress to atypical pneumonia, requiring intensive care and mechanical ventilation [4], [5], [6]. Compared with SARS-CoV, HCoV-19 is more contagious and has infected far more people worldwide, thus causing substantially more casualties [4], [5]. Although possible viral reservoirs including bats and pangolins have been suggested as sources of HCoV-19 [3], [7], more investigation is needed to ascertain its source and routes of zoonotic transmission [8], [9], [10]. To date, no vaccine or remedy specific for coronaviruses infection—including that of HCoV-19—is available.
Similar to SARS-CoV, the virion surface spike glycoprotein encoded by the HCoV-19 S gene is essential for target cell attachment and fusion processes [11], [12], [13]. Given the importance of the spike protein (S protein) in COVID-19 pathogenesis and in its potential immunogenicity for vaccine development, global efforts have been made to elucidate the structure of the S protein, beginning shortly after the first HCoV-19 sequence was published [1], [13], [14]. The HCoV-19 S glycoprotein is a 1273-amino acid precursor polypeptide; it can be cleaved by host cell proteases (cathepsin L, TMPRSS2) into the S1 fragment, which contains the receptor binding domain (RBD) to attach host receptor human angiotensin I converting enzyme 2 (hACE2), and the S2 fragment, which is responsible for the subsequent membrane fusion [1], [12], [13]. The S protein is predicted to have a cleavable N-terminal signal sequence (1–15), which presumably directs the protein toward the endoplasmic reticulum (ER) for extensive glycan decoration prior to virion packing.
Glycosylation is one of the most prominent post-translational modifications (PTMs) in many viral spike or envelope proteins, and has been shown to mediate host attachment, immune response, and virion packaging and budding [15], [16], [17], [18], [19], [20], [21], [22], [23], [24]. In particular, the binding of coronavirus S proteins to their respective receptors has been shown to be mediated by viral oligomannose N-glycans [19], [20], [23]. In addition, C-type lectin dendritic cell-specific ICAM-3-grabbing non-integrin 1 (DC-SIGN) and liver/lymph node-specific ICAM-3-grabbing non-integrin (L-SIGN) can enhance viral entry via their binding to the S protein glycans [25], [26]. Therefore, numerous antiviral strategies have been designed to interfere with protein glycosylation or glycan-based interaction [21], [24], [27], [28]. Glycosylation is also considered to be a key aspect in the development of an effective vaccine [29], [30].
Like the SARS-CoV S protein, which contains 23 N-linked glycosylation sequons (N-X-S/T, X ≠ P), the HCoV-19 S protein is predicted to host 22 sequons per protomer or 66 per trimer [13]. However, the potential glycosite pattern in the S1 of the HCoV-19 S protein is different from that of SARS-CoV, while the glycosylation sites in the S2 region are significantly conserved between HCoV-19 and SARS-CoV. Such a difference in the S1 glycosylation pattern may be associated with the fact that the biological and clinical characteristics of HCoV-19 are profoundly different from those of other coronaviruses. In this study, we report a comprehensive N-glycosylation profile—as well as other PTMs—of the HCoV-19 S protein and hACE2, elucidated by high-resolution mass spectrometry (MS) analyses. Based on these analyses, we reinterpret the current HCoV-19 S protein structural model to highlight important glycan features related to COVID-19 pathogenesis. Nonetheless, we demonstrate that the binding of the HCoV-19 S protein and hACE2 is independent of their glycosylation status.
2. Material and methods
2.1. Expression and purification of HCoV-19 spike ectodomain and hACE2
HCoV-19 S gene (GenBank number QHD43416.1) was synthesized (Genscript) with its codons optimized for insect cell expression. Its ectodomain (Val16-Pro1213) was cloned into a pFastBac vector (Life Technologies Inc., USA) with an N-terminal honeybee melittin signal peptide and C-terminal His6 and Flag tags. HCoV-19 S protein was expressed in Sf9 insect cells using the Bac-to-Bac system (Life Technologies Inc.) and harvested from the cell culture medium; this was followed by a purification procedure using a Ni-nitrilotriacetic acid (NTA) column and Superdex 200 gel filtration column (GE Healthcare, UK) in tandem. The extracellular domain of hACE2 (Gln18-Ser740, NP_068576.1) with a C-terminal Fc tag was expressed in HEK 293 cells and was purified by protein A sepharose beads (GE Healthcare).
2.2. Sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS-PAGE) analysis
To test the glycosylation status of the HCoV-19 spike and the hACE2 protein, both proteins were deglycosylated by PNGase F (NEB, 1:50) overnight at 37 °C in phosphate-buffered saline (PBS). Both the glycosylated and deglycosylated forms of the HCoV-19 spike and the hACE2 protein were then analyzed by 15% sodium dodecyl sulfate–polyacrylamide gel electrophoresis (SDS-PAGE), followed by Coomassie blue staining.
2.3. Glycopeptide sample preparation
The proteins were digested into peptides according to Ref. [31]. In brief, hACE2 and the S protein were first reduced by 10 mmol∙L−1 dithiothreitol in 50 mmol∙L−1 ammonium bicarbonate (ABC) for 45 min at room temperature and then alkylated by iodoacetamide for 45 min at room temperature in the dark. The proteins were then cleaned using acetone precipitation and resuspended in 50 mmol∙L−1 ABC. The alkylated glycoproteins were then digested for 14 h at 37 °C using sequencing-grade trypsin, chymotrypsin, or endoproteinase Lys-C (all purchased from Promega, USA), with a protein:protease ratio of 1:50 in 50 mmol∙L−1 ABC. The peptides were desalted using hydrophilic–lipophilic balance columns (Waters, USA).
The peptide samples were separated into duplicates. The first duplicate was deglycosylated by PNGase F (NEB, 1:100) overnight at 37 °C in 50 mmol∙L−1 ABC prepared in pure H2O18. The second duplicate was used to enrich intact N-glycopeptides by hydrophilic interaction liquid chromatography (HILIC). In brief, GlycoWorks (Waters) cartridges were preconditioned with loading buffer comprising 15 mmol∙L−1 ammonium acetate and 0.1% trifluoroacetic acid (TFA) in 80% acetonitrile. Peptides in the loading buffer were applied to the cartridges and the unbound flow-through fraction was collected. The column was washed two times with loading buffer. Glycopeptides were eluted in 0.1% TFA. All peptide fractions were desalted by house-made C18 stage tips before liquid chromatography–tandem mass spectrometry (LC-MS/MS) analyses were performed.
2.4. Liquid chromatography–tandem mass spectrometry experiment
About 500 ng of peptide was analyzed using an Ultimate 3000 nanoflow liquid chromatography system (Thermo Scientific, USA) connected to a hybrid Q-Exactive HFX mass spectrometer (Thermo Scientific). The mass spectrometer was operated in data-dependent mode with full-scan MS spectrum, followed by MS2 scans recording the top 20 most intense precursors sequentially isolated for fragmentation using high-energy collision dissociation. The MS and MS/MS spectra were recorded using Xcalibur software 2.3 (Thermo Scientific). Detailed parameters for separation and MS acquisition can be found in Tables S1 and S2 in Appendix A, respectively.
2.5. Bioinformatics
The acquired MS raw files from the deglycosylated peptides were searched by MaxQuant (version 1.6.10.43; Max-Planck-Institute of Biochemistry, Germany) against the hACE2 sequence from UniProtKB and the HCoV-19 S sequence (YP_009724390.1_3) from the National Center for Biotechnology Information (NCBI). Cysteine carbamidomethylation was set as a fixed modification, while methionine oxidation and O18 deamidation on glutamic acid were set as variable modifications. Trypsin was set as having up to two missed cleavages. A mass tolerance of 15 and 4.5 ppm was set for the first and main search, respectively. For comprehensive PTM analysis, phosphorylation (S/T/Y), acetylation (K), methylation (K/R/E), succinylation (K), crotonylation (K), farnesylation (K/N-terminal (Nterm)), myristoylation (K/Nterm), palmitoylation or prenylation, glycosylphosphatidylinositol addition, and oxidation on proline were individually investigated in parallel database searches for respective variable modification. A peptide-level false discovery rate of 1% was set to filter the result. Confident identification of PTM was based on a localization probability of 99%. Site occupancy for each PTM was calculated by dividing the peak intensities of the modified peptides (MP) and corresponding non-modified peptides (NP) using the following equation: MP/(MP + NP) × 100.
To investigate the N-glycosylation forms, the acquired MS raw files from the HILIC experiments were searched by pGlyco (version 2.2.2, Chinese Academy of Sciences, China) [32] against the same hACE2 and HCoV-19 S sequences. Cysteine carbamidomethylation was set as a fixed modification, while methionine oxidation was set as a variable modification. Trypsin was set as having up to two missed cleavages. A mass tolerance of 5 and 15 ppm was set for the precursor and fragment mass tolerance, respectively. Potential glycan fragments within the MS2 spectra were annotated by the built-in pGlyco.gdb glycan structure database [32]. The precursor intensity of each glycopeptide was extracted by the MaxQuant feature-detection algorithm. All LC-MS/MS raw data were deposited in an iProX database with the following link: https://www.iprox.cn/page/PSV023.html;?url=1631583654953TDY8, password: th2J; password: B73w.
2.6. Structural model refinement
Glycan structures in protein databank (PDB) format were downloaded or predicted by GLYCAM-Web (https://dev.glycam.org/gp). N-linked glycan models of hACE2 and the trimeric S protein were made by manually adding the MS-identified glycan at each site based on previous models (PDB codes 6M18 for hACE2 and 6VXX for S protein) within Coot [33]. The modified residues within each model were generated with Coot [33]. In addition, models of the HCoV-19 S protein and hACE2 complex (PDB 6M0J) were used to show the location of the PTMs surrounding the binding area. All structural analyses were performed by PyMOL (Schrödinger Inc., USA).
2.7. Evaluation of binding of HCoV-19 S protein and hACE2 by bio-layer interferometry
Binding of the HCoV-19 S protein and hACE2 was measured by bio-layer interferometry (BLI) on an Octet Re96E (ForteBio, USA) interferometry system. In brief, the HCoV-19 S protein was immobilized using aminopropylsilane biosensors (18-5045; ForteBio). To evaluate the influence of the glycosylation of HCoV-19 S protein on the binding, the S protein was first incubated with either 1000 U∙mL−1 PNGase F (NEB) or deactivated PNGase F at 37 °C for 14 h. After washing by kinetic buffer (1× PBS, pH 7.4, 0.01% bovine serum albumin (BSA), and 0.002% Tween 20), the association between HCoV-19 S protein and hACE2 was measured for 180 s at 30 °C by exposing sensors to hACE2 in kinetic buffer at concentrations of 6.25, 12.5, 25, 50, 100, and 200 nmol∙L−1, respectively. After the binding phase, the sensors were exposed to 1× kinetic buffer for dissociation for 300 s at 30 °C. The signal baseline was subtracted before data fitting using the equimolar binding model. Mean association rate constant (k on) and dissociation rate constant (k off) were determined with a global fit model using all data. A parallel experiment was also performed using hACE2-loaded sensors incubated with HCoV-19 S protein solutions at different concentrations. Analogously, immobilized hACE2 was pretreated by either PNGase F or deactivated PNGase F to evaluate the influence of hACE2 glycosylation on binding. In all experiments, PNGase F was deactivated by heating at 75 °C for 10 min.
3. Results
3.1. Determination of N-linked glycosylation sites on HCoV-19 S protein and hACE2
As shown in Fig. 1 (a), PNGase F deglycosylation resulted in a decrease in the molecular weight of both HCoV-19 S protein and hACE2 on SDS-PAGE. Digestion by PNGase F releases N-linked glycans from Asn within the N-X-S/T motif, which also results in Asn deamidation by losing one hydrogen and one nitrogen while incorporating one oxygen derived from the solvent [34]. To perform a thorough survey of glycosylated sites, protease-digested peptides from both proteins were subjected to PNGase F deglycosylation in H2O18; this enables the incorporation of O18 and leads to a +2.98 Da mass increment, which was used to mark the glycosylated sites.
Analysis of the deglycosylated peptides from the S protein by LC-MS/MS confirmed the presence of 20 N-linked glycosylation sites, including N61, N74, N122, N165, N234, N282, N331, N343, N603, N616, N657, N709, N717, N801, N1074, N1098, N1134, N1158, N1173, and N1194 (Table 1 ). Two remaining N-X-S/T sites—the N17 and N149 sites—were not identified in any deglycosylated peptide. However, a peptide with glycosylated N149 was identified directly without PNGase F treatment, as described later, resulting in a total of 21 glycosylation sites out of 22 potential sites with N-X-S/T sequons in the HCoV-19 S protein (Fig. 1(b)). In addition, quantitative analysis of site occupancy showed that 18 sites were completely glycosylated, while N603 and N657 reached 43% and 74% occupancy, respectively (Table 1). Fig. 1(c) provides an example of mass spectra showing evidence of N331 and N343 glycosylation.
Table 1.
Sites | PTM | Sequences | Occupancy (%) |
---|---|---|---|
HCoV-19 S protein | |||
61, 74 | N-glycosylation | SSVLHSTQDLFLPFFSNaVTWFHAIHVSGTNaGTK | 100 |
74 | N-glycosylation | HAIHVSGTNaGTKRF | 100 |
78 | Methylation | RaFDNPVLPFNDGVYFASTEK | 100 |
122 | N-glycosylation | TQSLLIVNNaATNVVIK IVNNaATNVVIKVCEF |
100 100 |
165 | N-glycosylation | VYSSANNaCTFEYVSQPFLMDLEGK | 100 |
224 | Methylation | DLPQGFSALEaPLVDLPIGINITR | 100 |
234 | N-glycosylation | VDLPIGINaITRF | 100 |
282 | N-glycosylation | YNENaGTITDAVDCALDPLSETK NENaGTITDAVDCALDPLSETKCTL |
100 100 |
331 | N-glycosylation | RVQPTESIVRFPNaITNL | 100 |
331, 334 | N-glycosylation | FPNaITNLCPFGEVFNaATR | 100 |
340 | Methylation | FPNITNLCPFGEaVFNATR | 2.02 |
603 | N-glycosylation | GGVSVITPGTNaTSNQVAVL | 43.50 |
616 | N-glycosylation | YQDVNaCTEVPVAIHADQL | 100 |
654, 661 | Methylation | AGCLIGAEaHVNNSYEaCDIPIGAGICASYQTQTNSPR | 100 |
657 | N-glycosylation | AGCLIGAEHVNaNSYECDIPIGAGICASYQTQTNSPR IGAEHVNaNSYECDIPIGAGICASY |
74.80 79.76 |
709, 717 | N-glycosylation | SNaNSIAIPTNaF | 100 |
801 | N-glycosylation | TPPIKDFGGFNaFSQILPDPSKPSK NaFSQILPDPSKPSKRSF |
100 100 |
1074 | N-glycosylation | NaFTTAPAICHDGK | 100 |
1098 | N-glycosylation | VSNaGTHWF | 100 |
1134 | N-glycosylation | VSGNCDVVIGIVNaNTVY | 100 |
1158, 1173 | N-glycosylation | NaHTSPDVDLGDISGINaASVVNIQKEIDRLNEVAK | 99.17 |
1194 | N-glycosylation | NLNaESLIDLQELGK | 99.73 |
hACE2 | |||
53 | N-glycosylation | FNHEAEDLFYQSSLASWNYNTNaITEENVQNMNNAGDK NYNTNaITEENVQNMNNAGDKW |
100 100 |
68 | Methylation | NYNTNITEENVQNMNNAGDKaW | 100 |
84 | Hydroxyproline | EQSTLAQMYPaLQEIQNLTVK | 20.74 |
90 | N-glycosylation | WSAFLKEQSTLAQMYPLQEIQNaLTVK | 100 |
103 | N-glycosylation | LQLQALQQNaGSSVLSEDKSK QLQALQQNaGSSVL |
100 100 |
253 | Hydroxyproline | LMNAYPaSYISPIGCLPAHLLGDMWGR | 98.35 |
263 | Hydroxyproline | LMNAYPSYISPIGCLPaAHLLGDMWGR | 88.57 |
284 | Hydroxyproline | FWTNLYSLTVPaFGQKPNIDVTDAMVDQAWDAQR | 4.06 |
289 | Hydroxyproline | PaNIDVTDAMVDQAWDAQR | 1.70 |
321 | Hydroxyproline | FFVSVGLPaNMTQGFWENSMLTDPGNVQK | 58.24 |
322 | N-glycosylation | FFVSVGLPNaMTQGFWENSMLTDPGNVQK VSVGLPNaMTQGF |
100 100 |
329 | Methylation | FFVSVGLPNMTQGFWEaNSMLTDPGNVQK | 100 |
346 | Hydroxyproline | AVCHPaTAWDLGK | 88.88 |
415 | Hydroxyproline | NGANEGFHEAVGEIMSLSAATPaK | 0.68 |
432 | N-glycosylation | SIGLLSPDFQEDNaETEINFLLK LSPDFQEDNaETEINF |
100 100 |
451 | Hydroxyproline | QALTIVGTLPaFTYMLEK | 43.59 |
469 | Hydroxyproline | GEIPaKDQWMK | 4.09 |
546 | N-glycosylation | CDISNaSTEAGQK HKCDISNaSTEAGQKLF |
100 100 |
565 | Hydroxyproline | SEPaWTLALENVVGAK | 0.37 |
583 | Hydroxyproline | NMNVRPaLLNYFEPLFTWLK | 2.12 |
612 | Hydroxyproline | NSFVGWSTDWSPaYADQSIK | 0.21 |
690 | N-glycosylation | ISFNFFVTAPKNaVSDIIPR VTAPKNaVSDIIPRTEVEKAIRM NaVSDIIPRTEVEK |
100 100 100 |
Modified sites.
Quantitative analysis of site occupancy showed that all seven possible glycosylation sites in hACE2—namely, N53, N90, N103, N322, N432, N546, and N690—were completely glycosylated (Table 1). Fig. 1(d) provides an example of the mass spectra showing N90 glycosylation. These data suggest that both HCoV-19 S protein and hACE2 are highly decorated by N-glycans.
3.2. Identification of PTMs of HCoV-19 S protein and hACE2
To unveil possible PTMs other than glycosylation, LC-MS/MS data of deglycosylated peptides from both HCoV-19 S protein and hACE2 were subjected to several rounds of spectra-database matching for common PTMs. The major PTMs found in both proteins were methylation on lysine, arginine, or glutamic acid, as summarized in Table 1. Using an occupancy greater than 50% as a criterion to identify sites dominantly decorated with PTMs, we found that R78, E224, E654, and E661 in HCoV-19 S protein and E57, K68, and E329 in hACE2 were highly methylated. In addition, the proline at sites 253, 263, 321, and 346 in hACE2 was found to be prominently oxidized and converted to hydroxyproline. However, common PTMs such as phosphorylation, acetylation, or other forms of fatty acid acylations were not found in HCoV-19 S protein and hACE2.
3.3. Global N-glycosylation profile of HCoV-19 S protein and hACE2
To resolve glycan camouflage on the surface of the HCoV-19 S protein and hACE2, intact glycopeptides derived from protease digestion and fractionated by HILIC were directly subjected to LC-MS/MS analysis specifically designed to detect peptides with extra molecular weight due to N-glycan attachment. Due to the highly heterogeneous nature of the glycan chain, a unique glycopeptide was designated in our study as a combination of both a unique peptide sequence and a specific N-glycan composition. According to these criteria, 419 and 467 unique N-glycopeptides were identified from the HCoV-19 S protein and hACE2, respectively (Table 2, Table 3 ). Except for N1134, 19 of the 20 HCoV-19 S glycosylated sides identified in the PNGase F experiment were confirmed by intact glycopeptide profiling. Furthermore, N-glycans were also found to present at site N149. Although all seven glycosylation sites in hACE2 were identified previously in the deglycosylated peptides labeled with deamidation, an N-glycan profile was obtained at five sites: N90, N103, N432, N546, and N690. Fig. S1 in the Appendix A provides an example of N-glycopeptides spectra from HCoV-19 S protein and hACE2.
Table 2.
Domain | Site | Sequences | Most abundant composition | Most likely structure |
---|---|---|---|---|
S1 | 61 | FSNaVTWF FSNaVTW |
Hex6HexNAc2 | |
74 | SSVLHSTQDLFLPFFSNaVTWFHAIHVSGTNaGTK SSVLHSTQDLFLPFFSNaVTWFHAIHVSGTNaGTKR HAIHVSGTNaGTKRF |
Fuc1NeuGc2Hex6HexNAc4 | ||
122 | TQSLLIVNNaATNVVIK IVNNaATNVVIKVCEF LIVNNaATNVVIKVCEF |
Hex3HexNAc2 | ||
149 | VCEFQFCNDPFLGVYYHKNNaK YHKNNaKSWMESEF HKNNaKSWMESEF |
Hex3HexNAc2 | ||
165 | VYSSANNaCTFEYVSQPFLMDLEGK SSANNaCTFEY SSANNaCTFEYVSQPF |
Hex4HexNAc2 | ||
234 | DLPQGFSALEPLVDLPIGINaITR VDLPIGINaITRF |
Hex5HexNAc2 | ||
282 | YNENaGTITDAVDCALDPLSETK | Hex4HexNAc2 | ||
331 | FPNaITNLCPFGEVFNaATR RVQPTESIVRFPNaITNLCPF RVQPTESIVRFPNaITNL |
Hex3HexNAc2 | ||
343 | FPNITNLCPFGEVFNaATR CPFGEVFNaATRF GEVFNaATRF |
Fuc1NeuGc2Hex5HexNAc4 | ||
603 | GGVSVITPGTNaTSNQVAVLY GGVSVITPGTNaTSNQVAVL |
Fuc1Hex3HexNAc2 | ||
616 | YQDVNaCTEVPVAIHADQLTPTW QDVNaCTEVPVAIHADQLTPTW QDVNaCTEVPVAIHADQLTPTWRVY YQDVNaCTEVPVAIHADQL QDVNaCTEVPVAIHADQL |
Hex3HexNAc2 | ||
657 | AGCLIGAEHVNaNSYECDIPIGAGICASYQTQTNSPR QTRAGCLIGAEHVNaNSYECDIPIGAGICASY IGAEHVNaNSYECDIPIGAGICASY QTRAGCLIGAEHVNaNSY |
Fuc1Hex3HexNAc2 | ||
S2 | 709 | SNaNSIAIPTNF | Hex3HexNAc | |
717 | SNNSIAIPTNaF | Fuc1NeuGc2Hex4HexNAc4 | ||
801 | DFGGFNaFSQILPDPSKPSK DFGGFNaFSQILPDPSKPSKR TPPIKDFGGFNaFSQILPDPSK TPPIKDFGGFNaFSQILPDPSKPSK DFGGFNaFSQILPDPSK NaFSQILPDPSKPSKRSF |
Hex3HexNAc2 | ||
S2(S2′) | 1074 | NaFTTAPAICHDGK LHVTYVPAQEKNaF |
Hex3HexNAc2 | |
109 | EGVFVSNaGTHWFVTQR AHFPREGVFVSNaGTHWFVTQR VSNaGTHWF VSNaGTHW |
Hex3HexNAc2 | ||
1134b | — | — | — | |
1158 | NaHTSPDVDLGDISGINASVVNIQK YFKNaHTSPDVDLGDISGINASVVNIQK |
Fuc1Hex3HexNAc2 | ||
1173 | YFKNHTSPDVDLGDISGINaASVVNIQK NHTSPDVDLGDISGINaASVVNIQK |
Fuc2NeuGc1Hex5HexNAc4 | ||
1194 | NLNaESLIDLQELGKYEQYIK NLNaESLIDLQELGK |
Fuc1Hex3HexNAc2 |
Glycosylated N sites.
Sites identified only on deglycosylated peptides.
Table 3.
Domain | Site | Sequences | Most abundant composition | Most likely structure |
---|---|---|---|---|
PD | 53a | — | — | — |
90 | EQSTLAQMYPLQEIQNbLTVK | NeuGc1Hex5HexNAc4 | ||
103 | LQLQALQQNbGSSVLSEDK LQLQALQQNbGSSVLSEDKSK LQLQALQQNbGSSVLSEDKSKR |
Fuc1NeuGc1Hex5HexNAc4 | ||
322a | — | — | — | |
423 | SIGLLSPDFQEDNbETEINFLLK | Fuc1Hex3HexNAc4 | ||
546 | CDISNbSTEAGQK | Fuc1Hex5HexNAc4 | ||
CLD | 690 | NbVSDIIPR ISFNFFVTAPKNbVSDIIPR |
NeuGc1Hex5HexNAc4 |
Sites identified only on deglycosylated peptides.
Glycosylated N sites.
A total of 144 N-glycans were found in the HCoV-19 S protein, with the majority containing the common N-acetylglucosamine core (Table 2). All N-glycosites in the S protein were attached to multiple types of N-glycans, with N343 being decorated by the most diverse N-glycans. The HCoV-19 S N-glycan composition was preferentially comprised of pauci- or high-mannose oligosaccharides, except in four N-glycosites (N73, N343, N717, and N1173) that contained a high proportion of complex and hybrid N-glycans (Figs. 2 (a) and (b) and Table 2). Based on LC-MS intensity, pauci-mannose Hex3HexNAc2 was the most common N-glycan in the HCoV-19 S protein. Interestingly, there was no evidence of a sialic acid component in the N-glycans of the HCoV-19 S protein.
A total of 220 N-glycans were found in hACE2; of these, 93% were complex, while hybrid, high-mannose, and pauci-mannose glycan only constituted 4%, 2%, and 1%, respectively (Fig. 2(a) and Table 3). Based on LC-MS intensity, the most dominant hACE2 glycan was NeuGc1Hex5HexNAc4, which was predicted to be a bi-antennary complex containing a sialic acid in the distal end. The N-glycan profile in all hACE2 sites was primarily complex. It is notable that N90, N103, and N690 all contained more than 100 types of N-glycans, with their most dominant forms all containing sialic acid.
Collectively, our LC-MS/MS data confirm that both the HCoV-19 S protein and its receptor hACE2 are indeed heavily N-glycosylated at most of their predicted N-X-S/T sequons. Fig. 2(c) provides a summary of the most dominant N-glycan composition and predicted structure for each site of the HCoV-19 S protein and hACE2.
3.4. Structure modeling of the HCoV-19 S–hACE2 complex refined with glycan and PTM details
By adding the chemical structures of the most abundant N-glycans at each site, based on the LC-MS/MS results, to the most updated cryo-electron microscopy (cyro-EM) model of the HCoV-19 S protein and hACE2, we generated atomic models that represent the most likely spatial distribution of the N-glycans on both proteins (Figs. 3 (a) and (b)). Although four sites (N74, N149, N1158, and N1194) of the S protein are not shown in the model due to the missing residues in model 6VXX, our model suggests that the camouflaging N-glycans shield more than two-thirds of the HCoV-19 S protein surface, which could potentially lead to host attachment and immune evasion. In addition, both the HCoV-19 S protein and the hACE2 model showed that the glycans at N331 and N343 of the HCoV-19 S protein and N90 of hACE2 were in proximity to—albeit not exactly inside of—the binding area of both proteins (Figs. 3(c) and (d)). The model of the S protein–hACE2 complex also showed that three methylation sites in hACE2 (E57, K68, and E329) form a trident structure enclosing the contact area formed between K353–R357 of hACE2 and N501 of HCoV-19 S protein (Fig. S2 in the Appendix A).
3.5. Binding of HCoV-19 S protein and hACE2 does not depend on N-glycosylation
To understand the contribution of protein glycosylation to the interaction between HCoV-19 S protein and hACE2, we compared the binding kinetics and affinity of the purified hACE2 ectodomain to glycosylated and deglycosylated HCoV-19 S extracellular domain (ECD) immobilized at the surface of biosensors in a BLI experiment. We found that hACE2 was bound to HCoV-19 S protein that had been pretreated with active or inactive PNGase F with the comparable equilibrium dissociation constants K d of 1.7 nmol∙L−1 (Fig. 4 (a)) and 1.5 nmol∙L−1 (Fig. 4(b)), respectively. The affinity of hACE2 binding to HCoV-19 S protein that was determined in this study aligns with the previous report [13]. Comparable affinity was also observed when HCoV-19 S ECD was bound to the immobilized hACE2 (K d = 16.7 nmol∙L−1, Fig. 4(c)) or deglycosylated hACE2 (K d = 18.2 nmol∙L−1, Fig. 4(d)). Therefore, our data suggest that the glycosylation status does not contribute directly to the binding between HCoV-19 S protein and hACE2. A detailed summary of the BLI binding kinetics can be found in Table S3 in Appendix A.
4. Discussion
Glycosylation is an ubiquitous and complex PTM that greatly extends the structural and functional diversity of many proteins. However, comprehensive characterization of protein N-glycosylation is technically challenging. In this study, deglycosylated peptides, which are easier to be analyzed by LC-MS than their glycosylated precursors, were profiled to confirm the N-glycosylated sites in HCoV-19 S protein and hACE2. The O18 deamidation modification ensured confidence in the glycosite identifications, as the isotope was incorporated during PNGase F-mediated hydrolysis in H2O18. Intact N-glycopeptide analyses were then performed to further verify the glycosites and to provide detailed information about the compositional and structural heterogeneity of the associated N-glycans in each site. Our investigation showed that all sites except N17 were highly glycosylated in the HCoV-19 S protein. According to the N-linked glycan model of the S protein, a large proportion of the S protein surface is covered by glycans, which is consistent with previous reports [13], [14], [35], [36]. When the S proteins from HCoV-19 and SARS-CoV were compared, it was noticeable that the majority of differences in the glycosylation sites occurred in the distal S1 subunit, resulting in a significant difference in the glycan profile in the outermost canopy of the virus formed by spike trimer clusters. This alteration in glycosites might be the result of host or environmental selective pressure, and could imply dramatic differences in viral infectivity, pathogenesis, and host responses. During our research, we noticed that several other research groups also made attempts to decode the glycan profile of the HCoV-19 S protein. All these studies have confirmed that the HCoV-19 S protein is heavily glycosylated [35], [36].
The N-glycans in the S protein are markedly heterogeneous and may greatly extend the conformational flexibility and epitope diversity for the S protein. Although a small proportion of complex and hybrid N-glycans were found in the S protein, most sites were occupied by oligomannose glycans, which is in line with previous reports on the glycosylation profile of SARS-CoV [37], [38], [39]. Many viral proteins are hallmarked by a high level of oligomannose glycans, probably as a sign of incomplete glycan maturation due to a high glycosite density that results in steric hindrance [40]. Interestingly, N343, the glycosite closet to the binding surface, is decorated by the most diverse N-glycans and primarily by hybrid and complex forms. Moreover, as in the case of SARS-CoV and Marburg viral proteins [41], [42], we found that sialic acid incorporation in the SARS-CoV S protein glycans was negligible. In the ongoing global efforts to generate a vaccine, which primarily targets the S protein as the candidate antigen, vaccine developers should consider the diverse glycan forms decorating a large swath of the S trimer surface, as glycans could drastically modulate the protein immunogenicity. It was interesting to learn that different expression systems could affect glycan forms substantially, as we compared our data with other glycan profiling studies [35], [36]. In particular, mammalian cells prefer to assemble complex glycans to the S protein, as compared with insect cells. Therefore, we should also expect that vaccines that are produced based on different S proteoforms with different glycosylation statuses could show varying efficacy against future infection.
The binding of coronavirus S proteins to their respective receptors has been shown to be mediated by viral oligomannose N-glycan [19], [20], [23]. The SARS-CoV S protein contains three RBD-associated glycosites (N318, N330, and N357), while HCoV-19 only has two (N331 and N343) surrounding the binding pocket. Contrary to our postulation that the N-glycan at these two sites might contribute polar interaction to receptor binding, the BLI binding assay demonstrated that deglycosylation did not change the affinity of the SARS-CoV S protein to hACE2. However, this negative BLI binding result does not exclude the possibility that glycosylation could affect other viral entry steps, including protease cleavage and glycan-mediated interactions with dendritic cell/liver/lymph node-specific ICAM-3-grabbing non-integrin, which were reported to enhance SARS-CoV entry [25], [26]. Further investigations are required to ascertain the role of glycosylation in the infectivity of HCoV-19.
Antigen glycosylation greatly determines host immune responses. One of the prominent consequences of glycosylation is immune evasion by shielding off immunogenic epitopes in viral envelope or surface proteins [17], [43], [44]. The complete occupancy of glycans in most of the glycosites of the HCoV-19 S protein suggests that the virus is able to invade the host in a stealthy fashion. Successful innate immune evasion by HCoV-19 during the early stage of infection might explain the long asymptomatic incubation period during which transmissible virions are produced [4], [6]. Cases of HCoV-19 reinfection have also been reported, suggesting that the virus is capable of escaping from antibody-mediated neutralization [45]. On the other hand, protein glycosylation can also facilitate immune response by means of sensor molecules or antibodies that are specific for glycan recognition. N-glycans are known ligands for galectins, and can trigger inflammation events such as cytokine release and immune cell infiltration [46], [47]—which may contribute to the HCoV-19 pathogenesis and correlate with the disease severity. Moreover, epidemiological studies have shown that ABO polymorphism is linked to different susceptibilities to both HCoV-19 and SARS-CoV infections [48], [49]. Since the ABH carbohydrate epitopes and viral protein glycans are likely to be synthesized by the same ER-resident glycosylation enzymes, anti-A or -B antibodies can partially block the viral–host interactions [50]; thus, individuals with blood type O would be more resistant to virus infection. It remains to be explored whether the glycosylation status of the S protein has implications toward inter-individual variation in terms of viral response and clinical outcome within the infected population.
The glycosylation status of hACE2 was also profiled in this study. All seven glycosites in hACE2 were completely occupied by glycan, including N90, which is in the vicinity of the binding surface. However, we found that glycans did not directly contribute to the binding of hACE2 with the HCoV-19 S protein. This finding is in agreement with a previous finding that disruption of ACE2 glycosylation did not affect its binding with the SARS-CoV S protein [51]. However, deglycosylated ACE2 did compromise the cellular entry and the subsequent production of infectious SARS-CoV virion [51]. Therefore, ACE2 glycosylation is still considered to be an important target for intervention of coronavirus infection. In fact, the ability of chloroquine to counter HCoV-19 infection is thought to be due to its inhibition of ACE2 glycosylation, in addition to its ability to increase the endosomal pH level [28], [52].
In addition to glycosylation, additional PTM forms were investigated in this study. Methylation was identified in several sites in both the S protein and hACE2. In particular, we found that the E57, K68, and E329 sites in hACE2, which surround its binding site with the RBD of the S protein, are completed methylated. Methylation causes a loss of charge and increases the hydrophobicity of these sites. Four hydroxyprolines (at 253, 263, 321, and 346) were identified in the protease domain of hACE2. The extra hydroxyl group is likely to increase the hydrophilicity of proline in this extracellular region of hACE2. Further study is needed to investigate possible biological roles of these PTMs. We did not find phosphorylation or acetylation in our dataset, and previous PTM investigation of SARS-CoV also failed to identify these PTMs [38]. Fatty acid acylations were also considered in our study, since they are commonly found in surface viral proteins to facilitate membrane fusion and virus entry. However, our study found no evidence of an acylation PTM presence in the HCoV-19 S protein. Given that our multi-protease proteomic experiment provides almost full coverage for both proteins, we believe that the PTMome of the HCoV-19 S protein and hACE2 mainly consists of glycosylation, methylation, and proline oxidation.
Acknowledgments
Acknowledgments
This work was supported by the National Key Research and Development Program (2017YFC1200204, 2017YFA0504803, and 2018YFA0507700), Emergency Project of Zhejiang Provincial Department of Science and Technology (2020C03123-1), Fundamental Research Funds for the Central Universities (2018XZZX001-13), and Independent Project Fund of the State Key Laboratory for Diagnosis and Treatment of Infectious Disease. We thank the proteomics and metabolomics platform in the State Key Laboratory for Diagnosis and Treatment of Infectious Diseases at Zhejiang University for glycoproteomic analysis. We thank Shuangtian Shengwu Biotech. Co., Ltd. for assistance of BIL analysis. We thank pGlyco team at Institute of Computing Technology (Chinese Academy of Sciences, Beijing, China) for technical support for pGlyco analysis. We thank Dr. Ma Ping for consultation on protein expression and purification.
Authors’ contributions
Study concept and design: Zeyu Sun, Lanjuan Li. Samples preparation: Zeyu Sun, Keyi Ren, Feiyang Ji, Xiaoxi Ouyang. LC-MS/MS experiments: Zeyu Sun, Keyi Ren. Analysis and interpretation of data: Zeyu Sun, Jing Jiang, Keyi Ren. Structure modeling: Xing Zhang, Jinghua Chen, Zeyu Sun. Drafting of the manuscript: Zeyu Sun, Zhengyi Jiang. Critical revision of the manuscript: Lanjuan Li. All authors approved the final version of the manuscript, including the authorship list.
Compliance with ethics guidelines
Zeyu Sun, Keyi Ren, Xing Zhang, Jinghua Chen, Zhengyi Jiang, Jing Jiang, Feiyang Ji, Xiaoxi Ouyang, and Lanjuan Li declare that they have no conflict of interest or financial conflicts to disclose.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.eng.2020.07.014.
Appendix A. Supplementary data
The following are the Supplementary data to this article:
References
- 1.Wu F., Zhao S., Yu B., Chen Y.M., Wang W., Song Z.G., et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Dong E., Du H., Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis. 2020;20(5):533–534. doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zhou P., Yang X.L., Wang X.G., Hu B., Zhang L., Zhang W., et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Guan W.J., Ni Z.Y., Hu Y., Liang W.H., Ou C.Q., He J.X., et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382:1708–1720. doi: 10.1056/NEJMoa2002032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhou F., Yu T., Du R., Fan G., Liu Y., Liu Z., et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395(10229):1054–1062. doi: 10.1016/S0140-6736(20)30566-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wang D., Hu B., Hu C., Zhu F., Liu X., Zhang J., et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China. JAMA. 2020;323(11):1061–1069. doi: 10.1001/jama.2020.1585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zhang T., Wu Q., Zhang Z. Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak. Curr Biol. 2020;30(7):1346–1351. doi: 10.1016/j.cub.2020.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wu A., Peng Y., Huang B., Ding X., Wang X., Niu P., et al. Genome composition and divergence of the novel coronavirus (2019-nCoV) Cell Host Microbe. 2020;27:325–328. doi: 10.1016/j.chom.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Xu Y. Unveiling the origin and transmission of 2019-nCoV. Trends Microbiol. 2020;28(4):239–240. doi: 10.1016/j.tim.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhang Y.Z., Holmes E.C. A genomic perspective on the origin and emergence of SARS-CoV-2. Cell. 2020;181(2):223–227. doi: 10.1016/j.cell.2020.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wan Y., Shang J., Graham R., Baric R.S., Li F. Receptor recognition by the novel coronavirus: an analysis based on decade-long structural studies of SARS coronavirus. J Virol. 2020;94(7):e00127–220. doi: 10.1128/JVI.00127-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hoffmann M., Kleine-Weber H., Schroeder S., Krüger N., Herrler T., Erichsen S., et al. SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell. 2020;181(2) doi: 10.1016/j.cell.2020.02.052. 271–80.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Walls A.C., Park Y.J., Tortorici M.A., Wall A., McGuire A.T., Veesler D. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell. 2020;181(2) doi: 10.1016/j.cell.2020.02.058. 281–92.e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wrapp D., Wang N., Corbett K.S., Goldsmith J.A., Hsieh C.L., Abiona O., et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020;367(6483):1260–1263. doi: 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Fukushi M., Yoshinaka Y., Matsuoka Y., Hatakeyama S., Ishizaka Y., Kirikae T., et al. Monitoring of S protein maturation in the endoplasmic reticulum by calnexin is important for the infectivity of severe acute respiratory syndrome coronavirus. J Virol. 2012;86(21):11745–11753. doi: 10.1128/JVI.01250-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.de Groot R.J. Structure, function and evolution of the hemagglutinin-esterase proteins of corona- and toroviruses. Glycoconj J. 2006;23(1–2):59–72. doi: 10.1007/s10719-006-5438-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Raman R., Tharakaraman K., Sasisekharan V., Sasisekharan R. Glycan-protein interactions in viral pathogenesis. Curr Opin Struct Biol. 2016;40:153–162. doi: 10.1016/j.sbi.2016.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chang D., Zaia J. Why glycosylation matters in building a better flu vaccine. Mol Cell Proteomics. 2019;18(12):2348–2358. doi: 10.1074/mcp.R119.001491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Li W., Hulswit R.J.G., Widjaja I., Raj V.S., McBride R., Peng W., et al. Identification of sialic acid-binding function for the Middle East respiratory syndrome coronavirus spike glycoprotein. PNAS. 2017;114(40):E8508–E8517. doi: 10.1073/pnas.1712592114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Parsons L.M., Bouwman K.M., Azurmendi H., de Vries R.P., Cipollo J.F., Verheije M.H. Glycosylation of the viral attachment protein of avian coronavirus is essential for host cell and receptor binding. J Biol Chem. 2019;294:7797–7809. doi: 10.1074/jbc.RA119.007532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Shih Y.P., Chen C.Y., Liu S.J., Chen K.H., Lee Y.M., Chao Y.C., et al. Identifying epitopes responsible for neutralizing antibody and DC-SIGN binding on the spike glycoprotein of the severe acute respiratory syndrome coronavirus. J Virol. 2006;80(21):10315–10324. doi: 10.1128/JVI.01138-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.York I.A., Stevens J., Alymova I.V. Influenza virus N-linked glycosylation and innate immunity. Biosci Rep. 2018;39(1) doi: 10.1042/BSR20171505. BSR20171505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zheng J., Yamada Y., Fung T.S., Huang M., Chia R., Liu D.X. Identification of N-linked glycosylation sites in the spike protein and their functional impact on the replication and infectivity of coronavirus infectious bronchitis virus in cell culture. Virology. 2018;513:65–74. doi: 10.1016/j.virol.2017.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhou Y., Lu K., Pfefferle S., Bertram S., Glowacka I., Drosten C., et al. A single asparagine-linked glycosylation site of the severe acute respiratory syndrome coronavirus spike glycoprotein facilitates inhibition by mannose-binding lectin through multiple mechanisms. J Virol. 2010;84(17):8753–8764. doi: 10.1128/JVI.00554-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Han D.P., Lohani M., Cho M.W. Specific asparagine-linked glycosylation sites are critical for DC-SIGN- and L-SIGN-mediated severe acute respiratory syndrome coronavirus entry. J Virol. 2007;81(21):12029–12039. doi: 10.1128/JVI.00315-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Jeffers S.A., Tusell S.M., Gillim-Ross L., Hemmila E.M., Achenbach J.E., Babcock G.J., et al. CD209L (L-SIGN) is a receptor for severe acute respiratory syndrome coronavirus. PNAS. 2004;101(44):15748–15753. doi: 10.1073/pnas.0403812101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zheng L., Li H., Fu L., Liu S., Yan Q., Leng S.X. Blocking cellular N-glycosylation suppresses human cytomegalovirus entry in human fibroblasts. Microb Pathog. 2020;138 doi: 10.1016/j.micpath.2019.103776. [DOI] [PubMed] [Google Scholar]
- 28.Vincent M.J., Bergeron E., Benjannet S., Erickson B.R., Rollin P.E., Ksiazek T.G., et al. Chloroquine is a potent inhibitor of SARS coronavirus infection and spread. Virol J. 2005;2:69. doi: 10.1186/1743-422X-2-69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Chen W.H., Du L., Chag S.M., Ma C., Tricoche N., Tao X., et al. Yeast-expressed recombinant protein of the receptor-binding domain in SARS-CoV spike protein with deglycosylated forms as a SARS vaccine candidate. Hum Vaccines Immunother. 2014;10(3):648–658. doi: 10.4161/hv.27464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kumar S., Maurya V.K., Prasad A.K., Bhatt M.L.B., Saxena S.K. Structural, glycosylation and antigenic variation between 2019 novel coronavirus (2019-nCoV) and SARS coronavirus (SARS-CoV) VirusDisease. 2020;31:13–21. doi: 10.1007/s13337-020-00571-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sun Z., Liu X., Jiang J., Huang H., Wang J., Wu D., et al. Toward biomarker development in large clinical cohorts: an integrated high-throughput 96-well-plate-based sample preparation workflow for versatile downstream proteomic analyses. Anal Chem. 2016;88(17):8518–8525. doi: 10.1021/acs.analchem.6b01333. [DOI] [PubMed] [Google Scholar]
- 32.Liu M.Q., Zeng W.F., Fang P., Cao W.Q., Liu C., Yan G.Q., et al. pGlyco 2.0 enables precision N-glycoproteomics with comprehensive quality control and one-step mass spectrometry for intact glycopeptide identification. Nat Commun. 2017;8:438. doi: 10.1038/s41467-017-00535-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Emsley P., Lohkamp B., Scott W.G., Cowtan K. Features and development of Coot. Acta Crystallogr. 2010;D66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bailey U.M., Jamaluddin M.F., Schulz B.L. Analysis of congenital disorder of glycosylation-Id in a yeast model system shows diverse site-specific under-glycosylation of glycoproteins. J Proteome Res. 2012;11:5376–5383. doi: 10.1021/pr300599f. [DOI] [PubMed] [Google Scholar]
- 35.Shajahan A, Supekar NT, Gleinich AS, Azadi P. Deducing the N- and O- glycosylation profile of the spike protein of novel coronavirus SARS-CoV-2. Glycobiology. 2020;30(12):981–8. [DOI] [PMC free article] [PubMed]
- 36.Watanabe Y., Allen J.D., Wrapp D., McLellan J.S., Crispin M. Site-specific glycan analysis of the SARS-CoV-2 spike. Science. 2020;369(6501):330–333. doi: 10.1126/science.abb9983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Krokhin O., Li Y., Andonov A., Feldmann H., Flick R., Jones S., et al. Mass spectrometric characterization of proteins from the SARS virus: a preliminary report. Mol Cell Proteomics. 2003;2(5):346–356. doi: 10.1074/mcp.M300048-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ying W., Hao Y., Zhang Y., Peng W., Qin E., Cai Y., et al. Proteomic analysis on structural proteins of severe acute respiratory syndrome coronavirus. Proteomics. 2004;4(2):492–504. doi: 10.1002/pmic.200300676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Song H.C., Seo M.Y., Stadler K., Yoo B.J., Choo Q.L., Coates S.R., et al. Synthesis and characterization of a native, oligomeric form of recombinant severe acute respiratory syndrome coronavirus spike glycoprotein. J Virol. 2004;78(19):10328–10335. doi: 10.1128/JVI.78.19.10328-10335.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Watanabe Y., Bowden T.A., Wilson I.A., Crispin M. Exploitation of glycosylation in enveloped virus pathobiology. Biochim Biophys Acta Gen Subj. 2019;1863(10):1480–1497. doi: 10.1016/j.bbagen.2019.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Feldmann H., Nichol S.T., Klenk H.D., Peters C.J., Sanchez A. Characterization of filoviruses based on differences in structure and antigenicity of the virion glycoprotein. Virology. 1994;199(2):469–473. doi: 10.1006/viro.1994.1147. [DOI] [PubMed] [Google Scholar]
- 42.Ritchie G., Harvey D.J., Feldmann F., Stroeher U., Feldmann H., Royle L., et al. Identification of N-linked carbohydrates from severe acute respiratory syndrome (SARS) spike glycoprotein. Virology. 2010;399(2):257–269. doi: 10.1016/j.virol.2009.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Yang T.J., Chang Y.C., Ko T.P., Draczkowski P., Chien Y.C., Chang Y.C., et al. Cryo-EM analysis of a feline coronavirus spike protein reveals a unique structure and camouflaging glycans. PNAS. 2020;117(3):1438–1446. doi: 10.1073/pnas.1908898117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Vigerust D.J., Shepherd V.L. Virus glycosylation: role in virulence and immune interactions. Trends Microbiol. 2007;15(5):211–218. doi: 10.1016/j.tim.2007.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Biswas A., Bhattacharjee U., Chakrabarti A.K., Tewari D.N., Banu H., Dutta S. Emergence of novel coronavirus and COVID-19: whether to stay or die out? Crit Rev Microbiol. 2020;46(2):182–193. doi: 10.1080/1040841X.2020.1739001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Robinson B.S., Arthur C.M., Evavold B., Roback E., Kamili N.A., Stowell C.S., et al. The sweet-side of leukocytes: galectins as master regulators of neutrophil function. Front Immunol. 2019;53(6):925–935. doi: 10.3389/fimmu.2019.01762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wang W.H., Lin C.Y., Chang M.R., Urbina A.N., Assavalapsakul W., Thitithanyanont A., et al. The role of galectins in virus infection—a systemic literature review. J Microbiol Immunol Infect. 2019;53(6):925–935. doi: 10.1016/j.jmii.2019.09.005. [DOI] [PubMed] [Google Scholar]
- 48.Cheng Y., Cheng G., Chui C.H., Lau F.Y., Chan P.K., Ng M.H., et al. ABO blood group and susceptibility to severe acute respiratory syndrome. JAMA. 2005;293(12):1450–1451. doi: 10.1001/jama.293.12.1450-c. [DOI] [PubMed] [Google Scholar]
- 49.Zhao J, Yang Y, Huang H, Li D, Gu D, Lu X, et al. Relationship between the ABO blood group and the COVID-19 susceptibility. Clin Infect Dis 2020;73(2):328–31. [DOI] [PMC free article] [PubMed]
- 50.Guillon P., Clément M., Sébille V., Rivain J.G., Chou C.F., Ruvoën-Clouet N., et al. Inhibition of the interaction between the SARS-CoV spike protein and its cellular receptor by anti-histo-blood group antibodies. Glycobiology. 2008;18(12):1085–1093. doi: 10.1093/glycob/cwn093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Zhao X., Guo F., Comunale M.A., Mehta A., Sehgal M., Jain P., et al. Inhibition of endoplasmic reticulum-resident glucosidases impairs severe acute respiratory syndrome coronavirus and human coronavirus NL63 spike protein-mediated entry by altering the glycan processing of angiotensin I-converting enzyme 2. Antimicrob Agents Chemother. 2015;59(1):206–216. doi: 10.1128/AAC.03999-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Liu J., Cao R., Xu M., Wang X., Zhang H., Hu H., et al. Hydroxychloroquine, a less toxic derivative of chloroquine, is effective in inhibiting SARS-CoV-2 infection in vitro. Cell Discovery. 2020;6:16. doi: 10.1038/s41421-020-0156-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.