Significance
We report here a 3.3-Å cryo-EM structure of feline infectious peritonitis virus (FIPV) S protein derived from the serotype I FIPV UU4 strain. The near-atomic EM map enabled ab initio modeling of 27 out of the 33 experimentally verified high-mannose and complex-type N-glycans that mask most of the protein surface. We demonstrated the feasibility to directly visualize the core fucose of a complex-type glycan, which was independently cross-validated by glycopeptide mass spectrometry analyses. There exist 3 N-glycans that wedge between 2 galectin-like domains within the S1 subunit of FIPV-UU4 S protein, resulting in a propeller-like conformation unique to all reported CoV S proteins. The results highlight a structural role of glycosylation in maintaining complex protein structures.
Keywords: cryoelectron microscopy, mass spectrometry, protein glycosylation, alphacoronavirus, feline infectious peritonitis virus
Abstract
Feline infectious peritonitis virus (FIPV) is an alphacoronavirus that causes a nearly 100% mortality rate without effective treatment. Here we report a 3.3-Å cryoelectron microscopy (cryo-EM) structure of the serotype I FIPV spike (S) protein, which is responsible for host recognition and viral entry. Mass spectrometry provided site-specific compositions of densely distributed high-mannose and complex-type N-glycans that account for 1/4 of the total molecular mass; most of the N-glycans could be visualized by cryo-EM. Specifically, the N-glycans that wedge between 2 galectin-like domains within the S1 subunit of FIPV S protein result in a unique propeller-like conformation, underscoring the importance of glycosylation in maintaining protein structures. The cleavage site within the S2 subunit responsible for activation also showed distinct structural features and glycosylation. These structural insights provide a blueprint for a better molecular understanding of the pathogenesis of FIP.
Coronaviruses (CoVs) are enveloped viruses with single-stranded positive-sensed RNA. They are major infectious pathogens for a myriad of mammals and birds (1). Interspecies transmission due to genetic mutations in CoVs are responsible for life-threatening pandemics, such as severe acute respiratory syndrome-related CoV (SARS-CoV) in 2003 and Middle East respiratory syndrome-related CoV (MERS-CoV) in 2012 (1–3), resulting in detrimental economic and societal impacts. Sporadic CoV outbreaks further cause animal endemics and major economic losses (4–6). Crucially, there is a constant risk of zoonotic outbreaks that argues for a better molecular understanding of CoV-associated pathogenesis (7, 8).
First reported 4 decades ago, feline infectious peritonitis (FIP) is one of the most fatal infectious diseases in cats, particularly kittens (9). The disease results from infection by a feline CoV (FCoV), feline infectious peritonitis virus (FIPV) (10, 11). There is no effective treatment and the fatality rate is essentially 100% (10, 11). FCoV can be serologically categorized into 2 types: serotypes I and II. Serotype I FCoV is the most epidemiologically prevalent, contributing to more than 70% of circulating virus isolates worldwide (12). Progress in FIP research has been limited not least because of difficulties in isolating and propagating serotype I FCoV in vitro for mechanistic studies on pathogenesis at the molecular level, which are urgently needed for the development of vaccines and therapeutics (2, 9, 13, 14).
CoVs use their spike (S) proteins for host recognition and subsequent membrane fusion to introduce their viral genomes into the host for replication. Preventing CoV infection by blocking S-protein binding to host receptors therefore represents the first line of defense. CoV S proteins consist of 2 functional units, the S1 and S2 subunits, which are responsible for cell attachment and membrane fusion, respectively (1, 2, 15). Mutations in the receptor-binding motifs (RBMs) or cleavage sites of CoV S proteins can lead to zoonotic spillover and alteration of cell/tissue tropism, as exemplified by SARS and MERS (1, 16).
To better understand the molecular basis of FIPV, we constructed a trimeric S glycoprotein of a serotype I FIPV UU4 strain and overexpressed it in a mammalian cell line, as opposed to insect cell lines that were used in previous structural studies on CoV S proteins (3, 17–21). This ensured a better resemblance of the posttranslationally modified glycosylation patterns on the S protein, the chemical structures of which were determined by mass spectroscopy (MS). Cryoelectron microscopy (cryo-EM) single-particle reconstruction was employed to determine the 3-dimensional (3D) structure of FIPV-UU4 S protein to a resolution of 3.3 Å, enabling ab initio model building of not only the protein but also most of the N-glycan structures. The result enabled direct visualization of the camouflaging N-glycans on the surface of FIPV-UU4 S protein. Nearly 1/3 of the observed N-linked glycans correspond to the complex type, which has not been studied in such detail in previously reported CoV S proteins (3, 17–21). Our cryo-EM structure of FIPV-UU4 S protein therefore serves as a starting point for further studies on virus–host interactions and development of better vaccines and therapeutics for the intervention of FIP.
Results
Structure of FIPV-UU4 S Protein.
The ectodomain of FIPV-UU4 S protein was fused with a trimerization domain of T4 fibritin followed by a V5 tag and a His6 tag at the C terminus to replace the transmembrane helix. The recombinant FIPV-UU4 S protein was overexpressed in a human embryonic kidney 293 (HEK 293) cell line. With 1,391 amino acids in its sequence, the expected molecular mass of the homotrimeric recombinant protein is 465 kDa. Nevertheless, size-exclusion chromatography coupled with multiangle static light scattering (SEC-MALS) indicated an apparent molecular mass of 722 kDa for the single elution peak, of which 192 kDa (27% of the total molecular mass) was ascribed to glycan moieties (SI Appendix, Fig. S1A). The extensive glycosylation and the trimeric state were confirmed by gel electrophoresis under denaturing and native conditions, respectively (SI Appendix, Fig. S1B). Importantly, immunocytochemistry analysis demonstrated that mouse anti–FIPV-UU4 S protein serum was able to differentiate serotype I- and II-infected cells, thus establishing the potential of FIPV-UU4 S protein as a potent vaccine candidate for FIP treatment (SI Appendix, Fig. S1 C and D). Using a 200-keV electron microscope, we determined the 3D electron density map of the FIPV-UU4 S protein to 3.3-Å resolution, which revealed a propeller-like homotrimer structure with an overall dimension of 180 Å in diameter and 170 Å in height (Fig. 1A, Table 1, SI Appendix, Figs. S2A and S3 A and B, and Movie S1). In contrast, all previously reported cryo-EM structures of CoV S proteins have adopted compact ellipsoidal shapes of which the host receptor-recognizing S1 subunits pack against the spring-loaded coiled-coil S2 subunits (SI Appendix, Table S1) (3, 17–21). Local resolution analysis indicated that the propeller-like regions of the EM map were less defined compared with that of the core regions, indicative of local dynamics (SI Appendix, Fig. S3E).
Table 1.
Detection mode | Defocus phase contrast | Volta phase plate |
Microscope | Talos Arctica | Titan Krios |
Voltage, keV | 200 | 300 |
Defocus range, μm | 1.8 to 2.8 | 0.2 to 0.8 |
Movies | 2,436 | 1,405 |
Frames per movie | 32 | 30 |
Exposure time per frame, s | 2.5 | 2.27 |
Magnification | 120k | 120k |
Pixel size, Å | 0.87 | 0.85 |
Total accumulated dose, e−/Å2 | 48 | 30 |
Particles | 102,586 | 191,810 |
Map resolution, Å | 3.3 | 3.7 |
Map-sharpening B factor, Å2 | −90 | −143 |
3DFSC (sphericity) | 0.91 | |
Cryo-EF (efficiency) | 0.66 | |
Model composition | ||
Nonhydrogen atoms | 32,115 | |
Protein residues | 3,735 | |
Ligands | 219 | |
B factor, Å | ||
Protein | 170.75 | |
Ligand | 150.62 | |
Model validation | ||
MolProbity score | 2.12 | |
EMRinger score | 2.62 | |
CC (mask) | 0.79 | |
d 99 (Å) | ||
Masked | 4.1 | |
Unmasked | 3.9 | |
d FSC model, 0/0.143/0.5 Å | ||
Masked | 3.1/3.4/3.9 | |
Unmasked | 3.2/3.5/4.2 | |
Ramachandran, % | ||
Favored | 87.33 | |
Allowed | 12.43 | |
Outliers | 0.24 | |
Rotamer outliers, % | 0.82 | |
Clashscore | 9.50 | |
Rms deviations | ||
Bond length, Å | 0.011 | |
Bond angle, ° | 1.604 |
The high-resolution EM map enabled ab initio model building and refinement to generate an atomic model of FIPV-UU4 S protein. The 3 monomers are intertwined through extensive interactions between the conserved S2 subunits, while the 3 S1 subunits form the propeller blades (Fig. 1B and Table 1). Detailed inspection of the EM map showed well-defined side-chain densities in most regions, including the interstrand disulfide bonds, attesting to the high quality of our EM-derived structural model (SI Appendix, Fig. S3D and Movie S2). After considering the electron densities of individual amino acids, many electron densities remained unaccounted for. It transpired that these additional densities correspond to the N-glycan structures (vide infra). Based on our cryo-EM structure, we defined 5 distinct domains within the S1 subunit: domain 0 (residues 1 to 275), domain A (residues 276 to 540), domain B (residues 541 to 695), domain C (residues 696 to 754), and domain D (residues 755 to 791). The S2 subunit consists of an S2′ cleavage site (residues 969 to 978), a fusion peptide (residues 979 to 1055), and 2 heptad repeats (HR1 and HR2), corresponding to residues 1056 to 1156 and 1342 to 1403, respectively. The electron density map corresponding to the HR2 domain of the C terminus (residues 1338 to 1391) could not be resolved, suggesting conformational heterogeneity that leads to loss of contrast after averaging over a large number of particle images (Fig. 1C). Likewise, the model of the 4 loop segments (residues 1 to 61, 391 to 410, 781 to 796, and 1337 to 1361) could not be built because of the loss of the corresponding EM maps, potentially due to intrinsic dynamics.
Site-Specific N-Glycosylation Mapped by MS and Cryo-EM.
Glycosylation of CoV S proteins is implicated in protein folding (22), structure stability (22), virus entry, and receptor recognition (23–27). According to the sequence, 37 N-linked glycosylation sites are predicted for FIPV-UU4 S protein. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis of the de–N-glycosylated tryptic peptides led to identification of 29 occupied N-glycosites based on the conversion of Asn to Asp upon PNGase F treatment (Fig. 2A). The remaining 8 predicted N-glycosylation sites, namely 27NTSH, 31NNSK, 482NYTD, 585NGSV, and 590NVTS in the S1 subunit, 1308NTTH in the S2 subunit, and 1352NQTK and 1357NLTA in HR2, were not verified due to the lack of corresponding tryptic peptides. To obtain an unbiased view of the range of N-glycans carried, the pool of released N-glycans was profiled by MALDI-MS after permethylation. Both high-mannose and ±core-fucosylated, ±sialylated complex-type N-glycans were evidently present, with Man9GlcNAc2 (M9) and a simple core-fucosylated biantennary structure contributing to the 2 most intense molecular ion signals detected (SI Appendix, Fig. S4). Taking the average size of these 2 predominant N-glycan structures—1,768 Da for biantennary and 1,864 Da for M9—that were released from the FIPV-UU4 S protein, a total of 37 glycosylated sites would contribute to a total of 66.6 kDa of glycan per monomer, namely ca. 200 kDa for a trimer, consistent with the 192-kDa glycan contribution as inferred from SEC-MALS data (SI Appendix, Fig. S1A). Collectively, it confirms that FIPV-UU4 S protein is indeed heavily N-glycosylated at most, if not all, of the predicted sites.
The extensive N-glycosylation of FIPV-UU4 S protein was evident in the 2-dimensional classifications of the raw cryo-EM particle images. The use of the Volta phase plate (VPP) in combination with a 300-keV electron microscope enhanced the contrast of the blurry density around the core protein densities. The VPP-derived dataset was used to construct a 3D EM map, which showed better-defined protrusions with lower local resolutions as a result of conformational heterogeneity (SI Appendix, Figs. S2B and S5). Through different image-processing procedures, we could unambiguously build 28 N-linked glycan structures onto the atomic model, including 2 N-glycosylation sites, N585 and N590, which were not identified by LC-MS/MS analysis of the de–N-glycosylated peptides (SI Appendix, Fig. S6). The reconstituted glycan EM densities were mostly limited to 1 or 2 N-acetylglucosamine (GlcNAc) moieties linked to the asparagine side chains due to their intrinsic dynamics. Nonetheless, a number of well-defined glycan densities could be traced and modeled with up to 7 monosaccharide residues. In the case of N357 and N1218, we could resolve the densities that correspond to 2 GlcNAcs and up to 5 mannoses, which is indicative of these 2 sites being mostly glycosylated by high-mannose-type structures (Fig. 2 D and F and Movie S3).
To further define the distribution of high-mannose versus complex-type N-glycans over the various sites, tryptic digests of FIPV-UU4 S protein were subjected to LC-MS/MS analysis without first removal of the N-glycans. By directly identifying the intact glycopeptides, the site-specific N-glycosylation pattern of 24 sites could be profiled, including 482NYTD and 1308NTTH, not detected by previous analysis of de–N-glycosylated peptides. This brings the total of MS-verified N-glycosylation to 31 out of the predicted 37 sites (summarized in Fig. 2A). For each of the identified sites, the most abundant and hence most representative glycoforms were deduced from the relative peak intensities of the manually extracted ion chromatograms (XICs) of the corresponding glycopeptides (SI Appendix, Table S2). For example, the XICs of glycopeptides derived from N1218 and N949 indicated that the major glycoforms correspond to an M9 high mannose and a nonsialylated, fucosylated biantennary complex-type glycan structure, respectively (Fig. 2 C and E, Left). Close examination of the EM density protruding from the side chain of N949 revealed additional density next to the first GlcNAc moiety linked to N949. We attributed this to a core fucose (Fuc), which is consistent with and fully corroborated by the corresponding glycopeptide MS/MS spectrum, which contained a diagnostic peptide backbone fragment ion carrying a GlcNAc and a Fuc (Y1+F; Fig. 2E).
Combining all our cryo-EM and MS data, 33 out of the 37 predicted N-linked glycosylated sites were identified, including 18 and 15 sites in the S1 and S2 subunits, respectively (SI Appendix, Table S3). The 4 unaccounted sites included N27 and N31 at the N terminus and N1352 and N1357 at the C terminus, all of which were not resolved by cryo-EM. Based on the chemical structures of the most populated N-glycans derived from glycopeptide analysis and the complementary cryo-EM results, we generated an atomic model that represented the most likely spatial distribution of the N-glycans on the FIPV-UU4 S protein (Figs. 2B and 3 and SI Appendix, Table S3). These glycans shield more than 2/3 of the protein surface, potentially camouflaging protein functional motifs to evade detection by the host immune system.
Structural Characteristics of Domain 0 Unique to Alphacoronaviruses.
Compared with the cryo-EM structure of the S protein of human CoV NL63 (HCoV-NL63), which represents the only reported alphacoronavirus S-protein structure (19), domain 0 of FIPV-UU4 is rotated 90° with respect to the adjacent domain A (Fig. 4 A and B). Such a pronounced conformational difference could stem from the extensive glycosylation at N254, N357, and N491 that wedge between domains 0 and A (Fig. 3A). Domain 0 is only present in alphacoronaviruses, while it is structurally similar to domain A of other CoV genera (SI Appendix, Table S4). Structure-based bioinformatics analysis indicated that both domains 0 and A of FIPV-UU4 S protein are structurally homologous to galectins, and galectin-4 in particular (SI Appendix, Tables S4 and S5). Nevertheless, domains 0 and A of FIPV-UU4 S protein have distinct glycosylation patterns, leading to differential steric hindrance and physiochemical characteristics on their surfaces that may influence host recognition (Figs. 2A and 3A and SI Appendix, Tables S2 and S3).
Glycan Array Analysis of Domain 0 Unique to Alphacoronaviruses.
Despite sharing a similar 3D fold with galectins (SI Appendix, Table S4), domain 0 of FIPV-UU4 S protein lacks the conserved tryptophan residues that galectins use to bind β-galactoside through CH–π interactions (28). Furthermore, the putative carbohydrate-binding pocket in domain 0 is sequestered by a long loop in the cryo-EM structure. Nonetheless, the ill-defined electron density was indicative of local dynamics such that the putative carbohydrate-binding pocket may still be accessible in solution (SI Appendix, Table S4).
Because hemagglutinin activity has been reported for several CoVs, we asked whether individual domains of FIPV-UU4 S protein indeed exhibit lectin activity as a way to bind to host cell-surface glycans for mediating FIPV infection. We employed an array of 100 glycan structures to test the lectin activities of full-length FIPV-UU4 S protein and 3 truncated variants, namely domain 0 only, domains 0 and A, and domains 0, A, and B (SI Appendix, Materials and Methods). Three groups of glycan structures were found to be recognized by all 4 variants, namely sialylated or core-Fuc Galβ(1→4)Glcβ-core structures, sialylated Galβ(1→3)GalNAcβ-core structures, and oligo-glucose (Glc) structures (SI Appendix, Fig. S7). Notably, NeuAc sialylation at the 6 position of the inner GalNAc seems to be required (compound 67) while additional sialylation at the 6 position of Gal (compound 69) increased its affinity slightly. However, sialylation at the 6 position of Gal itself without the presence of sialylation on GalNAc (compound 68) did not support binding, whereas the additional presence of sialylation at the 3 position of Gal (compound 70) actually inhibited the binding slightly. Collectively, the glycan array data suggest that positive recognition and binding by domain 0 prefer a minimum Galβ(1→3)GalNAcβ-core structure sialylated at the 6 position of GalNAc. Di- and tri-Glc with α-linkages could be recognized. The structural basis and functional implications of recognizing these sialylated disaccharides in the context of FIPV host recognition remain to be established.
Structure Comparison of the Putative Receptor-Binding Domain B.
Previous studies on CoVs indicated that domain B within subunit S1 is responsible for primary host receptor binding (1). The structures of domain B from different CoVs share the same overall β-sandwich fold structure, encompassing 2 3-stranded antiparallel β-sheets with divergent loop sequences and conformations that constitute the putative receptor-binding motifs (Fig. 4C). The RBMs play a key role in host specificity; mutations in them are responsible for interspecies transmission (29). Among the 4 selected alphacoronaviruses, FIPV-UU4, HCoV-NL63, human CoV 229E (HCoV-229E), and porcine respiratory coronavirus (PRCV), FIPV-UU4 S protein has the longest loop 1 in domain B with a distinct helical conformation (Fig. 4 C–E and SI Appendix, Table S6), rendering it more solvent-exposed and potentially more accessible to host receptors than that of HCoV-NL63 (19). Meanwhile, other regions of domain B of FIPV-UU4 S protein are shielded by N-glycans (Fig. 4D).
Structural Characteristics of the S2 Subunit in Its Prefusion State.
CoVs share conserved sequences and structures of their S2 subunits (30, 31). The S2 subunit of FIPV-UU4 is structurally similar to that of HCoV-NL63 and porcine deltacoronavirus (PdCoV), with which FIPV-UU4 shares 57 and 52% sequence identity, respectively. FIPV-UU4 S protein shows a positional rmsd of 3.3 and 2.2 Å with respect to that of HCoV-NL63 and PdCoV, respectively (SI Appendix, Table S7). Our cryo-EM analysis identified 12 N-glycosylation sites in the S2 subunit of FIPV-UU4. Except for N841 and N862, all these N-glycosylation sites were confirmed by MS, including 6 high-mannose types, 3 complex types, and 1 that showed a mixture of high-mannose and complex-type N-glycans (Fig. 3B and SI Appendix, Tables S2 and S3).
Activation of CoV S proteins through site-specific proteolysis at the conserved S1/S2 and S2′ cleavage sites is an essential step for viral entry. The S2′ cleavage site of FIPV-UU4 (969LLPPRVGMR↓978S), which connects the upstream S2′ activation loop and a fusion peptide, is implicated in the pathogenicity of serotype I FIPVs (2). The underscore refers to the recognition residue for protease cleavage. Compared with other CoVs, FIPV-UU4 has a short S2′ activation loop with only 1 N-glycan (N949), located upstream of the cleavage site (R977; Fig. 5). Another N-glycan stemming from the other monomer (N841′) is in close proximity to the S2′ cleavage site. These 2 N-glycans may provide steric hindrance for proteolysis of the S2′ activation loop, thereby changing the protease activation requirements and host cell tropism (2).
Bioactivity of FIPV-UU4 S Protein.
To investigate the antigenicity of the HEK293-expressed recombinant FIPV-UU4 S protein, the recombinant FIPV-UU4 S protein was used to immunize mice. The mouse serum was collected and subjected to immunohistochemistry (IHC) staining of clinical serotype I FIPV-infected cat tissues, which were characterized by typical granulomatous inflammation, and the immunocytochemistry (ICC) staining of serotype II FIPV (NTU156)-inoculated Fcwf-4 cells. The result showed positive cytoplasmic signals in macrophages associated with granulomatous lesions by IHC staining but negative signals in serotype II FIPV NTU156-inoculated Fcwf-4 cells by ICC staining, confirming that the recombinant FIPV-UU4 S protein indeed exhibits antigenicity that is close to that of native serotype I FIPVs (SI Appendix, Fig. S1 D and E).
Discussion
We report here a high-resolution cryo-EM structure of FIPV-UU4 S protein with a distinct domain architecture that sets it apart from all previously reported CoV S protein structures (Fig. 1 and SI Appendix, Table S2). The most significant structural difference is the unique 3-blade propeller-like structure formed by the S1 subunit. The combined efforts of cryo-EM and MS analyses identified and delineated the positions and structures of 33 out of the 37 predicted N-glycosylation sites, accounting for 27% of the total molecular mass. Our results represent the unprecedentedly detailed analysis of the N-linked glycans of a CoV S protein in terms of their chemical compositions and 3D structures, made possible through the use of a mammalian expression system to introduce physiological mammalian glycosylation patterns. The resulting models revealed some spatial partitioning of high-mannose versus complex-type glycans. Several high-mannose-type glycans are located at the junctions between domains and appear to serve as wedges to maintain the domain architecture (Fig. 2B). It is plausible that the protein domain junctions generate significant steric hindrance, thereby preventing subsequent trimming and addition of the complex-type glycan structures (32).
In addition to protein folding, viral protein glycosylation plays a pivotal role in viral infection. Glycosylation of virus envelope proteins helps shield their antibody-neutralizing epitopes as a way to evade host immunity. One of the best-known examples is the envelope glycoprotein (gp120) of HIV-1, of which glycosylation accounts for almost half of its molecular mass and plays an essential role in host immunity evasion (33). In CoVs, a recent study on HCoV-NL63 suggests that the glycosylation at N358 within its RBMs may serve as molecular trickery for evading host immunity (19). However, the role of the absent glycan shield at the corresponding putative RBMs in FIPV-UU4 S protein should be further investigated.
In addition to their protective roles in shielding viruses from being neutralized by host immune systems, spike-protein glycosylation can be selectively recognized by host cells to facilitate viral infections. Lectin-dependent enhancements of viral infections have been reported for SARS-CoV (34), Ebola virus (35, 36), Marburg virus (34), and Dengue virus (37). Both serotype I and serotype II FIPVs use feline dendritic cell-specific intercellular adhesion molecule 3-grabbing nonintegrin (fDC-SIGN) as a coreceptor to recognize high-mannose glycans during viral entry and transmission (34). Domain A of FIPV-UU4 S protein is densely decorated with high-mannose-type glycans, which could be implicated in interacting with fDC-SIGN during viral infection (Figs. 2A and 3A).
While viral envelope or S-protein glycosylation is targeted by host cells, several viral envelope or S proteins also exhibit lectin activities to recognize host surface glycans in trans (38). For example, a number of CoVs have been reported to exhibit hemagglutinin activity with some preference for sialylated oligosaccharides (39). Through glycan array analysis, we obtained experimental evidence of lectin activity for domain 0 of FIPV-UU4 S protein, which showed a distinct binding preference for a Galβ(1→3)GalNAcβ-core structure sialylated at the 6 position of the inner GalNAc. If one disregards the anomericity of the GalNAc, this minimal NeuAcα(2→6)GalNAcα-determinant corresponds to the sialyl Tn epitope widely implicated as a cancer and CART antigen, and also being developed as a vaccine candidate. Follow-up binding studies including the use of custom-made O-glycans containing the actual sialyl Tn epitope and other core 1 O-glycans sialylated at different positions would be required to substantiate this intriguing finding that sialylated O-glycans on host cell surfaces might play an important role in viral recognition and infection of serotype I FIPV.
In the present study, 33 N-glycosylation sites were confirmed on the ectodomain of the trimeric S protein. M9 high mannoses were identified on N1092 and N1218 (SI Appendix, Fig. S8) of the Th1 and/or Th2 epitopes (residues 1051 to 1110 and 1208 to 1235) of the FIPV-UU4 S protein (40). Viral protein glycosylation might determine the host tropism (41), immunogenicity, and pathogenicity (42–46). The N-linked glycans in Th1 epitopes might function as hindering structures that constrict T cell recognition or mislead the host immune defense by producing an ineffective immune response (42–46), as demonstrated in HIV (47). The role of these glycans in the pathogenesis of FIP should be further studied.
In summary, we describe, at the atomic level, the structure and glycosylation of FIPV-UU4 S protein. This represents structural work on the serotype I FIPV family, which inflicts an exceptionally high mortality rate for infected cats. The structure reveals major structural differences in the S1 subunit from the only known example within the alphacoronavirus genera, HCoV-NL63 S protein, for which structural information is available. These results demonstrate how structurally divergent these S proteins can be despite their sequence homology (SI Appendix, Table S1). The integrated use of cryo-EM and MS reveals the unique domain architecture with detailed structural information regarding its glycosylation, the galectin-like N-terminal domains, and the location of the putative RBMs of serotype I FIPV. The high-resolution structure of FIPV-UU4 S protein may therefore serve as the blueprint for further mechanistic insights into viral host interactions of FIPV.
Materials and Methods
Purification of FIPV-UU4 S Protein.
HEK293 cells stably expressing FIPV-UU4 S protein were washed by Dulbecco’s phosphate-buffered saline (PBS) (Gibco) 2 times prior to inoculation into 9 175T flasks, each containing 350 mL FreeStyle 293 expression medium (Gibco), and cultured in the CELLSPIN System (INTEGRA Biosciences) at 37 °C for 5 d. Cells were harvested by centrifugation at 1,000 rpm for 20 min, and the supernatant was collected and filtered through a membrane with a pore size of 0.22 μm. The supernatant was supplemented with 10× binding buffer (500 mM sodium phosphate, 1.5 M sodium chloride, 100 mM imidazole, pH 8). HisPur cobalt resin (Thermo Fisher Scientific) was added to the supernatant (20 mL resin per L) following the manufacturer’s protocols. The eluent was concentrated using Vivaspin 20 (GE Healthcare) with 100-kDa molecular mass cutoff, supplemented with cOmplete EDTA-free Protease Inhibitor Mixture (Roche), aliquotted, flash-frozen in liquid nitrogen, and stored at −80 °C until further use.
Cryo-EM sample preparation and data collection.
Defocus phase-contrast cryo-EM.
Four microliters of purified FIPV-UU4 S protein (0.2 mg/mL) in PBS (Sigma-Aldrich; 79382) was applied onto glow-charged 200-mesh Quantifoil R2/1 holey carbon grids. The grids were blotted for 3 s at 4 °C and 100% humidity, and vitrified using a Vitrobot Mark IV (Thermo Fisher). Cryoelectron data of FIPV-UU4 S protein were collected using a 200-keV Talos Arctica microscope with an exposure time of 2.5 s and pixel size of 0.87 Å, using a Falcon III detector (Thermo Fisher) in a linear mode. In total, 2,436 micrographs were collected with defocus ranging between 1.8 and 2.8 µm and accumulated exposure of 48 e−/Å2 distributed over 32 frames.
Volta phase plate cryo-EM.
The grid preparation was the same as aforementioned for defocus phase-contrast (DPC) cryo-EM data collection. Cryoelectron micrographs of FIPV-UU4 S protein were collected using a 300-keV Titan Krios microscope (Thermo Fisher) with an exposure time of 2.27 s and pixel size of 0.85 Å, using a Falcon III detector (Thermo Fisher) in a linear mode with contrast enhancement by using a VPP. In total, 1,405 micrographs were collected with defocus ranging between 0.2 and 0.8 µm and accumulated exposure of 30 e−/Å2 in 30 output frames.
Data Availability.
The atomic coordinates of FIPV-UU4 S protein have been deposited in the Protein Data Bank (PDB) under ID code 6JX7. The cryo-EM maps, including unsharpened and sharpened maps, have been deposited in the Electron Microscopy Data Bank (EMDB) under ID code EMD-9891.
Further detailed information on materials and methods is provided in SI Appendix, Materials and Methods.
Supplementary Material
Acknowledgments
We are thankful for the excellent support from the Academia Sinica Common Mass Spectrometry Facilities for glycan analyses, Academia Sinica Cryo-EM Center (Grant AS-CFII-108-110) for cryo-EM data collection, and Academia Sinica Grid Computing for cryo-EM data processing. This work was supported by the Ministry of Science and Technology (MOST), Taiwan (Grants MOST 106-2311-B-002-028-MY3 to H.-W.C.; 107-2628-M-001-005-MY3 to S.-T.D.H.), National Taiwan University (108L7842 to H.-W.C.), Taiwan Protein Project (AS-KPQ-105-TPP), and intramural funding from Academia Sinica (to S.-T.D.H. and K.-H.K.).
Footnotes
The authors declare no competing interest.
This article is a PNAS Direct Submission.
Data deposition: The cryo-EM map and atomic model coordinates have been deposited in the Electron Microscopy Data Bank (EMDB) and Protein Data Bank (PDB) (ID codes EMD-9891 and 6JX7, respectively).
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.1908898117/-/DCSupplemental.
References
- 1.Millet J. K., Whittaker G. R., Host cell proteases: Critical determinants of coronavirus tropism and pathogenesis. Virus Res. 202, 120–134 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Jaimes J. A., Whittaker G. R., Feline coronavirus: Insights into viral pathogenesis based on the spike protein structure and function. Virology 517, 108–121 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Yuan Y., et al. , Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nat. Commun. 8, 15092 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sun R. Q., et al. , Outbreak of porcine epidemic diarrhea in suckling piglets, China. Emerg. Infect. Dis. 18, 161–163 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Boileau M. J., Kapil S., Bovine coronavirus associated syndromes. Vet. Clin. North Am. Food Anim. Pract. 26, 123–146 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Cavanagh D., Coronavirus avian infectious bronchitis virus. Vet. Res. 38, 281–297 (2007). [DOI] [PubMed] [Google Scholar]
- 7.Martina B. E., et al. , Virology: SARS virus infection of cats and ferrets. Nature 425, 915 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Guan Y., et al. , Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China. Science 302, 276–278 (2003). [DOI] [PubMed] [Google Scholar]
- 9.Kipar A., Meli M. L., Feline infectious peritonitis: Still an enigma? Vet. Pathol. 51, 505–526 (2014). [DOI] [PubMed] [Google Scholar]
- 10.Pedersen N. C., An update on feline infectious peritonitis: Diagnostics and therapeutics. Vet. J. 201, 133–141 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bálint Á., Farsang A., Szeredi L., Zádori Z., Belák S., Recombinant feline coronaviruses as vaccine candidates confer protection in SPF but not in conventional cats. Vet. Microbiol. 169, 154–162 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Benetka V., et al. , Prevalence of feline coronavirus types I and II in cats with histopathologically verified feline infectious peritonitis. Vet. Microbiol. 99, 31–42 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Pedersen N. C., An update on feline infectious peritonitis: Virology and immunopathogenesis. Vet. J. 201, 123–132 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hohdatsu T., Izumiya Y., Yokoyama Y., Kida K., Koyama H., Differences in virus receptor for type I and type II feline infectious peritonitis virus. Arch. Virol. 143, 839–850 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Belouzard S., Millet J. K., Licitra B. N., Whittaker G. R., Mechanisms of coronavirus cell entry mediated by the viral spike protein. Viruses 4, 1011–1033 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Klenk H. D., Garten W., Host cell proteases controlling virus pathogenicity. Trends Microbiol. 2, 39–43 (1994). [DOI] [PubMed] [Google Scholar]
- 17.Kirchdoerfer R. N., et al. , Pre-fusion structure of a human coronavirus spike protein. Nature 531, 118–121 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Walls A. C., et al. , Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer. Nature 531, 114–117 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Walls A. C., et al. , Glycan shield and epitope masking of a coronavirus spike protein observed by cryo-electron microscopy. Nat. Struct. Mol. Biol. 23, 899–905 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Xiong X., et al. , Glycan shield and fusion activation of a deltacoronavirus spike glycoprotein fine-tuned for enteric infections. J. Virol. 92, e01628-17 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Shang J., et al. , Cryo-EM structure of infectious bronchitis coronavirus spike protein reveals structural and functional evolution of coronavirus spike proteins. PLoS Pathog. 14, e1007009 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Delmas B., Laude H., Assembly of coronavirus spike protein into trimers and its role in epitope expression. J. Virol. 64, 5367–5375 (1990). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Cavanagh D., Coronavirus IBV glycopolypeptides: Size of their polypeptide moieties and nature of their oligosaccharides. J. Gen. Virol. 64, 1187–1191 (1983). [DOI] [PubMed] [Google Scholar]
- 24.Feinberg H., Mitchell D. A., Drickamer K., Weis W. I., Structural basis for selective recognition of oligosaccharides by DC-SIGN and DC-SIGNR. Science 294, 2163–2166 (2001). [DOI] [PubMed] [Google Scholar]
- 25.Ritchie G., et al. , Identification of N-linked carbohydrates from severe acute respiratory syndrome (SARS) spike glycoprotein. Virology 399, 257–269 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Zheng J., et al. , Identification of N-linked glycosylation sites in the spike protein and their functional impact on the replication and infectivity of coronavirus infectious bronchitis virus in cell culture. Virology 513, 65–74 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhou Y., et al. , A single asparagine-linked glycosylation site of the severe acute respiratory syndrome coronavirus spike glycoprotein facilitates inhibition by mannose-binding lectin through multiple mechanisms. J. Virol. 84, 8753–8764 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chien C. H., Ho M. R., Lin C. H., Hsu S. D., Lactose binding induces opposing dynamics changes in human galectins revealed by NMR-based hydrogen-deuterium exchange. Molecules 22, E1357 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li F., Structure, function, and evolution of coronavirus spike proteins. Annu. Rev. Virol. 3, 237–261 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zheng Q., et al. , Core structure of S2 from the human coronavirus NL63 spike glycoprotein. Biochemistry 45, 15205–15215 (2006). [DOI] [PubMed] [Google Scholar]
- 31.Xu Y., et al. , Crystal structure of severe acute respiratory syndrome coronavirus spike protein fusion core. J. Biol. Chem. 279, 49414–49419 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Helenius A., Aebi M., Intracellular functions of N-linked glycans. Science 291, 2364–2369 (2001). [DOI] [PubMed] [Google Scholar]
- 33.Pancera M., et al. , Structure and immune recognition of trimeric pre-fusion HIV-1 Env. Nature 514, 455–461 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Marzi A., et al. , DC-SIGN and DC-SIGNR interact with the glycoprotein of Marburg virus and the S protein of severe acute respiratory syndrome coronavirus. J. Virol. 78, 12090–12095 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Alvarez C. P., et al. , C-type lectins DC-SIGN and L-SIGN mediate cellular entry by Ebola virus in cis and in trans. J. Virol. 76, 6841–6844 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lozach P. Y., et al. , DC-SIGN and L-SIGN are high affinity binding receptors for hepatitis C virus glycoprotein E2. J. Biol. Chem. 278, 20358–20366 (2003). [DOI] [PubMed] [Google Scholar]
- 37.Navarro-Sanchez E., et al. , Dendritic-cell-specific ICAM3-grabbing non-integrin is essential for the productive infection of human dendritic cells by mosquito-cell-derived dengue viruses. EMBO Rep. 4, 723–728 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Van Breedam W., Pöhlmann S., Favoreel H. W., de Groot R. J., Nauwynck H. J., Bitter-sweet symphony: Glycan-lectin interactions in virus biology. FEMS Microbiol. Rev. 38, 598–632 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Schwegmann-Wessels C., Herrler G., Sialic acids as receptor determinants for coronaviruses. Glycoconj. J. 23, 51–58 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Satoh R., et al. , Screening and identification of T helper 1 and linear immunodominant antibody-binding epitopes in the spike 2 domain and the nucleocapsid protein of feline infectious peritonitis virus. Vaccine 29, 1791–1800 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Wasik B. R., Barnard K. N., Parrish C. R., Effects of sialic acid modifications on virus binding and infection. Trends Microbiol. 24, 991–1001 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Botarelli P., et al. , N-glycosylation of HIV-gp120 may constrain recognition by T lymphocytes. J. Immunol. 147, 3128–3132 (1991). [PubMed] [Google Scholar]
- 43.Selby M., et al. , Hepatitis C virus envelope glycoprotein E1 originates in the endoplasmic reticulum and requires cytoplasmic processing for presentation by class I MHC molecules. J. Immunol. 162, 669–676 (1999). [PubMed] [Google Scholar]
- 44.Werdelin O., Meldal M., Jensen T., Processing of glycans on glycoprotein and glycopeptide antigens in antigen-presenting cells. Proc. Natl. Acad. Sci. U.S.A. 99, 9611–9613 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Carbone F. R., Gleeson P. A., Carbohydrates and antigen recognition by T cells. Glycobiology 7, 725–730 (1997). [DOI] [PubMed] [Google Scholar]
- 46.Speir J. A., Abdel-Motal U. M., Jondal M., Wilson I. A., Crystal structure of an MHC class I presented glycopeptide that generates carbohydrate-specific CTL. Immunity 10, 51–61 (1999). [DOI] [PubMed] [Google Scholar]
- 47.Dimitrov J. D., Kazatchkine M. D., Kaveri S. V., Lacroix-Desmazes S., “Rational vaccine design” for HIV should take into account the adaptive potential of polyreactive antibodies. PLoS Pathog. 7, e1002095 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The atomic coordinates of FIPV-UU4 S protein have been deposited in the Protein Data Bank (PDB) under ID code 6JX7. The cryo-EM maps, including unsharpened and sharpened maps, have been deposited in the Electron Microscopy Data Bank (EMDB) under ID code EMD-9891.
Further detailed information on materials and methods is provided in SI Appendix, Materials and Methods.