Abstract
Porcine coronavirus SADS-CoV has been identified from suckling piglets with severe diarrhea in southern China in 2017. The SADS-CoV genome shares ~95% identity to that of bat α-coronavirus HKU2, suggesting that SADS-CoV may have emerged from a natural reservoir in bats. Here we report the cryo-EM structures of HKU2 and SADS-CoV spike (S) glycoprotein trimers at 2.38 Å and 2.83 Å resolution, respectively. We systematically compare the domains of HKU2 spike with those of α-, β-, γ-, and δ-coronavirus spikes, showing that the S1 subunit N- and C-terminal domains of HKU2/SADS-CoV are ancestral domains in the evolution of coronavirus spike proteins. The connecting region after the fusion peptide in the S2 subunit of HKU2/SADS-CoV adopts a unique conformation. These results structurally demonstrate a close evolutionary relationship between HKU2/SADS-CoV and β-coronavirus spikes and provide insights into the evolution and cross-species transmission of coronaviruses.
Subject terms: Cryoelectron microscopy, Viral evolution, Virus structures
Several coronaviruses infecting humans and animals have emerged in recent years. Here, the authors provide structures of the spike proteins of the porcine coronavirus SADS-CoV and closely related bat coronavirus HKU2, providing insights into evolution of coronavirus spike proteins.
Introduction
Coronaviruses, categorized into the order Nidovirales, family Coronaviridae, and subfamily Coronavirinae, are a large group of viral pathogens with a wide host range1. Their infections in humans, other mammals, and birds can cause respiratory, hepatic, enteric and neurological diseases with varying severity2. The zoonotic transmission of novel coronaviruses into humans poses severe threats to the global health, evidenced by the severe acute respiratory syndrome coronavirus (SARS-CoV), Middle East respiratory syndrome coronavirus (MERS-CoV), and the SARS-CoV-2 that is causing the ongoing pandemic COVID-19 in the world1,3–6. In the meantime, coronaviruses infecting domestic animals also bring substantial economic losses7. For example, the swine acute diarrhea syndrome coronavirus (SADS-CoV) (also known as SeACoV and PEAV) isolated in 2017 caused outbreaks of severe watery diarrhea of suckling piglets with a mortality up to 90% in several commercial pig farms in Guangdong Province of China8–13. SADS-CoV is an α-coronavirus and other representative members in the α-genus are porcine epidemic diarrhea virus (PEDV), porcine transmissible gastroenteritis coronavirus (TGEV), porcine respiratory coronavirus (PRCV), feline infectious peritonitis virus (FIPV), and human NL63 and 229E coronaviruses (HCoV-NL63 and HCoV-229E)1. Representative members in other three genera include mouse hepatitis coronavirus (MHV), bovine coronavirus (BCoV), SARS-CoV, SARS-CoV-2, MERS-CoV, HCoV-OC43, and HCoV-HKU1 in the β-genus, avian infectious bronchitis virus (IBV) in the γ-genus, and porcine deltacoronavirus (PdCoV) in the δ-genus1.
Cross-species transmission promoted by genetic recombination and/or mutations underlies the host range expansion of coronaviruses3,14–16. Bats are the natural reservoir of more than 30 different α- and β-coronaviruses that have great potential for interspecies transmission by recombination and/or mutation15,17–19. Data on genetic evolution, receptor binding, and pathogenesis have demonstrated that human SARS-CoV and MERS-CoV most likely originate from bats1. Palm civets and dromedary camels are the intermediate hosts of SARS-CoV and MERS-CoV from bats to humans, respectively1. The newly identified porcine SADS-CoV isolates are also found to share ~95% sequence identity with Rhinolophus bat coronavirus HKU2, and this further stressed the severe results of coronavirus spillover from bats to domestic animals8–13. However, the molecular mechanisms underlying the transmission of SADS-CoV from bats to pigs are still unknown and need to be further explored. Recently it was shown that SADS-CoV is able to infect cells from a broad range of species including mouse, chicken, pig, monkey, and human, indicating a high potential of the SADS-CoV for interspecies transmission20.
The spike glycoprotein of coronaviruses mediates viral entry by binding host receptor with the S1 subunit and fusing viral and cellular membranes with the S2 subunit, thereby determining viral host range and tissue tropism21,22. As a class I viral fusion protein, the spike exists on the envelope of virion as a homotrimer and each monomer contains more than 1000 amino acid residues that can be cleaved into S1 and S2 subunits21. For most coronaviruses, the N-terminal domain (NTD) of the S1 subunit recognizes cell-surface carbohydrates, while the C-terminal domain (CTD) specifically binds to cellular protein receptors21–23. SARS-CoV, SARS-CoV-2, and HCoV-NL63 utilize CTD to bind human receptor ACE224–30; MERS-CoV utilizes CTD to bind human receptor DPP431,32; and TGEV, PRCV, and 229E utilize CTD to bind receptor APN33–35; HCoV-OC43 utilizes NTD to recognize glycans36; and one exception is MHV, which utilizes the NTD to bind mouse receptor CEACAM1a37,38. Therefore, the S1 subunit, especially its NTD and CTD, is the most variable region of the spike, and is responsible for different tropisms of coronaviruses. In comparison, the S2 subunit containing the fusion peptide (FP) and heptad repeats (HR1 and HR2) for membrane fusion are more conserved in both sequence and structure21,22. For the SADS-CoV, receptor analysis indicated that none of the known coronavirus protein receptors including ACE2, DPP4, and APN are essential for cell entry10,20. There are also no reports regarding the recognition of glycans by the NTD of SADS-CoV.
Structural studies of the spike and its binding with glycans and protein receptors have provided important insights into the origin, evolution, and interspecies transmission of coronaviruses. Cryo-EM structures of spike trimer from all four coronavirus genera have been reported: the α-coronavirus spike structures are determined for HCoV-NL6339, HCoV-229E34, PEDV40, and FIPV41; the β-coronavirus spike structures are determined for MHV38,42,43, HCoV-HKU144, HCoV-OC4336, SARS-CoV24,45–47, MERS-CoV45,48,49, and SARS-CoV-250,51; and the γ-coronavirus spike structure is determined for IBV52 and the δ-coronavirus spike structure is determined for PdCoV53,54. The cryo-EM structures of bat coronavirus spike trimers have not been reported, and only crystal structures of the CTD from HKU455, HKU556, and HKU957 were determined.
The spikes of SADS-CoV (1130 amino acid residues) and HKU2 (1128 amino acid residues) are among the shortest coronavirus spike glycoproteins58 and their amino acid identities to other known coronavirus spikes are lower than 28%, indicating the spikes of HKU2 and SADS-CoV are unique8–13,59. In this study, we report the cryo-EM structures of the SADS-CoV and HKU2 spike trimers at 2.83 and 2.38 Å resolution, respectively. The HKU2 spike trimer structure is the first one from bat coronavirus. We analyze the HKU2 and SADS-CoV trimer structures and also compare the NTD, CTD, SD1, and SD2 domains of the S1 subunit and the S2 subunit of HKU2 with other spikes from α-, β-, γ-, and δ-coronaviruses. Our results strongly support that HKU2 and SADS-CoV preserve primitive structural features in their spikes that have a close evolutionary relationship with β-coronavirus spikes and provide insights into the evolution and cross-species transmission of coronaviruses.
Results
Protein expression and structure determination
The cDNAs encoding HKU2 spike (YP_001552236) and SADS-CoV spike (AVM41569.1) were synthesized with codons optimized for insect cell expression. HKU2 ectodomains (residues 1–1066) and SADS-CoV ectodomains (residues 1–1068) were separately cloned into pFastBac-Dual vector (Invitrogen) with C-terminal foldon tag and Strep tag. After expression in Hi5 insect cells and purification to homogeneity, the cryo-EM images on these two spike ectodomains were recorded using FEI Titan Krios microscope operating at 300 kV with a Gatan K2 Summit direct electron detector (Supplementary Fig. 1). About 1,400,000 particles for HKU2 spike and 900,000 particles for SADS-CoV spike were subjected to 2D classification, and a total of 421,490 particles of HKU2 spike and 152,334 particles of SADS-CoV spike were selected and subjected to 3D refinement with C3 symmetry to generate density maps (Supplementary Fig. 2). The overall density maps were solved to 2.38 Å for HKU2 spike and 2.83 Å for SADS-CoV spike (gold-standard Fourier shell correlation (FSC) = 0.143) (Supplementary Figs. 1–3). The atomic-resolution density map enabled us to build nearly all residues of HKU2 spike ectodomains (residues 17–995) except for a few breaks (residues 129–141 and 203–205), as well as 48 N-linked glycans and 71 water molecules (Supplementary Figs. 3a, 4a, and 5a). The final refined model of SADS-CoV spike contains residues 19–998 with some short breaks (residues 82–83, 134–143, and 488–490) and 45 N-linked glycans (Supplementary Figs. 3b, 4b, and 5b). Data collection and refinement statistics for these two structures are listed in Table 1.
Table 1.
HKU2 (EMDB-30037) (PDB 6M15) | SADS-CoV (EMDB-30038) (PDB 6M16) | |
---|---|---|
Data collection and processing | ||
Magnification | ×130,000 | ×130,000 |
Voltage (kV) | 300 | 300 |
Electron exposure (e−/Å2) | 49.784 | 49.784 |
Defocus range (μm) | −1 to −3 | −1 to −3 |
Pixel size (Å) | 1.061 | 1.061 |
Symmetry imposed | C3 | C3 |
Initial particle images (no.) | ~1,400,000 | ~900,000 |
Final particle images (no.) | 421,490 | 152,334 |
Map resolution (Å) | 2.38 | 2.83 |
FSC threshold | 0.143 | 0.143 |
Map resolution range (Å) | 2.38–4 | 2.83–6 |
Refinement | ||
Initial model used (PDB code) | 5SZS | 5SZS |
Model resolution (Å) | 2.38 | 2.83 |
FSC threshold | 0.143 | 0.143 |
Model resolution range (Å) | 2.38 | 2.83 |
Map sharpening B factor (Å2) | −87.05 | −101.51 |
Model composition | ||
Nonhydrogen atoms | 23,471 | 23,433 |
Protein residues | 2895 | 2895 |
Ligands | 72 | 60 |
B factors (Å2) | ||
Protein | 25.63 | 55.82 |
Ligand | 50.21 | 65.92 |
R.m.s. deviations | ||
Bond lengths (Å) | 0.004 | 0.008 |
Bond angles (°) | 0.624 | 0.763 |
Validation | ||
MolProbity score | 1.69 | 2.39 |
Clashscore | 3.88 | 7.99 |
Poor rotamers (%) | 3.46 | 3.97 |
Ramachandran plot | ||
Favored (%) | 97.5 | 91.68 |
Allowed (%) | 2.5 | 8.22 |
Disallowed (%) | 0 | 0.1 |
Overall structures of HKU2 and SADS-CoV spikes
The overall structures of HKU2 and SADS-CoV spikes we determined here resemble the previously reported prefusion structures of coronaviruses spikes. Both spike trimers have a mushroom-like shape (~150 Å in height and ~115 Å in width) (Fig. 1a), consisting of a cap mainly formed by β-sheets of the S1 subunit, a central stalk mainly formed by α-helices of the S2 subunit, and a root formed by twisted β-sheets and loops of the S2 subunit (Fig. 1a). In each trimer there is a C3 axis along the central stalk (Fig. 1a). The amino acid identity between HKU2 and SADS-CoV spikes is 86%, and these two spike structures are quite similar with the root mean square deviation (r.m.s.d.) being 0.53 Å for 962 aligned Cα atoms of the monomer and 0.56 Å for 2886 aligned Cα atoms of the trimer. Due to the high structural similarity, we will use the HKU2 structure to present the features of both spikes in the subsequent description, whereas significant differences between them will be pointed out only when necessary.
The S1 subunit of the HKU2 spike comprises two major domains, NTD and CTD, which are followed by two subdomains 1 and 2 (SD1 and SD2) connecting them to the S2 subunit (Fig. 1b, c). The S1 subunits from three monomers form the cap of the spike, in which the three CTDs in the inner part are at the apex sitting on top of the central stalk and the three NTDs are located outside the CTDs surrounding the central stalk (Fig. 1a). The NTD, CTD, SD1, and SD2 of the S1 subunit are all mainly composed of β strands (Fig. 1c). In contrast, the upstream helix (UH), FP, connecting region (CR), heptad repeat 1 (HR1), and central helix (CH) of the S2 subunit are mainly composed of helices, whereas the β-hairpin (BH) and subdomain 3 (SD3) at the bottom part of the S2 subunit mainly consist of β strands and loops (Fig. 1c). Moreover, the residues after the SD3, which contain the heptad repeat 2 (HR2), are not resolved in the HKU2 and SADS-CoV spike structures, as well as in all other reported coronavirus spike structures in the prefusion state.
The SD1 and SD2 of the S1 subunit and the S2 subunit are highly similar in amino acid sequence (85, 84, and 95% identities) and structure (Cα r.m.s.d. less than 0.5 Å) between HKU2 and SADS-CoV spikes (Supplementary Fig. 6). The NTD has the lowest sequence identity of 70% among all domains. Structural superimposition also gave a higher Cα r.m.s.d. value of 1.2 Å between these two NTDs. The core β-sheet structure is structurally conserved and conformational variations reside in the loops (Supplementary Fig. 6a). The CTD sequence identity between HKU2 and SADS-CoV spikes is 82%, and the Cα r.m.s.d. between these two CTDs is 1.1 Å, also indicating more structural variations in the CTD in comparison with the highly conserved SD1, SD2, and S2 subunit. The NTD and CTD of the S1 subunit are commonly utilized by coronaviruses for binding cell-surface carbohydrates or protein receptors for cell attachment23. Therefore, the sequence and/or structural variations indicate that HKU2 and SADS-CoV would also bind different host receptors by NTD and/or CTD of the S1 subunit, although their receptors in bat and pig are unknown and the receptor-binding sites on spike have not been defined.
NTD structure and comparisons
The NTD of HKU2 has three layers of antiparallel β-sheet with the top one consisting of six strands, the middle one consisting of five strands, and the bottom one consisting of three strands. Below the bottom sheet is a short α-helix (Fig. 2a). The top and middle β-sheets form a galectin-like β-sandwich fold, which is inserted between two strands of the bottom sheet (Fig. 2a). To supplement, three disulfide bonds are detected in the HKU2 NTD structure: C17–C56 connecting the N-terminus of the NTD to its upper loop, C124–C149 connecting β6 and β7 strands in the top sheet, and C234–C244 connecting the bottom helix to the bottom sheet (Fig. 2a).
Although the NTDs of all coronaviruses adopt a similar overall architecture, the NTD of HKU2 has the highest structural similarity with the NTD1 (named domain 0 in previous reports) of α-coronavirus HCoV-NL63 with an r.m.s.d. of 2.7 Å for 186 aligned Cα atoms (Fig. 2b and Supplementary Fig. 8d). The NTD1 of α-coronavirus PEDV is not completely modeled in the spike trimer structure; however, the partial model still fits well with the NTD of HKU2 with an r.m.s.d. of 2.3 Å for 73 aligned Cα atoms. Both HCoV-NL63 and PEDV have a second NTD (NTD2, also named domain A in previous reports), and the NTD of HKU2 is structurally less similar to the NTD2 with an r.m.s.d. of 4.3 Å against HCoV-NL63 NTD2 and of 4.1 Å against PEDV NTD2 (Fig. 2b and Supplementary Fig. 8d). Recent structural determination showed that another α-coronavirus, HCoV-229E, also has one NTD, which is more structurally similar to the NTD2 (Cα r.m.s.d. of 2.0 Å) than the NTD1 (Cα r.m.s.d. of 3.7 Å) of HCoV-NL63 (Fig. 2c). All above comparisons indicate that there are two subtypes of NTD in the α-coronaviruses: the subtype I is represented by the NTDs of HKU2, SADS-CoV, the NTD1s of HCoV-NL63 and PEDV, and the subtype II is represented by the NTD of HCoV-229E, the NTD2s of HCoV-NL63 and PEDV (Fig. 2b, c). Although sharing an overall architecture, these two NTD subtypes have a structural difference in the galectin-like β-sandwich fold containing the top and middle sheets stacked together through hydrophobic interactions. These two β-sheets are well aligned in the galectin-like domain of subtype I, whereas there is an alignment shift in the galectin-like domain of subtype II (Fig. 2b). The other notable difference is the distribution of signature disulfide bonds. A signature disulfide bond C124–C149 (numbered in HKU2 and connecting β6 and β7 strands in the top sheet) is conserved in all subtype I NTDs (Fig. 2b), and the subtype II NTDs have two signature disulfide bonds: the first one C145–C168 (numbered in HCoV-229E) connecting β5 and β6 strands and the second one C81–C105 (numbered in HCoV-229E) connecting β2 and neighbor loop (Fig. 2c). Besides, C234–C244 (numbered in HKU2 and connecting the bottom helix to the bottom sheet) is conserved in both subtype I and subtype II NTDs (Fig. 2b, c).
The NTDs of β-coronaviruses including BCoV, HCoV-HKU1, HCoV-OC43, MERS-CoV, SARS-CoV, and MHV resemble the subtype I, rather than the subtype II NTD in the topology and distribution of the disulfide bonds (Fig. 2d). These β-coronavirus NTDs have additional loops in the N-terminus, between β1 and β2 strands, and between β6 and β7 strands (numbered in HKU2 structure), forming an extensive ceiling-like structure on top of the galectin-like fold (Fig. 2d). It has been found that the evolvement of this ceiling-like structure has functional outcomes such as immune evasion or receptor binding52. The NTD of γ-coronavirus IBV also resemble the subtype I NTD in the topology, although its disulfide bond positions are not conserved as in the subtype I NTD (Fig. 2e). Of note, the NTD of δ-coronavirus PdCoV resemble the subtype II NTD in the topology and distribution of the disulfide bonds (Fig. 2f). Both IBV and PdCoV NTDs also have additional insertions including loops and short helices in the galectin-like fold compared with the two subtypes of NTD in α-coronaviruses (Fig. 2e, f).
CTD structure and comparisons
The CTD of HKU2 has a twisted five-stranded antiparallel β-sheet as the core with connecting loops between the strands (Fig. 3a). It contains four disulfide bonds: C277–C300 and C285–C290 at the N-terminus, C341–C397 at the C-terminus, and the last one C331–C369 connecting the β2 and β5 strands in the core β-sheet (Fig. 3a). Interestingly, the CTD core of HKU2 is of high structural similarity with the conserved CTD core of β-coronaviruses and the disulfide bonds in the CTD of HKU2 except for C285–C290 are also detected in all β-coronavirus CTDs (Fig. 3b). These CTDs have the core of one twisted β-sheet and here we name them as one-layer CTD subtype (Fig. 3a, b). The β-coronavirus CTDs always have an insertion consisting of loops and/or strands between the β5 and β6 strands of the core (Fig. 3b). SARS-CoV26, SARS-CoV-227–30, MERS-CoV31,32, HKU455, and HKU556 have receptor-binding motif (RBM) in this insertion region responsible for binding their respective protein receptors. In the CTD of HKU2, there is only one short loop between the β5 and β6 strands of the core twisted β-sheet (Fig. 3a).
Although as members in the α-genus, HKU2, and SADS-CoV CTD structures are significantly different from those of other α-coronaviruses HCoV-NL63, HCoV-229E, PEDV, TGEV, and PRCV that contain two layers of β-sheets (Fig. 3c). And we named these CTDs as two-layer CTD subtype. All available two-layer CTD structures can be well aligned with Cα r.m.s.d. in the range of 1.0–3.4 Å. These two-layer CTDs contain two highly conserved disulfide bonds: C540–C586 and C569–C596 (numbered in PEDV CTD) (Fig. 3c). The C569–C596 is conserved among all coronaviruses, whereas the C540–C586 is conserved in all α-coronaviruses (except for HKU2 and SADS) and δ-coronavirus PdCoV (Fig. 3c).
The CTD of δ-coronavirus PdCoV has a core of two β-sheets, belonging to the two-layer CTD subtype (Fig. 3c). As for the γ-coronavirus IBV, the core of its CTD is also similar to the typical two-layer CTD (Fig. 3c). However, several β strands are replaced by loops and the disulfide bonds are in different positions from the two-layer CTD (Fig. 3c). IBV CTD also has an extra region of loops, reminiscent of the extra domain in the CTDs of β-coronaviruses (Fig. 3c).
SD1 and SD2 structures and comparisons
The SD1 and SD2 are two subdomains following the CTD in the S1 subunit, linking the CTD to the S2 subunit. The HKU2 SD1 is a partial β barrel consisting of five β strands and a disulfide bond (C409–C458) connecting its C-terminus to the β1 strand (Fig. 4a). This five-stranded β barrel and the linking disulfide bond are conserved among all four genera of coronavirus (Fig. 4a). The HKU2 SD2 has a structure of two layers of β-sheet with an additional short α-helix over the top sheet (Fig. 4b). The additional α-helix and the top sheet is linked by a disulfide bond (C482–C509), and another disulfide bond (C524–C533) links the C-terminal loop to the bottom sheet (Fig. 4b). The two-layer core structure and the second disulfide bond are conserved among all genera of coronavirus; however, the additional α-helix and the first linking disulfide bond is a distinct feature of β-coronaviruses plus α-coronavirus HKU2 and SADS-CoV (Fig. 4b). This additional helix appears to be an insertion between the β2 and β3 strands of the SD2, and is retained during evolution of β-coronaviruses.
Quaternary packing of the NTD and CTD in the spike
It has been observed that coronaviruses have two types of quaternary packing mode of the S1 subunits in the trimer: intrasubunit packing and cross-subunit packing52. Actually, this is mainly caused by different positioning and interaction between NTD and CTD in the spike monomer. The HKU2 S1 subunit, similar to those in α-coronaviruses HCoV-NL63, HCoV-229E, and PEDV and δ-coronavirus PdCoV, has an inward CTD that contacts with the NTD (Fig. 5a). The three structural NTD–CTD modules in the cap region of these spikes are composed of NTD and CTD from the same monomer, forming the intrasubunit packing in the spike trimer (Fig. 5a). The S1 subunits of other coronaviruses in the β- and γ-genera including MHV, SARS-CoV, MERS-CoV, HCoV-OC43, and IBV have an outward CTD that is far away from the NTD (Fig. 5b). Therefore, the three structural NTD–CTD modules in the cap region of these spikes have the NTD from one monomer and the CTD from the adjacent monomer, forming the cross-subunit packing in the spike trimer (Fig. 5b). Interestingly, we found that the outward CTDs always have an insertion in the core structure, such as β-coronavirus CTDs and γ-coronavirus IBV CTD (Fig. 5b). In contrast, all inward CTDs only have the one-layer or two-layer core structure without obvious inserted region.
Conserved S2 subunit and a distinct CR
Sequence analysis suggested that the S1/S2 protease cleavage site at the boundary between the S1 and S2 subunits is R544-M545 in HKU2 spike and R546-M547 in SADS-CoV spike8,11,59. Compared with the S1 subunit, the topology and structure of S2 subunit are highly conserved in all coronavirus spikes. The HKU2 S2 subunit contains a 3-helix UH (residues 589–639), an FP helix (residues 672–684), a CR (residues 689–747), a 4-helix HR1 (residues 748–836), a CH (residues 837–887), a twisted BH (residues 888–929), and a β-sandwich like SD3 (residues 930–995) (Figs. 1b and 6a). Like in other coronavirus spikes in the prefusion state, the model of HR2 after SD3 was not built in the structure due to poor density. Five disulfide bonds in S2 are detected. Two of them (C590–C612 and C595–C601) stabilize the folded helices of UH, C696–C706 bends the CR, C884–C895 links the CH and the BH, and C934–C943 is within the SD3 (Fig. 6a). The first four disulfide bonds are conserved in all coronaviruses, and the last one in the SD3 has different positions in different spikes. Specifically, it links the β2 and β3 strands of SD3 in the spikes of HKU2, SADS-CoV, and MERS-CoV (numbered in MERS-CoV), and in other coronavirus spikes it links the β2 strand to the C-terminal loop of SD3 (numbered in MERS-CoV) (Supplementary Fig. 7).
All coronavirus spikes have the S2′ protease site upstream from the FP in the S2 subunit, which is essential for proteolytic fusion activation of the spike. Receptor binding and cleavage at the S2′ site promote large-scale conformational changes of the FP, CR, HR1, and HR2, allowing the insertion of FP into host cell membrane and the formation of six-helix bundle. The FP and CR, which are often not well and totally resolved in other coronavirus spike structures, can be clearly modeled in the HKU2 spike due to the atomic resolution of the map (Supplementary Fig. 5). The typical CR in the S2 subunit contains three helices and one short strand, with a disulfide bond bending the first and second helix to form a turn (Fig. 6b). In HKU2, the second helix is replaced by a short strand (713–716) and the third helix is replaced by a loop (721–741), therefore there are two short strands and only one helix in HKU2 CR (Fig. 6c). The conserved disulfide bond C696–C706 makes the first helix of CR in HKU2 spike turn upside down. The S2′ cleavage site (between R671 and S672) is then covered by the reversed CR helix and loops, and R671 interacts with E723 in the loop and K697 and K698 in helix 1 (Fig. 6c). In other coronaviruses, taking the MHV S2 for example, the helix 1 does not cover the S2′ site (between R869 and S870), and R869 only loosely interacts with T929 in helix 3 (Fig. 6b). After the dissociation of the S1 subunit triggered by receptor binding, the exposure of the S2′ site for cleavage is a prerequisite for the proteolytic activation of the coronavirus spike to mediate membrane fusion. The buried S2′ site indicates that HKU2 spike, compared with other coronavirus spikes, would require more conformational changes around the S2′ site for the exposure.
Discussion
A porcine coronavirus SADS-CoV (also named SeACoV and PEAV in other reports) was recently identified from suckling piglets with diarrhea in southern China, and its genome sequence was most identical (~95% identity) to that of Rhinolophus bat α-coronavirus HKU28–13. The SADS-CoV and HKU2 are phylogenetically located in a sublineage closely related to the proposed α-coronavirus group-1b lineage at the complete genome level8–13. However, phylogenetic analysis based on the spike glycoprotein indicated that they are members of a separate lineage clustered within β-coronavirus (Supplementary Fig. 8a), suggesting that HKU2 and SADS-CoV probably resulted from recombination of an α-coronavirus with an unrecognized β-coronavirus S gene8–13. These results, together with the lower than 28% amino acid identities to other known coronavirus glycoproteins, strongly indicate that the spike glycoproteins of HKU2 and SADS-CoV are unique8–13. In this study, we determined the cryo-EM structures of HKU2 and SADS-CoV spike glycoproteins at atomic resolutions. Pairwise comparisons demonstrated nearly identical overall structures, and differences mainly locate in the loops of NTD and CTD of the S1 subunit between the spikes of HKU2 and SADS-CoV (Supplementary Fig. 6). A series of structural analysis and comparisons were also performed at the domain level between HKU2 spike with other coronavirus spikes with determined structures. Our results show that HKU2 and SADS-CoV spikes maintain primitive structural features, especially in the NTD and CTD, and provide more insights into the evolution of coronaviruses.
The HKU2 and SADS-CoV have one NTD in the S1 subunit, and their structures are more similar to the NTD1 than the NTD2 of α-coronaviruses HCoV-NL63 and PEDV, whereas the only NTD of HCoV-229E is structurally more similar to the NTD2 than the NTD1 (Fig. 2). Therefore, we suggest that α-coronaviruses have two subtypes of NTD. The evolution relationship between them are not clear yet. It was once suggested that the presence of two NTDs in HCoV-NL63 is a result of gene duplication39. However, the sequence identity between these two NTDs is only 15.7% in HCoV-NL63 and 12.9% in PEDV. Considering that HKU2 (SADS-CoV) and HCoV-229E have one NTD belonging to either subtype I or subtype II, a more plausible evolution way of the NTD in α-coronaviruses is the recombination of two separate primitive domains into the genome, resulting in the presence of two NTDs in the S1 subunit of α-coronaviruses including HCoV-NL63 and PEDV. Of note, these two NTD subtypes may represent primitive structures that could be the evolutionary ancestors of NTDs of other genera coronaviruses. For example, in the current available spike structures, the NTDs of β-coronavirus are similar to the HKU2 NTD representing the subtype I in both architecture and disulfide bond positions. These β-coronavirus NTDs also have additional loop ceiling over the top sheet, functionally facilitating immune evasion or protein receptor binding such as in MHV37,38. The NTD of γ-coronavirus IBV is also architecturally similar to the subtype I, although the disulfide bond positions are not conserved (Fig. 2e). In contrast, the δ-coronavirus PdCoV NTD is similar to the HCoV-229E NTD representing the subtype II in both architecture and disulfide bond positions (Fig. 2f). A previous study of the IBV spike proposed that α-coronavirus NTDs are probably the most ancestral and the NTDs of the four genera form an evolutionary spectrum in the order of α-, δ-, γ-, and β-genus52. Our proposal here is similar to the previous one in the point that two NTD subtypes in α-coronaviruses may represent primitive structures that could be the evolutionary ancestors of NTDs. However, we argue that the evolution pathways may not be in the order of α-, δ-, γ-, and β-genus. A more plausible pathway is that the β-, γ-, and δ-coronavirus NTDs may evolve independently and parallelly from subtype I (β- and γ-coronavirus NTDs) or subtype II (δ-coronavirus NTDs) (Supplementary Fig. 8b).
The HKU2 and SADS-CoV CTD structure has a one-layer core consisting of a twisted five-stranded antiparallel β-sheet. Interestingly, β-coronavirus CTDs also have the similar one-layer core structure and three strictly conserved disulfide bonds are also present in the core of HKU2 CTD. Currently, all identified RBM of β-coronavirus CTDs are within an inserted domain between two strands of the core sheet, and this insertion responsible for receptor binding of β-coronaviruses is replaced by a short loop in HKU2 CTD. This result firstly indicates HKU2 CTD represent a primitive structure in the one-layer CTD family, while the inserted domain in β-coronaviruses results from a recombinant event during evolution (Supplementary Fig. 8c). The second indication is that HKU2 and SADS-CoV may not utilize the CTD to bind protein receptors that have not been identified yet, and their different receptor usage may be determined by the NTD that harbors almost 50% of residue difference between them. Of note, the CTDs from other α-coronaviruses, γ-coronavirus IBV, and δ-coronavirus PdCoV all belong to the two-layer subtype consisting of two layers β-sheets, although with structural variations in different viruses. These results further confirmed the previous phylogenetic analysis suggesting that HKU2 and SADS-CoV probably resulted from a recombination of an α-coronavirus genomic backbone with an unrecognized β-coronavirus spike gene8,11,59.
In contrast with the NTD and CTD having unique structural features, the SD1 and SD2 of the S1 subunit and the S2 subunits of HKU2 and SADS-CoV are structurally conserved as those of other coronaviruses. In the evolutionary aspect, it is not surprising because this region either connects the CTD to the S2 subunit (SD1 and SD2) or mediates the membrane fusion (S2 subunits), whereas the NTD and CTD are key factors determining tissue tropism and host range of coronaviruses23. Even highly conserved in overall structure, the S2 subunit in HKU2 and SADS-CoV still have a secondary structure arrangement in the CR after the FP, resulting in a more buried S2′ cleavage site (Fig. 6c). It indicates that although the membrane fusion mechanism is highly conserved, the dynamic fusion procedures of HKU2 and SADS-CoV may still have their unique features that need to be addressed in the future.
The ongoing COVID-19 pandemic caused by SARS-CoV-2 has raised worldwide alert4–6. We compared the domain structures of HKU2 and SARS-CoV-2 S proteins (Supplementary Fig. 9). The NTD of SARS-CoV-2 has the two conserved disulfide bonds in subtype I; the CTD core of SARS-CoV-2 has one layer of β-sheets; the SD1 and SD2 of SARS-CoV-2 have the same disulfide bonds with HKU2, although the additional helix of SD2 is not built in the SARS-CoV-2 S model owing to poor density in this region. SARS-CoV-2 has the typical β-coronavirus structural features in each domain, and the inserted RBM in the CTD could be the major impetus for cross-species transmission from bats to human.
In comparing the structures of HKU2 and SADS-CoV spikes with other coronavirus spikes, we observed that additional NTDs could be recombined, ceiling loops could be inserted into NTD core, and extra domain containing RBM could be inserted into the CTD core structure. These phenomena indicate the subdomains are gradually recruited into the S1 subunit during evolution, and the recruitments are required for cross-species transmission, adapting to different host range, and responding to the updating of host immune system, which provides a vivid example for the co-evolution of virus and host.
Methods
Expression and purification of HKU2 and SADS-CoV spike ectodomains
The cDNAs encoding HKU2 spike (YP_001552236) and SADS-CoV spike (AVM41569.1) were synthesized with codons optimized for insect cell expression. HKU2 ectodomains (1–1066) and SADS-CoV ectodomains (1–1068) were separately cloned into pFastBac-Dual vector (Invitrogen) with C-terminal foldon tag for trimerization and Strep tag for purification in the frame of pH promoter. They were expressed in Hi5 insect cell using Bac-to-Bac baculovirus system (Invitrogen). Briefly, the construct was transformed into DH10Bac competent cells and the extracted bacmid was transfected into Sf9 cell using Cellfectin II reagent (Invitrogen). The baculoviruses were harvested after 7–9 days. The high-titer viruses were generated after one more amplification that used to infect Hi5 cells at a density of 1.5 × 106/ml. After 60 h in culture, the cell medium containing spike ectodomains were concentrated and exchanged to binding buffer (50 mM Tris, pH 8.0, 150 mM NaCl). The spike ectodomains were purified by StrepTactin beads (IBA) and then purified by gel-filtration chromatography using Superose 6 gel filtration column (GE Healthcare) pre-equilibrated with HBS buffer (10 mM HEPES pH 7.2, 150 mM NaCl). Purified ectodomains were concentrated for electron microscopy analysis.
Cryo-electron microscopy
Aliquots of spike ectodomains (4 μl, 0.33 mg/ml, in buffer containing 10 mM HEPES pH 7.2, 150 mM NaCl) were applied to glow-discharged holey carbon grids (Quantifoil grid, Au 300 mesh, R1.2/1.3). The grids were then blotted and then plunge-frozen in liquid ethane using FEI Vitrobot system (FEI).
Images were recorded using FEI Titan Krios microscope operating at 300 kV with a Gatan K2 Summit direct electron detector (Gatan Inc.) at Tsinghua University. The automated software (AutoEMation) was used to collect 7663 movies for HKU2 and 4568 movies for SADS-CoV at ×130,000 magnification and at a defocus range between 1 and 3 μm. Each movie has a total accumulate exposure of 49.784 e−/Å2 fractionated in 32 frames of 175 ms exposure. Data collection statistics are summarized in Table 1.
Whole frames in each movie were corrected for beam-induced motion using MotionCo260. The final image was bin averaged to give a pixel size of 1.061 Å. The parameters of contrast transfer function (CTF) was estimated for each micrograph using GCTF61. Particles were automatically picked using Gautomatch (http://www.mrc-lmb.cam.ac.uk/kzhang/) and extracted using RELION62. Initially, ~1,400,000 particles for HKU2 and ~900,000 particles for SADS were subjected to 2D classification. After two or three additional 2D classifications, the best class consisting ~750,000 particles (HKU2) and ~320,000 particles (SADS-CoV) were applied for creating 3D initial model, 3D refinement, and 3D classification. A total of 421,490 particles (HKU2) and 152,334 particles (SADS-CoV) of best classes were selected and subjected to 3D refinement with C3 symmetry to generate density map. The reported resolutions based on the gold-standard FSC cutoff of 0.143 criterion were 2.38 Å for HKU2 spike and 2.83 Å for SADS-CoV spike after RELION post processing. Local resolution variations were estimated using ResMap63. Data processing statistics are summarized in Table 1.
Model building and refinement
As for the HKU2 spike model building, the initial model of HKU2 S1 NTD was generated using the SWISS-MODEL64 and fit into the map using UCSF Chimera65. The other part of HKU2 spike ectodomains was obtained using Map to Model in PHENIX suit66. As for the SADS-CoV spike model building, the initial model was obtained by fit HKU2 S2, NTD, and CTD separately into the map using UCSF Chimera65. Manual model rebuilding was carried out using Coot67 and refined with PHENIX real-space refinement66. The quality of the final model was analyzed with Molprobity68 and EMRinger69. The validation statistics of the structural models are summarized in Table 1.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank the Tsinghua University Branch of China National Center for Protein Sciences (Beijing) for the cryo-EM facility and the computational facility support, and L. Zhao, X. Li, J. Wen, H. Zhou, Y. Wang, A. Jia, and S. Zhang for technical support. This work was supported by funds from Beijing Advanced Innovation Center for Structural Biology at Tsinghua University and the National Key Plan for Scientific Research and Development of China (grant number 2016YFD0500307).
Author contributions
J.Y. carried out protein expression, purification, electron microscopy sample preparation, data collection, image processing, and model building with the help of S.Q. Author X.W. conceived, designed, and directed the study. X.W., J.Y., and R.G analyzed the structure, made the figures, and wrote the manuscript.
Data availability
The atomic coordinates of HKU2 spike and SADS-CoV spike have been deposited in the Worldwide Protein Data Bank with the accession codes 6M15 and 6M16, respectively; the corresponding maps have been deposited in the Electron Microscopy Data Bank with the accession codes EMD-30037 and EMD-30038, respectively.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks Jean K Millet and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information is available for this paper at 10.1038/s41467-020-16876-4.
References
- 1.Cui J, Li F, Shi ZL. Origin and evolution of pathogenic coronaviruses. Nat. Rev. Microbiol. 2019;17:181–192. doi: 10.1038/s41579-018-0118-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Fehr AR, Perlman S. Coronaviruses: an overview of their replication and pathogenesis. Methods Mol. Biol. 2015;1282:1–23. doi: 10.1007/978-1-4939-2438-7_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Su S, et al. Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends Microbiol. 2016;24:490–502. doi: 10.1016/j.tim.2016.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wu F, et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhou P, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med.382, 727–733 (2020). [DOI] [PMC free article] [PubMed]
- 7.Wang Q, Vlasova AN, Kenney SP, Saif LJ. Emerging and re-emerging coronaviruses in pigs. Curr. Opin. Virol. 2019;34:39–49. doi: 10.1016/j.coviro.2018.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Fu X, et al. Newly emerged porcine enteric alphacoronavirus in southern China: Identification, origin and evolutionary history analysis. Infect. Genet. Evol. 2018;62:179–187. doi: 10.1016/j.meegid.2018.04.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Xu Z, et al. Isolation and characterization of a highly pathogenic strain of porcine enteric alphacoronavirus causing watery diarrhoea and high mortality in newborn piglets. Transbound. Emerg. Dis. 2019;66:119–130. doi: 10.1111/tbed.12992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhou P, et al. Fatal swine acute diarrhoea syndrome caused by an HKU2-related coronavirus of bat origin. Nature. 2018;556:255–258. doi: 10.1038/s41586-018-0010-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Pan Y, et al. Discovery of a novel swine enteric alphacoronavirus (SeACoV) in southern China. Vet. Microbiol. 2017;211:15–21. doi: 10.1016/j.vetmic.2017.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gong L, et al. A new bat-HKU2-like coronavirus in swine, China, 2017. Emerg. Infect. Dis. 2017;23:1607–1609. doi: 10.3201/eid2309.170915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wang L, Su S, Bi Y, Wong G, Gao GF. Bat-origin coronaviruses expand their host range to pigs. Trends Microbiol. 2018;26:466–470. doi: 10.1016/j.tim.2018.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Parrish CR, et al. Cross-species virus transmission and the emergence of new epidemic diseases. Microbiol. Mol. Biol. Rev. 2008;72:457–470. doi: 10.1128/MMBR.00004-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lu G, Wang Q, Gao GF. Bat-to-human: spike features determining ‘host jump’ of coronaviruses SARS-CoV, MERS-CoV, and beyond. Trends Microbiol. 2015;23:468–478. doi: 10.1016/j.tim.2015.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Peck KM, Burch CL, Heise MT, Baric RS. Coronavirus host range expansion and middle east respiratory syndrome coronavirus emergence: biochemical mechanisms and evolutionary perspectives. Annu. Rev. Virol. 2015;2:95–117. doi: 10.1146/annurev-virology-100114-055029. [DOI] [PubMed] [Google Scholar]
- 17.Woo PC, Lau SK, Huang Y, Yuen KY. Coronavirus diversity, phylogeny and interspecies jumping. Exp. Biol. Med. 2009;234:1117–1127. doi: 10.3181/0903-MR-94. [DOI] [PubMed] [Google Scholar]
- 18.Woo PC, et al. Discovery of seven novel mammalian and avian coronaviruses in the genus Deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of Gammacoronavirus and Deltacoronavirus. J. Virol. 2012;86:3995–4008. doi: 10.1128/JVI.06540-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fan Y, Zhao K, Shi ZL, Zhou P. Bat coronaviruses in China. Viruses. 2019;11:210. doi: 10.3390/v11030210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Yang YL, et al. Broad cross-species infection of cultured cells by bat HKU2-related swine acute diarrhea syndrome coronavirus and identification of its replication in murine dendritic cells in vivo highlight its potential for diverse interspecies transmission. J. Virol. 2019;93:e01448–19. doi: 10.1128/JVI.01448-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Li F. Structure, function, and evolution of coronavirus spike proteins. Annu. Rev. Virol. 2016;3:237–261. doi: 10.1146/annurev-virology-110615-042301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Tortorici MA, Veesler D. Structural insights into coronavirus entry. Adv. Virus Res. 2019;105:93–116. doi: 10.1016/bs.aivir.2019.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Li F. Receptor recognition mechanisms of coronaviruses: a decade of structural studies. J. Virol. 2015;89:1954–1964. doi: 10.1128/JVI.02615-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Song W, Gui M, Wang X, Xiang Y. Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2. PLoS Pathog. 2018;14:e1007236. doi: 10.1371/journal.ppat.1007236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wu K, Li W, Peng G, Li F. Crystal structure of NL63 respiratory coronavirus receptor-binding domain complexed with its human receptor. Proc. Natl Acad. Sci. USA. 2009;106:19970–19974. doi: 10.1073/pnas.0908837106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Li F, Li W, Farzan M, Harrison SC. Structure of SARS coronavirus spike receptor-binding domain complexed with receptor. Science. 2005;309:1864–1868. doi: 10.1126/science.1116480. [DOI] [PubMed] [Google Scholar]
- 27.Lan J, et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature. 2020;581:215–220. doi: 10.1038/s41586-020-2180-5. [DOI] [PubMed] [Google Scholar]
- 28.Shang J, et al. Structural basis of receptor recognition by SARS-CoV-2. Nature. 2020;581:221–224. doi: 10.1038/s41586-020-2179-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wang Q, et al. Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2. Cell. 2020;181:894–904.e9. doi: 10.1016/j.cell.2020.03.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Yan R, et al. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science. 2020;367:1444–1448. doi: 10.1126/science.abb2762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wang N, et al. Structure of MERS-CoV spike receptor-binding domain complexed with human receptor DPP4. Cell Res. 2013;23:986–993. doi: 10.1038/cr.2013.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lu G, et al. Molecular basis of binding between novel human coronavirus MERS-CoV and its receptor CD26. Nature. 2013;500:227–231. doi: 10.1038/nature12328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Reguera J, et al. Structural bases of coronavirus attachment to host aminopeptidase N and its inhibition by neutralizing antibodies. PLoS Pathog. 2012;8:e1002859. doi: 10.1371/journal.ppat.1002859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Li Z, et al. The human coronavirus HCoV-229E S-protein structure and receptor binding. Elife. 2019;8:e51230. doi: 10.7554/eLife.51230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wong AHM, et al. Receptor-binding loops in alphacoronavirus adaptation and evolution. Nat. Commun. 2017;8:1735. doi: 10.1038/s41467-017-01706-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Tortorici MA, et al. Structural basis for human coronavirus attachment to sialic acid receptors. Nat. Struct. Mol. Biol. 2019;26:481–489. doi: 10.1038/s41594-019-0233-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Peng G, et al. Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor. Proc. Natl Acad. Sci. USA. 2011;108:10696–10701. doi: 10.1073/pnas.1104306108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Shang J, et al. Structure of mouse coronavirus spike protein complexed with receptor reveals mechanism for viral entry. PLoS Pathog. 2020;16:e1008392. doi: 10.1371/journal.ppat.1008392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Walls AC, et al. Glycan shield and epitope masking of a coronavirus spike protein observed by cryo-electron microscopy. Nat. Struct. Mol. Biol. 2016;23:899–905. doi: 10.1038/nsmb.3293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wrapp D, McLellan JS. The 3.1-Angstrom Cryo-electron Microscopy Structure of the Porcine Epidemic Diarrhea Virus Spike Protein in the Prefusion Conformation. J. Virol. 2019;93:e00923–19. doi: 10.1128/JVI.00923-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Yang TJ, et al. Cryo-EM analysis of a feline coronavirus spike protein reveals a unique structure and camouflaging glycans. Proc. Natl Acad. Sci. USA. 2020;117:1438–1446. doi: 10.1073/pnas.1908898117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Walls AC, et al. Cryo-electron microscopy structure of a coronavirus spike glycoprotein trimer. Nature. 2016;531:114–117. doi: 10.1038/nature16988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Walls AC, et al. Tectonic conformational changes of a coronavirus spike glycoprotein promote membrane fusion. Proc. Natl Acad. Sci. USA. 2017;114:11157–11162. doi: 10.1073/pnas.1708727114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kirchdoerfer RN, et al. Pre-fusion structure of a human coronavirus spike protein. Nature. 2016;531:118–121. doi: 10.1038/nature17200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yuan Y, et al. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nat. Commun. 2017;8:15092. doi: 10.1038/ncomms15092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Gui M, et al. Cryo-electron microscopy structures of the SARS-CoV spike glycoprotein reveal a prerequisite conformational state for receptor binding. Cell Res. 2017;27:119–129. doi: 10.1038/cr.2016.152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Kirchdoerfer RN, et al. Stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis. Sci. Rep. 2018;8:15701. doi: 10.1038/s41598-018-34171-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Pallesen J, et al. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen. Proc. Natl Acad. Sci. USA. 2017;114:E7348–E7357. doi: 10.1073/pnas.1707304114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Park YJ, et al. Structures of MERS-CoV spike glycoprotein in complex with sialoside attachment receptors. Nat. Struct. Mol. Biol. 2019;26:1151–1157. doi: 10.1038/s41594-019-0334-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Walls AC, et al. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell. 2020;181:281–292 e286. doi: 10.1016/j.cell.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Wrapp D, et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020;367:1260–1263. doi: 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Shang J, et al. Cryo-EM structure of infectious bronchitis coronavirus spike protein reveals structural and functional evolution of coronavirus spike proteins. PLoS Pathog. 2018;14:e1007009. doi: 10.1371/journal.ppat.1007009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Shang J, et al. Cryo-Electron Microscopy Structure of Porcine Deltacoronavirus Spike Protein in the Prefusion State. J Virol. 2018;92:e01556–17. doi: 10.1128/JVI.01556-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Xiong X, et al. Glycan Shield and Fusion Activation of a Deltacoronavirus Spike Glycoprotein Fine Tuned for Enteric Infections. J. Virol. 2018;92:e01628–17. doi: 10.1128/JVI.01628-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wang Q, et al. Bat origins of MERS-CoV supported by bat coronavirus HKU4 usage of human receptor CD26. Cell Host Microbe. 2014;16:328–337. doi: 10.1016/j.chom.2014.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Han X, et al. Structure of the S1 subunit C-terminal domain from bat-derived coronavirus HKU5 spike protein. Virology. 2017;507:101–109. doi: 10.1016/j.virol.2017.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Huang C, et al. Putative receptor binding domain of bat-derived coronavirus HKU9 spike protein: evolution of betacoronavirus receptor binding motifs. Biochemistry. 2016;55:5977–5988. doi: 10.1021/acs.biochem.6b00790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Dong BQ, et al. Detection of a novel and highly divergent coronavirus from asian leopard cats and Chinese ferret badgers in Southern China. J. Virol. 2007;81:6920–6926. doi: 10.1128/JVI.00299-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lau SK, et al. Complete genome sequence of bat coronavirus HKU2 from Chinese horseshoe bats revealed a much smaller spike gene with a different evolutionary lineage from the rest of the genome. Virology. 2007;367:428–439. doi: 10.1016/j.virol.2007.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Zheng SQ, et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods. 2017;14:331–332. doi: 10.1038/nmeth.4193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Zhang K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 2016;193:1–12. doi: 10.1016/j.jsb.2015.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zivanov J, et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. Elife. 2018;7:e42166. doi: 10.7554/eLife.42166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kucukelbir A, Sigworth FJ, Tagare HD. Quantifying the local resolution of cryo-EM density maps. Nat. Methods. 2014;11:63–65. doi: 10.1038/nmeth.2727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Waterhouse A, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Pettersen EF, et al. UCSF chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 66.Adams PD, et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 2010;66:213–221. doi: 10.1107/S0907444909052925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Emsley P, Cowtan K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 2004;60:2126–2132. doi: 10.1107/S0907444904019158. [DOI] [PubMed] [Google Scholar]
- 68.Chen VB, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 2010;66:12–21. doi: 10.1107/S0907444909042073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Barad BA, et al. EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy. Nat. Methods. 2015;12:943–946. doi: 10.1038/nmeth.3541. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The atomic coordinates of HKU2 spike and SADS-CoV spike have been deposited in the Worldwide Protein Data Bank with the accession codes 6M15 and 6M16, respectively; the corresponding maps have been deposited in the Electron Microscopy Data Bank with the accession codes EMD-30037 and EMD-30038, respectively.