Insights into SARS-CoV transcription and replication from the structure of the nsp7–nsp8 hexadecamer

Yujia Zhai; Fei Sun; Xuemei Li; Hai Pang; Xiaoling Xu; Mark Bartlam; Zihe Rao

doi:10.1038/nsmb999

. 2005 Oct 16;12(11):980–986. doi: 10.1038/nsmb999

Insights into SARS-CoV transcription and replication from the structure of the nsp7–nsp8 hexadecamer

Yujia Zhai ^1,^2,^#, Fei Sun ^1,^2,^#, Xuemei Li ^1,³, Hai Pang ^1,², Xiaoling Xu ^1,², Mark Bartlam ^1,², Zihe Rao ^1,^2,^3,^✉

PMCID: PMC7096913 PMID: 16228002

Abstract

Coronavirus replication and transcription machinery involves multiple virus-encoded nonstructural proteins (nsp). We report the crystal structure of the hexadecameric nsp7–nsp8 supercomplex from the severe acute respiratory syndrome coronavirus at 2.4-Å resolution. nsp8 has a novel 'golf-club' fold with two conformations. The supercomplex is a unique hollow, cylinder-like structure assembled from eight copies of nsp8 and held tightly together by eight copies of nsp7. With an internal diameter of ∼30 Å, the central channel has dimensions and positive electrostatic properties favorable for nucleic acid binding, implying that its role is to confer processivity on RNA-dependent RNA polymerase.

Supplementary information

The online version of this article (doi:10.1038/nsmb999) contains supplementary material, which is available to authorized users.

Main

Coronaviruses are enveloped positive-stranded RNA viruses with the largest currently known RNA genomes. Expression of their genomes begins with the translation of two large replicase polyproteins, pp1a (>4,000 residues) and pp1ab (>7,000 residues), which are encoded by the viral replicase gene that comprises open reading frame 1a (orf1a) and orf1b¹. pp1a and pp1ab are extensively processed by orf1a-encoded proteases to yield 15 or 16 mature nonstructural (replicase) proteins that assemble to form the membrane-associated viral replication and transcription machinery, which is vital to the viral life cycle². Together with a number of cellular factors, this machinery synthesizes not only genome-sized RNA but also a nested set of eight subgenomic mRNAs. These subgenomic mRNAs are predicted to express all ORFs downstream of orf1b, encoding a variety of structural and accessory proteins^3,4,5. Knowledge of the structure and organization of coronavirus replication and transcription machinery at the molecular level is limited. However, the extraordinary size of the coronavirus replicative polyproteins, their generally large phylogenetic distance from those of other RNA viruses and the presence of several predicted RNA-processing activities that are not found in other positive-stranded RNA viruses suggest that this machinery is of unparalleled complexity^3,6,7.

Recently, many studies of coronaviruses have been focused on a newly identified coronavirus: severe acute respiratory syndrome coronavirus (SARS-CoV)^8,9,10,11,12, the etiological agent responsible for the 2003 global SARS outbreak^13,14,15. Its genome is ∼29.7 kilobases long (excluding the 3′ poly(A) tail) and is predicted to contain 14 functional ORFs^3,4,5, encoding 16 replicase proteins, 4 structural proteins and 8 accessory proteins³. The replicase proteins are expected to have multiple enzymatic activities, and some of these have been ascertained experimentally⁶. The activities of a papain-like protease (PL2^pro, also known as nsp3), a main protease (M^pro, also known as 3CL^pro or nsp5), a single-stranded (ss) RNA–binding protein (nsp9), an RNA-dependent RNA polymerase (RdRp, also known as nsp12), a superfamily 1–like helicase (HEL1, also known as nsp13) and a uridylate-specific endoribonuclease (NendoU, also known as nsp15) were recently established and characterized^{3,6,16,17,18,19,20}. In addition, nsp3, nsp14 and nsp16 were predicted to have ADP-ribose 1′-phosphatase, 3′ → 5′ exonuclease and 2′-O-ribose methyltransferase domains, respectively⁶. However, no functions have definitively been assigned to other replicase proteins.

We present here the crystal structure of the hexadecameric nsp7–nsp8 supercomplex of SARS-CoV at 2.4-Å resolution. To our knowledge, it is the first structure to show interactions between coronavirus replicase proteins, and it offers a glimpse of the sophisticated architecture of the coronavirus replication and transcription machinery at the atomic level. Our experiments suggest that the supercomplex could encircle RNA and could function as a general processivity factor for RdRp (nsp12). The structure should help in understanding the replication and transcription mechanisms of SARS-CoV and other coronaviruses such as mouse hepatitis virus (MHV), human coronavirus strain 229E (HCoV-229E) and the recently reported human coronavirus strain HKU1 (HCoV-HKU1)²¹.

Note: Supplementary information is available on the Nature Structural & Molecular Biology website.

Results

Structural overview

The structure of the hexadecameric nsp7–nsp8 supercomplex resembles a hollow cylinder with a central channel and two handles protruding from opposite sides (Fig. 1). The cylinder has a height of ∼90 Å, an internal diameter of ∼30 Å and an external diameter of ∼95 Å (∼120 Å if the two handles are included). There are 4:4 interacting nsp7 (chains A–D) and nsp8 (chains E–H) molecules per asymmetric unit, and the whole hexadecamer comprises two asymmetric units related by the crystallographic two-fold c-axis. This, together with two other pseudo two-fold axes parallel to the a-axis and b-axis, endows the structure with high symmetry. Along the b-axis, the supercomplexes are packed together to form a channel.

(a) Overall structure of the nsp7–nsp8 hexadecameric supercomplex. nsp7, nsp8I and nsp8II are colored green, blue and gold respectively. All diagrams for ribbons, sticks and balls were generated by BobScript 2.6b⁴⁰. (b,c) Sequence alignment of coronavirus proteins homologous to SARS-CoV nsp7 (NP_828865) and nsp8 (NP_828866^*): HCoV-229E (NP_835348, NP_835349^*), transmissible gastroenteritis virus (TGEV; NP_840005, NP_840006^*), porcine epidemic diarrhea virus (PEDV; NP_839961, NP_839962^*), bovine coronavirus (BCoV; NP_742134, NP_742135^*), murine hepatitis virus strain A59 (MHV; NP_740612, NP_740613^*) and avian infectious bronchitis virus (AIBV; NP_740625, NP_740626^*). Accession codes in parentheses are for GenBank; asterisks indicate nsp8 homolog. Residues boxed in red are completely conserved and those in yellow have a conservation of >70%. Residues marked by solid circles (involved in forming hydrogen bonds) and triangles (involved in hydrophobic interactions) are responsible for interactions between nsp7 and nsp8. Magenta marks residues at site 1, black marks site 2 and empty circles mark residues surrounding the channel. Secondary structure elements are labeled according to the structures of SARS-CoV nsp7 and nsp8. The alignment was generated by ClustalW 1.7 (ref. 41) and colored by ESPript 2.1 (ref. 42).

Structure of nsp7

nsp7 is an all–α-helical protein (Fig. 2a). Its central core is an N-terminal helical bundle (HB), with helices HB1, HB2 and HB3 (residues 5–26, 30–47 and 49–68, respectively), forming a triple-stranded antiparallel coiled coil with a right-handed superhelical pitch. A search in DALI²² did not identify any similar structures. The HB regions of four nsp7 monomers in one asymmetric unit superimpose well (r.m.s. deviation < 0.8 Å), but the short helix HCT (residues 70–78) retains some mobility (Supplementary Fig. 1 online). HB1–3 interact with one another mainly through hydrophobic residues. Analysis of the sequence conservation among known coronaviruses shows that the HB region is more conserved than the HCT region, an observation presumably related to the HB's role in interacting with nsp8 (Fig. 1b).

(a–c) Ribbon representations of SARS-CoV nsp7 (a), nsp8I (b) and nsp8II (c). Secondary structure elements are labeled as in Figure 1b,c.

The 'golf-club' fold of nsp8

The four nsp8 monomers in one asymmetric unit adopt two markedly different conformations: nsp8I (chains G and H) has a 'golf club'–like structure composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain (residues 6–104 and 105–196, respectively; Fig. 2b). The shaft domain contains three helices (NH1–3), one of which (NH3) is very long. Another three α-helices (CH1–3) and seven β-strands (β1–7) form the head domain, which has an α/β fold. The seven β-strands form an open β-barrel with two antiparallel β-sheets packed orthogonally. More than half the residues in the C-terminal domain are hydrophobic, and the whole domain forms a tight hydrophobic core.

nsp8II (chains E and F) resembles a golf club with a bent shaft (Fig. 2c). Although its head domain is similar to that of nsp8I (r.m.s. deviation < 0.5 Å), the shaft helix NH3 bends into two shorter helices, NH3α and NH3β, linked by a coil, C3 (Supplementary Fig. 1). Residues before Leu43 on chain E and residues before Asp55 on chain F could not be assigned from the electron density map. SDS-PAGE analysis of the supercomplex crystals showed that their absence was not due to protein degradation or cleavage; thus, the flexibility of the peptide resulting from crystal packing might be the reason. The presence of two conformations of nsp8 agrees with the results of PONDR analysis²³, which strongly suggests that residues 43–84 of nsp8 are disordered.

Multiple sequence alignment of corresponding coronavirus nsp8 proteins reveals high conservation, with the N-terminal domain more conserved than the C-terminal domain (Fig. 1c). This suggests that the N-terminal domain might have an important role in interaction with other molecules and complex assembly. No similar structures were identified from the DALI server²².

Interactions between nsp7 and nsp8

Both nsp8I and nsp8II interact tightly with nsp7 to give a buried surface area of ∼1,400 Å² (∼26% of the whole surface area of nsp7), forming two types of heterodimers, D1 and D2, respectively. nsp8I and nsp8II interact with nsp7 via the same sites (sites 1 and 2, Figs. 1 and 3 and Supplementary Fig. 2 online), suggesting that the conformational change of nsp8 is not induced by nsp7 binding. Site 1 is situated in the C-terminal region of the nsp8 shaft domain. Residues on the NH3/NH3β helix of nsp8 (Met92, Met 95, Leu96, Met99 and Leu103) and residues on the HB1 helix of nsp7 (Val11, Cys13, Val17 and Val21) form a hydrophobic core. An additional hydrogen bond is formed between the side chains of nsp8 Thr89 and nsp7 Gln24. Site 2 is located in the CH1 helix of nsp8. Helices HB3 and HCT of nsp7 interact with CH1 of nsp8 at site 2. More specifically, the side chains of residues on CH1 of nsp8 (Phe97, Leu100, Leu108, Ile111, Ile112 and Ala115) are involved in hydrophobic interactions with Met57, Val58, Leu61, Leu64, Leu65 and Ile73 of nsp7. The side chain and the main chain carbonyl group of nsp8 Arg116 form hydrogen bonds with the main chain carbonyl group of nsp7 Cys77 and the side chain of nsp7 Asn74, respectively. Furthermore, another hydrogen bond is formed between the main chain amide group of nsp8 Ile125 and the side chain of nsp7 Ser62.

(a) The interaction sites between nsp7 and nsp8I/II. Green, nsp7; blue, nsp8. (b) T1 and T2 heterotetramer formation. The two dimers from one tetramer are illustrated by the surface representation and ribbon diagram, respectively. Residues: green, polar; yellow, hydrophobic; red, acidic; blue, basic. Ribbons: green, nsp7; blue, nsp8I; orange, nsp8II. (c) Possible pathway for assembly of the complex by heterotetramers T1 and T2. (d) Hexadecameric supercomplex construction with 'bricks' of nsp8 and 'mortar' of nsp7. The angle between nsp8I and nsp8II is labeled in magenta. In c and d, nsp7 molecules interacting with nsp8I and nsp8II are colored yellow-green and blue-green, respectively.

Assembly and architecture of the supercomplex

The D1 and D2 heterodimers can further dimerize to form heterotetramers T1 and T2, with a total buried surface area of ∼1,500 Å² and 1,700 Å², respectively (Fig. 3b). The legs (L1 and L2) of T1 are clamped by opposite regions (R1 and R2, respectively) of two T2s, with a buried surface area of ∼2,000 Å². The interactions link two T1s and two T2s together in the order T1-T2-T1′-T2′(-T1) (primes distinguish identical tetramers and parenthetical interaction indicates closing of a ring of four tetramers), enabling the full construction of the hexadecameric supercomplex (Fig. 3c).

The four monomers of nsp8II are oriented approximately perpendicular to those of nsp8I. Such an arrangement constitutes the framework of the supercomplex (Fig. 3d). Helices NH1 and NH2 and the N-terminal region of NH3 from nsp8I (residues 6–57) form the handles. The central channel is created by the middle parts (residues 60–82) of four NH3 helices and is surrounded by NH3α helices, with β1 and β2 strands situated on either side (Fig. 1a). In the preliminary framework, nsp7 interacts extensively with nsp8 to make this configuration more compact and stable (Fig. 3d). Residues 24–36 between HB1 and HB2 of nsp7 also participate in the formation of the channel. nsp8I and nsp8II are equally important for the assembly of the supercomplex. The spatial arrangement of the 16 monomers becomes possible because the long α-helix NH3 of nsp8I bends into NH3α and NH3β in nsp8II. It is notable that most nsp8 residues around the channel are highly conserved among coronaviruses, suggesting that they have biological significance (Fig. 1c).

The architecture of the hexadecameric nsp7–nsp8 supercomplex is unique among macromolecular complexes reported so far that contain two kinds of protein (or subunits). In other structures, homologous multimers of one type always stack on those of another type as layers with high degrees of symmetry. The absence of any one kind of protein markedly affects the global shape of the complex. Two such examples are the structures of the multienzyme complex Rubisco²⁴ and the GroEL–GroES–ATP complex²⁵. In contrast, the relationship between nsp7 and nsp8 multimers in the supercomplex is not stacking but cross-linking. nsp8I and nsp8II constitute the framework of the supercomplex as 'bricks,' and nsp7 stabilizes and fills this configuration as 'mortar.' Loss of nsp7 should not markedly change the shape of the structure (Fig. 3d). Thus, we conclude that the nsp7–nsp8 hexadecamer demonstrates a new mode of protein architecture in large macromolecular complexes. The relationships between the 16 monomers are shown in Supplementary Figure 3 online. Overall, each monomer of nsp7, nsp8I or nsp8II interacts with other neighboring monomers to give an average buried surface area of ∼2,000 Å², 3,200 Å² or 3,500 Å², respectively, which accounts for 37%, 31.5% or 23.5% of its total solvent-accessible surface area, respectively.

The cross-linking behavior of the mixture of nsp7 and nsp8 further supports the assembly mode of the supercomplex. In cross-linking experiments, both the nsp7–nsp8 heteromultimer and the nsp8 framework were detected (Supplementary Fig. 4 online). To examine whether the hexadecamer is the natural state of the nsp7–nsp8 complex in solution, we used negative-staining electron microscopy to obtain two-dimensional average images of the supercomplex. The images show particles with similar dimensions to the hexadecamer, indicating that the architecture of the nsp7–nsp8 supercomplex crystal structure also exists in solution (Supplementary Fig. 5 online).

Interaction with dsRNA in an encircling mode

The electrostatic properties and dimensions of the nsp7–nsp8 supercomplex imply that its role is to bind nucleic acids. The inner channel is coated by positive potential, whereas the outer surface of the cylinder is mainly covered by negative potential (Fig. 4a). This bipartite charge distribution ensures that the phosphate backbone of nucleic acids can pass through the channel without electrostatic repulsions, as with other DNA/RNA-binding proteins such as PCNA and the β subunit^26,27. Furthermore, the central channel of the supercomplex has an average internal diameter of ∼30 Å and can suitably accommodate duplex DNA/RNA (Figs. 1 and 4).

(a) The electrostatic potential surface of the hexadecamer modeled with (right) and without (left) duplex RNA in the positive channel. Blue, positive charge (+10 k_BT); red, negative charge (−10 k_BT). (b) Model of the hexadecameric nsp7–nsp8 supercomplex with hypothetical duplex RNA. Left, top view, showing the channel's proper dimensions to accommodate dsRNA. Right, side view, showing a possible mode of interaction where the four nsp8II NH3 helices insert into the dsRNA grooves. (c,d) Results of EMSAs of nsp7, nsp8 and their mutants. Each lane contains 75 pmol dsRNA (c) or dsDNA (d). Lanes 1–7 contain 650 pmol of each protein; lanes 8–13 contain 90 pmol of hexadecamer.

It is widely accepted that coronavirus replication occurs in the cytoplasm of infected cells and that no DNA is involved²⁸. A double-stranded (ds) RNA intermediate is required for genomic replication of all coronaviruses during the RNA synthesis process. The hollow cylindrical structure of the hexadecamer suggests that its function is to encircle and stabilize dsRNA, thus holding the nascent and template strands together to facilitate efficient replication and transcription.

On the basis of this presumption, we constructed a model with a dsRNA fragment inserted into the channel to analyze the possible mode of interaction between them (Fig. 4b). We found that the four long helices of nsp8I could insert into the grooves of dsRNA and the positively charged residues on these helices are conserved in all homologous coronavirus nsp8 proteins. To test whether these residues are related to nucleic acid binding, we identified several basic residues located around the channel (Supplementary Fig. 6 online) and designed three mutants: nsp7m (nsp7 R26A K32A), nsp8m1 (nsp8 K77A R80A) and nsp8m2 (nsp8 K63A R84A R85A). We then performed electrophoretic mobility shift assays (EMSAs) to examine the nucleic acid binding affinity of each mutant. The results showed that the nucleic acid binding affinities of nsp8m1 and nsp8m2 were much weaker than that of the wild-type protein, whereas mutations in nsp7 had no effect (Fig. 4c,d). With a calculated isoelectric point of 6.5, nsp8 should not bind nucleic acids by electrostatic interaction, unlike basic proteins. As nsp8 contains a total of 22 positively charged residues (lysines and arginines), the change in overall charge caused by the loss of two or three of these should not result in a marked change in affinity. Furthermore, the mutations did not affect supercomplex formation and stability, as we ascertained by gel filtration and crystallization experiments (data not shown). The only explanation is that the locations of these mutated residues make them essential for nucleic acid binding. As they are all located around the central channel, we conclude that the channel should encircle nucleic acid. In addition, the results of EMSAs showed that nsp8 mutants have higher affinity for dsRNA than for dsDNA, as they could still bind dsRNA but hardly bound dsDNA. This suggests that dsRNA is the likely natural binding partner of the nsp7–nsp8 supercomplex.

Discussion

The high conservation of nsp7 and nsp8 in known coronaviruses suggests that the hexadecamer should be a general component for all coronaviruses. The electrostatic properties of the hexadecamer and the diameter of its central channel are similar to those of PCNA and the β subunit ring, the processivity factors of DNA polymerase^26,27, which encircle dsDNA and interact with the polymerase to confer high processivity on it²⁹. Coincidentally, experiments on MHV have shown that RdRp co-immunoprecipitates with nsp8, nsp9, nsp5 (main protease, also called M^pro or 3CL^pro) and nsp13 (helicase)³⁰, which also implies an interaction between RdRp and the nsp7–nsp8 hexadecamer. Their remarkably large genomes and putative proofreading activities suggest that coronaviruses may differ from other RNA viruses and share unprecedented similarities with DNA-based life forms in the mechanisms of genome biosynthesis⁶. The hexadecamer might be a factor that binds and trails RdRp, conferring high processivity on it for efficient replication of the extremely large coronavirus genome. Such a binding mode would give a molar ratio of 8:1 between nsp7–nsp8 and RdRp, which agrees with the natural abundance of orf1a-encoded nsps in the replication and transcription machinery (three- to five-fold in excess of orf1b-encoded nsps such as RdRp, helicase, exonuclease and others) resulting from a ribosomal frameshifting mechanism¹.

The colocalization of nsp8 with nsp7, nsp9 and nsp10 in experiments on MHV³¹ provides very strong evidence for their interaction in this virus. Analytical ultracentrifugation experiments further indicate that SARS-CoV nsp8 interacts with nsp9, a ssRNA-binding protein²⁰. In addition, the disorder of the nsp8 N-terminal region has been seen to decrease upon the addition of nsp9 to nsp8 (ref. 20). On the basis of the nsp7–nsp8 hexadecameric structure, the most probable nsp9-binding site should be in the region formed by the N-terminal 50 residues of nsp8II, which is located at the entrance of the channel and has high flexibility with missing electron density. Wrapping of ssRNA around the nsp9 dimer is suggested by the nsp9 structure and by a study using tryptophan fluorescence quenching¹⁹. We therefore consider that the function of the nsp9 dimer might be to protect the newly unwinding nascent and template strands emerging from the channel of the nsp7–nsp8 complex, which have not yet formed a stable secondary structure, from nuclease processing.

The crystal structure of the SARS-CoV hexadecameric nsp7–nsp8 supercomplex is the first obtained so far to show atomic interactions between coronavirus nonstructural proteins. Sixteen molecules associate tightly with one another to form a handled hollow cylindrical structure in which the coexistence of two conformations (nsp8I and nsp8II) is observed. Its novel architecture and unique mode of assembly should provide new insights in the field of macromolecular complex structures. Designing peptides or nonpeptidyl compounds that mimic the interaction interface between nsp7 and nsp8 is one strategy to block supercomplex formation and interfere with virus replication. Besides M^pro, this structure could provide a new candidate for drug design targeting those serious diseases caused by SARS-CoV, HCoV-229E and HCoV-HKU1.

Methods

Protein expression, purification and supercomplex assembly.

The coding sequences for SARS-CoV nsp7 and nsp8 were amplified by PCR from the SARS-CoV BJ01 strain (corresponding to 3837Ser–3919Gln and 3920Ala–4117Gln of orf1a replicative polyprotein, respectively) and inserted into the pGEX-6p-1 plasmid using BamHI and XhoI sites. The proteins were expressed in Escherichia coli strain BL21 (DE3) as GST fusion proteins. A selenomethionyl (SeMet) derivative of nsp7 was prepared using the method of methionine-biosynthesis pathway inhibition³². The GST fusion proteins were first purified by glutathione affinity column. The GST was released by GST–rhinovirus 3C protease (Amersham Biosciences), leading to five additional residues (GPLGS) at the N terminus, and SeMet-nsp7 and nsp8 were further purified by Superdex 200 (10/30) gel filtration column (Amersham Biosciences) in 25 mM sodium HEPES, 150 mM NaCl, 1 mM EDTA and 5 mM DTT (pH 7.5). nsp8 was then mixed with ∼1 molar excess of SeMet-nsp7 and passed over the Superdex 200 column in the same buffer. Fractions of the SeMet-nsp7–nsp8 complex were then concentrated and used for crystallization.

Crystallization and data collection.

Crystals were grown at 291 K by the hanging drop vapor diffusion method from an ammonium sulfate system with a complex concentration of 5 mg ml⁻¹ in the gel filtration buffer. Crystals were flash-frozen in liquid nitrogen in a crystallization buffer supplemented with 25% glycerol. MAD data were collected to 2.8-Å resolution on beamline 3W1B of the Beijing Synchrotron Radiation Facility. Higher-resolution (2.4 Å) data were collected from a single crystal of the native complex on beamline BL19-ID of the Advanced Photon Source (Argonne National Laboratory, Argonne, Illinois, USA). Data were processed with the HKL2000 suite of programs³³ (Table 1).

Table 1.

Data collection, phasing and refinement statistics

	Native	nsp7(SeMet)–nsp8 crystal
Data collection
Space group	P2₁2₁2		P2₁2₁2
Cell dimensions
a, b, c (Å)	93.6, 94.0, 150.9		93.6, 94.0, 150.9
α, β, γ (°)	90, 90, 90		90, 90, 90
		Peak	Inflection	Remote
Wavelength (Å)	1.0332	0.9791	0.9793	0.9500
Resolution (Å)	2.3	2.8	2.8	2.8
R _merge	0.099 (0.476)^a	0.104 (0.422)	0.092 (0.369)	0.127 (0.604)
I/σI	24.0 (1.7)	15.5 (2.0)	15.5 (2.0)	10.1 (1.6)
Completeness (%)	96.8 (82.0)	99.0 (95.2)	97.1 (81.0)	96.0 (90.3)
Redundancy	8.4 (3.1)	5.9 (3.8)	5.4 (3.2)	5.4 (4.1)
Refinement
Resolution (Å)	50–2.4
No. reflections	51,504
R_work / R_free	21.3/25.1
No. atoms	7,782
Protein	7,598
Ligand/ion	35
Water	149
B-factors
Protein	56.5
Ligand/ion	76.2
Water	55.2
R.m.s. deviations
Bond lengths (Å)	0.006
Bond angles (°)	1.14

Open in a new tab

^aHighest-resolution shell is shown in parentheses (2.90–2.80 Å for selenium energies and 2.38–2.30 Å for the native data set).

Phasing, model building and refinement.

MAD data (2.8 Å) were used to locate selenium sites and calculate an initial electron density map for model building. Twelve selenium sites were located using SOLVE³⁴, which identified noncrystallographic symmetry (four operators), and the initial phases were calculated up to 3.2 Å. Density modification and phase extension to 2.8 Å were performed with RESOLVE³⁵, and an interpretable electron density map was calculated. Automatic model building was performed using RESOLVE and ∼43% of the asymmetric unit was traced, including 166 full residues and 521 residues lacking side chains. The remainder of the model was built manually using O³⁶. Finally, four nsp7 molecules and four nsp8 molecules were located in one asymmetric unit. This model was refined to a resolution of 2.8 Å, with R_work = 23.2% and R_free = 28.2% for the MAD data, using CNS³⁷. The model was further refined, using the higher-resolution native data, to 2.4 Å, with R_work = 21.5% and R_free = 25.4%. The final model was confirmed to have good stereochemistry according to a Ramachandran plot calculated by PROCHECK³⁸. Phasing and refinement statistics are summarized in Table 1.

Electron microscopy and image processing.

For electron microscopy, the mixture of nsp7 and nsp8 was diluted to a final concentration of 0.3 mg ml⁻¹ in 20 mM sodium HEPES (pH 7.5) and 150 mM NaCl. The sample was applied to 400 mesh copper grids using the carbon sandwich technique with 1% uranyl acetate as a negative stain. Each micrograph was recorded with a JEM-100CX transmission electron microscope at a magnification of × 33,000 and an accelerating voltage of 80 kV. The images were processed with EMAN³⁹ and about 2,500 particles were extracted from the best image after manual screening of the automatic extracted particles. The reference-free alignment and two-dimensional averaging were performed by Boxer³⁹.

Cross-linking gel assay.

nsp7, nsp8 and mixtures of the two were diluted to 5 mg ml⁻¹ in 25 mM HEPES (pH 7.5), 300 mM NaCl, 1 mM EDTA and 1 mM DTT. Ethylene glycolbis (succinimidylsuccinate) was dissolved in DMSO to a concentration of 50 mM and then added to 10 μl of protein sample with a final concentration of 5 mM. After the mixture was incubated on ice for 2 h, the reaction was quenched for 15 min by adding 1 M Tris-HCl (pH 7.5) to a final concentration of 50 mM. An equal volume of 2 × SDS-PAGE sample buffer was added and a small amount was analyzed on a 4–15% SDS polyacrylamide gel.

Electrophoretic mobility shift assays.

The dsDNA (5′-CTTGCAAAAGACACAACTGA-3′) was synthesized by BioAsia. The dsRNA (5′-NGGAGACCAUGUGAUUGGCA-3′) was a gift from G. Gao (Chinese Academy of Sciences, Beijing, China). Nucleic acid was incubated with protein in 20 mM HEPES (pH 7.5), 100 mM NaCl, 1 mM EDTA and 5% glycerol for 30 min at room temperature. The samples were run on 4% nondenaturing TBE polyacrylamide gel, and the gel was then stained with ethidium bromide.

Accession codes.

Protein Data Bank: Coordinates have been deposited with accession code 2AHM.

Supplementary information

Supplementary Fig. 1^{(491.4KB, pdf)}

Superposition of the structures of nsp8 and nsp7. (PDF 491 kb)

Supplementary Fig. 2^{(882.4KB, pdf)}

Stereo view of the interface at site 1 and site 2. (PDF 882 kb)

Supplementary Fig. 3^{(358.9KB, pdf)}

Sketch map of the interactions between the 16 monomers. (PDF 358 kb)

Supplementary Fig. 4^{(465.3KB, pdf)}

Cross-linking gel of nsp7, nsp8 and nsp7–nsp8. (PDF 465 kb)

Supplementary Fig. 5^{(695KB, pdf)}

Raw electron micrograph and two-dimensional averaged images. (PDF 694 kb)

Supplementary Fig. 6^{(3MB, pdf)}

Stereo view of the mutated residues. (PDF 3119 kb)

Acknowledgements

We would like to thank R. Zhang and A. Joachimiak of the Advanced Photon Source, P. Liu and Y. Dong of the Beijing Synchrotron Radiation Facility, and P. Li and Q. Guo for help with data collection; J. Ziebuhr and L.-S. Su for comments and critical discussion; W. Xu for help with electron microscopy; J.-S. Jiang for image processing; and D. Su and Y. Xu for technical assistance. This work was supported by Projects 863 and 973 of the Ministry of Science and Technology of China (grants 200BA711A12, G199075600 and 2003CB514103), the National Natural Science Foundation of China (grant 30221003), the Sino-German Center (grant GZ236(202/9)) and the Sino-European Project on SARS Diagnostics and Antivirals of the European Commission (grant 003831).

Accession codes

Accessions

Protein Data Bank

2AHM

Competing interests

The authors declare no competing financial interests.

Footnotes

Yujia Zhai and Fei Sun: These authors contributed equally to this work.

References

1.Brierley I, Digard P, Inglis SC. Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot. Cell. 1989;57:537–547. doi: 10.1016/0092-8674(89)90124-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Ziebuhr J, Snijder EJ, Gorbalenya AE. Virus-encoded proteinases and proteolytic processing in the Nidovirales. J. Gen. Virol. 2000;81:853–879. doi: 10.1099/0022-1317-81-4-853. [DOI] [PubMed] [Google Scholar]
3.Thiel V, et al. Mechanisms and enzymes involved in SARS coronavirus genome expression. J. Gen. Virol. 2003;84:2305–2315. doi: 10.1099/vir.0.19424-0. [DOI] [PubMed] [Google Scholar]
4.Marra MA, et al. The Genome sequence of the SARS-associated coronavirus. Science. 2003;300:1399–1404. doi: 10.1126/science.1085953. [DOI] [PubMed] [Google Scholar]
5.Rota PA, et al. Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science. 2003;300:1394–1399. doi: 10.1126/science.1085952. [DOI] [PubMed] [Google Scholar]
6.Snijder EJ, et al. Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 2003;331:991–1004. doi: 10.1016/S0022-2836(03)00865-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Gorbalenya AE. Big nidovirus genome. When count and order of domains matter. Adv. Exp. Med. Biol. 2001;494:1–17. [PubMed] [Google Scholar]
8.Drosten C, et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1967–1976. doi: 10.1056/NEJMoa030747. [DOI] [PubMed] [Google Scholar]
9.Fouchier RA, et al. Aetiology: Koch's postulates fulfilled for SARS virus. Nature. 2003;423:240. doi: 10.1038/423240a. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Ksiazek TG, et al. A novel coronavirus associated with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1953–1966. doi: 10.1056/NEJMoa030781. [DOI] [PubMed] [Google Scholar]
11.Kuiken T, et al. Newly discovered coronavirus as the primary cause of severe acute respiratory syndrome. Lancet. 2003;362:263–270. doi: 10.1016/S0140-6736(03)13967-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Peiris JS, et al. Coronavirus as a possible cause of severe acute respiratory syndrome. Lancet. 2003;361:1319–1325. doi: 10.1016/S0140-6736(03)13077-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Peiris JS, Yuen KY, Osterhaus AD, Stohr K. The severe acute respiratory syndrome. N. Engl. J. Med. 2003;349:2431–2441. doi: 10.1056/NEJMra032498. [DOI] [PubMed] [Google Scholar]
14.Zhao Z, et al. Description and clinical treatment of an early outbreak of severe acute respiratory syndrome (SARS) in Guangzhou, PR China. J. Med. Microbiol. 2003;52:715–720. doi: 10.1099/jmm.0.05320-0. [DOI] [PubMed] [Google Scholar]
15.Zhong NS, et al. Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People's Republic of China, in February, 2003. Lancet. 2003;362:1353–1358. doi: 10.1016/S0140-6736(03)14630-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Yang H, et al. The crystal structures of severe acute respiratory syndrome virus main protease and its complex with an inhibitor. Proc. Natl. Acad. Sci. USA. 2003;100:13190–13195. doi: 10.1073/pnas.1835675100. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Ivanov KA, et al. Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase. J. Virol. 2004;78:5619–5632. doi: 10.1128/JVI.78.11.5619-5632.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Ivanov KA, et al. Major genetic marker of nidoviruses encodes a replicative endoribonuclease. Proc. Natl. Acad. Sci. USA. 2004;101:12694–12699. doi: 10.1073/pnas.0403127101. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Egloff MP, et al. The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl. Acad. Sci. USA. 2004;101:3792–3796. doi: 10.1073/pnas.0307877101. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Sutton G, et al. The nsp9 replicase protein of SARS-coronavirus, structure and functional insights. Structure (Camb). 2004;12:341–353. doi: 10.1016/j.str.2004.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Woo PC, et al. Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia. J. Virol. 2005;79:884–895. doi: 10.1128/JVI.79.2.884-895.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Holm L, Sander C. Dali: a network tool for protein structure comparison. Trends Biochem. Sci. 1995;20:478–480. doi: 10.1016/s0968-0004(00)89105-7. [DOI] [PubMed] [Google Scholar]
23.Dunker AK, Brown CJ, Lawson JD, Iakoucheva LM, Obradovic Z. Intrinsic disorder and protein function. Biochemistry. 2002;41:6573–6582. doi: 10.1021/bi012159+. [DOI] [PubMed] [Google Scholar]
24.Taylor TC, Andersson I. The structure of the complex between rubisco and its natural substrate ribulose 1,5-bisphosphate. J. Mol. Biol. 1997;265:432–444. doi: 10.1006/jmbi.1996.0738. [DOI] [PubMed] [Google Scholar]
25.Xu Z, Horwich AL, Sigler PB. The crystal structure of the asymmetric GroEL-GroES-(ADP)7 chaperonin complex. Nature. 1997;388:741–750. doi: 10.1038/41944. [DOI] [PubMed] [Google Scholar]
26.Kong XP, Onrust R, O'Donnell M, Kuriyan J. Three-dimensional structure of the beta subunit of E. coli DNA polymerase III holoenzyme: a sliding DNA clamp. Cell. 1992;69:425–437. doi: 10.1016/0092-8674(92)90445-i. [DOI] [PubMed] [Google Scholar]
27.Krishna TS, Kong XP, Gary S, Burgers PM, Kuriyan J. Crystal structure of the eukaryotic DNA polymerase processivity factor PCNA. Cell. 1994;79:1233–1243. doi: 10.1016/0092-8674(94)90014-0. [DOI] [PubMed] [Google Scholar]
28.Siddell SG. The Coronaviridae. 1995. The coronaviridae: an introduction; pp. 1–9. [Google Scholar]
29.Bauer GA, Burgers PM. The yeast analog of mammalian cyclin/proliferating-cell nuclear antigen interacts with mammalian DNA polymerase delta. Proc. Natl. Acad. Sci. USA. 1988;85:7506–7510. doi: 10.1073/pnas.85.20.7506. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Brockway SM, Clay CT, Lu XT, Denison MR. Characterization of the expression, intracellular localization, and replication complex association of the putative mouse hepatitis virus RNA-dependent RNA polymerase. J. Virol. 2003;77:10515–10527. doi: 10.1128/JVI.77.19.10515-10527.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Bost AG, Carnahan RH, Lu XT, Denison MR. Four proteins processed from the replicase gene polyprotein of mouse hepatitis virus colocalize in the cell periphery and adjacent to sites of virion assembly. J. Virol. 2000;74:3379–3387. doi: 10.1128/jvi.74.7.3379-3387.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Doublie S. Preparation of selenomethionyl proteins for phase determination. Methods Enzymol. 1997;276:523–530. [PubMed] [Google Scholar]
33.Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol.276, 307–326. [DOI] [PubMed]
34.Terwilliger TC, Berendzen J. Automated MAD and MIR structure solution. Acta Crystallogr. D Biol. Crystallogr. 1999;55:849–861. doi: 10.1107/S0907444999000839. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Terwilliger TC. Maximum-likelihood density modification. Acta Crystallogr. D Biol. Crystallogr. 2000;56:965–972. doi: 10.1107/S0907444900005072. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Jones TA, Zou JY, Cowan SW, Kjeldgaard M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A. 1991;47:110–9. doi: 10.1107/s0108767390010224. [DOI] [PubMed] [Google Scholar]
37.Brunger AT, et al. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 1998;54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
38.Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 1993;26:283–291. [Google Scholar]
39.Ludtke SJ, Baldwin PR, Chiu W. EMAN: semiautomated software for high-resolution single particle reconstructions. J. Struct. Biol. 1999;128:82–97. doi: 10.1006/jsbi.1999.4174. [DOI] [PubMed] [Google Scholar]
40.Esnouf RM. An extensively modified version of MolScript that includes greatly enhanced coloring capabilities. J. Mol. Graph. 1997;15:132–134. doi: 10.1016/S1093-3263(97)00021-1. [DOI] [PubMed] [Google Scholar]
41.Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Gouet P, Courcelle E, Stuart DI, Metoz F. ESPript: analysis of multiple sequence alignments in PostScript. Bioinformatics. 1999;15:305–308. doi: 10.1093/bioinformatics/15.4.305. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Fig. 1^{(491.4KB, pdf)}

Superposition of the structures of nsp8 and nsp7. (PDF 491 kb)

Supplementary Fig. 2^{(882.4KB, pdf)}

Stereo view of the interface at site 1 and site 2. (PDF 882 kb)

Supplementary Fig. 3^{(358.9KB, pdf)}

Sketch map of the interactions between the 16 monomers. (PDF 358 kb)

Supplementary Fig. 4^{(465.3KB, pdf)}

Cross-linking gel of nsp7, nsp8 and nsp7–nsp8. (PDF 465 kb)

Supplementary Fig. 5^{(695KB, pdf)}

Raw electron micrograph and two-dimensional averaged images. (PDF 694 kb)

Supplementary Fig. 6^{(3MB, pdf)}

Stereo view of the mutated residues. (PDF 3119 kb)

Data Availability Statement

Accessions

Protein Data Bank

2AHM

[CR1] 1.Brierley I, Digard P, Inglis SC. Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot. Cell. 1989;57:537–547. doi: 10.1016/0092-8674(89)90124-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Ziebuhr J, Snijder EJ, Gorbalenya AE. Virus-encoded proteinases and proteolytic processing in the Nidovirales. J. Gen. Virol. 2000;81:853–879. doi: 10.1099/0022-1317-81-4-853. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Thiel V, et al. Mechanisms and enzymes involved in SARS coronavirus genome expression. J. Gen. Virol. 2003;84:2305–2315. doi: 10.1099/vir.0.19424-0. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Marra MA, et al. The Genome sequence of the SARS-associated coronavirus. Science. 2003;300:1399–1404. doi: 10.1126/science.1085953. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Rota PA, et al. Characterization of a novel coronavirus associated with severe acute respiratory syndrome. Science. 2003;300:1394–1399. doi: 10.1126/science.1085952. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Snijder EJ, et al. Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 2003;331:991–1004. doi: 10.1016/S0022-2836(03)00865-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Gorbalenya AE. Big nidovirus genome. When count and order of domains matter. Adv. Exp. Med. Biol. 2001;494:1–17. [PubMed] [Google Scholar]

[CR8] 8.Drosten C, et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1967–1976. doi: 10.1056/NEJMoa030747. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Fouchier RA, et al. Aetiology: Koch's postulates fulfilled for SARS virus. Nature. 2003;423:240. doi: 10.1038/423240a. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Ksiazek TG, et al. A novel coronavirus associated with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1953–1966. doi: 10.1056/NEJMoa030781. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Kuiken T, et al. Newly discovered coronavirus as the primary cause of severe acute respiratory syndrome. Lancet. 2003;362:263–270. doi: 10.1016/S0140-6736(03)13967-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Peiris JS, et al. Coronavirus as a possible cause of severe acute respiratory syndrome. Lancet. 2003;361:1319–1325. doi: 10.1016/S0140-6736(03)13077-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Peiris JS, Yuen KY, Osterhaus AD, Stohr K. The severe acute respiratory syndrome. N. Engl. J. Med. 2003;349:2431–2441. doi: 10.1056/NEJMra032498. [DOI] [PubMed] [Google Scholar]

[CR14] 14.Zhao Z, et al. Description and clinical treatment of an early outbreak of severe acute respiratory syndrome (SARS) in Guangzhou, PR China. J. Med. Microbiol. 2003;52:715–720. doi: 10.1099/jmm.0.05320-0. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Zhong NS, et al. Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People's Republic of China, in February, 2003. Lancet. 2003;362:1353–1358. doi: 10.1016/S0140-6736(03)14630-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Yang H, et al. The crystal structures of severe acute respiratory syndrome virus main protease and its complex with an inhibitor. Proc. Natl. Acad. Sci. USA. 2003;100:13190–13195. doi: 10.1073/pnas.1835675100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Ivanov KA, et al. Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase. J. Virol. 2004;78:5619–5632. doi: 10.1128/JVI.78.11.5619-5632.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Ivanov KA, et al. Major genetic marker of nidoviruses encodes a replicative endoribonuclease. Proc. Natl. Acad. Sci. USA. 2004;101:12694–12699. doi: 10.1073/pnas.0403127101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Egloff MP, et al. The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl. Acad. Sci. USA. 2004;101:3792–3796. doi: 10.1073/pnas.0307877101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Sutton G, et al. The nsp9 replicase protein of SARS-coronavirus, structure and functional insights. Structure (Camb). 2004;12:341–353. doi: 10.1016/j.str.2004.01.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Woo PC, et al. Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia. J. Virol. 2005;79:884–895. doi: 10.1128/JVI.79.2.884-895.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Holm L, Sander C. Dali: a network tool for protein structure comparison. Trends Biochem. Sci. 1995;20:478–480. doi: 10.1016/s0968-0004(00)89105-7. [DOI] [PubMed] [Google Scholar]

[CR23] 23.Dunker AK, Brown CJ, Lawson JD, Iakoucheva LM, Obradovic Z. Intrinsic disorder and protein function. Biochemistry. 2002;41:6573–6582. doi: 10.1021/bi012159+. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Taylor TC, Andersson I. The structure of the complex between rubisco and its natural substrate ribulose 1,5-bisphosphate. J. Mol. Biol. 1997;265:432–444. doi: 10.1006/jmbi.1996.0738. [DOI] [PubMed] [Google Scholar]

[CR25] 25.Xu Z, Horwich AL, Sigler PB. The crystal structure of the asymmetric GroEL-GroES-(ADP)7 chaperonin complex. Nature. 1997;388:741–750. doi: 10.1038/41944. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Kong XP, Onrust R, O'Donnell M, Kuriyan J. Three-dimensional structure of the beta subunit of E. coli DNA polymerase III holoenzyme: a sliding DNA clamp. Cell. 1992;69:425–437. doi: 10.1016/0092-8674(92)90445-i. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Krishna TS, Kong XP, Gary S, Burgers PM, Kuriyan J. Crystal structure of the eukaryotic DNA polymerase processivity factor PCNA. Cell. 1994;79:1233–1243. doi: 10.1016/0092-8674(94)90014-0. [DOI] [PubMed] [Google Scholar]

[CR28] 28.Siddell SG. The Coronaviridae. 1995. The coronaviridae: an introduction; pp. 1–9. [Google Scholar]

[CR29] 29.Bauer GA, Burgers PM. The yeast analog of mammalian cyclin/proliferating-cell nuclear antigen interacts with mammalian DNA polymerase delta. Proc. Natl. Acad. Sci. USA. 1988;85:7506–7510. doi: 10.1073/pnas.85.20.7506. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Brockway SM, Clay CT, Lu XT, Denison MR. Characterization of the expression, intracellular localization, and replication complex association of the putative mouse hepatitis virus RNA-dependent RNA polymerase. J. Virol. 2003;77:10515–10527. doi: 10.1128/JVI.77.19.10515-10527.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Bost AG, Carnahan RH, Lu XT, Denison MR. Four proteins processed from the replicase gene polyprotein of mouse hepatitis virus colocalize in the cell periphery and adjacent to sites of virion assembly. J. Virol. 2000;74:3379–3387. doi: 10.1128/jvi.74.7.3379-3387.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Doublie S. Preparation of selenomethionyl proteins for phase determination. Methods Enzymol. 1997;276:523–530. [PubMed] [Google Scholar]

[CR33] 33.Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol.276, 307–326. [DOI] [PubMed]

[CR34] 34.Terwilliger TC, Berendzen J. Automated MAD and MIR structure solution. Acta Crystallogr. D Biol. Crystallogr. 1999;55:849–861. doi: 10.1107/S0907444999000839. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Terwilliger TC. Maximum-likelihood density modification. Acta Crystallogr. D Biol. Crystallogr. 2000;56:965–972. doi: 10.1107/S0907444900005072. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Jones TA, Zou JY, Cowan SW, Kjeldgaard M. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr. A. 1991;47:110–9. doi: 10.1107/s0108767390010224. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Brunger AT, et al. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 1998;54:905–921. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 1993;26:283–291. [Google Scholar]

[CR39] 39.Ludtke SJ, Baldwin PR, Chiu W. EMAN: semiautomated software for high-resolution single particle reconstructions. J. Struct. Biol. 1999;128:82–97. doi: 10.1006/jsbi.1999.4174. [DOI] [PubMed] [Google Scholar]

[CR40] 40.Esnouf RM. An extensively modified version of MolScript that includes greatly enhanced coloring capabilities. J. Mol. Graph. 1997;15:132–134. doi: 10.1016/S1093-3263(97)00021-1. [DOI] [PubMed] [Google Scholar]

[CR41] 41.Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR42] 42.Gouet P, Courcelle E, Stuart DI, Metoz F. ESPript: analysis of multiple sequence alignments in PostScript. Bioinformatics. 1999;15:305–308. doi: 10.1093/bioinformatics/15.4.305. [DOI] [PubMed] [Google Scholar]

PERMALINK

Insights into SARS-CoV transcription and replication from the structure of the nsp7–nsp8 hexadecamer

Yujia Zhai

Fei Sun

Xuemei Li

Hai Pang

Xiaoling Xu

Mark Bartlam

Zihe Rao

Abstract

Supplementary information

Main

Results

Structural overview

Figure 1. The supercomplex structure and its sequence.

Structure of nsp7

Figure 2. Three-dimensional structure of SARS-CoV nsp7 and nsp8.

The 'golf-club' fold of nsp8

Interactions between nsp7 and nsp8

Figure 3. Architecture and assembly of the hexadecameric nsp7–nsp8 complex.

Assembly and architecture of the supercomplex

Interaction with dsRNA in an encircling mode

Figure 4. Hypothetical interaction between RNA and the hexadecameric nsp7–nsp8 complex.

Discussion

Methods

Protein expression, purification and supercomplex assembly.

Crystallization and data collection.

Table 1.

Phasing, model building and refinement.

Electron microscopy and image processing.

Cross-linking gel assay.

Electrophoretic mobility shift assays.

Accession codes.

Supplementary information

Acknowledgements

Accession codes

Accessions

Protein Data Bank

Competing interests

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

Accessions

Protein Data Bank

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases