Abstract
Arterivirus replicase polyproteins are cleaved into at least 13 mature nonstructural proteins (nsps), and in particular the nsp5-to-nsp8 region is subject to a complex processing cascade. The function of the largest subunit from this region, nsp7, which is further cleaved into nsp7α and nsp7β, is unknown. Using nuclear magnetic resonance (NMR) spectroscopy, we determined the solution structure of nsp7α of equine arteritis virus, revealing an interesting unique fold for this protein but thereby providing little clue to its possible functions. Nevertheless, structure-based reverse genetics studies established the importance of nsp7/nsp7α for viral RNA synthesis, thus providing a basis for future studies.
TEXT
Arteriviruses are enveloped viruses with an ∼13- to ∼16-kb plus-stranded RNA genome (17, 18). In general, the consequences of arterivirus infection can range from an asymptomatic, persistent, or acute infection to abortion or lethal hemorrhagic fever (11). The family currently comprises four members: equine arteritis virus (EAV) (1), lactate dehydrogenase-elevating virus (LDV) of mice, simian hemorrhagic fever virus (SHFV), and porcine reproductive and respiratory syndrome virus (PRRSV). The latter arterivirus has become an economically important pathogen, recently causing major outbreaks in the Asian swine industry (25).
Together with the distantly related families Coronaviridae and Roniviridae, the Arteriviridae belong to the order Nidovirales (7, 17). Despite major differences in virion size and morphology, as well as pathogenesis, the evolutionary relationship among nidoviruses is evident in particular from the organization and composition of the large replicase gene (4). This gene consists of the two 5′-most open reading frames (ORFs) of the genome—ORF1a and ORF1b, which are followed by a variable number of downstream ORFs encoding structural and accessory proteins.
Translation of the 5′-capped and 3′-polyadenylated arterivirus genome results in two large precursor polyproteins, pp1a and the C-terminally extended frameshift product pp1ab. These are cleaved by up to four virus-encoded proteases to yield at least 13 nonstructural proteins (nsps) (nsp1 to nsp12, including nsp7α and 7β) plus a substantial number of processing intermediates of unknown function (5, 19, 20, 21, 23, 26). Together, the nsps direct the formation of a membrane-anchored replication/transcription complex (RTC) that mediates both viral genome replication and the synthesis of a 3′-coterminal nested set of subgenomic (sg) mRNAs, a nidovirus hallmark (13, 15).
The replicase of the arterivirus prototype EAV encodes pp1a and pp1ab products of 1,727 and 3,175 amino acids (aa), respectively, which are processed by three ORF1a-encoded proteinases residing in nsp1, nsp2, and nsp4 (5, 26). Other important functional domains include three proposed transmembrane proteins (nsp2, nsp3, and nsp5), key enzymes for RNA synthesis, such as the RNA-dependent RNA polymerase (RdRp; nsp9) (1) and helicase (nsp10) (16), and a uridylate-specific endoribonuclease (NendoU; nsp11) (12) of unknown function.
Despite recent progress in arterivirus replicase dissection, various uncharacterized domains and nsps remain, including in particular the nsp5-to-nsp8 region (nsp5-8), which is extensively processed via alternative proteolytic cascades (23) and yields at least seven long-lived processing intermediates. The largest subunit in this region is nsp7 (225 aa in EAV), which is flanked by two small nsps (nsp6 and nsp8, of only 22 and 50 aa, respectively) that may modulate its function, whereas nsp5-8 and nsp5-7 likely constitute membrane-associated forms of nsp6-8 and nsp6-7. nsp7 was recently found to be internally cleaved by the nsp4 protease, yielding small amounts of nsp7α and nsp7β (20). Mutagenesis of the nsp7α/7β site suggested that it is critical for EAV replication, although the function of nsp7 remains unknown, and no homolog of the protein appears to be present in other RNA virus families.
In order to further our understanding of the role of arterivirus nsp7, we have now determined the solution structure of the 123-aa nsp7α, which is far more conserved among arteriviruses than nsp7β. The protein was found to have a unique fold and possesses no recognizable functional motifs. Subsequently, residues suggested to be functionally important by sequence comparison and the derived structure were probed by reverse genetics, yielding a variety of crippled and nonviable EAV mutants that underscore the relevance of nsp7/nsp7α for arterivirus biology.
EAV nsp7α is comprised of amino acid residues Gly1453 to Glu1575 of the pp1a replicase protein (EAV Bucyrus isolate), which are renumbered 1 to 123 hereafter for convenience. Recombinant nsp7α was expressed in Escherichia coli as described previously (6; see also the supplemental material), and its structure was determined by nuclear magnetic resonance (NMR) spectroscopy using standard protocols executed with the Crystallography and NMR System (CNS) software (2) (Table 1 ) as previously described (3). The 20 lowest-energy structures were superimposed using the backbone coordinates for residues 11 to 122 (Fig. 1 A), giving an overall root mean square deviation (rmsd) between the family and the mean coordinate position of 0.505 Å for backbone atoms. The structural statistics for the family of 20 structures are reported in Table 1. The structural quality was assessed in terms of restraints violation (analyzed within CNS) and deviation from the ideal geometry (using the program PROCHECK-NMR) (Table 1).
Table 1.
NMR restraint or other structural statistic | Value for nsp7α |
---|---|
Total distance restraints (inter-residue) | 1,707 |
Short range (residue i to i + j, j = 1) | 644 |
Medium range (residue i to I + j, 2 ≤ j ≤ 4) | 437 |
Long range (residue i to i + j, j > 4) | 626 |
Hydrogen bonds | 32 |
Total dihedral angle restraints | 100 |
φ | 50 |
ψ | 50 |
Restraint violations | |
Distance restraint violation > 0.2 Å | None |
Dihedral restraint violation > 5° | None |
Average rmsd (Å) among the 20 refined structures | |
Residues | 11–122 |
Backbone of structured regionsa | 0.505 |
Heavy atoms of structured regions | 1.230 |
Ramachandran statistics of 20 structures (% residues) | |
Most favored regions | 78.4 |
Additional allowed regions | 19.3 |
Generously allowed regions | 0.8 |
Disallowed regions | 1.5 |
Residues selected on the basis of 15N backbone dynamics (residues 11 to 122).
The overall fold of nsp7α consists of a bundle of three α-helices packed against a mixed β-sheet formed by 4 antiparallel strands (β2 to β5) and one short parallel strand (β1) arranged in a β1β2β5β4β3 topology (Fig. 1B). The secondary-structure elements are as follows: α1 (Asp11 to Leu21), β1 (Ala25 to Ser28), α2 (Ser38 toVal54), α3 (Ala58 to Asp70), β2 (Asp81to Ile86), β3 (Ala93 to Lys96), β4 (Leu99 to Lys107), and β5 (Cys114 to Lys121). The two central strands, β5 and β4, are particularly long and twist around each other to form a central backbone onto which the other strands bolt.
nsp7α has four distinct hydrophobic regions essential for the correct folding of the protein (Fig. 1C). Most of the hydrophobic residues participating in the formation of those regions are either fully conserved among arteriviruses (Phe16, Leu17, Leu48, Leu52, Phe71, and Val83) or replaced by other hydrophobic residues (Phe14, Ala25, Val26, Leu36, Leu64, Ala72, Val77, Leu101, Val109, Ile116, and Val119), as shown in the sequence alignment presented in Fig. 2 G.
Analysis of the nsp7α structure using EBI Web tools (PDBsum/ProFunc, Catalytic site search; http://www.ebi.ac.uk), Dali (http://ekhidna.biocenter.helsinki.fi/dali_server/), and GRATH (http://protein.hbu.cn/cath/cathwww.biochem.ucl.ac.uk/cgi-bin/cath/Grath.html) did not result in any significant indicators of function or similarities in structure. The structure, therefore, represents a new fold and provides little clue about potential functions of nsp7α. Furthermore, electrostatic surface potential analysis revealed a rather mixed set of surface charges, with no clear electropositive or electronegative patches that would point toward the nsp7α function. (Fig. 2B and E).
To assess the relationship between nsp7α and its precursor, nsp7, we performed a comparison of the amide group chemical shifts for the two proteins using NMR spectroscopy. The full-length nsp7 coding sequence (nucleotides [nt] 4583 to 5255, encoding pp1a/pp1ab residues Ser1453 to Glu1677; EAV Bucyrus isolate) was also expressed in E. coli and purified as described for nsp7α (see the supplemental material). Many of the resonances of nsp7α do not appear to change significantly in the context of the full-length nsp7 (see Fig. S1 in the supplemental material). Overall, our data indicate that the nsp7α core structure is also present in the full-length protein and that the nsp7α/7β cleavage is very unlikely to be accompanied by a large conformational change in the nsp7α core.
On the basis of the EAV nsp7α structure and an nsp7 sequence alignment (Fig. 2G), a number of conserved and less-conserved nsp7α residues were selected for site-directed mutagenesis and reverse genetics. Mutations were introduced into an EAV infectious cDNA clone and, after electroporation of in vitro-transcribed full-length RNA, the viability of the mutant viruses was analyzed using immunofluorescence assays for the viral replicase subunit nsp3 and the nucleocapsid (N) protein, which can be used as markers for the synthesis of genomic and sg mRNA7, respectively (22). Furthermore, cell culture supernatants were harvested and tested in a plaque assay for the production of infectious virus progeny (Fig. 3) . Finally, the occurrence of reversion was monitored by virus passaging and sequence analysis.
Figure 3 summarizes the results of our mutagenesis study. The residues selected for mutagenesis either were thought to be important for protein folding, as deduced from the structure (Leu17, Arg24, Phe71, Asp81, and Gly111), or were residues with solvent-accessible side chains (Arg33, Phe39, Arg62, Arg63, and Lys67). Leu17 and Phe71 are buried within the hydrophobic cores presented above and are fully conserved among arteriviruses (Fig. 2G). Gly111 is conserved in the sense that the residue is small (either Gly or Ala), which is important for the stability of the β-turn between strands β4 and β5. Mutation of these residues to Asp might disrupt the core, and therefore, the protein fold, explaining the nonviable (Leu17→Asp and Phe71→Asp) and crippled (Gly111→Asp) phenotype of these mutants. Arg24 (in the loop between helices α1 and α2) and Asp81 (in β-strand β1) form a salt bridge that keeps the α-helical bundle stacked against the β-sheet (Fig. 1B). Its disruption by replacement of Arg24 with Ala indeed crippled virus replication. A range of phenotypes was observed for substitutions of the fully conserved Asp81 residue. The drastic change to Ala led to a quasiviable phenotype that did not produce plaques upon analysis of early virus harvests but subsequently reverted to the wild-type residue, indicating that a low level of viral RNA synthesis must have been maintained by this mutant. The more conservative Asp81→Glu substitution resulted in a viable but crippled mutant.
Residues Arg33, Phe39, Arg62, Arg63, and Lys67 are exposed on the protein surface and therefore could be of functional importance. In particular, the positively charged Arg33 (semiconserved), Arg62 (semiconserved), and Arg63 (not conserved) may provide suitable binding regions for the phosphate backbone of an oligonucleotide. Substitution of these residues resulted in crippled mutants, whereas replacement of Lys67 with another positively charged residue (i.e., Arg) had only a mild effect, strengthening our hypothesis. Interestingly, a replacement of Lys67 with Ser, the residue present at the corresponding position in SHFV nsp7α, pseudoreverted to Arg, suggesting that indeed a positively charged residue is preferred at position 67. For the Lys67→Ala mutant, some cells were positive in immunofluorescence assays but we repeatedly could not detect progeny virus. The solvent-accessible side chain of the (conserved aromatic) Phe39, which could provide stacking interactions with nucleotide bases, was found to be essential for virus viability.
In the complex proteolytic cascade used to mature the nsp3-8 region of the EAV pp1a and pp1ab replicase polyproteins, nsp7 (nsp7α) stands out for the number of long-lived processing intermediates that contain this subunit. The nsp4 cleavage sites involved in these proteolytic events are all conserved throughout the arterivirus group, suggesting that this multitude of nsp7-containing processing intermediates may be a common feature of arterivirus-infected cells (5, 20, 26). The structural analysis presented in this paper defines EAV nsp7α, the most conserved region of nsp7 (Fig. 2G), as a distinct replicase domain with a novel fold, a fold that is likely preserved in the C-terminally extended full-length nsp7. Even though the nsp7α structure and reverse genetics analysis provided us few clues about the (putative) function(s) of this protein, some valuable conclusions can be drawn, and we provide the first direct evidence that nsp7α indeed is a critical protein domain for arterivirus RNA synthesis (Fig. 3). Interpretation of the observed mutant phenotypes is complicated by their diversity and the fact, outlined above, that nsp7α is part of so many replicase-processing intermediates. As in other plus-stranded RNA virus systems, e.g., picornaviruses and alphaviruses (9, 10, 24), such intermediates may perform specific functions themselves and proteolytic cleavages may serve as a switch to (in)activate them. In any case, all replacements of nsp7α residues tested in this study affected some aspect of the EAV replicative cycle, indicating that they are structurally or functionally important and justifying the future in-depth characterization of these mutant phenotypes.
Protein structure accession number.
Structure data for the arterivirus nsp7α are available in the Protein Data Bank under accession number 2L8K.
Supplementary Material
Acknowledgments
We thank our LUMC colleagues Nancy Beerens, Linda Boomaars-van der Zanden, and Ali Tas; Bruno Coutard (AFMB Marseille); and Luigi Martino (King's College London) for excellent technical assistance.
This work was supported in part by the European project “VIZIER” (“Comparative Structural Genomics of Viral Enzymes Involved in Replication”) funded by the 6th Framework Programme of the European Commission under the reference LSHG-CT-2004-511960, and by NWO-CW TOP grant 700.57.301 from the Netherlands Organization for Scientific Research. M. R. Conte is indebted to the Wellcome Trust for financial support.
Footnotes
Supplemental material for this article may be found at http://jvi.asm.org/.
Published ahead of print on 11 May 2011.
REFERENCES
- 1. Beerens N., et al. 2007. De novo initiation of RNA synthesis by the arterivirus RNA-dependent RNA polymerase. J. Virol. 81:8384–8395 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Brünger A. T. 2007. Version 1.2 of the crystallography and NMR system. Nat. Protoc. 2:2728–2733 [DOI] [PubMed] [Google Scholar]
- 3. Conte M. R., et al. 2006. Structure of the eukaryotic initiation factor (eIF) 5 reveals a fold common to several translation factors. Biochemistry 45:4550–4558 [DOI] [PubMed] [Google Scholar]
- 4. den Boon J. A., et al. 1991. Equine arteritis virus is not a togavirus but belongs to the coronaviruslike superfamily. J. Virol. 65:2910–2920 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Fang Y., Snijder E. J. 2010. The PRRSV replicase: exploring the multifunctionality of an intriguing set of nonstructural proteins. Virus Res. 154:61–76 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Gaudin C., Manolaridis I., Tucker P. A., Conte M. R. 2010. Resonance assignment of nsp7alpha from arterivirus. Biomol. NMR Assign. 5:23–25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Gorbalenya A. E., Enjuanes L., Ziebuhr J., Snijder E. J. 2006. Nidovirales: evolving the largest RNA virus genome. Virus Res. 117:17–37 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Larkin M. A., et al. 2007. Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948 [DOI] [PubMed] [Google Scholar]
- 9. Lawson M. A., Semler B. L. 1992. Alternate poliovirus nonstructural protein processing cascades generated by primary sites of 3C proteinase cleavage. Virology 191:309–320 [DOI] [PubMed] [Google Scholar]
- 10. Lemm J. A., Rumenapf T., Strauss E. G., Strauss J. H., Rice C. M. 1994. Polypeptide requirements for assembly of functional Sindbis virus replication complexes: a model for the temporal regulation of minus- and plus-strand RNA synthesis. EMBO J. 13:2925–2934 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Maclachlan N. J., Balasuriya U. B., Murtaugh M. P., Barthold S. W., Lowenstine L. J. 2007. Arterivirus pathogenesis and immune response, p. 325–337 In Perlman S., Gallagher T., Snijder E. J. (ed.), Nidoviruses. ASM Press, Washington, DC [Google Scholar]
- 12. Nedialkova D. D., et al. 2009. Biochemical characterization of arterivirus nonstructural protein 11 reveals the nidovirus-wide conservation of a replicative endoribonuclease. J. Virol. 83:5671–5682 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Pasternak A. O., Spaan W. J. M., Snijder E. J. 2006. Nidovirus transcription: how to make sense.? J. Gen. Virol. 87:1403–1421 [DOI] [PubMed] [Google Scholar]
- 14. Potterton E., McNicholas S., Krissinel E., Cowtan K., Noble M. 2002. The CCP4 molecular-graphics project. Acta Crystallogr. 58:1955–1957 [DOI] [PubMed] [Google Scholar]
- 15. Sawicki S. G., Sawicki D. L., Siddell S. G. 2007. A contemporary view of coronavirus transcription. J. Virol. 81:20–29 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Seybert A., van Dinten L. C., Snijder E. J., Ziebuhr J. 2000. Biochemical characterization of the equine arteritis virus helicase suggests a close functional relationship between arterivirus and coronavirus helicases. J. Virol. 74:9586–9593 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Snijder E. J., Meulenberg J. J. M. 1998. The molecular biology of arteriviruses. J. Gen. Virol. 79:961–979 [DOI] [PubMed] [Google Scholar]
- 18. Snijder E. J., Spaan W. J. M. 2006. Arteriviruses, p. 1337–1355 In Knipe D. M., Howley P. M. (ed.), Fields virology. Lippincott, Williams & Wilkins, Philadelphia, PA [Google Scholar]
- 19. van Aken D., Snijder E. J., Gorbalenya A. E. 2006. Mutagenesis analysis of the nsp4 main proteinase reveals determinants of arterivirus replicase polyprotein autoprocessing. J. Virol. 80:3428–3437 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. van Aken D., Zevenhoven-Dobbe J. C., Gorbalenya A. E., Snijder E. J. 2006. Proteolytic maturation of replicase polyprotein pp1a by the nsp4 main proteinase is essential for equine arteritis virus replication and includes internal cleavage of nsp7. J. Gen. Virol. 87:3473–3482 [DOI] [PubMed] [Google Scholar]
- 21. van Dinten L. C., Rensen S., Spaan W. J. M., Gorbalenya A. E., Snijder E. J. 1999. Proteolytic processing of the open reading frame 1b-encoded part of arterivirus replicase is mediated by nsp4 serine protease and is essential for virus replication. J. Virol. 73:2027–2037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. van Dinten L. C., van Tol H., Gorbalenya A. E., Snijder E. J. 2000. The predicted metal-binding region of the arterivirus helicase protein is involved in subgenomic mRNA synthesis, genome replication, and virion biogenesis. J. Virol. 74:5213–5223 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Wassenaar A. L. M., Spaan W. J. M., Gorbalenya A. E., Snijder E. J. 1997. Alternative proteolytic processing of the arterivirus replicase ORF1a polyprotein: evidence that NSP2 acts as a cofactor for the NSP4 serine protease. J. Virol. 71:9313–9322 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Ypma-Wong M. F., Filman D. J., Hogle J. M., Semler B. L. 1988. Structural domains of the poliovirus polyprotein are major determinants for proteolytic cleavage at Gln-Gly pairs. J. Biol. Chem. 263:17846–17856 [PubMed] [Google Scholar]
- 25. Zhou L., Yang H. 2010. Porcine reproductive and respiratory syndrome in China. Virus Res. 154:31–37 [DOI] [PubMed] [Google Scholar]
- 26. Ziebuhr J., Snijder E. J., Gorbalenya A. E. 2000. Virus-encoded proteinases and proteolytic processing in the Nidovirales. J. Gen. Virol. 81:853–879 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.