Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2022 Jan 17;50(2):1000–1016. doi: 10.1093/nar/gkab1302

Crystal structures of alphavirus nonstructural protein 4 (nsP4) reveal an intrinsically dynamic RNA-dependent RNA polymerase fold

Yaw Bia Tan 1,2, Laura Sandra Lello 3, Xin Liu 4, Yee-Song Law 5,6, Congbao Kang 7, Julien Lescar 8,9, Jie Zheng 10, Andres Merits 11,, Dahai Luo 12,13,14,
PMCID: PMC8789068  PMID: 35037043

Abstract

Alphaviruses such as Ross River virus (RRV), chikungunya virus (CHIKV), Sindbis virus (SINV), and Venezuelan equine encephalitis virus (VEEV) are mosquito-borne pathogens that can cause arthritis or encephalitis diseases. Nonstructural protein 4 (nsP4) of alphaviruses possesses RNA-dependent RNA polymerase (RdRp) activity essential for viral RNA replication. No 3D structure has been available for nsP4 of any alphaviruses despite its importance for understanding alphaviral RNA replication and for the design of antiviral drugs. Here, we report crystal structures of the RdRp domain of nsP4 from both RRV and SINV determined at resolutions of 2.6 Å and 1.9 Å. The structure of the alphavirus RdRp domain appears most closely related to RdRps from pestiviruses, noroviruses, and picornaviruses. Hydrogen-deuterium exchange mass spectrometry (HDX-MS) and nuclear magnetic resonance (NMR) methods showed that in solution, nsP4 is highly dynamic with an intrinsically disordered N-terminal domain. Both full-length nsP4 and the RdRp domain were capable to catalyze RNA polymerization. Structure-guided mutagenesis using a trans-replicase system identified nsP4 regions critical for viral RNA replication.

INTRODUCTION

Alphaviruses (family Togaviridae) and flaviviruses (family Flaviviridae) comprise numerous important arthropod-borne viruses (arboviruses) that cause diseases in humans. Fast urbanization, massive international travel, and global warming have triggered the re-emergence of several arboviral diseases, such as dengue, Zika, and chikungunya fevers, affecting millions worldwide (1–3). Many arboviruses infect humans through Aedes mosquitoes which are abundantly present in tropical/subtropical regions and are expanding into temperate climate territories. Arbovirus members of the genus Alphavirus have been divided into Old World and New World alphaviruses, and cause clinically distinct symptoms (4,5). New World alphaviruses such as eastern equine encephalitis virus (EEEV) and Venezuelan equine encephalitis virus (VEEV) are primarily associated with encephalitis in horses and humans that causes death or long-term neurological impairment. Despite the generally nonfatal nature of infections, Old World alphaviruses, such as chikungunya virus (CHIKV), cause a debilitating illness with persisting painful arthritis (6). Between March 2005 and April 2006, a major outbreak in the Reunion resulted in ∼255 000 cases of chikungunya fever, affecting approximately one-third of the total population of this French island. Human diseases caused by Ross River virus (RRV) and Sindbis virus (SINV) are generally milder, although they are also arthritogenic. Without any approved treatment, re-emerging outbreaks of alphaviruses highlight the unmet needs for vaccines and specific antiviral drugs. Available treatments for alphaviral infection are mostly symptom relief to minimize fever, joint pain, and associated inflammation. Several compounds, such as β-d-N 4-hydroxycytidine (NHC), favipiravir, and Compound-A, were claimed to be effective in inhibiting VEEV, CHIKV and SINV RNA synthesis but have thus far not progressed towards clinical trials as anti-alphavirus compounds (7).

Alphaviruses have a positive-sense (+) RNA genome of ∼11.8 kilobases in length with a 5′ type-0 cap and a 3′ polyadenylated tail. The two open reading frames (ORFs) in alphavirus RNA are flanked by cis-acting sequence elements: 5′ and 3′ untranslated regions (UTRs) and intergenic region partly overlapping with subgenomic (SG) promoter (8). RRV and CHIKV invade host cells by binding to the receptor Mxra8 (9), while SINV recognizes NRAMP as the main receptor (10). Receptor binding is followed by clathrin-assisted endocytosis, entry by fusion between the host cell and the viral membrane, and the release of the nucleocapsid containing the viral RNA genome into the cytoplasm. Upon its release, the genome functions as mRNA for polyproteins P123 and P1234 translated from its first ORF, and the expression of P1234 occurs through the readthrough of an inframe opal codon. These polyproteins are precursors for four nonstructural proteins (nsP1-4) that are formed via tightly regulated autoproteolysis and assemble into a viral RNA replication complex (RC). Alphavirus RCs are vesicular organelles called spherules and represent sites for viral RNA synthesis that are protected from the host immune response. Three types of viral RNAs are synthesized during the infectious cycle: the full-length negative-strand intermediate, the full-length positive-strand genomic RNA, and the shorter positive-strand SG RNA. SG RNA synthesis is driven by the SG promoter, and the RNA is used to express structural proteins encoded by the second ORF of the alphavirus genome. The newly made RNA genome and structural proteins assemble into progeny virions that are released by budding from the plasma membrane and are thereafter capable of further infection.

Alphavirus P1234 and its cleavage products contain all virus-encoded components required for RC assembly and all enzymatic activities needed for the synthesis and modification of viral RNAs. nsP4 contains RNA-dependent RNA polymerase (RdRp) activity responsible for de novo RNA synthesis and terminal adenylyltransferase (TATase) activity (11–13). In addition to nsP4, all other nsPs are also important for effective genome replication (11,12). Many molecular and structural details of the alphaviral nsP1-3 proteins have already been reported except for the C-terminal hypervariable region of nsP3, which is intrinsically disordered. In contrast, structural information for nsP4 has been lacking, in large part due to its intrinsic flexibility, which is important for its biological functions (14–22). Out of a total of ∼610 amino acid residues, the N-terminal region of nsP4 (∼aa 1–100) comprises an N-terminal domain (NTD) unique to alphaviruses with a poorly defined function. The C-terminal region of the protein comprises the alphaviral RdRp domain. Over the years, several attempts to express nsP4 from alphaviruses in an active form for structural studies have yielded poorly soluble recombinant proteins with a tendency to aggregate. Substantial differences were noted, however, between nsP4 proteins of various alphaviruses: recombinant nsP4 of SINV possesses significant RNA polymerase activity (23), while the recombinant nsP4 RdRp domain of CHIKV displays only low template elongation activity (13). As a result, alphavirus nsP4 has resisted structural determination for several decades, which has hampered rational design for the development of compounds to target its activities. The availability of experimentally determined structures of nsP4 from any alphavirus would therefore constitute an important step to accelerate the high-throughput screening of inhibitors in silico, while the development of robust enzymatic RdRp assays would facilitate testing of these compounds. Furthermore, structural information on nsP4 would also be crucial for a better understanding of the structure and function of the alphavirus RC as a whole, and thus represent a strong foundation for further drug discovery efforts.

Here, we report crystal structures of alphavirus RdRp from both RRV and SINV determined at 2.6 Å and 1.9 Å resolution respectively (Table 1). Having a dynamic structure, the RRV RdRp contains several disordered segments and some electron density in the RNA binding tunnel remains unassigned. The SINV RdRp adopts an overall fold similar to RRV RdRp but appears more ordered: residues that were disordered in the RRV nsP4 crystal structure are now well resolved in SINV RdRp and almost fully occupy the central RNA binding tunnel. Thus, this alphavirus unique RdRp conformation seems to be incompatible with RNA binding and polymerization. Complementary structural analysis performed using hydrogen–deuterium exchange mass spectrometry (HDX-MS) and nuclear magnetic resonance (NMR) helped to better understand nsP4 structural dynamics and revealed the intrinsically flexible conformations adopted by its NTD and RdRp domains in solution. Both full-length nsP4 (nsP4FL) and its RdRp domain displayed weak RNA polymerase activities in vitro. Finally, structure-guided mutagenesis experiments were used to gain functional insights into nsP4 in alphaviral RNA replication.

Table 1.

Data collection, phasing and refinement statistics

RRV SeM-RdRpSER3 RRV RdRpSER3 SINV RdRp SINV RdRpCS
Data collection Se-SAD phasing Native Native Native
PDB ID - 7F0S 7VB4 7VW5
Space group P 21 21 21 P 21 21 21 P 1 P 1
Unit cell dimensions
a, b, c (Å) 64.15, 68.58, 100.98 64.84, 68.35, 99.65 67, 68.29, 71.07 67.45, 68.94, 71.42
 α, β, γ (°) 90, 90, 90 90, 90, 90 115.97, 106.71, 95.90 116.311, 107.05, 95.10
Wavelength 0.979 (peak) 0.999 (native) 1.000 0.946
Resolution range (Å) 46.85–2.83 (2.93–2.83) 23.41–2.6 (2.69–2.6) 40.52–1.9 (1.968–1.9) 34.48–2.3 (2.382–2.3)
Total reflections 296 954 (29 212) 148 510 (15 410) 524 533 (36 101) 161 646 (17 056)
Unique reflections 11 129 (1107) 14 118 (1399) 78 572 (6646) 45 591 (4603)
Mean I/sigma(I) 17.19 (0.30) 23.81 (1.78) 18.84 (1.90) 13.25 (3.46)
Completeness (%) 99.93 (99.82) 99.66 (100.00) 92.26 (71.47) 94.41 (93.72)
Multiplicity 26.7 (26.3) 10.5 (11.0) 6.7 (5.4) 3.5 (3.7)
R merge b - 0.066 (1.469) 0.06941 (0.9519) 0.05755 (0.3683)
R meas c - 0.070 (1.544) 0.07531 (1.054) 0.06833 (0.4318)
R pim d - 0.023 (0.470) 0.02882 (0.4429) 0.03631 (0.2233)
CC1/2 e - 0.984 (0.632) 0.999 (0.708) 0.998 (0.901)
FOM 0.70 (0.66) - - -
Phase error (°) 35.60 (40.21) - - -
Refinement Statistics
 Reflections used in refinement - 14 112 (1399) 78 574 (5923) 45 652 (4415)
 Reflections used for R-free - 696 (67) 1858 (146) 1169 (112)
Rwork f - 0.265 (0.3877) 0.1875 (0.3210) 0.1950 (0.2564)
Rfree g - 0.288 (0.4466) 0.2239 (0.3549) 0.2446 (0.3374)
Number of nonhydrogen atoms - 3109 8064 7648
 macromolecules - 3108 7160 7267
 ligands - - 28 4
 solvent (water) - 1 876 377
Protein residues - 419 917 930
RMSD (bonds) - 0.006 0.003 0.002
RMSD (angles) - 0.99 0.58 0.53
Ramachandran statistics
 Ramachandran favored (%) - 91.81 97.45 97.6
 Ramachandran allowed (%) - 8.19 2.44 2.4
 Ramachandran outliers (%) - 0.00 0.11 0
Rotamer outliers (%) - 0.00 0.13 0.25
Clashscore - 12.18 3.21 4.69
Average B-factor - 108.36 28.32 44.61
 macromolecules - 108.37 27.71 44.54
 ligands - - 34.43 52.67
 solvent - 69.40 33.15 45.86
Number of TLS groups - 6 7 7

The values presented in parentheses are of the highest resolution shell.

Inline graphic .

Inline graphic , where Rmeas is the precision indicator of individual observation for unmerged data.

Inline graphic , where Rpim is the precision indicator for the merged data.

Inline graphic , where CC1/2 is of two half datasets.

Inline graphic .

Inline graphic , where Rfree was randomly sampled from 5% reflections.

MATERIALS AND METHODS

Molecular cloning

The sequences encoding RRV nsP4FL (aa 1–611) and its RdRp domain (aa 110–611) were obtained using PCR on an RRV-T48 cDNA plasmid template. The sequences encoding SINV nsP4FL (aa 1–610) and its RdRp domain (aa 91–610) were obtained using PCR on a SINV cDNA plasmid template (pSIN ReP5-GFP-GluR1 from Addgene). The PCR fragments were subcloned into the pSUMO-LIC vector using a ligation-independent cloning method in which the expressed recombinant protein had an N-terminal hexahistidine-linked small ubiquitin-like modifier protein (His6-SUMO) fusion tag which can act as solubility enhancer. For RRV RdRp, the RdRp surface entropy reduction construct (RdRpSER3) predicted by SERp server (https://services.mbi.ucla.edu/SER/) harboring two glutamine-to-alanine substitutions (Q192A and Q196A), nsP4FL polymerase-weakened mutant (D466N and D467N; named as GNN) and construct for expression of NTD (aa 1–109) were obtained using site-directed mutagenesis (SDM) with the Toyobo KOD-FX Neo cloning kit. For SINV RdRp, the C164S and C164N mutants (named as SINV RdRpCS and RdRpCN) were obtained via the same SDM method. These plasmids were propagated using the E. coli TOP10 strain.

Human RNA polymerase I (HSPolI) promoter-based plasmids for the production of replication-competent RNA templates of RRV (HSPolI-FG-RRV) and SINV (HSPolI-FG-SINV) have been previously described (24). Plasmids encoding P123 of RRV (CMV-P123-RRV) and SINV (CMV-P123-SINV) as well as plasmids expressing the corresponding nsP4 fused to ubiquitin at their N-termini (CMV-ubi-nsP4-RRV and CMV-ubi-nsP4-SINV) have been described elsewhere (25). Variants of CMV-ubi-nsP4-RRV and CMV-ubi-nsP4-SINV encoding the polymerase negative variant of nsP4 with GDD to GAA mutation in the polymerase active site were obtained using site-directed mutagenesis and designated CMV-ubi-nsP4GAA-RRV and CMV-ubi-nsP4GAA-SINV, respectively. Variants of CMV-ubi-nsP4-RRV and CMV-ubi-nsP4-SINV containing mutations shown in Supplementary Figure S8 were constructed using synthetic DNA fragments (Genscript). Sequences of all plasmids were verified using Sanger sequencing and are available from the authors upon request.

Recombinant protein expression and purification

The recombinant proteins were expressed using the phage-resistant E. coli Rosetta II T1R strain developed in-house. Bacteria were grown in 2–4 l Luria-Bertani (LB) Millar Broth (Axil Scientific) supplemented with 50 μg/ml kanamycin (Gold Biotechnology) and 34 μg/ml chloramphenicol (Gold Biotechnology) in a 37°C shaking incubator (Infors HT) until reaching an optical density (OD600) measurement between 0.8 and 1.0. The shaker's temperature was then adjusted to 18°C, expression was induced by the addition of isopropyl β-d-1-thiogalactopyranoside (IPTG) to a final concentration of 0.5 mM, and expression was carried out for 16–24 h. For selenomethionine labeling, an adapted protocol from Molecular Dimension was used for extended-expression at 18°C for 16 h (26). All the expression cultures were harvested by centrifugation for 10 min at 5000 × g, and bacterial pellets were frozen and stored at -80°C until recombinant protein purification. For 13C/15N-labeled NTDs from RRV, expression and induction were performed as described above except for the use of M9 minimal medium supplemented with 2 g/l 13C-glucose and/or 1 g/l 15NH4Cl.

The purifications of the recombinant nsP4 proteins (labeled or unlabeled) were conducted at 4°C using gentle treatments to ensure the structural integrity of the obtained samples. The bacterial pellet was suspended in IMAC A buffer (2× PBS, 500 mM NaCl, 5 mM β-mercaptoethanol, and 10% glycerol) supplemented with 1× protease inhibitor cocktail (DMSO free, MedChemExpress), 0.1–0.25% Tween-20 (Bio Basic) and 5 mM imidazole (Sigma Aldrich). The suspension was briefly sonicated before homogenization through 1000-bar pressurized PANDAPLUS2000 (GEA). The crude lysate was clarified via centrifugation in an ultracentrifuge XPN-100 (Beckman Coulter) at 235 000 × g. The clarified lysate was incubated with Ni-NTA FF beads (Bio Basic). Beads with bound recombinant protein were transferred to a column and subsequently washed with 15 column volumes (CVs) of IMAC A2 buffer (2× PBS, 500 mM NaCl, 5 mM β-mercaptoethanol, 10% glycerol, and 10 mM imidazole). The recombinant protein was eluted in 5 CV of IMAC B buffer (2× PBS, 500 mM NaCl, 5 mM β-mercaptoethanol, 10% glycerol and 300 mM imidazole) and treated with Ulp1-SUMO-protease (1:25 w/w ratio of protease: protein) for 16–20 h at 4°C in the presence of 250 mM NaCl to remove the N-terminal His6-SUMO tag. The cleaved recombinant protein was loaded onto a Heparin 5 HP column (Cytiva) at an adjusted concentration of 100 mM NaCl and subsequently eluted using 900 mM NaCl. The eluate was further separated through size-exclusion chromatography (SEC) in a HiLoad S75 or S200 16/600 Superdex® column (Cytiva) on an AKTA Pure 25 M fast protein liquid chromatography (FPLC) system (Cytiva).

Products obtained at all purification steps were examined for their purity by SDS-PAGE. The fractions corresponding to the elution peak were selected and concentrated to 2–50 mg/ml through the use of Amicon® Ultra-15 Centrifugal Unit (Merck), frozen and stored at –80°C for downstream experiments. The storage buffer is 25 mM HEPES pH 7.4, 300 mM NaCl, 2 mM tris(2-carboxyethyl) phosphine (TCEP) and 5% glycerol.

X-ray crystallography

Crystallization conditions for purified recombinant nsP4 proteins were screened with Hampton Research commercial kits (PEG-ION HT, PEG-RX HT, INDEX HT, SALT-RX, CRYSTAL-SCREEN HT) at concentrations 100–121 μM with and without ligands, aided by a Mosquito robotic liquid handler (SPT Labtech) and crystallization drops were periodically imaged in the RockMaker (Formulatrix®) maintained at 20°C.

Under optimized conditions, 121 μM RRV RdRp was incubated with 2–5 mM MgCl2, 2–5 mM deoxyadenosine triphosphate or adenosine triphosphate (dATP/ATP) and 135 μM synthesized self-annealing duplex-RNA R2 (dsRNA: 5′ GCAUGGGCGCCC 3′) for 1–2 h. This protein-RNA mixture led to crystal formation with sizes of 0.2–0.5 mm on a vapor-diffusion hanging-drop of 24% (w/v) PEG 3350, 0.1 M Na HEPES/Tris HCl pH 8.0, 0.2 M ammonium acetate, and 3% trimethylamine N-oxide (TMAO) in 5–7 days. The crystals were dehydrated using the same buffer supplemented with 30% (w/v) PEG 3350 and 10% glycerol as cryoprotectants before snap freezing on a cryoloop in liquid nitrogen. Crystals of SINV RdRp (both wild-type and RdRpCS mutant) were obtained with a concentration of 110 μM incubated with 2 mM ATP, 2 mM MgCl2, 18–20% PEG3350 and 0.2 M magnesium acetate. These crystals grew to a final size of 0.3–0.5 mm under vapor-diffusion hanging drop and were snap-frozen with the addition of 10–20% glycerol.

X-ray diffraction datasets (Table 1) were collected at Taiwan Photon Source (TPS) Beamline TPS-05A, Japan Spring-8 Beamline BL32XU (27,28), Swiss Light Source Beamline X06DA (PXIII), and Australian Synchrotron Beamline MX2 (29,30), all with a nitrogen stream to maintain crystals at 100 K. Data were collected for at least a total consecutive sweep of 200 degrees with an oscillation step of 0.1–0.5° and a 70–90% attenuated beam. The detector was placed at a distance of 260–300 mm from the crystal for native data collection at wavelengths of ∼0.95–1 Å. Single-wavelength anomalous diffraction (SAD) data collected at the selenium K-edge for the seleniated RRV RdRp protein were used for SAD phasing and the SeMet positions were used as markers to aid model building. Data were preprocessed on-the-fly by an XDS-based program (31) and further processed in the CCP4 7.1 (32) and Phenix 1.19 software suites (33) for SAD phasing and automated model building. Crystal structures were displayed at the computer graphics using Coot and refined with Phenix (33,34). Table 1 summarizes the crystallographic refinement parameters.

Hydrogen–deuterium exchange detected by mass spectrometry

For both RRV and SINV, 5 μM of either nsP4FL or RdRp or NTD recombinant protein in a buffer containing 50 mM Na HEPES, pH 7.4, 150 mM NaCl, 5% glycerol, 5 mM MgCl2, and 2 mM DTT was used to carry out hydrogen-deuterium exchange (HDX) reactions at 4°C. Four microliters of protein/protein complex with ligand/peptide were diluted in 16 μl D2O in exchange buffer (50 mM Na HEPES, pH 7.4, 50 mM NaCl, and 2 mM DTT), incubated for various HDX time points (e.g. 0, 10, 60, 300, 900 s) at 4°C and quenched by mixing with 20 μl of ice-cold 1 M TCEP 100 mM NaH2PO4. Each quenched sample was immediately injected into the LEAP Pal 3.0 HDX platform. Upon injection, samples were passed through an immobilized pepsin column (2 mm × 2 cm) at 120 μl min−1, and the digested peptides were captured on a C18 PepMap300 trap column (Thermo Fisher) and desalted. Peptides were separated across a 2.1 mm × 5 cm C18 separating column (1.9 mm Hypersil Gold, Thermo Fisher) with a linear gradient of 4–40% CH3CN and 0.3% formic acid over 6 min. Sample handling, protein digestion, and peptide separation were conducted at 4°C. Mass spectrometric data were acquired using a Fusion Orbitrap mass spectrometer (Thermo Fisher) with a measured resolving power of 65 000 at m/z 400. HDX analyses were performed in duplicate or triplicate with single preparations of each protein state. The intensity-weighted mean m/z centroid value of each peptide envelope was calculated and subsequently converted into a percentage of deuterium incorporation. Statistical significance for the differential HDX data was determined by an unpaired Student's t-test for each time point, a procedure that is integrated into the HDX Workbench software (35). Corrections for back exchange were made based on an estimated 70% deuterium recovery and accounting for the known 80% deuterium content of the deuterium exchange buffer.

For peptide identification, product ion spectra were acquired in data-dependent mode with the top eight most abundant ions selected for the production analysis per scan event. The MS/MS data files were submitted to Proteome Discover 2.4 (Thermo Fisher) for high-confidence peptide identification. For data rendering, the HDX data from all overlapping peptides were consolidated to individual amino acid values using a residue averaging approach. Briefly, for each residue, the deuterium incorporation values and peptide lengths from all overlapping peptides were assembled. A weighting function was applied in which shorter peptides were weighted more heavily and longer peptides were weighted less. Each of the weighted deuterium incorporation values was then averaged to produce a single value for each amino acid. The initial two residues of each peptide, as well as prolines, were omitted from the calculations.

NMR data collection and analysis

The NMR spectrum of the RRV NTD was collected at 298 K on a Bruker Avance II magnet with a proton frequency of 600 MHz equipped with a cryoprobe. A13C/15N-labeled (0.3 mM) NTD in a buffer that contained 20 mM Na-PO4, pH 6.5, 150 mM NaCl and 1 mM DTT was used for data acquisition. Experiments including 3D-HNCACB, HNCOCACB, HNCA, HNCOCA and HNCO were collected for resonance assignment. Assignment was challenging due to the absence of signals from several residues. Data were processed using NMRPipe (36) and visualized using NMRView (37).

Structure analysis

The 3D crystal structures of RRV and SINV RdRp (this work) were queried against the full PDB database through the Dali server (Dali; http://ekhidna2.biocenter.helsinki.fi/dali/) for matching structural homologs. Amino acid sequences of the top hits from Dali were aligned through the COBALT Multiple Alignment Tool (https://www.ncbi.nlm.nih.gov/tools/cobalt/cobalt.cgi) or MUSCLE (https://www.ebi.ac.uk/Tools/msa/muscle/) to generate a multiple sequence alignment (MSA) file. The sequence-conservation map overlay with the RdRp secondary structure was generated through ESPRIPT (https://espript.ibcp.fr) (38) with the input of the MSA file and RdRpSER3 PDB. The nsP4FL, RdRp and NTD homology models of RRV or SINV nsP4 were predicted based on comparative modeling (CM) mode with the template of the best-refined RdRpSER3 structure by sampling for 5 models through the Robetta server (https://robetta.bakerlab.org/) (39). The homology models were superimposed on other experimental crystal structures for comparison and analysis. These homology models were colored and annotated for the HDX-MS experiment. All structural displays were created in PyMOL 2.4 (Schrodinger LLC) (40).

RNA polymerase assay

The optimized RNA polymerase assay from (13) was conducted here using 1 μM recombinant nsP4 proteins and 50 nM internally labeled fluorescent T1 hairpin-RNA template in a buffer containing 25 mM Na HEPES pH 8.0, 2.5 mM MnCl2, 5.0 mM MgCl2, 20 mM KCl, 5 mM TCEP, 3 mM Lauryldimethylamine oxide (LDAO), 0.01% Triton X-100 and 2 U/μl murine RNase inhibitor (NEB) in nuclease-free water (HyClone). Reactions were initiated with 2 mM ATP and incubated for 0 to 24 h at 25°C in the dark. At selected incubation time points, the reactions were quenched using 2× RNA loading dye (95% formamide, 0.02% SDS, 0.01% xylene cyanol, 0.02% bromophenol blue, and 1 mM EDTA). Samples were loaded onto 17.5% urea–PAGE and run at 200 V for 1.5–2 h. The urea-PAGE was imaged on a Bio-Rad Gel-Doc XR+ Imager using the Fluorescence Blot ALEXA488 protocol to analyze the RNA polymerase activity. (T1 RNA sequence: 5′ FAM-UUUUUUUUUUAGGACCGCAUUGCACUCCGCGGUCCUAA 3′)

Trans-replication assay

U2OS human bone osteosarcoma cells (ATCC HTB-96) were maintained in Iscove's Modified Dulbecco's medium (Gibco) containing 10% Fetal Bovine Serum (FBS), 2 mM L-glutamine, 100 U/ml penicillin, and 0.1 mg/ml streptomycin at 37°C in a 5% CO2 atmosphere. Cells grown on 12-well plates were cotransfected with the following combination of plasmids (each used at 1 μg per transfection): HSPolI-FG-RRV, CMV-P123-RRV, CMV-ubi-nsP4-RRV or its mutant versions (Supplementary Data, Figure S8) for RRV trans-replicase and HSPolI-FG-SINV, CMV-P123-SINV, CMV-ubi-nsP4-SINV or its mutant versions for SINV trans-replicase. Transfections were performed using Lipofectamine LTX with PLUS reagent (Thermo Fisher Scientific) according to the manufacturer's instructions. Transfected cells were incubated at 37°C for 18 h. All transfections were performed in triplicate, and experiments were repeated at least twice. After incubation, cells were lysed, and the activities of firefly luciferase (Fluc) and Gaussia luciferase (Gluc) were measured using the Dual-Luciferase-Reporter assay (Promega). Fluc and Gluc activities measured for cells transfected using plasmids expressing active replicases or their mutants under investigation were normalized to those obtained for the corresponding control cells transfected with HSPolI-FG-RRV, CMV-P123-RRV and CMV-ubi-nsP4GAA-RRV or HSPolI-FG-SINV, CMV-P123-SINV and CMV-ubi-nsP4GAA-SINV.

RESULTS

Determination of the crystal structures of nsP4 RdRp from RRV and SINV.

We obtained soluble recombinant nsP4FL and RdRp proteins of both RRV and SINV (Figure 1A) using the E. coli expression system. The RdRp recombinant proteins, lacking the N-terminal 109 or 90 amino acids of nsP4 respectively, were purified as soluble active enzymes that could be crystallized. The size exclusion chromatography profiles of these proteins indicate they are monomeric in solution. Crystals of the RRV RdRp initially diffracted to a maximum resolution of 3.8–4.0 Å. To improve RRV RdRp crystals diffraction, we substituted hydrophilic glutamine residues (Q192 and Q196) that are presumably exposed at the surface of the folded RdRp protein with alanines. The obtained surface-entropy double mutant RdRpSER3 formed crystals with an increased size that allowed native X-ray diffraction intensities to be collected to a maximum resolution of 2.60 Å (Table 1). Because no convincing solution was found using molecular replacement (MR), l-selenomethionine (SeM)-labeled RRV RdRpSER3 was used for single-wavelength anomalous diffraction (SAD) phasing at the Se K edge. SAD data were collected at 2.83 Å resolution using a crystal isomorphous to native RdRpSER3 crystals (Table 1). A total of 419 out of 502 residues could be built, including eighteen α-helices, three 310 helices, and ten β-strands (Figures 1B, C and Table 1). Of note, no bound ligands, such as RNA, ATP, Mg2+, or trimethylamine N-oxide (TMAO), were identified in the RdRpSER3 crystal structure despite their presence under crystallization conditions (see Methods). An anomalous Fourier difference map using data collected at the Se K-edge allowed unambiguous mapping of eleven Se-Met residues (out of a total of thirteen) onto the structure, giving confidence for the correctness of the atomic model (Supplementary Figure S1). While we were making progress in building and refining the RRV RdRp crystal structure, we managed to obtain crystals of the wild-type (wt) SINV RdRp spanning amino acids 91–610 in a crystallization condition different from the RRV protein (see Materials and Methods). SINV RdRp crystals diffract to 1.9 Å resolution and belong to the P1 space group with two RdRp molecules per asymmetric unit (ASU) (Table 1 and Supplementary Figure S3). The SINV RdRp structure was readily determined by molecular replacement using the RRV RdRp structure as a search probe and refined to a Rfree value of 22.4% with good stereochemistry (Table 1, Figure 1 and Supplementary Figure S3).

Figure 1.

Figure 1.

Crystal structure of Alphavirus nsP4 RdRp domain. (A) Schematic presentation of RRV and SINV nsP4. The crystallized regions of nsP4: SINV RdRp (aa 91–610) and RRV RdRp (aa 109–611) are annotated. Their flexible region (aa 135–184; denoted as ‘flex’ in smudge green) occupying the central tunnel, and (sub)domain distribution is shown as follows: N-terminal domain (NTD) in white and RdRp domain consisting of fingers with colored fingertips (index in green, middle in orange, ring in yellow and pinky in pink), palm in gray and thumb in purple. The mutations targeting disulfide-bond forming cysteine (C164S) in SINV RdRp and surface-entropy site (labeled as SER3 to represent surface-entropy mutations of Q192A and Q196A) in RRV RdRp are marked. The locations of the A-G motifs are shown in colors (colored as A: teal; B: marine; C: red; D: brown; E: yellow; F: black; G: forest green). (B, C) The annotated crystal structures of RRV RdRp (RdRpSER3) in (B) and SINV RdRp (wt) in (C) are colored according to (A) for displaying their subdomain spatial distribution from the front right-hand view (left panel) and their motifs in the 180° rear view (right panel). Motif C that comprises conserved GDD active-site residues is highlighted in red sticks. Polyalanine considered in the flex region of RRV RdRpSER3 and built residues in the SINV RdRp flex region are both represented as a smudge-green ribbon which is traced by backbone Cα residues as a sphere. Additional features in SINV RdRp in (C): the cysteines (C164 in flex, C323 in pinky) are labeled as black sticks (left panel) while magnesium cations are colored navy blue. (D, E) The contrast of RdRp crystallographic configurations for (D) monomeric RRV and (E) dimeric SINV is presented in both crystal structure (top view) and simplified block (non-hydrophobic-interacting part colored as gray and zig-zag part symbolizes flex region) to present the hydrophobic interactions between the index finger (green region) and thumb (magenta region) on the left panel respectively. The hydrophobic interface of RRV and SINV between the N-terminal index finger and C-terminal thumb region is highlighted in the respective zoom-in box on the right panel of both (D) and (E). In these zoom-in boxes, all hydrophobic residues (hydrophobic interaction distance within 4Å) are shown as sticks and listed accordingly to the color of the index finger and thumb. The top list is the common residues shared between RdRp of RRV and SINV while the bottom list is SINV-RdRp-unique residues at the hydrophobic interface.

Structural comparison of the two alphavirus RdRps

Both RRV and SINV RdRps adopt the characteristic right-hand fold of viral RdRps, where fingers–palm–thumb subdomains are sequentially arranged from the amino-to-carboxylic ends. The overall structure of SINV RdRp is similar to RRV within an RMSD of 1.97 Å (417 Cα) (Table 1 and Supplementary Table S1; Figure 1 and Supplementary Figure S3). The core catalytic domain from both RRV and SINV RdRps including the palm region is well defined in continuous electron density and structurally well conserved between both RdRp proteins following superposition at RMSD of 1.38 Å (Figure 1B-E, Supplementary Figures S2D and S3 and Supplementary Table S1). Compared to RRV RdRp, the higher-resolution structure of the SINV RdRp reveals more details of the structure and also bound ligands (magnesium and glycerol). Two magnesium cations are coordinated by aspartates residues 392 and 466, with the latter being part of the active site ‘GDD’ motif (Supplementary Figure S3A).

A total of 83 residues are missing from the highly dynamic RRV RdRp crystal structure, while there are only 39 missing residues from the SINV RdRp structure. The 83 mobile residues in RRV RdRp belong to the N-terminal fingertips and C-terminal thumb subdomains: index finger (residues 135–184; this flexible region is denoted as ‘flex’ region hereafter) (Figures 1, 2A, Supplementary Figures S2C and S3C), pinky, and ring fingers (residues 210–213 and 292–307, respectively) (Figure 2A and Supplementary Figure S2A), and C-terminus of the protein (residues 575–578, 588–593 and 600–611) (Figure 2A and Supplementary Figure S2B). Conversely, missing residues in SINV RdRp are located in N-terminal fingertips (index residues 91–103, flex residues 167–178, pinky residues 206–210, and ring residues 288–307) and C-terminus residues (residues 603–611) (Figure 2). The relatively large number of missing residues of the RRV RdRp structure suggests a highly dynamic structure which is also detected in the HDX measurement (see below, Figure 2).

Figure 2.

Figure 2.

Structure dynamics of RRV nsP4 in solution. (A) The HDX profiles of the recombinant constructs of both RRV and SINV: nsP4FL (blue), RdRp (red), and NTD (green), in solution. On the graph, the deuteration incorporation levels expressed in percentage (D%) are plotted against the nsP4 residue numbers (RRV: aa 1–611, SINV: aa 1–610). The residues missing in the resolved structure are shaded as gray areas, and their positions are indicated. The longest unresolved region in RRV RdRp (residues 135–184) is designated as ‘flex’. (B, C) The color-coded ternary structures of Robetta homology models for RRV (B) nsP4FL, (C) RdRp and NTD, and for SINV nsP4FL and RdRp in (D), according to the representation of D% in rainbow heatmap to visualize structural flexibilities. These models were generated via Robetta (39) to fill the missing residues not covered in the current crystal structures, including the flexibly orientated NTD. The HDX-protection profiles of RRV nsP4 in (A) were color-mapped to these RRV homology models at single-amino-acid resolutions to guide the visual analyses and 3D comparisons between nsP4FL and RdRp and NTD. RRV nsP4FL is annotated with N/C-terminus, subdomains, motifs, and highlighted regions (Ring Motif F, Pinky, flex, Index-to-NTD linker, HLH substructure) as a reference to other models in (C) and (D). The missing residues are presented as Cα spheres in these homology models. The residues colored in black are those not covered/excluded by HDX data rendering. Rainbow bar as reference for (B),(C), and (D): D% is visualized in the rainbow heatmap spectrum, where the red end represents high D% (structured region) and the blue end represents low D% (disordered region).

There are noticeable structural differences in the fingers and thumb subdomains. The highly flexible fingers subdomain of RRV (subdivided into the index, pinky, ring, and middle finger subdomains) is significantly different from the corresponding SINV region with an RMSD of 1.90 Å (Figure 1BE, Supplementary Figure S3B–C and Supplementary Table S1). The RRV RdRp fingertips extend from the fingers towards the thumb to form the distinctive encircled ring conformation for RNA polymerization first seen in the HCV polymerase structure (41,42). To fold into this topology, the RRV nsP4 index finger extends its N-terminal helix horizontally towards the thumb to which it docks via the formation of a hydrophobic interface (Figure 1BE and Supplementary Figure S3B–C). The flexible SINV index fingertip appears detached from the core domain and crosses over to a neighboring RdRp molecule within the ASU, forming a dimeric configuration within the crystal (Figure 1C, E and Supplementary Figure S3B–C). This difference between RRV and SINV index fingers suggests that the N-terminus end is dynamic and potentially undergoes structural reorganization during RC assembly and RNA loading. However, this dimeric form of SINV nsP4 RdRp is likely a crystallographic artifact, as all reported alphaviral nsP4 proteins are monomeric in solution based on the present and past gel-filtration data (11–13).

While the flex region of the RRV RdRp is largely disordered, the equivalent region of SINV RdRp folds into the central RNA binding and catalysis tunnel (Figure 1, Supplementary Figures S2C and S3C). Unexpectedly, a disulfide bond between cysteine residues C164 and C323 is formed. The cysteine-to-serine residue mutation (C164S) at the less conserved residue C164 did not alter the enzyme's overall folding or catalytic ability so this disulfide bond appears not to be important for virus replication (Figure 1C, Supplementary Figures S3B and S7C–D). Other fingers such as middle, ring, and pinky fingers are more conserved structurally between the two RdRps. The middle finger of RRV RdRp maintains the classical RdRp loop substructure but with its loop positioned lower than that of SINV RdRp, in which the middle and index fingers interact (Supplementary Figure S3C). The ring and pinky fingers are the most conserved fingertips between the two RdRps. Importantly, the occupation of the central tunnel by residues from the finger domain would exclude RNA substrates (template or primer) binding in both RRV and SINV RdRps. Therefore, nsP4 must undergo a very large conformational rearrangement of the finger domain to accommodate the RNA template and to catalyze viral RNA polymerization. Overall, the fingers subdomain indicates flexibilities for conformational changes, likely for its enzymatic roles, nsPs interactions, and viral RNA binding.

Further, the thumb subdomain in the RdRps of SINV and RRV has limited structural similarity in their α-helical arrangement that superimposed with a RMSD of 2.05 Å (Figure 1DE and Supplementary Figure S3C and Supplementary Table S1). The loop between the first two α-helices (a1 and a2) is longer in RRV (residues 525–557) showing higher flexibilities than the equivalent of SINV with a shorter loop and more structured α-helices. The unbuilt regions in the C-terminus end of RRV RdRp (Supplementary Figure S2B and S3C) due to weak electron densities also contribute to the increased RMSD despite the spatial outlines of the α-helices in both RdRps of RRV and SINV remain inframe. The higher dynamics in RRV RdRp thumb region observed here is also supported by comparing the HDX profiles between RRV and SINV nsP4 (see below, Figure 2).

In summary, the conformations captured in the present crystal structures are likely to have been partly induced by crystal packing forces leading to shifts in the fingers and thumb subdomains compared to catalytically fully competent forms. Moreover, beside crystal packing forces, the structural context (e.g. interactions with other members of the alphavirus RC) is also likely to be important to determine the RdRp conformation, given the high plasticity of the isolated recombinant protein.

The structural dynamics of nsP4 and its domains in solution

To gain insights into the conformational dynamics of nsP4 and to probe intramolecular interactions in solution, we performed HDX-MS and NMR studies on recombinant nsP4 and proteins corresponding to its domains. HDX-MS is a proteomics approach to study protein dynamics by measuring the HDX rates between solvent D2O and the hydrogen of target protein. A high percentage of deuteration (D%; hydrogen exchanged to deuterium) reflects the high solvent accessibility and flexibility of the regions on RRV and SINV nsP4. To illustrate the structural dynamics in 3D, we plotted the HDX profiles in D% in heatmap format to these structural models for comparison (Figures 2 and Supplementary Figure S4 and S5). The missing residues in the crystal structures were modeled using Robetta ternary-structure prediction (39). Based on the high D% HDX profile observed, the NTD of RRV nsP4FL appears quite flexible in solution, except at residues 53–74, where a predicted helix-loop-helix (HLH) substructure is located (Figure 2BC). The linker between NTD and RdRp appears to have higher D% values, indicating increased dynamics in solution (Figure 2A). Given its low molecular mass of 12.6 kDa, the recombinant NTD is amenable to structural characterization via NMR in solution. The NTD selected from RRV contains 107 amino acids with an expected number of 99 cross-peaks (excluding Pro residues) in the 1H-15N-HSQC spectrum. From these, ∼60 cross-peaks could be observed, making backbone resonance assignments challenging (Supplementary Figure S6). The narrow dispersion of the cross-peaks in the spectrum suggests that the NTD contains flexible and helical regions that support the NTD homology model and are in agreement with the HDX profile (Figure 2A, C and Supplementary Figure S6).

The palm and thumb subdomains as well as the middle and ring fingers are largely protected from the deuterations and therefore constitute a buried structural core in the RRV nsP4. In contrast, the flexible N-terminal index and pinky fingers harboring the respective flex region and motif G (residues 215–221) and the thumb C-terminus region (residues 560–585) became highly deuterated in the HDX experiment (Figures 2AC and Supplementary Figure S4). Thus, these are less-protected regions, a finding that is in agreement with their high mobility found in the crystal structure of RRV RdRp (Figure 1B, Supplementary Figures S2 and S3C). The HDX protection profile of the ring fingers and the last 20 amino acids at the protein C-terminus is in contrast with the RRV RdRp crystal structure, where these regions had weak or missing electron densities (Supplementary Figures S2B and S3C).

Overall, the similar HDX profiles of the NTD protein by itself and within the nsP4 from both SINV and RRV indicate that there is no strong intramolecular interaction between the NTD and the RdRp region of the nsP4.

Comparison of the alphavirus nsP4 structures with other viral RdRps

The polymerase domain of nsP4 of RRV and SINV adopts a conserved tertiary structure harboring features common to viral RdRps. A structural homology search of the Protein Data Bank (PDB) database (43) revealed that the closest structural homologs of both RRV and SINV RdRps are RdRps from the human Norwalk virus from the Caliciviridae family (NV; PDB 5TSN) (44), enterovirus-71 (EV71, PDB 6KWQ) from the Piconaviridae family (45) and classical swine fever virus (CSFV, PDB 5Y6R) as well as bovine viral diarrhea virus (BVDV) from the Flaviviridae family (46) (Figure 3). In RRV nsP4, the conserved polymerase motifs forming a fully encircled right-hand fold are arranged in the primary structure as G-F-A-B-C-D-E, showing evolutionary conservation with the Picornaviridae (but not to the birnavirus) RNA polymerase (42) (Figure 1 and Figure 3). The SINV nsP4 did not form the encircled right-hand fold due to the index finger interacting with the neighbor crystallographic ASU but this intermolecular contact does not affect the overall structure and their motif arrangements. The core polymerase domain at the palm of alphaviral RdRp contains motifs A, B, C and E that are structurally well-conserved compared to known positive-strand viral RdRps (Figure 3). The Phe residue on motif E of alphaviral nsP4 points towards the GDD residues in the β-turn of motif C, as in other viral RdRps (Figure 3; motif E in yellow and motif C in red). In contrast, motif D is likely to constitute an alphavirus distinctive feature that separates it from other viral RdRps. Residues from motif D of nsP4 adopt an elongated β-strand conformation, which forms a Greek-key-like β-sheet motif, next to motifs A–C (Figure 3A). The least conserved motifs lie in the pinky finger (motif G) and ring finger (motif F) subdomains which are known for poor sequence homology in viral RdRps (Figure 3B). The fingertips are typically folded in β-sheets extending towards the thumb like a lid over the palm. The thumb subdomain of nsP4 shows poor structural homology with RdRp homologs, and its C-terminal region is folded in a different orientation without β-sheet formation. These structural differences make the alphavirus nsP4 an α-helix rich structure compared to its closest viral RdRp homologs.

Figure 3.

Figure 3.

Comparison of the structure of alphavirus nsP4 with those of other viral RdRps. (A) The RRV RdRpSER3 (RRV, PDB 7F0S) and SINV RdRp (SINV, PDB 7VB4), and their shared top structural homologs, including Norwalk virus (NV, PDB 5TSN), enterovirus-71 (EV71, PDB 6KWQ), and classical swine fever virus (CSFV, PDB 5Y6R). The motifs of these PDB structures were structurally aligned for comparison. The summarized motif and subdomain arrangement at the center are colored the same as Figure 1. The conserved aspartate residues in motifs C and A are displayed as peptide carbon backbone Cα spheres, while the conserved phenylalanine residue in motif E is shown in stick format. (B) The conservation of RdRps of +RNA virus based on primary sequence and secondary structure presented through Espript*. The sequence and secondary structure of RRV nsP4 (genus Alphavirus, family Togaviridae) were compared to those of different alphaviruses (RRV-to-SPDV) and members of the Caliviridae (NV), Flaviviridae (CSFV-to-HCV), and Picornaviridae (PV-to-COXV) families. The secondary structure is labeled on top of the alignment and shows only α-helices (spring), β-strands (arrow), and β-turns (TT). Motifs A-G are boxed and colored according to (A). [*Espript coloring: The red-boxed columns are the most conserved, while the blue-boxed columns classify amino acid conservation based on their similar functional groups. *Espript label: Togaviridae - Ross River virus (RRV), Getah virus (GETV), Sindbis virus (SINV), chikungunya virus (CHIKV), o’nyong’nyong virus (ONNV), Venezuela equine encephalitis virus (VEEV), Barmah forest virus (BFV), and salmon pancreas disease virus (SPDV); Caliviridae - Norwalk virus (NV); Flaviviridae - classical swine fever virus (CSFV), bovine viral diarrhea virus (BVDV), and hepatitis C virus (HCV); Picornaviridae - poliovirus (PV), encephalomyocarditis virus (EMCV), enterovirus-71 (EV71), and coxsackievirus (COXV).

Both RRV and SINV nsP4 display RNA polymerase activity

A previously described fluorescent RNA-based assay (13) was optimized to improve robustness and sensitivity and used for the analysis of the RNA polymerase activities of the obtained recombinant proteins. In this primer extension assay, RNA intermediates (RIs) result from the incorporation of 1–7 adenosine nucleotide residues (A residues), while the RNA product (RP) is the fully extended RNA obtained by incorporation of 8 A residues on the T1 RNA template (Figure 4A). Reactions were run for 24 h with time-point sampling at 0, 2, 4, 6, 8, 10 and 24 h.

Figure 4.

Figure 4.

RNA polymerase activities of nsP4FL, RdRp, and RdRpSER3. (A) Schematic presentation of the discontinuous time point RNA polymerase reaction. Extension of fluorescently labeled (green star on the 5′ end) RNA template T1 by recombinant nsP4 protein (gray particle) results in ATP incorporations as A-residue (orange circle) into the RNA intermediates (RIs; a snapshot of one possible RI; right panel) and RNA product (RP; right panel), both of which are shown below with expected band migration with each band corresponding to the number of A-residue added to T1 on a urea-PAGE gel. (B, C) The biochemical characterization of recombinant nsP4 proteins using discontinuous time-point gel-based polymerase assay was performed for 0–24 h. The denaturing urea-PAGE gels are annotated with T1, time-point (t) in hour unit (h), products made by dengue virus RdRp used as a positive control (named as D), and the tested recombinant nsP4 proteins from RRV (nsP4FL, polymerase weakened mutant of nsP4FL GNN, RdRp, RdRpSER3, and NTD) and SINV (nsPFL and RdRp, in purple text). For the polymerase reactions between RRV RdRp or RdRpSER3 and NTD, the stoichiometric molar ratio is shown in (C), that is marked with lane number for reference. The incorporation of each base is marked with a short white line shown between the last 2 time points.

The recombinant nsP4FL, RdRp, and RdRp mutants from RRV and SINV were enzymatically active and capable of extending the T1 template (Figure 4 and Supplementary Figure S7). The SINV nsP4FL and RdRp had weaker activities than those of RRV. The RIs were observed from 2 h onwards for all of these enzymes. The SINV nsP4FL is weaker enzyme than the corresponding RdRp protein or their counterparts of RRV. The difference between two proteins was probably caused by the high degradation rate of RNA template or RIs by the SINV nsP4 proteins (including their mutants) during the assay (Figure 4B right panel and Supplementary Figure S7C-D). Therefore, the following RNA polymerase assay was focused on RRV nsP4 recombinant proteins that had minimal RNA degradation.

For all RRV nsP4 recombinant proteins, their polymerase reactions remained largely stagnant for over 10 h and subsequently accelerated to form a mixture of RP and RIs at 24 h (Figure 4B left panel and Supplementary Figure S7A, B). After 24 h, nsP4FL ranks as the most efficient for the synthesis of RP and RIs with 6 or 7 A incorporations, followed by RdRpSER3 and RdRp. The conclusion that the activity of the surface-engineered mutant RdRpSER3 exceeded that of its wt counterpart was based on the observation that for the mutant enzyme, no RIs of less than 4 A added was detected at 24 h, while for its wt RdRp, a faint ladder consisting of the template and RIs with 1–3-A added was still visible even at this time point (Figure 4B left panel). As expected, polymerase-weakened nsP4FL protein with GNN mutation at its motif C active site formed much smaller amounts of RIs, and the level of RP remained below the detection limit even at the 24 h time point (13).

To examine whether there is a functional interaction between the NTD and the RdRp domains, we measured the polymerase activity of RdRp in the presence of the recombinant NTD protein at 1:1, 1:2 and 1:5 molar ratios. No primer extension was observed for the recombinant protein corresponding to the RRV nsP4 NTD (Figure 4C: lane 11–13). In the presence of the NTD protein, no increase in polymerase activity of RdRpSER3 or RdRp was observed at any time point (Figure 4C). Instead, an inhibitory effect of the NTD on the polymerase activity was observed; more evidently for the RdRp than for RdRpSER3 (Figure 4C: lane 3, 6, 9, 17, 20, 23). Thus, the enhanced activity observed for nsP4FL could not be reproduced by mixing NTD and RdRp in trans.

Activities of RRV replicase are affected by structure-guided mutations in nsP4

Having determined the crystal structure of RRV RdRp, we next analyzed the functional importance of several of its key structural features. As the introduction of unfavorable mutations into an infectious clone tends to result in no data (lethal phenotype) or the rapid appearance of revertants or compensating mutations (17,24), we instead used a novel two-component trans-replication system (Figure 5A), which has extremely high sensitivity (25). Briefly, the system is based on the ability of separately expressed P123 and nsP4 of alphaviruses to form a functional RNA replicase. Coexpression of P123 and nsP4 with replication-competent template RNA also triggers the formation of RCs (spherules) similar to those formed in virus-infected cells (47), i.e., represents a relevant tool to study alphavirus RNA replication. Plasmids for the expression of RRV P123 and nsP4 as well as the corresponding template RNA have been described (25). The template RNA contains sequences coding for the following two reporters: Fluc under the genomic promoter and Gluc under the SG promoter (Figure 5A). For simplicity, the synthesis of full-length RNA serving as the template for Fluc expression is hereafter termed ‘replication’, and the synthesis of RNA serving as the template for Gluc expression is termed ‘transcription’. The efficiency of replication and transcription was estimated by the variations in corresponding reporter expression (24).

Figure 5.

Figure 5.

Effects of mutations in nsP4 on the activities of RRV and SINV trans-replicases. (A) The trans-replication system is composed of two plasmids (CMV-P123 and CMV-ubi-nsP4) for the expression of RRV and SINV replicase and a plasmid HSPolI-FG for the production of replication-competent RNA containing Fluc and Gluc reporters. In this assay, the increase in expression levels of Fluc and Gluc represented the production of full-length genomic RNA and SG RNA, respectively. HSPolI, a truncated promoter (residues −211 to −1) for human RNA polymerase I; 5′ UTR, full length 5′ UTR of an alphavirus; nsP1 N*, region encoding the N-terminal 77 (RRV) or 114 (SINV) amino acid residues of nsP1; SG, SG promoter spanning (with respect to termination codon of nsP4) from position –79 to the end of the intergenic region; 3′ UTR, truncated (last 110 residues) 3′ UTR of an alphavirus; HDV RZ, antisense strand ribozyme of hepatitis delta virus; MmTer, a terminator for RNA polymerase I in mice; CMV, an immediate early promoter of human cytomegalovirus; LI, leader sequence of the herpes simplex virus thymidine kinase gene with artificial intron; SV40Ter, simian virus 40 late polyadenylation region; Ubi, sequence encoding for human ubiquitin. Positions of mutations in nsP4 (residues numbered as in nsP4 of RRV) are indicated. (B, C) U2OS cells in 12-well plates were cotransfected with matching combinations of CMV-P123 and CMV-ubi-nsP4 or its mutant versions (in a 1:1 molar ratio) and the corresponding HSPolI-FG plasmids. As negative controls, CMV-ubi-nsP4GAA, which encodes nsP4 lacking RNA polymerase activity, was used instead of CMV-ubi-nsP4. Cells were incubated at 37°C and lysed 18 h post-transfection. Fluc and Gluc activities produced by wt and mutant replicases were normalized to the nsP4GAA controls. Values obtained for nsP4GAA controls were taken as 1. Means + standard deviation of three independent experiments are shown. (B) Effects of introduced mutations on RRV replicase activity. (C) Effects of introduced mutations on SINV replicase activity.

Mutations were introduced into nsP4 of RRV (Supplementary Figure S8), and the activities of the replicates harboring mutant nsP4 proteins were compared with those of the wt replicase. Substitution of Arg63 and Arg67 with Ala residues did not result in significant changes in RNA replication, while transcription was slightly but significantly (P < 0.001) reduced (Figure 5B and Supplementary Figure S8). In contrast, only minimal transcription activity was detected for replicase harboring the Strep-FLAG tag after residue 92. The same tag inserted after residue 109 had a somewhat smaller impact; however, it still reduced the transcription level ∼230-fold when compared with wt replicase (P < 0.0001) (Figure 5B and Supplementary Figure S8). Thus, the replacement of charged residues with Ala residues in the NTD of RRV nsP4 was well-tolerated compared to the insertion of peptide tags.

Replacement of Cys125 with Val residue had no significant effect on the replication and transcription activities of the RRV RNA replicase. However, replacement of the same residue with Asp resulted in an ∼60-fold decrease in replication and an ∼14-fold decrease in transcription (P < 0.0001 in both cases). Analysis of the nsP4 structure revealed that residues 136–185 of RRV nsP4 form a flexible loop. Replacement of this region with a Strep-FLAG tag resulted in the complete loss of RNA replicase activity, indicating a crucial function played by this region. This conclusion was corroborated by substituting Asp143 and Asp150 in this loop with alanine residues, which resulted in a large ∼43-fold decrease in replication and an even larger ∼310-fold decrease in transcription (p < 0.0001 in both cases). Interestingly, replacing Asp146 and Asp153 residues with Ala had a smaller impact: replication was only reduced ∼5-fold (P < 0.0001) (Figure 5B and Supplementary Figure S8). Likewise, substituting charged amino acid residues with Ala in the thumb subdomain resulted in a decrease in RNA replicase activity: replacement of Asp543 and Asp545 caused an ∼25-fold decrease in replication and an ∼35-fold decrease in transcription (P < 0.0001). The effect of substitution of the Arg546 and Arg548 residues caused a less prominent ∼11-fold decrease in replication (P < 0.0001), while the impact on transcription was more pronounced (∼120-fold, P < 0.0001) (Figure 5B and Supplementary Figure S8). Finally, the introduction of a stop codon after residue 601, resulting in the loss of the ten C-terminal residues of nsP4 that could not be resolved in the present crystal structure, resulted in almost complete loss of RNA replicase activity. The insertion of a Strep-FLAG tag after residue 601 had a similar effect (Figure 5B and Supplementary Figure S8). Taken together, these data indicate that the presence of the ten C-terminal residues and their location relative to the rest of nsP4 is crucial for RRV nsP4 activity.

DISCUSSION

The nsP4 polymerase crystal structures from two alphaviruses feature a conserved rigid fold in the palm subdomain but rather less so in the fingers and thumb subdomains (Figure 1 and Supplementary Figure S3C). This observation is unlikely due to crystal lattice packing forces as the two proteins were crystallized in different conditions and packed in different crystal lattices (Table 1). The structures of alphavirus RdRps are likely the result of divergent evolution leading to the peculiar folds of the fingers and thumb at odds with the non-alphaviral RdRps. One of the most noticeable changes from both SINV and RRV RdRp crystal structures is the elongated β-strand adopted by motif D next to the catalytic GDD motif C located in the palm subdomain (Figure 1 and Figure 3). This structure contrasts with the canonical β-strand-loop motif found in other viral RdRps (Figure 3A). Having a reduced flexibility, this elongated motif D could inhibit the NTP entry in the alphaviral RdRp apoenzyme state (42). Indeed, we observed no bound ATP or dATP but only magnesium ions in the central catalytic cleft above the palm in SINV RdRp despite their presence in the crystallization buffer (Figure 1 and Supplementary Figure S3A). A high number of residues in the RRV RdRp flex region are disordered compared to SINV RdRp, possibly due to the absence of a disulfide bond and the intrinsic plasticity of this region. Overall, structural analysis of the two RdRp structures points to a partially unfolded, less active pre-initiation state for alphavirus nsP4, before RC assembly and viral RNA replication and transcription. It is expected that a series of extensive structural rearrangement occurs throughout the multistage viral RNA biogenesis process, when nsP4 dynamically interacts with other nsPs and viral RNAs (25,42,48–50).

The combination of HDX-MS, NMR, and Robetta homology modeling provided insights into the structural dynamics of nsP4 of both SINV and RRV in solution, especially for the NTD region. The homology model of RRV NTD features a mixture of disordered regions and a predicted helical secondary structure consisting of an HLH substructure (aa 53–74) (Figure 2B-C). Despite the challenging assignments of the cross-peaks to the RRV NTD residues in the incomplete NMR spectra, these missing cross-peaks provided clues that its NTD may have active conformational exchanges (Supplementary Figure S6). The HDX-protected region at the HLH substructure and the NMR-detected cross-peaks both suggest in the presence of secondary structure elements in the RRV NTD, which agree well with the NTD homology model. The HDX-MS analysis also indicates that the NTD does not interact with the catalytic RdRp domain of RRV and SINV nsP4 (Figure 2A and Supplementary Figure S4 and S5). The NTD of the viral RdRp is an adaptive domain that often undergoes divergent structural evolution to perform novel functions in various virus families or genera to target different hosts (48).

Analyses of RNA polymerase activity provide clues about the function of the nsP4 NTD. NTD is dispensable for viral RNA polymerase activity but appears to enhance de novo RNA synthesis for RRV nsP4FL (Figure 4B left panel). The highest accumulation of fully extended RPs was observed when the NTD and RdRp domains were present in cis (i.e. in the context of the RRV nsP4FL). The NTD could also constitute an additional layer to regulate alphavirus RNA polymerase activity in a way reminiscent of how the fidelity of BVDV RdRp is modulated during RNA synthesis (48). Furthermore, the NTD from alphavirus nsP4 could act as a chaperone required to initiate interaction with nsP1-3. Mutual stabilization and organisation of nsP1-4 into assemblies of active RC conformers would result in the formation of the spherules for effective viral RNA synthesis. The SINV nsP4 was also previously shown in (23) to activate its de novo viral RNA synthesis via interactions with nsP1-3 in an in vitro reconstitutions. This may explain why the isolated RRV and SINV nsP4 proteins only showed weak and inefficient polymerase activity compared to the dengue virus RdRp that was used as positive control (Figure 4, marked as D). The disulfide bond at C164-C323 in SINV nsP4, which is suspected to cause misfolding of fingers to block the RNA binding tunnel could also have further dampen the SINV polymerase activity with a greater degree of RNA degradation (Figures 1, 4 and Supplementary Figure S7C). It is possible that actively elongating RRV nsP4 recombinant proteins (Figure 4) only constitute a the minor population, just like the CHIKV nsP4 reported in (13), that is stabilized with the help of cations like manganese and magnesium and detergents like LDAO and Triton X-100. In the absence of NTD, recombinant RdRpSER3 had slightly higher RNA polymerase activity than its wt counterpart, possibly due to its lowered surface entropy preventing precipitation and the retention of a higher amount of active proteins.

To characterize the function of selected regions of the alphavirus RdRp for RNA replication and transcription in the cellular environment, a biologically relevant trans-replicase system was used (Figure 5A). The importance of the NTD for viral RNA replication was reflected by the strong negative impact caused by the Strep-FLAG tag inserted at various sites within the first 110 residues of nsP4 (Figure 5B-C). The flex region in the index fingertip and the thumb are equally critical for viral RNA replication and transcription. This is in agreement with their known function for RNA binding and stabilization of the elongation complex formation in poliovirus RdRp (51). We found that several charged residues in the flex region (Asp143 and Asp150) in the fingers, and at the helix tip of the thumb (Arg543 and Arg545) are essential (Figure 5B-C), presumably by providing electrostatic interactions either within the RdRp or with an RNA template. Finally, the last ten thumb residues protected from the solvent exchange in the HDX experiment were shown to be indispensable for RNA replicase activity (Figures 2 and 5B-C). These observations are consistent with the presence of an intramolecular interface within alphavirus nsP4 critical for viral RNA replication. In this respect, the C-terminus of the alphavirus RdRp could constitute a regulatory motif similar to that found in the flavivirus RdRp (42).

Although nsP4 is the most conserved nsP, several known properties of recombinant nsP4 of SINV (11,23) indicate higher activity than what was observed for the RRV counterpart. We have also found that nsP4 of SINV and RRV is not fully functionally interchangeable: RRV P123 can form active RNA replicase with nsP4 either from RRV or SINV, while SINV P123 forms active replicase only with its cognate nsP4 (25). Based on sequence similarities, alphaviruses are divided into different complexes and clades (52). RRV is a member of the Semliki Forest virus complex, while SINV, the type member of the genus, belongs to the western equine encephalitis complex, making nsP4 of SINV a good candidate for validating to what extent findings made for nsP4 of RRV are applicable to nsP4 proteins of alphaviruses in general. Matching experiments performed using SINV trans-replicase provided results very similar to that of RRV (Figure 5C). The exceptions were mutations introduced in the NTD, which is the least conserved region of nsP4 (Supplementary Figure S8). The negative impact of the Strep-FLAG tag insertion was much smaller in the case of SINV nsP4: if the tag was introduced after residue 108, this insertion resulted in a relatively minor ∼7-fold reduction in replication and a ∼5-fold reduction in transcription. From a practical point of view, inserting an affinity tag at this position could be used for affinity pull-down of a functional SINV nsP4 and complexes formed by this protein. These findings are also consistent with our data, indicating that the NTD of nsP4 is flexible and not directly involved in RNA synthesis. In contrast, all mutations introduced into the RdRp domain of SINV nsP4 caused effects very similar to those observed for RRV. The largest difference between the two viruses was observed for the substitution of the Asp145 and Asp152 residues with Ala residues, which resulted in prominent reductions in both replication and transcription for SINV (by factors of ∼38-fold and ∼15-fold, respectively); these effects were significantly larger than those observed for corresponding mutations in RRV nsP4.

Overall, our data indicate that there is structural and functional conservation in the RdRp domain even for distantly related alphaviruses. Therefore, it can be assumed that structural information for RRV and SINV RdRp presented here is highly applicable for other alphaviruses and can be used for the rational design of drugs targeting many, or possibly all, medically important alphaviruses.

DATA AVAILABILITY

The atomic coordinates and structure factors have been deposited with the Protein Data bank under accession codes 7F0S, 7VB4 and 7VW5.

Supplementary Material

gkab1302_Supplemental_File

ACKNOWLEDGEMENTS

We would like to thank the Protein Production Platform from NTU (PPP, NTU), NTU Institute of Structural Biology (NISB, NTU), Lee Kong Chian School of Medicine for providing the facilities and the general support from members of the D.L. and A.M. laboratories to complete our research. We thank the technical services provided by the ‘Synchrotron Radiation Protein Crystallography Facility of the National Core Facility Program for Biotechnology, Ministry of Science and Technology’ and the ‘National Synchrotron Radiation Research Center’, a national user facility supported by the Ministry of Science and Technology of Taiwan, ROC – for data collection at Beamline NSRRC TPS 05A. We would also like to acknowledge the technical support from Japan SPring-8 Center RIKEN Targeted Proteins Beamline BL32XU and Australian Synchrotron, which is part of ANSTO, for the access to MX1/MX2 Beamlines (MX2 for the utilization of the Australian Cancer Research Foundation detector). Lastly, we thank the Paull Scherrer Institute (Villigen, Switzerland) for the provision of synchrotron beamtime at beamline X06DA of the SLS for data collection.

Author contributions: A.M. and D.L. designed research; Y.B.T., L.S.L., X.L., Y.-S.L., C.K. and J.Z. performed research; Y.B.T., L.S.L., X.L., Y.-S.L., C.K., J.L., J.Z., A.M. and D.L. analyzed data; and Y.B.T., L.S.L., A.M. and D.L. prepared the manuscript with input from all authors.

Contributor Information

Yaw Bia Tan, Lee Kong Chian School of Medicine, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921; NTU Institute of Structural Biology, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921.

Laura Sandra Lello, University of Tartu, Institute of Technology, Nooruse 1, 50411 Tartu, Estonia.

Xin Liu, Shanghai Institute of Materia Medica, China Academy of Sciences, 555 Zu Chong Zhi Road, Zhang Jiang Hi-Tech Park, Pudong, Shanghai, China.

Yee-Song Law, Lee Kong Chian School of Medicine, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921; NTU Institute of Structural Biology, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921.

Congbao Kang, Experimental Drug Development Centre, Agency for Science, Technology and Research (A*STAR), 10 Biopolis Rd, #05-01/06 Chromos, Singapore, 138670.

Julien Lescar, NTU Institute of Structural Biology, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921; School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore.

Jie Zheng, Shanghai Institute of Materia Medica, China Academy of Sciences, 555 Zu Chong Zhi Road, Zhang Jiang Hi-Tech Park, Pudong, Shanghai, China.

Andres Merits, University of Tartu, Institute of Technology, Nooruse 1, 50411 Tartu, Estonia.

Dahai Luo, Lee Kong Chian School of Medicine, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921; NTU Institute of Structural Biology, Nanyang Technological University, 59 Nanyang Drive, Singapore 636921; School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Singapore Ministry of Education under its Singapore Ministry of Education Academic Research Fund Tier 2 [MOE-T2EP30220-0009, MOE2016T22097]; Estonian Research Council [PRG1154]. Funding for open access charge: Ministry of Education of Singapore [MOE2016T22097].

Conflict of interest statement. None declared.

REFERENCES

  • 1. Wilder-Smith A., Gubler D.J., Weaver S.C., Monath T.P., Heymann D.L., Scott T.W.. Epidemic arboviral diseases: priorities for research and public health. Lancet Infect. Dis. 2017; 17:e101. [DOI] [PubMed] [Google Scholar]
  • 2. Nsoesie E.O., Kraemer M.U., Golding N., Pigott D.M., Brady O.J., Moyes C.L., Johansson M.A., Gething P.W., Velayudhan R., Khan K.. Global distribution and environmental suitability for chikungunya virus, 1952 to 2015. Eurosurveillance. 2016; 21:30234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Lessler J., Chaisson L.H., Kucirka L.M., Bi Q., Grantz K., Salje H., Carcelen A.C., Ott C.T., Sheffield J.S., Ferguson N.M.. Assessing the global threat from zika virus. Science. 2016; 353:aaf8160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Jain J., Mathur K., Shrinet J., Bhatnagar R.K., Sunil S.. Analysis of coevolution in nonstructural proteins of chikungunya virus. Virol J. 2016; 13:86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Forrester N.L., Palacios G., Tesh R.B., Savji N., Guzman H., Sherman M., Weaver S.C., Lipkin W.I.. Genome-scale phylogeny of the alphavirus genus suggests a marine origin. J. Virol. 2012; 86:2729–2738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. de Andrade D.C., Jean S., Clavelou P., Dallel R., Bouhassira D. Chronic pain associated with the chikungunya fever: long lasting burden of an acute illness. BMC Infect. Dis. 2010; 10:31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Wada Y., Orba Y., Sasaki M., Kobayashi S., Carr M.J., Nobori H., Sato A., Hall W.W., Sawa H.. Discovery of a novel antiviral agent targeting the nonstructural protein 4 (nsP4) of chikungunya virus. Virology. 2017; 505:102–112. [DOI] [PubMed] [Google Scholar]
  • 8. Strauss J.H., Strauss E.G.. The alphaviruses: gene expression, replication, and evolution. Microbiol. Rev. 1994; 58:491–562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Zhang R., Kim A.S., Fox J.M., Nair S., Basore K., Klimstra W.B., Rimkunas R., Fong R.H., Lin H., Poddar S.et al.. Mxra8 is a receptor for multiple arthritogenic alphaviruses. Nature. 2018; 557:570–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Rose P.P., Hanna S.L., Spiridigliozzi A., Wannissorn N., Beiting D.P., Ross S.R., Hardy R.W., Bambina S.A., Heise M.T., Cherry S.. Natural resistance-associated macrophage protein is a cellular receptor for sindbis virus in both insect and mammalian hosts. Cell Host Microbe. 2011; 10:97–104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Tomar S., Hardy R.W., Smith J.L., Kuhn R.J.. Catalytic core of alphavirus nonstructural protein nsP4 possesses terminal adenylyltransferase activity. J. Virol. 2006; 80:9962–9969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Rubach J.K., Wasik B.R., Rupp J.C., Kuhn R.J., Hardy R.W., Smith J.L.. Characterization of purified sindbis virus nsP4 RNA-dependent RNA polymerase activity in vitro. Virology. 2009; 384:201–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Chen M.W., Tan Y.B., Zheng J., Zhao Y., Lim B.T., Cornvik T., Lescar J., Ng L.F.P., Luo D. Chikungunya virus nsP4 RNA-dependent RNA polymerase core domain displays detergent-sensitive primer extension and terminal adenylyltransferase activities. Antiviral Res. 2017; 143:38–47. [DOI] [PubMed] [Google Scholar]
  • 14. Götte B., Liu L., McInerney G.M.. The enigmatic alphavirus non-structural protein 3 (nsP3) revealing its secrets at last. Viruses. 2018; 10:105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Foy N.J., Akhrymuk M., Akhrymuk I., Atasheva S., Bopda-Waffo A., Frolov I., Frolova E.I.. Hypervariable domains of nsP3 proteins of new world and old world alphaviruses mediate formation of distinct, virus-specific protein complexes. J. Virol. 2013; 87:1997–2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Law Y.-S., Wang S., Tan Y.B., Shih O., Utt A., Goh W.Y., Lian B.-J., Chen M.W., Jeng U.-S., Merits A.et al.. Interdomain flexibility of chikungunya virus nsP2 helicase-protease differentially influences viral RNA replication and infectivity. J. Virol. 2021; 95:e01470-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Law Y.S., Utt A., Tan Y.B., Zheng J., Wang S., Chen M.W., Griffin P.R., Merits A., Luo D. Structural insights into RNA recognition by the chikungunya virus nsP2 helicase. Proc. Natl. Acad. Sci. U.S.A. 2019; 116:9558–9567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Karpe Y.A., Aher P.P., Lole K.S.. NTPase and 5′-RNA triphosphatase activities of chikungunya virus nsP2 protein. PLoS One. 2011; 6:e22336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Li C., Guillen J., Rabah N., Blanjoie A., Debart F., Vasseur J.J., Canard B., Decroly E., Coutard B.. mRNA capping by venezuelan equine encephalitis virus nsP1: functional characterization and implications for antiviral research. J. Virol. 2015; 89:8292–8303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Jones R., Bragagnolo G., Arranz R., Reguera J.. Capping pores of alphavirus nsP1 gate membranous viral replication factories. Nature. 2021; 589:615–619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Gottipati K., Woodson M., Choi K.H.. Membrane binding and rearrangement by chikungunya virus capping enzyme nsP1. Virology. 2020; 544:31–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Zhang K., Law Y.-S., Law M.C.Y., Tan Y.B., Wirawan M., Luo D. Structural insights into viral RNA capping and plasma membrane targeting by chikungunya virus nonstructural protein 1. Cell Host Microbe. 2021; 29:757–764. [DOI] [PubMed] [Google Scholar]
  • 23. Rubach J.K., Wasik B.R., Rupp J.C., Kuhn R.J., Hardy R.W., Smith J.L.. Characterization of purified sindbis virus nsP4 RNA-dependent RNA polymerase activity in vitro. Virology. 2009; 384:201–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Lello L.S., Utt A., Bartholomeeusen K., Wang S., Rausalu K., Kendall C., Coppens S., Fragkoudis R., Tuplin A., Alphey L.et al.. Cross-utilisation of template RNAs by alphavirus replicases. PLoS Pathog. 2020; 16:e1008825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Lello L.S., Bartholomeeusen K., Wang S., Coppens S., Fragkoudis R., Alphey L., Arien K.K., Merits A., Utt A.. nsP4 is a major determinant of alphavirus replicase activity and template selectivity. J. Virol. 2021; 95:e0035521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Ramakrishnan V., Finch J., Graziano V., Lee P., Sweet R.. Crystal structure of globular domain of histone H5 and its implications for nucleosome binding. Nature. 1993; 362:219–223. [DOI] [PubMed] [Google Scholar]
  • 27. Hirata K., Yamashita K., Ueno G., Kawano Y., Hasegawa K., Kumasaka T., Yamamoto M.. ZOO: an automatic data-collection system for high-throughput structure analysis in protein microcrystallography. Acta Crystallogr. D: Struct. Biol. 2019; 75:138–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Yamashita K., Hirata K., Yamamoto M.. KAMO: towards automated data processing for microcrystals. Acta Crystallogr. D: Struct. Biol. 2018; 74:441–449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Aragao D., Aishima J., Cherukuvada H., Clarken R., Clift M., Cowieson N.P., Ericsson D.J., Gee C.L., Macedo S., Mudie N.et al.. MX2: a high-flux undulator microfocus beamline serving both the chemical and macromolecular crystallography communities at the australian synchrotron. J. Synchrotron Radiat. 2018; 25:885–891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Cowieson N.P., Aragao D., Clift M., Ericsson D.J., Gee C., Harrop S.J., Mudie N., Panjikar S., Price J.R., Riboldi-Tunnicliffe A.et al.. MX1: a bending-magnet crystallography beamline serving both chemical and macromolecular crystallography communities at the australian synchrotron. J. Synchrotron Radiat. 2015; 22:187–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Kabsch W. Xds. Acta Crystallogr. Sect. D. 2010; 66:125–132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Winn M.D., Ballard C.C., Cowtan K.D., Dodson E.J., Emsley P., Evans P.R., Keegan R.M., Krissinel E.B., Leslie A.G., McCoy A.. Overview of the CCP4 suite and current developments. Acta Crystallogr. Sect. D. 2011; 67:235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Liebschner D., Afonine P.V., Baker M.L., Bunkóczi G., Chen V.B., Croll T.I., Hintze B., Hung L.-W., Jain S., McCoy A.J.. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in phenix. Acta Crystallogr. D: Struct. Biol. 2019; 75:861–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Emsley P., Lohkamp B., Scott W.G., Cowtan K.. Features and development of coot. Acta Crystallogr. Sect. D. 2010; 66:486–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Pascal B.D., Willis S., Lauer J.L., Landgraf R.R., West G.M., Marciano D., Novick S., Goswami D., Chalmers M.J., Griffin P.R.. HDX workbench: software for the analysis of H/D exchange MS data. J. Am. Soc. Mass Spectrom. 2012; 23:1512–1521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Delaglio F., Grzesiek S., Vuister G.W., Zhu G., Pfeifer J., Bax A.. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995; 6:277–293. [DOI] [PubMed] [Google Scholar]
  • 37. Johnson B.A. Using NMRView to visualize and analyze the NMR spectra of macromolecules. Methods Mol. Biol. 2004; 278:313–352. [DOI] [PubMed] [Google Scholar]
  • 38. Robert X., Gouet P.. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 2014; 42:W320–W324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Yang J., Anishchenko I., Park H., Peng Z., Ovchinnikov S., Baker D. Improved protein structure prediction using predicted interresidue orientations. Proc. Natl. Acad. Sci. U.S.A. 2020; 117:1496–1503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Schrödinger L. 2020; The PyMOL Molecular Graphics System, Version 2.4.
  • 41. Lesburg C.A., Cable M.B., Ferrari E., Hong Z., Mannarino A.F., Weber P.C.. Crystal structure of the RNA-dependent RNA polymerase from hepatitis c virus reveals a fully encircled active site. Nat. Struct. Biol. 1999; 6:937–943. [DOI] [PubMed] [Google Scholar]
  • 42. Venkataraman S., Prasad B., Selvarajan R.. RNA dependent RNA polymerases: insights from structure, function and evolution. Viruses. 2018; 10:76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Holm L. Benchmarking fold detection by dalilite v. 5. Bioinformatics. 2019; 35:5326–5327. [DOI] [PubMed] [Google Scholar]
  • 44. Shaik M.M., Bhattacharjee N., Feliks M., Ng K.K., Field M.J.. Norovirus RNA-dependent RNA polymerase: a computational study of metal-binding preferences. Proteins. 2017; 85:1435–1445. [DOI] [PubMed] [Google Scholar]
  • 45. Shi W., Ye H.Q., Deng C.L., Li R., Zhang B., Gong P.. A nucleobase-binding pocket in a viral RNA-dependent RNA polymerase contributes to elongation complex stability. Nucleic. Acids. Res. 2020; 48:1392–1405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Choi K.H., Groarke J.M., Young D.C., Kuhn R.J., Smith J.L., Pevear D.C., Rossmann M.G.. The structure of the RNA-dependent RNA polymerase from bovine viral diarrhea virus establishes the role of GTP in de novo initiation. Proc. Natl. Acad. Sci. U.S.A. 2004; 101:4425–4430. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Hellström K., Kallio K., Utt A., Quirin T., Jokitalo E., Merits A., Ahola T.. Partially uncleaved alphavirus replicase forms spherule structures in the presence and absence of RNA template. J. Virol. 2017; 91:e00787-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Jia H., Gong P.. A structure-function diversity survey of the RNA-Dependent RNA polymerases from the positive-strand RNA viruses. Front. Microbiol. 2019; 10:1945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Pietila M.K., Hellstrom K., Ahola T.. Alphavirus polymerase and RNA replication. Virus Res. 2017; 234:44–57. [DOI] [PubMed] [Google Scholar]
  • 50. Spuul P., Balistreri G., Hellstrom K., Golubtsov A.V., Jokitalo E., Ahola T.. Assembly of alphavirus replication complexes from RNA and protein components in a novel trans-replication system in mammalian cells. J. Virol. 2011; 85:4739–4751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Gong P., Peersen O.B.. Structural basis for active site closure by the poliovirus RNA-dependent RNA polymerase. Proc. Natl. Acad. Sci. U.S.A. 2010; 107:22505–22510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Chen R., Mukhopadhyay S., Merits A., Bolling B., Nasar F., Coffey L.L., Powers A., Weaver S.C., Ictv Report C.. ICTV virus taxonomy profile: togaviridae. J. Gen. Virol. 2018; 99:761–762. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkab1302_Supplemental_File

Data Availability Statement

The atomic coordinates and structure factors have been deposited with the Protein Data bank under accession codes 7F0S, 7VB4 and 7VW5.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES