Skip to main content
RNA logoLink to RNA
. 2007 Mar;13(3):351–360. doi: 10.1261/rna.313707

NMR structure of stem–loop D from human rhinovirus-14

Stephen J Headey 1,5, He Huang 2,5,6, Jolyon K Claridge 1, Giselle A Soares 1, Kaushik Dutta 3, Martin Schwalbe 1, Daiwen Yang 4, Steven M Pascal 1
PMCID: PMC1800519  PMID: 17194719

Abstract

The 5′-cloverleaf of the picornavirus RNA genome is essential for the assembly of a ribonucleoprotein replication complex. Stem–loop D (SLD) of the cloverleaf is the recognition site for the multifunctional viral protein 3Cpro. This protein is the principal viral protease, and its interaction with SLD also helps to position the viral RNA-dependent RNA polymerase (3Dpol) for replication. Human rhinovirus-14 (HRV-14) is distinct from the majority of picornaviruses in that its SLD forms a cUAUg triloop instead of the more common uYACGg tetraloop. This difference appears to be functionally significant, as 3Cpro from tetraloop-containing viruses cannot bind the HRV-14 SLD. We have determined the solution structure of the HRV-14 SLD using NMR spectroscopy. The structure is predominantly an A-form helix, but with a central pyrimidine–pyrimidine base-paired region and a significantly widened major groove. The stabilizing hydrogen bonding present in the uYACGg tetraloop was not found in the cUAUg triloop. However, the triloop uses different structural elements to present a largely similar surface: sequence and underlying architecture are not conserved, but key aspects of the surface structure are. Important structural differences do exist, though, and may account for the observed cross-isotype binding specificities between 3Cpro and SLD.

Keywords: NMR, picornavirus, rhinovirus, RNA, replication, 3Cpro

INTRODUCTION

Human rhinoviruses (HRVs) are members of the picornaviridae family, which also includes poliovirus, hepatitis A virus, foot and mouth virus, and enteroviruses, including the coxsackieviruses. They are also closely related to SARS and other coronaviruses. Rhinoviruses are estimated to be responsible for >50% of all respiratory tract infections, and thus are a significant economic pest through missed work (Greenberg 2003). Upper respiratory tract infections can result in complications such as otitis media and sinusitis. HRV infections of the lower respiratory tract are the second most common infectious agents associated with pneumonia and bronchiolitis in young children, and commonly cause exacerbations and hospitalizations in patients with chronic airway diseases, including asthma (Hayden 2004). There are 110 identified HRV serotypes.

Like all picornaviruses, HRV-14 possesses a relatively small (7.2 kb) single-stranded RNA genome that is replicated without the use of a DNA intermediate. The genome is divided into three regions: 5′ and 3′ untranslated regions (UTR), separated by a single gene encoding a 250 kDa polyprotein. After translation, the nascent polyprotein is autolytically cleaved into component proteins that form the viral capsid (VP4, VP2, VP3, and VP1) and those involved in viral replication (2A, 2B, 2C, 3A, 3B, 3Cpro, and 3Dpol). The majority of cleavage events are performed by a multifunctional viral protein, 3Cpro, a cysteine protease similar in structure to the serine proteases (Lawrence and Thatch 1975; Pelham 1978; Matthews et al. 1994). In addition to its role in processing viral proteins, 3Cpro cleaves host proteins involved in mRNA transcription and translation, thereby diverting resources to viral reproduction (Amineva et al. 2004; Kuyumcu-Martinez et al. 2004). 3Cpro is also intimately involved in viral genome replication (Xiang et al. 1995). 3Cpro for a time remains covalently joined to the RNA-dependent RNA polymerase (3Dpol) to form the 3CD fusion protein. 3CD binds to RNA stem–loop D (SLD), near the start of the 5′-UTR, through a mechanism that is not completely understood, but is thought to involve positively charged residues of 3Cpro located on the surface opposite to its protease active site (Andino et al. 1990; Leong et al. 1993; Matthews et al. 1994; Walker et al. 1995). The interaction of 3CD with SLD is essential for the assembly of the ribonucleoprotein complex and the initiation of transcription (Xiang et al. 1995). Host proteins are recruited to the process, including the poly(A) binding protein (PABP), which can bind both to the poly(A) tail of the short 3′ UTR and to the 5′ ribonucleoprotein complex. This suggests that the viral RNA may be effectively circularized prior to replication (Meredith et al. 1999; Herold and Andino 2001).

While viral genomes with a deleted 3′ UTR still produce viable viruses, albeit with greatly reduced efficiency (Sarnow 1989; Todd and Semler 1996), mutations in 3Cpro or SLD that abrogate their interaction abolish viral reproduction (Walker et al. 1995; Zell et al. 2002). HRV-14 is distinct from most picornaviridae and the majority of HRV isotypes in containing a triloop (Fig. 1A) instead of the more common tetraloop (Fig. 1B,C) in its SLD. Intriguingly, although recombinant HRV-14 3Cpro can bind the tetraloop SLD of poliovirus (Walker et al. 1995; Blair et al. 1998), recombinant 3Cpro from poliovirus or coxsackievirus B3 (CVB3) cannot bind to HRV-14 SLD (Xiang et al. 1995; Blair et al. 1998; Zell et al. 2002; Rieder et al. 2003). Substituting the HRV-14 SLD triloop with certain tetraloop sequences results in a stem–loop that can interact with CVB3 3Cpro (Rieder et al. 2003). Taken together, these observations suggest that important differences between tetraloop and triloop structures can affect 3Cpro binding, but also suggest that important similarities must exist as well.

FIGURE 1.

FIGURE 1.

SLD sequences for which 3D structures have been determined. (A) Sequence of the HRV-14 stem–loop D (SLD) region investigated in this study. (B) Consensus sequence derived from rhinoviral and enteroviral SLDs, as defined by Du et al. (2004). (C) Coxsackievirus-B3 (CVB3) SLD sequence (Ohlenschlager et al. 2004). Figures 38 follow the coloring scheme introduced in this figure, to allow easy identification of mismatch and loop nucleotides. To facilitate comparison with the HRV-14 sequence, the third and fourth nucleotides in the tetraloops of parts B and C are numbered 15.1 and 15.2, respectively. Nucleotides in bold italic indicate points of divergence between the three sequences.

We have used NMR spectroscopy to determine the solution structure of the 27 nucleotide (nt) HRV-14 SLD region sufficient for 3Cpro binding (Walker et al. 1995). Comparison with recently studied tetraloop SLDs (Du et al. 2003, 2004; Ohlenschlager et al. 2004) shows that the HRV-14 triloop SLD does indeed retain many of the structural features of the tetraloop SLDs while also possessing distinct characteristics that may account for its unique 3Cpro binding properties.

RESULTS

Exchangeable proton resonance assignment and interpretation

The HRV-14 SLD 27mer contains 16 imino protons, of which 13 were observed in 1H 1D and 1H, 1H 2D NOESY spectra in H2O (Fig. 2), consistent with their participation in hydrogen-bonded base pairs. Nine of these 13 resonances lie in the spectral range typically found in standard Watson–Crick base pairing (12.5–14 ppm) and were assigned to stem I and stem II nucleotides. The four imino resonances lying outside of this range, two each at ∼10.0 and 11.4 ppm, display unusually strong imino–imino NOE cross-peaks. Weak NOE connectivities to bases in the Watson–Crick regions enabled unambiguous assignment of these resonances to uracil nucleotide participating U–U mismatched base pairs in the central region of the helix (Fig. 2). The imino resonance of G1 was not observed, suggesting that the G1–C27 base pair does not form or is not sufficiently shielded from solvent to prevent fast imino proton exchange on the NMR time scale. Similarly, no imino resonance could be assigned for U22, which might be expected to participate in a noncanonical CU base pair in the central region of the helix. However, this observation is consistent with studies of other SLDs where the homologous imino resonance is either not observed (Du et al. 2003; Ohlenschlager et al. 2004) or only detected at low temperature (Du et al. 2004). The absence of detectable imino resonances for U13 and U15 is consistent with the location of these bases in a loop region where they do not base pair.

FIGURE 2.

FIGURE 2.

Portion of 700 MHz 2D NOESY spectrum of HRV-14 SLD in H2O containing imino–imino connectivities. A 1D 1H NMR spectrum of the full imino region is plotted across the top. Key cross-peaks are labeled by nucleotide number.

Nonexchangeable resonance assignments

Assignment of stem II and triloop resonances of the 27mer was greatly assisted by reference to previously published assignments of a 14 nt region of HRV-14 SLD (Huang et al. 2001), which included nucleotides 8–20 of Figure 1A. The chemical shifts of nucleotides 9–19 showed little departure from the previous assignments. All nonexchangeable base resonances as well as all sugar H1′ and H2′ resonances, with the exception of the H2′ of G1, were assigned. Of the remaining sugar resonances 23 of the 27 H3′, 24 of the 27 H4′ and 18 of the 54 H5′ and H5″ resonances were assigned. In total, 167 of the 211 nonexchangeable 1H resonances of the 27mer were unambiguously assigned. The extent of assignment was higher in the pyrimidine-mismatch (83%) and triloop (92%) regions than in stem I (71%) or stem II (79%). Strong intranucleotide and sequential (H1′)–(H6 or H8) and (H2′)–(H6 or H8) NOE cross-peaks observed for most nucleotides are consistent with the classical base stacking of RNA double helices.

Sugar conformations and backbone geometry

DQF-COSY spectra show strong intranucleotide (H1′)–(H2′) multiplets for U13, A14, and U15 of the triloop, consistent with predominantly C2′-endo sugar pucker. The ribose rings of these nucleotides were therefore constrained to C2′-endo during structure calculations. The nucleotides of the closing base pair of the triloop (C12–G16), as well as C27, have weak but detectable (H1′)–(H2′) multiplets, typical of a mixture of C2′-endo and C3′-endo sugar puckers. Therefore, the sugar conformations of these nucleotides were not constrained. The absence of (H1′)–(H2′) multiplets for the sugars of other nucleotides, including those of the mismatch region, is consistent with the C3′-endo conformation found in A-form RNA. C3′-endo constraints were consequently applied to these residues during structure calculations. The intensities of intranucleotide NOEs between (H1′) and aromatic protons provided no evidence of syn glycosidic bond angles for any nucleotide. The 1H,31P-HETCOR revealed significantly downfield shifted 31P resonances for U15, U17, and to a lesser extent, U13 and G16, indicating possible departures from A-form backbone geometry in these regions.

Structural calculation

The solution structure of HRV-14 SLD is shown in Figure 3A. Structural statistics are given in Table 1. HRV-14 SLD is comprised of two A-form helical stem regions containing standard Watson–Crick base pairs. These stems are separated by a pyrimidine-mismatch region containing a C–U base pair flanked by two U–U base pairs. The structure is capped by a hairpin triloop. The ribose moieties of U13, A14, and U15 of the triloop and G16 are in the C2′-endo conformation, whereas those of the stem and mismatch regions and C12 are in the C3′-endo conformation typical of A-form RNA. All glycosidic bonds are in the more common trans conformation. The solution structure is relatively well defined (pair-wise heavy atom RMSD = 0.59 ± 0.16 Å).

FIGURE 3.

FIGURE 3.

Stereo views of the superposition of the 10 lowest-energy structures of HRV-14 SLD determined (A) with RDCs, and (B) without RDCs. Bases are colored as in Figure 1A.

TABLE 1.

Structural and refinement statistics of HRV-14 SLD

graphic file with name 351tbl1.jpg

Pyrimidine-mismatch conformation

The central mismatch region of SLD is pyrimidine-rich, containing the sequences UCU and UUU (Fig. 1A). Hydrogen bonding between U–U base pairs is evident from the presence of unusual imino–imino NOEs as discussed above and shown in Figure 2. This pattern arises from the imino proton of U5/U21 (∼10.0 ppm) hydrogen bonding to the O4 of U23/U7, respectively, while the imino of U23/U7 (∼11.4 ppm) hydrogen bonds to the O2 of U5/U21, respectively (see Fig. 4). The intervening C6–U22 base pair has hydrogen bonds between the amino group of C6 and O4 of U22 and also between the imino proton of U22 and N6 of C6. No hydrogen bond constraints were included in structure calculations for this base pair, as its labile protons were not observed in NOE spectra. The incorporation of RDC constraints appears to have been critical in maintaining correct hydrogen bond geometry for C6–U22, and in structures calculated without RDCs this base pair is not well formed (Fig. 3B). RDC constraints also encourage base planarity for the U7–U21 pair. Although A-form geometry is largely maintained in the pyrimidine-mismatch region, the minor groove width is reduced by ∼2–3 Å relative to stems I and II. This reduction is sufficient to explain the observed C6(H1′)–U23(H1′) NOE.

FIGURE 4.

FIGURE 4.

A zoom view of the pyrimidine–pyrimidine-mismatch region. Base-pairing hydrogen bonds are indicated by dotted lines, while base hydrogens that do not participate in hydrogen bonds are omitted. Each of the two U–U base pairs form an H3–O2 and an O4–H3 hydrogen bond. However, in the lower pairing (U5–U23), the O2 is from U5 while the O4 is from U23. In the top pairing, the O2 is from U21 while the O4 is from U7. The central C6–U22 base pair forms hydrogen bonds from C6(amino group) to U22(O4) and from U22(H3) to C6(N3). The phoshpodiester backbone is shown as a worm, and the color scheme is that of Figure 1A, with the addition of red, white, and blue for base oxygen, hydrogen, and nitrogen atoms, respectively.

Triloop conformation

The triloop (Fig. 5A) is comprised of cU13A14U15g. The U13 base is in the major groove where it stacks onto the C12 base, giving rise to relatively strong sequential NOEs from both C12(H1′) and C12(H2′) to U13(H6) that are comparable in intensity to analogous NOEs in the stem regions (Fig. 5B). However, a less typical NOE of moderate intensity between C12(H1′) and U13(H5) supports an enhanced degree of base stacking. In contrast, the sequential H1′-aromatic NOEs of U13 to A14, A14 to U15, and U15 to G16 are very weak, which is consistent with the lack of base stacking for these pairs of nucleotides. The base of A14 is located in the minor groove. Its position is constrained by a network of NOEs to A14 base protons, including A14(H2) and A14(H8) resonances to U13(H4′). The U15 base is somewhat disordered, and has relatively few internucleotide NOE connectivities. The incorporation of three RDC constraints for this nucleotide stabilized its base in a position tilted toward the backbone in the direction of A14. The closing base pair (C12, G16) forms a nearly standard Watson–Crick pair but with the sugar of G16 adopting a C2′-endo conformation. As mentioned, both C12 and G16 show weak (H1′)–(H2′) multiplets in COSY spectra, which suggest mixed sugar puckers, and consequently, no sugar pucker constraints were applied to these nucleotides. However, in each of the 10 lowest-energy structures, the G16 and C12 sugars adopt C2′-endo and C3′-endo conformations, respectively.

FIGURE 5.

FIGURE 5.

The HRV-14 SLD triloop. (A) Cartoon representation. U13 (green) lies on the major groove side, while A14 (blue) lies along the minor groove. U15 is shown in pink. (B) Stereo view showing key NOE connectivities (red lines) and CH bonds specifically oriented by RDC data (red bonds). Nucleotides are otherwise colored as in Figure 1A, with the addition of violet phosphorous atoms to assist in viewing the path of the backbone.

DISCUSSION

Advantage of RDC analysis

Not surprisingly, the precision of the structural family determined with RDCs (Fig. 3A) is significantly higher than that of the family determined without RDCs (Fig. 3B), with pair-wise RMSDs of 0.59 and 1.00 Å, respectively. Due to the planarity of nucleic acid bases, the presence of base pairing, and the restricted conformational space available to the cyclic ribose moiety, a modest number of RDCs is sufficient to afford a large increase in precision (Mauffret et al. 2002). Moreover, the RDC-refined structures presented here are more compact, with a mean axial length of 47 Å compared to 52 Å for the non-RDC refined structures. Similar shortening of the helix upon refinement with RDCs also occurred in structure calculations for the consensus SLD (Du et al. 2004), suggesting that at least in some cases, elongated structures may be an inherent bias of structure calculation algorithms performed without RDCs.

A portion of the SLD structure presented here is a refinement of a previously reported stem–loop structure (Huang et al. 2001), which included stem II and the triloop, but not stem I or the mismatch region. In this previous study (of a 14mer), performed without the benefit of RDC data, the A14 base was thought to occupy the major groove. Current structural analysis (of a 27mer) places the A14 base in the minor groove. When RDC restraints are omitted from the present refinement, the base of A14 resides in the major groove in 30% of structures, but in the minor groove in the remaining 70% of acceptable structures. This second possibility, that the A14 base may occupy the minor groove, was only identified upon inclusion of additional NOEs observed with the 27mer, including from the previously unassigned A14(H2) to the H4′ and H5″ resonances of U13. These two potential conformational groups were then used to back-calculate NOEs. The results favored the A14 base residing in the minor groove. Importantly, the incorporation of RDC data conclusively precluded the major groove configuration without necessitating recourse to NOE back-calculation. These findings underscore the utility of RDC data for resolving structural ambiguities in nucleic acids, particularly in nonstandard loop regions.

HRV-14 SLD is an atypical A helix

Although HRV-14 SLD is unequivocally an A-form helix, its geometry nevertheless shows significant divergence from idealized A form. The defining features of an A helix, namely, the 3′-endo sugar pucker, off-axis positioning of the bases, and the inclination of the bases relative to the helical axis, are all present in HRV-14 SLD. A local cinching of the minor groove occurs in the pyrimidine-mismatch region, presumably as a result of the smaller base sizes. The minor groove width in this region decreases from the typical A-form distance of 11 Å to ∼8 Å. Beginning at the pyrimidine-mismatch region the major groove width progressively expands from 7 Å, which already exceeds the nominal value for the A-form helix, to ∼13 Å (Figs. 6A, 7A), which is in the typical range of a B helix commonly found in DNA. Such widening of the major groove is a recurrent feature of protein recognition sites, as the major groove of the idealized A helix is too narrow to allow the interrogation of the bases by protein alpha helices. Somewhat similar helical geometry is also evident in the NMR structures of tetraloop SLDs (Figs. 6B,C). In each case, part of the major groove is widened. In the case of the consensus SLD (Du et al. 2004), the presence of two bulged bases in stem I (see Fig. 1B) coincides with the initial expansion of the major groove, which widens to ∼12 Å as the cinch is approached, and maintains close to the B-form width thereafter. The coxsackievirus B3 SLD major groove (Ohlenschlager et al. 2004) approaches the B-form width in both stem regions but narrows to an intermediate value in the mismatch region. It is not immediately obvious to what extent groove width differences between these three structures are due to (1) the presence of a bulge in the consensus sequence as discussed above; (2) seemingly minor sequence variations within the stems; or (3) differences in technique, such as the inclusion of RDC data in the HRV-14 and consensus structures, but not the coxsackie structure. It is worth noting that Ohlenschlager et al. (2004) found chemical shift perturbations induced by 3CD binding in the distal end of the mismatch region, stem II, and the apical loop of coxsackievirus B3 SLD. Enhanced access to the major groove in this region is evident in all three SLD structures. Moreover, the U-rich pyrimidine-mismatch region would present an area of high electronegativity that could interact with the basic residues of the putative RNA binding site on 3Cpro.

FIGURE 6.

FIGURE 6.

Space-filling views of NMR-determined structures of SLD from (A) HRV-14, (B) consensus sequence (Du et al. 2004), and (C) Coxsackievirus B3 (CVB3) (Ohlenschlager et al. 2004). Widened major grooves are evident at the center of each part of the figure. Sequences and colors are as in Figure 1A–C, respectively.

FIGURE 7.

FIGURE 7.

(A) Plot of HRV-14 SLD major groove width, calculated as distance of closest phosphorous–phosphorous approach across the groove, minus 5.6 Å. Distances have been measured from the nucleotides listed directly below the x axis to the nucleotides listed below the error bars. Groove widths for the standard A and B forms are shown as horizontal lines within the figure. Base-paired nucleotides and coloring of the mismatch region have been added below the x axis for clarity. (B) A CPK rendering of HRV-14 SLD. Bases are again colored as in Figure 1A. Backbone phosphorous nuclei are shown in purple, while backbone oxygens not covalently bound to sugar carbons are shown in red. The widening of the major groove upon approach to the triloop is clearly evident.

Triloops and tetraloops

The mechanism of picornaviral replication is so well conserved that picornaviruses with chimeric genomes, where the cloverleafs have been swapped between picornavirus isoptypes, can produce viable virus particles (Rohll et al. 1994; Xiang et al. 1995; Johnson and Semler 1988). It is not surprising, therefore, that recombinant 3Cpro from picornaviruses with tetraloop SLDs can bind each others’ SLDs with near-normal affinity (Zell et al. 2002). However, attempts to identify key binding determinants via mutagenesis have not been completely successful, leading to the suggestion that conserved structural features, rather than conserved nucleotide sequences, are critical for the interaction (Zell et al. 2002; Rieder et al. 2003).

An insight into these conserved features comes from an examination of the tetraloop structures of the consensus (Du et al. 2004) and CVB3 (Du et al. 2003; Ohlenschlager et al. 2004) SLDs. These two stem–loops contain nearly identical tetraloop structures, with base stacking and hydrogen bonding very similar to that previously seen in cUUCGg tetraloops (e.g., Allain and Varani 1995). Tetraloops are common motifs for RNA–RNA and RNA–protein interactions. The importance of the SLD tetraloop in 3Cpro binding is reinforced by NMR chemical shift perturbation results as discussed above (Ohlenschlager et al. 2004).

HRV-14 is atypical of picornaviruses in that its SLD contains a triloop instead of the more common tetraloop. This difference has proven to be both useful and puzzling in investigations of 3Cpro–SLD interactions. The HRV-14 SLD has to this point not been shown capable of interacting with a 3Cpro from a virus whose SLD forms a tetraloop (Xiang et al. 1995; Blair et al. 1998; Zell et al. 2002; Rieder et al. 2003), nor can the corresponding chimeric picornavirus produce viable particles. Interestingly, the reverse is not true, as HRV-14 3Cpro can form a stable complex with the tetraloop SLDs from poliovirus (Walker et al. 1995). These observations strongly suggest that the loop region forms part of the 3Cpro binding site. On the other hand, the lack of binding reciprocity has been difficult to explain.

When the NMR structure of the HRV-14 c(U13A14U15)g triloop (Fig. 8A) is compared to that of the u(Y13A14C15.1G15.2)g tetraloop (Figs. 8B,C), a striking similarity is apparent. In each case, the A14 base (blue) lies on the minor groove side while a pyrimidine (green) lies in the major groove where it stacks upon paired bases (gray). These shared features could help account for the ability of HRV-14 3Cpro to bind the tetraloop poliovirus SLD (Walker et al. 1995). However, upon closer examination, this mimicry is achieved via dissimilar underlying architecture. The tetraloops form a noncanonical (Y13–G15.2) base pair (gray), with G15.2 in syn conformation and forming a hydrogen bond to the Y13 2′-hydroxyl (see Fig. 1B,C for numbering). Only two unpaired nucleotides are present: A14 (blue) and C15.1 (green). The latter of these is the pyrimidine lying in the major groove, where it stacks onto Y13. The HRV-14 triloop contains no such base pairing: instead of two unpaired bases, there are three. The terminal base pair of stem II (C12, G16) assumes a role similar to the tetraloop (Y13, G15.2) pair. The pyrimidine in the major groove is U13 rather than C15.1, and it stacks onto the C12 base instead of Y13. In effect, the triloop stem II is one base pair shorter.

FIGURE 8.

FIGURE 8.

(A) Top view of HRV-14 SLD showing sugars and bases only of the nucleotides from the triloop, the central mismatch (orange and yellow), and the closing base pair (C12–G16). In this view, the triloop is closest to the observer. U13 (green), which lies in the major groove side, stacks onto C12 (gray). A14 (blue) lies on the minor groove side. (B) Similar view of the consensus SLD structure (see Fig. 1B). The closing base pair (U12–G16) is omitted for clarity. NMR structure of HRV-14 stem–loop D. However, the tetraloop base pair formed by C13–G15.2 (gray) plays a somewhat analogous role. A14 (blue) again lies along the minor groove, while C15.1 (green) lies on the major groove side. There is no base analogous to U15 from HRV-14 (pink base in A). (C) The CVB3 SLD structure (see Fig. 1C) shows similarity to the consensus structure in B.

The third unpaired base in the triloop is U15. In constrast to U13 and A14, the position of this base has no homolog in the tetraloop structure. Comparatively few internucleotide NOE distance constraints were assigned for this nucleotide, and thus it is somewhat disordered. With the inclusion of RDCs its base is constrained to a position not approaching either groove, but tilted toward the backbone of A14 (Figs. 5A,B). No hydrogen-bonding interactions appear to stabilize this position. It is thus conceivable that U15 and perhaps other nucleotides of the triloop are, in fact, more disordered than the RDC-refined structures suggest, presumably on a rapid time scale, and that structures calculated using conformationally averaged RDCs and NOEs could provide a false impression of a well-defined conformation. The presence of partially resolved H1′–H2′ multiplets for the closing base-pair C12–G16 in the DQFCOSY spectrum lends some support to this interpretation. RDCs are sensitive to motions on a wide range of time scales, and should tend toward zero if the motions are sufficiently unrestricted. The mean magnitude of the RDCs in the triloop (5.9 ± 2.26 Hz) is indeed less than the mean for the rest of the molecule (10.5 ± 7.12 Hz). However, these differences may simply reflect the fact that these nucleotides are in a loop region where their orientations would be expected to differ from nucleotides in helical regions. No nucleotide has strongly attenuated RDCs, suggesting that the triloop bases do have preferred orientations. The NOE data are also well fit with single conformers: conformationally averaged models are not required to account for conflicting NOE-derived distances. Taken together, the data would seem to be inconsistent with large structural fluctuations that might lead to, for instance, interconversion of the loop bases between grooves. Nonetheless, until dynamics data are available, judgment should be reserved as to whether the relatively well-ordered RDC-refined structures (Fig. 3A), or the less-ordered structures generated without RDCs (Fig. 3B), provide a truer representation of the conformational space sampled by the triloop.

Regardless of the presence or absence of disorder at this base, U15 clearly does not fit into the structural pattern established by the tetraloops. This fact, together with the ability of HRV-14 3Cpro to bind tetraloop SLDs, could suggest that U15 is not a critical binding determinant in HRV-14. However, this additional base may account for the inability of 3C proteases from other picornaviruses to bind the HRV-14 SLD (Xiang et al. 1995; Blair et al. 1998; Zell et al. 2002; Rieder et al. 2003) by providing steric or other forms of interference. This contention is supported by point mutagenesis of a replication-incompetent HRV-14/poliovirus chimeric genome containing an HRV-14 SLD triloop sequence (Rieder et al. 2003). Creation of a tetraloop via insertion of a guanidine (cUAUGg) fails to rescue replication. However, the further U→C mutation (cUACGg) yields a replicating genome. In the tetraloop SLD structures, the position of this cytosine in the major groove is stabilized by a hydrogen bond from its amino group to the pro(R) phosphate oxygen of the preceding adenosine. Uracil lacks this amino group. Hence, it may be free to leave the major groove, perhaps adopting a conformation similar to U15 in the HRV-14 SLD, and thus interfering with 3Cpro binding.

In summary, comparisons between the NMR structure of the HRV-14 triloop and those of tetraloop SLDs have been made with a view to discerning the conserved features required for 3Cpro binding. The likely candidates are three structural features: (a) a widened major groove; (b) a base-stacked pyrimidine in the major groove; and (c) an adenosine in the minor groove.

MATERIALS AND METHODS

RNA preparation

Natural abundance 5′-GUACUCUGGUACUAUGUACCUUUGUAC-3′ corresponding to stem–loop D of HRV-14 was purchased from Dharmacon after chemical synthesis, gel electrophoresis, and HPLC purification. The 3′-O-ester protection was removed with 100 mM TEMED-acetate (tetramethylethylenediamine), pH 3.8, as described by the manufacturer. The oligomer was further purified with ethanol precipitation and gel filtration, and buffer exchanged to 90 mM NaCl, 10 mM sodium phosphate, 0.1 mM EDTA, pH 7.0. For NMR studies, the oligomer concentration was brought to 0.6 mM, and H2O was replaced by either 100% D2O (D2O sample) or 10% D2O/90% H2O (H2O sample).

15N,13C-labeled RNA of the identical sequence as above was synthesized enzymatically from a synthetic single-stranded DNA template using Escherichia coli His6-tagged RNA polymerase (Milligan et al. 1987; Wyatt et al. 1991) and 13C,15N-labeled rNTPs (Spectra Stable Isotopes). To maximize transcription yield, an 11 nt 5′-purine-rich sequence tag was included in the DNA template. The resulting 11 nt tag was later removed from the RNA via DNAzyme cleavage (Santoro and Joyce 1997; Cheong et al. 2004). The RNA was purified via 15% polyacrylamide gel electrophoresis with 6 M urea, identified by brief UV shadowing (254 nm), and excised from the gel. RNA was recovered by crushing the gel slab followed by a temperature shock (freeze [−80°C]/thaw [90°C]) treatment, overnight incubation at 37°C, and either electroelution or diffusion into buffer. Purified RNA samples were concentrated, desalted, and brought to the same buffer conditions as the natural abundance sample via centrifugal filtration.

NMR spectroscopy

NMR experiments were performed at 600 (Varian Inova), 700, and 800 MHz (Bruker Avance). Each spectrometer is equipped with four rf channels and gradient pulse capabilities. The 700 MHz spectrometer was equipped with a cryoprobe. In general, spectra of nonexchangeable protons were recorded at 30°C with a 99% D2O buffer sample using presaturation to suppress the residual water signal. Initial NOESY spectra were acquired at 600 MHz with mixing times of 50, 75, 100, 120, and 400 msec at 25°C. Typically, a total of 256 t 1 increments were acquired, with 4096 data points and 32 scans per fid and a recycle time of 3 sec. To further assist in resonance assignment, 400-msec mixing time NOESY spectra were acquired in D2O at a series of temperatures (0, 15, 25, 35°C), and NOESY spectra in H2O were acquired with 50 and 120 msec mixing times at 0 and 5°C with the H2O signal suppressed by a combination of soft 90° and z-gradient pulses; 700 MHz NOESY spectra were acquired in H2O at 5°C with 4096 data points, 200 t 1 increments, 576 scans per fid, and a mixing time of 50 msec with 3–9–19 gradient water suppression, and in D2O with 4096 data points, 464 t 1 increments, and 416 scans per increment with a 300-msec mixing time.

TOCSY spectra with mixing times of 70 and 200 msec and a DQF-COSY spectrum were recorded at 25°C to obtain sugar couplings and through-bond correlations. An 1H,13C-HMQC spectrum was acquired with spectral widths of 5000 and 10,560 Hz in the proton and carbon dimensions, respectively. A total of 86 increments were acquired, with 2048 data points, 512 scans per fid, and a recycle time of 3 sec. A 1H,31P HETCOR spectrum was acquired with spectral widths of 5000 Hz in 1H and 800 Hz in 31P. Data were processed with NMRPipe (Delaglio et al. 1995) and visualized and analyzed with NMRview (Johnson and Blevins 1994) or TOPSPIN 1.3 (Bruker). Chemical shift assignments for 13C were based on correlated 1H shifts in the 1H,13C-HMQC spectrum and the 1H,13C-HSQC experiments described below.

Dipolar couplings were measured at 700 MHz in the 13C dimension of a coupled constant-time HSQC utilizing composite chirp pulses for the inversion of 13C magnetization during the t 1 period. Where overlap occurred, dipolar couplings were instead obtained from the 1H dimension by removing 13C decoupling during acquisition. Analysis of couplings in both dimensions confirmed that similar measurements were obtained by each method. The measurement precision was estimated by repetition as ∼2 Hz. Alignment was achieved by the use of 13 mg/mL Pf1 phage (Profos), which caused an 11.6-Hz splitting in the deuterium spectrum, indicating a liquid-crystalline state had been obtained.

NMR-derived structural constraints

NOE-based estimates of the distance between nonexchangeable protons were derived from NOESY cross-peak volumes at mixing times of 50 msec, 75 msec, and 100 msec using an isolated spin pair approximation. Cross-peak volumes were integrated using Felix and build-up rates compared to H5–H6 (2.45 Å) cross-peaks and scaled by 1/r 6, where r represents the distance between the two protons. After preliminary distances were derived in this manner, the potential well minima and maxima were set to this distance ±30% up to a maximum of 6 Å. NOEs involving exchangeable protons were interpreted more conservatively as distance constraints with a similar +30% upper limit, but with a lower bound of 1.8 Å.

Backbone dihedral angles of nucleotides G1–C4, G8–C12, U18–C20, and G24–C27 were constrained to A-form helix values ±45° based on their 31P chemical shifts in the 1H,31P-HETCOR, and their participation in Watson–Crick pairing (Gorenstein 1984). A more cautious interpretation was made for the pyrimidine-mismatch region. The alpha and zeta dihedral angles of nucleotides 5–7 and 21–23 were constrained to gauche conformations (0° ± 100°) based on their 31P shifts (Varani et al. 1991), while other backbone angles in this region were left unconstrained. Backbone constraints were not applied in the triloop region, as the 31P resonances of U13, U15, G16, and U17 are shifted significantly downfield.

A total of 66 well-resolved one-bond RDCs were obtained including 23 JC1′–H1′, 5 JC2–H2, 15 JC5–H5, 16 JC6–H6, 7 JC8–H8 RDCs along with the JC2′–H2′ RDC of guanidine 16, which could be resolved on the basis of its unique 1H frequency. The alignment tensors were obtained using an iterative approach similar to that of Warren and Moore (2001). Briefly, a range of values for the rhombicity (R) was generated using the program PALES (Zweckstetter and Bax 2000) from a family of structures refined without RDCs. The R values were used to obtain an estimate of the axial component (Da) of the alignment tensor using the formula

graphic file with name 351equ1.jpg

where d max is the maximum positive value from the histogram of measured RDCs. Although the minima and highest populated values of the histogram can in theory be used to obtain Da, in practice these are not adequately sampled in RNA due to the nonrandom distribution of bond angles. However, the value of d max can be more reliably obtained as it corresponds to C–H bonds at an angle of 90° to the principle alignment tensor, an orientation well sampled by the planar bases of nucleic acids. The tensor values thus obtained were used to generate a set of structures using the RDC constraints. The lowest-energy structures were then used in a second round of alignment tensor prediction with PALES. Structure generation and alignment prediction were further iterated until the tensor values converged. The lowest-energy structures were obtained with Da = −15.75 Hz and R = 0.168.

Structure calculation

Structure calculations employed 358 NOE-derived interatomic distance constraints, 85 backbone torsion angle constraints, and 40 hydrogen bond constraints. Initially, hydrogen-bonding constraints were applied only to those base pairs with NOE patterns and observable imino protons indicating participation in standard Watson–Crick pairing (U2–U5, U7–C12, G16–U21, U23–A26). Weak base planarity constraints (10 kcal mol−1) were applied to the Watson–Crick regions and also for U5–U23 and U7–U21 base pairs for which imino–imino NOE peaks were observed, although no assumption was made about the geometry of the pairing. Structure calculations used simulated annealing with Cartesian molecular dynamics in CNS 1.1. HRV-14 SLD structures were generated from a set of helical stem–loops having Watson–Crick base-paired regions but with randomized triloop and pyrimidine-mismatch regions. A total of 1200 trial structures were generated, using 4000 steps at 20,000 K followed by 4000 cooling steps beginning at 1000 K, and then energy minimized using 15 cycles with 200 steps of steepest descent minimization. This produced 510 structures with acceptable geometry, as defined by CNS. The 20 lowest-energy structures unambiguously converged with the H3 of 5/21 paired with O4 of 23/7, respectively, and H3 of 7/23 paired with O2 of 21/5, respectively. Distance constraints reflecting these hydrogen bonds were therefore applied in subsequent rounds of structure calculation to reinforce appropriate hydrogen bond distances and to increase the efficiency of structure calculation. The first round structures were further refined with or without 66 1JCH RDC constraints with the same annealing procedure except that the high-temperature stage was reduced to 3000 K. The 10 lowest-energy structures refined with RDCs and having no constraint violations (no NOE or hydrogen bond violation >0.2 Å, no angle violation >5°, or dipolar coupling violation >2.0 Hz) were selected to represent the solution structure of HRV-14 SLD (Fig. 3A). The 10 lowest-energy structures calculated without RDCs and having no constraint violations (no NOE or hydrogen bond violation >0.2 Å, no angle violation >5°) are shown in Figure 3B. Structural figures were created with PyMOL (http://pymol.sourceforge.net/).

ACKNOWLEDGMENTS

The authors thank Thomas Krugh, Douglas Turner, Brent Znosko, Scott Kennedy, Francois Bissey, Gillian Norris, Andrew Sutherland-Smith, and Pat Edwards. Funding for this project was provided in part by the Marsden Fund of the Royal Society of New Zealand.

Footnotes

Article published online ahead of print. Article and publication date are at http://www.rnajournal.org/cgi/doi/10.1261/rna.313707.

References

  1. Allain, F.H., Varani, G. Structure of the P1 helix from group I self-splicing introns. J. Mol. Biol. 1995;250:333–353. doi: 10.1006/jmbi.1995.0381. [DOI] [PubMed] [Google Scholar]
  2. Amineva, S.P., Aminev, A.G., Palmenberg, A.C., Gern, J.E. Rhinovirus 3C protease precursors 3CD and 3CD′ localize to the nuclei of infected cells. J. Gen. Virol. 2004;85:2969–2979. doi: 10.1099/vir.0.80164-0. [DOI] [PubMed] [Google Scholar]
  3. Andino, R., Rieckhof, G.E., Trono, D., Baltimore, D. Substitutions in the protease (3Cpro) gene of poliovirus can suppress a mutation in the 5′ noncoding region. J. Virol. 1990;64:607–612. doi: 10.1128/jvi.64.2.607-612.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Blair, W.S., Parsley, T.B., Bogerd, H.P., Towner, J.S., Semler, B.L., Cullen, B.R. Utilization of a mammalian cell-based RNA binding assay to characterize the RNA binding properties of picornavirus 3C proteases. RNA. 1998;4:215–225. [PMC free article] [PubMed] [Google Scholar]
  5. Cheong, H.K., Hwang, E., Lee, C., Choi, B.S. Rapid preparation of RNA samples for NMR spectroscopy and X-ray crystallography. Nucleic Acids Res. 2004;32:e84. doi: 10.1093/nar/gnh081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Delaglio, F., Greziek, S., Vuister, G.W., Zhu, G., Pfeifer, J., Bax, A. NMRPipe: A multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995;6:277–293. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
  7. Du, Z., Yu, J., Andino, R., James, T.L. Extending the family of UNCG-like tetraloop motifs: NMR structure of a CACG tetraloop from coxsackievirus B3. Biochemistry. 2003;42:4373–4383. doi: 10.1021/bi027314e. [DOI] [PubMed] [Google Scholar]
  8. Du, Z., Yu, J., Ulyanov, N.B., Andino, R., James, T.L. Solution structure of a consensus stem–loop D RNA domain that plays important roles in regulating translation and replication in enteroviruses and rhinoviruses. Biochemistry. 2004;43:11959–11972. doi: 10.1021/bi048973p. [DOI] [PubMed] [Google Scholar]
  9. Gorenstein, D.G., Lai, K., Shah, D.O. 31P and two-dimensional 31P/1H correlated NMR spectra of Duplex d(Ap[17O]Gp[18O]Cp[16O]T) and assignment of 31P signals in d(ApGpCpT)2-actinomycin D complex. Biochemistry. 1984;23:6717–6723. doi: 10.1021/bi00321a067. [DOI] [PubMed] [Google Scholar]
  10. Greenberg, S.B. Respiratory consequences of rhinovirus infection. Arch. Intern. Med. 2003;163:278–284. doi: 10.1001/archinte.163.3.278. [DOI] [PubMed] [Google Scholar]
  11. Hayden, F.G. Rhinovirus and the lower respiratory tract. Rev. Med. Virol. 2004;14:17–31. doi: 10.1002/rmv.406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Herold, J., Andino, R. Poliovirus RNA replication requires genome circularization through a protein-protein bridge. Mol. Cell. 2001;7:581–591. doi: 10.1016/S1097-2765(01)00205-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Huang, H., Alexandrov, A., Chen, X., Barnes T.W., III, Zhang, H., Dutta, K., Pascal, S.M. Structure of an RNA hairpin from HRV-14. Biochemistry. 2001;40:8055–8064. doi: 10.1021/bi010572b. [DOI] [PubMed] [Google Scholar]
  14. Johnson, B.A., Blevins, R.A. NMRView: A computer program for the visualization and analysis of NMR data. J. Biomol. NMR. 1994;4:603–614. doi: 10.1007/BF00404272. [DOI] [PubMed] [Google Scholar]
  15. Johnson, V.A., Semler, B.L. Defined recombinants of poliovirus and coxsackievirus: Sequence-specific deletions and functional substitutions in the 5′ noncoding regions of viral RNAs. Virology. 1988;162:47–57. doi: 10.1016/0042-6822(88)90393-5. [DOI] [PubMed] [Google Scholar]
  16. Kuyumcu-Martinez, N.M., Van Eden, M.E., Younan, P., Lloyd, R.E. Cleavage of poly(A)-binding protein by poliovirus 3C protease inhibits host cell translation: A novel mechanism for host translation shutoff. Mol. Cell. Biol. 2004;24:1779–1790. doi: 10.1128/MCB.24.4.1779-1790.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Lawrence, C., Thatch, R. Identification of a viral protein involved in post-translational maturation of the encephalomyocarditis virus capsid precursor. J. Virol. 1975;15:918–928. doi: 10.1128/jvi.15.4.918-928.1975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Leong, L.E., Walker, P.A., Porter, A.G. Human rhinovirus-14 protease 3C (3Cpro) binds specifically to the 5′-noncoding region of the viral RNA. Evidence that 3Cpro has different domains for the RNA binding and proteolytic activities. J. Biol. Chem. 1993;268:25735–25739. [PubMed] [Google Scholar]
  19. Matthews, D.A., Smith, W.W., Ferre, R.A., Condon, B., Budahazi, G., Sisson, W., Villafranca, J.E., Janson, C.A., McElroy, H.E., Gribskov, C.L. Structure of human rhinovirus 3C protease reveals a trypsin-like polypeptide fold, RNA-binding site, and means for cleaving precursor polyprotein. Cell. 1994;77:761–771. doi: 10.1016/0092-8674(94)90059-0. [DOI] [PubMed] [Google Scholar]
  20. Mauffret, O., Tevanian, G., Fermandjian, S. Residual dipolar coupling constants and structure determination of large DNA duplexes. J. Biomol. NMR. 2002;24:317–328. doi: 10.1023/a:1021645131882. [DOI] [PubMed] [Google Scholar]
  21. Meredith, J.M., Rohll, J.B., Almond, J.W., Evans, D.J. Similar interactions of the poliovirus and rhinovirus 3D polymerase with the 3′ untranslated region of rhinovirus 14. J. Virol. 1999;73:9952–9958. doi: 10.1128/jvi.73.12.9952-9958.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Milligan, J.F., Groebe, D.R., Witherell, G.W., Uhlenbeck, O.C. Oligoribonucleotide synthesis using T7 RNA polymerase and synthetic DNA templates. Nucleic Acids Res. 1987;15:8783–8798. doi: 10.1093/nar/15.21.8783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ohlenschlager, O., Wohnert, J., Bucci, E., Seitz, S., Hafner, S., Ramachandran, R., Zell, R., Gorlach, M. The structure of the stem–loop D subdomain of coxsackievirus B3 cloverleaf RNA and its interaction with the proteinase 3C. Structure. 2004;12:237–248. doi: 10.1016/j.str.2004.01.014. [DOI] [PubMed] [Google Scholar]
  24. Pelham, H.R.B. Translation of encephalomyocarditis virus RNA in vitro yeilds an active proteolytic processing enzyme. Eur. J. Biochem. 1978;85:457–462. doi: 10.1111/j.1432-1033.1978.tb12260.x. [DOI] [PubMed] [Google Scholar]
  25. Rieder, E., Xiang, W., Paul, A., Wimmer, E. Analysis of the cloverleaf element in a human rhinovirus type 14/poliovirus chimera: Correlation of subdomain D structure, ternary protein complex formation and virus replication. J. Gen. Virol. 2003;84:2203–2216. doi: 10.1099/vir.0.19013-0. [DOI] [PubMed] [Google Scholar]
  26. Rohll, J.B., Percy, N., Ley, R., Evans, D.J., Almond, J.W., Barclay, W.S. The 5′-untranslated regions of picornavirus RNAs contain independent functional domains essential for RNA replication and translation. J. Virol. 1994;68:4384–4391. doi: 10.1128/jvi.68.7.4384-4391.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Santoro, S.W., Joyce, G.F. A general purpose RNA-cleaving DNA enzyme. Proc. Natl. Acad. Sci. 1997;94:4262–4266. doi: 10.1073/pnas.94.9.4262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Sarnow, P. Role of 3′-end sequence in infectivity of poliovirus transcripts made in vitro. J. Virol. 1989;60:1040–1049. doi: 10.1128/jvi.63.1.467-470.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Todd, S., Semler, B.L. Structure–infectivity analysis of the human rhinovirus genomic RNA 3′ non-coding region. Nucleic Acids Res. 1996;24:2133–2142. doi: 10.1093/nar/24.11.2133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Varani, G., Cheong, C., Tinoco, I. Structure of an unusually stable RNA hairpin. Biochemistry. 1991;30:3280–3289. doi: 10.1021/bi00227a016. [DOI] [PubMed] [Google Scholar]
  31. Walker, P.A., Leong, L.E., Porter, A.G. Sequence and structural determinants of the interaction between the 5′-noncoding region of picornavirus RNA and rhinovirus protease 3C. J. Biol. Chem. 1995;270:14510–14516. doi: 10.1074/jbc.270.24.14510. [DOI] [PubMed] [Google Scholar]
  32. Warren, J.J., Moore, P.B. Application of dipolar coupling data to the refinement of the solution structure of the Sarcin–Ricin loop RNA. J. Biomol. NMR. 2001;20:311–323. doi: 10.1023/a:1011214214552. [DOI] [PubMed] [Google Scholar]
  33. Wyatt, J.R., Chastain, M., Puglisi, J.D. Synthesis and purification of large amounts of RNA oligonucleotides. Biotechniques. 1991;11:764–769. [PubMed] [Google Scholar]
  34. Xiang, W., Harris, K.S., Alexander, L., Wimmer, E. Interaction between the 5′-terminal cloverleaf and 3AB/3CDpro of poliovirus is essential for RNA replication. J. Virol. 1995;69:3658–3667. doi: 10.1128/jvi.69.6.3658-3667.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Zell, R., Sidigi, K., Bucci, E., Stelzner, A., Gorlach, M. Determinants of the recognition of enteroviral cloverleaf RNA by coxsackievirus B3 proteinase 3C. RNA. 2002;8:188–201. doi: 10.1017/s1355838202012785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Zweckstetter, M., Bax, A. Prediction of sterically induced alignment in a dilute liquid crystalline phase: Aid to protein structure determination by NMR. J. Am. Chem. Soc. 2000;122:3791–3792. [Google Scholar]

Articles from RNA are provided here courtesy of The RNA Society

RESOURCES