Abstract
As a retrovirus, the human immunodeficiency virus (HIV-1) packages two copies of the RNA genome as a dimer in the infectious virion. Dimerization is initiated at the dimer initiation site (DIS) which encompasses stem-loop 1 (SL1) in the 5’-UTR of the genome. Study of genomic dimerization has been facilitated by the discovery that short RNA fragments containing SL1 can dimerize spontaneously without any protein factors. Based on the palindromic nature of SL1, a kissing loop model has been proposed. First, a metastable kissing dimer is formed via standard Watson-Crick base pairs and then converted into a more stable extended dimer by the viral nucleocapsid protein (NCp7). This dimer maturation in vitro is believed to mimic initial steps in the RNA maturation in vivo, which is correlated with viral infectivity. We previously discovered a small molecule activator, Lys-Ala-7-amido-4-methylcoumarin (KA-AMC), which facilitates dimer maturation in vitro, and determined aspects of its structure-activity relationship. In this report, we present measurements of the binding affinity of the activators and characterization of their interactions with the SL1 RNA. Guanidinium groups and increasing positive charge on the side chain enhance affinity and activity, but features in the aromatic ring at least partially decouple affinity from activity. Although KA-AMC can bind to multiple structural motifs, NMR study showed KA-AMC preferentially binds to unique structural motifs, such as the palindromic loop and the G-rich internal loop in the SL1 RNA. NCp7 binds to SL1 only an order of magnitude tighter than the best small molecule ligand tested. The study presented here provides guidelines for design of superior small molecule binders to the SL1 RNA that have the potential of being developed as an antiviral by either interfering with SL1-NCp7 interaction at the packaging and/or maturation stages.
As with other retroviruses, the human immunodeficiency virus type 1 (HIV-1) packages two homologous copies of its RNA genome into the virion. These RNA molecules form a non-covalent dimer in the 5’-untranslated region (5’-UTR) of the genome (1). The dimeric nature of the genome is important in many steps of the replication cycle including reverse transcription, recombination, and packaging, and consequently, for infectivity (reviewed in (2)); it also enhances the capacity of the virus for genetic variability (3). Molecular understanding of the genomic dimerization has been facilitated by the discovery of a short synthetic RNA fragment corresponding to the dimer initiation site (DIS) in the 5’-UTR that can dimerize spontaneously in vitro (4–8). DIS is located in the stem-loop 1 (SL1) of the HIV-1 5’UTR; it is the minimal RNA domain required to initiate dimerization in vitro (5, 7) and in vivo (9) SL1 (Fig. 1a) contains a 6-nucleotide (nt) palindromic sequence in its apex, which is usually flanked by two purines at the 5’-end and one purine at the 3’-end, especially in the A and B subtypes of HIV-1. An 11-base pair (bp) stem of SL1 contains a 1–3 G-rich internal loop (GRIL) G•AGG (5, 7). Although there are 64 possibilities for a 6-nt auto-complementary sequence, only two sequences are common: GCGCGC (e.g., in the Lai isolate) and GUGCAC (e.g., in the Mal isolate) (10), highlighting the importance of these particular palindromic sequences; furthermore, the palindrome is always six nucleotides. In addition, the flanking purines are also strongly conserved, and mutations or deletions in this region strongly impair dimerization and consequently infectivity(4, 11). Although the exact sequence of the stem does not appear as important, maintaining base pairing is essential (4), and the sequences of the upper 7-bp portion of the stem, CUUGCUG•CAGCAAG or CUUGCUG•CGGCAAG, also show strong conservation (12). Finally, the GRIL motif G•AGG and the lower 4-bp portion of the stem, CUCG•CGAC, are the most conserved features of SL1 among HIV-1 isolates (12).
Figure 1.
SL1 constructs and the kissing-loop model of dimer maturation. (a) Stem-loop SL1 of the HIV-1 packaging signal and its fragments used in this study. SL1-fl RNA corresponds to the full-length 35-nt SL1 RNA with two terminal base pairs (shown in low case) reversed in order to optimize the in vitro transcription yield. SL1-s RNA includes the palindromic loop and the upper stem of SL1 but not the G-rich internal loop; two G:C base pairs were added to the bottom of the lower stem to increase its stability. SL1-gril RNA consists of the stems and the G-rich internal loop of SL1 with an apical UACG loop added to reduce dimerization during folding and a dangling G residue added at the 5’ end to increase the transcription yield. Broken boxes enclose wild-type residues from SL1; non-wild-type residues are shown in low case and indicated with an asterisk. Two additional constructs were used in this study: SL1-yf consisted of the sequence SL1-fl with the three flanking purines (adenines) being replaced by pyrimidines (uracils), and SL1-yb consisted of SL1-fl with the GRIL purines all replaced with pyrimidines (two cytosines and two uracils). (b) A two-step model of SL1 dimerization. A metastable kissing dimer (top) is formed using the self-complementary sequence in the apical loops of SL1 monomers and it can be converted into a more stable extended dimer (bottom) by heat or viral NCp7 protein.
Based on the palindromic nature of the 9-nt loop, a kissing-loop model of dimerization has been proposed (5, 7) (Fig. 1b). The two RNA molecules first establish six Watson-Crick base pairs within the palindrome forming a metastable kissing dimer (KD). The resulting KD is stable at physiological temperature (37 °C), but it converts into a more stable extended dimer (ED) upon incubation at 55 °C (8, 13) or incubation at room temperature with the viral nucleocapsid protein (NCp7) (14), a proteolytic fragment of the Gag polyprotein. This in vitro NCp7-induced dimer conversion apparently mimics part of the maturation process that takes place in vivo, induced by the viral protease cleaving Gag, and is highly correlated with infectivity. Two HIV-1 genomic RNA molecules are selected in the cytoplasm for packaging into the budding virion (15), and then the RNA dimer undergoes a series of transformations during maturation inside the newly budded viral particles (16). RNA dimers extracted from immature (protease-deficient) viral particles are less thermostable than those from the mature virions (17). It is likely that general chaperone properties of NCp7 (18, 19) contribute to the dimer conversion; however, the detailed molecular mechanism of the conversion remains unclear.
In addition to the RNA dimerization, SL1 is also involved in the packaging of the genomic RNA into virions. SL1 and three other stem-loop structures, SL2, SL3, and SL4, comprise a so-called packaging Ψ site; SL1, SL3 SL4 and possibly SL2 together with the NCp7 domain within the Gag polyprotein are required for the packaging(9, 20). NCp7, either as a Gag domain or as a mature protein, binds each of the SL1–SL4 structures in vitro (21–23). Within SL1, there are two independent determinants specifically recognized by NCp7 in vitro: palindromic apical region with flanking purines and the G-rich internal loop; the apparent dissociation constants for the mature NCp7 binding to the two determinants are 100 and 140 nM, respectively (23).
During our experimental screening aimed to find a small-molecule to inhibit the NCp7-mediated SL1 dimer maturation that could serve as a novel HIV-1 therapeutic lead, we unexpectedly found a small-molecule activator, Lys-Ala-7-amido-4-methylcoumarin (KA-AMC, Fig. 2), that facilitated the in vitro dimer maturation instead with an effective concentration EC50 of 570 µM (24). One of the motivations of the present study was to investigate whether the activation effectivity of the SL1 maturation by small molecules is correlated with their binding affinity. We used NMR to probe structural features of KA-AMC binding to SL1 RNA and took advantage of fluorescent properties of coumarin to determine the apparent Kd values of the binding of KA-AMC and several related molecules to SL1. One of these molecules, Arg-Arg-7-amido-4-methylcoumarin (RR-AMC), binds SL1 with submicromolar affinity, less than one order of magnitude less tightly than NCp7. This study provides insights for discovering and optimizing small molecules inhibiting NCp7-SL1 interaction and thus targeting the HIV-1 RNA dimer maturation and/or HIV-1 RNA packaging.
Figure 2.
Small molecules used for the RNA binding: Lys-Ala-7-amido-4-methylcoumarin (KA-AMC), Arg-7-amido-4-methylcoumarin (R-AMC), Arg-Arg-7-amido-4-methylcoumarin (RR-AMC), and Arg-4-methyl-2-hydroxyquinoline (R-MHQ).
MATERIAL AND METHODS
RNA synthesis
All RNA oligonucleotides were synthesized by in vitro transcription (25) from a synthetic DNA template (Integrated DNA Technology, Inc) with T7 RNA polymerase prepared in-house (26). The oligonucleotides were then purified by 20% denaturing polyacrylamide gel electrophoresis (PAGE) followed by electroelution, dialyzed in sterile water, and stored at −20 °C. All RNAs were quantified using ultraviolet (UV) spectrophotometry with an extinction coefficient estimated with an online calculator to account for sequence (Ambion, Inc).
RNA sample preparation and dimerization assay
A 3 µM aliquot of SL1-fl RNA or its mutants or a 4 µM aliquot of SL1-s RNA (see Fig. 1a for nomenclature and sequences of SL1 constructs and Supplementary Data for sequences of SL1 mutants) in sterile water was heated to 90 °C for 5 min, followed by snap cooling on ice for 5 min. An equal volume of pre-chilled 2X dimerization buffer [20 mM potassium phosphate (pH 7.0), 100 mM NaCl, and 0.2 mM MgCl2] was added, and the sample was incubated on ice for an additional 30 minutes. Finally, the sample was equilibrated at room temperature for 2 h to fold into the KD conformation. The same RNA constructs were also folded into the ED conformation by heating to 90 °C followed by slow cooling to room temperature. SL1-yf consists of the sequence SL1-fl with the three flanking adenines being replaced by pyrimidines (uracils), and SL1-yb consists of SL1-fl with the GRIL purines all replaced with pyrimidines (C-UCU). SL1-yf and SL1–yb RNAs were folded into the KD conformation similarly, except for the 3 min heating and snap cooling periods. The native PAGE assay was used to distinguish KD and ED on the basis of their Mg2+ dependence (27–29). The metastable KD stays as a dimer during PAGE only in the presence of Mg2+ ions. It dissociates and migrates as a monomer when PAGE is performed in 1X Tris-borate-EDTA (TBE) buffer (89 mM Tris, 89 mM borate, pH 8.4, and 2 mM EDTA), with EDTA as a Mg2+ chelator. On the other hand, KD runs as a dimer in the presence of Mg+2 ions, such as in the 1X Tris-borate-MgCl2 (TBM) buffer (89 mM Tris, 89 mM borate, pH 8.4, and 3 mM MgCl2). The more stable ED does not need Mg2+ ions to stay as a dimer in the gel and runs as a dimer band in either electrophoresis conditions. By performing native PAGE in 1X TBE buffer, we can distinguish these two dimer forms and check the folding state of the RNA sample.
SL1-gril RNA in water was lyophilized and dissolved in 1X dimerization buffer (10mM potassium phosphate, pH 7.0, 50 mM NaCl, and 0.1 mM MgCl2) at a concentration of about 10 µM and then folded by heating to 90 °C for 5 min followed by snap cooling on ice for 30 min. The folding state of the sample was then checked by native PAGE. After folding, RNA samples were concentrated to a desired concentration with a centrifugal device (Amicon, 3,000 MWCO) for fluorescent emission quenching experiments or for NMR studies.
To study the specificity of small molecules binding, we also used the HIV-1 SL2 and SL3 stem-loops and a 18-nt duplex RNA (DUP RNA), consisting of residues 11–17 of the wild-type SL1 sequence preceded by two guanine residues with the remaining nine residues being self-complementary to the first nine (GGUGAAGCGCGCUUCACC) to be fully palindromic (see Supplementary Data). The SL2, SL3 and DUP RNA samples in water were lyophilized and dissolved in 1X dimerization buffer to a final concentration of 300–400 µM and folded by heating to 90 °C and slowly cooling to room temperature. SL2 and SL3 RNA constructs were shown to fold as a monomer previously (30, 31).
Native gel electrophoresis
Native PAGE experiments were carried out using a 10% bis:acrylamide (19:1) ratio of matrix at 25 °C in TBE buffer. A native dye, bromophenol blue, containing 10% glycerol was used in each case. Gels were run until the bromophenol blue ran to the bottom at 150 volts. Gels were then stained with ethidium bromide and visualized by UV light.
Apparent dissociation constant measurement using fluorescence emission quenching of coumarin and hydroxyquinoline derivatives
The following small molecules were used for the fluorescence quenching experiments (Fig. 2): Lys-Ala-7-amido-4-methylcoumarin (KA-AMC), Arg-7-amido-4-methylcoumarin (R-AMC), Arg-Arg-7-amido-4-methylcoumarin (RR-AMC), and Arg-4-methyl-2-hydroxyquinoline (R-MHQ). KA-AMC and R-AMC were purchased from Sigma, RR-AMC was purchased from Bachem, and R-MHQ was synthesized (24). A 300 µL aliquot of a small molecule at a concentration of 1 µM in 1X dimerization buffer was placed in a cylindrical micro-fluorescence cell (Horiba). The initial emission fluorescence of the coumarin derivatives and R-MHQ was measured at 390 nm and 364 nm, respectively, with FluoroMax3 fluorescence spectrometer (Horiba) using an excitation wavelength of 323 nm. Aliquots of RNA ranging from 300–400 µM were added to the sample for the “reverse” titration. The samples were mixed briefly and the fluorescence measured. The fluorescence quenching data were fitted to the following equation (32) by varying parameters θb and Kd with Microsoft Excel:
(Eq. 1) |
where θ = F/F0 is the ratio of the observed fluorescence intensity F to the initial intensity F0.θb = Fb/F0 is a similar ratio at the saturating binding, RT and LT are the total concentrations (bound plus unbound) of RNA and ligand, respectively, and Kd is the dissociation constant.
NMR experiments for monitoring KA-AMC binding to SL1 RNA
All NMR experiments were performed on a Varian INOVA 600 MHz NMR spectrometer. Spectra in D2O were acquired at either 25 or 30 °C, and spectra in 90/10% H2O/D2O were acquired at 10 °C. Samples of SL1-fl in either KD or LD forms, SL1-s in KD form and SL1-gril had concentration ranging from 250 to 400 µM in RNA strands and varying concentration of KA-AMC in 1X dimerization buffer. In the case of H2O samples, water suppression was achieved using a symmetrically-shifted shaped pulse (33) for the acquisition of two-dimensional (2D) NOESY spectra and using the “watergate" method (34) for the acquisition of one-dimensional (1D) spectra. Spectra for samples in D2O were acquired without any suppression of the residual HDO signal. Changes in RNA proton chemical shifts upon addition of varying concentrations of KA-AMC were monitored in 1D spectra (for imino protons), 1D traces of T1 inversion recovery experiment (for selective detection of slowly relaxing adenine H2 protons), and 2D NOESY and TOCSY spectra (for other aromatic and sugar protons). Partial proton chemical shift assignments of SL1-fl, SL1-s and SL1-gril alone and in complexes with KA-AMC were achieved based on homonuclear 2D NOESY (recorded with mixing times varying from 50 to 600 ms) and 2D TOCSY spectra with the help of published assignments for SL1-fl in the LD form (35). 1D data were processed and analyzed with vnmr (Varian); 2D data were processed with NMRpipe (36) and analyzed with SPARKY (37).
Molecular modeling of a KA-AMC:SL1 RNA complex
Molecular docking of KA-AMC to SL1 RNA was performed using the MORDOR software (38). For docking to the flanking A’s site in the KD form, the receptor structure was based on the NMR structure of the KD SL1 RNA lacking the G-rich internal loop, PDB 2D19 (39). The palindromic sequence GCGCGC was substituted for GUGCAC in 2D19, and the structure was energy-minimized with the miniCarlo program (40) before passing it to MORDOR. For docking to the GRIL motif, the receptor structure was based on the NMR structure of the full-length SL1 RNA in the ED form, PDB 2GM0 (35). The sequence of the lower stem CUCG•CGAG was substituted for GACG•CGUC in 2GM0, and the structure was energy-minimized. All calculations were done separately for models 1, 2, and 7 of the 2GM0 NMR ensemble, which represent distinct GRIL conformations.
The receptor and ligand preparation and docking procedures with flexible receptor and ligand were similar to those previously described (38, 41), except that the RNA receptor was allowed more flexibility. For this purpose, distance restraints between heavy atoms compatible with an envelope of A-RNA conformations were imposed on RNA stems on both sides of the binding pocket (ca. 7.4 restraints per residue with an average flat-well width of 0.5 Å), while the binding pocket itself was unrestrained. Compared to using an atomic root mean square deviation restraints imposed on the overall RNA conformation (38), this method allows hinge motions around the binding pocket and a greater flexibility inside the pocket. In addition, the generalized Born method (42) was used to account for solvation during some of the docking runs. Between 5,000 and 10,000 binding poses were generated for each RNA receptor and docking run as described previously (38). The docked RNA – KA-AMC complexes were scored according to their interaction energy consisting of van der Waals, electrostatic, and the Generalized Born solvation energy terms. About 1% of the best scoring poses were examined manually using the ViewDock option of UCSF Chimera (43) and then further assessed based on qualitative agreement with NMR observations.
RESULTS
Design of RNA constructs
SL1 folds into a stem-loop structure with an apical 9-nucleotide loop and a stem containing a 1–3 internal loop (5, 7). Both the apical and the internal loops are strongly conserved among HIV-1 isolates; they bind NCp7 in vitro, and mutations in these regions impair viral infectivity. The full-length SL1 (SL1-fl, Fig. 1a) is the same 35-nt RNA construct that we have used previously (12, 24, 35). It has the sequence corresponding to the wild-type Lai HIV-1 isolate, except for the inversions in the two terminal base pairs, which enabled optimization of the in vitro transcription yield. We have also designed two truncated constructs, each containing one of the important features of SL1, namely SL1-s (“stable” SL1) and SL1-gril (monomerized SL1 with GRIL motif) (Fig. 1a). The SL1-s construct lacks GRIL but contains the palindromic apical loop and the upper stem of the full-length construct with two additional GC base pairs at the terminus to increase the stem stability. Increasing the stem stability reduces a possibility for the KD-to-ED dimer conversion during experiments and simplifies NMR data analysis significantly.
SL1-gril contains the stem-G-rich internal loop-stem motif of the wild-type SL1 with the addition of an apical UACG tetraloop. The tetraloop increases the construct stability, reduces dimerization during folding and furthermore provides a unique signature in NMR (44). In addition, we used the SL1-yf and SL1-yb mutants of SL1-fl (12). In SL1-yf (see Supplementary Data), the adenines flanking the palindrome, i.e., A13, A14 and A21, were substituted with uracils. In SL1-yb, the 1–3 internal loop G-AGG was substituted with an all-pyrimidine 1–3 internal loop C-UCU. Finally, to study the specificity of the small molecules binding, some experiments were carried out for the HIV-1 SL2 and SL3 stem-loops and for an 18-bp duplex RNA. The self-complementary RNA sequence used to form the duplex, GGUGAAGCGCGCUUCACC, has the SL1 palindromic hexanucleotide in the middle (shown in bold). The wild-type sequence GGCGACUGGUGAGUACGCC was used for the SL2 construct (the tetraloop and the A-bulge are shown in bold). Three base pairs were added to the SL3 sequence ggaCUAGCGGAGGCUAGucc to optimize the yield of the in vitro transcription (the tetraloop is shown in bold and the non-wild-type residues are shown in lower case).
RNA constructs SL1-fl and SL1-s were folded as desired, either in the KD or ED form, and constructs SL1-yf and SL1-yb were folded in the KD form, which was then verified using the magnesium-dependent native PAGE assay as described in Materials and Methods (data not shown). Native PAGE was also used to verify that SL1-gril, SL2 and SL3 were folded as monomers (data not shown).
Binding affinity of small molecules to SL1 RNA by Fluorescence Emission Quenching
We used the fluorescence properties of the small molecules shown in Fig. 2 to measure their binding affinity to SL1 RNA and its mutants. The fluorescence emission intensity quenches when these small molecules bind to RNA. The apparent dissociation constants Kd were determined by fitting the fluorescence intensity binding curves to Eq. 1, which assumes a single binding site on the receptor, hence the apparent character of the Kd values. For monomeric constructs SL1-gril, SL2, and SL3, the total RNA concentration in Eq. 1 represents the concentration of the binding sites, and the resulting Kd values characterize the affinity to this site. In the case of dimeric RNA constructs, SL1-fl, SL1-s, SL1-yf and SL1-yb, each symmetric dimer carries at least two equivalent binding sites. Because the RNA concentration was expressed in terms of RNA strands (which is again equivalent to the binding site concentration), the resulting Kd values characterize the affinity to each binding site, assuming that the two symmetric binding sites do not interact with each other. There are, however, two pairs of distinct binding sites on the full-length SL1-fl RNA construct, one associated with the palindromic region of SL1 and another associated with the GRIL motif (see also NMR chemical shift mapping below). The palindromic and GRIL binding sites are dissociated from each other in the SL1-s and SL1-gril constructs. If all four (two pairs) binding sites had equal affinity and were non-interacting, then the apparent Kd values measured for the SL1-fl must be one half of those for SL1-s and SL1-gril, which is indeed approximately the case for R-AMC, RR-AMC and R-MHQ (Table 1). However, in the case of KA-AMC, the GRIL motif appears to be the dominant binding site with the apparent affinity to SL1-gril more than twice as strong as the affinity to SL1-s (Table 1).
Table 1.
Apparent dissociation constants Kd (µM) for the binding of small molecules to SL1-related RNA constructsa
RNA | KA-AMC | R-AMC | RR-AMC | R-MHQ |
---|---|---|---|---|
SL1-fl KD | 19.3 (4.2) | 5.2 (0.3) | 0.5 (0.1) | 3.5 (0.3) |
SL1-s KD | 50.0 (3.7) | 12.6 (0.1) | 1.4 (0.2) | 5.9 (1.0) |
SL1-gril | 20.9 (0.2) | 12.2 (0.6) | 0.9 | 7.1 (0.8) |
Duplex RNA | 122.1 | 82.2 | 16.5 | 98.5 |
Apparent dissociations constants were determined based on fluorescence quenching of small molecules upon RNA titration as explained in Materials and Methods. Standard deviations are shown in parentheses, determined in 2–4 independent experiments. The data on binding to duplex RNA and on binding of RR-AMC to SL1-gril are preliminary, determined based on a single experiment.
Matters are further complicated by the fact that each of the four small molecules studied here binds to the duplex RNA, albeit with lower affinities (Table 1). This was expected, of course, because of the negative charges on the RNA backbone and positive charges on small molecules; this further highlights the apparent character of the measured Kd values. We must note, however, that this is also the case for NCp7, which also binds non-specifically RNA and DNA stems (see, e.g., (19, 45)) in addition to specific binding to a number of structural RNA motifs; the measured affinities of NCp7 to such motifs must therefore also be considered as apparent (23). This parallel is important, as it allows a direct comparison of apparent affinities of the small molecules measured here and apparent NCp7 affinities (23).
Fig. 3 shows the results of fitting of the fluorescence binding curves measured in representative experiments for SL1-gril. Despite the fact that the small molecules bind to the stems in addition to the G-rich internal loop, the curves can be fit reasonably well to Eq. 1, probably because the continuing ligand binding at higher RNA concentrations is absorbed by the apparent value of θb = Fb/F0.
Figure 3.
Representative fluorescence quenching curves for small molecules binding to SL1-gril RNA: KA-AMC (diamonds), R-AMC (triangles), R-MHQ (squares), RR-AMC (circles). The lines show the results of the fitting to Eq. 1.
Each of the four small molecules binds SL1-related RNA constructs in the low micromolar or sub-micromolar range (Table 1), with the apparent dissociation constant decreasing in the following order: KA-AMC > R-AMC > R-MHQ > RR-AMC for each RNA. The arginine guanidinium group makes a ligand a tighter binder compared to the localized amino group on lysine, and increasing the ligand positive charge also improves the affinity. This trend parallels the order of effectivity of KA-AMC, R-AMC and RR-AMC in promoting the KD-to-ED conversion, but this order is reversed for R-AMC and R-MHQ (24). Substituting lysine for arginine and increasing the positive charge also leads to tighter non-specific binding to duplex RNA (ranging from 122 µM for KA-AMC to 16 µM for RR-AMC), which is expected due to the negatively charged RNA backbone. What was not expected is that the specificity of binding also increases in approximately the same order (defined as the ratio of the apparent Kd values for the binding to the duplex RNA compared to the binding site of interest, either palindromic loop or GRIL, Table 2). For the two tighter ligands, R-MHQ and RR-AMC, the specificity is about a factor of ten. At the same time, the apparent binding affinity of the tightest ligand, RR-AMC, approaches that of NCp7 binding to the same RNA sequences (23), with a factor of 14 difference for the palindromic loop (SL1-s) and a factor of 6 for the G-rich internal loop (SL1-gril). RR-AMC and KA-AMC bind somewhat better (by a factor of 1.6 and 2.4, respectively) to GRIL than to the palindromic loop, while R-AMC and R-MHQ bind both sites equally within experimental error (Table 1).
Table 2.
Specificity of small molecules binding to SL1-related RNA constructsa
RNA | KA-AMC | R-AMC | RR-AMC | R-MHQ |
---|---|---|---|---|
SL1-fl KD | 6.3 | 15.8 | 33.0 | 28.1 |
SL1-s KD | 2.4 | 6.5 | 11.8 | 16.7 |
SL1-gril | 5.8 | 6.7 | 18.3 | 13.9 |
Defined as the ratio of the apparent Kd for the binding to duplex RNA and apparent Kd for the binding to SL1-related RNA construct (Table 1).
KA-AMC binds the KD and ED isomers with similar affinities, with apparent Kd of 19-23 µM and 48–50 µM for SL1-fl and SL1-s, respectively (compare Tables 1 and 3). Under the experimental conditions used in this study, the four small molecules do not convert KD into ED; we verified this for RR-AMC (data not shown), which was shown to be the most efficient conversion activator of the four ligands (24). Indeed, such a conversion was not expected, because the KD-to-ED conversion requires much higher, sub-millimolar concentrations of the small molecules (24).
Table 3.
Apparent dissociation constants Kd (µM) for the binding of KA-AMC to various RNA constructsa.
SL1-fl ED | SL1-s ED | SL1-yf KD | SL1-yb KD | SL2 | SL3 |
---|---|---|---|---|---|
22.6 | 47.6 | 22.9 | 26.3 | 47.5 | 65.1 |
(2.5) | (2.1) | (1.1) | (1.9) | (2.4) | (0.4) |
Apparent dissociations constants were determined based on fluorescence quenching of small molecules upon RNA titration as explained in Materials and Methods. Standard deviations are shown in parentheses, determined in 2–4 independent experiments.
In addition to SL1 binding and non-specific binding to the RNA duplex, KA-AMC also binds to SL2 and SL3 HIV-1 stem-loop constructs (Table 3) with an apparent affinity similar to that of the palindromic loop (SL1-s). Interestingly, NCp7 has similar binding properties; in fact, it binds SL2 and SL3 tighter (Kd 23 and 28 nM, respectively) than it binds the palindromic loop and GRIL (Kd 100 and 140 nM, respectively) (23). KA-AMC also binds the KD forms of a number of SL1 mutants, including the ones with the flanking adenines of the palindromic loop (SL1-yf) or purines of the GRIL motif (SL1-yb) changed to pyrimidines, with an apparent affinity similar to that for SL1-fl (Table 3). While the NCp7 affinity to these mutants have not been measured, NCp7 is likely to bind them as well, because it promotes their conversion from KD to ED form as readily as it does for SL1-fl (12).
Chemical shifts and line width of KA-AMC protons upon binding to RNA
Proton NMR resonances of KA-AMC undergo significant shifts and line broadening upon binding to SL1-fl KD (24). Similar effects are also observed for the KA-AMC binding to SL1-s KD (Supplementary Data) and SL1-gril (Fig. 4). Resonances of protons on the aromatic ring of KA-AMC experience very strong upfield shifts, indicative of ring stacking with bases on RNA, with the effect being even more pronounced for binding to SL1-gril than to SL1-s KD (Supplementary Data). Resonances of the alanine side chain protons display a modest upfield shift, and lysine protons display a modest downfield shift upon binding. The line broadening of KA-AMC protons suggests an exchange process at the intermediate NMR time scale, either between bound and unbound states or between different binding modes. The line broadening is especially severe for the ring protons of KA-AMC, which is correlated with the large changes in chemical shifts for these protons. The observed moderate shift of the resonances of the ring KA-AMC protons back in the downfield direction with increasing concentration of the added ligand (Fig. 4 d through f) is most likely due to the simultaneous increase in the fraction of unbound ligand. At the same time, there is no detectable ligand line sharpening at higher ligand concentrations, which would be expected if the line broadening was entirely due to intermediate exchange between bound and free states. Further, the RNA proton signals continue to change with the continuing titration of the ligand (Fig. 4). In sum, these results strongly suggest the existence of exchange between multiple binding modes.
Figure 4.
1D proton NMR spectra at 25 °C of KA-AMC (0.5 mM) and SL1-gril RNA (280 µM). (a) KA-AMC alone, (b) SL1-gril alone, and the following SL1-gril – KA-AMC complexes: (c) 1:0.25, (d) 1:0.5, (e) 1:1, and (f) 1:2. The vertical scale in the traces in the right parts of (c) and (d) is increased compared with the rest of the traces.
RNA chemical shift mapping of KA-AMC interactions with SL1 constructs
To identify the KA-AMC binding sites and characterize ligand-RNA interactions, we monitored changes in proton NMR chemical shifts of SL1-fl KD and ED, SL1-s KD and SL1-gril during the KA-AMC titration. Chemical shifts of unbound SL1-fl ED have been assigned previously (35), and chemical shifts of other unbound RNAs were partially assigned using homonuclear NMR techniques. As expected, chemical shifts of residues of the palindromic loop and flanking adenines were similar in SL1-fl KD and SL1-s KD, and chemical shifts or residues in or near the GRIL motif were similar in SL1-fl KD and SL1-fl ED (data not shown). However, GRIL residues had somewhat different chemical shifts in the SL1-gril and SL1-fl constructs; G4 and G5 fingerprint cross-peaks could not be assigned in SL1-gril using homonuclear NMR due to the spectral overlap (data not shown). These differences are likely to reflect somewhat different structures and/or dynamics at this site and must be due to the different sequence of the lower 4-bp stem in these RNA constructs (Fig. 1a): the sequence is wild-type with the dangling G at the 5’-end in SL1-gril and it has a blunt end with two reversed base pairs in SL1-fl. The solution conformation of this site is known to be sensitive to relatively minor changes in the sequence (see (35)).
We followed the changes in proton chemical shifts of RNA constructs upon addition of KA-AMC using well-resolved regions of various NMR spectra: imino protons in 1D spectra in water, pyrimidine H5–H6 cross-peaks and occasional H2’-H1’ cross-peaks (for flexible residues and residues in C2’-endo conformation) in 2D TOCSY spectra. The adenine H2 resonances were monitored using the T1 inversion recovery 1D traces with an appropriate delay after the inversion 180 ° pulse, which allowed for the recovery of most aromatic protons but kept the H2 resonances negative due to a longer T1 relaxation time for these protons. In addition, we followed the H1’-H6/H8 fingerprint cross-peaks in 2D NOESY spectra for SL1-s and SL1-gril. Some of the NMR spectra are shown in Supplementary Data, and the results are summarized in Figure 5. Figure 5 shows maximum absolute values of changes in chemical shifts among protons in each RNA residue upon KA-AMC titration to SL1-flKD, SL1-sKD, and SL1-gril.
Figure 5.
Changes in chemical shifts of SL1 constructs upon binding of KA-AMC. The titration data for SL1-fl KD (a), SL1-s KD (b) and SL1-gril (c) are presented as the absolute values of the maximum chemical shift differences Max|ΔCS|. Data from the following NMR experiments were included in this index: 1D spectra in water, TOCSY and T1 inversion recovery spectra in D2O at 25 °C for SL1-fl; 1D spectra in water at 10 °C, TOCSY and T1 inversion recovery spectra in D2O at 30 °C for SL1-s; TOCSY in D2O at 25 °C for SL1-gril. Water 1D spectra provide chemical shifts of the imino protons H1 and H3, inversion recovery experiments provide adenine H2 chemical shifts, and TOCSY experiments provide chemical shifts of H5 and H6 protons in pyrimidines and H1’ and H2’ protons for riboses in the C2’-endo conformations. The SL1-fl and SL1-s data for the 2:1, 2:2 and 2:4 RNA-ligand complexes (using RNA strands concentration) are shown as black, grey, and open bars, respectively. The SL1-gril data for the 1:1 and 1:2 RNA-ligand complexes are shown as grey and open bars, respectively. Black, grey, and open triangles indicate residues undergoing resonance broadening observed starting with the 2:1, 2:2 and 2:4 complexes, respectively. “N” indicates that information about a residue is not available for comparison. RNA regions corresponding to the palindromic loop and the GRIL motif are indicated above the graph. Residues are numbered according to Fig. 1a.
It is clear that most RNA residues are affected at least to some extent by adding KA-AMC, even if it is only line-broadening for some residues. Part of this may relate to the capability of KA-AMC to bind even to the RNA duplex (Table 1). In particular, the RNA termini are affected by ligand binding (see, e.g., G1 H2’-H1’ cross-peak in Fig. 6a) consistent with the coumarin ring stacking to the stem end, but the interior of RNA stems are affected as well, especially at higher ligand ratios (Fig. 5a). Nevertheless, the maximum chemical shift changes occur for the residues at or near two distinct sites: the palindromic loop with flanking adenines (SL1-fl KD and ED and SL1-s KD) and the GRIL motif (SL1-fl KD and ED and SL1-gril), consistent with higher apparent affinity of KA-AMC to these sites compared with duplex RNA (Table 1). Indeed, the small shifts and broadening seen for residues in the interior of the stems may arise from the ligand binding to these two preferred sites. For the palindromic loop site in the KD form, SL1-fl KD and SL1-s KD, we observed KA-AMC-induced perturbation of the imino signals of U11, G23, G12, G17, and of overlapping imino signals in the palindrome GCGCGC itself, H5-H6 TOCSY cross-peaks of C16, C20, and C22, H1’-H2’ TOCSY cross-peaks of G12 and A14, A13 H8 proton and H2 protons in some of the flanking adenines resolved in the inversion recovery experiment (Supplementary Data). Additionally, we compared 2D NOESY spectra of the free RNA versus that of a 2:2.25 RNA:KA-AMC complex for SL1-s KD and observed a loss of intra-residue H1’-H6/H8 NOE cross-peaks in residues G12, A13, G15, A21, and C22 in the complex and a loss of sequential H1’-H6/H8 NOE cross-peaks for the U11 – G12, G12 –A13, A13 –A14, G15 –C16, C20 – A21, and A21 –C22 connectivities. All these changes are centered at the region of the flanking adenines next to the palindromic loop. Loss of the sequential H1’-H6/H8 NOE cross-peaks may reflect changes in RNA structure upon KA-AMC binding. However intra-residue H1’-H6/H8 NOE’s are expected for any RNA conformation, and their loss can only result from exchange broadening of proton resonances. We also observed broadening of many other NMR signals (shown with triangles in Fig. 5a, 5b), such as imino protons of G12, U11 and G23, overlapping imino signals in the palindrome, H5–H6 TOCSY cross-peaks in C16, C20, and C22, H1’–H2’ TOCSY cross-peak in G12, adenine H2 protons in the flanking A’s region, and A13 H8. Interestingly, KA-AMC binding to another dimeric form, SL1-fl ED, caused a distinctly different pattern of chemical shift perturbations around the flanking A’s site (data not shown), suggesting a distinct binding mode. Although line broadening was also observed for many residues in SL1-fl ED upon ligand binding, the imino protons in the palindromic GC pairs remained sharp (data not shown).
Figure 6.
Examples of NMR line broadening for RNA protons upon addition of KA-AMC showing superposition of TOCSY spectra for various RNA-ligand ratios. (a) SL1-s KD complexes with KA-AMC. The RNA strand concentration was 400 µM. Spectrum of RNA alone is shown in black, and spectra for the 2:0.5, 2:1, 2:2, and 2:4 RNA strands-to-ligand ratios are shown in magenta, blue, red and green colors, respectively. (b) SL1-gril with RNA concentration of 280 µM. Spectrum of RNA alone is shown in black, and spectra for the 1:0.5, 1:1, and 1:2 RNA-to-ligand ratios are shown in blue, red and green colors, respectively. Arrows show the direction of chemical shift changes for the A14 H2’-H1’ (a) and C6 H5-H6 (b) cross-peaks.
KA-AMC-induced perturbations in and near the GRIL site (in SL1-fl KD and ED, and in SL1-gril) include the H1’–H2’ cross-peaks in G30 and G31 and the H5–H6 cross-peaks in C3, C6, and C32. NMR signals of the tetraloop UACG in SL1-gril were not affected significantly during titration. In addition, we observed significant broadening of NMR signals for residues near the GRIL motif upon adding KA-AMC, including the H1’–H2’ TOCSY cross-peaks in G30 and G31, H5–H6 TOCSY cross-peaks in C6 and C32, and many intra-and inter-residue H1’–H6/H8 cross-peaks in A29, G30, G31, and C32 (shown with triangles in Fig. 5a and 5c). NMR line broadening at both the palindromic loop and GRIL sites suggests exchange processes at an intermediate NMR time scale, which could be associated either with bound and unbound conformations or with multiple binding modes of KA-AMC (see Discussion, below).
Docking
No reliable inter-molecular NOE cross-peaks were observed in any of the KA-AMC complexes with SL1 RNA constructs due to severe line broadening for RNA and especially ligand signals. This made impossible any NMR-based structure determination. Therefore, we carried out several computational docking experiments to explore possible KA-AMC binding poses to the flanking adenines and GRIL sites using the flexible docking program MORDOR (38). MORDOR permits both RNA and ligand flexibility, so they can bind to each other via induced fit while predicting complex structures that are energetically feasible. In most high-scoring (low energy) binding poses to the SL1-s receptor, the coumarin ring of KA-AMC forms stacking interactions with RNA bases, consistent with the strong upfield shift of the ring protons of the ligand as well as base protons. A number of stacking arrangements have resulted from the docking; the most prevalent motifs for binding to the flanking adenine region are shown in Fig. 7a and 7b. In both cases, A13 is displaced from its position in the free RNA (39) with the coumarin ring. In the first motif (Fig. 7a), the coumarin ring is stacked directly under A13, and the coumarin ring oxygen is forming a H-bond with the amino group of A21. The alanine residue of KA-AMC is located above G12, which can explain moderate upfield shifts of its proton resonances (see above). The amide and guanidinium groups of KA-AMC are making H-bonds with the phosphate and sugar groups on both strands of SL1-s. In the second motif (Fig. 7b), the coumarin ring is stacked between G12 of the one strand and G15 of the other strand of SL1-s, and the amide and guanidinium protons are making H-bonds with the phosphate and sugar groups from the lower half of SL1-s. It is interesting that RNA residues that are stacked directly against or hydrogen bonded with the coumarin ring in at least one of the binding poses, i.e., G12, A13, G15, A21, are not observed in NMR spectra of the SL1-s complex with KA-AMC (Fig. 5b), presumably due to a severe line broadening. While the absence of NMR signals cannot be considered as a proof of binding poses presented in Fig. 7a and 7b, it is certainly consistent with the existence of multiple modes of binding, which are exchanging on the intermediate time scale. Neighboring residues, such as A14, C20, C22, are still observed in NMR spectra (Supplementary Data), but they typically experience chemical shift perturbations and moderate line broadening upon KA-AMC binding (Fig. 5b and 6a), probably caused by the propagation of a structural perturbation due to ligand binding.
Figure 7.
Stereo views of binding poses of KA-AMC at the flanking adenines (a, b) and GRIL (c, d) binding sites calculated with MORDOR (38). RNA is shown schematically using ribbon representation of the backbone and filled representation for bases and sugars with UCSF Chimera (50). Adenines flanking the palindromic apical sequence (a, b) and purines of the GRIL motif are shown in light red. Carbon atoms of KA-AMC are shown in yellow. H-bonds between KA-AMC and RNA are shown with thin orange lines.
Although the GRIL motif is structurally very different from the adenines flanking the palindromic kissing loop, KA-AMC binding poses calculated with MORDOR share common features for the two binding sites. First, there are multiple low-energy binding poses for the KA-AMC docked with the GRIL motif. Second, the coumarin ring is stacked against RNA bases and the amide and guanidinium groups of KA-AMC are making H-bonds with sugars and phosphates of RNA in high-scoring poses for SL1-gril. Two typical motifs are shown in Fig. 7c and 7d. In the first pose (Fig. 7c), the coumarin ring is stacked between G4 and G5, while the carbonyl oxygen of coumarin is making a H-bond with the amino group of G30, and the 7-amido group and the lysine guanidinium group are making H-bonds with RNA phosphate oxygens. At the same time, the amide protons of alanine and lysine form H-bonds with the Hoogsteen edge of G4. In the second pose (Fig. 7d), the coumarin ring is stacked directly above G31 and partially (at an angle) below A29. The coumarin carbonyl oxygen is making bifurcating H-bonds with the amino group of G5 and the ribose hydroxyl group of G30. In addition, the alanine amide oxygen is making a H-bond with the amino group of A29, and the lysine guanidinium group is H-bonding with a phosphate oxygen. Comparison of the predicted binding poses with NMR data is complicated by the lack of assignments for residues G4 and G5. Nevertheless, the maximum chemical shift and line width perturbations are observed for the purines of the GRIL motif A29, G30, G31, and for the neighboring residues C6, A27 (Fig. 5c, 6b). An interesting spectral feature of the KA-AMC binding to SL1-gril is a significant downfield shift of H1’ and H2’ resonances in G30 and a simultaneous upfield shift of H1’ and H2’ in G31 (Supplementary Data). Based on ring current effect considerations, it seems that the G31 ribose must be directly above or below the coumarin ring while the ribose of G30 should be in the same plane with the coumarin ring in some of the binding poses. The ribose hydroxyl group of G30 is hydrogen-bonded to the coumarin carbonyl oxygen in the pose shown in Fig. 7d; this indeed brings the G30 H2’ proton approximately in the plane of the coumarin ring. However, these binding poses do not explain the upfield shifts of the G31 ribose. Another possible explanation may involve changes in the conformational equilibrium of the GRIL motif, which displayed conformational heterogeneity even without a ligand (35), and remained flexible upon KA-AMC binding.
DISCUSSION
Previously, we reported small molecules that facilitate the HIV-1 SL1 RNA dimer maturation in vitro and performed an SAR study to identify important features for its activity (24); however, it remained unclear whether the enhancement in activity correlates with RNA binding affinity or the specificity of binding. We utilized the intrinsic fluorescence emission of coumarin and hydroxyquinoline derivatives and their quenching by RNA to estimate binding affinity of these molecules to the SL1 RNA. We also probed structural features of KA-AMC – SL1 RNA interactions with NMR.
We observed our activator molecules bind to many structural motifs with varying affinity, similarly to the viral NCp7 protein. Although the total charge of a small molecule is an important determinant for binding affinity (compare R-AMC with RR-AMC in Table 1), a particular pattern of donor and acceptor atoms is also important (compare binding affinity of KA-AMC and R-AMC in Table 1). Although increasing positive charge facilitates both specific and unspecific binding to RNA, we found that the four small molecules studied have a binding preference to unique structural motifs (16-fold preferential for SL1-fl KD over a duplex RNA (apparently stacking with terminal base pair) for R-AMC and 33-fold for RR-AMC, Table 2). KA-AMC showed preferential binding to the GRIL motif of SL1 (Table 1) compared to other stem-loops in the HIV-1 psi locus, SL2 and SL3 (Table 3). Furthermore, the coumarin derivatives appear to have some preference for GRIL (in SL1-gril RNA) over the flanking A’s site (SL1-s), while R-MHQ binds slightly better to the flanking A’s (Table 1). Finally, there is no complete correlation between the SL1 maturation activation in vitro and SL1 binding. While both maturation activation and binding increase in the order KA-AMC < R-AMC < RR-AMC, this order is reversed for R-AMC and R-MHQ activation versus binding. The SAR study highlighted the importance of the coumarin ring oxygen as the H-bond acceptor for activity. When this oxygen in R-AMC is substituted by a H-bond donor NH in R-MHQ, the activity decreases 3-fold (24), while the apparent affinity is higher for R-MHQ both for binding to the full-length SL1-fl KD and for binding to the individual sites of SL1 (Table 1). This shows that the difference in the maturation activity does not solely depend on the apparent binding affinity.
It is significant that we discovered a strong SL1 RNA binder, RR-AMC, with an apparent binding constant in the high nanomolar range. SL1 ligands may potentially have an antiviral effect by competing with the SL1 – NCp7 interaction and thus either inhibiting RNA packaging or affecting RNA maturation. RR-AMC binds the flanking A’s site and GRIL 14-and 6-fold less tightly, respectively, than NCp7. At the same time, RR-AMC binds these two sites 12-and 18-fold tighter, respectively, than it binds duplex RNA. Most known clinically useful RNA ligands bind RNA with micromolar affinity and have a molecular weight greater than 500 daltons (46, 47). RR-AMC has a molecular weight 488 Da with a total charge of +3; it has the potential of being further optimized into an even tighter and more specific SL1 binder, although gaining specificity is expected to be a major difficulty.
We probed structural features of the KA-AMC – SL1 RNA interactions using NMR. Consistently with binding constant measurement by fluorescence, we observed that KA-AMC binds to many different RNA motifs; however, it binds preferentially to the flanking adenines and GRIL in the SL1-s and SL1-gril constructs, respectively. Upon binding, ligand and RNA residues in the binding site undergo conformational exchange in the intermediate NMR time range, indicated by the severe line broadening. Line broadening due to possible aggregation effects was excluded for the experimental conditions utilized by using NMR to determine diffusion coefficients for the RNA signals (data not shown). Broadening of NMR signals may be caused by chemical exchange either between bound and free RNA conformations or between multiple bound conformations (48). One can distinguish these possibilities by titrating great excess of ligand into the receptor. If exchange between free and bound species is the cause of broadening, adding ligand in large excess over that required to saturate the binding site should result in line narrowing for the RNA signals to a value reflecting the molecular tumbling of the RNA-ligand complex since the exchange term contributing to line broadening becomes vanishingly smaller. On the contrary, if the ligand has multiple binding modes of comparable energy, there would always be heterogeneity in RNA conformations, so the line shape would remain broad for RNA signals even in great excess of ligand. Our NMR data show no sharpening of any signals at higher KA-AMC ratios (see A14 H8 and A26 H2 peaks in Fig. 4 and Supplementary Data), implying that KA-AMC has multiple binding modes. On the other hand, these arguments are strictly speaking applicable to the situation with a single binding site. For example, assuming a single binding site for the GRIL motif with Kd of 21 µM, 76% of RNA and ligand is expected to be in the bound state for the 1:1 RNA – ligand complex (Fig. 4e). For the 1:2 RNA – ligand complex (Fig. 4f), 93% of RNA but only 47% of ligand are expected to be bound. Then, we would expect significant line sharpening and significant recovery of the chemical shift perturbation for the ligand signals, which clearly occurs for the ligand residues. However, there is no evident narrowing of signals from the RNA. The situation is complicated by a continuing non-specific ligand binding to other RNA residues, consistent with its ability to bind RNA duplex (Table 1). Therefore, we cannot completely exclude a contribution of intermediate exchange between bound and unbound states into the observed RNA lines broadening. Nevertheless, existence of multiple binding poses in each specific site, GRIL and the flanking adenines of the KD form, is consistent with our computational docking results (Fig. 7). We could not determine the solution structures of the complexes because of the absence of observable intermolecular NOE cross-peaks, also likely because of the conformational averaging. However, many features of predicted binding poses are in qualitative agreement with chemical shift perturbations. Furthermore, preliminary studies using molecular dynamics simulations of selected binding poses with explicit water using AMBER 9 (49) show that many of these features remain stable during the simulation time. For example, for the pose shown in Fig. 7a, the specific pattern of H-bonds has changed (the H-bond between the coumarin ring oxygen and the A21 amino group broke, and the one between the coumarin carbonyl oxygen and C20 amino group formed instead), but the coumarin ring remained stacked under A13 during 1 ns of the simulation (data not shown). Existence of multiple binding poses is likely to contribute entropically into the observed apparent ligand affinities.
Several coumarin and hydroxyquinoline derivatives conjugated with a short peptide can promote the conversion of the SL1 RNA from the KD to ED conformation (24), thus mimicking both the in vivo (17) and in vitro (14) NCp7 behavior. The similarities between NCp7 and small molecules studied here go further. NCp7 binds with high affinity to various structural motifs of HIV-1 RNA, such as the palindromic region and GRIL on SL1 hairpin and G-rich apical loops of SL2 and SL3 hairpins with apparent Kd values of 23–140 nM (23). KA-AMC binds the palindromic region and the GRIL motif of SL1 with apparent Kd of 50 and 21 µM, respectively (Table 1); it also binds SL2 and SL3 (Table 3). Further, NCp7 also binds fairly non-specifically to double-stranded nucleic acids (19). KA-AMC may bind the stem regions of SL1 as well, based on small proton chemical shift perturbations (Fig. 5), and it binds a duplex RNA sample, although less tightly than the two specific sites (Tables 1, 2). Here the similarities end. It appears that approximately stoichiometric amounts of NCp7 (2 NCp7:2 RNA strands) are required for the full KD-to-ED conversion of SL1wt (12). Assuming the apparent Kd of 100 nM (23), about 82% of SL1-fl RNA is bound by NCp7 at the experimental conditions used, and most of the RNA was converted to the ED conformation (12). With the apparent Kd of 19 µM (Table 1), 88% of SL1-fl KD would be bound by KA-AMC at the 100X ligand excess, and yet most of the RNA would not have converted to ED at these conditions, because the EC50 value, the effective concentration of KA-AMC to achieve 50% conversion, is 540–600 µM (24). Clearly, NCp7 not only binds better but, independent of that, is a better refolding chaperone. We plan to use structural insights obtained in this study to further optimize small molecules binding SL1 to inhibit the NCp7-SL1 interaction and thus target the HIV-1 RNA dimer maturation and/or HIV-1 RNA packaging.
Supplementary Material
ACKNOWLEGEMENTS
We thank Abram Calderon for assistance in fluorescence quenching experiments and Tuck C. Wong for help with diffusion coefficient measurements.
Abbreviations
- HIV-1
human immunodeficiency virus type 1
- SL1
stem-loop 1 RNA
- NCp7
proteolytic nucleocapsid protein product of the Gag protein Lys-Ala-7-amido-4-methylcoumarin
- R-AMC
Arg-7-amido-4-methylcoumarin
- RR-AMC
Arg-Arg-7-amido-4-methylcoumarin
- R-MHQ
Arg-7-amido-4-methyl-2-hydroxyquinoline
- 5’-UTR
5’-untranslated region
- DIS
dimer initiation site
- H-bond
hydrogen bond
- KD
kissing dimer
- ED
extended dimer
- SAR
structure-activity relationship
- NMR
nuclear magnetic resonance
- SL1-fl
ful-llength SL1 RNA
- PAGE
polyacrylamide gel electrophoresis
- UV
ultraviolet
- TBE
Tris-borate-EDTA buffer
- TBM
Tris-borate-MgCl2 buffer
- Kd
dissociation constant
- PDB
Protein Data Bank
- 1D
one-dimensional
- 2D
two-dimensional
- TOCSY
total correlation spectroscopy
- NOE
nuclear Overhauser effect
- NOESY
nuclear Overhauser effect spectroscopy
- T1
spin lattice relaxation time
Footnotes
This work was supported by Grant AI046967 from the National Institutes of Health to T.L.J.
SUPPORTING INFORMATION AVAILABLE
Additional experimental details. This material is available free of charge via the Internet at http://pubs.acs.org.
REFERENCES
- 1.Hoglund S, Ohagen A, Goncalves J, Panganiban AT, Gabuzda D. Ultrastructure of HIV-1 genomic RNA. Virology. 1997;233:271–279. doi: 10.1006/viro.1997.8585. [DOI] [PubMed] [Google Scholar]
- 2.Paillart JC, Shehu-Xhilaga M, Marquet R, Mak J. Dimerization of retroviral RNA genomes: an inseparable pair. Nat Rev Microbiol. 2004;2:461–472. doi: 10.1038/nrmicro903. [DOI] [PubMed] [Google Scholar]
- 3.Hu WS, Temin HM. Genetic consequences of packaging two RNA genomes in one retroviral particle: pseudodiploidy and high rate of genetic recombination. Proc Natl Acad Sci U S A. 1990;87:1556–1560. doi: 10.1073/pnas.87.4.1556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Clever JL, Wong ML, Parslow TG. Requirements for kissing-loop-mediated dimerization of human immunodeficiency virus RNA. J. Virol. 1996;70:5902–5908. doi: 10.1128/jvi.70.9.5902-5908.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Skripkin E, Paillart JC, Marquet R, Ehresmann B, Ehresmann C. Identification of the primary site of the human immunodeficiency virus type 1 RNA dimerization in vitro. Proc Natl Acad Sci U S A. 1994;91:4945–4949. doi: 10.1073/pnas.91.11.4945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Paillart JC, Marquet R, Skripkin E, Ehresmann B, Ehresmann C. Mutational Analysis of the Bipartite Dimer Linkage Structure of Human Immunodeficiency Virus Type 1 Genomic Rna. J. Biol. Chem. 1994;269:27486–27493. [PubMed] [Google Scholar]
- 7.Laughrea M, Jette L. A 19-Nucleotide Sequence Upstream of the 5' Major Splice Donor Is Part of the Dimerization Domain of Human Immunodeficiency Virus 1 Genomic RNA. Biochemistry. 1994;33:13464–13474. doi: 10.1021/bi00249a035. [DOI] [PubMed] [Google Scholar]
- 8.Muriaux D, Fosse P, Paoletti J. A kissing complex together with a stable dimer is involved in the HIV-1Lai RNA dimerization process in vitro. Biochemistry. 1996;35:5075–5082. doi: 10.1021/bi952822s. [DOI] [PubMed] [Google Scholar]
- 9.Sakuragi J, Sakuragi S, Shioda T. Minimal region sufficient for genome dimerization in the human immunodeficiency virus type 1 virion and its potential roles in the early stages of viral replication. J Virol. 2007;81:7985–7992. doi: 10.1128/JVI.00429-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lodmell JS, Ehresmann C, Ehresmann B, Marquet R. Convergence of natural and artificial evolution on an RNA loop-loop interaction: the HIV-1 dimerization initiation site. RNA. 2000;6:1267–1276. doi: 10.1017/s1355838200000698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Paillart JC, Westhof E, Ehresmann C, Ehresmann B, Marquet R. Non-canonical interactions in a kissing loop complex: the dimerization initiation site of HIV-1 genomic RNA. J Mol Biol. 1997;270:36–49. doi: 10.1006/jmbi.1997.1096. [DOI] [PubMed] [Google Scholar]
- 12.Mujeeb A, Ulyanov NB, Georgantis S, Smirnov I, Chung J, Parslow TG, James TL. Nucleocapsid protein-mediated maturation of dimer initiation complex of full-length SL1 stemloop of HIV-1: sequence effects and mechanism of RNA refolding. Nucleic Acids Res. 2007;35:2026–2034. doi: 10.1093/nar/gkm097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Laughrea M, Jette L. Kissing-loop model of HIV-1 genome dimerization: HIV-1 RNAs can assume alternative dimeric forms, and all sequences upstream or downstream of hairpin 248–271 are dispensable for dimer formation. Biochemistry. 1996;35:1589–1598. doi: 10.1021/bi951838f. [DOI] [PubMed] [Google Scholar]
- 14.Muriaux D, De Rocquigny H, Roques BP, Paoletti J. NCp7 activates HIV-1Lai RNA dimerization by converting a transient loop-loop complex into a stable dimer. J Biol Chem. 1996;271:33686–33692. doi: 10.1074/jbc.271.52.33686. [DOI] [PubMed] [Google Scholar]
- 15.Moore MD, Nikolaitchik OA, Chen J, Hammarskjold ML, Rekosh D, Hu WS. Probing the HIV-1 genomic RNA trafficking pathway and dimerization by genetic recombination and single virion analyses. PLoS Pathog. 2009;5:e1000627. doi: 10.1371/journal.ppat.1000627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Song R, Kafaie J, Yang L, Laughrea M. HIV-1 viral RNA is selected in the form of monomers that dimerize in a three-step protease-dependent process; the DIS of stem-loop 1 initiates viral RNA dimerization. J Mol Biol. 2007;371:1084–1098. doi: 10.1016/j.jmb.2007.06.010. [DOI] [PubMed] [Google Scholar]
- 17.Fu W, Gorelick RJ, Rein A. Characterization of human immunodeficiency virus type 1 dimeric RNA from wild-type and protease-defective virions. J Virol. 1994;68:5013–5018. doi: 10.1128/jvi.68.8.5013-5018.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Feng YX, Campbell S, Harvin D, Ehresmann B, Ehresmann C, Rein A. The human immunodeficiency virus type 1 Gag polyprotein has nucleic acid chaperone activity: possible role in dimerization of genomic RNA and placement of tRNA on the primer binding site. J Virol. 1999;73:4251–4256. doi: 10.1128/jvi.73.5.4251-4256.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Urbaneja MA, Wu M, Casas-Finet JR, Karpel RL. HIV-1 nucleocapsid protein as a nucleic acid chaperone: spectroscopic study of its helix-destabilizing properties, structural binding specificity, and annealing activity. J Mol Biol. 2002;318:749–764. doi: 10.1016/S0022-2836(02)00043-8. [DOI] [PubMed] [Google Scholar]
- 20.Marquet R, Paillart JC, Skripkin E, Ehresmann C, Ehresmann B. Dimerization of human immunodeficiency virus type 1 RNA involves sequences located upstream of the splice donor site. Nucleic Acids Res. 1994;22:145–151. doi: 10.1093/nar/22.2.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Berkhout B. Structure and function of the human immunodeficiency virus leader RNA. Prog Nucleic Acid Res Mol Biol. 1996;54:1–34. doi: 10.1016/s0079-6603(08)60359-1. [DOI] [PubMed] [Google Scholar]
- 22.Clever JL, Parslow TG. Mutant human immunodeficiency virus type 1 genomes with defects in RNA dimerization or encapsidation. J. Virol. 1997;71:3407–3414. doi: 10.1128/jvi.71.5.3407-3414.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Shubsda MF, Paoletti AC, Hudson BS, Borer PN. Affinities of packaging domain loops in HIV-1 RNA for the nucleocapsid protein. Biochemistry. 2002;41:5276–5282. doi: 10.1021/bi016045+. [DOI] [PubMed] [Google Scholar]
- 24.Chung J, Mujeeb A, Jiang Y, Guilbert C, Pendke M, Wu Y, James TL. A small molecule, Lys-Ala-7-amido-4-methylcoumarin, facilitates RNA dimer maturation of a stem-loop 1 transcript in vitro: structure-activity relationship of the activator. Biochemistry. 2008;47:8148–8156. doi: 10.1021/bi800230m. [DOI] [PubMed] [Google Scholar]
- 25.Milligan JF, Uhlenbeck OC. Synthesis of small RNAs using T7 RNA polymerase. In: Dahlberg JE, Abelson JN, editors. Meth. Enzym. New York: Academic Press; 1989. pp. 51–62. [DOI] [PubMed] [Google Scholar]
- 26.Grodberg J, Dunn JJ. ompT encodes the Escherichia coli outer membrane protease that cleaves T7 RNA polymerase during purification. J Bacteriol. 1988;170:1245–1253. doi: 10.1128/jb.170.3.1245-1253.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Marquet R, Baudin F, Gabus C, Darlix JL, Mougel M, Ehresmann C, Ehresmann B. Dimerization of human immunodeficiency virus (type 1) RNA: stimulation by cations and possible mechanism. Nucleic Acids Res. 1991;19:2349–2357. doi: 10.1093/nar/19.9.2349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Laughrea M, Jette L. HIV-1 genome dimerization: formation kinetics and thermal stability of dimeric HIV-1Lai RNAs are not improved by the 1-232 and 296–790 regions flanking the kissing-loop domain. Biochemistry. 1996;35:9366–9374. doi: 10.1021/bi960395s. [DOI] [PubMed] [Google Scholar]
- 29.Takahashi KI, Baba S, Chattopadhyay P, Koyanagi Y, Yamamoto N, Takaku H, Kawai G. Structural requirement for the two-step dimerization of human immunodeficiency virus type 1 genome. RNA. 2000;6:96–102. doi: 10.1017/s1355838200991635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Amarasinghe GK, De Guzman RN, Turner RB, Summers MF. NMR structure of stem-loop SL2 of the HIV-1 psi RNA packaging signal reveals a novel A-U-A base-triple platform. J Mol Biol. 2000;299:145–156. doi: 10.1006/jmbi.2000.3710. [DOI] [PubMed] [Google Scholar]
- 31.Pappalardo L, Kerwood DJ, Pelczer I, Borer PN. Three-dimensional folding of an RNA hairpin required for packaging HIV-1. J Mol Biol. 1998;282:801–818. doi: 10.1006/jmbi.1998.2046. [DOI] [PubMed] [Google Scholar]
- 32.Elalaoui A, Divita G, Maury G, Imbach JL, Goody RS. Intrinsic tryptophan fluorescence of bovine liver adenosine kinase, characterization of ligand binding sites and conformational changes. Eur J Biochem. 1994;221:839–846. doi: 10.1111/j.1432-1033.1994.tb18798.x. [DOI] [PubMed] [Google Scholar]
- 33.Smallcombe SH. Solvent suppression with symmetrically-shifted pulses. J. Am. Chem. Soc. 1993;115:4776–4785. [Google Scholar]
- 34.Piotto M, Saudek V, Sklenar V. Gradient-Tailored Excitation for Single-Quantum NMR Spectroscopy of Aqueous Solutions. J Biomol NMR. 1992;2:661–665. doi: 10.1007/BF02192855. [DOI] [PubMed] [Google Scholar]
- 35.Ulyanov NB, Mujeeb A, Du Z, Tonelli M, Parslow TG, James TL. NMR structure of the full-length linear dimer of stem-loop-1 RNA in the HIV-1 dimer initiation site. J Biol Chem. 2006;281:16168–16177. doi: 10.1074/jbc.M601711200. [DOI] [PubMed] [Google Scholar]
- 36.Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A. NMRPipe: A Multidimensional Spectral Processing System Based on UNIX Pipes. J. Biomolec. NMR. 1995;6:277–293. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
- 37.Goddard TD, Kneller DG. SPARKY. 3.0 ed. San Francisco: University of California; 1998. [Google Scholar]
- 38.Guilbert C, James TL. Docking to RNA via root-mean-square-deviation-driven energy minimization with flexible ligands and flexible targets. J Chem Inf Model. 2008;48:1257–1268. doi: 10.1021/ci8000327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Baba S, Takahashi K, Noguchi S, Takaku H, Koyanagi Y, Yamamoto N, Kawai G. Solution RNA structures of the HIV-1 dimerization initiation site in the kissing-loop and extended-duplex dimers. J Biochem (Tokyo) 2005;138:583–592. doi: 10.1093/jb/mvi158. [DOI] [PubMed] [Google Scholar]
- 40.Zhurkin VB, Ulyanov NB, Gorin AA, Jernigan RL. Static and Statistical Bending of DNA Evaluated by Monte Carlo Simulations. Proc. Nat. Acad. Sci. USA. 1991;88:7046–7050. doi: 10.1073/pnas.88.16.7046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Gómez Pinto I, Guilbert C, Ulyanov NB, Stearns J, James TL. Discovery of Ligands for a Novel Target, the Human Telomerase RNA, Based on Flexible-Target Virtual Screening and NMR. J Med Chem. 2008;51:7205–7215. doi: 10.1021/jm800825n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Still WC, Tempczyk A, Hawley RC, Hendrickson T. Semianalytical Treatment of Solvation For Molecular Mechanics and Dynamics. J Amer Chem Soc. 1990;112:6127–6129. [Google Scholar]
- 43.Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE. UCSF Chimera--a visualization system for exploratory research and analysis. J Comput Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 44.Comolli LR, Ulyanov NB, Soto AM, Marky LA, James TL, Gmeiner WH. NMR Structure of the 3' stem-loop from human U4 snRNA. Nucl Acids Res. 2002;30:4371–4379. doi: 10.1093/nar/gkf560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Stewart-Maynard KM, Cruceanu M, Wang F, Vo MN, Gorelick RJ, Williams MC, Rouzina I, Musier-Forsyth K. Retroviral nucleocapsid proteins display nonequivalent levels of nucleic acid chaperone activity. J Virol. 2008;82:10129–10142. doi: 10.1128/JVI.01169-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Ryu DH, Rando RR. Aminoglycoside binding to human and bacterial A-Site rRNA decoding region constructs. Bioorg Med Chem. 2001;9:2601–2608. doi: 10.1016/s0968-0896(01)00034-7. [DOI] [PubMed] [Google Scholar]
- 47.Aboul-ela F. Strategies for the design of RNA-binding small molecules. Future Med Chem. 2010;2:93–119. doi: 10.4155/fmc.09.149. [DOI] [PubMed] [Google Scholar]
- 48.Reibarkh M, Malia TJ, Hopkins BT, Wagner G. Identification of individual protein-ligand NOEs in the limit of intermediate exchange. J Biomol NMR. 2006;36:1–11. doi: 10.1007/s10858-006-9028-7. [DOI] [PubMed] [Google Scholar]
- 49.Case DA, Darden TA, T E, Cheatham I, Simmerling CL, Wang J, Duke RE, Luo R, Merz KM, Pearlman DA, Crowley M, Walker RC, Zhang W, Wang B, Hayik S, Roitberg A, Seabra G, Wong KF, Paesani F, Wu X, Brozell S, Tsui V, Gohlke H, Yang L, Tan C, Mongan J, Hornak V, Cui G, Beroza P, Mathews DH, Schafmeister C, Ross WS, Kollman PA. AMBER 9. 9.0 ed. San Francisco: University of California; 2006. AMBER 9. [Google Scholar]
- 50.Couch GS, Hendrix DK, Ferrin TE. Nucleic acid visualization with UCSF Chimera. Nucleic Acids Res. 2006;34:e29. doi: 10.1093/nar/gnj031. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.