Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2015 Dec 14;291(7):3468–3482. doi: 10.1074/jbc.M115.708099

Structural Insights into the HIV-1 Minus-strand Strong-stop DNA*

Yingying Chen ‡,§,1,2, Ouerdia Maskri ‡,1,3, Françoise Chaminade , Brigitte René , Jessica Benkaroun , Julien Godet , Yves Mély , Olivier Mauffret , Philippe Fossé ‡,4
PMCID: PMC4751388  PMID: 26668324

Abstract

An essential step of human immunodeficiency virus type 1 (HIV-1) reverse transcription is the first strand transfer that requires base pairing of the R region at the 3′-end of the genomic RNA with the complementary r region at the 3′-end of minus-strand strong-stop DNA (ssDNA). HIV-1 nucleocapsid protein (NC) facilitates this annealing process. Determination of the ssDNA structure is needed to understand the molecular basis of NC-mediated genomic RNA-ssDNA annealing. For this purpose, we investigated ssDNA using structural probes (nucleases and potassium permanganate). This study is the first to determine the secondary structure of the full-length HIV-1 ssDNA in the absence or presence of NC. The probing data and phylogenetic analysis support the folding of ssDNA into three stem-loop structures and the presence of four high-affinity binding sites for NC. Our results support a model for the NC-mediated annealing process in which the preferential binding of NC to four sites triggers unfolding of the three-dimensional structure of ssDNA, thus facilitating interaction of the r sequence of ssDNA with the R sequence of the genomic RNA. In addition, using gel retardation assays and ssDNA mutants, we show that the NC-mediated annealing process does not rely on a single pathway (zipper intermediate or kissing complex).

Keywords: DNA structure, DNA-protein interaction, human immunodeficiency virus (HIV), nucleic acid structure, reverse transcription

Introduction

Two strand transfers are required for reverse transcription of the human immunodeficiency virus type 1 (HIV-1)5 genome (1). These strand transfers are enhanced by the HIV-1 nucleocapsid protein (NC) (26). During the first strand transfer, which involves reverse transcriptase (RT) with its RNase H activity to promote degradation of the 5′-copied genomic RNA (donor RNA), the 3′-end of the minus-strand strong-stop DNA (ssDNA) is transferred to the 3′-end of the genomic RNA (acceptor RNA). This transfer can occur in an intra- (i.e. transfer of ssDNA to the original RNA template) or intermolecular (i.e. transfer of ssDNA to the other genomic RNA copy in the virion) manner (1). The first requirement for synthesis of ssDNA is the annealing of the human primer tRNA3Lys onto the primer binding site of the genomic RNA (Fig. 1A). The model of reverse transcription presumes that the first strand transfer occurs after synthesis of the full-length ssDNA. More precisely, the transfer is mediated by base pairing of the R region at the 3′-end of the genomic RNA with the complementary r region at the 3′-end of ssDNA. The effect of shortening the length of complementarity has been studied using murine leukemia virus-based vectors capable of only a single round of replication (7). This study showed that removing more than 80% of the length of 3′ R, from 69 to 12 nucleotides, did not significantly affect the efficiency of the first strand transfer. These results cannot be extrapolated directly to HIV-1, because the R regions of murine leukemia virus and HIV-1 are different. The effect of shortening the length of 3′ R has also been studied in HIV-1 (8). This study showed that production of viral particles was barely affected by a 3′ R sequence of only 37 nucleotides. However, this result did not directly demonstrate that the first strand transfer was not altered, because multiple rounds of viral replications were allowed. Studies on Moloney murine leukemia virus (9), spleen necrosis virus (10), and HIV-1 (11) with mutations in R suggested that partial ssDNAs (“weak-stop” DNAs) can be transferred prior to complete reverse transcription of the R region. However, the transfer of weak-stop DNAs occurred at low frequency (10, 11). In addition, the results of a study performed with a series of mutations in the 5′, 3′, or both R sequences suggested that the great majority of first strand transfers in HIV-1 occur after the synthesis of the full-length ssDNA (12). Therefore, an anti-HIV-1 therapy based on inhibition of the first strand transfer requires knowledge of the structures of the full-length ssDNA and the complete 3′ R. Consistent with this notion, actinomycin D inhibits the first strand transfer through direct interaction with the ssDNA (13, 14).

FIGURE 1.

FIGURE 1.

Strong-stop DNA analysis. A, schematic diagram of events preceding the first strand transfer. R, terminal repeat; TAR, transactivator response element; poly(A), polyadenylation signal; U5, sequence unique at the 5′-end; PBS, primer binding site; U3, sequence unique at the 3′-end; ssDNA, minus-strand strong-stop DNA; r, complementary DNA sequence to R; cTAR, complementary DNA sequence to TAR; cpoly(A), complementary DNA sequence to poly(A); u5, complementary DNA sequence to U5. The 3′ 18 nucleotides of the tRNA3Lys primer are annealed to the PBS, a complementary sequence in the viral genome (gray line); numbering is relative to the genomic RNA cap site (+1). Once annealed, RT catalyzes extension of the primer to form ssDNA (black line). The 5′-end of the genomic RNA is degraded (broken line) by the action of the RNase H activity of RT. The first strand transfer is mediated by base pairing of the R and r regions that are complementary. B, ssDNA and RNA used in this study. ssDNA (178 nucleotides) is the extension product of PR primer, numbering is relative to the first nucleotide of ssDNA, which is complementary to nucleotide 178 of the genomic RNA; RNA Δ3′ (200 nucleotides) contains the 3′-end of the U3 sequence and the full-length R sequence. C, predicted secondary structures for the cTAR sequence in the wild-type and mutant ssDNAs. The DNA folding program of Zuker (43) was used to predict the most stable secondary structure for each cTAR sequence. The T·G base pair is supported by NMR analysis of the top half of cTAR (65). Mutations are shown as lowercase letters in boxes.

The R sequence contains the transactivator response element (TAR) and a portion of the polyadenylation signal (poly(A)) (15, 16). The 3′-TAR sequence folds into a hairpin in the complete HIV-1 RNA genome extracted from virions (17). The 3′-poly(A) sequence can form a hairpin (16, 18), but there is no data showing that this secondary structure is present in the complete HIV-1 RNA genome. The r sequence of ssDNA is predicted to fold into hairpins that are complementary to the TAR and poly(A) RNA sequences and are therefore named cTAR and cpoly(A), respectively (19, 20). To date, the secondary structure of the full-length ssDNA has not been determined. Thus, there is no data showing that the cTAR and cpoly(A) sequences form the predicted hairpins in the context of the full-length ssDNA.

NC promotes annealing of ssDNA to the 3′-end of the genomic RNA because it possesses nucleic acid chaperone activity that facilitates the rearrangement of nucleic acids into the most thermodynamically stable structures containing the maximum number of base pairs (21, 22). This chaperone function depends on three properties (2): 1) ability to aggregate nucleic acids; 2) weak duplex destabilization activity; and 3) rapid on-off binding kinetics. The ability of NC to increase the rate of the first strand transfer is due at least in part to its stimulatory effect on the rate of annealing of the complementary hairpins (23, 24). Unfolding of the complementary hairpins is thought to be rate-limiting in the annealing process leading to the first strand transfer (23). As mentioned above, mutational analysis of the HIV-1 R sequence suggests that the great majority of first strand transfers occur after completion of ssDNA synthesis, i.e. the complete cTAR sequence would be required for efficient strand transfer in vivo (12). Several studies performed with the cTAR sequence (55 to 59 nucleotides) support a dynamic structure of the cTAR hairpin, involving equilibrium between both the closed conformation and the partially open “Y” conformation (20, 2528). Recently, we showed that NC slightly destabilizes the lower stem that is adjacent to the internal loop and shifts the equilibrium toward the Y conformation exhibiting at least 12 unpaired nucleotides in its lower part (29). We also showed that the apical and internal loops of cTAR are weak and strong binding sites for NC, respectively (29). To date, the interactions between NC and the full-length ssDNA have not been studied.

Because mutations in the TAR apical loop decrease the first strand transfer in vitro, Berkhout et al. (19) suggested that this process involves a “kissing” complex formed by the apical loops of TAR and cTAR hairpins. Consistent with this hypothesis, we found that efficient annealing of cTAR DNA to the 3′-end of the genomic RNA relies on sequence complementarities between TAR and cTAR apical loops (18). Studies using TAR RNA and cTAR DNA hairpins suggest that both the apical loops and the 3′/5′ termini of complementary hairpins are the initiation sites for the annealing reaction under subsaturating concentrations of NC (30, 31). In contrast, cTAR DNA-TAR RNA annealing in the presence of saturated NC depends only on nucleation through the 3′/5′ termini, resulting in the formation of a “zipper” intermediate (30, 32). Note that the two annealing pathways have not been demonstrated for the annealing reaction between the full-length ssDNA and the 3′-end of the genomic RNA.

In this study, we investigated the annealing process using ssDNA and RNA Δ3′ representing the 3′-end of the genomic RNA (Fig. 1B). Using chemical and enzymatic probes, our study is the first and the only one, to our knowledge, which determined the secondary structure of the ssDNA and the NC binding sites within this DNA.

Experimental Procedures

NC and Oligonucleotides

Full-length NC (NC(1–55)) was synthesized by the Fmoc (N-(9-fluorenyl)methoxycarbonyl)/pentafluorophenyl ester chemical method and purified to homogeneity by HPLC (33). NC was dissolved at concentrations of 1 to 2 mg/ml in a buffer containing 25 mm HEPES (pH 6.5), 50 mm NaCl, and 2.2 mol of ZnCl2/mol of peptide. DNA oligonucleotides were purchased from Eurogentec. In the following oligonucleotide sequences, the uppercase letters indicate the bases that are complementary to the DNA sequence of the HIV-1 MAL isolate and the positions of the HIV-1 sequence targeted by the oligonucleotides are in parentheses: O1, gcagaattctggagtaatacgactcactataGGTCTCTCTTGTTAGAC (1–17); O2, gcagcggtcgacGGTATCACTTCTGGGC (876–861); O3, gcaatacGGTCTCTCTTGTTAGACCAGGTCGAGCaatttgGCTCTCTGGCTAGC (1–47); O4, ccgccgtgactaccgcgagacccac; O5, gctcgaaCCCGGGCTCGACCTGGTCTAACAAGttcttCCtatagtgagtcgtattac (27–7); O6, catccaCCCGGGAGCTCTCTGGCTAGCAAGtctctggCACTGCTTAAGCCTC (32–70); O7, gaagaattctggagtaatacgactcactataGGAGGCGTAACTTGGG (9055–9070); O8, atcgataccgtcgacc; O9, gcagaattctggagtaatacgactcactataGGAGCTTTTGATCTCAG (8635–9229); Oligonucleotides O1, O5, O7, and O9 contain the T7 RNA polymerase promoter. PR, CTGCTAGAGATTTTTACACCG (178–158); PU3b1M, TCCGGATGTTGTTCTC (8990–8975).

Construction of Plasmids

Standard procedures were used for restriction enzyme digestion and plasmid construction (34). Restriction endonucleases and T4 DNA ligase were purchased from New England Biolabs. The expand high fidelity PCR system was from Roche Applied Science. Cloned sequences and mutations were verified by DNA sequencing. Plasmids pHIVCG-4 and pHIVCG8.6 contain DNA fragments of the HIV-1 genome derived from the MAL isolate (35, 36). Plasmid pYC5′ was generated by PCR amplification of linearized pHIVCG-4 with EcoRI using oligonucleotides O1 and O2. The resulting PCR product was digested with EcoRI and SalI and ligated into pHIVCG-4 digested with the same enzymes. Using oligonucleotides O3 and O4 and pYC5′ linearized with EcoRI, plasmid pBCSL1 was generated by PCR amplification. The resulting PCR product was digested with BsaI and ligated into pYC5′ digested with the same enzyme. Using oligonucleotides O5 and O6 and pYC5′ linearized with AvaI, plasmid pCCinv was generated by PCR amplification. The resulting PCR product was then digested with AvaI and ligated intramolecularly. Plasmid pFC3′-2 was generated by PCR amplification of linearized pHIVCG8.6 with EcoRI using oligonucleotides O7 and O8. The resulting PCR product was digested with EcoRI and XhoI and ligated into pCG44 (37) digested with the same enzymes. Plasmid pFC3′UTR was generated by PCR amplification of linearized pHIVCG8.6 with EcoRI using oligonucleotides O8 and O9. The resulting PCR product was digested with EcoRI and XhoI and ligated into pCG44 digested with the same enzymes.

In Vitro RNA Synthesis and Purification

Plasmids pYC5′, pBCSL1, and pCCinv were digested with HaeIII to generate templates for in vitro synthesis of RNAs 1–415wt, 1–415SL1, and 1–415inv, respectively. These RNA transcripts start and end with authentic HIV sequences, i.e. they do not contain additional sequences resulting from DNA plasmid construction. Plasmid pFC3′-2 was digested with XhoI to generate the template for in vitro synthesis of RNA Δ3′ (200 nucleotides) that contains at the 3′-end a poly(A) tail (22 adenine residues) and five nucleotides corresponding to the XhoI site. Plasmid pFC3′UTR was digested with XhoI to generate the template for in vitro synthesis of RNA 3′ UTR (620 nucleotides). Five μg of the cleaved plasmids was transcribed with bacteriophage T7 RNA polymerase under the conditions stipulated by the RiboMAXTM large scale RNA production system (Promega). RNAs were purified by electrophoresis on a denaturing 5% polyacrylamide gel as described (34).

Synthesis, Labeling, and Purification of the ssDNAs and DNA Size Markers

The DNA oligonucleotides PR and PU3b1M were used as primers. For synthesis of 5′-end labeled ssDNAs and DNA size marker (M, 356 nucleotides), the primers were 5′-end labeled using T4 polynucleotide kinase (New England Biolabs) and [γ-32P]ATP (PerkinElmer Life Sciences). Ninety pmol of 5′-end labeled primer (3 × 105 cpm/pmol) and 60 pmol of RNAs 1–415 (templates for the wild-type and mutant ssDNAs), or 3′ UTR (template for the DNA size marker, M) in 150 μl of water were heated at 90 °C for 3 min and frozen for 5 min on dry ice/ethanol bath. Then, 138 μl of reaction buffer (final concentrations: 78 mm KCl, 1 mm DTT, and 52 mm Tris-HCl, pH 7.8) containing 30 units of HIV-1 reverse transcriptase (Worthington) was added and the sample was preincubated at 37 °C for 10 min before the primer extension reaction was initiated by 6 μl of MgCl2 and 6 μl of dNTPs. The final reaction contained 50 mm Tris-HCl (pH 7.8), 75 mm KCl, 2 mm MgCl2, 1 mm DTT, and 300 nm dNTPs. The reaction was incubated at 37 °C for 1 h and terminated by addition of 200 μl of 0.3 m NaOH, 0.05 m EDTA and heating at 90 °C for 15 min. Then, the sample was extracted by phenol/chloroform followed by ethanol precipitation and the dried pellet was resuspended in 40 μl of loading buffer A (7 m urea, 0.03% (w/v) bromphenol blue, and 0.03% (w/v) xylene cyanol). The 5′-end labeled ssDNAs and M were purified by electrophoresis on 6% denaturing polyacrylamide gels and isolated by elution followed by ethanol precipitation. The purified ssDNAs and M were dissolved in water and checked for purity and integrity on a 6% denaturing polyacrylamide gel.

Analysis of ssDNA Dimerization

The assays were carried out in a final volume of 10 μl. The 5′-end labeled ssDNA (0.15 pmol at 4 × 105 cpm/pmol) was dissolved in 7.2 μl of water, heated at 90 °C for 2 min, and chilled for 2 min on ice. Then 0.8 μl of the renaturation buffer was added (final concentrations: 75 mm KCl, 0.2 mm MgCl2, and 50 mm Tris-HCl, pH 7.8) and the sample was incubated at 37 °C for 45 min. The reaction mixtures were then incubated at 37 °C for 15 min in the absence or presence of NC at various concentrations. The incubations were stopped by extraction with phenol/chloroform and each aqueous phase was mixed with 4 μl of loading buffer B (50% (w/v) glycerol, 0.05% (w/v) bromphenol blue, 0.05% (w/v) xylene cyanol). The heat-denatured control of 5′-end labeled ssDNA (0.15 pmol at 4 × 105 cpm/pmol in 10 μl of water) was performed by heating at 90 °C for 2 min and chilling for 2 min on ice, and mixing with 4 μl of loading buffer B. Formation of homoduplexes was analyzed by electrophoresis on a 2% agarose (QA-AgaroseTM, Qbiogene) gel at 4 °C in 0.5 × TBM (45 mm Tris borate (pH 8.3), 0.1 mm MgCl2). After electrophoresis, the gel was fixed, dried, and autoradiographed as described (37).

Gel-shift Annealing Assay

The annealing assay was carried out in a final volume of 12 μl. The 5′-end labeled ssDNA (0.15 pmol at 4 × 105 cpm/pmol) in 4 μl of water was heated at 90 °C for 2 min and chilled for 2 min on ice. Then, 1.0 μl of renaturation buffer (final concentrations: 75 mm KCl, 0.2 mm MgCl2, and 50 mm Tris-HCl, pH 7.8) and 1 μl of NC (1.48 or 3.81 pmol) were added and the sample was incubated at 37 °C for 15 min. Unlabeled RNA Δ3′ (0.45 pmol) underwent the same renaturation treatment with 1 μl of NC (5 or 12.85 pmol) and then added to the refolded ssDNA. The protein to nucleotide molar ratios were 1:18 or 1:7. These ratios refer to total nucleotide concentration. The reaction mixture was then incubated at 37 °C for 1, 3, 9, 15, 30, 60, or 120 min. At the end of incubations, the assays were phenol/chloroform extracted and each aqueous phase was mixed with 4 μl of loading buffer B. The samples were analyzed by electrophoresis on a 2% agarose (QA-AgaroseTM, Qbiogene) gel at 25 °C in 0.5 × TBE (45 mm Tris borate (pH 8.3), 1 mm EDTA). After electrophoresis, the gel was fixed, dried, and autoradiographed as described (37). The monomeric (m) and heteroduplex (hd) forms of ssDNA, and the high molecular weight DNA-RNA complexes (hmw) were quantified using a TyphoonTM TRIO (GE Healthcare) and ImageQuant software. The percent of heteroduplex was determined as 100 × (hd/(hd + m + hmw)).

Structural Probing of ssDNAs

Potassium permanganate and piperidine were purchased from Sigma. Mung bean nuclease and DNase I were purchased from New England Biolabs and Promega, respectively. Structural probing of ssDNA was carried out in a final volume of 10 μl. The 5′-end labeled ssDNA (0.15 pmol at 4 × 105 cpm/pmol) in 7.2 μl of water was heated at 90 °C for 2 min and chilled for 2 min on ice. Then, 0.8 μl of renaturation buffer (final concentrations: 75 mm KCl, 0.2 mm MgCl2, and 50 mm Tris-HCl (pH 7.8) for probing with KMnO4 or DNase I; 75 mm KCl, 0.2 mm MgCl2, and 50 mm sodium cacodylate (pH 6.5) for probing with mung bean nuclease) was added and the sample was incubated at 37 °C for 45 min. The reaction mixtures were then incubated at 37 °C for 15 min in the absence or presence of NC at various concentrations. The samples were then incubated with 0.125, 0.25, or 0.5 units of mung bean nuclease for 15 min at 37 °C or with 0.1 or 0.15 units of DNase I for 7 min at 37 °C. These cleavage reactions were stopped by phenol/chloroform extraction followed by ethanol precipitation. The dried pellets were resuspended in 7 μl of loading buffer A. For potassium permanganate probing, ssDNA was treated with 0.25, 0.5, 0.75, or 1 mm KMnO4 for 1 min at 37 °C. The treatment was stopped by adding 40 μl of the termination buffer (0.7 m β-mercaptoethanol, 0.4 m NaOAc (pH 7.0), 10 mm EDTA, 25 μg/ml of tRNA). DNA was then extracted with phenol/chloroform, ethanol precipitated, and dried. DNA was subjected to piperidine cleavage by resuspension of the dried pellet in 100 μl of freshly diluted 1 m piperidine and heating at 90 °C for 30 min. The samples were lyophilized, resuspended in 20 μl of water, and lyophilized again. After a second lyophilization from 15 μl of water, the samples were resuspended in 7 μl of loading buffer A. Sequence markers of the labeled ssDNA were produced by the Maxam-Gilbert method (38). To identify all cleavage sites at the nucleotide level, the samples were analyzed by long and short migration times on denaturing 6–8 and 12–14% polyacrylamide gels, respectively.

Alignment of the ssDNA Sequences

The ssDNA sequence alignments were performed using the HIV sequence database from the Los Alamos National Lab HIV. A position-weighted matrix was then computed using the open-source statistical software R from the Comprehensive R Archive Network (59). The sequence logo (39) was plotted using the seqLogo package.

Results

Synthesis, Purification, and Analysis of ssDNA

Commercial chemical synthesis of ssDNA was unattainable due to substrate length. Therefore, we synthesized the full-length ssDNA derived from the MAL isolate using HIV-1 reverse transcriptase. The 5′-end labeled ssDNA was synthesized and purified as described under “Experimental Procedures” section. Previous studies showed that both loose and tight duplexes can be characterized by native agarose gel electrophoresis at 4 °C in the TBM buffer (4042). The labeled ssDNA was monomeric after incubation with or without NC and analyzed under the electrophoretic conditions described above (Fig. 2). Therefore, the annealing assays and the structural analyses were not complicated by the presence of dimeric forms of ssDNA.

FIGURE 2.

FIGURE 2.

Analysis of ssDNA dimerization. The 5′-end labeled ssDNA in the absence (lanes C1 and C2) or presence of NC (lanes 4–6) was incubated at 37 °C as described under ”Experimental Procedures.“ The samples were analyzed by 2% agarose gel electrophoresis at 4 °C in TBM buffer and visualized by autoradiography. Den, heat-denatured ssDNA. Lane C1, control without phenol/chloroform extraction. Lane C2, control with phenol/chloroform extraction. The protein to nucleotide molar ratios were 1:18 (lane 4), 1:12 (lane 5), and 1:7 (lane 6). Lane M, size marker (356 nucleotides). Monomeric form of ssDNA is indicated by m (178 nucleotides).

Annealing of ssDNA to the 3′-R Region of the Genome

As mentioned under the Introduction, the annealing reaction between the full-length ssDNA and the 3′-end of the genomic RNA has not yet been studied. Here, we investigated the annealing process using NC, the full-length ssDNA, and RNA Δ3′ representing the 3′-end of the genomic RNA (Fig. 1B). The annealing assays were performed as described under “Experimental Procedures.” Annealing of RNA Δ3′ to the wild-type ssDNA was barely detectable after 120 min incubation in the absence of NC (Fig. 3A, lane C1). Because the genomic RNA and ssDNA are probably folded and associated with NC during the first strand transfer in vivo, RNA Δ3′ and ssDNA were renatured in the presence of NC before being mixed together. Annealing time courses were performed at low (Fig. 3, A–C) and high (Fig. 3, D–F) NC concentrations. Annealing of RNA Δ3′ to the wild-type ssDNA was significantly increased in the presence of NC at a protein to nucleotide molar ratio of 1:18 (Fig. 3A). The average yield of heteroduplex product was 43% after 120 min incubation (Fig. 3G, black circles). NC at a protein to nucleotide molar ratio of 1:7 facilitated greatly the annealing of RNA Δ3′-wild-type ssDNA (Fig. 3D). The average yield of the heteroduplex product was 75% after 120 min incubation (Fig. 3H, black circles). These results show that the wild-type ssDNA synthesized and renatured in vitro is competent for the annealing reaction and the chaperone activity of NC is required for efficient annealing of ssDNA to the 3′-end of the genomic RNA.

FIGURE 3.

FIGURE 3.

Mutational analysis of ssDNA-RNA Δ3′ annealing in the presence of NC. The NC-mediated annealing assays were performed as described under ”Experimental Procedures.“ The samples were analyzed by 2% agarose gel electrophoresis at 25 °C in the TBE buffer. A–C, lanes C1, the 5′-end labeled ssDNA (ssDNA*) was incubated with RNA Δ3′ for 120 min in the absence of NC. Lanes 1–7, ssDNA* and RNA Δ3′ were incubated for 1, 3, 9, 15, 30, 60, or 120 min in the presence of NC at a protein to nucleotide molar ratio of 1:18. The monomeric and heteroduplex forms of ssDNA are indicated by m and hd, respectively. D-F, lanes C2, ssDNA* was incubated for 120 min in the presence of NC at a protein to nucleotide molar ratio of 1:7. Lanes 1–7, ssDNA* and RNA Δ3′ were incubated for 1, 3, 9, 15, 30, 60, or 120 min in the presence of NC at a protein to nucleotide molar ratio of 1:7. Lane M, size marker (356 nucleotides). High molecular weight complexes are indicated by hmw. G and H, percentage of heteroduplex in the presence of NC at a protein to nucleotide molar ratio of 1:18 (G) or 1:7 (H). All experiments were repeated at least three times and error bars show standard deviations. Black circle, wild-type; open triangle, SL1; open square, INV.

To determine the role of the apical loop and the lower stem of the cTAR hairpin in the NC-mediated annealing process, two mutants were investigated (Fig. 1C). The SL1 mutant was designed such that the cTAR domain in ssDNA can form the stem-loop structure, but its apical loop cannot base pair with the apical loop of the TAR RNA hairpin. The INV mutant was designed such that the cTAR domain in ssDNA can form only the Y conformation and its lower part cannot base pair with the lower part of the TAR RNA hairpin. A band of very weak intensity migrating between the monomer and heterodimer forms was observed with the two mutants (Fig. 3, B and C). The position of this band is consistent with a heteroduplex that is shorter than the full-length heteroduplex. A likely explanation is that spontaneous cleavage occurred in a very small fraction of RNA Δ3′ molecules at the level of mismatched sites during the annealing process, mutant ssDNAs were thus annealed to a truncated form of RNA Δ3′. Surprisingly, the rate of annealing in the presence of NC at a protein to nucleotide molar ratio of 1:18 was significantly increased with the mutants, compared with the wild-type (Fig. 3, A-C and G). The average yield of heteroduplex products was at least 65% after 120 min incubation with these mutants (Fig. 3G). The mfold program predicts that the cTAR hairpin is more stable in the INV mutant than in the wild-type (Fig. 1C). However, the results suggest that mutations in the apical loop and the lower part of the cTAR hairpin facilitated the annealing process by decreasing the stability of mutant ssDNAs. In other words, the ssDNA structure of mutants was more destabilized in the presence of a low concentration of NC than that of the wild-type. Consistent with this notion, the wild-type and mutants exhibited similar kinetic profiles in the presence of a high concentration of NC (Fig. 3H). High molecular weight complexes were clearly observed with the mutants (Fig. 3, E and F), suggesting formation of DNA-RNA multimers composed of at least three molecules. Taken together, the results show that NC-mediated annealing of ssDNA to RNA Δ3′ does not rely on a single pathway (kissing or zipper). The results are also consistent with the notion that the NC-mediated annealing process could be initiated through complementary sequences that do not involve the apical loop and the 3′/5′ termini of the cTAR and TAR hairpins.

Strategy for Secondary Structure Determination of ssDNA

To determine the secondary structure of the full-length wild-type ssDNA, we used structural probes and the mfold program (43). Thus, the full-length ssDNA was probed with DNase I, mung bean nuclease (MBN) and potassium permanganate (KMnO4). DNase I is a double-strand specific endonuclease that produces single-strand nicks (44). MBN is highly selective for single-stranded nucleic acids and single-stranded regions in double-stranded nucleic acids (45). Note that nucleotide mismatches in double-stranded DNA are poor substrates for MBN cleavage at 37 °C (45, 46). Due to the bulky size of DNase I and MBN, the absence of cuts in the folded DNA structure by these enzymes may also be a result of steric hindrance. In contrast, KMnO4 is a small probe that can be used to detect all regions in the folded DNA structure that are unpaired or distorted (47, 48): it is an oxidizing agent that preferentially attacks the 5,6-double bond of thymine. In B-DNA, this bond is shielded by base stacking interactions and, thus, the T residues in such DNA duplexes are relatively resistant to oxidation. After treatment of ssDNA with piperidine, the DNA backbone was cleaved at the site of the modified thymines. The cleavage fragments generated by the nucleases and the KMnO4/piperidine treatment were analyzed by electrophoresis on denaturing polyacrylamide gels as described under “Experimental Procedures.” Running Maxam-Gilbert sequence markers of ssDNA on the same gels in parallel allowed the identification of cleavage sites. Note that the nucleases cleave the phosphodiester bond and generate a 3′-hydroxyl terminus on 5′-end labeled DNA. In contrast, the KMnO4/piperidine treatment and Maxam-Gilbert reactions generate a 3′-phosphorylated terminus on 5′-end labeled DNA (38, 49). The electrophoretic mobility of Maxam-Gilbert sequence markers is therefore slightly greater than that of fragments produced by nucleases. This slight difference is observable only with short DNA fragments.

Analysis of the ssDNA Secondary Structure in the Absence of NC

Representative examples of probing experiments without NC are shown in Fig. 4. The results of a series of independent experiments are summarized in the secondary structure of ssDNA that is the most consistent with the probing data (Fig. 5A). In the absence of NC, the DNase I cleavage pattern (Fig. 4, A and B) is consistent with the secondary structure that we propose for ssDNA (Fig. 5A). Indeed, except moderate DNase I cleavage between G97-C98 occurring in an apical loop (Fig. 4B), all moderate and strong DNase I cleavages occurred within stems or at the ends of stems (Fig. 5A). Moderate DNase I cleavage between G97-C98 suggests the existence of transient base pairs in the apical loop. The ssDNA secondary structure is also supported by the MBN cleavage pattern (Fig. 4, C and D) because all moderate and strong MBN cleavages occurred within single-stranded regions and loops (Fig. 5A). Finally, the KMnO4 probing results are also consistent with our secondary structure model. Thus, except the weak sensitivity of T115 to KMnO4, all paired thymine bases within the stems were barely (T5, T41, T45, T92, T108, T114, T130, and T131) or not reactive (T43, T80, T83, T84, T122, T154, T160, and T165) to KMnO4 (Fig. 4, E and F). The weak sensitivity of T115 to KMnO4 suggests that the G85·T115 mismatched base pair is not involved in strong stacking interactions with the surrounding base pairs. Consistent with this notion, Gogos et al. (50) showed that KMnO4 interacts with the thymine residue of T·G mismatches. Residues T11, T32, T134, and T146, which are at the ends of stems or adjacent to a mismatch, exhibited moderate sensitivity to KMnO4 because at least one side of the plane of the heterocyclic ring of these thymine residues is exposed. As expected, the reactivity level of all unpaired thymine bases toward KMnO4 was high or moderate.

FIGURE 4.

FIGURE 4.

Analysis of the ssDNA secondary structure in the absence of NC. Probing experiments were performed and analyzed by electrophoresis on denaturing 14% (A, C, and E), 6% (B), and 8% (D and F) polyacrylamide gels as described under ”Experimental Procedures.“ A and B, the 5′-end labeled ssDNA was incubated with DNase I (0.1 and 0.15 units, lanes 1 and 2). C and D, the 5′-end labeled ssDNA was incubated with mung bean nuclease (MBN) (0.125, 0.25, and 0.5 units, lanes 1–3). E and F, the 5′-end labeled ssDNA was incubated with KMnO4 (0.25, 0.5, 0.75, and 1 mm, lanes 1–4). Lanes Ct are controls without any treatment. Lane Cpip is the control without KMnO4 treatment but with piperidine treatment. T+C and G+A refer to Maxam-Gilbert sequence markers. Arrows indicate the cleavage sites and the reactive thymine residues. Schematic of the ssDNA is shown alongside the gel, indicating the u5, cpoly(A), and cTAR domains.

FIGURE 5.

FIGURE 5.

Secondary structure for ssDNA in the absence (A) and presence (B) of NC. The color code used for the reactivities is indicated in the insets. ΔG values were predicted by mfold (43). The asterisks indicate protections induced by NC at the protein to nucleotide molar ratio of 1:18. The double asterisks indicate strong protections induced by NC at the protein to nucleotide molar ratio of 1:18. The stars indicate the cleavage sites and the thymine residues where the reactivity is increased by NC at the protein to nucleotide molar ratio of 1:18.

Analysis of the ssDNA Secondary Structure in the Presence of NC

To identify destabilized regions and protections induced by NC in ssDNA, we compared the enzymatic and KMnO4 probing patterns of ssDNA in the absence or presence of increasing concentrations of NC (Fig. 6). The results of a series of independent experiments are summarized in the secondary structure of ssDNA that is the most consistent with the probing data (Fig. 5B). The DNase I cleavage patterns in the absence and presence of NC were similar (Fig. 6, A and B), indicating that NC did not change deeply the secondary structure of ssDNA. However, DNase I cleavage at the level of almost all sites increased in the presence of NC at a protein to nucleotide molar ratio of 1:18. A likely explanation is that binding of NC to single-stranded regions changed the three-dimensional folding of ssDNA and therefore the accessibility of double-stranded regions. In the presence of NC at a protein to nucleotide molar ratio of 1:3.5, DNase I cleavage at the level of all sites decreased strongly (Fig. 6, A and B, lanes 5). These results are consistent with the notion that NC at high concentrations binds the double-stranded regions of nucleic acids nonspecifically through electrostatic interactions of the basic residues with the phosphodiester backbone (18, 36, 51). Interestingly, NC at a protein to nucleotide molar ratio of 1:18 induced specific protection against DNase I at the level of C129 (Fig. 6B, lane 2). This suggests that the nucleotides encompassing residue C129 constitute a strong binding site for NC or undergo a local rearrangement upon NC binding.

FIGURE 6.

FIGURE 6.

Analysis of the ssDNA secondary structure in the presence of NC. Probing experiments were performed and analyzed by electrophoresis on denaturing 14% (A, C, and E), 6% (B), and 8% (D and F) polyacrylamide gels as described under ”Experimental Procedures.“ A and B, the 5′-end labeled ssDNA was incubated with 0.1 unit of DNase I in the absence (lanes 1) or presence of NC (lanes 2–5). C and D, the 5′-end labeled ssDNA was incubated with 0.25 unit of mung bean nuclease (MBN) in the absence (lanes 1) or presence of NC (lanes 2–5). MBN cleavage sites between positions 166 and 170 were well characterized by long migration times on a denaturing 6% polyacrylamide gel. E and F, the 5′-end labeled ssDNA was incubated with 0.5 mm KMnO4 in the absence (lanes 1) or presence of NC (lanes 2–5). Lanes Ct are controls without any treatment. Lanes Cpip are controls without KMnO4 treatment but with piperidine treatment. The protein to nucleotide molar ratios were 1:18 (lanes 2), 1:12 (lanes 3), 1:7 (lanes 4), and 1:3.5 (lane 5). The asterisks indicate protections induced by NC at the protein to nucleotide molar ratio of 1:18. The double asterisks indicate strong protections induced by NC at the protein to nucleotide molar ratio of 1:18. The stars indicate the cleavage sites and the thymine residues where the reactivity is increased by NC at the protein to nucleotide molar ratio of 1:18. Schematic of the ssDNA is shown alongside the gel, indicating the u5, cpoly(A), and cTAR domains.

MBN cleavage at the level of 14 unpaired nucleotides (T15, A16, C17, T22, T24, A25, A27, A50, C52, A53, A59, A61, A63, and A64) did not change in the presence of NC at a protein to nucleotide molar ratio of 1:18 (Fig. 6, C and D, lanes 2), indicating that these unpaired nucleotides did not constitute strong binding sites for NC. In contrast, MBN cleavage at the level of 17 unpaired nucleotides (T34, A36, A66, A68, T69, C73, A74, A76, C89, A90, A99, A100, T105, A106, C128, A157, and C158) decreased in the presence of NC at a protein to nucleotide molar ratio of 1:18, showing that these unpaired nucleotides were less accessible to MBN. The strongest protections occurred at the level of T34, A68, T69, C73, A74, A76, A99, and C128, suggesting that these unpaired nucleotides constitute strong binding sites for NC. In the presence of NC at a protein to nucleotide molar ratio of 1:3.5, MBN cleavage at the level of all sites decreased strongly (Fig. 6, C and D, lanes 5), suggesting that the single-stranded regions were almost completely covered by NC. MBN cleavage at the level of G38, A94, A95, A141, C148, and C149 increased in the presence of NC at a protein to nucleotide molar ratio of 1:18. As mentioned above, it is likely that binding of NC to single-stranded regions changes the three-dimensional folding of ssDNA and therefore the accessibility of these nucleotides. MBN cleavage at the level of A40 was slightly increased with NC at a protein to nucleotide molar ratios of 1:18, suggesting that NC destabilized the T11-A40 base pair that is adjacent to the internal loop. Surprisingly, NC induced moderate MBN cleavages at the level of residues A116, C138, and A139 (Fig. 6D, lane 2) that are within stems (Fig. 5B). The nucleotides encompassing these MBN cleavage sites may adopt alternative conformations. However, DNase I and KMnO4 probing data do not support this hypothesis. A possible explanation is that binding of NC to the G85·T115 mismatched base pair and the internal loop (sequence between 156 and 158) increased the “breathing” process of adjacent base pairs. Note that MBN can cleave double-stranded DNA within AT-rich regions that exhibit “structural breathing” (45, 52). Interestingly, NC at a protein to nucleotide molar ratio of 1:18 induced a strong increase in MBN sensitivity for residues A166, A167, A169, and A170, indicating that these nucleotides were unpaired (Fig. 6D, lane 2).

As previously reported (29), KMnO4 is expected to interact with the nucleobases of unpaired thymine residues that interact with the N-terminal zinc finger of NC. Indeed, a close examination of the structures of the NC·DNA complexes (53, 54) shows that the 5,6-double bond of the nucleobase thymine in these complexes is accessible to KMnO4. Therefore, in the presence of NC at a protein to nucleotide molar ratio of 1:18, the sensitivity of thymine residues to KMnO4 did not decrease (Fig. 6, E and F, lanes 2). Unpaired residues T12, T13, T14, T15, T34, T48, T49, T69, T126, and T127 became more reactive to KMnO4 in the presence of NC (Fig. 5B), suggesting that these thymine residues were involved in transient and weak base stacking interactions that were destabilized by NC. Consistent with the formation of stable stems, T43, T45, T80, T83, T84, T92, T107, T108, T114, T122, T160, and T165 did not become more reactive to KMnO4 in the presence of NC (Fig. 5B). NC induced a slight increase in KMnO4 sensitivity for residues T5, T41, and T154, indicating that at least one side of the plane of the heterocyclic ring of each thymine residue was exposed. This suggests that NC increased the fraying of the adjacent base pairs that are located at the ends of stems (Fig. 5B). Consistent with this notion, T11, T29, and T32 became more reactive to KMnO4 in the presence of NC. Residue T115 became markedly reactive to KMnO4 in the presence of NC (Fig. 6F), indicating that NC increased the accessibility of the G85·T115 mismatched base pair. Interestingly, NC induced a strong increase in KMnO4 sensitivity for residues T130, T131, and T134 that are within the lower stem of the cTAR stem-loop (Figs. 5A and 6F). These results associated with the MBN probing data strongly suggest that the lower stem of the cTAR hairpin is open in the presence of NC (Fig. 5B).

Conservation of the ssDNA Sequence and Structure

Our study was performed with the ssDNA sequence of the MAL isolate. The HIV-1 NL4-3 isolate has been extensively used in studies dealing with the reverse transcription process (see Ref. 2 and references therein). Interestingly, ssDNA of the HIV-1 NL4-3 isolate can adopt a secondary structure that is close to that of the HIV-1 MAL isolate (Fig. 7). To provide phylogenetic support for the secondary structure of ssDNA in the presence of NC, we searched for its conservation among different HIV-1 groups. First, we determined a consensus sequence for each HIV-1 group, using the LANL HIV subtype reference database. The sequences retrieved from the QuickAlign procedure were used to calculate a position weight matrix from which a sequence logo was derived for each HIV-1 group (supplemental Fig. S1). A sequence logo is drawn such that the overall height of the stack indicates the information content at each position (derived from the Shannon entropy), whereas the height of the base symbols within the stack indicates their relative frequency (39, 55). The information content, which can reach a maximal theoretical value of 2 bits for nucleic acids, is usually interpreted as an indicator of evolutionary conservation. The ssDNA sequence is relatively well conserved among different isolates and groups of HIV-1 (supplemental Fig. S1). Nucleotides 15–30 (numbering is relative to HIV-1 group M) located in the u5 domain correspond to the less conserved part of ssDNA. However, this domain displays highly conserved sequences between positions 1–14, 33–48, and 73–80. There are also highly conserved sequences between positions 97–112 and 115–122 in the cpoly(A) domain. In the cTAR domain, the most highly conserved sequences correspond to nucleotides 144–151, 153–157, 163–166, and 177–181. Conservation of residues G36, G71, and G102 supports the hypothesis that these nucleotides are within the high-affinity binding sites for NC. The sequence logos were used to predict the ssDNA secondary structures of HIV-1 groups M, N, and O (Fig. 7). Only one full-length sequence was available to predict the ssDNA secondary structure of HIV-1 group P. Interestingly, the ssDNA sequences of four groups can fold into secondary structures that are close to that of the HIV-1 MAL isolate. In all secondary structures, there are three putative strong NC binding sites that are located in single-stranded regions containing at least a guanine residue. In HIV-1 groups M, N, and O, the putative site involving residue G36 is not in a loop. In HIV-1 group M, the base pairing interaction between nucleotides C18-A19 and T35-G36 is probably not stable because it is between an internal loop and a A-A mismatch. Therefore, the TG motif could be recognized by NC. In HIV-1 groups N and O, the strong NC binding site could be in the u5 apical loop containing a TG motif.

FIGURE 7.

FIGURE 7.

Proposed secondary structures for the ssDNAs of different isolates and groups of HIV-1. The mfold program (43) was used to predict the secondary structures. The ssDNA secondary structures of HIV-1 groups M, N, and O were generated from the sequence logos presented in supplemental Fig. S1. For HIV-1 group P, only one full-length ssDNA sequence is available (accession number GU111555). The putative high-affinity binding sites for NC are highlighted in purple.

Discussion

The first strand transfer is essential for HIV-1 replication (1). This transfer requires a base pairing interaction between the R sequence at the 3′-end of the genomic RNA and the r sequence at the 3′-end of the ssDNA (Fig. 1A). Therefore, the HIV-1 full-length ssDNA (176–181 nucleotides in length) is a key actor in the annealing process leading to the first strand transfer. The mechanism of annealing has been investigated using model nucleic acid substrates containing at least portions of the complementary R/r sequences. The first model systems were based on annealing of an RNA molecule (81 or 148 nucleotides representing a portion of the 3′-end of the genomic RNA) to a DNA molecule (81 or 131 nucleotides) corresponding to a portion of the 3′-end of the ssDNA (23, 56). Short DNAs representing the cTAR sequence (55 or 59 nucleotides) or the mini-cTAR sequence (26 or 27 nucleotides) were used in many studies investigating the annealing process (18, 20, 2832, 57). Only one study examined the secondary structure of ssDNA using truncated ssDNAs (50, 63, and 128 nucleotides) (58). Thus, all in vitro studies dealing with the ssDNA structure and the annealing process were performed with single-stranded DNAs representing truncated forms of ssDNA. Therefore, there is no data showing that the cTAR and cpoly(A) sequences form the predicted long stem-loops (19) in the context of the full-length ssDNA. Moreover, the mfold program (43) predicts that some of the most stable secondary structures of ssDNA (MAL and NL4-3 isolates) do not form the TAR stem-loop structure (data not shown). Our study is the first to investigate the secondary structure of the full-length ssDNA and its interaction with the 3′-end of the genomic RNA.

We used structural probes (DNase I, mung bean nuclease, and potassium permanganate) to directly determine the secondary structure of ssDNA in the absence or presence of NC. Our probing data support the folding of ssDNA into three stem-loop structures in the absence of NC (Fig. 5A). Note that a large part of the u5 sequence (29 consecutive nucleotides) is single-stranded. It is important to emphasize that the r region (sequences cpoly(A) and cTAR) does not fold into an independent domain, because six base pairs are formed between the u5 and cpoly(A) sequences (Fig. 8). Our results show that the cpoly(A) and the cTAR hairpins are formed in the full-length ssDNA. Previous studies showed that short DNAs corresponding to the cTAR sequence (55 or 59 nucleotides) adopt two alternative hairpin conformations in the absence of NC: the closed form and the partially open Y form (20, 27, 29). Interestingly, our probing data indicate that the cTAR sequence in ssDNA adopts the partially open Y form but not the closed form (conformer 1 in Fig. 8). Consistent with this finding, the mfold program predicts that the ssDNA containing the Y form is more stable than the ssDNA containing the closed form (ΔGY = −11.43 versus ΔGC = −7.54 kcal/mol). Furthermore, the three-dimensional folding of ssDNA could prevent the formation of the closed form. Taken together the data indicate that the cTAR DNA model adopts a secondary structure (closed form) that is not formed in ssDNA.

FIGURE 8.

FIGURE 8.

Proposed model for conformational switch of ssDNA induced by NC binding. ΔG values were predicted by the mfold program (43). The high-affinity binding sites for NC are highlighted in purple.

Interestingly, a low concentration of NC (protein to nucleotide molar ratio of 1:18) increases the accessibility of the cpoly(A) and cTAR domains to DNase I (Fig. 5B), suggesting that the three-dimensional folding of ssDNA is destabilized upon binding of NC to its strong sites. Destabilization of the three-dimensional structure of ssDNA could be a first step in the NC-mediated annealing process. In other words, NC could facilitate interaction of the r sequence of ssDNA with the R sequence of the genomic RNA by increasing the accessibility of these complementary sequences. Our probing data show that NC does not change deeply the secondary structure of ssDNA. Thus, the global architecture of the u5 and cpoly(A) stem-loops is not altered in the presence of NC. The KMnO4 and MBN reactivities indicate that NC increases the fraying of base pairs that are located at the ends of stems in the u5 stem-loop. There is no evidence that a population of ssDNA molecules is totally melted in the presence of NC at a high concentration (protein to nucleotide molar ratio of 1:3.5). Indeed, T43, T80, T83, T84, T122, T160, and T165 were unreactive to KMnO4 (Fig. 6, E and F). However, the lower stem of the cpoly(A) hairpin is probably destabilized during the NC-mediated annealing process, i.e. in the presence of the poly(A) RNA hairpin. Furthermore, the long stem (11 base pairs) of the cpoly(A) hairpin should be destabilized to allow synthesis of full-length plus-strand DNA. The G85·T115 mismatched base pair is probably involved in DNA helix destabilization by NC, because the protein increases the reactivity of residue T115 to KMnO4 (Fig. 5B). Godet et al. (59) showed that mismatched base pairs play an important role in DNA helix destabilization by NC.

Interestingly, the cTAR region undergoes a significant conformational change upon NC binding. Indeed, the lower part of the cTAR hairpin is more open in the presence of NC than in its absence (Fig. 8). This open form is supported by strong MBN cleavage sites between positions 166 and 171 and the reactivities of T126, T127, T130, T131, and T134 toward KMnO4 (Fig. 5B). These latter MBN data indicate that the 3′-end of cTAR is accessible and could therefore interact with TAR. Consistent with the loss of four base pairs in the most open form, C129 is a weak DNase I cleavage site in the presence of NC (Fig. 5B), whereas it is a moderate DNase I cleavage site in its absence (Fig. 5A). Our results are in agreement with the hypothesis that NC induces the O2 conformation of the cTAR hairpin exhibiting 22 unpaired nucleotides in its lower part (20). This conformation is consistent with the hypothesis that the 3′/5′ termini of the cTAR and TAR hairpins form a zipper intermediate that initiates the annealing reaction (30, 32). However, we showed that mutations preventing the formation of the zipper intermediate do not affect the annealing of ssDNA to the 3′-end of the genomic RNA (Fig. 3). Mutations within the first 10 nucleotides of the 5′ R sequence, i.e. the 10 last nucleotides of the cTAR sequence, produce virions that are markedly defective for reverse transcription (12). Our results suggest that this reverse transcription defect is not due to inhibition of the annealing reaction. A kissing complex formed by the apical loops of TAR and cTAR could initiate the annealing reaction (18, 19, 29, 30). The accessibility of the cTAR apical loop to MBN increases in the presence of NC (Fig. 5B), indicating that this loop should be more accessible to the TAR apical loop. However, we show that mutations preventing the loop-loop interaction do not affect the annealing process (Fig. 3). Finally, our study suggests that the NC-mediated annealing process does not rely on a single pathway (zipper intermediate or kissing complex). It is likely that the virus uses the two pathways and perhaps a third involving the poly(A) and cpoly(A) hairpins. Indeed, our results are consistent with the “acceptor invasion” model proposed by Bambara and co-workers (60, 61). In this model, RT pausing at the base of the 5′ TAR hairpin initiates RNases H cleavages in the 5′ poly(A) hairpin creating a gap for the invasion of the 3′ poly(A) hairpin (acceptor RNA) and interaction with the cpoly(A) hairpin. This initiates the strand exchange by a branch migration process until the 3′ terminus of ssDNA is fully transferred.

Identification of NC binding sites in the full-length ssDNA contributes to an understanding of the annealing mechanism. To identify the ssDNA nucleotides coated by NC, we investigated the NC-induced protection of ssDNA to enzymatic digestion. To this end, we performed a footprinting analysis, using MBN and DNase I (Fig. 6, A-D). The DNase I and MBN probing data indicate that ssDNA is almost completely covered by NC at a high concentration (protein to nucleotide molar ratio of 1:3.5). NC at a low concentration (protein to nucleotide molar ratio of 1:18) induced the strongest protections against MBN at the level of five sites (T34, A68-T69, 73C-A-C-A76, A99, and C128). The internal loop (nucleotides 156–158) of the cTAR domain was also protected by NC, but the protection was lower than the five protections mentioned above. In a previous report, we also showed that this internal loop is a binding site for NC in a short DNA (55 nucleotides) that folds into the cTAR hairpin (29). Protection at the level of the 73C-A-C-A76 sequence is probably not due to a direct interaction with NC, because this sequence does not display the features of a NC binding site. In contrast, four of the five protected sites are within single-stranded regions containing at least a guanine residue or the TG motif that are preferential binding sites for NC (6265). Note that the binding site size of NC is 5 to 8 nucleotides (24). Taken together, the data support the model presented in Fig. 8, in which NC interacts strongly with four sites at a protein to nucleotide molar ratio of 1:18. Our study is the first to identify the high-affinity binding sites for NC in the full-length ssDNA. Interestingly, a strong binding site for NC containing residue G132 is located between the cpoly(A) and cTAR hairpins. A recent study performed with a modified version of the cTAR sequence (57 nucleotides) also suggests that residue G132 is preferentially recognized by NC (59). In contrast to this study, we did not find that residue G171 is a strong binding site for NC. Therefore, our results show that the properties of the cTAR DNA model are not identical to those of the cTAR domain in the full-length ssDNA. Our probing data are consistent with the notion that the cTAR region is more open in the presence of NC than in its absence (Fig. 5, compare A and B). We propose that the great majority of ssDNA molecules in the absence of NC fold into conformer 1 and the minority into conformer 2 (Fig. 8). An attractive hypothesis is that NC binding to the 129C-T-T-G-C133 sequence in conformer 2 shifts the equilibrium toward this conformer (Fig. 8).

Phylogenetic analysis supports the notion that ssDNAs of different HIV-1 groups can adopt similar secondary structures containing three stem-loops and three single-stranded regions (Fig. 7). Because the base pairing interaction between nucleotides 4–11 and 41–58 (numbering is relative to HIV-1 group M) is completely conserved (supplemental Fig. S1), it may play a key role during synthesis of minus- and plus-strand DNAs. Note that the counterpart RNA of this base pairing interaction, the primer binding site 3 helix, is also conserved and may be required for efficient initiation of reverse transcription from the tRNA3Lys primer (66). Consistent with the hypothesis that the first strand transfer is facilitated by two loop-loop interactions (19), the upper parts of the cpoly(A) (nucleotides 92–112) and cTAR (139–165) hairpins are highly conserved (Fig. 7). The TAR and poly(A) RNA hairpins play important roles in HIV-1 transcription and polyadenylation at the 3′-end of HIV-1 mRNAs, respectively (15, 16). Therefore, we cannot exclude the possibility that conservation of the cTAR and cpoly(A) DNA hairpins represent conservation at the level of RNA structure, or both. Indeed, Berkhout (67) showed that the upper part of the TAR RNA hairpin is well conserved and its group also showed conservation of the poly(A) RNA hairpin (16). Consistent with these findings, there is a strong evolutionary pressure to restore proper folding of the TAR and poly(A) hairpins upon mutational disruption (68, 69). An attractive hypothesis, therefore, is that the TAR and poly(A) hairpins have been conserved at both levels, DNA and RNA, for optimal HIV-1 replication. NC is a weak duplex destabilizer that cannot destabilize long fully base paired nucleic acid helices, it can destabilize short base paired regions (4–8 base pairs) flanked by duplex ends, loops, bulges, or mismatches (24, 70). In agreement with the weak duplex destabilizing activity of NC, the stems of ssDNAs do not exceed eight consecutive Watson-Crick base pairs (Fig. 7). This suggests a coevolutionary relationship between the ssDNA sequence and the NC activity to promote the first strand transfer and synthesis of full-length plus-strand DNA. Interestingly, previous studies support the notion of a coevolutionary relationship between the cTAR structure and the NC activity (59, 71). Finally, our phylogenetic analysis suggests conservation of the four high-affinity binding sites for NC that we identified in ssDNA of the MAL isolate.

Our study is the first to determine the secondary structure of the full-length HIV-1 ssDNA in the absence or presence of NC. To our knowledge, the secondary structures of ssDNAs of other retroviruses have not been determined. Our results support a model for the NC-mediated annealing process in which the first step is the preferential binding of NC to four sites. These binding events trigger unfolding of the three-dimensional structure of ssDNA, thus facilitating interaction of the r sequence of ssDNA with the R sequence of the genomic RNA. The second step is simultaneous molecular crowding of nucleic acids and weak destabilization of secondary structures leading to the DNA-RNA heteroduplex. Finally, our results suggest that initiation of the annealing process does not rely on a single pathway (zipper intermediate or kissing complex).

Author Contributions

P. F. conceived and designed the experiments. Y. C. and O. M. performed all experiments presented in the paper. F. C. and J. B. were involved in plasmids and RNAs preparation. P. F., Y. C., and O. M. analyzed the results of experiments. P. F. and J. G. performed phylogenetic analysis. P. F., O. M., B. R., and Y. M. wrote the paper.

Supplementary Material

Supplemental Data
*

This work was supported in part by Centre National de la Recherche Scientifique LBPA UMR 8113, Agence Nationale de Recherche sur le Sida, and the European Project THINPAD “Targeting the HIV-1 Nucleocapsid Protein to fight Antiretroviral Drug Resistance” FP7 Grant 601969. The authors declare that they have no conflicts of interest with the contents of this article.

Inline graphic

This article contains supplemental Fig. S1.

5
The abbreviations used are:
HIV-1
human immunodeficiency virus type 1
KMnO4
potassium permanganate
MBN
mung bean nuclease
NC
nucleocapsid protein
poly(A)
polyadenylation signal
cpoly(A)
complementary DNA sequence to poly(A)
R
terminal repeat
r
complementary DNA sequence to R
ssDNA
minus-strand strong-stop DNA
TAR
transactivator response element
cTAR
complementary DNA sequence to TAR
U5
sequence unique at the 5′-end
u5
complementary DNA sequence to U5
U3
sequence unique at the 3′-end.

References

  • 1. Basu V. P., Song M., Gao L., Rigby S. T., Hanson M. N., and Bambara R. A. (2008) Strand transfer events during HIV-1 reverse transcription. Virus Res. 134, 19–38 [DOI] [PubMed] [Google Scholar]
  • 2. Levin J. G., Mitra M., Mascarenhas A., and Musier-Forsyth K. (2010) Role of HIV-1 nucleocapsid protein in HIV-1 reverse transcription. RNA Biol. 7, 754–774 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Darlix J. L., Vincent A., Gabus C., de Rocquigny H., and Roques B. (1993) Trans-activation of the 5′ to 3′ viral DNA strand transfer by nucleocapsid protein during reverse transcription of HIV1 RNA. C. R. Acad. Sci. III 316, 763–771 [PubMed] [Google Scholar]
  • 4. Peliska J. A., Balasubramanian S., Giedroc D. P., and Benkovic S. J. (1994) Recombinant HIV-1 nucleocapsid protein accelerates HIV-1 reverse transcriptase catalyzed DNA strand transfer reactions and modulates RNase H activity. Biochemistry 33, 13817–13823 [DOI] [PubMed] [Google Scholar]
  • 5. Guo J., Henderson L. E., Bess J., Kane B., and Levin J. G. (1997) Human immunodeficiency virus type 1 nucleocapsid protein promotes efficient strand transfer and specific viral DNA synthesis by inhibiting TAR-dependent self-priming from minus-strand strong-stop DNA. J. Virol. 71, 5178–5188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Chen Y., Balakrishnan M., Roques B. P., and Bambara R. A. (2003) Steps of the acceptor invasion mechanism for HIV-1 minus-strand strong stop transfer. J. Biol. Chem. 278, 38368–38375 [DOI] [PubMed] [Google Scholar]
  • 7. Dang Q., and Hu W. S. (2001) Effects of homology length in the repeat region on minus-strand DNA transfer and retroviral replication. J. Virol. 75, 809–820 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Berkhout B., van Wamel J., and Klaver B. (1995) Requirements for DNA strand transfer during reverse transcription in mutant HIV-1 virions. J. Mol. Biol. 252, 59–69 [DOI] [PubMed] [Google Scholar]
  • 9. Lobel L. I., and Goff S. P. (1985) Reverse transcription of retroviral genomes: mutations in the terminal repeat sequences. J. Virol. 53, 447–455 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Ramsey C. A., and Panganiban A. T. (1993) Replication of the retroviral terminal repeat sequence during in vivo reverse transcription. J. Virol. 67, 4114–4121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Klaver B., and Berkhout B. (1994) Premature strand transfer by the HIV-1 reverse transcriptase during strong-stop DNA synthesis. Nucleic Acids Res. 22, 137–144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Ohi Y., and Clever J. L. (2000) Sequences in the 5′ and 3′ R elements of human immunodeficiency virus type 1 critical for efficient reverse transcription. J. Virol. 74, 8324–8334 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Guo J., Wu T., Bess J., Henderson L. E., and Levin J. G. (1998) Actinomycin D inhibits human immunodeficiency virus type 1 minus-strand transfer in in vitro and endogenous reverse transcriptase assays. J. Virol. 72, 6716–6724 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Jeeninga R. E., Huthoff H. T., Gultyaev A. P., and Berkhout B. (1998) The mechanism of actinomycin D-mediated inhibition of HIV-1 reverse transcription. Nucleic Acids Res. 26, 5472–5479 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Feng S., and Holland E. C. (1988) HIV-1 tat trans-activation requires the loop sequence within tar. Nature 334, 165–167 [DOI] [PubMed] [Google Scholar]
  • 16. Berkhout B., Klaver B., and Das A. T. (1995) A conserved hairpin structure predicted for the poly(A) signal of human and simian immunodeficiency viruses. Virology 207, 276–281 [DOI] [PubMed] [Google Scholar]
  • 17. Watts J. M., Dang K. K., Gorelick R. J., Leonard C. W., Bess J. W. Jr., Swanstrom R., Burch C. L., and Weeks K. M. (2009) Architecture and secondary structure of an entire HIV-1 RNA genome. Nature 460, 711–716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Kanevsky I., Chaminade F., Ficheux D., Moumen A., Gorelick R., Negroni M., Darlix J. L., and Fossé P. (2005) Specific interactions between HIV-1 nucleocapsid protein and the TAR element. J. Mol. Biol. 348, 1059–1077 [DOI] [PubMed] [Google Scholar]
  • 19. Berkhout B., Vastenhouw N. L., Klasens B. I., and Huthoff H. (2001) Structural features in the HIV-1 repeat region facilitate strand transfer during reverse transcription. RNA 7, 1097–1114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Bernacchi S., Stoylov S., Piémont E., Ficheux D., Roques B. P., Darlix J. L., and Mély Y. (2002) HIV-1 nucleocapsid protein activates transient melting of least stable parts of the secondary structure of TAR and its complementary sequence. J. Mol. Biol. 317, 385–399 [DOI] [PubMed] [Google Scholar]
  • 21. Darlix J. L., Godet J., Ivanyi-Nagy R., Fossé P., Mauffret O., and Mély Y. (2011) Flexible nature and specific functions of the HIV-1 nucleocapsid protein. J. Mol. Biol. 410, 565–581 [DOI] [PubMed] [Google Scholar]
  • 22. Thomas J. A., and Gorelick R. J. (2008) Nucleocapsid protein function in early infection processes. Virus Res. 134, 39–63 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. You J. C., and McHenry C. S. (1994) Human immunodeficiency virus nucleocapsid protein accelerates strand transfer of the terminally redundant sequences involved in reverse transcription. J. Biol. Chem. 269, 31491–31495 [PubMed] [Google Scholar]
  • 24. Levin J. G., Guo J., Rouzina I., and Musier-Forsyth K. (2005) Nucleic acid chaperone activity of HIV-1 nucleocapsid protein: critical role in reverse transcription and molecular mechanism. Prog. Nucleic Acid Res. Mol. Biol. 80, 217–286 [DOI] [PubMed] [Google Scholar]
  • 25. Godet J., and Mély Y. (2010) Biophysical studies of the nucleic acid chaperone properties of the HIV-1 nucleocapsid protein. RNA Biol. 7, 687–699 [DOI] [PubMed] [Google Scholar]
  • 26. Azoulay J., Clamme J. P., Darlix J. L., Roques B. P., and Mély Y. (2003) Destabilization of the HIV-1 complementary sequence of TAR by the nucleocapsid protein through activation of conformational fluctuations. J. Mol. Biol. 326, 691–700 [DOI] [PubMed] [Google Scholar]
  • 27. Hong M. K., Harbron E. J., O'Connor D. B., Guo J., Barbara P. F., Levin J. G., and Musier-Forsyth K. (2003) Nucleic acid conformational changes essential for HIV-1 nucleocapsid protein-mediated inhibition of self-priming in minus-strand transfer. J. Mol. Biol. 325, 1–10 [DOI] [PubMed] [Google Scholar]
  • 28. Liu H. W., Zeng Y., Landes C. F., Kim Y. J., Zhu Y., Ma X., Vo M. N., Musier-Forsyth K., and Barbara P. F. (2007) Insights on the role of nucleic acid/protein interactions in chaperoned nucleic acid rearrangements of HIV-1 reverse transcription. Proc. Natl. Acad. Sci. U.S.A. 104, 5261–5267 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Kanevsky I., Chaminade F., Chen Y., Godet J., René B., Darlix J. L., Mély Y., Mauffret O., and Fossé P. (2011) Structural determinants of TAR RNA-DNA annealing in the absence and presence of HIV-1 nucleocapsid protein. Nucleic Acids Res. 39, 8148–8162 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Vo M. N., Barany G., Rouzina I., and Musier-Forsyth K. (2009) HIV-1 nucleocapsid protein switches the pathway of transactivation response element RNA/DNA annealing from loop-loop “kissing” to “zipper”. J. Mol. Biol. 386, 789–801 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Liu H. W., Cosa G., Landes C. F., Zeng Y., Kovaleski B. J., Mullen D. G., Barany G., Musier-Forsyth K., and Barbara P. F. (2005) Single-molecule FRET studies of important intermediates in the nucleocapsid-protein-chaperoned minus-strand transfer step in HIV-1 reverse transcription. Biophys. J. 89, 3470–3479 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Godet J., de Rocquigny H., Raja C., Glasser N., Ficheux D., Darlix J. L., and Mély Y. (2006) During the early phase of HIV-1 DNA synthesis, nucleocapsid protein directs hybridization of the TAR complementary sequences via the ends of their double-stranded stem. J. Mol. Biol. 356, 1180–1192 [DOI] [PubMed] [Google Scholar]
  • 33. de Rocquigny H., Ficheux D., Gabus C., Fournié-Zaluski M. C., Darlix J. L., and Roques B. P. (1991) First large scale chemical synthesis of the 72 amino acid HIV-1 nucleocapsid protein NCp7 in an active form. Biochem. Biophys. Res. Commun. 180, 1010–1018 [DOI] [PubMed] [Google Scholar]
  • 34. Polge E., Darlix J. L., Paoletti J., and Fossé P. (2000) Characterization of loose and tight dimer forms of avian leukosis virus RNA. J. Mol. Biol. 300, 41–56 [DOI] [PubMed] [Google Scholar]
  • 35. Darlix J. L., Gabus C., Nugeyre M. T., Clavel F., and Barré-Sinoussi F. (1990) Cis elements and trans-acting factors involved in the RNA dimerization of the human immunodeficiency virus HIV-1. J. Mol. Biol. 216, 689–699 [DOI] [PubMed] [Google Scholar]
  • 36. Lapadat-Tapolsky M., De Rocquigny H., Van Gent D., Roques B., Plasterk R., and Darlix J. L. (1993) Interactions between HIV-1 nucleocapsid protein and viral DNA may have important functions in the viral life cycle. Nucleic Acids Res. 21, 831–839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Fossé P., Motté N., Roumier A., Gabus C., Muriaux D., Darlix J. L., and Paoletti J. (1996) A short autocomplementary sequence plays an essential role in avian sarcoma-leukosis virus RNA dimerization. Biochemistry 35, 16601–16609 [DOI] [PubMed] [Google Scholar]
  • 38. Maxam A. M., and Gilbert W. (1980) Sequencing end-labeled DNA with base-specific chemical cleavages. Methods Enzymol. 65, 499–560 [DOI] [PubMed] [Google Scholar]
  • 39. Schneider T. D., and Stephens R. M. (1990) Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 18, 6097–6100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Skripkin E., Paillart J. C., Marquet R., Ehresmann B., and Ehresmann C. (1994) Identification of the primary site of the human immunodeficiency virus type 1 RNA dimerization in vitro. Proc. Natl. Acad. Sci. U.S.A. 91, 4945–4949 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Laughrea M., and Jetté L. (1996) Kissing-loop model of HIV-1 genome dimerization: HIV-1 RNAs can assume alternative dimeric forms, and all sequences upstream or downstream of hairpin 248–271 are dispensable for dimer formation. Biochemistry 35, 1589–1598 [DOI] [PubMed] [Google Scholar]
  • 42. Ali M. B., Chaminade F., Kanevsky I., Ennifar E., Josset L., Ficheux D., Darlix J. L., and Fossé P. (2007) Structural requirements for nucleocapsid protein-mediated dimerization of avian leukosis virus RNA. J. Mol. Biol. 372, 1082–1096 [DOI] [PubMed] [Google Scholar]
  • 43. Zuker M. (2003) mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31, 3406–3415 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Hampshire A. J., Rusling D. A., Broughton-Head V. J., and Fox K. R. (2007) Footprinting: a method for determining the sequence selectivity, affinity and kinetics of DNA-binding ligands. Methods 42, 128–140 [DOI] [PubMed] [Google Scholar]
  • 45. Desai N. A., and Shankar V. (2003) Single-strand-specific nucleases. FEMS Microbiol. Rev. 26, 457–491 [DOI] [PubMed] [Google Scholar]
  • 46. Till B. J., Burtner C., Comai L., and Henikoff S. (2004) Mismatch cleavage by single-strand specific nucleases. Nucleic Acids Res. 32, 2632–2641 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Holstege F. C., and Timmers H. T. (1997) Analysis of open complex formation during RNA polymerase II transcription initiation using heteroduplex templates and potassium permanganate probing. Methods 12, 203–211 [DOI] [PubMed] [Google Scholar]
  • 48. Craig M. L., Tsodikov O. V., McQuade K. L., Schlax P. E. Jr., Capp M. W., Saecker R. M., and Record M. T. Jr. (1998) DNA footprints of the two kinetically significant intermediates in formation of an RNA polymerase-promoter open complex: evidence that interactions with start site and downstream DNA induce sequential conformational changes in polymerase and DNA. J. Mol. Biol. 283, 741–756 [DOI] [PubMed] [Google Scholar]
  • 49. Rubin C. M., and Schmid C. W. (1980) Pyrimidine-specific chemical reactions useful for DNA sequencing. Nucleic Acids Res. 8, 4613–4619 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Gogos J. A., Karayiorgou M., Aburatani H., and Kafatos F. C. (1990) Detection of single base mismatches of thymine and cytosine residues by potassium permanganate and hydroxylamine in the presence of tetralkylammonium salts. Nucleic Acids Res. 18, 6807–6814 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Wang H., Yeh Y. S., and Barbara P. F. (2009) HIV-1 nucleocapsid protein bends double-stranded nucleic acids. J. Am. Chem. Soc. 131, 15534–15543 [DOI] [PubMed] [Google Scholar]
  • 52. Johnson P. H., and Laskowski M. Sr. (1970) Mung bean nuclease I: II. resistance of double stranded deoxyribonucleic acid and susceptibility of regions rich in adenosine and thymidine to enzymatic hydrolysis. J. Biol. Chem. 245, 891–898 [PubMed] [Google Scholar]
  • 53. Bourbigot S., Ramalanjaona N., Boudier C., Salgado G. F., Roques B. P., Mély Y., Bouaziz S., and Morellet N. (2008) How the HIV-1 nucleocapsid protein binds and destabilises the (−)primer binding site during reverse transcription. J. Mol. Biol. 383, 1112–1128 [DOI] [PubMed] [Google Scholar]
  • 54. Bazzi A., Zargarian L., Chaminade F., Boudier C., De Rocquigny H., René B., Mély Y., Fossé P., and Mauffret O. (2011) Structural insights into the cTAR DNA recognition by the HIV-1 nucleocapsid protein: role of sugar deoxyriboses in the binding polarity of NC. Nucleic Acids Res. 39, 3903–3916 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Schneider T. D. (2002) Consensus sequence Zen. Appl. Bioinformatics 1, 111–119 [PMC free article] [PubMed] [Google Scholar]
  • 56. Guo J., Wu T., Anderson J., Kane B. F., Johnson D. G., Gorelick R. J., Henderson L. E., and Levin J. G. (2000) Zinc finger structures in the human immunodeficiency virus type 1 nucleocapsid protein facilitate efficient minus- and plus-strand transfer. J. Virol. 74, 8980–8988 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Vo M. N., Barany G., Rouzina I., and Musier-Forsyth K. (2006) Mechanistic studies of mini-TAR RNA/DNA annealing in the absence and presence of HIV-1 nucleocapsid protein. J. Mol. Biol. 363, 244–261 [DOI] [PubMed] [Google Scholar]
  • 58. Heilman-Miller S. L., Wu T., and Levin J. G. (2004) Alteration of nucleic acid structure and stability modulates the efficiency of minus-strand transfer mediated by the HIV-1 nucleocapsid protein. J. Biol. Chem. 279, 44154–44165 [DOI] [PubMed] [Google Scholar]
  • 59. Godet J., Kenfack C., Przybilla F., Richert L., Duportail G., and Mély Y. (2013) Site-selective probing of cTAR destabilization highlights the necessary plasticity of the HIV-1 nucleocapsid protein to chaperone the first strand transfer. Nucleic Acids Res. 41, 5036–5048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Song M., Balakrishnan M., Chen Y., Roques B. P., and Bambara R. A. (2006) Stimulation of HIV-1 minus strand strong stop DNA transfer by genomic sequences 3′ of the primer binding site. J. Biol. Chem. 281, 24227–24235 [DOI] [PubMed] [Google Scholar]
  • 61. Piekna-Przybylska D., and Bambara R. A. (2011) Requirements for efficient minus strand strong-stop DNA transfer in human immunodeficiency virus 1. RNA Biol. 8, 230–236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Fisher R. J., Rein A., Fivash M., Urbaneja M. A., Casas-Finet J. R., Medaglia M., and Henderson L. E. (1998) Sequence-specific binding of human immunodeficiency virus type 1 nucleocapsid protein to short oligonucleotides. J. Virol. 72, 1902–1909 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Vuilleumier C., Bombarda E., Morellet N., Gérard D., Roques B. P., and Mély Y. (1999) Nucleic acid sequence discrimination by the HIV-1 nucleocapsid protein NCp7: a fluorescence study. Biochemistry 38, 16816–16825 [DOI] [PubMed] [Google Scholar]
  • 64. Avilov S. V., Godet J., Piémont E., and Mély Y. (2009) Site-specific characterization of HIV-1 nucleocapsid protein binding to oligonucleotides with two binding sites. Biochemistry 48, 2422–2430 [DOI] [PubMed] [Google Scholar]
  • 65. Bazzi A., Zargarian L., Chaminade F., De Rocquigny H., René B., Mély Y., Fossé P., and Mauffret O. (2012) Intrinsic nucleic acid dynamics modulates HIV-1 nucleocapsid protein binding to its targets. PLoS One 7, e38905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Damgaard C. K., Andersen E. S., Knudsen B., Gorodkin J., and Kjems J. (2004) RNA interactions in the 5′ region of the HIV-1 genome. J. Mol. Biol. 336, 369–379 [DOI] [PubMed] [Google Scholar]
  • 67. Berkhout B. (1992) Structural features in TAR RNA of human and simian immunodeficiency viruses: a phylogenetic analysis. Nucleic Acids Res. 20, 27–31 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Klaver B., and Berkhout B. (1994) Evolution of a disrupted TAR RNA hairpin structure in the HIV-1 virus. EMBO J. 13, 2650–2659 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Berkhout B., Klaver B., and Das A. T. (1997) Forced evolution of a regulatory RNA helix in the HIV-1 genome. Nucleic Acids Res. 25, 940–947 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Beltz H., Piémont E., Schaub E., Ficheux D., Roques B., Darlix J. L., and Mély Y. (2004) Role of the structure of the top half of HIV-1 cTAR DNA on the nucleic acid destabilizing activity of the nucleocapsid protein NCp7. J. Mol. Biol. 338, 711–723 [DOI] [PubMed] [Google Scholar]
  • 71. Beltz H., Azoulay J., Bernacchi S., Clamme J. P., Ficheux D., Roques B., Darlix J. L., and Mély Y. (2003) Impact of the terminal bulges of HIV-1 cTAR DNA on its stability and the destabilizing activity of the nucleocapsid protein NCp7. J. Mol. Biol. 328, 95–108 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES