Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Mar 19.
Published in final edited form as: J Mol Biol. 2010 Feb 1;397(1):161–178. doi: 10.1016/j.jmb.2010.01.059

Multiple nucleotide preferences determine cleavage site recognition by the HIV-1 and M-MuLV RNases H

Sharon J Schultz 1, Miaohua Zhang 1, James J Champoux 1,*
PMCID: PMC2830385  NIHMSID: NIHMS176149  PMID: 20122939

Abstract

The RNase H activity of reverse transcriptase is required during retroviral replication and represents a potential target in anti-viral drug therapies. Sequence features flanking a cleavage site influence the three types of retroviral RNase H activity, termed internal, DNA 3′ end-directed, and RNA 5′ end-directed. Using the reverse transcriptases of human immunodeficiency virus, type 1 (HIV-1) and Moloney murine leukemia virus (M-MuLV), this study evaluates how individual base preferences at a cleavage site direct retroviral RNase H specificity. Strong test cleavage sites (designated as between nucleotide positions −1 and +1) for the HIV-1 and M-MuLV enzymes were introduced into model hybrid substrates designed to assay internal or DNA 3′ end-directed cleavage, and base substitutions were tested at specific nucleotide positions. For internal cleavage, positions +1, −2, −4, −5, −10, and −14 for HIV-1 and positions +1, −2, −6, and −7 for M-MuLV significantly affected RNase H cleavage efficiency, while positions −7 and −12 for HIV-1, and positions −4, −9 and −11 for M-MuLV had more modest effects. DNA 3′ end-directed cleavage was influenced substantially by positions +1, −2, −4, and −5 for HIV-1 and positions +1, −2, −6, and −7 for M-MuLV. Cleavage site distance from the recessed end did not affect sequence preferences for M-MuLV reverse transcriptase. Based on the identified sequence preferences, a cleavage site recognized by both the HIV-1 and M-MuLV enzymes was introduced into a sequence that was otherwise resistant to RNase H. The isolated RNase H domain of M-MuLV reverse transcriptase retained sequence preferences at positions +1 and −2, despite prolific cleavage in the absence of the polymerase domain. The sequence preferences of retroviral RNase H likely reflect structural features in the substrate that favor cleavage, and represent a novel specificity determinant to consider in drug design.

Keywords: RNase H, reverse transcription, human immunodeficiency virus, type 1 (HIV-1), Moloney murine leukemia virus (M-MuLV), reverse transcriptase

Introduction

In reverse transcription, the single-stranded RNA genome of a retrovirus is copied into a linear double-stranded DNA that integrates into the host cell chromosome 1, 2. During synthesis, a host cell tRNA is used to initiate minus-strand DNA synthesis, while a sequence in the viral genome termed the polypurine tract (PPT)1 is cleaved and subsequently used as the primer to initiate plus-strand DNA synthesis. Unique sequences at the 5′ and 3′ ends of the viral genome are duplicated through template switches to generate long terminal repeats (LTR) at both ends of the unintegrated viral DNA. The virally-encoded reverse transcriptase contains an RNase H activity as well as an RNA- and DNA-dependent DNA polymerase activity, and both activities are required for viral replication 36. During reverse transcription, reverse transcriptase carries out DNA polymerization, strand transfers, and cleavage of the viral RNA genome once it has been used as a template to produce RNA/DNA hybrids.

The reverse transcriptase of human immunodeficiency virus, type 1 (HIV-1) is a heterodimer containing a 66 kDa (p66) and a 51 kDa (p51) subunit 7. The p66 subunit has an amino terminal DNA polymerase domain that is followed by the connection and RNase H domains, while the p51 subunit is a proteolytic fragment of p66 that lacks the carboxyl terminal RNase H domain. The reverse transcriptase of Moloney murine leukemia virus (M-MuLV) is a 76 kDa monomer that contains an amino terminal DNA polymerase domain, a connection domain, and a carboxyl terminal RNase H domain 8. Crystal structures of the human and murine reverse transcriptases indicate similar nucleic acid binding clefts and a comparable tertiary structure for the RNase H domains 913. In addition, co-crystal structures of the HIV-1 enzyme indicate that the distance between the active sites of the polymerase and RNase H domains is approximately 17–18 nucleotides as measured on a template strand, and that the nucleic acid binding cleft has numerous and varied contacts with the substrate, the majority of which are observed between the polymerase domain and the DNA region proximal to the 3′ primer terminus 11, 12, 14.

The RNase H activity of reverse transcriptase facilitates reverse transcription in several ways 15, 16. RNase H degrades the viral genome in the RNA/DNA hybrids formed during minus-strand synthesis. Genome degradation assists strand transfers and plus-strand synthesis, and represents the majority of RNase H cleavages required during reverse transcription. RNase H carries out a sequence-specific cleavage in the viral genome that generates the PPT primer required for plus-strand initiation. Also with sequence specificity, RNase H removes the tRNA and PPT primers used to initiate minus-strand and plus-strand DNA synthesis, respectively. Correct generation of the PPT primer as well as proper removal of the PPT and tRNA primers create the LTR ends required for integration. Despite these essential roles in reverse transcription, no clinically relevant drugs have yet been developed against the RNase H activity of reverse transcriptase (reviewed in Ref. 17) 17.

Based upon the interactions of reverse transcriptase with the substrate, RNase H cleavages have been classified into three distinct types (for review, see Refs. 15 and 18) 15, 18. Internal RNase H cleavages occur when reverse transcriptase binds an RNA/DNA hybrid without associating with the end of an RNA or DNA strand 1922. Interestingly, internal cleavage sites are not selected at random. Through an analysis of base frequencies found at the nucleotide positions surrounding internal cleavage sites, we showed that the HIV-1 and M-MuLV reverse transcriptases have a statistically significant preference for certain nucleotides at specific positions 23. With the scissile phosphate defined as occurring between the −1 and +1 positions, and strong or moderate preferences indicated by upper case or lower case letters, respectively, HIV-1 reverse transcriptase has nucleotide preferences at positions +1 (A/U), −2 (C/G), −4 (C/g/u), −7 (G/C), −12 (A/g/u), and −14 (A/g). The M-MuLV reverse transcriptase has similar preferences at +1 (A/U) and −2 (C/G), but shows unique preferences at positions −6 (C/G/u) and −11 (A).

The two other types of RNase H cleavage result from the preferential association of the polymerase domain of reverse transcriptase to recessed DNA 3′ or RNA 5′ ends in a hybrid 2427. Both the distance from and the type of recessed end are important determinants for end-directed cleavages. DNA 3′ end-directed cleavages occur ~15 – 20 nucleotides away from the recessed terminus for HIV-1 reverse transcriptase or ~17 – 20 nucleotides away for the M-MuLV enzyme, while RNA 5′ end-directed cleavages are ~13 – 20 nucleotides away from the recessed RNA 5′ end for both enzymes 28. Based on a statistical analysis, sequence is also an important determinant of end-directed cleavages, and DNA 3′ and RNA 5′ end-directed types of cleavage share the same nucleotide preferences at specific positions flanking a cleavage site 29,28. Both the HIV-1 and M-MuLV enzymes have strong preferences at positions +1 (A/U/C) and −2 (C/G), and additionally for the human enzyme, at position −4 (U/C/G).

At present, the relative importance of each preferred position as inferred from the statistical analyses is unknown. It is also possible that some positions that emerged from the statistical analyses reflect a chance association in the particular sequences chosen and were not related to enzyme specificity. To distinguish whether all of the positions identified by the statistical analyses affect cleavage equally or whether some positions are more important than others, we have tested the effects of base substitutions at several nucleotide positions near cleavage sites recognized by the HIV-1 and M-MuLV reverse transcriptases. Our analyses show some positions are more influential than others in promoting cleavage, and identify additional position preferences for each enzyme. Also, we tested if any sequence preferences are displayed by the isolated RNase H domain of M-MuLV, which retains enzymatic activity but is unable to perform sequence-specific cleavages like PPT primer generation or removal 30, 31. These data offer insights into the sequence preferences that influence general genome degradation by the HIV-1 and M-MuLV reverse transcriptases, and may suggest new opportunities for targeting drugs specific to the retroviral RNases H.

Results

To examine base preferences at individual positions flanking an RNase H cleavage site recognized by HIV-1 or M-MuLV reverse transcriptase, it was important to identify a strong and relatively isolated cleavage site that was recognized by each enzyme. From previous analyses of strong internal cleavage sites, we chose one RNA sequence with a strong site for the HIV-1 reverse transcriptase located 3 nucleotides downstream of a strong site for the M-MuLV reverse transcriptase 23, and generated the 49-mer RNA termed Rmzc that contained both sites (see Materials and Methods). From the 5′ end of Rmzc, the HIV-1 site is located between nucleotides 33 and 34 (site 33/34) and the M-MuLV site is between nucleotides 30 and 31 (site 30/31) (Fig. 1A and 4A). The convenient proximity of these sites allowed a single base substitution in Rmzc to test different nucleotide positions relative to each cleavage site. For example, changing the base at position −5 for site 33/34 also changes position −2 for site 30/31.

Fig. 1.

Fig. 1

Model substrates to analyze internal and DNA 3′ end-directed cleavage by HIV-1 reverse transcriptase. A. The sequence of 49-mer RNA Rmzc is shown from 5′ to 3′ and site 33/34, the strong cleavage site recognized by HIV-1 reverse transcriptase, is indicated with an arrow. For site 33/34, the positions of preferred nucleotides as determined from an earlier statistical analysis are indicated by the numbers in shaded boxes. B. Cleavage of site 33/34 in Rmzc hybrids designed to test internal (left panel) or DNA 3′ end-directed (right panel) cleavage. Above each panel is a schematic of the hybrid substrate containing Rmzc (gray) and a DNA strand (black). HIV-1 reverse transcriptase was incubated with hybrids containing 5′ end-labeled (5′ label; lanes 1–5, 11–15) or 3′ end-labeled Rmzc (3′ label; lanes 6–10, 16–20) in time course assays. Samples were either left untreated (0 min) or removed at 0.25, 1, 4, or 16 min, analyzed in denaturing 20% acrylamide gels, and visualized using a PhosphorImager. The positions of Rmzc and the products resulting from cleavage at site 33/34 are indicated with arrows, and the sizes of cleavage products are indicated in nucleotides.

Fig. 4.

Fig. 4

Model substrates to analyze internal and DNA 3′ end-directed cleavage by M-MuLV reverse transcriptase. A. The sequence of Rmzc is shown from 5′ to 3′ and site 30/31, the strong cleavage site recognized by M-MuLV reverse transcriptase, is indicated just upstream of site 33/34 (used for HIV-1 RNase H assays; see Fig. 1). The positions of preferred nucleotides as determined previously are indicated by the numbers in open boxes. B. Cleavage of site 30/31 in Rmzc hybrids designed to test internal (left panel) or DNA 3′ end-directed (right panel) cleavage. The hybrid substrates are shown above each panel and configured as described in Fig. 1B. Hybrids containing Rmzc labeled at the 5′ (lanes 1–5, 11–15) or 3′ (lanes 6–10, 16–20) ends were incubated with M-MuLV reverse transcriptase in time course assays, and samples were analyzed as described in Fig. 1B. The positions of Rmzc and the 30-mer product resulting from cleavage at site 30/31 are indicated with arrows and the sizes of cleavage products are indicated in nucleotides.

Substitutions that affect HIV-1 RNase H cleavage

The sequence surrounding site 33/34 in Rmzc matched the preferred nucleotides previously identified at positions +1, −2, −4, −7, −12, and −14 for internal cleavage by HIV-1 RNase H (Fig. 1A) 23. By choosing the appropriate length of a complementary DNA strand, hybrid substrates were created to promote internal or DNA 3′ end-directed cleavage at site 33/34 (Fig. 1B, top illustrations). In the internal cleavage hybrid, site 33/34 was located 33 nucleotides from the RNA 5′ end, while in the DNA 3′ end-directed cleavage hybrid, site 33/34 was positioned 18 nucleotides from the recessed DNA 3′ end.

To evaluate cleavage of site 33/34 in the internal and DNA 3′ end-directed cleavage substrates, both 5′ and 3′ end-labeled Rmzc RNAs were used. When the internal cleavage substrate containing 5′ end-labeled Rmzc was incubated with HIV-1 reverse transcriptase, internal cleavage at site 33/34 was exceptionally strong as the amount of 33-mer product was similar to that of the 14 to 17-mer products generated by RNA 5′ end-directed cleavages in the 0.25 and 1 min time points (Fig. 1B, compare lanes 1–5). At the later time points, additional cleavage at site 33/34 was masked by cleavages proximal to the labeled 5′ end. When the internal cleavage substrate contained 3′ end-labeled Rmzc, exceptionally strong cleavage at site 33/34 was again evident (Fig. 1B, lanes 6–10). Using the DNA 3′ end-directed cleavage substrate with 5′ end-labeled or 3′ end-labeled Rmzc, cleavage at site 33/34 primarily generated the 18-mer product in the 0.25 and 1 min time points (Fig. 1B, lanes 11–13 and 16–18, respectively). In the following experiments that test the effects of sequence changes on cleavage, the 1 min time points were used as an indicator of the initial cleavage events.

In the internal cleavage assay, changing the preferred +1A to a disfavored G significantly decreased cleavage (Fig. 2A, compare lanes 2 and 4). Similarly, changing the −2C to U or the −4U to A substantially decreased cleavage (Fig. 2A, lanes 6 and 10). While changing the −7C to U or −14A to C decreased cleavage, changing the −12G to C did not seem to affect internal cleavage (Fig. 2A, lanes 14, 18, and 16). Combining −7U or −12C with −14C decreased cleavage slightly more than the −14C substitution alone (Fig. 2A, compare lanes 22 and 20 with lane 18). Substitutions were also tested at some positions not previously associated with nucleotide preferences by the statistical tests. While a −3G did not alter cleavage, a −5U decreased internal cleavage (Fig. 2A, lanes 8 and 12). Also, replacing the −9U with A or G seemed to slightly reduce cleavage (Fig. 2A, lanes 24 and 26). To better evaluate the effects of these substitutions, identical time course assays were repeated in multiple experiments and the amount of the 33-mer product observed in the 1 min time points was quantified (Fig. 2B). The analyses showed that internal cleavage at site 33/34 was reduced by over 60% with +1G or −14C, by ~ 70% for −5U, and over 90% for −2U or −4A. It was also evident that the substitutions of −7U and −12C were comparable to the −9A and −9G changes that had no substantial effect on internal cleavage. However, the reduction in cleavage by −14C was greater when combined with −7U or −12C.

Fig. 2.

Fig. 2

Effects of nucleotide substitutions on internal RNase H cleavage by HIV-1 reverse transcriptase. A. 5′ end-labeled Rmzc or Rmzc RNAs with the indicated substitutions relative to site 33/34 were used to generate internal cleavage substrates as shown in Fig. 1B. The position(s) and change(s) introduced into each substrate relative to site 33/34 are indicated above each lane by the position number and base. These substrates were incubated with HIV-1 reverse transcriptase in time course experiments and the 1 min time points (even numbered lanes) or the untreated substrates (odd numbered lanes) were analyzed as described in Fig. 1B. The relevant portion of the resulting gel image is shown and filled circles indicate the position of the 33-mer product resulting from cleavage at site 33/34. B. Bar graph depicting the total amount of 33-mer product (% of total) generated by internal cleavage at site 33/34 using 1 min time point data from 5 separate experiments with the substrates described in A. C. Bar graphs depicting the total amount of 33-mer product (% of total) generated by internal cleavage at site 33/34 using substrates comparing cleavage of Rmzc (leftmost bar in each graph) with all three nucleotide substitutions at the indicated positions. Data from 1 min time points generated in 5 experiments are shown for all positions. For B and C, error bars represent plus and minus one standard deviation.

To further evaluate the contributions of select nucleotide positions to internal cleavage, the effects of all three base substitutions at a given position were compared (Fig. 2C). Consistent with the strong cleavage observed at site 33/34, the original Rmzc sequence typically generated the highest levels of internal cleavage (left bar in each graph). Substitutions at position −2 indicated that U and A were deleterious while C and G were very favorable. Substitutions at position −4 showed that all bases but A favored cleavage. This analysis also revealed new strong preferences at position −5 (G > A > C/U) and position −10 (A/U > G/C), and only slight preferences at positions −3 (favors G) and −9 (disfavors C).

To test how base substitutions affected DNA 3′ end-directed cleavage at site 33/34, substrates for DNA 3′ end-directed cleavage were used in RNase H assays with HIV-1 reverse transcriptase. As compared to the original Rmzc sequence, DNA 3′ end-directed cleavage was reduced by the substitutions +1G, −2U, −4A, and −5U (Fig. 3A, compare lanes 2, 4, 6, 8 and 10). The substitution −7U appeared to slightly reduce cleavage, and single or even double substitutions located further from site 33/34 did not decrease cleavage significantly (Fig. 3A, lanes 12, 14, 16, 18, and 20). DNA 3′ end-directed cleavage was substantially reduced when disfavored bases were combined at positions +1 and −2 (Fig. 3A, lane 24), and eliminated by additionally changing the −4 or −5 positions (Fig. 3A, lanes 26 and 28, respectively). Quantitation of results from repeated experiments showed that the individual substitutions at positions +1, −2, −4, or −5, and the combined substitutions −12C, −14C reduced DNA 3′ end-directed cleavage by ~50% or more (Fig. 3B). A slight reduction in cleavage occurred with the single substitutions at −7, −12, or −14, and the combined −7U, −14C substitution. DNA 3′ end-directed cleavages were decreased over 90% by the combined +1G, −2U change, and barely detectable with the additional −4A or −5U changes.

Fig. 3.

Fig. 3

Effects of nucleotide substitutions on DNA 3′ end-directed RNase H cleavage by HIV-1 reverse transcriptase. A. DNA 3′ end-directed cleavage substrates as shown in Fig. 1B were made using 5′ end-labeled Rmzc or Rmzc RNAs with the indicated nucleotide substitutions, and the resulting substrates were analyzed as described in Fig. 2A. B. A bar graph depicting the total amount of 33-mer product (% of total) generated by DNA 3′ end-directed cleavage at site 33/34 from 1 min time points generated in 5 separate experiments using the substrates shown in A. C. Bar graphs depicting the total amount of 33-mer product (% of total) generated by DNA 3′ end-directed cleavage at site 33/34 using substrates comparing cleavage of Rmzc (left) with substrates containing the three other nucleotides at the indicated positions. Data represent the 1 min time points generated in 5 experiments (positions −2, −9, −10) or 4 experiments (positions −3, −4, −5).

All three base substitutions were carried out at positions −2, −3, −4, −5, −9, and −10 using the DNA 3′ end-directed cleavage substrates (Fig. 3C). Cleavage at site 33/34 was highest in hybrids containing the original Rmzc sequence and, in general, the results were similar to those observed for internal cleavage (compare Figs. 2C and 3C). Notably, at position −4, a G was favored over C in DNA 3′ end-directed cleavage while the converse was true for internal cleavage. Consistent with the base preferences observed earlier, none of the substitutions at positions −9 or −10 significantly affected DNA 3′ end-directed cleavage.

Substitutions that affect M-MuLV RNase H cleavage

Site 30/31 in the RNA Rmzc is 3 bases upstream of site 33/34 recognized by the HIV-1 RNase H, and matches the preferred nucleotides previously identified at positions +1, −2, −6, and − 11 for internal cleavage by M-MuLV RNase H (Fig. 4A) 23. When an internal cleavage substrate containing 5′ end-labeled Rmzc was treated with M-MuLV reverse transcriptase, the amount of 30-mer product resulting from cleavage at site 30/31 was only slightly less at the 0.25 and 1 min time points than those generated by RNA 5′ end-directed cleavages (15 to 17-mer products) (Fig. 4B, left panel, lanes 1–5). Results with the substrate containing 3′ end-labeled Rmzc confirmed that site 30/31 was a strong and relatively isolated internal cleavage site (Fig. 4B, lanes 6–10). When positioned 18 nucleotides from the DNA 3′ primer terminus in the DNA 3′ end-directed cleavage substrate, site 30/31 was cleaved exceptionally well with either end-labeled substrate (Fig. 4B, right panel, lanes 11–15 and 16–20).

Next, several substitutions were introduced at positions around site 30/31 and tested using internal cleavage substrates. Consistent with nucleotide preferences previously identified for positions +1 (A/U), −2 (C/G), and −6 (C/G/u), internal cleavage at site 30/31 was reduced ~80–90% by the introduction of +1G, −2U, or −6A, but unaffected by the −6G change (Fig. 5A, compare lanes 2, 8, 12, 24, and 26; Fig. 5B). Changing the preferred −11A to C had a moderate effect, reducing cleavage by only ~30% (Fig. 5B). Substitutions at the non-preferred positions +4, +2, −1, and −4 did not significantly affect cleavage (Fig. 5A, lanes 4, 6, 10, 14; Fig. 5B). As compared to the original Rmzc sequence, the −9C change reduced cleavage almost equal to that of the −11C substitution, and the combined −9C, −11C changes reduced cleavage at site 30/31 by ~50% (Fig. 5B).

Fig. 5.

Fig. 5

Effects of nucleotide substitutions on internal RNase H cleavage by M-MuLV reverse transcriptase. A. 5′ end-labeled Rmzc or Rmzc RNAs with the indicated substitutions relative to site 30/31 were used to generate internal cleavage substrates as described in Fig. 2. The position(s) and change(s) introduced into each substrate relative to site 30/31 are indicated above each lane by the position number and base. These substrates were incubated with M-MuLV reverse transcriptase in time course experiments and the 0.25 min time points (even numbered lanes) or the untreated substrates (odd numbered lanes) were analyzed. Filled circles indicate the position of the 30-mer product resulting from cleavage at the 30/31 site. B. A bar graph depicting the amount of 30-mer product (% of total) generated by internal cleavage at site 30/31 from 0.25 min time points generated in 5 separate experiments using the substrates described in A. C. Bar graphs depicting the total amount of 30-mer product (% of total) generated by internal cleavage at site 30/31 using substrates comparing cleavage of Rmzc (left) with substrates containing the indicated substitutions. Data represent the 0.25 min time points generated in 5 experiments.

All three base substitutions were tested at several positions near site 30/31, including the preferred positions +1, −2, and −6 (Fig. 5C). Consistent with previously identified base preferences, internal cleavage was favored by A and U at position +1, decreased by U at position −2, and substantially reduced by A at position −6. Interestingly, a new preference was observed at position −7, where A was significantly favored cleavage over the other base choices. Also, a C was slightly preferred at position +2, and no bases were favored at position −1.

Similar assays with M-MuLV reverse transcriptase were used to evaluate how base substitutions near site 30/31 in Rmzc influenced DNA 3′ end-directed cleavage. In the course of this analysis, we also asked if the sequence preferences that promote cleavage were affected by the distance of a cleavage site from the DNA 3′ end by positioning site 30/31 at both 18 nucleotides and 15 nucleotides from the recessed 3′ end (Fig. 6). All three base substitutions were carried out at positions +1, −1, −2, −6, and −7. For the 18–nt substrate, strong preferences were observed at positions +1 (A/U), −2 (A/C/G), −6 (C/G/U), and −7 (A) but substitutions at position −1 were not significantly different from the control (Fig. 6A). Due to the optimal positioning of site 30/31 in the cleavage window 28, site 30/31 was cleaved much more efficiently in nearly all of the 18-nt substrates as compared to the 15-nt substrates (compare Figs. 6A and 6B). Most importantly, the base preferences for the 18-nt and 15-nt substrates were essentially identical, and were very similar to those observed for the internal cleavage substrates (compare Fig. 6 with Fig. 5C).

Fig. 6.

Fig. 6

Effects of nucleotide substitutions on DNA 3′ end-directed RNase H cleavage by M-MuLV reverse transcriptase. A. DNA 3′ end-directed cleavage substrates that positioned site 30/31 at 18 nucleotides from the recessed DNA 3′ end (18-nt substrate) were made using Rmzc or Rmzc RNAs with the indicated nucleotide substitutions, and the resulting substrates were analyzed as described in Fig. 3. The bar graphs depict the amount of 30-mer product (% of total) generated by DNA 3′ end-directed cleavage at site 30/31 as described in Fig. 5C. Data represent the 0.25 min time points generated in 5 experiments (positions −1, −6, −7), 4 experiments (position +1), or 3 experiments (positions −2). B. DNA 3′ end-directed cleavage substrates that positioned site 30/31 at 15 nucleotides from the recessed DNA 3′ end (15-nt substrate). Graphs are as described for A.

Creation of a site recognized by internal and DNA 3′ end-directed cleavage

We next asked if a cleavage site could be introduced into a sequence that was resistant to RNase H activity. We were especially curious to test the newly identified preferences of the HIV-1 enzyme at position −5 (G/a) and the M-MuLV enzyme at position −7 (A). An AU-rich sequence that is not cleaved by either the HIV-1 or the M-MuLV RNase H is found in the region upstream of the −1/+1 cleavage site that generates the PPT primer in the M-MuLV genome 21, 23, 32. This sequence was used to generate RNA R–54/–14, where the proposed Site T (for Target site) was within the resistant region between the 34th and 35th nucleotides from the RNA 5′ end (Fig. 7A). For Site T, preferred bases were already present at positions +1, −2 and −11 for M-MuLV, and at positions +1, −2, −12, and −14 for HIV-1, but disfavored bases were present at positions −4 through −7 for both enzymes. A variety of base substitutions were introduced at positions −4 to −7 and cleavage was assayed using internal and DNA 3′ end-directed cleavage substrates.

Fig. 7.

Fig. 7

Creation of Site T recognized by the HIV-1 and M-MuLV RNases H. A. The sequence of RNA R–54/–14 is shown with the targeted location of a new cleavage site (Site T for target site) in an RNase H resistant sequence. Site T has preferred nucleotides for HIV-1 at positions +1, −2, −12, and −14 (shadowed boxes) and for M-MuLV at positions +1, −2, and −11 (open boxes) but has disfavored nucleotides at positions −4 through −7. B. Internal cleavage substrates containing R–54/–14 or substituted R–54/–14 RNAs were analyzed in time course assays, and analyzed as described in Fig. 1. Graphs show the total amount of 34-mer product (% of total) generated by internal cleavage at Site T using representative data from 1 min time points for HIV-1 and 0.25 min time points for M-MuLV reverse transcriptase. C. DNA 3′ end-directed cleavage substrates containing R–54/–14 or substituted R–54/–14 RNAs with the DNA 3′ primer terminus located at 18 nucleotides from Site T were analyzed and the data are shown as described for B. For B and C, the data shown are from representative experiments.

Internal cleavage substrates containing RNA R–54/–14 or RNAs with the indicated substitutions were treated with HIV-1 reverse transcriptase, and the 1 min time points from time course assays were analyzed (Fig. 7B, HIV-1). A single substitution of −4A to C increased internal cleavage by 14-fold at Site T, while the combined −4C, −5G increased internal cleavage over 100-fold. Also for HIV-1, the combined −4C, −7C or −4C, −6G, −7C increased cleavage by 20-fold or 40-fold, respectively, but other changes had little or no effect on internal cleavage. For M-MuLV, internal cleavage at Site T increased by 7-fold with the combined −6G, −7A changes, but only two-fold or lower effects were observed with other substitutions.

These substitutions were also tested in a DNA 3′ end-directed cleavage assay with Site T positioned at 18 nucleotides from the DNA 3′ end (Fig. 7C). Like for internal cleavage, all changes that included the −4C increased DNA 3′ end-directed cleavage for HIV-1 reverse transcriptase by 8 to 9-fold, and a −6G caused a 2-fold increase. For M-MuLV reverse transcriptase, cleavage was increased 3 to 4-fold by all changes that included the −6G, and 2-fold by the combined −4C, −7C. Finally, the low but detectable level of DNA 3′ end-directed cleavage observed at Site T for both enzymes was eliminated by the disfavored combined +1G, −2A changes.

Nucleotide preferences of the isolated M-MuLV RNase H domain

In an earlier study, we showed that the isolated RNase H (RNH) domain derived from the M-MuLV reverse transcriptase has enzymatic activity but was unable to generate the PPT primer or remove the extended PPT and tRNA primers 30. Here, we asked if the isolated RNH domain retained any of the sequence preferences involved in cleavage site recognition.

First, multiple cleavage sites recognized by the isolated RNH domain were mapped. Five RNAs derived both from viral sequences and from in vitro transcription plasmids were used to generate RNA/DNA hybrids that were treated with the isolated RNH domain or the intact M-MuLV reverse transcriptase (see Fig. 9A and Materials and Methods). Using the substrate containing RNA T7/+17, the isolated RNH domain cleaved at multiple internal sites over the entire length of the RNA (Fig. 8A, lanes 5–8, cleavage products indicated by filled circles). Some products were clearly more abundant than others, suggesting some differences in cleavage preference by the isolated RNH domain. In contrast, the intact M-MuLV reverse transcriptase preferentially cleaved sites (positions 16–20) found within the RNA 5′ end-directed cleavage window and recognized only a few internal sites (Fig. 8A, lanes 1–4). Similar results were observed using substrates that contained RNAs PPT62/HET and hRppt57, as sites recognized by the RNH domain were abundant and fairly evenly distributed while those recognized by the M-MuLV reverse transcriptase were much less frequent (Fig. 8, B and C).

Fig. 9.

Fig. 9

Analysis of cleavage sites recognized by the RNH domain of M-MuLV reverse transcriptase. A. The sequences of the RNAs used in the hybrids of Figure 8 are shown with the cleavage sites recognized by the M-MuLV reverse transcriptase (arrows) and the RNH domain (filled circles). The RNA 5′ end-directed cleavage window is indicated by a shadowed box that falls from positions 13 through 20 in the aligned RNA 5′ ends. B. Determination of the base preferences at nucleotide positions surrounding the RNH cleavage sites. The chi-square values of the nucleotide distribution from positions −13 to +4 were determined by comparison to overall base frequencies and plotted as a function of nucleotide position relative to the scissile phosphate located between positions −1 and +1. This analysis is based on a total of 88 cleavage sites recognized by the isolated RNH domain. The chi-square value of 11.34, which has a p value of 0.01, is indicated by a dashed line. Nucleotide positions with scores greater than 11.34 are considered significant. The preferred nucleotides for each significant position are indicated above the corresponding bar in uppercase (strongly preferred) letters. Counting from the RNA 5′ end, the position of each cleavage site is indicated below the RNA sequence by the 5′ and 3′ nucleotides positions that border the scissile phosphate.

Fig. 8.

Fig. 8

Comparison of cleavage sites recognized by the isolated M-MuLV RNase H domain versus the M-MuLV reverse transcriptase. Substrates containing the indicated 5′ end-labeled RNAs were incubated with M-MuLV reverse transcriptase (M-MuLV RT) or the isolated RNase H domain, RNH, for the indicated times, and samples were analyzed as described in Fig. 1. As markers, single-strand 5′ end-labeled RNAs were treated as indicated with nuclease P1 (P1), sodium hydroxide (NaOH), or ribonuclease T1 (T1) to enable a correlation of the cleavage sites with the RNA sequence. Numbers on the side indicate the size in nucleotides of products generated by M-MuLV RT cleavages, filled circles indicate products resulting from RNH cleavage, and asterisks indicate exceptionally strong cleavage sites that are discussed in the text. Representative experiments are shown with substrates containing RNAs T7/+17 (A), PPT62/HET (B), and hRppt57 (C).

The isolated RNH domain strongly recognized some sites (Fig. 8, indicated with asterisks). One such site was between the 9th and 10th nucleotides from the RNA 5′ end of the PPT62/HET and hRppt57 RNAs, which share the same first 10 nt of 5′ sequence due to in vitro transcription (Fig. 8B, lanes 6–8 and 8C, lanes 2–5). We interpret this cleavage to represent a sequence preference at this site rather than an end-directed cleavage preference since no exceptionally strong cleavage sites were observed at the 5′ ends of other RNAs used in this analysis (Fig. 8A, T7/+17, asterisks, and data not shown). Similar examples of equally strong cleavage sites were found between the 22nd and 23rd nucleotides and the 49th and 50th nucleotides in PPT62/HET (Fig. 8B, lanes 2–5).

In a comparison of the cleavage sites recognized by the M-MuLV reverse transcriptase versus the isolated RNH domain, the intact enzyme recognized far fewer sites, and half of these were clustered in the RNA 5′ end-directed cleavage window (Fig. 9A). In contrast, the RNH domain cleaved at approximately four times as many positions and the sites were distributed more evenly throughout the RNAs. Notably, the isolated RNH domain recognized all sites cleaved by the intact M-MuLV reverse transcriptase, with the exception of sites 19/20 and 23/24 in the substrate containing RNA hRppt57. To determine if the isolated RNH domain displayed any base preferences near a cleavage site, the sequences from positions −13 to +4 for 88 cleavage sites in several RNAs, including those in Fig. 8, were aligned (see Materials and Methods). To determine the significance of any deviations from random nucleotide frequencies, the base distribution at each position was compared with the expected base distributions using the chi-square method, and the resulting chi-square values were plotted as a function of nucleotide positions (Fig. 9B). For the isolated RNH domain, two nucleotide positions showed strong deviations from random with p values less than 0.01. The first was position +1, which showed a strong preference for A or U, and the second was position −2, which strongly disfavored U.

Discussion

In previous work with the HIV-1 and M-MuLV reverse transcriptases, a statistical analysis of sequence alignments from RNase H cleavage sites revealed that internal and end-directed cleavage sites were flanked by base preferences at specific positions 23, 29. For an internal cleavage site, HIV-1 RNase H had 6 positions with nucleotide preferences, while M-MuLV had 4 positions. However, this analysis could not discern the relative contribution of any single position to cleavage and additional preferences could exist that were not revealed by the statistical study as a result of the particular distribution of bases in the chosen sequences. In the present study, a hybrid substrate containing a strong, isolated cleavage site recognized by each enzyme (site 30/31 for M-MuLV; site 33/34 for HIV-1) was used to ask if the preferred positions around a cleavage site are equally important, or if the preferred positions have distinct effects on the extent of cleavage. Our results indicate that the nucleotide preferences at most positions contribute individually to cleavage site recognition, and that there are large variations in the extent that each position affects cleavage. Some positions have very pronounced effects on cleavage, while other positions have moderate or slight effects. These observations are summarized in Figure 10 and discussed below.

Fig. 10.

Fig. 10

Summary of base preferences at nucleotide positions surrounding an RNase H cleavage site recognized by HIV-1 or M-MuLV reverse transcriptase. The scissile phosphate (indicated by an arrow) is designated between the −1 and +1 positions below an RNA strand (gray) that would be in an RNA/DNA hybrid. The nucleotide positions that have a significant (black box) or moderate (gray box) effect on the efficiency of internal cleavage by HIV-1 (A) or M-MuLV (B) RNase H are shown with the positions numbered (above) and the base preferences (below) indicated. Positions important for DNA 3′ end-directed cleavage are indicated with an asterisk. Upper or lower case letters indicate a strong or moderate preference, respectively, for specific nucleotides at the indicated positions.

Many of the preferred positions have a substantial and independent effect on retroviral RNase H cleavage (Fig. 10, black boxes). For HIV-1 RNase H, the previously identified preferences at positions +1 (A/U), −2 (C/G), −4 (C/G/U), and −14 (A/g) and newly identified preferences at positions −5 (G/a) and −10 (A/U) significantly affected internal cleavage (Fig. 10). The introduction of a −2U, −2A, or −4A had particularly negative consequences on cleavage. The newly identified preferences at positions −5 and −10 were not observed in our earlier studies, perhaps because the nucleotides favored at these positions relative to the cleavage sites were not present at high enough frequencies in the RNA sequences used. For M-MuLV RNase H, three of the previously identified positions +1 (A/U/c), −2 (A/C/G), and −6 (C/G/U), and a newly identified preference at position −7 (A) significantly affected internal cleavage. The loss of an A at position −7, or the introduction of a −2U or a −6A were especially deleterious for M-MuLV RNase H cleavage. Thus many preferred positions around an RNase H cleavage site can individually have a large effect on the cleavage efficiency for both the HIV-1 and M-MuLV enzymes.

Another class of preferred positions had only a moderate effect on cleavage when tested alone, but a more substantial effect when combined with a second change (Fig. 10, gray boxes). For HIV-1 RNase H, positions −7 (C/G) and −12 (A/G/u) moderately influenced internal cleavage when a disfavored nucleotide was introduced, but cleavage was substantially reduced with the combined changes of −7U, −14C or −12C, −14C. Similarly for M-MuLV RNase H, only a moderate reduction in cleavage was observed when a disfavored C was introduced at the preferred position −11. This moderate effect was somewhat surprising because the preference for an A at position −11 had the highest statistical significance of any of the preferred positions previously shown for M-MuLV internal cleavage 23. Also for M-MuLV, position −9 (G) was not identified earlier as a preferred position, but the −9C change reduced cleavage to a level comparable to that of the −11C change, and the combined changes −9C, −11C decreased cleavage more than either change independently. Another position with a moderate effect on cleavage for M-MuLV was position −4, since a −4U slightly increased cleavage, and when combined with a −11C, generated a cleavage level that was intermediate to either change alone. These observations suggest that positions −4, −9, and −11 for M-MuLV and positions −7 and −12 for HIV-1 have moderate effects on RNase H cleavage, and when combined with other weak or moderate preferences, the effects are additive.

Finally, there were some positions that had very modest effects on cleavage. Internal cleavage by HIV-1 RNase H had slight preferences at positions −3 (favors G) and −9 (disfavors C), while that of M-MuLV showed a slight preference at position +2 (favors C). Most likely, the bases favored at these positions contribute to an optimal combination for cleavage that is unique to the test cleavage sites. These slightly-favored bases are not indicated in Fig. 10, and further experiments are required to establish whether such subtle effects are important in cleavage preferences.

DNA 3′ end-directed cleavage was significantly affected by positions +1, −2, −4, and −5 for HIV-1 and by positions +1, −2, −6, and −7 for M-MuLV (Fig. 10, asterisks). While the total amount of product generated by DNA 3′ end-directed cleavage was significantly greater than by internal cleavage, most base preferences were the same for internal and DNA 3′ end-directed cleavage, and the effects of base substitutions were proportionally equivalent. For example, substitutions at position −2 affected both types of cleavage very similarly for HIV-1 and M-MuLV RNase H. Also, most single substitutions at positions more distal from the cleavage site did not significantly influence DNA 3′ end-directed cleavage. An interesting exception for HIV-1 was the single −14C change, which slightly reduced DNA 3′ end-directed cleavage, and when −14C was combined with a −7U or a −12C, end-directed cleavage was reduced further. This observation suggests that while one disfavored nucleotide at a position distal from the cleavage site appears unlikely to affect the DNA 3′ end-directed cleavage efficiency, two or more disfavored nucleotides can significantly affect such cleavage.

Previous studies have shown that the distance between the recessed DNA 3′ end and the cleavage site is also an important determinant of how efficiently a site is cleaved 26, 3336. DNA 3′ end-directed cleavage is greatest when this distance is ~15–20 nucleotides for HIV-1 RNase H and ~17–20 nucleotides for M-MuLV RNase H 28. In this study, we tested whether distance from the recessed primer terminus might influence the base preferences at several positions for M-MuLV RNase H. Despite severely reducing the cleavage efficiency by positioning the cleavage site at 15 nucleotides from the DNA 3′ end, the base preferences for cleavage were essentially the same as when the cleavage site was positioned at 18 nucleotides. We have observed similar results with other sites located at different distances from the recessed DNA end for both HIV-1 and M-MuLV RNases H (data not shown). These observations support the conclusion that the distance and sequence preferences of reverse transcriptase are independent determinants of RNase H cleavage.

In the co-crystal structure of HIV-1 reverse transcriptase with a PPT-containing RNA/DNA hybrid, contacts with the substrate appear to cluster in three regions of the protein 12. First, there are residues near the RNase H active site that contact the RNA strand from positions +2 to −2, including base-specific contacts between Gln475 and the −2G in the substrate, and between Arg448 and the +1A in the substrate. Second, there are multiple hydrogen-bonding contacts between the phosphate backbone in the DNA primer strand that is opposite to RNA positions −4 to −9, and a region of reverse transcriptase found close to the RNase H active site. This region has been termed the RNase H primer grip and is found in both the HIV-1 and M-MuLV RNases H 12, 37. The RNase H primer grip influences substrate binding and positioning, and point mutations in this region of HIV-1 RNase H can decrease RNase H activity and alter recognition of the PPT 3842. Third, residues closer to the polymerase active site have multiple contacts (including base-specific interactions) with both strands of the substrate beginning at position −11 and continuing further upstream on the RNA strand relative to the potential cleavage site. For the preferred positions that promote RNase H cleavage, positions +1 and −2 would be near the RNase H active site, positions −4 through −7 would be in the RNase H primer grip region, and the position −11 for M-MuLV and positions −12 and −14 for HIV-1 would be near the polymerase active site. At present, it remains ambiguous where positions −9 or −10 would interact with either the M-MuLV or HIV-1 reverse transcriptase.

While the isolated RNase H domain of HIV-1 is inactive 43, 44, the isolated M-MuLV RNase H domain retains enzymatic activity but is unable to carry out specific cleavages such as PPT primer generation or the removal of the extended tRNA or PPT primers 30, 31, 45. Here we tested if the isolated M-MuLV RNH domain cleaved randomly in an RNA/DNA hybrid or exhibited any sequence preferences near a cleavage site. After mapping multiple cleavage sites recognized by the M-MuLV RNH domain, an analysis of the base frequencies at the flanking nucleotide positions showed that the preferences of the full length enzyme at positions −2 (A/C/G) and +1 (A/U) were retained. Thus the isolated RNase H domain cleaves sites with limited selectivity based upon base preferences at positions −2 and +1. From the HIV-1 co-crystal structure 12, these preferences likely reflect base-specific contacts with residues near the RNase H active site. The lack of preferences for nucleotides at the more distal positions is consistent with the absence of the connection and polymerase domains in RNH.

It is informative to consider whether the sequence preferences identified for cleavage sites recognized by the retroviral RNases H extend to other members of the family of RNases H. Unlike retroviral reverse transcriptases, the prokaryotic and most eukaryotic RNases H typically lack long N-terminal or C-terminal extensions associated with the RNase H domain, and so far no sequence specificities have been identified near cleavage sites 46, 47. Thus, while interactions between the substrate and the protein in the vicinity of the active site can result in some sequence preferences for retroviral RNase H cleavage, the non-retroviral enzymes apparently lack this level of specificity. Higher eukaryotic type I RNases H typically have a short N-terminal hybrid binding domain (HBD) that promotes binding of hybrids in a non-sequence specific manner and is associated with enhancing the specific activity and processivity of the enzyme 47. Recent co-crystal structures of the HBD from human RNase H1 complexed with 12 base pair RNA/DNA hybrids have revealed that the HBD interacts with the backbones of the hybrid substrate along the minor groove, contacting 2′-OH groups on the RNA strand and both phosphate groups and deoxyribose moieties on the DNA strand 48. These non-specific contacts account for the enhancement of RNase H1 activity by the HBD and are consistent with the lack of any sequence specificity at cleavage sites. Thus sequence preferences at specific nucleotide positions around a cleavage site may be a determinant that is unique to the retroviral RNases H.

We were able to create a cleavage site in an RNase H-resistant 17 nt sequence that contained two stretches of (A)4 and one of (U)4 (Site T, Fig. 7). An r(A):d(T) tract of 4 or more bases has distinct structural features, including a straight trajectory and a narrow minor groove 49, and may influence the specificity of the PPT cleavage by RNase H 12. In the RNase H-resistant sequence, substitutions were introduced in the middle (A)4 tract, which corresponded to positions −4 through −7 relative to the target Site T. Interestingly, very few changes were needed to promote cleavage at Site T. Only a single change of −4C for HIV-1 or a −6G for M-MuLV substantially increased DNA 3′ end-directed cleavage, and internal cleavage required two changes for each enzyme, a −4C, −5C for HIV-1 and a −6G, −7A for M-MuLV. Our findings are consistent with the possibility that cleavage efficiency is affected by the substrate structure as determined by sequence. Also, the minimal changes required to introduce cleavage at a site within the RNase H resistant sequence are consistent with the base preferences presented in Fig. 10.

Reverse transcriptase is a crucial target in anti-retroviral therapy of HIV-1 infected patients. While the vast majority of clinical antivirals are currently specific for the polymerase domain, the RNase H activity of reverse transcriptase represents an excellent target for anti-viral therapy as well 17. Importantly, an anti-RNase H drug must be selectively directed against the viral activity without interfering with that of the endogenous human RNase H1. From the sequence preferences involved in cleavage site selection by the retroviral RNases H, there may be several ways to interfere with proper viral RNase H activity. For example, drugs that interfere with the interactions between the HIV-1 enzyme and positions −4 to −7 or further upstream at positions −10 and −14 positions would be predicted to substantially interfere with retroviral RNase H activity but would be unlikely to affect the endogenous RNase H1.

Materials and Methods

Enzymes and Reagents

DNA oligonucleotides were purchased from Eurofins MWG Operon, Invitrogen, and Bioneer, Inc. RNAs were made by in vitro transcription with the MEGAshortscript Kit from Ambion. Recombinant HIV-1 and M-MuLV reverse transcriptases were purchased from Worthington Biochemicals and Amersham Pharmacia Biotech, respectively. The recombinant RNase H domain of M-MuLV reverse transcriptase was prepared as described previously 30. T7 DNA polymerase and calf intestinal alkaline phosphatase were obtained from USB/Affymetrix. To generate RNA ladders for mapping cleavage sites, nuclease P1 (cleaves at all positions) and RNase T1 (cleaves 3′ of G positions) were obtained from New England Biolabs and BRL (now Invitrogen), respectively. Other DNA modifying and restriction enzymes were purchased from New England Biolabs.

Preparation of RNAs

RNAs Rmzc and R–54/–14 were prepared by in vitro transcription of annealed DNAs as described previously 29,28. For RNA Rmzc, a template DNA (5′-AGGTGGGGTCTTTCATTGTACCGAATTCCTCGAGTCTAGCAATTCGCCCTATAGTGAGTCGTATTAAATT-3′) was annealed to a primer DNA (5′-AATTTAATACGACTCACTATAGG-3′). Site 33/34 (used for HIV-1; Fig. 1) and site 30/31 (used for M-MuLV; Fig. 4) are located between the 16th and 17th and the 19th and 20th nucleotides, respectively, from the template DNA 5′ end (underlined). For RNA R–54/–14, a template DNA (5′-GGAGACTAAATAAAATCTTTTATTTTATCTATGGC TCGTACGAGCTCCCTATAGTGAGTCGTATTAAATT-3′) was annealed to a primer DNA (5′-AATTTAATACGACTCACTATAGG-3′). Site T (Fig. 7) is located between the 15th and 16th nucleotides from the template DNA 5′ end (underlined). Base substitutions were carried out for each site as described in the results by changing the appropriate bases in the template DNA. The 49-mer RNA R–63/–23 was prepared as described previously 28.

For RNA T7/+17, two 29-mer DNAs (5′-GTACAATGAAAGACCCCACCTCGAGACGA-3′ and 5′-AGCTTCGTCTCGAGGTGGGGTCTTTCATT-3′) were annealed and placed in Acc65I and HindIII-linearized pGEM7Zf(+), and the resulting plasmid was linearized with BsmBI and transcribed in vitro. For RNA hRppt57, two 50-mer DNAs (5′-AGCTTCGTCTCCG AATTAGCCCTTCCAGTCCCCCCTTTTCTTTTAAAATG-3′ and 5′-AATTCATTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAATTCGGAGACGA-3′) were annealed and placed in HindIII and EcoRI-linearized pGEM9Zf(−), and the resulting plasmid was linearized with BsmBI for in vitro transcription. For RNA PPT62/HET, a 95 bp DNA was prepared by PCR of template 5 22 using a 19-mer DNA (5′-TCGGGAGCTCGTACGAGCC-3′) and a 33-mer DNA (5′-GGAATTCGTCTCGGAACAGGGACTTGAAAGCCC-3′). After digesting with EcoRI and SacI, the PCR product was introduced into EcoRI and SacI-linearized pGEM9Zf(−), and the resulting plasmid was linearized with BsmBI for in vitro transcription.

Preparation of hybrid substrates

RNAs were labeled at the 5′ or 3′ ends and typically annealed to DNA strands using ratios of RNA:DNA at 1:1 to make internal cleavage substrates for Rmzc RNAs, at 1:10 to generate DNA 3′ end-directed cleavage substrates for R–54/–14 RNAs, and at 1:2 for all other RNAs as described previously 28. Hybrids with RNAs containing base substitutions were prepared as described for the original RNA, as detailed below.

Internal cleavage assays with Rmzc used a 49-mer DNA (5′-GTGGGGTCTTTCATT GTACCGAATTCCTCGAGTCTAGCAATTCGCCCAT-3′) and the resulting hybrid 5′ ends were recessed by 2 nucleotides. DNA 3′ end-directed cleavage assays with Rmzc used 30-mer DNAs to position site 33/34 and site 30/31 at 18 and 15 nucleotides, respectively, from the DNA 3′ end (5′-GGGGTCTTTCATTGTACCGAATTCCTCGAG-3′) or to position site 30/31 at 18 nucleotides from the DNA 3′ end (5′-GTCTTTCATTGTACCGAATTCCTCGAGTCT-3′). Internal cleavage assays with R–54/–14 similarly used a 49-mer DNA (5′-AGACTAAATAAAATCTTTTATTTTATCTATGGCTCGTACGAGCTCCCTA-3′) and DNA 3′ end-directed cleavage assays used a 30-mer DNA (5′-GACTAAATAAAATCTTTTATTTTATCTATG-3′). Hybrids were generated with T7/+17 using a 70-mer DNA (5′-GTGGGGTCTTTCATTGTACCGAATTCCTCGAGTCTAGAGGAGCATGCGACGTCGGGCCCAATTCGCCCGA-3′), with PPT62/HET using template 5 22, and with hRppt57 using a 57-mer DNA (5′-ATTAGCCCTTCCAGTCCCCCCTTTTCTTTTAAAATGAATTCGTCGACGAGCTCCCTA-3′).

RNase H Cleavage assays

10 nM hybrid substrates in 20 μl reactions were treated with 0.2 pmol of HIV-1 reverse transcriptase for all assays, and 6–10 pmol or 0.6–1 pmol of M-MuLV reverse transcriptase in internal or DNA 3′ end-directed assays, respectively, as described 28. Using denaturing 20% acrylamide gels, cleavage products were visualized by PhosphorImager analysis and analyzed using ImageQuant software (Molecular Dynamics) 28. Cleavage assays with the isolated RNH domain of M-MuLV reverse transcriptase were carried out essentially as previously described 30.

Statistical analysis for nucleotide preferences of the RNH domain

The chi-square one-dimensional test was used to calculate the deviation from the random distribution of bases at each position of the aligned sequences 23, 29. The expected frequency of each nucleotide was determined by summing the nucleotide frequencies in the unique sequences of Rmzc, T7/+17, PPT62/HET, hRppt57, and R+1/+29. The expected nucleotide frequencies were A=0.286, C=0.197, G=0.296, and U=0.220.

Acknowledgments

This work was funded by the National Institutes of Health grant CA51605.

Footnotes

1

Abbreviations used: HIV-1, human immunodeficiency virus, type 1; M-MuLV, Moloney murine leukemia virus; PPT, polypurine tract; LTR, long terminal repeat; RNH, isolated RNase H domain of M-MuLV reverse transcriptase; HBD, hybrid binding domain.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Arts EJ, LeGrice SFJ. Interaction of retroviral reverse transcriptase with template-primer duplexes during replication. Prog Nucleic Acid Res Mol Biol. 1998;58:339–393. doi: 10.1016/s0079-6603(08)60041-0. [DOI] [PubMed] [Google Scholar]
  • 2.Telesnitsky A, Goff SP. Strong-stop Strand Transfer during Reverse Transcription. In: Skalka AM, Goff SP, editors. Reverse Transcriptase. Cold Spring Harbor Laboratory Press; Plainview, New York: 1993. pp. 49–83. [Google Scholar]
  • 3.Tisdale M, Schulze T, Larder BA, Moelling K. Mutations within the RNase H domain of human immunodeficiency virus type 1 reverse transcriptase abolish virus infectivity. J Gen Virol. 1991;72 (Pt 1):59–66. doi: 10.1099/0022-1317-72-1-59. [DOI] [PubMed] [Google Scholar]
  • 4.Tanese N, Telesnitsky A, Goff SP. Abortive reverse transcription by mutants of Moloney murine leukemia virus deficient in the reverse transcriptase-associated RNase H function. J Virol. 1991;65:4387–4397. doi: 10.1128/jvi.65.8.4387-4397.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Repaske R, Hartley JW, Kavlick MF, O’Neill RR, Austin JB. Inhibition of RNase H activity and viral replication by single mutations in the 3′ region of Moloney murine leukemia virus reverse transcriptase. J Virol. 1989;63:1460–1464. doi: 10.1128/jvi.63.3.1460-1464.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Nikolenko GN, Svarovskaia ES, Delviks KA, Pathak VK. Antiretroviral drug resistance mutations in human immunodeficiency virus type 1 reverse transcriptase increase template-switching frequency. J Virol. 2004;78:8761–8770. doi: 10.1128/JVI.78.16.8761-8770.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.di Marzo Veronese F, Copeland TD, DeVico AL, Rahman R, Oroszlan S, Gallo RC, Sarngadharan MG. Characterization of highly immunogenic p66/p51 as the reverse transcriptase of HTLV-III/LAV. Science. 1986;231:1289–1291. doi: 10.1126/science.2418504. [DOI] [PubMed] [Google Scholar]
  • 8.Telesnitsky A, Goff SP. RNase H domain mutations affect the interaction between Moloney murine leukemia virus reverse transcriptase and its primer-template. Proc Natl Acad Sci USA. 1993;90:1276–1280. doi: 10.1073/pnas.90.4.1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kohlstaedt LA, Wang J, Friedman JM, Rice PA, Steitz TA. Crystal structure at 3.5 A resolution of HIV-1 reverse transcriptase complexed with an inhibitor. Science. 1992;256:1783–1790. doi: 10.1126/science.1377403. [DOI] [PubMed] [Google Scholar]
  • 10.Ding J, Das K, Hsiou Y, Sarafianos SG, Clark AD, Jr, Jacobo-Molina A, Tantillo C, Hughes SH, Arnold E. Structure and functional implications of the polymerase active site region in a complex of HIV-1 RT with a double-stranded DNA template-primer and an antibody Fab fragment at 2.8 A resolution. J Mol Biol. 1998;284:1095–1111. doi: 10.1006/jmbi.1998.2208. [DOI] [PubMed] [Google Scholar]
  • 11.Huang H, Chopra R, Verdine GL, Harrison SC. Structure of a covalently trapped catalytic complex of HIV-1 reverse transcriptase: implications for drug resistance. Science. 1998;282:1669–1675. doi: 10.1126/science.282.5394.1669. [DOI] [PubMed] [Google Scholar]
  • 12.Sarafianos SG, Das K, Tantillo C, Clark AD, Jr, Ding J, Whitcomb JM, Boyer PL, Hughes SH, Arnold E. Crystal structure of HIV-1 reverse transcriptase in complex with a polypurine tract RNA:DNA. EMBO J. 2001;20:1449–1461. doi: 10.1093/emboj/20.6.1449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Das D, Georgiadis MM. The crystal structure of the monomeric reverse transcriptase from Moloney murine leukemia virus. Structure (Camb) 2004;12:819–829. doi: 10.1016/j.str.2004.02.032. [DOI] [PubMed] [Google Scholar]
  • 14.Jacobo-Molina A, Ding J, Nanni RG, Clark AD, Jr, Lu X, Tantillo C, Williams RL, Kamer G, Ferris AL, Clark P, Hizi A, Hughes SH, Arnold E. Crystal structure of human immunodeficiency virus type 1 reverse transcriptase complexed with double-stranded DNA at 3.0 A resolution shows bent DNA. Proc Natl Acad Sci USA. 1993;90:6320–6324. doi: 10.1073/pnas.90.13.6320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Champoux JJ, Schultz SJ. Ribonuclease H: properties, substrate specificity and roles in retroviral reverse transcription. Febs J. 2009;276:1506–1516. doi: 10.1111/j.1742-4658.2009.06909.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Champoux JJ. Roles of Ribonuclease H in Reverse Transcription. In: Skalka AM, Goff SP, editors. Reverse Transcriptase. Cold Spring Harbor Press; Cold Spring Harbor, NY: 1993. pp. 103–118. [Google Scholar]
  • 17.Klumpp K, Mirzadegan T. Recent progress in the design of small molecule inhibitors of HIV RNase H. Curr Pharm Des. 2006;12:1909–1922. doi: 10.2174/138161206776873653. [DOI] [PubMed] [Google Scholar]
  • 18.Schultz SJ, Champoux JJ. RNase H activity: Structure, specificity, and function in reverse transcription. Virus Res. 2008;134:86–103. doi: 10.1016/j.virusres.2007.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Fuentes GM, Rodriguez-Rodriguez L, Fay PJ, Bambara RA. Use of an oligoribonucleotide containing the polypurine tract sequence as a primer by HIV reverse transcriptase. J Biol Chem. 1995;270:28169–28176. doi: 10.1074/jbc.270.47.28169. [DOI] [PubMed] [Google Scholar]
  • 20.Kelleher CD, Champoux JJ. RNA degradation and primer selection by Moloney murine leukemia virus reverse transcriptase contribute to the accuracy of plus strand initiation. J Biol Chem. 2000;275:13061–13070. doi: 10.1074/jbc.275.17.13061. [DOI] [PubMed] [Google Scholar]
  • 21.Schultz SJ, Zhang M, Kelleher CD, Champoux JJ. Analysis of plus-strand primer selection, removal, and reutilization by retroviral reverse transcriptases. J Biol Chem. 2000;275:32299–32309. doi: 10.1074/jbc.M000021200. [DOI] [PubMed] [Google Scholar]
  • 22.Schultz SJ, Zhang M, Champoux JJ. Specific cleavages by RNase H facilitate initiation of plus-strand RNA synthesis by Moloney murine leukemia virus. J Virol. 2003;77:5275–5285. doi: 10.1128/JVI.77.9.5275-5285.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Schultz SJ, Zhang M, Champoux JJ. Recognition of internal cleavage sites by retroviral RNases H. J Mol Biol. 2004;344:635–652. doi: 10.1016/j.jmb.2004.09.081. [DOI] [PubMed] [Google Scholar]
  • 24.DeStefano JJ, Cristofaro JV, Derebail S, Bohlayer WP, Fitzgerald-Heath MJ. Physical mapping of HIV reverse transcriptase to the 5′ end of RNA primers. J Biol Chem. 2001;276:32515–32521. doi: 10.1074/jbc.M103958200. [DOI] [PubMed] [Google Scholar]
  • 25.DeStefano JJ, Bambara RA, Fay PJ. Parameters that influence the binding of human immunodeficiency virus reverse transcriptase to nucleic acid structures. Biochemistry. 1993;32:6908–6915. doi: 10.1021/bi00078a014. [DOI] [PubMed] [Google Scholar]
  • 26.DeStefano JJ, Mallaber LM, Fay PJ, Bambara RA. Determinants of the RNase H cleavage specificity of human immunodeficiency virus reverse transcriptase. Nucleic Acids Res. 1993;21:4330–4338. doi: 10.1093/nar/21.18.4330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Palaniappan C, Fuentes GM, Rodriguez-Rodriguez L, Fay PJ, Bambara RA. Helix structure and ends of RNA/DNA hybrids direct the cleavage specificity of HIV-1 reverse transcriptase RNase H. J Biol Chem. 1996;271:2063–2070. [PubMed] [Google Scholar]
  • 28.Schultz SJ, Zhang M, Champoux JJ. Preferred Sequences within a Defined Cleavage Window Specify DNA 3′ End-Directed Cleavages by Retroviral RNases H. J Biol Chem. 2009;284:32225–32238. doi: 10.1074/jbc.M109.043158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Schultz SJ, Zhang M, Champoux JJ. Sequence, distance, and accessibility are determinants of 5′-end-directed cleavages by retroviral RNases H. J Biol Chem. 2006;281:1943–1955. doi: 10.1074/jbc.M510504200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Schultz SJ, Champoux JJ. RNase H domain of Moloney murine leukemia virus reverse transcriptase retains activity but requires the polymerase domain for specificity. J Virol. 1996;70:8630–8638. doi: 10.1128/jvi.70.12.8630-8638.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Zhan X, Crouch RJ. The isolated RNase H domain of murine leukemia virus reverse transcriptase. Retention of activity with concomitant loss of specificity. J Biol Chem. 1997;272:22023–22029. doi: 10.1074/jbc.272.35.22023. [DOI] [PubMed] [Google Scholar]
  • 32.Shinnick TM, Lerner RA, Sutcliffe JG. Nucleotide sequence of Moloney murine leukaemia virus. Nature. 1981;293:543–548. doi: 10.1038/293543a0. [DOI] [PubMed] [Google Scholar]
  • 33.DeStefano JJ. The orientation of binding of human immunodeficiency virus reverse transcriptase on nucleic acid hybrids. Nucleic Acids Res. 1995;23:3901–3908. doi: 10.1093/nar/23.19.3901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Furfine ES, Reardon JE. Reverse transcriptase. RNase H from the human immunodeficiency virus Relationship of the DNA polymerase and RNA hydrolysis activities. J Biol Chem. 1991;266:406–412. [PubMed] [Google Scholar]
  • 35.Gopalakrishnan V, Peliska JA, Benkovic SJ. Human immunodeficiency virus type 1 reverse transcriptase: spatial and temporal relationship between the polymerase and RNase H activities. Proc Natl Acad Sci USA. 1992;89:10763–10767. doi: 10.1073/pnas.89.22.10763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kati WM, Johnson KA, Jerva LF, Anderson KS. Mechanism and fidelity of HIV reverse transcriptase. J Biol Chem. 1992;267:25988–25997. [PubMed] [Google Scholar]
  • 37.Lim D, Gregorio GG, Bingman C, Martinez-Hackert E, Hendrickson WA, Goff SP. Crystal structure of the moloney murine leukemia virus RNase H domain. J Virol. 2006;80:8379–8389. doi: 10.1128/JVI.00750-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Julias JG, McWilliams MJ, Sarafianos SG, Arnold E, Hughes SH. Mutations in the RNase H domain of HIV-1 reverse transcriptase affect the initiation of DNA synthesis and the specificity of RNase H cleavage in vivo. Proc Natl Acad Sci USA. 2002;99:9515–9520. doi: 10.1073/pnas.142123199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Julias JG, McWilliams MJ, Sarafianos SG, Alvord WG, Arnold E, Hughes SH. Mutation of amino acids in the connection domain of human immunodeficiency virus type 1 reverse transcriptase that contact the template-primer affects RNase H activity. J Virol. 2003;77:8548–8554. doi: 10.1128/JVI.77.15.8548-8554.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Rausch JW, Lener D, Miller JT, Julias JG, Hughes SH, Le Grice SF. Altering the RNase H primer grip of human immunodeficiency virus reverse transcriptase modifies cleavage specificity. Biochemistry. 2002;41:4856–4865. doi: 10.1021/bi015970t. [DOI] [PubMed] [Google Scholar]
  • 41.McWilliams MJ, Julias JG, Sarafianos SG, Alvord WG, Arnold E, Hughes SH. Mutations in the 5′ end of the human immunodeficiency virus type 1 polypurine tract affect RNase H cleavage specificity and virus titer. J Virol. 2003;77:11150–11157. doi: 10.1128/JVI.77.20.11150-11157.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.McWilliams MJ, Julias JG, Sarafianos SG, Alvord WG, Arnold E, Hughes SH. Combining mutations in HIV-1 reverse transcriptase with mutations in the HIV-1 polypurine tract affects RNase H cleavages involved in PPT utilization. Virology. 2006;348:378–388. doi: 10.1016/j.virol.2005.12.042. [DOI] [PubMed] [Google Scholar]
  • 43.Hostomsky Z, Hostomska Z, Hudson GO, Moomaw EW, Nodes BR. Reconstitution in vitro of RNase H activity by using purified N-terminal and C-terminal domains of human immunodeficiency virus type 1 reverse transcriptase. Proc Natl Acad Sci USA. 1991;88:1148–1152. doi: 10.1073/pnas.88.4.1148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Smith JS, Roth MJ. Purification and characterization of an active human immunodeficiency virus type 1 RNase H domain. J Virol. 1993;67:4037–4049. doi: 10.1128/jvi.67.7.4037-4049.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tanese N, Goff SP. Domain structure of the Moloney murine leukemia virus reverse transcriptase: mutational analysis and separate expression of the DNA polymerase and RNase H activities. Proc Natl Acad Sci USA. 1988;85:1777–1781. doi: 10.1073/pnas.85.6.1777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tadokoro T, Kanaya S. Ribonuclease H: molecular diversities, substrate binding domains, and catalytic mechanism of the prokaryotic enzymes. Febs J. 2009;276:1482–1493. doi: 10.1111/j.1742-4658.2009.06907.x. [DOI] [PubMed] [Google Scholar]
  • 47.Cerritelli SM, Crouch RJ. Ribonuclease H: the enzymes in eukaryotes. Febs J. 2009;276:1494–1505. doi: 10.1111/j.1742-4658.2009.06908.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Nowotny M, Gaidamakov SA, Ghirlando R, Cerritelli SM, Crouch RJ, Yang W. Structure of human RNase H1 complexed with an RNA/DNA hybrid: insight into HIV reverse transcription. Mol Cell. 2007;28:264–276. doi: 10.1016/j.molcel.2007.08.015. [DOI] [PubMed] [Google Scholar]
  • 49.Han GW, Kopka ML, Cascio D, Grzeskowiak K, Dickerson RE. Structure of a DNA analog of the primer for HIV-1 RT second strand synthesis. J Mol Biol. 1997;269:811–826. doi: 10.1006/jmbi.1997.1085. [DOI] [PubMed] [Google Scholar]

RESOURCES