Skip to main content
Journal of Virology logoLink to Journal of Virology
. 1998 May;72(5):4297–4307. doi: 10.1128/jvi.72.5.4297-4307.1998

A Chimeric Ty3/Moloney Murine Leukemia Virus Integrase Protein Is Active In Vivo

Sandra L Dildine 1, James Respess 2, Doug Jolly 2, Suzanne B Sandmeyer 1,*
PMCID: PMC109660  PMID: 9557720

Abstract

This report describes the results of experiments to determine whether chimeras between a retrovirus and portions of Ty3 are active in vivo. A chimera between Ty3 and a Neor-marked Moloney murine leukemia virus (M-MuLV) was constructed. The C-terminal domain of M-MuLV integrase (IN) was replaced with the C-terminal domain of Ty3 IN. The chimeric retroviruses were expressed from an amphotrophic envelope packaging cell line. The virus generated was used to infect the human fibrosarcoma cell line HT1080, and cells in which integration had occurred were selected by G418 resistance. Three independently integrated viruses were rescued. In each case, the C-terminal Ty3 IN sequences were maintained and short direct repeats of the genomic DNA flanked the integration site. Sequence analysis of the genomic DNA flanking the insertion did not identify a tRNA gene; therefore, these integration events did not have Ty3 position specificity. This study showed that IN sequences from the yeast retrovirus-like element Ty3 can substitute for M-MuLV IN sequences in the C-terminal domain and contribute to IN function in vivo. It is also one of the first in vivo demonstrations of activity of a retrovirus encoding an integrase chimera. Studies of chimeras between IN species with distinctive integration patterns should complement previous work by expanding our understanding of the roles of nonconserved domains.


Efficient retroviral vectors have played a central role in the development of gene therapy. One of the limitations of retroviral vectors, however, is the relatively random selection of insertion sites. This can result in disruption of the target genome and cause expression of the therapeutic gene to be unpredictable. The yeast retrovirus-like element Ty3 inserts with position specificity at the site of transcription initiation by RNA polymerase III (pol III). Potential limitations of retrovirus-based vectors could be resolved by Moloney murine leukemia virus (M-MuLV)-based retroviral vectors with the position-specific integration properties of the yeast retrovirus-like element Ty3. In this study, we constructed a retroviral vector with a chimeric integrase (IN) and determined its ability to function in vivo.

Retroviruses integrate throughout the genomes of their hosts. Although integration is not completely random, the mechanisms that determine the positions of integration are poorly understood (16, 47, 56). Factors influencing the structure of DNA appear to play a role in target site selection. Assembly of DNA into nucleosomes created favored sites for integration at positions where the major groove is on the exposed face of the nucleosomal DNA helix (54). A more detailed analysis of integration sites in DNA assembled into chromatin showed that DNA that is most severely distorted and that has a wider major groove within the nucleosome is a preferential target for integration (52). Bending of the target DNA by different DNA binding proteins or by phased tracts of adenosine residues can create favored integration sites in the region where the DNA is distorted (6, 48, 53). Another factor in target site selection is sequence- or structure-specific DNA binding proteins. Fusion proteins have been created between the IN protein of human immunodeficiency virus (HIV) and the DNA binding domains of λ (8), Lex A repressor (27), Zif268 (10), or avian sarcoma virus (ASV) IN and the DNA binding domains of the Lex A repressor (35). These fusion proteins were able to target integrations to regions surrounding the protein recognition sequences in in vitro assays. In naturally occurring interactions, the DNA binding protein(s) itself may promote integration by interacting with the integration machinery. For example, integration was stimulated by a specific interaction between HIV IN and a putative transcription activator (33). The most compelling example of such an interaction is the position-specific integration of the yeast retrovirus-like element Ty3. Integration of Ty3 adjacent to tRNA genes requires RNA pol III transcription factors, suggesting that position-specific integration may be influenced by an interaction between the Ty3 integration machinery and pol III transcription factors (38).

The yeast retrovirus-like element Ty3 is more closely related to the Drosophila melanogaster gypsy-like retroviruses and to animal retroviruses than to the other yeast retrotransposons (29). Ty3 is composed of a 4.7-kb internal domain flanked by 340-bp long terminal repeats (LTRs) (14). It is distinguished from other retrotransposons and from retroviruses by its unique integration specificity. De novo insertions of Ty3 elements into yeast genomic DNA were shown to be integrated within 1 to 2 bp of the site of initiation of transcription of tRNA genes (12). Subsequently it was shown that the RNA pol III-transcribed genes, 5S and U6, can also serve as specific targets for Ty3 integration (13). The tRNA gene target must be transcriptionally competent since promoter mutations that abolish transcription prevent integration (13). Experiments using an in vitro integration assay and fractionated transcription extracts showed that transcription factors TFIIIB and TFIIIC are required for integration but that RNA pol III is not (15, 38). Ty3 integration does not significantly affect the expression of the adjacent tRNA gene (37), suggesting that Ty3 may have evolved naturally to insert in a nondetrimental position in the yeast genome. If the integration specificity of Ty3 could be adapted to retrovirus-based gene therapy vectors, this would result in a therapeutic vector that integrates into a predictable site in the genome. In addition, tRNA genes are redundant in the human genome and, because they are expressed constitutively, are likely to be located in accessible regions of chromatin in many cell types. Therefore, integration adjacent to a tRNA gene may lead to more predictable levels of expression of the therapeutic vectors than integration into random sites.

Studies on retroviral IN proteins suggest there are domains that can be separated to generate functional IN chimeras. Computer alignment of the amino acid sequence of retroviral and retrotransposon IN proteins show that there is a highly conserved region which includes seven invariant residues [an HHCC metal finger and a DD(35)E active-site motif], flanked by N-terminal and C-terminal domains (31, 36). The central core region beginning C terminal to the HHCC motif appears to constitute a domain by structural and functional criteria. Limited proteolysis of HIV IN showed that a core of about 120 amino acids (aa) including the DD(35)E motif was relatively resistant to proteolysis (22). Expression of recombinant subclones of HIV type 1 further showed that the region from aa 50 to 186 containing the same subset of conserved residues was sufficient to carry out the disintegration reaction (9, 65), indicating that this domain functions independently in polynucleotidyl transfer. The minimal domain for disintegration activity of M-MuLV IN includes the DD(35)E motif and most of the C-terminal region (32). The region containing the HHCC and DD(35)E motifs, conserved among all retrovirus IN proteins, shares approximately 25% amino acid identity between M-MuLV and Ty3 (data not shown). Amino acid substitutions in the DD(35)E catalytic triad block retroviral integration in vivo (11, 42, 60) and in vitro (21, 22, 40, 43, 64). Ty3 IN also requires the conserved DD(35)E motif, since amino acid substitutions in the active site blocked 3′-end processing in vivo (39) and virus-like particles containing the IN mutations were unable to catalyze integration in vitro (38). The N-terminal (to the HHCC) and C-terminal domains are poorly conserved among retroviruses. The C-terminal domain shows the greatest variability in size and sequence (31, 36). The C-terminal domains of HIV, M-MuLV, and Ty3 IN are about 100, 140, and 230 aa, respectively. The C terminus of retroviral IN proteins contains a domain that has been shown to have nonspecific DNA binding activity (24, 36, 49, 58, 65, 67), but the DNA binding domain is not required for catalytic activity. Its significantly larger size in Ty3 suggests that it could perform other functions. This domain in Ty3 is a candidate for targeting integration. We have replaced the C domain of M-MuLV IN with the C domain of Ty3 IN to determine whether portions of Ty3 IN could substitute for portions of M-MuLV IN and, if so, whether changes in patterns of M-MuLV integration result.

Three independently integrated chimeric retroviruses containing the Ty3 C-terminal domain (AMBMCT; see the legend to Fig. 1 for nomenclature) were identified. Sequence analysis of each of these chimeric viruses has revealed the maintenance of the Ty3 C-terminal sequences. Short direct repeats of the flanking genomic DNA were detected, indicating that these are true integrations. Searching the National Center for Biotechnology Information (NCBI) sequence database with the rescued flanking sequence did not reveal the presence of a tRNA gene. Therefore, these integrations do not appear to be position specific. These results show that the C-terminal domain of Ty3 IN can substitute for M-MuLV sequence and provide some IN activity.

FIG. 1.

FIG. 1

(A) Primary structure comparison of Ty3 and M-MuLV IN proteins. Open and filled boxes represent Ty3 and M-MuLV IN amino acids, respectively. IN proteins are divided into A (amino-terminal), B (central or core), and C (carboxyl-terminal) domains. Domains were chosen based on amino acid similarity and functional and structural similarities. The numbers in the boxes indicate the amino acid boundaries of the domains. H and C indicate conserved histidine and cysteine, respectively, amino acids which constitute a metal finger domain. D, D, and E indicate conserved aspartic acid and glutamic acid, respectively, amino acids which constitute the catalytic triad of the protein(s). (B) Construction of chimeric retroviral vectors. Single-stranded DNA from pMLV/Ty3IN was synthesized and served as the template for single-stranded oligonucleotide mutagenesis to produce pAMBMCT, where A, B, and C refer to the IN domains listed above, M refers to M-MuLV-derived sequences, and T refers to Ty3-derived sequences. Plasmid pRgpKan is a rescuable shuttle vector containing the M-MuLV LTR, gag and pol genes, and SV40 promoter (SV40) driving a Neor gene and a ColE1 origin of replication (ori). Plasmid pRgpAMBMCT contains the chimeric IN sequences in place of the wild-type M-MuLV IN sequences in a pRgpKan background.

MATERIALS AND METHODS

DNA constructs.

Wild-type M-MuLV retroviral vector plasmids used in this study are designated pRgpNeo and pRgpKan. pRgpNeo is a rescuable shuttle vector, derived from the BAG vector (51) in which the β-galactosidase sequences, bp 812 (AatII) to 4156 (SalI), were replaced with M-MuLV gag-pol sequences, bp 367 (AatII) to 5872 (ScaI) (61), from pMLV-K (46). The vector expresses G418 resistance, and gag and pol gene products encapsidate the vector RNA in mammalian cells. In a bacterial host, pRgpNeo exists as a plasmid containing the ColE1 origin of replication and expresses kanamycin resistance. pRgpKan is similar to pRgpNeo but contains a single LTR and no polyomavirus sequences. When pRgpKan is transiently expressed in mammalian cells along with an envelope gene, a retroviral vector identical to pRgpNeo is produced.

The chimeric retroviral vector pRgpAMBMCT was constructed via a multistep cloning process. M-MuLV nucleotides 3706 to 6538 from p2XMLV (50) encompassing IN were cloned into the pIBI-20 (17) phagemid vector cleaved in the polylinker with SalI and BamHI to create pMLVIN. Ty3 IN sequences were provided by pVB193 (4), which contains Ty3 sequences from 3132 to 5351 (30) encompassing IN in the pIBI-20 phagemid vector. A 4.4-kb ScaI fragment containing IN sequences from pMLVIN was ligated to a 4.5-kb SalI-ScaI fragment containing Ty3-1 IN sequences from pVB193 to generate plasmid pMLV/Ty3IN. The phagemid vector pMLV/Ty3IN contains the M-MuLV IN- and Ty3 IN-coding sequences in tandem. It served as a template for the synthesis of single-stranded DNA for mutagenesis. The 41-mer oligonucleotide, 5′-CCTAAATCAATTTCAAATGGGGTGAGGCCATGGGGGCCCGG-3′, complementary to M-MuLV nucleotides 5379 to 5399 and Ty3 nucleotides 4362 to 4381 was used to loop out the intervening sequences and join the M-MuLV B domain to the Ty3 C domain via single-stranded oligonucleotide mutagenesis (41). This generated plasmid pAMBMCT. The 44-mer oligonucleotide, 5′-GGATGTTTCGGGGGTTATAGTTAAAAGATACTCCTCCCATCTCC-3′, complementary to M-MuLV nucleotides 4597 to 4619 and Ty3 nucleotides 3450 to 3470 was used to loop out the intervening sequences and join the coding region for the first 4 aa of M-MuLV IN to the coding region for the N terminus of Ty3 IN. This generated plasmid pATBTCT. The 44-mer oligonucleotide, 5′-CTTCTAGGGAATAATTCTTTCGTTTTCCTTGGTAGACCCAATAC-3′, complementary to M-MuLV nucleotides 4712 to 4733 and Ty3 nucleotides 3633 to 3654 was used to loop out the intervening sequences and join the M-MuLV A domain to the Ty3 B domain. This generated plasmid pAMBTCT.

The chimeric IN sequences were cloned into the retroviral vector by ligation of a 4.1-kb NheI-SalI fragment from pRgpKan, which encompassed the M-MuLV LTR, gag, and part of pol; a 3-kb NheI-SalI fragment from BAGΔX (truncated version of the BAG vector in which the polyomavirus sequences have been deleted and there is only one LTR [51]), which provided the neomycin phosphotransferase resistance (Neor) gene from transposon Tn5 under control of the simian virus 40 (SV40) promoter and ColE1 origin of replication; and an approximately 2.7-kb SalI-XhoI fragment from pAMBMCT, pATBTCT, or pAMBTCT, which provided the chimeric IN sequences. In addition to Ty3 IN sequences, the 2.7-kb SalI-XhoI fragment contains the majority of the Ty3 LTR sequences to bp 5332 (30). Ligation products were transformed into Escherichia coli HB101 via electroporation. Transformants were selected by growth in the presence of kanamycin (50 μg/ml). This generated the chimeric retroviral vector plasmids pRgpAMBMCT (two independent clones, 3-7 and 4-11) and pRgpATBTCT and pRgpAMBTCT(−), which contained the chimeric IN sequences inserted in the opposite orientation. The presence of the chimeric IN sequences was verified by restriction enzyme digestion and sequence analysis in the region of the M-MuLV-Ty3 junctions.

Cell lines.

The cell line 293 2-3 (7) is derived from the human adenovirus type 5-transformed embryonal kidney cell line 293 (ATCC CRL1573) and expresses M-MuLV gag and pol genes. The packaging cell line NC10 is derived from the human fibrosarcoma cell line HT1080 (ATCC CCL121). NC10 cells express the M-MuLV 4070A amphotrophic envelope (5). All cells were grown in Dulbecco’s modification of Eagle’s medium supplemented with 10% fetal calf serum (Irvine Scientific).

Virus production and G418 selection for integration.

The retroviral vectors were independently cotransfected (28) into 293 2-3 cells along with pMLP-G at a 1:1 ratio (10 μg of each). Plasmid pMLP-G expresses vesicular stomatitis virus (VSV) G protein and was used to pseudotype retroviral vector particles (7). After 48 h, 10 ml of medium from transfected cell cultures was collected, filtered through 45-μm-pore-size cellulose acetate filters (Nalgene Inc.), and placed onto NC10 cells. After 24 h, the transduced NC10 culture was subjected to G418 (900 μg/ml) selection for 24 to 48 h. Selection was continued at a reduced level of G418 (600 μg/ml) until nontransduced control cultures no longer contained viable cells. The resulting NC10 culture produced retroviral vectors consisting of (i) the chimeric RNA genome and Gag and Gag-Pol from the chimeric construct and (ii) amphotropic envelope. NC10 producer cells were grown to confluency. Medium was changed and collected after 24 h. Ten milliliters of filtered medium was placed onto target HT1080 cells at approximately 50% confluency. After 24 h, medium was replaced with fresh Dulbecco modified Eagle medium plus 10% fetal calf serum. After an additional 24 h, the cells were placed under G418 selection at 900 μg/ml for 24 to 48 h. Selection continued at 600 μg/ml until nontransduced control cultures no longer contained viable cells. G418-resistant cell cultures were expanded and used to prepare genomic DNA.

Genomic DNA extraction, Southern analysis, and rescue of integrated retroviral vectors.

High-molecular-weight DNA was prepared from G418-resistant HT1080 cells infected with the chimeric RgpAMBMCT and wild-type RgpNeo retroviral vectors. Approximately 10 μg of each sample of genomic DNA was digested with the restriction enzyme PstI, and the DNA was electrophoresed on a 0.9% agarose gel. The DNA was transferred onto Zeta-Probe GT nylon membranes (Bio-Rad) by using a PosiBlot pressure blotter (Stratagene Inc.). The blots were hybridized at 42°C in the presence of 50% formamide with a fragment of M-MuLV (nucleotides 4643 to 5873) or Ty3 (nucleotides 3132 to 5332) which was specific for the respective IN-coding regions. The probe was synthesized by extending random primers in the presence of [α-32P]dATP with the Megaprime DNA labeling system (Amersham, Inc.). Integrated retroviral vectors were recovered by digestion of 10 μg of genomic DNA with ScaI, extraction of DNA with phenol-chloroform, and precipitation of DNA with ethanol. ScaI-digested genomic DNA was ligated with T4 DNA ligase (New England BioLabs, Inc.). DNA was extracted with phenol-chloroform and precipitated with ethanol precipitation in the presence of 10 μg of glycogen. DNA was resuspended in 7 μl of TE, (10 mM Tris base, 1 mM EDTA [pH 8.0]). One microliter was used to quantitate the DNA by fluorometry using a Mini TKO 100 DNA Fluorometer (Hoefer Scientific). The remainder of the DNA (6 μl) was used to transform E. coli DH12S (Life Technologies) via electroporation with a Gene Pulser (Bio-Rad). E. coli transformants were grown at 30°C and plated onto LB medium containing 50 μg of kanamycin per ml. Rescued retroviral vector plasmids were recovered by the alkaline lysis procedure (3). The retroviral vector and flanking genomic sequences were subcloned by standard DNA cloning techniques (3). In general, the rescued plasmids were digested with NheI and fragments were separated by electrophoresis in agarose gels. The NheI fragments representing the retroviral vectors were circularized. The NheI fragments representing the flanking genomic DNA were cloned into the XbaI site of the pIBI-20 vector.

Sequence analysis.

Dideoxynucleotide sequencing was performed by the method of Sanger et al. (57), using the Sequenase enzyme (U.S. Biochemicals). Oligonucleotide primers used for sequence analysis were as follows: 394, 5′-ATGCATCTCTATGCAC-3′ (complementary to Ty3 nucleotides 5307 to 5323); 175, 5′-CAGGGTGACGTATTGTC-3′ (complementary to Ty3 nucleotides 5042 to 5054); 194, 5′-ATGCATCTCTATGCAC-3′ (complementary to Ty3 nucleotides 4581 to 4596); 199, 5′-CAACTGGCTCTAGAC-3′ (M-MuLV nucleotides 5317 to 5331); 387, 5′-GTCTCGCTGTTCCTTGGGAG-3′ (M-MuLV nucleotides 80 to 99); universal primer, 5′-GTAAAACGACGGCCAGTG-3′ (complementary to pIBI-20 nucleotides 341 to 358); and reverse primer, 5′-CAGGAAACAGCTATGACC-3′ (pIBI-20 nucleotides 202 to 219). Sequencing reactions were fractionated by electrophoresis in 8% polyacrylamide/bisacrylamide (19:1, National Diagnostics)–8 M urea gels and visualized by autoradiography.

PCR amplification of HT1080 preintegration genomic DNA.

High-molecular-weight genomic DNA isolated from HT1080 cells was used as a substrate for asymmetric PCR using oligonucleotide primers 406 (5′-GAACCACTAAGTTTGCTTGTGG-3′) and 407 (5′-CCATGAGAAATACTAGGTGACTGC-3′). The sequences of oligonucleotide primers 406 and 407 represent opposite strands on the 5′ and 3′ flanks, respectively, of the rescued integration from the 4-11 retroviral chimeric clone 10 [4-11(10)]). One microgram of HT1080 genomic DNA was amplified with 50 and 0.5 pmol of oligonucleotide primers 406 and 407, respectively, 2 mM MgCl2, and 2.5 U of Taq polymerase (Perkin-Elmer, Inc.). PCR products were collected by ethanol precipitation in the presence of 2.5 M sodium acetate and annealed to oligonucleotide 407 prior to sequence analysis.

Northern analysis.

Total cytoplasmic RNA was extracted from NC10 producer cells as described previously (19). Approximately 10 μg of RNA from each sample was denatured by reaction with glyoxal as described previously (45) and subjected to electrophoresis in a 1.1% agarose gel in 10 mM NaP (pH 7) at ∼130 V for 4 h. The RNA was transferred in a PosiBlot pressure blotter (Stratagene) to a Duralon-UV membrane and cross-linked in a UV Stratalinker 1800 (Stratagene). Samples on identical membranes were hybridized with a fragment of M-MuLV (nucleotides 4643 to 5750) or Ty3 (nucleotides 3132 to 5332) which was specific for the respective IN-coding regions. Membranes were stripped of probes by the addition of boiling 0.1× SSC (1× SSC is 0.15 M NaCl plus 0.015 M sodium citrate)–1% sodium dodecyl sulfate and rehybridized with a fragment from M-MuLV gag-pol sequences (nucleotides 1906 to 4444). Membranes were stripped again and rehybridized with a fragment from the Neor gene (HindIII to EcoRI). The probes were synthesized by extending random primers in the presence of [α-32P]dATP with the Megaprime DNA labeling system (Amersham). Membranes were hybridized to probes and washed as described previously (14) except that the hybridization and second wash were done at 42°C.

RESULTS

Activity of retroviral vectors containing chimeric IN.

The C-terminal region of retroviral IN has been implicated in binding of target DNA. To test whether this region of the position-specific Ty3 IN could function in a retroviral context, we constructed chimeras based on a M-MuLV retroviral vector, pRgpKan (Fig. 1). pRgpKan contains a single LTR and gag and pol genes from M-MuLV, the Neor gene from transposon Tn5 driven by an SV40 promoter, and a bacterial origin of replication. The single LTR directs both initiation and termination of transcription so that an RNA genome is transcribed with LTR information at both ends. The wild-type M-MuLV retroviral vector used in these experiments was pRgpNeo, derived from the BAG vector (51). Plasmid pRgpNeo differs from pRgpKan by having two LTRs and sequences from the polyomavirus early region. Once pRgpKan is expressed in mammalian cells, an RNA genome which is identical to pRgpNeo is transcribed; therefore, viruses produced from these two retroviral vectors are identical. The retroviral vector pRgpAMBMCT, substituting the C domain of the Ty3 IN for the C domain of M-MuLV IN in the pRgpKan vector, was constructed as described above. To assay the function of the chimeric IN protein, retrovirus containing chimeric IN was generated via a three-step process in cell culture. The chimeric retroviral vectors were cotransfected into 293 2-3 cells (7) along with pMLP-G (expresses the VSV G protein) at a 1:1 ratio. In 293 2-3 cells, the retroviral vector RNA genome was encapsidated along with Gag and Gag-Pol into viral particles. Budded particles contained VSV G protein on the outer surface. Filtered supernatant fluid containing these particles was transferred onto NC10 cells. The transduced NC10 culture was placed under G418 selection. NC10 cells were maintained under G418 selection until nontransduced control cultures no longer contained viable cells. As judged by the relative number of G418-resistant cells, the chimeric genomes were packaged at the same rate as observed for the wild-type pRgpNeo vector (data not shown). The resulting culture then produced retrovirus consisting of the chimeric RNA genome, Gag and Gag-Pol expressed from the chimeric construct, and M-MuLV amphotrophic envelope.

Activity of the chimeric IN protein was assayed by placing filtered supernatants from the NC10 producer cells onto the target, HT1080 cells (∼106 cells) and selecting for G418-resistant transductants. Table 1 lists the approximate number of G418-resistant cell colonies per 10 ml of filtered producer supernatant fluid. Producer supernatants from the two independent clones, 3-7 and 4-11, of the pRgpAMBMCT chimeric construct were each assayed twice on target HT1080 cells. The RgpAMBMCT chimeric retrovirus yielded one to three G418-resistant cell colonies. In contrast, infection with the wild-type RgpNeo retrovirus yielded more than 104 G418-resistant cell colonies. The retroviral vector pRgpAMBTCT(−), containing chimeric IN sequences cloned into the retroviral vector in the reverse orientation, was used as a control. As expected, this construct, which did not have a functional IN protein, did not yield any G418-resistant cells. Supernatant from the NC10 producer cell line alone also did not yield any G418-resistant cells.

TABLE 1.

Transduction of HT1080 target cells with chimeric IN-containing retroviral vectors

Construct Isolate Titer on HT1080 cellsa
pRgpAMBMCT 3-7 0, ≥1
4-11 2, ≥3
pRgpNeo >104, >104
pRgpAMBTCT(−) 0, 0
No vector 0, 0
a

G418-resistant cell colonies surviving per 10 ml of filtered, NC10 producer cell supernatant. Numbers shown represent two separate transient transfections of 293 2-3 cells with the retroviral vectors and the establishment of two independent NC10 producer cell lines followed by titration on HT1080 cells. 

Maintenance of Ty3 IN sequences in integrated retroviral vector DNA.

To determine whether the chimeric IN sequences were maintained in the G418-resistant HT1080 cells, genomic DNA was isolated from HT1080 cells, digested with PstI, and subjected to Southern analysis as shown in Fig. 2. Pst1 cleaves within the retroviral vector on either side of the IN sequences to liberate a 6-kb (pRgpKan) or 6.5-kb (pRgpAMBMCT) fragment (Fig. 2C). Identical Southern blots were hybridized with a M-MuLV or Ty3 IN-coding sequence specific α-32P-labeled probes (Fig. 2A and B). Genomic DNA from RgpNeo-infected, G418-resistant HT1080 cells hybridized with the M-MuLV probe (Fig. 2A, lane 3) but not the Ty3 probe (Fig. 2B, lane 3). Genomic DNAs from RgpAMBMCT-infected, G418-resistant HT1080 cells from clone 3-7 (Fig. 2A and B, lanes 4) and clone 4-11 (Fig. 2A and B, lanes 5) hybridized with both the M-MuLV- and Ty3-specific probes, as expected. However, the size of the hybridizing fragment from genomic DNA of clone 3-7-infected cells was larger than expected. PstI-digested genomic DNA from control HT1080 cells (lanes 2) did not hybridize with either probe. The expected size of the fragments was determined by PstI digestion of plasmids pRgpKan (lanes 6), pRgpATBTCT (lanes 7), and pRgpAMBMCT (lanes 8).

FIG. 2.

FIG. 2

Maintenance of chimeric IN sequences in genomic DNA from chimeric retroviral vector-infected HT1080 cells. (A and B) Southern analysis of genomic DNA isolated from G418-resistant HT1080 cells infected with pRgpNeo (lanes 3), pRgpAMBMCT clone 3-7 (lanes 4), or pRgpAMBMCT clone 4-11 (lanes 5). HT1080 genomic DNA is represented in lanes 2. Lanes 2 to 5 contain 10 μg of high-molecular-weight genomic DNA digested with restriction enzyme PstI. PstI-digested plasmids pRgpKan (lanes 6), pRgpATBTCT (lanes 7), and pRgpAMBMCT (lanes 8) are included as molecular weight controls. Duplicate samples were analyzed via Southern hybridization with an α-32P-labeled M-MuLV IN-specific probe (A) and an α-32P-labeled Ty3 IN-specific probe (B). Lanes 1 contain λ HindIII as a size marker. (C) Schematic diagram of PstI restriction enzyme sites on circular plasmid pRgpKan (left) and integrated pRgpNeo (right). The IN-specific probes should detect a 6-kb (pRgpNeo) or a 6.5-kb (pRgpAMBMCT) PstI fragment encompassing the IN sequences.

Rescue of integrated chimeric retroviral vectors.

To verify the maintenance of the Ty3 IN-coding sequences and determine the sequence of the genomic DNA at the insertion site, integrated retroviral vector DNA was isolated from the genomic DNA. Figure 3 outlines the procedure used to rescue the integrated retroviral vector DNA and flanking genomic DNA. This procedure had the advantage of recovering both the 5′ and 3′ flanking genomic DNA; thus, whether a target site repeat exists, which is indicative of an integration event, can be determined. Genomic DNA was isolated from G418-resistant cells containing integrated chimeric retroviral vectors. The genomic DNA was digested with the restriction enzyme ScaI, which does not cut within the retroviral vector sequences. It was ligated at low plasmid concentrations, which favor self-ligation, and transformed into E. coli. Transformants were selected by kanamycin resistance. To verify the rescue of a full-length retroviral vector, the rescued plasmids were digested with NheI, which cuts once in each LTR. Rescued plasmids containing full-length retroviral vectors should yield a 9.2-kb (pRgpNeo) or a 9.7-kb (pRgpAMBMCT) fragment representing the retroviral vector sequence. Other fragments(s) should contain junction and flanking genomic DNA sequences. Table 2 lists the rescued plasmids which contain either the chimeric pRgpAMBMCT or wild-type pRgpNeo retroviral vector. Three independent integrations were rescued from cells infected with the 4-11 chimeric retroviral vector: 4-11(3), 4-11(9), and 4-11(10). The recovered plasmids contained the expected 9.7-kb retroviral vector NheI fragment and additional fragment(s) representing flanking genomic DNA. All rescued plasmids from the 3-7 chimeric clone were similar to 3-7(19), which contained the 9.7-kb NheI fragment but no fragments representing flanking genomic DNA. If the rescued plasmid is too large for maintenance in E. coli, the genomic sequences may be lost by recombination between the LTRs. In that event, only the retroviral vector sequence would be rescued. Three independent integrations were rescued from the cells infected with the wild-type RgpNeo retrovirus. Each of these contained the expected 9.2-kb retroviral vector NheI fragment and additional fragment(s) representing flanking genomic DNA. To compare the rescued retroviral vector sequences to the input retroviral vector sequences and to facilitate sequence analysis of the flanking genomic DNA, the rescued retroviral vector sequences and flanking genomic DNA were subcloned.

FIG. 3.

FIG. 3

Strategy to rescue integrated retroviral vector DNA along with target genomic DNA. High-molecular-weight genomic DNA was isolated from G418-resistant HT1080 cells infected with the retroviral vectors. (A) Schematic representation of retroviral vector DNA integrated into the genomic DNA. The open boxes represent the retroviral LTR sequences, the solid black line represents the internal retroviral vector sequence, and the stippled boxes represent the genomic DNA. Genomic DNA was digested with ScaI, circularized, and transformed into E. coli DH12S. Transformants were selected by kanamycin resistance. (B) The rescued plasmid. To facilitate further analysis, the genomic flanking sequence was subcloned into the pIBI-20 vector, and the retroviral vector sequence was reconstructed by NheI digestion of the rescued plasmid and circularization.

TABLE 2.

Rescued chimeric retroviral vectors and subclones

Input chimera Rescued chimera Size (kb)
Reconstructed retroviral vector Subcloned flanking sequence
4-11 4-11(9) 9.7 11
4-11(10) 9.7 12
4-11(3) 9.7 1.0
5.0
3-7 3-7(19) 9.7 a
pRgpNeo 2 9.2 5.0
3 9.2 5.5
0.45
10 9.2 1.8
a

Clone contained no flanking sequence. 

To subclone the rescued retroviral vector sequences and flanking DNA, the recovered plasmids were digested with NheI. The 9.2- or 9.7-kb fragment was isolated, circularized by ligation and transformed into E. coli. The reconstructed retroviral vectors are listed in Table 2. BglII cuts within the M-MuLV IN- and Ty3 IN-coding sequences to yield a distinct pattern when hybridized with either a M-MuLV IN- or Ty3 IN-coding sequence-specific probe (Fig. 4C). Rescued and input retroviral vector plasmids were digested with BglII and subjected to Southern analysis. Figure 4 shows the BglII digestion pattern of input and rescued retroviral vectors pRgpNeo, 3-7, and 4-11 when hybridized with either an M-MuLV IN (Fig. 4A)- or Ty3 IN (Fig. 4B)-coding sequence-specific probe. The rescued wild-type pRgpNeo vector yielded a hybridization pattern identical to that of the input plasmid (Fig. 4A and B; compare lanes 9 to 11 with lane 1). Each rescued plasmid from the 3-7 and 4-11 chimeric clones yielded a hybridization pattern identical to that of the input chimeric plasmids (Fig. 4A and B; compare lanes 5 to 8 with lanes 2 and 3). These results indicated that no gross deletion or rearrangements of the chimeric IN coding sequences had occurred. Sequence analysis of the entire Ty3 IN C domain in the three 4-11 chimeric clones verified the presence of the Ty3 IN-coding sequences, with the exception of single nucleotide changes identified in the 4-11(3) and 4-11(10) rescued vectors. The single nucleotide changes would result in an aa substitution of aspartic acid for asparagine at position 382 in 4-11(3) and lysine for glutamic acid at position 486 in 4-11(10) in the Ty3 IN C domain. The significance of these second-site mutations remains to be determined. Nonetheless, the maintenance of Ty3 IN sequences in the rescued retroviral vectors indicates that these integrations were mediated by a chimeric IN protein containing the C-terminal domain of Ty3 IN.

FIG. 4.

FIG. 4

Maintenance of chimeric IN sequences in rescued retroviral vectors. (A and B) Southern hybridization of subcloned, rescued retroviral vectors with an α-32P-labeled M-MuLV (A) or Ty3 (B) IN-coding sequence specific probe. The subcloned retroviral vectors rescued from HT1080 cells infected with pRgpNeo (lanes 9 to 11), pRgpAMBMCT clone 3-7 (lanes 5), and clone 4-11 (lanes 6 to 8) were digested with restriction enzyme BglII, and restriction patterns were compared to the BglII digestion patterns of the input retroviral vector plasmids pRgpKan (lanes 1), pRgpAMBMCT clone 3-7 (lanes 2), and clone 4-11 (lanes 3). Lambda DNA digested with HindIII and DNA size markers (no. VIII; Boehringer Mannheim Inc.) were used as molecular weight markers (M; lanes 4). The size, in base pairs, of the expected BglII fragments are indicated at the left. (C) BglII restriction enzyme sites in pRgpNeo and M-MuLV and Ty3 IN sequences are indicated by the vertical arrows. BglII cuts within the pRgpKan and pRgpAMBMCT IN sequences to yield fragments of the indicated sizes.

Sequence analysis of the flanking genomic DNA.

A hallmark of retroviral integration into genomic DNA is the direct duplication of the target genomic DNA. The size of the target site repeat is determined by the distance between the positions of IN nicking on the two strands. The separation of nicks in the two strands varies among retroviruses; for M-MuLV it is 4 bp (62), and for Ty3 it is 5 bp (14). Flanking genomic DNA rescued from the three wild-type RgpNeo and three RgpAMBMCT chimeric retroviral vector integrations was subcloned into the pIBI-20 vector (Table 2) for sequence analysis. Flanking sequence from the pRgpNeo retroviral vector, as expected, revealed the presence of 4-bp target site repeats for all three rescued integrations (Fig. 5). Flanking sequence from each of the three integrations from the RgpAMBMCT chimeric retroviral vector contained target site repeats, indicating that the incorporation of chimeric retroviral sequences into genomic DNA occurred by integration. Interestingly, two of the integrations produced a 4-bp target site repeat, similar to that made by M-MuLV IN, while the other integration produced a 5-bp target site repeat, similar to that made by Ty3 IN (Fig. 5). To verify the 5-bp repeat, the preintegration genomic DNA was amplified and sequenced by asymmetric PCR (Fig. 6). The nucleotide sequence on the left represents the 3′ LTR, and the flanking DNA with the 5-bp repeat, AGGGT, is indicated. The nucleotide sequence on the right represents the preintegration genomic DNA containing the target AGGGT and flanking sequence. Below the sequencing gel, the nucleotide sequence of the preintegration genomic DNA is shown with the positions of the staggered cuts inferred to be made by the chimeric IN protein indicated by the arrows. Joining of 3′ ends of viral DNA at 5′-overhanging positions in target DNA followed by repair, presumably by host enzymes, results in a 5-bp duplication of the target site (underlined). The substitution of Ty3 sequences in the C-terminal domain of M-MuLV IN may result in an IN protein with hybrid strand transfer activity, or it may simply make strand transfer less precise.

FIG. 5.

FIG. 5

Genomic target DNA sequence and identification of target site repeats. Target genomic DNA flanking the integrated retroviral vectors was subcloned into the pIBI-20 vector. Flanking genomic DNA was identified by sequence analysis with an oligonucleotide complementary to M-MuLV U5 LTR sequences (3′ target DNA) or pIBI-20 vector sequences (5′ target DNA). The three independent integrations rescued from HT1080 cells infected with either the chimeric RgpAMBMCT or the wild-type RgpNeo retroviral vector are illustrated. The nucleotide sequence of the genomic DNA duplicated upon integration is underlined. The lengths of flanking genomic DNA sequences obtained on the 5′ and 3′ sides of the integrated retroviral vector are indicated in parentheses. The boxes encompassing the LTRs flank the retroviral vector sequence that is represented by the solid line.

FIG. 6.

FIG. 6

Verification of the 5-bp target site repeat. (A) Preintegration genomic DNA from HT1080 cells was amplified by asymmetric PCR using oligonucleotide primers 406 and 407, complementary to the 5′ and 3′, respectively, genomic sequences flanking the rescued integration from the chimeric retroviral vector clone 4-11(10). The sequence of the PCR product was determined by using oligonucleotide 407 as a primer. The nucleotide sequence shown at the left is from the subcloned, rescued genomic DNA, representing the 3′ LTR and the adjacent flanking genomic DNA, with the 5-bp AGGGT repeat indicated. The nucleotide sequence shown at the right is from the amplified preintegration genomic DNA with one copy of the 5-bp AGGGT sequence. (B) Nucleotide sequence of the preintegration genomic DNA, with the 5-bp AGGGT sequence underlined and the positions of the staggered nicks made by the chimeric IN protein indicated by the arrows. (C) Diagram of this region after integration of the retroviral vector DNA and repair by cellular enzymes has occurred.

Integration of the RgpAMBMCT chimeric retroviral vector is not adjacent to a tRNA gene.

To determine whether the chimeric IN protein with the altered strand transfer activity was targeting integration to a tRNA gene, the flanking genomic DNA sequence was determined for 100 to 250 nucleotides in the 5′ and 3′ directions. The length of 5′ and 3′ nucleotide sequence obtained for each rescued integration event is indicated in Fig. 5. These sequences were used to search the NCBI sequence database by using the BLAST program (1). No matches were found between these sequences and any tRNA genes. The flanking genomic DNA sequences were also analyzed with the tRNA SCAN program (25), which searches sequences for tRNA structures. The tRNA SCAN program failed to find any tRNA genes in flanking sequences. Therefore, the three independent integrations mediated by the M-MuLV V/Ty3 chimeric IN protein did not have the position specificity of Ty3.

Analysis of viral proteins and RNA.

To determine the basis of low titers of G418-resistant HT1080 cells infected with the RgpAMBMCT chimeric retroviral vector, viral protein, and RNA levels of the RgpNeo wild-type and the RgpAMBMCT chimera were compared. The NC10 producer cells were used as the source of viral proteins and RNA. Western analysis of viral proteins from concentrated supernatants and cell lysates revealed high levels of mature capsid protein (30 kDa) from the RgpNeo producer cells and much (10- to 100-fold) lower levels of mature capsid protein from the RgpAMBMCT NC10 producer cells (data not shown). Western analysis of these same protein samples with a polyclonal anti-M-MuLV IN antibody detected protein only from the pRgpNeo producer cells, while an anti-Ty3 IN antibody could not distinguish a unique protein from any protein sample (data not shown). Similar results were obtained from immunoprecipitation of radiolabeled NC10 producer cells (data not shown). NC10 producer cell supernatants were also tested for reverse transcriptase activity. The positive control supernatants from RgpNeo-infected cells yielded significant reverse transcriptase activity, whereas the activity in chimera infected cell supernatants could not be distinguished from negative control supernatants of uninfected cells (data not shown). These results are consistent with low levels of viral proteins expressed from the NC10 chimera producer cells. Low levels of proteins could be attributable to poor expression of the chimeras or to unstable proteins.

To determine whether the low levels of viral proteins expressed in chimeric retroviral vector-producing NC10 cells were due to low levels of viral gene expression, vector RNA levels were determined. Equivalent amounts of cytoplasmic RNA (10 μg) from NC10 cells (lanes 1), RgpNeo-producing NC10 cells (lanes 2 and a lower exposure in lanes 4), and RgpAMBMCT-producing NC10 cells (lanes 3) were subjected to Northern analysis using α-32P-labeled M-MuLV gag-pol, M-MuLV IN, Ty3 IN, and Neor probes (Fig. 7). The membrane hybridized with the M-MuLV IN probe was stripped and rehybridized with the M-MuLV gag-pol probe and then restripped and rehybridized with the Neor probe. In general, the level of viral RNA expression is significantly lower from the RgpAMBMCT chimera-producing cells than from the RgpNeo wild-type virus-producing cells (compare lanes 2 to lanes 3). The RNAs expressed from each vector were not further investigated. The longest transcript is assumed to represent the full-length vector transcript because each of the probes hybridizes to this transcript. The shortest transcript hybridizing to the Neor probe is assumed to be the transcript derived from the SV40 promoter upstream of the Neor gene. The significantly (10- to 100-fold) lower level of vector RNA expression would be sufficient to explain the low levels of viral proteins detected from the RgpAMBMCT chimera-producing cells.

FIG. 7.

FIG. 7

Vector RNA levels in NC10 producer cells. Total cytoplasmic RNA harvested from NC10 cells (lanes 1), RgpNeo NC10 producer cells (lanes 2), and RgpAMBMCT NC10 producer cells (lanes 3) was denatured by glyoxylation, fractionated by electrophoresis, and transferred to a nylon membrane. Ten micrograms of RNA from each of the three samples was loaded in three separate sets on the same agarose gel. One set was stained with ethidium bromide to test for equivalent loading (data not shown). The other two sets were transferred onto nylon membranes. One set was hybridized with a Ty3 IN-coding region-specific probe, and the other set was hybridized with an M-MuLV IN-coding region-specific probe. The membrane hybridized with the M-MuLV IN probe was stripped and rehybridized with the gag-pol-specific probe and then stripped again and rehybridized with the Neor-specific probe. Lanes 1 to 3 represent the same exposure of the membranes to compare RNA levels, while lane 4 is a shorter exposure of samples in lane 2 to distinguish the hybridizing species.

DISCUSSION

Integration of retroviruses and retrotransposons into the host DNA displays various degrees of target site specificity. A number of factors, including the local DNA structure and the proteins bound to the DNA, influence target specificity. Specific proteins can affect insertion site selection through changes in the DNA structure or by interactions directly with the integration machinery. M-MuLV and HIV IN proteins act on the exposed major groove of DNA assembled into chromatin (54). The favored sites in this context are positions which are distorted the most by the bending of the DNA around the nucleosome (52). Targeting integration of the yeast retrovirus-like element Ty3 to tRNA genes requires the presence of pol III transcription factors on a transcriptionally competent tRNA gene template (38). This requirement suggests that integration may be targeted to tRNA genes via a protein-protein interaction between the Ty3 integration machinery and the pol III transcription factors. Although bending of the DNA in this region due to transcription factor binding occurs (44) and may enhance integration, it probably does not explain the exclusive use of tRNA genes as targets for Ty3 integration. In the work reported here, we have tested whether a retrovirus with a substitution of the C-terminal domain of M-MuLV IN with the C-terminal domain of Ty3 IN retains IN activity and whether the chimeric IN protein possess an altered target site preference. Several observations suggested that the RgpAMBMCT chimeric retroviral vector retained IN activity. (i) Infection of target HT1080 cells with the Neor-marked RgpAMBMCT chimeric retroviral vector yielded G418-resistant cells. (ii) The G418-resistant cells contained retrovirus insertions with intact chimeric IN-coding sequence. (iii) Each chimeric retrovirus insertion was flanked by short direct repeats, indicating that it occurred by integration. (iv) The target site repeats were 4 bp (similar to M-MuLV) and 5 bp (similar to Ty3), suggesting that the chimeric IN protein may possess a hybrid target site nicking activity. This is the first report of a chimeric IN protein retaining activity in vivo.

Ty3 IN contains the conserved sequences and 3′-end processing and strand transfer activities of retroviral IN proteins. The central core domains of M-MuLV and Ty3 IN encompassing the HHCC and DD(35)E motifs show 25% amino acid identity (4). The DD(35)E motif is absolutely required for the catalytic activity of IN since mutations in any of these residues will abolish IN activity in vitro (21, 22, 38, 40, 43, 64) and in vivo (11, 23, 39, 60, 63, 66). The conservation of functional motifs in this region suggests that it is responsible for functions of IN conserved in both Ty3 and M-MuLV IN. It therefore seems unlikely to mediate the position-specific integration activity unique to Ty3. HIV and FIV (feline immunodeficiency virus) IN display strand transfer patterns in vitro into naked DNA that are distinct from each other (59). A recent report describing the integration patterns generated in vitro by chimeric IN proteins between HIV and FIV indicated that the central core region was responsible for the distinct sequence preferences for strand transfer by these two IN proteins (59). Ty3 position-specific insertion, however, has not been demonstrated in vitro in the absence of target binding proteins and is relatively sequence independent. It therefore seems unlikely to have a distinct structural basis from the sequence preferences exhibited by HIV and FIV IN proteins.

In contrast to the central region, the N- and C-terminal domains of retroviruses are poorly conserved. Significant domain differences between retroviruses and Ty3 also exist in these regions. The domain N terminal to the HHCC motif is very small in some retroviruses and is poorly conserved among retroviral IN proteins generally. Its function is not known. This domain is 90 aa in Ty3, 55 aa in M-MuLV, and only 11 aa in HIV. Therefore, it could encode a function unique to Ty3. The C-terminal domain of retrovirus IN has been shown to have nonspecific DNA binding activity and therefore is believed to interact with the target DNA (24, 36, 49, 58, 65, 67). This domain also shows great variability among retroviral IN proteins (31, 36). The weak conservation of this domain would be consistent with the evolution of different targeting capabilities. The differences between Ty3 and retroviruses in integration patterns are extreme. However, more subtle differences are observed among retroviruses. For example, the distribution of M-MuLV IN and HIV IN insertion sites was not identical in minichromosome targets (54). In Ty3, the C-terminal domain is significantly larger than in retroviruses: 230 aa for Ty3 versus 140 aa for M-MuLV and 100 aa for HIV. The significantly larger size and sequence dissimilarity suggest that this region could mediate disparate functions in Ty3 and retroviruses. If, as the in vitro data suggest, Ty3 IN targets integration to tRNA genes via protein-protein interactions with pol III transcription factors, then the C-terminal region of Ty3 IN may be involved in this interaction.

Although the AMBMCT chimeric IN protein retained activity in vivo, the virus encoding this protein was much less infectious than the wild-type RgpNeo. Infection of HT1080 cells with the RgpAMBMCT chimeric vector resulted in the generation of only three G418-resistant HT1080 cell clones, compared to more than 104 cell clones for RgpNeo. There are a number of possible reasons that the RgpAMBMCT chimeric vector would have lower titers than the wild-type RgpNeo. These possibilities were investigated by analysis of the chimeric retroviral proteins and RNA. Hybridization with the M-MuLV IN-specific probe to the RNA from the pRgpNeo producer cells yielded bands with 10- to 100-fold-greater intensity than the bands seen with RNA from the RgpAMBMCT producer cells. The observed difference in RNA levels between cells expressing the wild-type RgpNeo and the RgpAMBMCT chimera is comparable to the difference in viral protein levels observed. The low viral protein levels seen in producer cell supernatants would explain the low titers of G418-resistant HT1080 target cells produced upon infection with these supernatants, although a reduced efficiency of integration due to additional instability or poor activity of the chimeric IN protein could also contribute. The low level of RNA expression from the RgpAMBMCT chimera producer cells could be explained by a genomic location or context effect on expression. However, this explanation is unlikely because the RgpAMBMCT chimera producer cells should represent a mixed population of chimeric proviruses integrated in different positions and therefore different genomic contexts as do the wild-type RgpNeo producer cells. In addition, wild-type M-MuLV IN mediated the integration of wild-type RgpNeo and the RgpAMBMCT chimeric vector into the NC10 genomic DNA at roughly the same efficiency, since similar numbers of G418-resistant NC10 cells were generated from each vector. The possibility that mutations that lowered transcription levels were introduced at some step into the LTR of the pRgpAMBMCT chimeric vector was investigated. The entire LTR regions of the input and rescued plasmids from the wild-type and pRgpAMBMCT chimeric vectors were sequenced. No sequence differences were detected between the input or the rescued plasmids from the pRgpAMBMCT chimeric vector and the wild-type vector. Thus, mutations in the LTR do not cause differential expression of the chimeric vector.

Substitution of the C-terminal domain of Ty3 IN for M-MuLV IN apparently provides a required function to the IN. Deletion analysis in the C-terminal domain of M-MuLV IN showed that deletions of more than 28 aa resulted in the loss of IN activity in vitro (32) and virus viability in vivo (55). Two different short linker insertions at aa 322 in M-MuLV resulted in either nonviable virus (20) or a virus with severely delayed growth (32). The ability of the Ty3 IN C-terminal domain to substitute at some level for the C-terminal domain of M-MuLV IN suggests that the function of this domain is conserved.

No gross deletions or rearrangements of the Ty3 IN sequences occurred in the three RgpAMBMCT chimeric virus insertions recovered. Sequence analysis verified the maintenance of the Ty3 IN sequences in this region, with the exception of single nucleotide changes identified in the 4-11(3) and 4-11(10) rescued vectors. The single nucleotide changes would result in an amino acid substitution of aspartic acid for asparagine at position 382 in 4-11(3) and lysine for glutamic acid at position 486 in 4-11(10) in the Ty3 IN C domain. The step in production of the chimeric retrovirus in which these mutations occurred would have determined whether the chimeric IN protein that actually mediated integration contained this substitution. If it was generated during reverse transcription prior to integration into NC10 cells, then the chimeric IN protein would have contained the amino acid substitution. However, if it was not generated until the reverse transcription step prior to integration in HT1080 cells, then the chimeric IN protein produced in NC10 cells would not have contained this substitution. These two possibilities could be distinguished by reverse transcription-PCR analysis of viral RNAs isolated at each stage and comparison of the nucleotide sequences in this region. If the mutation occurred at the first reverse transcription step and the chimeric protein did contain the amino acid substitution, the rescued chimeric retroviral vector plasmid could be used to generate virus and tested for IN activity.

Retrovirus and retrotransposon IN proteins make a staggered cut in the target DNA. The size of this cut is characteristic of each IN protein and determines the size of the target site repeat. The identification of both 4- and 5-bp target site repeats produced by the RgpAMBMCT chimeric virus suggests that the chimeric IN protein may possess a target site cutting activity which is a hybrid between M-MuLV (4 bp) and Ty3 (5 bp) IN activities. Comparison of the data from HIV and ASV IN crystal structures suggests that the distance between active-site residues in IN dimers may determine the size of the staggered cut (2). Therefore, one possible explanation for our results is that the chimeric IN protein does not form as stable a dimer as either the intact M-MuLV or Ty3 IN protein. The chimeric nature of the IN protein could affect interactions between the chimeric IN monomers or affect interactions with other viral or cellular proteins which are required in the integration complex. This less stable, chimeric IN dimer would then be less consistent in the relative positions of the staggered nicks in the target DNA. Variation in target site repeats has also been detected in products of in vitro integrations in studies using IN purified from avian myeloblastosis virions (26) or IN purified from avian sarcoma-leukosis virions or bacterial expression systems (34). In vivo, IN functions in a complex with other viral and perhaps cellular proteins; therefore, something could be missing in the in vitro reactions which contribute to the stability of the IN dimers and, in turn, the fidelity of the reaction in vivo.

Analysis of the flanking genomic DNA from three integrations mediated by the RgpAMBMCT chimeric viruses did not reveal tRNA genes. Therefore, these integrations did not exhibit Ty3 position specificity. There are several potential explanations for the apparent lack of specificity. First, the C-terminal domain of Ty3 IN does not independently mediate the position-specific integration of Ty3. Second, specific integrations occurred, but these were not recovered efficiently. Third, this domain mediates specificity but is inactive in the chimeric context. Fourth, the C-terminal domain of Ty3 interacts with yeast target proteins but not the human homologs. It has recently been shown that Ty3 can target integration to a human tRNA gene in yeast (18). Experiments using in vitro integration assays are under way to test whether human extracts can satisfy the Ty3 integration requirement for pol III transcription factors and to determine which, if any, of several chimeric retroviral vectors have Ty3 position specificity in vitro.

Fusion of heterologous DNA binding domains to retroviral IN proteins has been successfully used in vitro to target strand transfer to regions adjacent to the binding sites (8, 10, 27, 35). The LexA DNA binding domain fused to the C terminus of ASV IN was incorporated into viral particles. However, the resulting virus displayed delayed growth kinetics (35). In this study, we have introduced an alternative novel approach to targeting integration to a specific site. A chimeric retrovirus substituting the C domain of Ty3 IN for the C domain of M-MuLV IN was generated in an attempt to confer the position-specific integration property of Ty3 on the M-MuLV retroviral vector. This strategy would exploit the existence of IN homologs that are position specific to produce predictable insertions of retrovirus vectors into preexisting genomic targets.

ACKNOWLEDGMENTS

We thank H. Fan for the gift of plasmid p2XMLV. We thank Harry Mangalam and Virginia Bilanchone for computer analysis. We thank Marielle Reyes and Dat Hoang for excellent technical assistance.

This work was supported by Public Health Service grant STTR 95-1 to Viagene, Inc., San Diego, Calif., Chiron Technologies Center for Gene Therapy, and UC-STAR grant S96-45.

REFERENCES

  • 1.Altschul S F, Gish W, Miller W, Myers E W, Lipman D J. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 2.Andrake M D, Skalka A M. Retroviral integrase, putting the pieces together. J Biol Chem. 1996;271:19633–19636. doi: 10.1074/jbc.271.33.19633. [DOI] [PubMed] [Google Scholar]
  • 3.Ausubel F M, Brent R, Kingston R E, Moore D D, Seidman J G, Smith J A, Struhl K, editors. Current protocols in molecular biology. New York, N.Y: Greene Publishing Associates/Wiley-Interscience; 1992. [Google Scholar]
  • 4.Bilanchone, V. B., and S. B. Sandmeyer. Unpublished data.
  • 5.Bodner, M. Personal communication.
  • 6.Bor Y C, Bushman F D, Orgel L E. In vitro integration of human immunodeficiency virus type 1 cDNA into targets containing protein-induced bends. Proc Natl Acad Sci USA. 1995;92:10334–10338. doi: 10.1073/pnas.92.22.10334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Burns J C, Friedmann T, Driever W, Burrascano M, Yee J-K. Vesicular stomatitis virus G glycoprotein pseudotyped retroviral vectors: concentration to a very high titer and efficient gene transfer into mammalian and nonmammalian cells. Proc Natl Acad Sci USA. 1990;90:8033–8037. doi: 10.1073/pnas.90.17.8033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Bushman F. Tethering human immunodeficiency virus 1 integrase to a DNA site directs integration to nearby sequences. Proc Natl Acad Sci USA. 1994;91:9233–9237. doi: 10.1073/pnas.91.20.9233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bushman F D, Engelman A, Palmer I, Wingfield P, Craigie R. Domains of the integrase protein of human immunodeficiency virus type 1 responsible for polynucleotidyl transfer and zinc binding. Proc Natl Acad Sci USA. 1993;90:3428–3432. doi: 10.1073/pnas.90.8.3428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bushman F D, Miller M D. Tethering human immunodeficiency virus type 1 preintegration complexes to target DNA promotes integration at nearby sites. J Virol. 1997;71:458–464. doi: 10.1128/jvi.71.1.458-464.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Cannon P M, Wilson W, Byles E, Kingsman S M, Kingsman A J. Human immunodeficiency virus type 1 integrase: effect on viral replication of mutations at highly conserved residues. J Virol. 1994;68:4768–4775. doi: 10.1128/jvi.68.8.4768-4775.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chalker D L, Sandmeyer S B. Transfer RNA genes are genomic targets for de novo transposition of the yeast retrotransposon Ty3. Genetics. 1990;126:837–850. doi: 10.1093/genetics/126.4.837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chalker D L, Sandmeyer S B. Ty3 integrates within the region of RNA polymerase III transcription initiation. Genes Dev. 1992;6:117–128. doi: 10.1101/gad.6.1.117. [DOI] [PubMed] [Google Scholar]
  • 14.Clark D J, Bilanchone V W, Haywood L J, Dildine S L, Sandmeyer S B. A yeast sigma composite element, Ty3, has properties of a retrotransposon. J Biol Chem. 1988;263:1413–1423. [PubMed] [Google Scholar]
  • 15.Connolly C M, Sandmeyer S B. RNA polymerase III interferes with Ty3 integration. FEBS Lett. 1997;405:305–311. doi: 10.1016/s0014-5793(97)00200-7. [DOI] [PubMed] [Google Scholar]
  • 16.Craigie R. Hotspots and warm spots: integration specificity of retroelements. Trends Genet. 1992;8:187–190. doi: 10.1016/0168-9525(92)90223-q. [DOI] [PubMed] [Google Scholar]
  • 17.Dente L, Cesareni G, Cortese R. pEMBL: a new family of single stranded plasmids. Nucleic Acids Res. 1983;11:1645–1655. doi: 10.1093/nar/11.6.1645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Dildine S L, Sandmeyer S B. Integration of the yeast retrovirus-like element Ty3 upstream of a human tRNA gene expressed in yeast. Gene. 1997;194:227–233. doi: 10.1016/s0378-1119(97)00167-4. [DOI] [PubMed] [Google Scholar]
  • 19.Dildine S L, Semler B L. The deletion of 41 proximal nucleotides reverts a poliovirus mutant containing a temperature-sensitive lesion in the 5′ noncoding region of genomic RNA. J Virol. 1989;63:847–862. doi: 10.1128/jvi.63.2.847-862.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Donehower L A. Analysis of mutant Moloney murine leukemia viruses containing linker insertion mutations in the 3′ region of pol. J Virol. 1988;62:3958–3964. doi: 10.1128/jvi.62.11.3958-3964.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Drelich M, Wilhelm R, Mous J. Identification of amino acid residues critical for endonuclease and integrase activities of HIV-1 IN protein in vitro. Viology. 1992;188:459–468. doi: 10.1016/0042-6822(92)90499-f. [DOI] [PubMed] [Google Scholar]
  • 22.Engelman A, Craigie R. Identification of conserved amino acid residues critical for human immunodeficiency virus type 1 integrase function in vitro. J Virol. 1992;66:6361–6369. doi: 10.1128/jvi.66.11.6361-6369.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Engelman A, Englund G, Orenstein J M, Martin M A, Craigie R. Multiple effects of mutations in human immunodeficiency virus type 1 integrase on viral replication. J Virol. 1995;69:2729–2736. doi: 10.1128/jvi.69.5.2729-2736.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Engelman A, Hickman A B, Craigie R. The core and carboxyl-terminal domains of the integrase protein of human immunodeficiency virus type 1 each contribute to nonspecific DNA binding. J Virol. 1994;68:5911–5917. doi: 10.1128/jvi.68.9.5911-5917.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Fichant G A, Burks C. Identifying potential tRNA genes in genomic DNA sequences. J Mol Biol. 1991;220:659–671. doi: 10.1016/0022-2836(91)90108-i. [DOI] [PubMed] [Google Scholar]
  • 26.Fitzgerald M L, Vora A C, Zeh W G, Grandgenett D P. Concerted integration of viral DNA termini by purified avian myeloblastosis virus integrase. J Virol. 1992;66:6257–6263. doi: 10.1128/jvi.66.11.6257-6263.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Goulaouic H, Chow S A. Directed integration of viral DNA mediated by fusion proteins consisting of human immunodeficiency virus type 1 integrase and Escherichia coli LexA protein. J Virol. 1996;70:37–46. doi: 10.1128/jvi.70.1.37-46.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Graham F L, van der Eb A J. A new technique for the assay of infectivity of human adenovirus 5 DNA. Virology. 1973;52:456–467. doi: 10.1016/0042-6822(73)90341-3. [DOI] [PubMed] [Google Scholar]
  • 29.Hansen L J, Chalker D L, Sandmeyer S B. Ty3, a yeast retrotransposon associated with tRNA genes, has homology to animal retroviruses. Mol Cell Biol. 1988;8:5245–5256. doi: 10.1128/mcb.8.12.5245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hansen L J, Sandmeyer S B. Characterization of a transpositionally active Ty3 element and identification of the Ty3 integrase protein. J Virol. 1990;64:2599–2607. doi: 10.1128/jvi.64.6.2599-2607.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Johnson M S, McClure M A, Feng D-F, Gray J, Doolittle R F. Computer analysis of retroviral pol genes: assignment of enzymatic functions to specific sequences and homologies with non-viral enzymes. Proc Natl Acad Sci USA. 1986;83:7648–7652. doi: 10.1073/pnas.83.20.7648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Jonsson C B, Donzella G A, Gaucan E, Smith C M, Roth M J. Functional domains of Moloney murine leukemia virus integrase defined by mutation and complementation analysis. J Virol. 1996;70:4585–4597. doi: 10.1128/jvi.70.7.4585-4597.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Kalpana G V, Marmon S, Wang W, Crabtree G R, Goff S P. Binding and stimulation of HIV-1 integrase by a human homolog of yeast transcription factor SNF5. Science. 1994;266:2002–2006. doi: 10.1126/science.7801128. [DOI] [PubMed] [Google Scholar]
  • 34.Katz R A, Merkel G, Kulkosky J, Leis J, Skalka A M. The avian retroviral IN protein is both necessary and sufficient for integrative recombination in vitro. Cell. 1990;63:87–95. doi: 10.1016/0092-8674(90)90290-u. [DOI] [PubMed] [Google Scholar]
  • 35.Katz R A, Merkel G, Skalka A M. Targeting of retroviral integrase by fusion to a heterologous DNA binding domain: in vitro activities and incorporation of a fusion protein into viral particles. Virology. 1996;217:178–190. doi: 10.1006/viro.1996.0105. [DOI] [PubMed] [Google Scholar]
  • 36.Khan E, Mack J P, Katz R A, Kulkosky J, Skalka A M. Retroviral integrase domains: DNA binding and the recognition of LTR sequences. Nucleic Acids Res. 1991;19:851–860. doi: 10.1093/nar/19.4.851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kinsey P T, Sandmeyer S B. Adjacent pol II and pol III promoters: transcription of the yeast retrotransposon Ty3 and a target tRNA gene. Nucleic Acids Res. 1991;19:1317–1324. doi: 10.1093/nar/19.6.1317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kirchner J, Connolly C M, Sandmeyer S B. Requirement of RNA polymerase III transcription factors for in vitro position-specific integration of a retroviruslike element. Science. 1995;267:1488–1491. doi: 10.1126/science.7878467. [DOI] [PubMed] [Google Scholar]
  • 39.Kirchner J, Sandmeyer S B. Ty3 integrase mutants defective in reverse transcription or 3′ end processing of extrachromosomal Ty3 DNA. J Virol. 1996;70:4737–4747. doi: 10.1128/jvi.70.7.4737-4747.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kulkosky J, Jones K S, Katz R A, Mack J P G, Skalka A M. Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases. Mol Cell Biol. 1992;12:2331–2338. doi: 10.1128/mcb.12.5.2331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Kunkel T A. Rapid and efficient site-specific mutagenesis without phenotypic selection. Proc Natl Acad Sci USA. 1985;82:488–492. doi: 10.1073/pnas.82.2.488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.LaFemina R L, Schneider C L, Robbins H L, Callahan P L, LeGrow K, Roth E, Schleif W A, Emini E A. Requirement of active human immunodeficiency virus type 1 integrase enzyme for productive infection of human T-lymphoid cells. J Virol. 1992;66:7414–7419. doi: 10.1128/jvi.66.12.7414-7419.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Leavitt A D, Shiue L, Varmus H E. Site-directed mutagenesis of HIV-1 integrase demonstrates differential effects on integrase functions in vitro. J Biol Chem. 1993;268:2113–2119. [PubMed] [Google Scholar]
  • 44.Leveillard T, Kassavetis G A, Geiduschek E P. Saccharomyces cerevisiae transcription factors IIIB and IIIC bend DNA of the tRNAGln gene. J Biol Chem. 1991;266:5162–5168. [PubMed] [Google Scholar]
  • 45.McMaster G K, Carmichael G G. Analysis of single- and double-stranded nucleic acids on polyacrylamide and agarose gels by using glyoxal and acridine orange. Proc Natl Acad Sci USA. 1977;74:4835–4838. doi: 10.1073/pnas.74.11.4835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Miller A D, Verma I M. Two base changes restore infectivity to a noninfectious molecular clone of Moloney murine leukemia virus (pMLV-1) J Virol. 1984;49:214–222. doi: 10.1128/jvi.49.1.214-222.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Miller M D, Bushman F D. HIV integration. Ini1 for integration? Curr Biol. 1995;5:368–370. doi: 10.1016/s0960-9822(95)00074-1. [DOI] [PubMed] [Google Scholar]
  • 48.Muller H-P, Varmus H E. DNA bending creates favored sites for retroviral integration: an explanation for preferred insertion sites in nucleosomes. EMBO J. 1994;13:4704–4714. doi: 10.1002/j.1460-2075.1994.tb06794.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Mumm S R, Grandgenett D P. Defining nucleic acid-binding properties of avian retrovirus integrase by deletion analysis. J Virol. 1991;65:1160–1167. doi: 10.1128/jvi.65.3.1160-1167.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Overhauser J, Fan H. Generation of glucocorticoid-responsive Moloney murine leukemia virus by insertion of regulatory sequences from murine mammary tumor virus into the long terminal repeat. J Virol. 1996;54:133–144. doi: 10.1128/jvi.54.1.133-144.1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Price J, Turner D, Cepko C. Lineage analysis in the vertebrate nervous system by retrovirus-mediated gene transfer. Proc Natl Acad Sci USA. 1987;84:156–160. doi: 10.1073/pnas.84.1.156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Pruss D, Bushman F D, Wolffe A P. Human immunodeficiency virus integrase directs integration to sites of severe DNA distortion within the nucleosome core. Proc Natl Acad Sci USA. 1994;91:5913–5917. doi: 10.1073/pnas.91.13.5913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Pruss D, Reeves R, Bushman F D, Wolffe A P. The influence of DNA and nucleosome structure on integration events directed by HIV integrase. J Biol Chem. 1994;269:25031–25041. [PubMed] [Google Scholar]
  • 54.Pryciak P M, Varmus H E. Nucleosomes, DNA-binding proteins, and DNA sequence modulate retroviral integration target site selection. Cell. 1992;69:769–780. doi: 10.1016/0092-8674(92)90289-o. [DOI] [PubMed] [Google Scholar]
  • 55.Roth M J. Mutational analysis of the carboxyl terminus of the Moloney murine leukemia virus integration protein. J Virol. 1991;65:2141–2145. doi: 10.1128/jvi.65.4.2141-2145.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Sandmeyer S B, Hansen L J, Chalker D L. Integration specificity of retrotransposons and retroviruses. Annu Rev Genet. 1990;24:491–518. doi: 10.1146/annurev.ge.24.120190.002423. [DOI] [PubMed] [Google Scholar]
  • 57.Sanger F, Nicklen S, Coulson A R. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci USA. 1977;74:5463–5467. doi: 10.1073/pnas.74.12.5463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Schauer M, Billich A. The N-terminal region of HIV-1 integrase is required for integration activity, but not for DNA binding. Biochem Biophys Res Commun. 1992;185:874–880. doi: 10.1016/0006-291x(92)91708-x. [DOI] [PubMed] [Google Scholar]
  • 59.Shibagaki Y, Chow S A. Central core domain of retroviral interase is responsible for target site selection. J Biol Chem. 1997;272:8361–8369. doi: 10.1074/jbc.272.13.8361. [DOI] [PubMed] [Google Scholar]
  • 60.Shin C-G, Taddeo B, Haseltine W A, Farnet C M. Genetic analysis of the human immunodeficiency virus type 1 integrase protein. J Virol. 1994;68:1633–1642. doi: 10.1128/jvi.68.3.1633-1642.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Shinnick R M, Lerner R A, Sutcliffe J G. Nucleotide sequence of Moloney murine leukemia virus. Nature. 1981;293:543–548. doi: 10.1038/293543a0. [DOI] [PubMed] [Google Scholar]
  • 62.Shoemaker C, Goff S, Gilboa E, Paskind M, Mitra S W, Baltimore D. Structure of a cloned circular Moloney murine leukemia virus DNA molecule containing an inverted segment: implications for retrovirus integration. Proc Natl Acad Sci USA. 1980;77:3932–3936. doi: 10.1073/pnas.77.7.3932. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Taddeo B, Haseltine W A, Farnet C M. Integrase mutants of human immunodeficiency virus type 1 with a specific defect in integration. J Virol. 1994;68:8401–8405. doi: 10.1128/jvi.68.12.8401-8405.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.van Gent D C, Oude Groeneger A A M, Plasterk R H A. Mutational analysis of the integrase protein of human immunodeficiency virus type 2. Proc Natl Acad Sci USA. 1992;89:9598–9602. doi: 10.1073/pnas.89.20.9598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Vink C, Oude Groeneger A A M, Plasterk R H A. Identification of the catalytic and DNA-binding region of the human immunodeficiency virus type 1 integrase protein. Nucleic Acids Res. 1993;21:1419–1425. doi: 10.1093/nar/21.6.1419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Wiskerchen M, Muesing M A. Human immunodeficiency virus type 1 integrase: effects of mutations on viral ability to integrate, direct viral gene expression from unintegrated viral DNA templates, and sustain viral propagation in primary cells. J Virol. 1995;69:376–386. doi: 10.1128/jvi.69.1.376-386.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Woerner A M, Marcus-Sekura C J. Characterization of a DNA binding domain in the C-terminus of HIV-1 integrase by deletion mutagenesis. Nucleic Acids Res. 1993;21:3507–3511. doi: 10.1093/nar/21.15.3507. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Virology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES