Abstract
The DNA sequence motifs which direct adeno-associated virus type 2 site-specific integration are being investigated using a shuttle vector, propagated as a stable episome in cultured cell lines, as the target for integration. Previously, we reported that the minimum episomal targeting elements comprise a 16-bp binding motif (Rep binding site [RBS]) for a viral regulatory protein (Rep) separated by a short DNA spacer from a sequence (terminal resolution site [TRS]) that can serve as a substrate for Rep-mediated nicking activity (R. M. Linden, P. Ward, C. Giraud, E. Winocour, and K. I. Berns, Proc. Natl. Acad. Sci. USA 93:11288–11294, 1996; R. M. Linden, E. Winocour, and K. I. Berns, Proc. Natl. Acad. Sci. USA 93:7966–7972, 1996). We now report that episomal integration depends upon both the sequence and the position of the spacer DNA separating the RBS and TRS motifs. The spacer thus constitutes a third element required for site-specific episomal integration.
Among the animal viruses, the human parvovirus adeno-associated virus type 2 (AAV2) is unique in its ability to establish latent infection in cell culture by integrating at a specific locus on the q arm of chromosome 19 (the reported specificity ranges from 70 to 100%) (14–16, 22). The AAV2 life cycle is biphasic. When the cell is coinfected with a helper virus, usually an adenovirus or a herpesvirus, AAV2 undergoes a highly efficient cycle of productive, lytic infection. In addition, a low level of helper-independent AAV2 replication can occur in some cells exposed to genotoxic stress (2, 28–30). When no helper virus is present, and the cells are not exposed to genotoxic chemicals or irradiation, AAV2 takes advantage of site-specific chromosomal integration to establish latency. The integrated state persists stably over many cell generations until viral rescue and replication are triggered by an infecting helper virus or other cellular stress conditions (2).
The 4.7-kb linear, single-stranded DNA genome of AAV2 contains two major open reading frames bracketed by 145-nucleotide inverted terminal repeats (ITRs). Due to the presence of palindromic sequences, the ITRs fold into hydrogen-bonded T-shaped hairpinned structures which serve as self-priming origins of replication. The ITRs contain the cis-acting replication elements. The trans-acting replication proteins (the Rep proteins) are encoded in the left-hand open reading frame, which gives rise to four overlapping polypeptides derived from two promoters by differential splicing (2). The larger Rep 68/78 proteins control major phases of the viral life cycle (2). They resolve an early replication intermediate by binding to the hairpinned stem of the ITR (at the Rep binding site [RBS]) and by introducing a strand- and site-specific nick (at the terminal resolution site [TRS]) (11, 12). The resolution of the hairpinned structure is required for viral DNA synthesis to proceed (21). As discussed below, the Rep 68/78 proteins are also intimately involved in site-specific chromosomal integration (1, 23, 24).
The site of AAV2 integration in chromosome 19 (the preintegration locus is known as AAVS1) has been cloned and sequenced (14). To identify the chromosomal DNA sequences which direct AAV2 to its preferential integration site, AAVS1 DNA has been subcloned in the Epstein-Barr virus-based shuttle vector p220.2 (9). This vector propagates as a stable extrachromosomal episome, at a copy number of 50 to 100, in human 293 cells expressing the Epstein-Barr virus EBNA1 protein (19, 31). p220.2 contains a hygromycin resistance gene for selection of cells propagating the episome and an Escherichia coli replicon and ampicillin resistance gene for rescue in bacteria. After the episome-propagating cell line is established, it is infected with AAV2 and integration into the episome carrying AAVS1 DNA is assessed by rescue in E. coli. Hybridization is used to identify ampicillin-resistant colonies containing plasmids with inserts of AAV2 DNA (9). Sequencing of the plasmid inserts has defined junctions with AAVS1 DNA and some organizational features reminiscent of those identified when integration occurred at the chromosomal level (10).
Using the above-described assay for site-directed integration, the minimum chromosome 19 targeting cassette has been identified as a 16-bp canonical RBS motif, similar to that in the viral ITR, separated by a short spacer DNA from a 6-bp sequence that resembles the viral ITR TRS sequence cleaved by Rep 68/78 (18) (Table 1). Data from cell-free reactions have indicated that Rep 68/78 can form a bound complex comprising both the AAVS1 and viral RBS and TRS sequences (26) and that Rep 68/78 initiates DNA synthesis on a plasmid carrying AAVS1 DNA, suggesting that the bound Rep proteins cleave the chromosomal TRS sequence (25). Recent data from a cell-free integration assay have also highlighted the essential roles of the chromosomal and viral RBS and TRS motifs in the formation of AAV2/AAVS1 recombinant junctions (8).
TABLE 1.
The present model for site-specific integration (6, 17, 18) proposes that a multimeric form of Rep 68/78, by binding to the RBS motifs on the chromosomal and viral DNAs, positions the incoming infecting genome at the AAVS1 site. Rep-mediated nicking at the adjacent chromosomal TRS sequence, possibly aided by cellular accessory proteins of the HMG1 family (7, 8), mobilizes a polymerase-Rep complex that initiates displacement DNA synthesis. As DNA synthesis at the AAVS1 site proceeds, a series of DNA template switches (copy choice) links the viral DNA to the chromosomal DNA. Virus-cell junctions in chromosome 19 are clustered at various distances downstream of the targeting RBS and TRS motifs, and integration is accompanied by rearrangements of the AAVS1 site (13, 16, 20, 22). These features of chromosomal integration can be accounted for in the above-described model if we assume that the order and timing of the template switches (from cellular DNA to viral DNA and back) differ in each integration event.
Although related, the AAVS1 RBS and TRS sequence motifs are not identical to those in the viral ITR (Table 1). Furthermore, the intervening sequence between the AAVS1 RBS and TRS motifs (henceforth called the spacer) is different both in sequence and in size from that of the viral origin. In this study, we have used the functional episome integration test to access the importance of the spacer. We show that integration depends upon both spacer sequence and position.
Oligonucleotides containing the same RBS and TRS motifs derived from the chromosome 19 AAVS1 site, but differing in the intervening spacer sequences, were synthesized and inserted into the p220.2 shuttle vector. Cell lines stably propagating the shuttle vector as an episome were infected with AAV, and integration into the episome was assessed by rescue in E. coli. The proportion of colonies hybridizing with an AAV2 DNA probe provided a measure of the episomal integration frequency (9). Each episomal target for AAV2 integration is designated in Tables 2 and 3 by the name of the cell line carrying that episome.
TABLE 2.
Episome carried by cell line | Sequence between TRS and RBS in p220.2 inserts | AAV-positive colonies (integration)
|
|
---|---|---|---|
No./total | % | ||
83 | TRS-CCA CTC CCTCTCT -RBS | 127/36,558 | 0.35 |
102 | TRS- GG CTC GGC -RBS | 119/36,300 | 0.33 |
80 | TRS- G CTC GGC -RBS | 209/33,630 | 0.62 |
81 | TRS- G CTC GC -RBS | 202/46,000 | 0.44 |
82 | TRS- CA TTA GTC -RBS | 17/56,950 | 0.03 |
86 | TRS- GG GGG GGC -RBS | 12/42,000 | 0.03 |
P-54/55 | TRS- GG GGG GGC -RBS | 17/34,728 | 0.05 |
87 | TRS- CA CTC GTC -RBS | 32/57,000 | 0.06 |
P-58/59 | TRS- CA CTC GTC -RBS | 16/24,354 | 0.07 |
Cell lines stably propagating p220.2 episomes were established by hygromycin selection (9). The inserts in p220.2 contain the spacer DNA sequence shown (5′ to 3′) linked on one side to the AAVS1 TRS (GGTTGG) and on the other side to the AAVS1 RBS (GCTC GCTC GCTC GCTG) (Table 1). All inserts contain the sequence CTAGAGGC 5′ to the TRS. Episome 83 contains the spacer sequence of the viral ITR. Episome 102 contains the spacer sequence of AAVSI. Forty-eight hours after the cell lines were infected with AAV (multiplicity of 50 infectious units/cell), viral integration into the episomes was assessed by rescue in E. coli and colony hybridization with an AAV DNA probe. The integration data are derived from two infections of the same cell line at different passage levels. In independent infections of the same cell line but at different passage levels, the integration frequency varied by a maximum factor of two. The frequencies shown were calculated from the total number of colonies screened in both trials. Independently generated cell lines carrying the same episomes are 86 and P-54/55, 87, and P-58/59. The nature of the episome in each cell line was confirmed by rescue of the shuttle vector in E. coli and sequencing of the inserts.
TABLE 3.
Episome carried by cell line | Sequence between TRS and RBS in p220.2 inserts | AAV-positive colonies (integration)
|
|
---|---|---|---|
No./total | % | ||
102 | TRS- GG CTC GGC-RBS | 56/40,000 | 0.140 |
201 | TRS- -RBS | 4/90,500 | 0.004 |
202 | TRS-50n+GG CTC GGC-RBS | 2/74,700 | 0.003 |
203 | TRS-GG CTC GGC+50n-RBS | 1/69,240 | 0.001 |
In episome 102, the spacer sequence is that of AAVS1; in episome 201, the spacer is deleted; in episome 202, the spacer sequence is separated from the TRS by 50 random nucleotides (50n); and in episome 203, the spacer sequence is separated from the RBS by 50 random nucleotides. Remaining details are as described in Table 2, footnote a.
The insert in episome 83 (Table 2) contains the 13-nucleotide sequence which bridges the RBS and TRS elements in the viral ITRs. In episome 102, the 8-nucleotide spacer is that present in the chromosome 19 AAVS1 locus. Episomes 83 and 102 target AAV integration to the same extent (0.35 and 0.33% AAV-positive colonies, respectively). Episomes 80 and 81 also target AAV2 integration to similar extents, indicating that a reduction in spacer length from 13 to 6 nucleotides does not radically affect the frequency of integration.
Since the central CTC triplet was common to the spacer sequence in episomes 80, 81, 83, and 102, we next turned our attention to this potential motif. Replacement of CTC by the triplet TTA (episome 82) or GGG (episomes 86 and P-54/55) reduced the frequency of integration 10-fold. Retention of the CTC triplet but alteration of the adjacent 5′ and 3′ dinucleotides resulted in a fivefold decrease in targeting activity (compare the activity of episome 102 to that of 87 or P-58/59). From these data, it appears that the central CTC triplet in the spacer, although necessary, is not by itself sufficient for full targeting activity and may thus be part of a larger motif. One candidate would be the GCTC motif, which is present in the spacers of the efficiently targeting episomes 102, 80, and 81 and which is also the repeating motif in the RBS site (5). However, the GCTC motif is not present in the spacer of episome 83, whose targeting activity is comparable to those of episomes 80, 81, and 102 and whose spacer sequence is that of the AAV2 ITR. Conceivably, it is the two CTC motifs embedded in a pyrimidine-rich tract in the episome 83 spacer which is responsible for full targeting activity. A pyrimidine-rich sequence between the TRS and RBS elements is a common feature of AAV serotypes, including AAV5, whose TRS cleavage specificity differs from that of AAV2 (4).
The importance of the GCTC motif in the AAVS1 spacer is also highlighted by the results with episomes 201, 202, and 203 (Table 3). Deletion of the entire GG CTC GGC spacer sequence reduces episomal integration 35-fold (episome 201). Functionality also depends upon the distance of the GG CTC GGC sequence from both the TRS and RBS sites: insertion of 50 random nucleotides at either end essentially abolishes integration (episomes 202 and 203). (The absolute distance between TRS and RBS sites may also be a factor.)
The above-described experiments show that the spacer sequence separating the TRS from the RBS is critical for episomal integration. Either a GCTC motif or two CTC triplets embedded in a pyrimidine-rich sequence were found to provide an optimum integration frequency. It is noteworthy that the GCTC motif is also the main repeat of the RBS consensus. Positioning also plays a role; increasing the distance between the AAVS1 spacer and the TRS or RBS motifs dramatically reduces integration. The AAVS1 spacer may thus be viewed as an extension of the canonical RBS consensus in that it adds an extra GCTC motif. The minimum targeting sequence for episomal integration can now be reduced to 30 bp: the 16-bp RBS separated by GCTCGC from the 6-bp TRS (Table 2, episome 81).
The step in integration blocked by changes in the spacer is unknown. Four of the spacers used were checked for the ability to bind Rep and to be nicked. These were spacers in episomes 102, 81, 83, and P-54/55 (Table 2). AAV was able to integrate into plasmids containing the first three but not into the last. All four of the spacers bound Rep, and all four were nicked to an extent which was within a factor of two of that observed for the AAVS1 sequence (spacer 102). The actual ratios were 0.64 for episome 81, 0.45 for episome P-54/55, and 1.93 for episome 83, which contained the spacer in the AAV ITR. Thus, Rep nicking showed greater tolerance with respect to spacer sequence than did integration in the model system.
The human genome contains numerous binding sites for the AAV2 Rep proteins, as judged by data bank inspection and by biochemical tests (27). However, when an appropriately positioned AAV2 TRS motif is included in the search parameters, the available data bank reveals only a single site for targeted AAV2 integration, consistent with biological experiments at the chromosomal level (15, 16). The episomal integration assay can provide additional parameters to search for other potential integration sites that might be exploited by different AAV serotypes in different hosts. The present data indicate that the spacer sequence and position should be included in these parameters.
Acknowledgments
We thank Nenita Cortez and Bernard Danovitch for excellent technical assistance.
This work was supported by Public Health Service grants AI 122251 (“Molecular Biology of Adeno-Associated Virus”) and GM50032 (“Regulation of AAV DNA Replication”).
REFERENCES
- 1.Balague C, Kalla M, Zhang W-W. Adeno-associated virus Rep78 protein and terminal repeats enhance integration of DNA sequences into the cellular genome. J Virol. 1997;71:3299–3306. doi: 10.1128/jvi.71.4.3299-3306.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Berns K I. Parvoviridae: the viruses and their replication. In: Fields B N, editor. Virology. 3rd ed. New York, N.Y: Lippincot-Raven; 1996. pp. 2173–2198. [Google Scholar]
- 3.Brister J R, Muzyczka N. Rep-mediated nicking of the adeno-associated virus origin requires two biochemical activities, DNA helicase activity and transesterification. J Virol. 1999;73:9325–9336. doi: 10.1128/jvi.73.11.9325-9336.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chiorini J A, Afione S, Kotin R M. Adeno-associated virus (AAV) type 5 Rep protein cleaves a unique terminal resolution site compared with other AAV serotypes. J Virol. 1999;73:4293–4298. doi: 10.1128/jvi.73.5.4293-4298.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chiorini J A, Yang L, Safer B, Kotin R M. Determination of adeno-associated virus Rep68 and Rep78 binding sites by random sequence oligonucleotide selection. J Virol. 1995;68:7334–7338. doi: 10.1128/jvi.69.11.7334-7338.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chiorini J A, Wiener S M, Yang L, Smith R H, Safer B, Kilcoin N P, Liu Y, Urcelay E, Kotin R M. The role of AAV Rep proteins in gene expression and targeted integration. Curr Top Microbiol Immunol. 1996;218:25–33. doi: 10.1007/978-3-642-80207-2_2. [DOI] [PubMed] [Google Scholar]
- 7.Costello E, Saudan P, Winocour E, Pizer L, Beard P. High mobility group protein 1 binds to the adeno-associated virus replication protein (Rep) and promotes Rep-mediated site-specific cleavage of DNA, ATPase activity and transcriptional repression. EMBO J. 1997;16:5843–5954. doi: 10.1093/emboj/16.19.5943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dyall J, Szabo P, Berns K I. Adeno-associated virus (AAV) site-specific integration: formation of AAV-AAVS1 junctions in an in vitro system. Proc Natl Acad Sci USA. 1999;92:12849–12854. doi: 10.1073/pnas.96.22.12849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Giraud C, Winocour E, Berns K I. Site specific integration by adeno-associated virus is detected by a cellular sequence. Proc Natl Acad Sci USA. 1994;91:10039–10043. doi: 10.1073/pnas.91.21.10039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Giraud C, Winocour E, Berns K I. Recombinant junctions formed by site-specific integration of adeno-associated virus into an episome. J Virol. 1995;69:6917–6924. doi: 10.1128/jvi.69.11.6917-6924.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Im D-S, Muzyczka N. The AAV origin binding protein Rep68 is an ATP dependent site-specific endonuclease with DNA helicase activity. Cell. 1990;61:447–457. doi: 10.1016/0092-8674(90)90526-k. [DOI] [PubMed] [Google Scholar]
- 12.Im D-S, Muzyczka N. Partial purification of adeno-associated virus Rep78, Rep68, Rep52, and Rep40 and their biochemical characterization. J Virol. 1992;66:1119–1128. doi: 10.1128/jvi.66.2.1119-1128.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kotin R M, Berns K I. Organization of adeno-associated virus DNA in latently infected Detroit 6 cells. Virology. 1989;170:460–467. doi: 10.1016/0042-6822(89)90437-6. [DOI] [PubMed] [Google Scholar]
- 14.Kotin R M, Linden R M, Berns K I. Characterization of a preferred site on chromosome 19q for integration of adeno-associated virus DNA by non-homologous recombination. EMBO J. 1992;11:5071–5078. doi: 10.1002/j.1460-2075.1992.tb05614.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kotin R M, Menninger J C, Ward D C, Berns K I. Mapping and direct visualization of a region-specific viral DNA integration site on chromosome 19q13-qter. Genomics. 1991;10:831–834. doi: 10.1016/0888-7543(91)90470-y. [DOI] [PubMed] [Google Scholar]
- 16.Kotin R M, Siniscalco M, Samulski R J, Zhu X D, Hunter L, Laughlin C A, McLaughlin S, Muzyczka N, Rocchi M, Berns K I. Site-specific integration by adeno-associated virus. Proc Natl Acad Sci USA. 1990;87:2211–2215. doi: 10.1073/pnas.87.6.2211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Linden R M, Ward P, Giraud C, Winocour E, Berns K I. Site-specific integration by adeno-associated virus. Proc Natl Acad Sci USA. 1996;93:11288–11294. doi: 10.1073/pnas.93.21.11288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Linden R M, Winocour E, Berns K I. The recombination signals for adeno-associated virus site-specific integration. Proc Natl Acad Sci USA. 1996;93:7966–7972. doi: 10.1073/pnas.93.15.7966. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Margolskee R F. Epstein-Barr virus based expression vectors. Curr Top Microbiol Immunol. 1992;158:67–95. doi: 10.1007/978-3-642-75608-5_4. [DOI] [PubMed] [Google Scholar]
- 20.McLaughlin S K, Collis P, Hermonat P L, Muzyczka N. Adeno-associated virus general transduction vectors: analysis of proviral structures. J Virol. 1988;62:1963–1973. doi: 10.1128/jvi.62.6.1963-1973.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Muzyczka N. In vitro replication of adeno-associated virus DNA. Semin Virol. 1991;2:281–290. doi: 10.1128/jvi.68.2.1128-1138.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Samulski R J, Zhu X, Xiao X, Brook J D, Housman D E, Epstein N, Hunter L A. Targeted integration of adeno-associated virus (AAV) into human chromosome 19. EMBO J. 1991;10:3941–3950. doi: 10.1002/j.1460-2075.1991.tb04964.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Shelling A N, Smith M G. Targeted integration of transfected and infected adeno-associated virus vectors containing the neomycin resistance gene. Gene Ther. 1994;1:165–169. [PubMed] [Google Scholar]
- 24.Surosky R T, Urabe M, Godwin S K, McQuiston S A, Kurtzman G J, Ozawa K, Natsoulis G. Adeno-associated virus Rep proteins target DNA sequences to a unique locus in the human genome. J Virol. 1997;71:7951–7959. doi: 10.1128/jvi.71.10.7951-7959.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Urcelay E, Ward P, Wiener S M, Safer B, Kotin R M. Asymmetric replication in vitro from a human sequence element is dependent on adeno-associated virus Rep protein. J Virol. 1995;69:2038–2046. doi: 10.1128/jvi.69.4.2038-2046.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Weitzman M D, Kyostio S R M, Kotin R M, Owens R A. Adeno-associated virus (AAV) Rep proteins mediate complex formation between AAV DNA and its integration site in human DNA. Proc Natl Acad Sci USA. 1994;91:5808–5812. doi: 10.1073/pnas.91.13.5808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wonderling R S, Owens R A. Binding sites for adeno-associated virus Rep proteins within the human genome. J Virol. 1997;71:2528–2534. doi: 10.1128/jvi.71.3.2528-2534.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yakobson B, Koch T, Winocour E. Replication of adeno-associated virus in synchronized cells without the addition of a helper virus. J Virol. 1987;61:972–981. doi: 10.1128/jvi.61.4.972-981.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Yakobson B, Hrynko T A, Peak M J, Winocour E. Replication of adeno-associated virus in cells irradiated with UV light at 254 nm. J Virol. 1989;63:1023–1030. doi: 10.1128/jvi.63.3.1023-1030.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Yalkinoglu A O, Heilbronn R, Burkle A, Schlehhofer J R, zur Hausen H. DNA amplification of adeno-associated virus as a response to cellular genotoxic stress. Cancer Res. 1988;48:3123–3129. [PubMed] [Google Scholar]
- 31.Yates J L, Warren N, Sugden B. Stable replication of plasmids derived from Epstein-Barr virus in various mammalian cells. Nature (London) 1985;313:812–815. doi: 10.1038/313812a0. [DOI] [PubMed] [Google Scholar]