Summary
Integration site analysis was performed on six dogs with canine leukocyte adhesion deficiency (CLAD) that survived greater than one-year after infusion of autologous CD34+ bone marrow cells transduced with a gammaretroviral vector expressing canine CD18. A total of 387 retroviral insertion sites (RIS) were identified in the peripheral blood leukocytes from the six dogs at 1 year post-infusion. A total of 129 RIS were identified in CD3+ T-lymphocytes and 102 RIS in neutrophils from two dogs at 3 years post-infusion. RIS occurred preferentially within 30 kb of transcription start sites, including 40 near oncogenes and 52 near genes active in hematopoietic stem cells. Integrations clustered around common insertion sites more frequently than random. Despite potential genotoxicity from RIS, to date there has been no progression to oligoclonal hematopoiesis and no evidence that vector integration sites influenced cell survival or proliferation. Continued follow-up in disease-specific animal models such as CLAD will be required to provide an accurate estimate of the genotoxicity using gammaretroviral vectors for hematopoietic stem cell gene therapy.
Keywords: hematopoietic stem cells, transplantation, retrovirus, LAM-PCR, integration site
Although the theoretical risk of retroviral-mediated oncogene activation through insertional mutagenesis had been considered to be low, this risk has been shown to be significant in recent human gene therapy clinical trials using gammaretroviral vectors.1–3 The importance of the individual variables responsible for genotoxicity in hematopoietic stem cell gene therapy trials using gammaretroviral vectors remains uncertain. Human clinical trials, as well as small and large animal studies, using gammaretroviral vectors have implicated the vector, transgene, and retroviral insertion sites (RIS) in the development of serious adverse events (SAEs). In each case, the gammaretroviral vector integration itself has been thought to contribute to the genotoxicity observed, but may not be the only factor responsible for clonal outgrowth.
Successful hematopoietic stem cell gene therapy in humans using gammaretroviral vectors have generally relied upon an in vivo selective growth advantage for the transduced cells provided by a therapeutic transgene, such as the interleukin-2 receptor gamma chain (IL2Rg) in X-linked severe combined immunodeficiency disease (SCID-X1) and adenosine deaminase (ADA) in SCID-ADA.4–6 Initial excitement in the early reports from these trials was subsequently tempered by SAEs which occurred in four of the eight children with therapeutic levels of in vivo transgene expression in the French SCID-X1 trial2 and in one of ten children in the UK SCID-X1 trial.7 The selective advantage conferred by the transgene was thought to not only contribute to the observed clinical success but also to genotoxic clonal outgrowth.6
In the recent non-myeloablative gene therapy trial for chronic granulomatous disease (CGD), in which no selective advantage was provided by the CYBB (gp91phox) transgene, two patients developed clonal dominance of retrovirally marked cells, accompanied by multiple RV insertions in EVI1-MDS1, PRDM16, and SETBP1.3 Thus, factors other than the transgene, such as RIS, appear to play a major role in clonal dominance.
Previous animal studies, including mice, dogs, and non-human primates, also point to genotoxic risk from gammaretroviral vectors. Insertional activation of Evi-1 by gammaretroviral vectors was first described in mice.8 A study of 14 non-human primates had integrations in EVI-1 after gammaretroviral gene transfer, with one animal developing a granulocytic sarcoma.9 A recent study examining RIS in dogs marked with a gammaretroviral vector also revealed a bias of RIS occurring in and near proto-oncogenes.10
In the current study we conducted integration site analysis in six dogs with canine leukocyte adhesion deficiency (CLAD) that had been successfully treated with gammaretroviral vector-mediated ex vivo gene therapy.11 CLAD is analogous to leukocyte adhesion deficiency (LAD) in children, and thus represents a preclinical large-animal model for LAD. The characteristics of the six treated dogs with CLAD, including age at infusion, conditioning regimen, and transduced cell dose infused, are shown (Supplementary Table 1). In this study, the therapeutic CD18 transgene did not confer a strong selective growth advantage to the transduced cells, and a clinically applicable non-myeloablative conditioning regimen was used to facilitate engraftment of the transduced cells. The percentages of CD18+ leukocytes in the peripheral blood in the 6 dogs ranged from 0.7 to 11.2% at 12–36 months follow-up and represent therapeutic levels (Supplementary Table 1).
To evaluate the diversity of clones contributing to long-term hematopoiesis after autologous transplantation of CD34+ BM transduced with a gammaretroviral vector carrying canine CD18, we identified integration sites by linear-amplification-mediated (LAM) PCR from all six treated animals (Figure 1). LAM-PCR, performed at 6 month intervals using DNA from peripheral blood leukocytes demonstrated multiple banding patterns indicative of polyclonal hematopoiesis in all six dogs (Figure 1a). Similar polyclonal hematopoiesis was evident for both myeloid and lymphoid lineages in dogs B1 and C1 examined at 36 and 31 months post-infusion, respectively (Figure 1b).
To confirm the polyclonal nature of the samples, linker-mediated (LM)-PCR was used to search for potentially over-represented clones or RIS within genes and gene regions. When LM-PCR amplicons were sequenced, the integration sites in all six dogs were polyclonally derived (complete list of integration sites in Supplementary Table 2) with no predominant clone(s) observed. We obtained a total of 618 retroviral insertion sites (RIS) from the treated CLAD dogs by sequence analysis of LM-PCR amplicons: 387 RIS from peripheral blood leukocytes from all six dogs at 5–21 months post-infusion (36 to 88 RIS per dog), 129 RIS from sorted CD3+ lymphocytes and 102 RIS from sorted neutrophils from dogs B1 and C1 at 31–36 months post-infusion. We identified 535 unique RIS after adjustment of 83 RIS that overlapped between the cell types and timepoints examined (see Supplementary Table 2). RIS isolated separately from lymphocytes and neutrophils confirmed polyclonal insertion sites in each lineage. In addition, 28 RIS were shared between the sorted lymphocytes and neutrophils RIS, indicating that a common hematopoietic precursor was transduced. The positions corresponding to human sequences listed in the Reference Sequence database (RefSeq) that mapped to the dog genome were examined for RIS that occurred within 30 kilobases (kb) upstream or downstream of the RefSeq genes. Comparison of the RIS to a computer-generated dataset of 1200 random positions in the dog genome revealed a modest propensity of RIS to be located within the RefSeq genes (43.4%), although this was not significant compared to the random dataset (39.3%).
Further examination of the RIS showed a tendency to occur at common insertion sites (CIS).12 CIS may represent either insertional hotspots or loci that confer some selective advantage. A large number of CIS were present among the RIS, when compared to that expected from a random dataset: 39 CIS with two insertions within 30 kb (expected: 3.7 CIS), 13 CIS with three insertions within 50 kb (expected: 0.035 CIS) and 6 CIS with four insertions within 100 kb (expected: 0.001 CIS) (Supplementary Table 3). Several genes were found near CIS, including PTP4A2, ABHD1, TCF23, NID1, NACA, and HSD17B6. Overall, 71 RIS (13.3%) of the 535 unique RIS were found within CIS, similar to the percentage observed by other investigators with gammaretroviral vectors.1,7
RIS located near oncogenes were determined by comparison to a list of human cancer genes compiled from the University of New South Wales (UNSW) Embryology DNA-Tumor Suppressor and Oncogene Database (http://embryology.med.unsw.edu.au/DNA/DNA10.htm) and from the Cancer Gene Consensus database at the Wellcome Trust Sanger Institute (http://www.sanger.ac.uk/genetics/CGP/Census/). Forty RIS (7.5%, expected: 3.2%, P < 0.001) were located in, or within, 30kb of 32 oncogenes (Figure 2). None of the RIS were near the genes implicated in clonal proliferation in the gene therapy trial for CGD (EVI1-MDS1, PRDM16 and SETBP1). Although no RIS in our study were present near the major gene implicated in leukemia in several SCID-X1 children treated by gene therapy, LMO2,2 several of the RIS were near other prominent cancer-related genes such as ABL1, BCL2, ETV6, FLI1, and NF1, including some that were also associated with a CIS (Supplementary Table 3).
To examine the propensity of the retrovirus to integrate near genes active in early hematopoiesis, we compared the genes near RIS to a database of genes expressed in HSCs.13 We found 52 RIS (9.7%; expected: 4.7%; P < 0.001) located near 47 HSC genes, indicating preferential retroviral integration (Figure 2). This increased propensity of RIS near HSC genes was not unexpected given that canine CD34+ cells were used and several HSC genes were likely activated by the G-CSF, SCF, and Flt3L growth factors used.
The non-myeloablative conditioning regimen used in both the CGD trial and our study, in conjunction with the fact that both diseases involve a transgene without a strong selective advantage, prompted us to compare our RIS profile with that of the CGD trial. Genes within 30 kb of RIS in treated CLAD dogs were compared to the genes within 30 kb of RIS found in the two CGD gene therapy patients (Figure 2).3 There was a significant overlap between the RIS profiles in both studies: 51 RIS in our study (9.5%) were close to 61 genes associated with RIS in the CGD trial (expected: 3.9%, P < 0.001). Of note, seven of these 51 (13.7%) genes shared between the two studies were also oncogenes (CIITA, FLI1, LYN, MAF, MLLT6, NF1 and SEPT9).
To determine whether RIS occurred close to genes with particular functions that may influence in vivo selection, we used gene ontology classification to identify gene families among genes within 30kb of the RIS. We observed overrepresentation (P < 0.05) of seven categories when compared to the expected distribution, however, none of these categories involved cell proliferation or differentiation (Figure 3a). Similarly, a comparison of gene families at either early (≤ 21 months) or late (> 21 months) timepoints (Figure 3b), or in either lymphocytes or neutrophils (Figure 3c) in dogs B1 and C1, revealed overrepresented families, but none appeared to be related to cell proliferation or survival.
Despite the increased frequency of retroviral insertions near oncogenes in both the CGD and CLAD studies, to-date the animals in this CLAD study show no evidence of clonal expansion up to 3 years post-treatment, a time period representing approximately 25 percent of the canine lifespan. In comparison, clonal expansion in two CGD patients occurred relatively rapidly, with clonal dominance occurring within 5 months after gene therapy. In the SCID-X1 clinical trials, clonal expansion led to leukemia in five patients at 24 to 68 months post-infusion. In the CLAD studies, it would be expected that leukemia or lymphoproliferation would occur much earlier because dogs have a faster HSC replication rate (assuming greater similarity to cats) than humans.14
The difference in outcomes between the CGD trial and our study may have been influenced by several factors, including vector design (the highly active spleen focus forming virus LTR used in the CGD trial versus the murine stem cell virus LTR used in this study), the source of target cells (G-CSF mobilized peripheral blood CD34+ cells used in the CGD trial versus bone marrow CD34+ cells in this study), and length of time in culture (5 days in the CGD trial versus 3 days in our trial). There was also a higher transduction efficiency with subsequent increased vector MOI levels in the cells infused into the CGD patients. However, the multiple insertions near or within oncogenes that we identified raises concerns that clonal dominance remains a distinct possibility, and that long-term follow-up of these animals will be necessary to accurately estimate the risk from the gammaretroviral vector used in this study.
These results also emphasize the value of disease-specific, pre-clinical animal models in assessing risks from the complex interaction of vector, transgene, insertion site, and disease in gene therapy studies. In particular, the CLAD model enables one to identify potential interactions between the transgene and the integration site, such as the one that occurred in the SCID-X1 trial.2
Supplementary Material
Acknowledgements
We thank Dr. William Telford and Veena Kapoor for assistance with flow cytometry. This research was supported by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research.
Footnotes
Presented in part in abstract form at the 47th and 48th annual meetings of the American Society of Hematology, Atlanta, GA, December 12, 2005 and Orlando, FL, December 9, 2006, and at the 9th and 10th annual meetings of the American Society of Gene Therapy, Baltimore, MD, May 31, 2006 and Seattle, WA, May 30, 2007.
References
- 1.Deichmann A, Hacein-Bey-Abina S, Schmidt M, Garrigue A, Brugman MH, Hu J et al. Vector integration is nonrandom and clustered and influences the fate of lymphopoiesis in SCID-X1 gene therapy. J Clin Invest 2007; 117: 2225–2232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hacein-Bey-Abina S, Von Kalle C, Schmidt M, McCormack MP, Wulffraat N, Leboulch P et al. LMO2-associated clonal T cell proliferation in two patients after gene therapy for SCID-X1. Science 2003; 302: 415–419. [DOI] [PubMed] [Google Scholar]
- 3.Ott MG, Schmidt M, Schwarzwaelder K, Stein S, Siler U, Koehl U et al. Correction of X-linked chronic granulomatous disease by gene therapy, augmented by insertional activation of MDS1-EVI1, PRDM16 or SETBP1. Nat Med 2006; 12: 401–409. [DOI] [PubMed] [Google Scholar]
- 4.Aiuti A, Slavin S, Aker M, Ficara F, Deola S, Mortellaro A et al. Correction of ADA-SCID by stem cell gene therapy combined with nonmyeloablative conditioning. Science 2002; 296: 2410–2413. [DOI] [PubMed] [Google Scholar]
- 5.Gaspar HB, Bjorkegren E, Parsley K, Gilmour KC, King D, Sinclair J et al. Successful reconstitution of immunity in ADA-SCID by stem cell gene therapy following cessation of PEG-ADA and use of mild preconditioning. Mol Ther 2006; 14: 505–513. [DOI] [PubMed] [Google Scholar]
- 6.Hacein-Bey-Abina S, Le Deist F, Carlier F, Bouneaud C, Hue C, De Villartay JP et al. Sustained correction of X-linked severe combined immunodeficiency by ex vivo gene therapy. N Engl J Med 2002; 346: 1185–1193. [DOI] [PubMed] [Google Scholar]
- 7.Schwarzwaelder K, Howe SJ, Schmidt M, Brugman MH, Deichmann A, Glimm H et al. Gammaretrovirus-mediated correction of SCID-X1 is associated with skewed vector integration site distribution in vivo. J Clin Invest 2007; 117: 2241–2249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li Z, Dullmann J, Schiedlmeier B, Schmidt M, von Kalle C, Meyer J et al. Murine leukemia induced by retroviral gene marking. Science 2002; 296: 497. [DOI] [PubMed] [Google Scholar]
- 9.Seggewiss R, Pittaluga S, Adler RL, Guenaga FJ, Ferguson C, Pilz IH et al. Acute myeloid leukemia is associated with retroviral gene transfer to hematopoietic progenitor cells in a rhesus macaque. Blood 2006; 107: 3865–3867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Beard BC, Keyser KA, Trobridge GD, Peterson LJ, Miller DG, Jacobs M et al. Unique integration profiles in a canine model of long-term repopulating cells transduced with gammaretrovirus, lentivirus, or foamy virus. Hum Gene Ther 2007; 18: 423–434. [DOI] [PubMed] [Google Scholar]
- 11.Bauer TR Jr, Hai M, Tuschong LM, Burkholder TH, Gu Y-C, Sokolic RA et al. Correction of the disease phenotype in canine leukocyte adhesion deficiency using ex-vivo hematopoietic stem cell gene therapy. Blood 2006; 108: 1767–1769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Suzuki T, Shen H, Akagi K, Morse HC, Malley JD, Naiman DQ et al. New genes involved in cancer identified by retroviral tagging. Nat Genet 2002; 32: 166–174. [DOI] [PubMed] [Google Scholar]
- 13.Ivanova NB, Dimos JT, Schaniel C, Hackney JA, Moore KA, Lemischka IR. A stem cell molecular signature. Science 2002; 298: 601–604. [DOI] [PubMed] [Google Scholar]
- 14.Shepherd BE, Guttorp P, Lansdorp PM, Abkowitz JL. Estimating human hematopoietic stem cell kinetics using granulocyte telomere lengths. Exp Hematol 2004; 32: 1040–1050. [DOI] [PubMed] [Google Scholar]
- 15.Wu X, Li Y, Crise B, Burgess SM. Transcription start regions in the human genome are favored targets for MLV integration. Science 2003; 300: 1749–1751. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.