Abstract
The inability to insert large DNA constructs into the genome efficiently and precisely is a key challenge in genomic engineering. Random transgenesis, which is widely used, lacks precision, and comes with a slew of drawbacks. Lentiviral and adeno-associated viral methods are plagued by, respectively, DNA toxicity and a payload capacity of less than 5 kb. Homology-directed repair (HDR) techniques based on CRISPR-Cas9 can be effective, but only in the 1–5 kb range. In addition, long homology arms—DNA sequences that permit construct insertion—of lengths ranging from 0.5 to 5 kb are required by currently known HDR-based techniques. A potential new method that uses Cas9-guided transposases to insert DNA structures up to 10 kb in length works well in bacteria, but only in bacteria. Surmounting these roadblocks, a new toolkit has recently been developed that combines RNA-guided Cas9 and the site-specific integrase Bxb1 to integrate DNA constructs ranging in length from 5 to 43 kb into mouse zygotes with germline transmission and into human cells. This ground-breaking toolkit will give researchers a valuable resource for developing novel, urgently needed mouse and human induced pluripotent stem cell (hiPSC) models of cancer and other genetic diseases, as well as therapeutic gene integration and biopharmaceutical applications, such as the development of stable cell lines to produce therapeutic protein products.
Keywords: transgenesis, Cas9, Bxb1 integrase, recombinase, prime editing
Significance of Large DNA Transgenesis
Researchers have discovered numerous links or correlations between putative structural variations (SVs) and gene expression and between SVs and phenotype, via long-read sequencing analysis of clinical samples and via large-scale collaboration programs such as ENCODE (The ENCODE Project Consortium, 2012). Some of the most common causes of cancer, including tumor suppressor genes (TSG) and oncogenic genes (ONC), originate at least in part from huge tandem duplications (TD) (Menghi et al., 2016; Willis et al., 2017; Menghi et al., 2018), as well as from kilobase-sized clusters of transcriptional enhancers, known as super-enhancers (SE) (Hnisz et al., 2013; Hnisz et al., 2015; Dave et al., 2017; Wang et al., 2019; Tang et al., 2020). However, researchers’ ability to study the role of large SVs in disease via model generation or synthetic biology is hindered by the inability to efficiently integrate large DNA constructions (greater than 10 kb) in mouse zygotes (Wang et al., 2013; Yang et al., 2013; Wang et al., 2015; Liu et al., 2019), mouse embryonic stem (mES) cells (Wang et al., 2015; Liu et al., 2019), or human cells (Chu et al., 2015; Jasin and Haber, 2016; Yang et al., 2020). Studies hindered by this barrier include: 1) generation of particular, potentially powerful mouse models [e.g., replacing a 100-kb region with its human equivalent requires at least three to 4 years (Wallace et al., 2007)], 2) identification of novel gene function when the gene is large, and 3) humanization of large regions of the mouse genome to study infectious diseases such as SARS-CoV-2, Ebola, and Hepatitis C virus. The need for an approach for insertion of large DNA constructs into the genome is highlighted by the current lack of an approach for effective modification of genome-wide DNA structures.
To meet the urgent requirement for approaches enabling large-DNA-construct integration, three distinct groups have collaborated to create a versatile and creative targeted transgenesis toolkit based on the Cas9-Bxb1 integrase system for accurate insertion of kb-sized DNA into the mammalian genome. Bxb1 integrase catalyzes effective and precise transgenesis by using DNA attachment sites (attP in the host genome and attB in the donor DNA, or attB in the host genome and attP in the donor DNA) as substrates (see Box 1). While alternative integrases such as phiC31, A118, SPBc, W, phiBT1, and phi370.1 are available, Bxb1 is the integrase of choice based on evidence that Bxb1, unlike these other integrases, has high efficiency and lacks pseudo sites in the mammalian genome. The new Cas9-Bxb1 toolkit will be useful for making important mouse models with large DNA inserts, such as potential new models for the Somatic Cell Genome Editing Consortium and humanized models of infectious illnesses, and will also be valuable for cell engineering and therapeutic gene therapy.
BOX 1. Site-specific serine recombinases.
Site-specific recombinases are enzymes that cut and rejoin DNA strands at precise genomic locations. Large serine and tyrosine integrases are two classes of bacteriophage recombinases with a nucleophilic active site amino acid residue (serine and tyrosine, respectively) that breaks DNA sequences by targeting DNA phosphodiester linkages (Grindley et al., 2006). Although the functions of tyrosine and serine integrases are similar, their mechanisms of action are very different. Tyrosine recombinases (e.g., Cre and Flp) mediate bidirectional DNA integration into the bacterial genome. In contrast, serine recombinases mediate DNA integration into the bacterial genome by catalyzing unidirectional, i.e., recombination, between two attachment sites, attP (Phage) and attB (Bacterium), resulting in the production of recombinant attachment left (attL) and right (attR) sites (Stark, 2017). Furthermore, compared to tyrosine recombinase attachment sites, the attachment sites of serine recombinases are quite small—50 bp for attP and 40 bp for attB. Each of these att sites (attP and attB) binds an integrase dimer, and recombination takes place in a complex containing an integrase tetramer that holds two att sites (attL and attR) together. Notably, in conjunction with a serine recombinase, the phage-encoded protein recombination directionality factor (RDF) is required for initiating the reverse reaction, i.e., recombination between recombinant attL and attR sites in the opposite direction. Because serine recombinases facilitate precise DNA integration, excision, inversion, and translocation, they have been used in a range of applications—molecular genetics, gene therapy, biotechnology, and synthetic biology (Merrick et al., 2018), (Yan et al., 2013).
Available Tools: Strengths and Weaknesses
Several approaches have been proposed to address some of the current genetic-engineering gaps, including random transgenesis, CRISPR/Cas9-mediated HDR, lentiviral-based and adeno-associated viruses (AAVs), and DNA transposases (e.g., Sleeping Beauty and piggyBac). However, none of these approaches address all of the gaps.
Random integration of transgenes is quick but ineffective (Haruyama et al., 2009; Laboulaye et al., 2018; Goodwin et al., 2019). It is also plagued by inherent variability as a result of complicated integrations at multiple locations, which can lead to segregation, position effects, and, in some cases, disruption of endogenous coding sequences, all of which complicate and hinder strain characterization. Furthermore, undocumented cassettes or even contaminated DNA pieces might be found in random insertion. Together, these haphazard insertions and modifications can result in phenotypic changes unrelated to function of the transgene.
While CRISPR/Cas9-mediated HDR of kb-sized DNA has been used to create transgenic mice and cell lines, this approach is unreliable. It has a variable and low success rate, especially when large (>10 kb) donor structures are required. Furthermore, while recent CRISPR/Cas9-based approaches, such as Easi (Efficient additions with ssDNA inserts)-CRISPR (using long single-stranded DNA donors) (Quadros et al., 2017), SPRINT (SHERLOCK-based profiling of in vitro transcription)-CRISPR (S-phase pronuclear injection of large DNA) (Abe et al., 2020), and 2C-HR (two-cell homologous recombination)-CRISPR (knock-in of large transgenes in two-cell stage embryos) (Gu et al., 2018), are highly efficient in the range of 1–6 kb, the efficiency of precise introduction of fragments greater than ∼7 kb in length is still not robust. The payload capacity of lentiviral-based techniques is superior (18 kb) (Chaudhari et al., 2020), but these techniques are linked to serious side effects such as genotoxicity (Montini et al., 2006) and immunogenicity (Nayak and Herzog, 2010). Although AAVs have few side effects, their payload capacity is less than 5 kb (Lai et al., 2010).
DNA transposases coupled with catalytically dead Cas9 for RNA-guided site-specificity have recently been designed to perform targeted transgenesis of up to 10 kb of DNA in bacteria (Munoz-Lopez and Garcia-Perez, 2010; Peters et al., 2017; Strecker et al., 2019; Vo et al., 2020). However, to date this system can be used only with prokaryotes. Furthermore, the significant off-target activity of transposases necessitates extensive engineering to accomplish efficient transgenesis. In addition, Sleeping Beauty and piggyBac, two of the most promising DNA transposons, are also linked to severe DNA toxicity (Wang et al., 2008; Tipanee et al., 2017).
In addition to the limitations discussed above, current approaches lack several other capabilities critical for efficient, versatile genetic engineering, including the ability to modify the genome sequentially to allow integration of additional kb-sized DNA constructs at a pre-existing locus (Brosh et al., 2021); control over copy numbers; control over integration orientation; DNA insertions without long kilobases of homology arms; and DNA insertions without traces of extraneous prokaryotic vector DNA. In sum, a key obstacle in genetic engineering remains the lack of sophisticated and economical techniques for precise integration of kb-sized DNA in human cells and animals. To overcome these limitations, a new toolkit—an RNA-guided Cas9-Bxb1 toolbox (Anzalone et al., 2021; Grandela et al., 2021; Ioannidi et al., 2021; Low et al., 2021)—has recently been developed that expands the breadth of precision transgenesis, genetic engineering, cell-based treatments, and synthetic biology.
CRISPR/Cas9-Endonuclease With the Site-Specific Integrase Bxb1 in Mouse Zygotes
We recently developed an approach using the Bxb1 integrase to induce precise and efficient integration of large DNA constructs into the mouse genome (Low et al., 2021). Bxb1 uses attP and attB attachment sites to accomplish this DNA insertion. In a key innovative component of developing our approach, we pre-positioned an attP attachment site in the ROSA26 safe harbor locus of several mouse strains, using CRISPR/Cas9-mediated HDR. We were then able to efficiently integrate large DNA constructs (∼5–∼43 kb) and generate single-copy transgenic mice. Below we describe two different versions of this recently developed approach for integrating large DNA constructs into a mouse safe harbor locus—version 1, or RMKI (recombinase mediated knock-in) (Figure 1A), for DNA constructs of up to 10 kb; and version 2, or recombinase-mediated cassette exchange (RMCE) (Figure 1B), for DNA constructs of up to ∼43 kb.
One-Step Approach (One-Cell Stage, Electroporation and Two-Cell Stage, Microinjection) for rapid targeted transgenesis of large DNA constructs
Conventional transgenesis approaches require 18–24 months for generation of transgenic mice using the Bxb1 integrase; the first step involves insertion of an attP site in the mouse genome using CRISPR/Cas9-mediated HDR. After characterizing founder animals carrying that attP site, the mice must be backcrossed and then intercrossed to generate homozygous attP mice (9–12 months). Embryos from the homozygous attP mice are then microinjected with a large donor DNA construct containing the transgene and a cognate attB site, followed by backcrosses and intercrosses to generate homozygous transgenic mice (another 9–12 months). Our new approach, summarized below, eliminates these protracted steps by accomplishing both insertion of the attP site and microinjection of the DNA construct in the same embryo, thereby eliminating the generation of attP homozygous mice that is required in existing approaches and creating transgenic mice in one generation (9–12 months) instead of 18–24 months.
In the first step of our new approach, CRISPR/Cas9-meditated HDR in mouse zygotes via electroporation is used to insert, at the one-cell zygote stage, an attP site immediately downstream of the transcription initiation site ATG (Figure 2A). Then, in step 2, using the same embryos but at the two-cell stage, microinjection is used to introduce, in the presence of Bxb1 mRNA, large donor DNA constructs carrying the transgene and the cognate attB site, allowing efficient, precise recombination of the donor DNA into the genomic attP site in a single generation (Figures 2B,C). Notably, electroporation is advantageous for both embryo survival and efficiency of HDR-mediated insertion of short oligos, whereas microinjection at the two-cell stage is essential for delivery of large DNA payloads. This novel one-step approach [one-cell stage, Electroporation (EP); and two-cell stage, Microinjection (MIJ)] not only significantly reduces the generation time, it, more importantly, enables a “plug-and-play” approach, i.e., one-step insertion of a large DNA construct in multiple mouse strains with no pre-placed attachment sites.
We used this innovative approach to successfully generate Cd68-cas9 transgenic mice in one generation on the NSG background with 13% Bxb1-mediated integration efficiency. We sequence-confirmed that the cas9 transgene was correctly integrated into the mouse Cd68 endogenous locus. Homozygous mice are viable with no gross abnormalities.
Targeted Nanopore Long-Read Sequencing for Efficient Validation of Correct Insertion of Transgenic Loci
To ensure confirmation of correct insertion of transgenes, we have used two approaches: classical genotyping (PCR, Sanger sequencing) from DNA isolated from tail tissues, and Oxford Nanopore Technology’s Cas9-mediated amplification-free enrichment approach (Gilpatrick et al., 2020), a targeted sequencing approach. The Nanopore approach is relatively low-cost and can be applied to various starting materials, all while enriching regions of interest over native sequences. Importantly, neither of our two validation approaches requires sacrificing animals. Briefly, our workflow for the Nanopore targeted sequencing involves: 1) high molecular weight genomic DNA is extracted from mice carrying the transgene by using tissue from ear notches, 2) the region of interest is targeted by the Cas9-single guide RNA (sgRNA) complex and excised from the gDNA, 3) the resulting fragment is used to construct a Nanopore sequencing library without the need for amplification, and 4) upon sequencing, the region of interest is greatly enriched compared to the background gDNA (Figure 3). We have already validated the Nanopore targeted sequencing approach for transgenic inserts ranging from 5 to 43 kb in length (Low et al., 2021). Moreover, we enabled validation of not only the inserted transgene but also the regions bordering the two ends of the Bxb1 integration site (attP GT and attP GA ) in the ROSA26 locus, by designing sgRNAs 2 kb upstream and downstream of the integration site.
Twin Prime Editing + Bxb1
Twin Prime Editing (TwinPE) + Bxb1 enables targeted integration of large DNA plasmids (∼5.6 kb) at various safe-harbor loci in human cells (Anzalone et al., 2021) (see Box 2). Simultaneous delivery of a twinPE + Bxb1 complex with large donor plasmids enables integration at multiple genomic loci, permitting multiplexing capabilities. Notably, twinPE + Bxb1 mediates precise donor integration at the targeted site without causing inappropriate donor integration into the human genome (Figure 4A). Furthermore, because the modeling of large structural variants, including DNA inversions and rearrangements, involved in cancer and other diseases can be challenging, Anzalone et al. demonstrate that TwinPE + Bxb1 can facilitate inversions in human cells. The authors examined a ∼40-kb inversion between IDS and its pseudogene IDS2 to test the twinPE-Bxb1 inversion method on a therapeutically relevant locus. Inversions between these locations have been found in 13% of Hunter syndrome patients, and identification of the breakpoints in pathogenic alleles has showed that the inversion frequently occurs inside a recombination hotspot seen in both IDS and IDS2. Flanking the recombination hotspots with attB and attP sequences, the authors demonstrate unidirectional inversion by Bxb1 resulting in attL and attR sites, suggesting successful repair of the pathogenic allele.
BOX 2. Prime Editing (PE) and Twin Prime Editing (TwinPE).
Most existing technologies, including those using the CRISPR-Cas9 system, leverage induction of double-strand breaks (DSBs), a process that, as a result of genome damage, results in undesirable outcomes. More recent technologies, including base editing (Komor et al., 2016) and prime editing (Anzalone et al., 2019), circumvent the need for DSBs, but are associated with significant drawbacks. Base editing involves exchanging one nucleotide for another but can be used to make only a small number of nucleotide changes. Prime editing involves cutting only one strand of DNA, followed by the use of reverse transcriptase to “prime,” or initiate, the transfer of new genetic information encoded in an engineered guide RNA termed “prime editing guide RNA; pegRNA,” followed by reconstruction of the other DNA strand so that it corresponds to the new genetic material. While PE has been shown to have the capacity for efficient genome modification, it can generate only small insertions (<∼50 bp).
To enable insertion of DNA sequences larger than ∼50 bp, Anzalone et al. (2021) recently developed a novel twin prime editing (twinPE) strategy. This method uses two pegRNAs—one nicks one of the two DNA target strands, and the other nicks the other strand, with each pegRNA directing the synthesis of a 3′ flap complementary to the 3′ flap produced by the other pegRNA. The 3′ flaps hybridize with each other to form an intermediate strand with annealed 3′ overhangs of the new DNA sequence and annealed 5′ overhangs of the original DNA sequence. After the original DNA sequence is excised, the gap is filled by the reverse transcriptase enzyme, and a nick site ligation is performed between the two nicks, with a 3′ flap sequence replacing the endogenous sequence between the nicks. The edit results in the introduction of a new DNA sequence, either by deleting a portion or modifying a portion of the original DNA sequence. In human cells, twinPE has been shown to successfully generate larger insertions (∼100 bp) than can be generated with the PE strategy. Nevertheless, twinPE falls short of integrating large kilobase-sizes DNA fragments into the genome.
Programmable Addition Through Site-Specific Targeting Elements
Programmable Addition through Site-specific Targeting Elements (PASTE), developed by Ioannidi et al. (2021), is simply Bxb1 fused to the PE2 prime editor protein (Figure 4B). PASTE achieves integration of DNA segments ranging in size from 779 bp to ∼36 kb at efficiencies up to ∼55% in multiple cell types, including both dividing and non-dividing cells (Ioannidi et al., 2021). The authors observed no off-target activity with PASTE. The range of segment sizes that can be efficiently inserted would enable insertion of more than 99.7% of human cDNAs, illustrating the research potential of the approach. The construct used in PASTE includes a Cas9 protein fused to a reverse transcriptase, a pegRNA, and a large serine integrase Bxb1. Importantly, in a novel step, the construct also incorporates the key elements used for efficient DNA insertion via serine integrases, which typically insert sequences containing an attP attachment site into a target containing the related attB attachment site, or “landing site.” While existing attP/attB insertion approaches use a two-step method for DNA insertion, i.e., integration of the attP site and associated donor DNA into an attB landing site previously incorporated into the genome, the PASTE construct includes both the attP site and the attB site, the latter of which is incorporated into the pegRNA design and is copied into the genome via reverse transcription and flap repair. Because the construct includes both 1) a circular double-strand DNA template containing the donor DNA and the attP site, and 2) the attB landing site incorporated within the pegRNA design (collectively termed the “attachment site-containing guide RNA; atgRNA”), the DNA cargo can be integrated at the target site in a single reaction. Specifically, the attB landing site is inserted via a Cas9-directed reverse transcriptase, followed by attP-mediated landing site recognition, and integration of the DNA cargo via a Cas9-directed integrase.
The authors demonstrated the power and versatility of PASTE by showing that PASTE can be used for: 1) the tagging of genes, i.e., genes were tagged with GFP using PASTE, and results showed that GFP co-localized with the tagged gene product as expected; 2) multiplexed gene integration, i.e., the authors simultaneously integrated three different genes at three genomic loci; 3) direct insertion of DNA templates carried by AAV or adenoviral vectors; and 4) integration of therapeutic genes with subsequent expression of therapeutic protein products. For example, alpha-1 antitrypsin (encoded by SERPINA1) and carbamoyl phosphate synthetase I (encoded by CPS1) are involved in human Alpha-1 antitrypsin deficiency and CPS1 deficiency, respectively. To test protein production of these two proteins, Ioannidi et al. (2021) used PASTE to deliver SERPINA1 or CPS1 cargo and found effective integration at the ACTB locus in human cells. Furthermore, the authors provided evidence for protein expression, intracellular accumulation of the transgenic products, and secretion of proteins into the medium. Lastly, the authors also took steps to optimize PASTE by using metagenomic mining to discover thousands of putative integrase and attachment site combinations, and to engineer multiple novel integrase orthologs with improved activity and reduced attachment-site requirements. Importantly, a different study used PASTE with human cells to achieve precise integration of templates as large as ∼36 kb with ∼10%–20% integration efficiency (Ioannidi et al., 2021).
Assembly and Delivery of 100’s of kb of DNA
Synthetic genomics—the design and synthesis of genomes, or key regions of them—is an emerging field in the study of genome function and biological processes. While synthetic genomics has been applied to investigation of viral and microbial genomes, further advances are required to apply this approach to the study of larger mammalian genomes. Mitchell et al. (2021) developed a strategy that surmounts two major challenges in synthetic genomics: the assembly and delivery of long DNA sequences. Specifically, the investigators developed a workflow termed “eSwAP-IN” (extrachromosomal Switching Auxotrophies Progressively by Integration) that enables de novo assembly of DNA sequences of interest in yeast; leveraged a previously described gene-trap-based system termed “ICE” (Inducible Cassette Exchange) to deliver large, assembled DNA constructs to mouse embryonic stem cells (mESCs) (Figure 5A); and tested for payload integration and expression of the integrated gene via PCR and immunoblot, respectively. The eSwAP-IN workflow harnesses the inherent capacity of the yeast Saccharomyces cerevisiae to perform homologous recombination; S. cerevisiae can stitch multiple DNA sequences together with high fidelity, given a minimum of 40 bp of terminal sequence homology encoded by adjacent parts. The eSwAP-IN workflow is a modification of the previously described SwAP-IN method. For “in yeasto” DNA assembly, both eSwAP-IN and SwAP-IN incorporate, in a step-wise fashion, DNA segments into a progressively longer construct termed an assemblon. The major modification in eSwAP-IN relative to SwAP-IN is that the assemblon is assembled extrachromosomally in a circular format, and thus replicates and segregates independently of the native yeast chromosomes. Another important advantage of the circular format is that the assemblon can theoretically be directly transferred into E. coli for preparation of large quantities of purified DNA for delivery to the organism of choice. Accordingly, the vector used for insertion of the DNA assemblon in the organism of choice encodes features to support replication, segregation, and selection in both yeast and E. coli.
To demonstrate the capacity of eSwAP-IN for efficient assembly of large DNA segments, the investigators assembled the 101-kb human HPRT1 (hHPRT1) gene and then delivered the assemblon to mESCs using the ICE system. This system employs an mESC line with a landing pad on the X chromosome that includes a doxycycline-inducible CRE transgene. Induction of Cre expression renders the cells recombination-competent, and delivery of an appropriately designed DNA construct results in cassette exchange recombination and replacement of CRE with the incoming DNA. The investigators demonstrated the use of ICE for successful delivery of a 114-kb construct to mESCs and precise integration of the custom-built 101-kb hHPRT1 locus into the ICE landing pad.
Although the ICE approach enables delivery of ∼100-kb DNA payloads, it leaves scars flanking the integrated DNA in the mammalian genome. Recently, Brosh et al. (2021) developed an alternative platform, termed Big-IN, for efficient, repeated targeted integration of large DNA segments into mammalian cells, and a scalable pipeline for validation of the engineered cells. They demonstrated use of Big-IN for integration of DNA up to 143 kb in length in human embryonic stem cells (hESCs) and mouse ESCs (Figure 5B).
In brief, a short landing pad is targeted to replace a genomic locus of interest using CRISPR/Cas9-mediated HDR, followed by single-step payload integration via Cre recombinase-mediated cassette exchange (RMCE). To accomplish this, cells are transfected with two plasmids: 1) a pCas9 plasmid expressing guide RNAs (gRNAs) targeting the region of interest, and 2) a short landing pad that includes a promotor driving expression of a puromycin-resistance gene, a thymidine kinase gene, and a Cre gene; a mutant lox site (lox 2272) and a loxP site, to permit Cre-mediated RMCE; and, flanking the two lox sites, homology arms (HA) corresponding to the genomic sequences that flank the gRNA target sites at the targeted genomic locus. Fixed insertion of the transiently transfected plasmid is accomplished by inducing its linearization via cloning of the same gRNA target sequences and protospacer adjacent motifs into the vector backbone just outside the HAs. Insertion is validated using PCR genotyping with primers targeting the novel junctions between the landing pad and the genomic sequences beyond the HAs; Sanger sequencing for base-pair resolution of correct landing pad integration; and quantitative real-time PCR for loss of target-gene expression and gain of Cre expression. The investigators also developed a modular next-generation sequencing pipeline, including use of hybridization capture sequencing, for validation of loss of the target locus, gain of the landing pad, and absence of the vector backbone and pCas9.
Big-IN is an advance over previous genetic-engineering approaches in that it facilitates one-step scarless delivery of large DNA payloads. Furthermore, the capacity of Big-IN for single-step construct integration enables repeated deliveries to the same allele, thereby minimizing technical factors that can hinder efficient integration, and thus is ideal for comprehensive examination of a given locus. In addition, the approach is designed to be scalable across multiple loci and cell lines, with delivery and selection methods that can be employed in a modular fashion to address problems associated with specific loci and cell types. Further, the validation strategy is designed to enable early validation of construct integration.
Grandela et al. (2021) recently presented STRAIGHT-IN (Serine and Tyrosine Recombinase Assisted Integration of Genes for High-Throughput INvestigation), a novel approach for integrating large DNA payloads into hiPSCs that combines the benefits of both serine (Bxb1) and tyrosine recombinases (cre or Flp) (Figure 5C). The authors first generated hiPSCs with a landing pad cassette containing attachment sites (attP) for Bxb1 at the safe harbor locus AAVS1 using CRISPR/Cas9-mediated HDR. Next, using a series of donor DNA constructs ranging in size from 2 to 50 kb with cognate attachment sites (attB), the authors examined Bxb1-mediated recombination in hiPSC-landing pad cells and observed efficient (up to 50% with antibiotic selection) integration of donor DNA independent of size restraints. Additionally, to examine the upper limit of DNA payload integration, the authors tested integration of a large 170-kb BAC construct with cognate attB sites and observed successful site-specific integration of the large construct into the landing pad site as examined via PCR amplification across attR and attL sites, and via digital droplet PCR (ddPCR) to determine the copy number of the integrated DNA. Notably, the landing pad attachment site (attP) is flanked by heterologous loxP/loxP* sites, which enables cre-mediated excision of the vector backbone components avoiding any vector-related adverse effects. Together, these findings suggest that the Cas9-Bxb1 toolkit can help with the precise integration of large DNA payloads into hiPSCs as well.
While the aforementioned recombinase-based approaches can theoretically be applied to any gene locus of any length, success depends on the ability to source the required DNA, by PCR or commercial synthesis; the degree to which yeast tolerates the sequence composition of the lengthening assemblon; and the upper limit for chromosome stability in yeast. Mitchell et al. expect that the upper length limit of bacterial or mammalian constructs assembled and maintained in yeast is well over 1 Mb, and that a set of particular technical advances including laboratory automation may facilitate realization of large-scale genome writing in mammalian cells.
Conclusion and Future Directions
Targeted serine-integrase-based genome insertion is a key component of biomedical research and therapy development. However, until recently, previous approaches lacked the capacity for efficient insertion of large (>5 kb) DNA segments. Recent studies have developed, implemented, and validated multiple versions of a revolutionized Cas9-Bxb1-targeted integrase system to enable diverse novel genetic-engineering approaches based on efficient insertion of large DNA constructs. Future studies can explore the capacity for integration of DNA constructs up to 100 kb in length and further test a “plug-and-play” approach for rapid generation of transgenic mice.
Because serine integrase-based recombination systems exhibit high specificity of for site-directed unidirectional recombination and high orthogonality to the mammalian genome, with no significant off-target activity, these systems have recently received widespread interest. From prophage genomes, using bioinformatics tools, Yang et al. uncovered 34 phage integrases and their predicted attB and attP recognition sites with no detectible off-target activity. Recently, Durrant et al. developed a systematic computational method for identifying thousands of novel serine integrases and their cognate attachment sites for insertion of large DNA segments. This technique has resulted in identification of three types of serine integrases: 1) integrases that can insert DNA into pre-positioned attachment sites (e.g., Pa01, Si74, and Nm60 integrases showed enhanced recombination of attP-donor plasmid DNA into attB landing-pad cells with minimal off-target activity compared with either Bxb1 or PhiC31); 2) integrases that can insert DNA into predicted pseudosites (e.g., Sp56, Pf80, and Enc3 can target the human genome at predicted target sites without pre-placed attachment sites); and 3) multi-targeting integrases that can insert DNA into multiple sites simultaneously (e.g., Cp36 integrated DNA into multiple loci with greater than 40% efficiencies in HEK293FT and K562 human cell lines with no pre-positioned landing-pad sites). Furthermore, experimental analyses of these integrases in human cells suggested sevenfold higher integration efficiencies than Bxb1, and genome insertion efficiencies of 40%–70% with DNA insert sizes of 7 kb. Notably, since the computational analyses identified both the integrases and the target sites, this would enable identification of off-targets in the human genome for effective genome therapy.
It will be exciting to apply this technology in cancer genomics, human genetics, systems biology, and cellular engineering to integrate large DNA constructs to model human diseases. Lastly, because the approaches extend beyond the integration of large kb-sized DNA segments to the generation of DNA inversions, it will be interesting to study the efficiency of Bxb1 integrase in generating DNA rearrangements and achieving conditional mutagenesis. In sum, this novel exciting technology, which has been validated both in vitro and in vivo, has significant clinical implications for treating various human genetic disorders as well as generating and studying next-generation mouse models of human disease. comment.
Acknowledgments
We are grateful to Drs. David R. Liu and Stephen Sampson for critically reading the manuscript. The authors thank Zoe Reifsnyder for assistance with preparation of the figures.
Author Contributions
VH wrote and reviewed the manuscript. BL and MW reviewed the manuscript. All authors read and approved the final manuscript.
Funding
This work was supported in part by The Jackson Laboratory. We acknowledge support from the National Institutes of Health grants R01 CA265978-01A1, CA034196, and R21 OD027052.
Conflict of Interest
BL and MW are co-inventors of the patent application “High Frequency Targeted Animal Transgenesis,” International Application No. PCT/US2020/054745, published as WO 2021/072049.
The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
- Abe T., Inoue K.-i., Furuta Y., Kiyonari H. (2020). Pronuclear Microinjection During S-Phase Increases the Efficiency of CRISPR-Cas9-Assisted Knockin of Large DNA Donors in Mouse Zygotes. Cell Rep. 31 (7), 107653. Epub 2020/05/21PubMed PMID: 32433962. 10.1016/j.celrep.2020.107653 [DOI] [PubMed] [Google Scholar]
- Anzalone A., Gao X., Podracky C., Nelson A., Koblan L., Raguram A., et al. (2021). Programmable Large DNA Deletion, Replacement, Integration, and Inversion with Twin Prime Editing and Site-Specific Recombinases. bioRxiv. 2021.11.01.466790. 10.1101/2021.11.01.466790 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anzalone A. V., Randolph P. B., Davis J. R., Sousa A. A., Koblan L. W., Levy J. M., et al. (2019). Search-and-Replace Genome Editing Without Double-Strand Breaks or Donor DNA. Nature 576 (7785), 149–157. Epub 2019/10/22PubMed PMID: 31634902; PMCID: PMC6907074. 10.1038/s41586-019-1711-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brosh R., Laurent J. M., Ordoñez R., Huang E., Hogan M. S., Hitchcock A. M., et al. (2021). A Versatile Platform for Locus-Scale Genome Rewriting and Verification. Proc. Natl. Acad. Sci. U.S.A. 118 (10). Epub 2021/03/03PubMed PMID: 33649239; PMCID: PMC7958457. 10.1073/pnas.2023952118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chaudhari N., Rickard A. M., Roy S., Dröge P., Makhija H. (2020). A Non-Viral Genome Editing Platform for Site-Specific Insertion of Large Transgenes. Stem Cell Res. Ther. 11 (1), 380. Epub 2020/09/05PubMed PMID: 32883366; PMCID: PMC7650303. 10.1186/s13287-020-01890-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chu V. T., Weber T., Wefers B., Wurst W., Sander S., Rajewsky K., et al. (2015). Increasing the Efficiency of Homology-Directed Repair for CRISPR-Cas9-Induced Precise Gene Editing in Mammalian Cells. Nat. Biotechnol. 33 (5), 543–548. Epub 2015/03/25PubMed PMID: 25803306. 10.1038/nbt.3198 [DOI] [PubMed] [Google Scholar]
- Dave K., Sur I., Yan J., Zhang J., Kaasinen E., Zhong F., et al. (2017). Mice Deficient of Myc Super-Enhancer Region Reveal Differential Control Mechanism Between Normal and Pathological Growth. Elife 6. Epub 2017/06/07PubMed PMID: 28583252; PMCID: PMC5461110. 10.7554/eLife.23382 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilpatrick T., Lee I., Graham J. E., Raimondeau E., Bowen R., Heron A., et al. (2020). Targeted Nanopore Sequencing with Cas9-Guided Adapter Ligation. Nat. Biotechnol. 38 (4), 433–438. Epub 2020/02/12PubMed PMID: 32042167; PMCID: PMC7145730. 10.1038/s41587-020-0407-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodwin L. O., Splinter E., Davis T. L., Urban R., He H., Braun R. E., et al. (2019). Large-Scale Discovery of Mouse Transgenic Integration Sites Reveals Frequent Structural Variation and Insertional Mutagenesis. Genome Res. 29 (3), 494–505. Epub 2019/01/20PubMed PMID: 30659012; PMCID: PMC6396414. 10.1101/gr.233866.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grandela C., Blanch-Asensio A., Brandão K. O., de Korte T., Yiangou L., Mol M. P. H., et al. (2021). STRAIGHT-IN: A Platform for High-Throughput Targeting of Large DNA Payloads into Human Pluripotent Stem Cells. bioRxiv. 2021.12.08.471715. 10.1101/2021.12.08.471715 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grindley N. D. F., Whiteson K. L., Rice P. A. (2006). Mechanisms of Site-Specific Recombination. Annu. Rev. Biochem. 75, 567–605. Epub 2006/06/08PubMed PMID: 16756503. 10.1146/annurev.biochem.73.011303.073908 [DOI] [PubMed] [Google Scholar]
- Gu B., Posfai E., Rossant J. (2018). Efficient Generation of Targeted Large Insertions by Microinjection into Two-Cell-Stage Mouse Embryos. Nat. Biotechnol. 36 (7), 632–637. Epub 2018/06/12PubMed PMID: 29889212. 10.1038/nbt.4166 [DOI] [PubMed] [Google Scholar]
- Haruyama N., Cho A., Kulkarni A. B. (2009). Overview: Engineering Transgenic Constructs and Mice. Curr. Protoc. Cell Biol. 42. Chapter 19:Unit 19.0. Epub 2009/03/14PubMed PMID: 19283728; PMCID: PMC2743315. 10.1002/0471143030.cb1910s42 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hnisz D., Abraham B. J., Lee T. I., Lau A., Saint-André V., Sigova A. A., et al. (2013). Super-Enhancers in the Control of Cell Identity and Disease. Cell 155 (4), 934–947. Epub 2013/10/15PubMed PMID:. 10.1016/j.cell.2013.09.05324119843 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hnisz D., Schuijers J., Lin C. Y., Weintraub A. S., Abraham B. J., Lee T. I., et al. (2015). Convergence of Developmental and Oncogenic Signaling Pathways at Transcriptional Super-Enhancers. Mol. Cell 58 (2), 362–370. Epub 2015/03/25PubMed PMID: 25801169; PMCID: PMC4402134. 10.1016/j.molcel.2015.02.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iacovino M., Bosnakovski D., Fey H., Rux D., Bajwa G., Mahen E., et al. (2011). Inducible Cassette Exchange: A Rapid and Efficient System Enabling Conditional Gene Expression in Embryonic Stem and Primary Cells. Stem Cells 29 (10), 1580–1588. Epub 2011/11/01PubMed PMID: 22039605; PMCID: PMC3622722. 10.1002/stem.715 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ioannidi E. I., Yarnall M. T. N., Schmitt-Ulms C., Krajeski R. N., Lim J., Villiger L., et al. (2021). Drag-and-Drop Genome Insertion Without DNA Cleavage with CRISPR-Directed Integrases. bioRxiv. 2021.11.01.466786. 10.1101/2021.11.01.466786 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jasin M., Haber J. E. (2016). The Democratization of Gene Editing: Insights from Site-Specific Cleavage and Double-Strand Break Repair. DNA Repair 44, 6–16. Epub 2016/05/12PubMed PMID: 27261202. 10.1016/j.dnarep.2016.05.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Komor A. C., Kim Y. B., Packer M. S., Zuris J. A., Liu D. R. (2016). Programmable Editing of a Target Base in Genomic DNA Without Double-Stranded DNA Cleavage. Nature 533 (7603), 420–424. Epub 2016/04/21PubMed PMID: 27096365; PMCID: PMC4873371. 10.1038/nature17946 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krupke D. M., Begley D. A., Sundberg J. P., Richardson J. E., Neuhauser S. B., Bult C. J. (2017). The Mouse Tumor Biology Database: A Comprehensive Resource for Mouse Models of Human Cancer. Cancer Res. 77 (21), e67–e70. Epub 2017/11/03PubMed PMID: 29092943; PMCID: PMC5679300. 10.1158/0008-5472.Can-17-0584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laboulaye M. A., Duan X., Qiao M., Whitney I. E., Sanes J. R. (2018). Mapping Transgene Insertion Sites Reveals Complex Interactions Between Mouse Transgenes and Neighboring Endogenous Genes. Front. Mol. Neurosci. 11, 385. Epub 2018/11/09PubMed PMID: 30405348; PMCID: PMC6206269. 10.3389/fnmol.2018.00385 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lai Y., Yue Y., Duan D. (2010). Evidence for the Failure of Adeno-Associated Virus Serotype 5 to Package a Viral Genome ≥8.2 Kb. Mol. Ther. 18 (1), 75–79. Epub 2009/11/12PubMed PMID: 19904238; PMCID: PMC2839223. 10.1038/mt.2009.256 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu M., Rehman S., Tang X., Gu K., Fan Q., Chen D., et al. (2019). Methodologies for Improving HDR Efficiency. Front. Genet. 9, 691. PubMed PMID: 30687381. 10.3389/fgene.2018.00691 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Low B. E., Hosur V., Lesbirel S., Wiles M. V. (2021). Efficient Targeted Transgenesis of Large Donor DNA into Multiple Mouse Genetic Backgrounds Using Bacteriophage Bxb1 Integrase. Sci. Rep. 12 (1), 5424. 10.1038/s41598-022-09445-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menghi F., Barthel F. P., Yadav V., Tang M., Ji B., Tang Z., et al. (2018). The Tandem Duplicator Phenotype Is a Prevalent Genome-Wide Cancer Configuration Driven by Distinct Gene Mutations. Cancer Cell 34 (2), 197–210. e5. Epub 2018/07/19PubMed PMID:. 10.1016/j.ccell.2018.06.00830017478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menghi F., Inaki K., Woo X., Kumar P. A., Grzeda K. R., Malhotra A., et al. (2016). The Tandem Duplicator Phenotype as a Distinct Genomic Configuration in Cancer. Proc. Natl. Acad. Sci. U.S.A. 113 (17), E2373–E2382. Epub 2016/04/14PubMed PMID: 27071093; PMCID: PMC4855596. 10.1073/pnas.1520010113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merrick C. A., Zhao J., Rosser S. J. (2018). Serine Integrases: Advancing Synthetic Biology. ACS Synth. Biol. 7 (2), 299–310. Epub 2018/01/11PubMed PMID: 29316791. 10.1021/acssynbio.7b00308 [DOI] [PubMed] [Google Scholar]
- Mitchell L. A., McCulloch L. H., Pinglay S., Berger H., Bosco N., Brosh R., et al. (2021). De Novo assembly and Delivery to Mouse Cells of a 101 Kb Functional Human Gene. Genetics 218 (1). Epub 2021/03/21PubMed PMID: 33742653; PMCID: PMC8128383. 10.1093/genetics/iyab038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Montini E., Cesana D., Schmidt M., Sanvito F., Ponzoni M., Bartholomae C., et al. (2006). Hematopoietic Stem Cell Gene Transfer in a Tumor-Prone Mouse Model Uncovers Low Genotoxicity of Lentiviral Vector Integration. Nat. Biotechnol. 24 (6), 687–696. Epub 2006/05/30PubMed PMID: 16732270. 10.1038/nbt1216 [DOI] [PubMed] [Google Scholar]
- Munoz-Lopez M., Garcia-Perez J. (2010). DNA Transposons: Nature and Applications in Genomics. Curr. Genomics 11 (2), 115–128. Epub 2010/10/05PubMed PMID: 20885819; PMCID: PMC2874221. 10.2174/138920210790886871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nayak S., Herzog R. W. (2010). Progress and Prospects: Immune Responses to Viral Vectors. Gene Ther. 17 (3), 295–304. Epub 2009/11/13PubMed PMID: 19907498; PMCID: PMC3044498. 10.1038/gt.2009.148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peters J. E., Makarova K. S., Shmakov S., Koonin E. V. (2017). Recruitment of CRISPR-Cas Systems by Tn7-Like Transposons. Proc. Natl. Acad. Sci. U.S.A. 114 (35), E7358–e66. Epub 2017/08/16PubMed PMID: 28811374; PMCID: PMC5584455. 10.1073/pnas.1709035114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quadros R. M., Miura H., Harms D. W., Akatsuka H., Sato T., Aida T., et al. (2017). Easi-CRISPR: A Robust Method for One-Step Generation of Mice Carrying Conditional and Insertion Alleles Using Long ssDNA Donors and CRISPR Ribonucleoproteins. Genome Biol. 18 (1), 92. Epub 2017/05/18PubMed PMID: 28511701; PMCID: PMC5434640. 10.1186/s13059-017-1220-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reilly K. M. (2016). The Effects of Genetic Background of Mouse Models of Cancer: Friend or Foe? Cold Spring Harb. Protoc. 2016 (3), pdb.top076273. Epub 2016/03/05PubMed PMID: 26933251; PMCID: PMC6703156. 10.1101/pdb.top076273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rivera J., Tessarollo L. (2008). Genetic Background and the Dilemma of Translating Mouse Studies to Humans. Immunity 28 (1), 1–4. Epub 2008/01/18PubMed PMID: 18199409. 10.1016/j.immuni.2007.12.008 [DOI] [PubMed] [Google Scholar]
- Stark W. M. (2017). Making Serine Integrases Work for Us. Curr. Opin. Microbiol. 38, 130–136. Epub 2017/06/10PubMed PMID: 28599144. 10.1016/j.mib.2017.04.006 [DOI] [PubMed] [Google Scholar]
- Strecker J., Ladha A., Gardner Z., Schmid-Burgk J. L., Makarova K. S., Koonin E. V., et al. (2019). RNA-Guided DNA Insertion with CRISPR-Associated Transposases. Science 365 (6448), 48–53. Epub 2019/06/07PubMed PMID: 31171706; PMCID: PMC6659118. 10.1126/science.aax9181 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang F., Yang Z., Tan Y., Li Y. (2020). Super-Enhancer Function and its Application in Cancer Targeted Therapy. npj Precis. Onc. 4, 2. Epub 2020/03/05PubMed PMID: 32128448; PMCID: PMC7016125. 10.1038/s41698-020-0108-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- The ENCODE Project Consortium (2012). An Integrated Encyclopedia of DNA Elements in the Human Genome. Nature 489 (7414), 57–74. PubMed PMID: 22955616. 10.1038/nature11247 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tipanee J., VandenDriessche T., Chuah M. K. (2017). Transposons: Moving Forward from Preclinical Studies to Clinical Trials. Hum. Gene Ther. 28 (11), 1087–1104. Epub 2017/09/19PubMed PMID: 28920716. 10.1089/hum.2017.128 [DOI] [PubMed] [Google Scholar]
- Vo P. L. H., Ronda C., Klompe S. E., Chen E. E., Acree C., Wang H. H., et al. (2020). CRISPR RNA-Guided Integrases for High-Efficiency, Multiplexed Bacterial Genome Engineering. Nat. Biotechnol. 39, 480–489. Epub 2020/11/25PubMed PMID: 33230293. 10.1038/s41587-020-00745-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wallace H. A. C., Marques-Kranc F., Richardson M., Luna-Crespo F., Sharpe J. A., Hughes J., et al. (2007). Manipulating the Mouse Genome to Engineer Precise Functional Syntenic Replacements with Human Sequence. Cell 128 (1), 197–209. Epub 2007/01/16PubMed PMID: 17218265. 10.1016/j.cell.2006.11.044 [DOI] [PubMed] [Google Scholar]
- Wang B., Li K., Wang A., Reiser M., Saunders T., Lockey R. F., et al. (2015). Highly Efficient CRISPR/HDR-Mediated Knock-In for Mouse Embryonic Stem Cells and Zygotes. Biotechniques 59 (4), 201–208. 4, 6-8Epub 2015/10/16PubMed PMID: 26458548. 10.2144/000114339 [DOI] [PubMed] [Google Scholar]
- Wang H., Yang H., Shivalila C. S., Dawlaty M. M., Cheng A. W., Zhang F., et al. (2013). One-Step Generation of Mice Carrying Mutations in Multiple Genes by CRISPR/Cas-Mediated Genome Engineering. Cell 153 (4), 910–918. Epub 2013/05/02PubMed PMID: 23643243. 10.1016/j.cell.2013.04.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang W., Lin C., Lu D., Ning Z., Cox T., Melvin D., et al. (2008). Chromosomal Transposition of PiggyBac in Mouse Embryonic Stem Cells. Proc. Natl. Acad. Sci. U.S.A. 105 (27), 9290–9295. Epub 2008/06/27PubMed PMID: 18579772; PMCID: PMC2440425. 10.1073/pnas.0801017105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang X., Cairns M. J., Yan J. (2019). Super-Enhancers in Transcriptional Regulation and Genome Organization. Nucleic Acids Res. 47 (22), 11481–11496. Epub 2019/11/15PubMed PMID: 31724731; PMCID: PMC7145697. 10.1093/nar/gkz1038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willis N. A., Frock R. L., Menghi F., Duffey E. E., Panday A., Camacho V., et al. (2017). Mechanism of Tandem Duplication Formation in BRCA1-Mutant Cells. Nature 551 (7682), 590–595. Epub 2017/11/24PubMed PMID: 29168504; PMCID: PMC5728692. 10.1038/nature24477 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yan B.-W., Zhao Y.-F., Cao W.-G., Li N., Gou K.-M. (2013). Mechanism of Random Integration of Foreign DNA in Transgenic Mice. Transgenic Res. 22 (5), 983–992. Epub 2013/03/14PubMed PMID: 23483296. 10.1007/s11248-013-9701-z [DOI] [PubMed] [Google Scholar]
- Yang H., Ren S., Yu S., Pan H., Li T., Ge S., et al. (2020). Methods Favoring Homology-Directed Repair Choice in Response to CRISPR/Cas9 Induced-Double Strand Breaks. Ijms 21 (18), 6461. PubMed PMID: 32899704. 10.3390/ijms21186461 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang H., Wang H., Shivalila C. S., Cheng A. W., Shi L., Jaenisch R. (2013). One-Step Generation of Mice Carrying Reporter and Conditional Alleles by CRISPR/Cas-Mediated Genome Engineering. Cell 154 (6), 1370–1379. Epub 2013/08/29PubMed PMID: 23992847. 10.1016/j.cell.2013.08.022 [DOI] [PMC free article] [PubMed] [Google Scholar]