Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2022 Sep 21;62(24):3521–3532. doi: 10.1021/acs.biochem.2c00379

Transposons and CRISPR: Rewiring Gene Editing

Francisco Tenjo-Castaño 1, Guillermo Montoya 1,*, Arturo Carabias 1
PMCID: PMC10734217  PMID: 36130724

Abstract

graphic file with name bi2c00379_0006.jpg

CRISPR-Cas is driving a gene editing revolution because of its simple reprogramming. However, off-target effects and dependence on the double-strand break repair pathways impose important limitations. Because homology-directed repair acts primarily in actively dividing cells, many of the current gene correction/replacement approaches are restricted to a minority of cell types. Furthermore, current approaches display low efficiency upon insertion of large DNA cargos (e.g., sequences containing multiple gene circuits with tunable functionalities). Recent research has revealed new links between CRISPR-Cas systems and transposons providing new scaffolds that might overcome some of these limitations. Here, we comment on two new transposon-associated RNA-guided mechanisms considering their potential as new gene editing solutions. Initially, we focus on a group of small RNA-guided endonucleases of the IS200/IS605 family of transposons, which likely evolved into class 2 CRISPR effector nucleases (Cas9s and Cas12s). We explore the diversity of these nucleases (named OMEGA, obligate mobile element-guided activity) and analyze their similarities with class 2 gene editors. OMEGA nucleases can perform gene editing in human cells and constitute promising candidates for the design of new compact RNA-guided platforms. Then, we address the co-option of the RNA-guided activity of different CRISPR effector nucleases by a specialized group of Tn7-like transposons to target transposon integration. We describe the various mechanisms used by these RNA-guided transposons for target site selection and integration. Finally, we assess the potential of these new systems to circumvent some of the current gene editing challenges.


Gene editing refers to the modification of an organism’s genetic material (addition, deletion, or alteration). Since the discovery that nuclease-induced DNA breaks can be used to promote genome editing,1 various nuclease-based technologies have been developed (ZFNs, meganucleases, TALENs, and CRISPR-Cas). Among them, CRISPR-Cas (clustered regularly interspaced short palindromic repeats and CRISPR-associated proteins) has led a revolution that has accelerated the applications of gene editing in different disciplines. CRISPR-Cas has been adapted from a microbial defense system that provides adaptive immunity against genetic parasites.2,3 After the first encounter with the invader, short pieces of the foreign genetic material are incorporated into CRISPR arrays. Later, these arrays are transcribed into RNAs, which associate with effector nucleases (e.g., Cas9 or Cas12), acting as a guide to cleave complementary DNAs or RNAs, thereby digesting the invader’s genetic material and stopping a second infection in a process known as interference. An additional determinant for target DNA selection is the presence of a short motif adjacent to the target site or PAM (protospacer adjacent motif). PAM sequences are essential for avoiding autoimmunity and are thought to act as “labels” to minimize the search space along the genome.47

CRISPR-Cas systems are widespread among microorganisms, and their variety is categorized into two classes and multiple types on the basis of the presence of the different protein and RNA components and their architecture. Typically, class 1 systems (including types I, III, and IV) use effector multisubunit complexes. For instance, type I effector or Cascade complexes comprise a gRNA and Cas3 and Cas5–Cas8 proteins, wherein Cas3 has nuclease activity. Likewise, type III-A and -B systems encode RNA-targeting multiprotein complexes (known as Csm for type III-A or Cmr for type III-B) containing Cas5 (Csm4/Cmr5), Cas7 (Csm3/Cmr4), Cas7-like (Csm5/Cmr1 and Cmr6), Cas10 (Csm1 and Cmr2), and Cas11 homologues (Csm2 and Cmr3), with Cas10 frequently displaying nuclease activity. In turn, class 2 effectors (type II or Cas9s, type V or Cas12s, and type VI or Cas13s) are composed of single-multidomain protein nucleases that have been extensively used in gene editing applications.810

The strength of CRISPR-Cas is based on its ease of reprogramming simply by altering the sequence of the guide RNA. Therefore, Cas nucleases can be redirected to specific sites in a genome adjacent to a PAM sequence to generate double-strand breaks (DSBs). This procedure exploits the host DSB repair pathways (DRPs) to induce gene editing, creating knockouts [through nonhomologous end joining (NHEJ) or microhomology-mediated end joining] or precisely correcting a lesion [through homology-directed repair (HDR) by providing an exogenous homologous repair template].11 Gene editing with CRISPR-Cas nucleases is currently used in biotechnology, agriculture, basic research, and even human medicine.12 However, CRISPR-Cas editing still faces substantial challenges in terms of efficacy (mainly due to the efficiency of the enzymes) and safety (associated with their accuracy and our lack of control over the host DRPs).13 In the past several years, the development of more efficient and precise CRISPR-nucleases,14,15 CRISPR-based platforms that bypass the host DRPs,16 and strategies for controlling the DRPs17 has alleviated some of these limitations. Autonomous DNA transposons have also been investigated as alternative gene editing tools. These transposons are mobile genetic elements (MGEs) comprising their own target DNA recognition system and a DNA integrase in charge of DRP-independent transposon insertion. However, transposon usage is either too specific (not easily programmable) as is the case for Tn718 or too unspecific as in the case for Tn519 and PiggyBac.20

This review focuses on new findings regarding the relationship between CRISPR interference nucleases and transposons. The potential advantages of these new RNA-guided tools for gene editing are also assessed.

IS200/IS605 Transposons: The Evolutionary Origin of Cas9 and Cas12 Nucleases

Mechanism of TnpA-Based IS200/IS605 Transposition

Insertion sequence (IS) transposons were identified in the 1970s due to their capacity to produce mutations in model systems during genome translocation.2124 The IS200/IS605 family is an unusual group within the IS transposons that use obligatory single-strand (ss) DNA intermediates for transposition. The transposition mechanism of this family has been reviewed extensively.25 In brief, transposon excision and integration are intricately coordinated with genome replication and have a strong bias toward the lagging strand template.26 The transposon cargo is typically ∼600–2000 nucleotides and encodes only one or two genes, TnpA and TnpB (Figure 1A). Transposon excision and integration are catalyzed by TnpA (or Y1), a small transposase (∼200 amino acids) member of the “HUH” (histidine–bulky hydrophobic residue–histidine) enzyme superfamily.27 TnpA binds to subterminal palindromic structures at the flanking ends of the transposon DNA called the left and right ends [isLE and isRE, respectively (Figure 1A)]. Upon binding to the isLE and isRE, TnpA first catalyzes the transposon’s excision and then its integration into a DNA adjacent to a target site, a conserved tetra- or pentanucleotide ssDNA that is not duplicated after insertion (Figure 1A). Structural and biochemical data of different steps of the transposition reaction are available for members of two families, TnpAs from Helicobacter pylori and Deinococcus radiodurans from the IS608 and ISDra2 transposons (HpyTnpA and DraTnpA), supporting a trans/cis rotational model.2832 Overall, transposon mobilization is thought to take place through a “peel and paste” mechanism, in which the transposon excises (“peel”) as a circular single-strand DNA from the lagging strand (donor DNA) and is then inserted (“paste”) into a new genomic position next to the target site (preferentially at the lagging strand)25 (Figure 1A).

Figure 1.

Figure 1

IS200/IS605 family of transposons that encodes Cas9 and Cas12 evolutionary ancestors. (A) Architecture of the D. radiodurans IS608 transposon and the proposed TnpA-mediated “peel and paste” transposition mechanism. Modified from refs (40) and (25). (B) Domain organization and comparison between DraTnpB and FnoCas12a [Protein Data Bank (PDB) entry 5NG6] and between OgeuIscB (PDB entry 8CTL) and SpyCas9 (PDB entry 4ZT0). The WED domain, RuvC domain, recognition lobe (REC), bridge helices (BH), NUC domain, zinc finger domain (ZF), PAM or TAM interacting domain (PI/TI), HNH domain, P1 interacting domain (P1D), and C-terminal domain are depicted.

IscBs and TnpBs Are RNA-Guided Endonucleases

TnpB proteins are not required for transposition28,31,33 and seem to play a regulatory function.3436 On the basis of sequence homology analyses, TnpBs are likely the evolutionary ancestors of Cas12 proteins.9,3739 The predicted structural model of TnpB from D. radiodurans (DraTnpB) is most similar to Cas12f and consists of a recognition lobe (REC), a WED domain, a RuvC domain, two bridge helices (BHs), and C-terminal zinc finger motive (ZF).40 A group of TnpB-related proteins also encode an HNH domain. Therefore, they are thought to have evolved into Cas9 proteins and are named IscBs (insertion sequences Cas9-like similar to TnpBs).37 IscBs also display an N-terminal domain without clear homology to other known domains containing a “PLMP” motif (PLMP), while they lack the characteristic REC domain38 (Figure 1B).

Noncoding RNAs (ncRNAs) have been linked to TnpB-related transposons.41,42 Recently, transcriptomics data from different TnpB transposons have revealed the expression of a ncRNA overlapping with the TnpB gene that extends beyond the isRE of the transposon, denoted as right end RNA [reRNA (Figure 2A)].38,40 Similarly, several IscB transposons transcribe ncRNAs associated with the isLE [leRNAs (Figure 2B)] containing predicted structural hairpins that partially match the HEARO (HNH Endonuclease-Associated RNA and ORF) RNA,38 a noncoding RNA usually associated with HNH containing loci, together thought to form a mobile genetic element.43 Both reRNAs and leRNAs are also denoted ωRNAs (from OMEGA-associated RNAs, see Diversity of TnpB and IscB Proteins).38 Some IscB transposons are associated with CRISPR arrays. The similarities between the structural hairpins of these CRISPR-RNAs (crRNAs) and the ωRNA suggest an evolutionary relationship, in which the isLE/isRE derived into the handle of the crRNA.38 ωRNAs also have a sequence that probably evolved to form the guide of the crRNA, corresponding to the flanking region of the transposon (Figure 2A,B). This sequence changes after transposition, showing conceptual parallelisms with the update of the guide sequences during CRISPR immunity. Nevertheless, the guide update mechanism is not conserved in both systems. In CRISPR, it is accomplished by Cas1–Cas2 adaptation complexes that integrate pieces of the invader genetic material into the CRISPR arrays,44 while in ωRNAs, it is mediated by transposon mobilization (Figure 1A).

Figure 2.

Figure 2

TnpBs and IscBs are RNA-guided endonucleases. (A) DraTnpB RNA-guided dsDNA cleavage. (B) OgeuIscB RNA-guided dsDNA cleavage.

IscBs and TnpBs are RNA-guided endonucleases that use ωRNAs as guides to target a guide RNA complementary dsDNA.38,40 They also specify a target or transposon-associated motive (TAM) adjacent to the guide RNA target, which is analogous to the recognition of PAM sequences by CRISPR nucleases to initiate dsDNA unwinding. TAM discovery experiments have revealed that different members of the TnpB and IscB families recognize different TAMs.38,40 Intriguingly, DraTnpB recognizes a TAM that partially matches the transposon’s target site, suggesting that the PAM might have originated from this sequence (Figure 2A). TnpBs cleave both strands of the DNA using a single catalytic RuvC domain, like Cas12 proteins38,40 (Figure 2A). On the contrary, IscB proteins use their RuvC and HNH domains to cleave specifically the nontarget and target strand, resembling Cas9 cleavage (Figure 2B).38 High-resolution cryo-electron microscopy structures of an IscB isolated from a human gut stool metagenome (OgeuIscB) in complex with the ωRNA and a nonscissile target DNA have revealed the molecular basis of TAM recognition, R-loop formation, and DNA cleavage, showing that IscB shares a similar interference mechanism with Cas9.45 OgeouIscB structures show that the function of the REC domain is substituted by the ωRNA lobe formed by the HEARO RNA, which contacts the RNA/DNA heteroduplex. This supports an evolutionary trend in which protein domains (such as REC) have substituted ωRNA structural motifs.45 Despite the structures of OgeuIscB showing unambiguous density for the PLMP domain, its function is still unclear. The PLMP domain interacts with the last loop of the ωRNA (P5), suggesting that it could play a role in RNP biogenesis by regulating the termination of transcription. The structure-assisted removal of this domain had only a slight effect on RNA-guided activity,45 while its removal from other members has a more drastic effect.38 Further research is needed to uncover the role of the PLMP domain.

Diversity of TnpB and IscB Proteins

Intriguingly, several groups of TnpB- and IscB-related proteins have been identified. The family of these nucleases, named OMEGA (obligate mobile element-guided activity), includes members that represent steps of the evolutionary history of Cas9 and Cas12 proteins. For instance, a group of smaller IscB-like proteins (∼350 amino acids) named IsrB (insertion sequence RuvC-like OrfB) contain PLMP and RuvC domains but lack the HNH domain and consequently are nickases that cleave the nontarget strand.38 On the contrary, IshB nucleases (insertion sequence HNH-like OrfB, ∼180 amino acids) display PLMP and HNH domains but lack the RuvC domain and are predicted to nick the target strand.38 Furthermore, a group of IscBs display putative ancestral REC domains and probably represent a later stage in the evolution closer to Cas9s.38 Additionally, two new groups of Cas9 have been discovered, one of them containing relatively small Cas9s (II-D, ∼700 amino acids).38

The TnpB family is more diverse than the IscB family. A bioinformatic analysis supports the idea that TnpB is one of the most common genes in prokaryotes, suggesting that RNA-guided mechanisms are more broadly spread than previously thought.38 Remarkably, TnpBs are possibly ancestors of Fanzors, larger proteins (∼2000 amino acids) present in diverse eukaryotic transposons.46 Further phylogenetic and functional investigation of the TnpB family will help to accurately establish the evolutionary history of Cas12 proteins.

Biological Function of TnpB and IscB Proteins

The role of TnpB and IscB proteins remains enigmatic. Initial observations show that TnpB is dispensable for transposition,28,31,33 and genetic and functional experiments point to a possible regulatory role.3436 On the basis of the current evidence and similarities with CRISPR immunity, the guide sequences of the OMEGA systems are updated after the migration of the transposon to a new place in the genome (Figure 1A). Therefore, the target site for TnpB and IscB proteins, defined by TAM sequence directly adjacent to the guide RNA complementary sequence, is present in only one of the daughter dsDNAs after transposon excision (Figure 3). In this framework, two main hypotheses for the function of TnpB and IscB proteins during the transposon life cycle have been suggested. A “toxin–antitoxin” model supports the idea that the cleavage of the transposon-less daughter DNA would be toxic for the cells, while DNAs containing the transposon would behave as an antitoxin at the population level.38 An alternative model of “peel, paste, and copy” for transposon homing proposes that the cleavage of the transposon-less daughter dsDNA would stimulate its recombination with the sister dsDNA containing the transposon.40 It is important to note that the “peel, paste, and copy” model might allow us to explain the increase in the transposon copy number observed in some D. radiodurans strains.47 The outcome of both scenarios would be similar at the population level, ensuring the propagation of the transposon. Further research is needed to uncover the biological role of the RNA-guided activity of OMEGA nucleases during the transposon life cycle.

Figure 3.

Figure 3

Possible functions of TnpB and IscB’s RNA-guided activity in the transposon life cycle. The proposed model is depicted for the D. radiodurans IS608 transposon. After transposon excision and replication of the DNA, one of the resulting dsDNAs presents the target site for DraTnpB, defined by the presence of the TAM next to the guide complementary sequence. DraTnpB RNA-guided cleavage of this strand would stimulate the HDR using the sister DNA, containing the transposon, as a template (transposon homing by the “peel, paste, and copy” mechanism). Alternatively, the generated DSB might have toxic effects on the cell, the transposon working as an antitoxin. Schematic inspired by ref (40).

Potential of OMEGA Systems in Gene Editing

OMEGA constitutes a broad collection of architecturally and functionally diverse RNA-guided endonucleases.38,40 Two OMEGA nucleases have been proven to induce genome edits in mammalian cells, establishing a proof of principle of OMEGA system’s potential in gene editing. OgeuIscB-ωRNA (496 amino acids) produced insertions and deletions (indels) at the target site, next to a 3′ CTAGAA TAM, in HEK293FT with a frequency of ∼5% when co-expressed with 16-nucleotide guides. Similarly, transient transfection of DraTnpB (408 amino acids) together with reRNAs containing 20-nucleotide guides produced indels with a frequency of ∼10–20% next to a 5′ TTGAT TAM, and the observed mutational profile resembles that of Cas12a-based editing.40

The development of compact CRISPR enzymes is of current interest due to their advantage when they are packed into mature delivery vehicles, such as adenoviral-associated vectors (AAVs).13,4851 OMEGA nucleases are substantially smaller (∼400 amino acids on average) than type II (Cas9s, 900–1300 amino acids) and type V (Cas12s, 500–1300 amino acids) enzymes, and still smaller than the most compact CRISPR-Cas gene editors (Cas12fs, ∼500 amino acids, and Cas12js, ∼800 amino acids).48 The miniaturization of enzymes usually comes at a cost of efficiency or specificity, because of the simplification of accessory regulatory protein domains. For example, Cas12js constitute a minimal protein scaffold that can perform RNA-guided DNA cleavage.49,50 However, their efficiency is lower than those of other gene editors,48 and attempts to improve their efficiency have resulted in a parallel increase in RNA/DNA mismatch tolerance and reduced RNA/DNA specificity.50 A recent study with Cas9 suggests that the tolerance to RNA/DNA mismatches inversely correlates with the number of structural contacts with the RNA/DNA hybrid.15 In this regard, OgeuIscB intertwines an extensive protein and ωRNA contact network with the DNA/RNA hybrid, suggesting a possible high DNA/RNA substrate specificity despite its small size.45

OgeuIscB and DraTnpB are less efficacious than Cas9 or Cas12a when used in mammalian cells. However, OMEGA nucleases represent diverse new starting points for the design of compact enzymes for gene editing. Further mechanistic and functional studies of the different IscB and TnpB families will be essential for fully uncovering the similarities between OMEGA and CRISPR nucleases and for the efficient development of TnpB- and IscB-based gene editing tools.

CRISPR-Cas Recruitment by TN7-like Transposons

The intricate evolutionary history of transposons and both CRISPR-Cas classes is further intertwined by the appearance of specialized transposons that have recruited an inactivated CRISPR-Cas interference effector to direct the transposase to its target (Figure 4).5255 These CRISPR-Cas-associated transposons (CAST) have lost their target cleavage activity by eliminating Cas3 in class 1 systems and mutating Cas12’s catalytic amino acids in class 2 systems. Both complexes retain the ability to recognize DNA sequences in a PAM-dependent manner, targeting transposon integration. Convergent evolution has produced a great diversity of these elements, resulting in different combinations of CRISPR-Cas subtypes (e.g., I-F, I-B, and V-K) and transposon families [e.g., Tn7 and Tn5053 (Figure 4)].5659

Figure 4.

Figure 4

Gene architectures of Tn7, CAST, and Tn5053 systems. Each system contains different combinations of transposon (tnsA, tnsB, tnsC, or tnsD/tniQ) and cas (Cascade or cas12k) proteins. The general structure of the CRISPR arrays (shown on the right of each locus) and the RE and LE sequences of each transposon (shown beneath each transposon end) also vary between systems. Canonical CRISPR array repeats are colored pale red, while divergent repeats are colored dark red. Subtype I-F and I-B CASTs resemble the Tn7 family of transposons, including tnsA, tnsB, tnsC, and tnsD/tniQ, while type V-K resembles the Tn5053 family that includes only tniA (tnsB homologue), tniB (tnsC homologue), and tniQ.

A general model explaining the new system’s transposition mechanism has been proposed on the basis of previous knowledge of Tn718,60,61 and the recent molecular, biochemical, and structural data generated for class 1 (subtypes I-B and I-F)54,6265 and class 2 (subtype V-K)6668 transposons (Figure 5). Tn7’s locus comprises five genes, namely, TnsA–TnsE, which are used in two different combinations (Figures 4 and 5). The first combination, TnsABCD, serves as a homing mechanism to insert Tn7 into the host’s chromosome. The second combination, TnsABCE, is used to insert Tn7 into replicating plasmids, opening the possibility of horizontal gene transfer to other hosts by conjugation.

Figure 5.

Figure 5

Tn7 and CAST transposition mechanisms. Each system relies on different machinery to recognize its target and catalyze transposition. Tn7 uses TnsD. Subtype I-B CAST uses both TnsD and Cascade. Subtype I-F CAST uses only Cascade. Subtype V-K CAST uses Cas12k. Transposon excision and integration are catalyzed by TnsA and TnsB in Tn7, I-B, and I-F CAST, while V-K CAST uses only TnsB. TS, target strand; NTS, nontarget strand.

During the homing phase or TnsABCD transposition, TnsD recognizes the target, the glucosamine 6-phosphate synthase gene (glmS), creating a distortion in the DNA double helix18 (Figure 5). TnsC, a transposition regulator belonging to the AAA+ family, recognizes both TnsD and this DNA distortion and forms a ring around the DNA.60,69 This process leads to the activation of the transposase complex comprising TnsA and TnsB.6972 TnsA is an endonuclease that cleaves the 5′ ends of the transposon. TnsB belongs to the DDE tranposase family and binds to a series of DNA repeats present at the left and right ends (LE and RE, respectively) of the transposon (Figures 4 and 5). Upon activation, TnsB catalyzes the cleavage of the 3′ ends of the transposon leaving free hydroxyl groups, later used to perform a nucleophilic attack downstream of the target sequence. Together, TnsA and TnsB excise the transposon from the donor DNA and TnsB inserts it into the attachment site located 25 bp downstream of the glmS transcriptional terminator (Figure 5).18 In TnsABCE transposition, TnsE is responsible for sequence-independent insertion into mobile plasmids by recognizing 3′ recessed ends and the sliding clamp protein during DNA replication.73

With respect to CAST, target recognition relies on the CRISPR-Cas-inactivated effector, instead of TnsD or TnsE, except in subtype I-B, in which both CRISPR-Cas and TnsD target recognition are used.5254 Generally, the inactivated CRISPR-Cas effector recognizes the target sequence based on the presence of a PAM and complementarity to the guide RNA. Subsequently, TniQ and TnsC are recruited by the DNA-CRISPR-Cas effector complex. Activation of TnsA and TnsB by TnsC is predicted to occur in a manner similar to that of Tn7. This results in transposon insertion a certain distance from the target sequence previously recognized by the effector complex (Figure 5).

Even though the three systems, CAST subtypes I-B, I-F, and V-K, share similar components, the molecular mechanisms leading to DNA integration differ. Target recognition is the most evident difference due not only to the divergence between the CRISPR-Cas subtypes but also the distinct functions of the TnsD/TniQ homologues and the usage of gRNA variants.54,65 Moreover, donor DNA integration and resolution also differ depending on the presence or absence of TnsA in class 1 and class 2 systems.74,75

CAST Targeting Mechanisms

Homing transposition (transposon integration in the host’s chromosome) in the subtype I-B system is supposed to work in the same way as in Tn7. In this system, the targeting function of TnsD has not been completely replaced by the CRISPR-Cas effector. Instead, the tnsD gene was duplicated and each copy performs a specialized function. Homing transposition uses one TnsD copy to recruit TnsC, TnsA, and TnsB to the target site in the bacterial genome. The other copy, TniQ, is used together with Cascade to target mobile genetic elements54 (Figure 5).

In subtype I-F, the TnsD/TnsE targeting function has been completely replaced by the CRISPR-Cas effector. Homing transposition and mobile genetic element targeting are differentially achieved by categorizing the gRNA sequences encoded in the array65 (Figure 4). New spacer acquisition results in new spacers being added to the leader-proximal end of the CRISPR array and leader-proximal repeat duplication. Given errors in replication, leader-distal repeats (“older repeats”) are increasingly more divergent. As a result, the crRNA sequences can be divided between a highly divergent crRNA encoded in the leader-distal end and the more recently acquired sequences with less degenerate repeats. Petassi et al.65 found that the spacers from the divergent repeats were used for homing while the spacers from the less degenerate repeats were used for targeting MGEs. This categorization system is especially advantageous because I-F CAST gRNAs meant for transposon homing could potentially be used by the host’s canonical I-F CRISR-Cas system (i.e., a I-F Cascade complex comprising a catalytically active Cas3 nuclease). Because a gRNA coming from I-F CAST would target a DNA sequence in the host’s genome, its presence in a catalytically active Cascade complex would result in a DSB on the host’s target DNA. However, repeat divergence in I-F CAST renders the processed gRNA unusable by catalytically active I-F systems and therefore prevents target DNA cleavage.65 As an additional advantage of gRNA categorization, repeat divergence can even increase transposition efficiency in specific cases with a divergent homing gRNA having integration frequencies higher than those of gRNAs containing typical repeats.65

Subtype V-K also categorizes crRNA sequences albeit not in the same manner as subtype I-F.54 This system comprises crRNAs encoded in a regular CRISPR array and an ectopic crRNA coding sequence not included in it (Figure 4). The difference between both types of crRNA lies in not only its position within the transposon locus but also the length of the repeat. The ectopic crRNA presents a shortened repeat and is used for homing transposition targeting tRNAs encoding sequences, while the regular array is used for transposition into MGEs. The evolutionary cause for subtype V-K gRNA categorization is, however, still not clear. According to Saito et al.,54 avoidance of autoimmunity by another type V system is unlikely because no other type V systems co-occur in the current available genomes. Nonetheless, this might have been the case assuming the presence of an ancestral Cas12 nuclease during the first steps of Cas12k’s recruitment by the transposon.54

Transposon Excision and Integration

Subtypes I-B and I-F rely on TnsA and TnsB to carry out transposon integration (Figure 5).53,54 It is expected that the reaction mechanism for excision and insertion proceeds in the same way as in Tn7 because this system also comprises TnsA and TnsB as explained above. In contrast to subtypes I-B and I-F, TnsA is conspicuously absent in subtype V-K. This fact has implications for integration intermediate resolution given that no enzyme would cleave the 5′ end of the transposon DNA, leading to cointegrate transposition products.74,75

Transposons encoding only TnsB or TnsB homologues have different strategies for dealing with this problem.7678 In the case of Tn5053, the transposon contains resolvase TniR that uses res sites, also present in the transposon, to resolve the cointegrate79,80 (Figure 4). Other transposases can generate hairpins to excise the remaining DNA and later insert it into the target DNA.81 Moreover, DNA replication and RecA-dependent recombination can also solve the cointegrate in the case of replicative transposition, where the transposon increases its copy number as opposed to “cut-and-paste” transposition in which there is only one copy of the transposon.78,81

The cointegrate resolution mechanism in the subtype V-K system is still unknown. However, there are two points that suggest the V-K system might involve a specific resolvase present only in the natural host. First, even though the V-K transposon is present as a single copy in its natural host, when transposition is carried out in a heterologous system, 20% of insertion products are present as cointegrates.75 Second, upon comparison of the V-K system components (TnsB, TnsC, and TniQ) to other transposons, this combination is most similar to Tn5053, which contains TniA, TniB (TnsB and TnsC homologues), and TniQ.75 This could point to the existence of a resolvase not encoded in the V-K transposon but present in the natural host that would fill in the function of TniR. So far, this putative resolvase is unknown. Obviously, the understanding of this important stage during integration is key for the possible development of subtype V-K genome editors.

CAST Gene Editing

Notwithstanding the colossal advances in genome editing, most insertion techniques are still limited by sequence length, efficiency, DNA targeting/programmability, and homologous recombination dependency. For instance, current CRISPR-Cas techniques are programmable but depend on homologous recombination so they can be used on only actively dividing cells.82 Recombineering using the RecET or gamma-red system is also programmable but has a low efficiency.83 Transposase usage is either too specific (not easily programmable) as is the case for Tn718 or too unspecific as is the case for Tn519 and PiggyBac.20 CAST has been proposed as a promising strategy for overcoming the limitations of these systems. Because targeting depends on naturally inactivated RNA-guided nucleases, it can be programmed by changing the spacer in the crRNA/gRNA. Moreover, given that CAST has its own DNA integration system, it does not rely on homologous recombination and could therefore be used in virtually any cell type.52,53

To date, the capabilities of CAST to perform gene editing have been tested only in different species of bacteria. In the first of these studies, Vo et al.84 optimized subtype I-F achieving highly accurate and efficient DNA integration in bacteria, as well as multiplexed insertions and deletions. They called this system INTEGRATE (insertion of transposable elements by guide RNA-assisted targeting). INTEGRATE is composed of a single vector comprising all genes necessary for transposition (cas6cas8, tnsAtnsC, and tniQ), a crRNA array, and a donor DNA. INTEGRATE can transpose up to 10 kb cargo sequences with ∼100% efficiency, although multiple insertions occurring 5 kb apart have an only ∼20% efficiency due to transposition immunity (see CAST Gene Editing Challenges). Multiple insertions were also explored by means of an orthogonal system comprising subtype I-F and V-K transposons. The results of these experiments showed that these transposon subtypes do not display cross-reactivity; i.e., the transposase from subtype I-F does not mobilize the donor DNA from subtype V-K and vice versa. However, subtype V-K displayed a larger number of off-target integration events.

INTEGRATE can be also used to create deletions in combination with Cre recombinase. Briefly, Vo et al.84 used two target sites to perform a double insertion of a transposon with a LoxP site within its cargo. Afterward, the sequence comprised between the two LoxP sites was excised by Cre recombinase. Between 2.4 and 20 kb could be excised using this methodology proving INTEGRATE’s versatility as a genome editing tool.

Genome Editing in Bacterial Communities

Functionality in a broad range of species is a desirable feature for a genome editing tool. However, many traditional techniques fail in this regard because most microorganisms cannot be cultured using current protocols. Vo et al.84 and Rubin et al.85 tried to circumvent this problem by using INTEGRATE and DART (DNA-editing all-in-one RNA-guided CRISPR-Cas transposase), a system similar to INTEGRATE but comprising barcodes for environmental-transformation sequencing (ET-seq), in bacterial communities. Using either electroporation or conjugation, INTEGRATE and DART successfully inserted a cargo DNA into a bacterial consortium with no off-targets when using subtype I-F. However, when using subtype V-K, a vast majority of insertions were off-target. According to these results, INTEGRATE and DART have the potential to shed light on the function and fitness of specific genes in microbial communities while eliminating the limitations of isolating unculturable bacteria.84,85

Protein Engineering

Protein engineering has been used in two ways to optimize subtype V-K. First, additional components have been added to the excision and integration system to avoid cointegrates, and second, protein fusions have been designed to simplify the targeting system.86

In the first approach, given that subtype V-K lacks TnsA, a high percentage of its transposition products are cointegrates. In an effort to supply TnsA’s function, Tou et al.86 fused TnsB to a homing endonuclease mutant with specific ssDNA nickase activity (nAniI). These combinations led to a dramatic decrease in the level of cointegrate products from 20% to <1% when using target sequences present in plasmids and 3% when using genomic target sequences. However, only fusions of nAniI to the N-terminus of TnsB supported transposition while C-terminal fusions did not. This could be explained by pull-down assays performed by Querques et al.67 in which interaction between TnsC and TnsB was mediated by the C-terminus of the latter, meaning that nAniI could block TnsB:TnsC interaction.

In the second approach, Cas12k has been fused to either TniQ or TnsC and the insertion efficiency was measured. However, none of the constructs tested yielded an efficiency higher than that of the separate components.86 This could be due to the wrong orientation of the proteins imposed by the design of the construct, misfolding, steric impediment, etc. Further experimentation is needed to design simpler systems with efficiencies equal to or higher than those of the natural ones. Structural data are particularly important to guide CAST engineering efforts, as artificial intelligence programs have not yet developed reliable modules for predicting protein–nucleic acid interactions.

So far, the structures of the individual components of subtype V-K as well as the Cas12k-TnsC-TniQ complex and the I-F Cascade-TniQ complex are known.62,6668,87 These complexes reflect different steps in transposon integration particularly related to target recognition. Undoubtedly, this information will be fundamental to the redesign of CAST protein interactions and DNA binding. For instance, complexes comprising Cas and transposon proteins with affinities higher than those of the natural ones could be obtained by mutating the amino acids present in the interfaces between the proteins. This could potentially lead to complexes with higher stabilities or to simplified complexes where regions not necessary for protein interactions are removed. Obtaining the structure of complexes comprising TnsA and TnsB reflecting the different states in transposon excision and integration is the next step to understanding CAST’s molecular mechanism and consequently facilitating the design of optimized CAST variants suitable for genome editing.

CAST Gene Editing Challenges

Despite the recent advances in the determination of the integration mechanism of CASTs, a number of challenges are yet to be surpassed if this system is to be used for genome editing. Because TnsB activity relies on the sequence of LE and RE, CAST genome editing is not a scarless editing method. Transposon integration both leads to insertion of six to eight DNA repeats corresponding to the LE and RE and causes target site duplications in the insertion site (Figures 4 and 5).52,53 Additionally, integration directionality needs to be addressed. Despite certain insertion bias in the subtype V-K system, i.e., LE being inserted proximal to the target site and RE being inserted distal to the target site,52 transposition in subtype I-F still presents products with LE and RE being inserted either way.53 This phenomenon represents a problem for applications such as fixing exons or regulatory sequences. Generating a CAST system with integration unidirectionality would therefore be ideal. Accordingly, further studies of the TnsB insertion mechanism are still needed to be able to reduce the donor sequence requirements while ensuring the integration of the transposition product in one sense only.

With respect to TnsC, it constitutes an interesting challenge when CAST simplification is the aim. In the case of Tn7, it controls transposition immunity; i.e., no further transposon copies are inserted into the target locus after one copy has already been inserted.18 This mechanism involves TnsC binding and unbinding to the target DNA. Prior to transposition, TnsC binds to the target DNA. However, TnsB promotes TnsC’s ATP hydrolytic activity, which in turn leads to the disassembly of the TnsC ring/filament.67 Because TnsB remains bound to the transposon LE and RE in the post-transposition complex, TnsC cannot bind the DNA in the vicinity of the integrated transposon and consequently cannot recruit additional transposase complexes carrying another copy of the transposon.60 This means that trying to eliminate TnsC from the system by engineering a Cas12k or Cascade complex that interacts directly with TnsB might lead to multiple insertions of the donor DNA given that no component would regulate the transposase activity, only its targeting. However, if TnsC is to be kept, it might cause gene expression dysregulation because in the case of the V-K system, it wraps around DNA not only downstream of the targe sequence but also nonspecifically to any other DNA sequence.67,68 This might not be the case for the class 1 systems that are more similar to Tn7 and whose TnsC requires the presence of TnsD or a D-loop to interact with DNA.

Finally, even though the natural hosts of the V-K systems display only single on-target copies of the transposon, when they are used in other species, most insertions are off-target, in contrast to the I-F system.84,85 This could indicate that the I-F system would be more amenable to be used for biotechnological purposes.

Future Directions

The natural diversity of nucleoproteins is an unlimited source of new systems waiting to be harnessed for biotechnological applications. Chief among them are CRISPR-Cas and transposons, which have developed sophisticated mechanisms for DNA excision and integration during their evolutionary history, and from which countless gene editing applications have been produced. The recently described OMEGA and CAST transposons constitute new avenues for circumventing current gene editing challenges. The former has the potential to minimize the size of programmable nucleases, while the latter facilitates targeted DNA insertions of large payloads without relying on homologous recombination.

However, as prokaryotic systems, both face the challenge of accessing the chromatin-packed DNA when used in eukaryotic cells. Experimental evidence has shown that nucleosomes directly impede DNA binding and cleavage, while chromatin remodeling can restore editing by Cas9 and Cas12a.8890 How the new editors reviewed here could behave when facing chromatin is unknown. In the case of the OMEGA systems, while they have been shown to be able to perform editing in eukaryotic cells, their editing levels are low when compared to those of the classical Cas9 and Cas12a tools. This is similar for other RNA-guided nucleases, such as Cas12j,48 which share with the OMEGA family members a reduced protein moiety. A possible explanation for this lower editing level is that the force produced by conformational changes in these small protein subunits might not be enough to displace DNA-bound proteins preventing access to the target sequence in a chromatin environment.

In the case of CAST, their behavior in a chromatin context is unknown. It is possible that CAST might encounter difficulties in accessing close chromatin based on data from other Cas proteins showing how chromatin accessibility decreases gene editing efficiency.91 A similar phenomenon is observed with Tn7, which can use gfpt-1 and gfpt-2 sequences (the human analogues of glmS) as target sequences in vitro but displays impaired transposition when target DNA is reconstituted in nucleosomes.92 In contrast, an engineered Tn7 element has been used to generate a mutant library in 45% of the yeast genome in vivo.93 Taken together, these results suggest that CAST systems could have the potential to overcome the chromatin context, albeit with variable efficiency.

Nevertheless, further investigations addressing the behavior of OMEGA and CAST systems in a chromatin context need to be addressed in depth to evaluate their properties in genome editing. We expect that the current research on both systems will certainly accelerate their redesign as new gene editing tools in basic research and in biomedical applications.

Acknowledgments

G.M. is member of the Integrative Structural Biology Cluster (ISBUC) at the University of Copenhagen. The authors thank the members of the Montoya group for comments and discussions.

Glossary

Abbreviations

CRISPR

clustered regularly interspaced short palindromic repeats

Cas

CRISPR-associated proteins

OMEGA

obligate mobile element-guided activity

CAST

CRISPR-Cas-associated transposon

ZFN

zinc finger nuclease

TALEN

transcription activator-like effector nuclease

gRNA

guide RNA

DSB

double-strand break

DRP

DSB repair pathways

NHEJ

nonhomologous end joining

HDR

homology-directed repair

isLE

left elements from IS transposons

isRE

right elements from IS transposons

LE

left end

RE

right end

IscB

insertion sequences Cas9-like similar to TnpBs

ncRNA

noncoding RNA

crRNA

CRISPR RNA

leRNA

left element RNA

reRNA

right element RNA

IsrB

insertion sequence RuvC-like OrfB

IshB

insertion sequence HNH-like OrfB

glms

glucosamine 6-phosphate synthase gene

INTEGRATE

insertion of transposable elements by guide RNA-assisted targeting

DART

DNA-editing all-in-one RNA-guided CRISPR-Cas transposase

Author Contributions

F.T.-C. and A.C. contributed equally to this work. The manuscript was written through contributions of all authors. A.C. wrote the draft of the first part (OMEGA systems). F.T.-C. wrote the draft of the second part (CAST systems). A.C. and F.T.-C. prepared the figures. G.M. provided feedback on the initial drafts and contributed to the writing of the final manuscript. All authors read and corrected the manuscript and approved the final version of the manuscript.

The Novo Nordisk Foundation Center for Protein Research is supported financially by the Novo Nordisk Foundation (NNF14CC0001), a Distinguished Investigator award (NNF18OC0055061 to G.M.), and the Copenhagen Bioscience PhD Programme (NNF19SA0035440 to F.T.-C.).

The authors declare the following competing financial interest(s): G.M. is co-founder and member of the advisory board of TwelveBio. The University of Copenhagen has patents issued for the development of Cas12j CRISPR tools in which G.M. and A.C. are named as inventors. F.T.-C. declares no competing interests.

References

  1. Rouet P.; Smih F.; Jasin M. Expression of a site-specific endonuclease stimulates homologous recombination in mammalian cells. Proc. Natl. Acad. Sci. U. S. A. 1994, 91, 6064–6068. 10.1073/pnas.91.13.6064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Gasiunas G.; Barrangou R.; Horvath P.; Siksnys V. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl. Acad. Sci. U. S. A. 2012, 109, E2579–E2586. 10.1073/pnas.1208507109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Jinek M.; Chylinski K.; Fonfara I.; Hauer M.; Doudna J. A.; Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 2012, 337, 816–821. 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Sternberg S. H.; Redding S.; Jinek M.; Greene E. C.; Doudna J. A. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature 2014, 507, 62–67. 10.1038/nature13011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Szczelkun M. D.; Tikhomirova M. S.; Sinkunas T.; Gasiunas G.; Karvelis T.; Pschera P.; Siksnys V.; Seidel R. Direct observation of R-loop formation by single RNA-guided Cas9 and Cascade effector complexes. Proc. Natl. Acad. Sci. U. S. A. 2014, 111, 9798–9803. 10.1073/pnas.1402597111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Globyte V.; Lee S. H.; Bae T.; Kim J. S.; Joo C. CRISPR/Cas9 searches for a protospacer adjacent motif by lateral diffusion. EMBO J. 2019, 38, e99466. 10.15252/embj.201899466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Paul B.; Chaubet L.; Verver D. E.; Montoya G. Mechanics of CRISPR-Cas12a and engineered variants on lambda-DNA. Nucleic Acids Res. 2022, 50, 5208–5225. 10.1093/nar/gkab1272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Koonin E. V.; Makarova K. S. Origins and evolution of CRISPR-Cas systems. Philos. Trans. R. Soc. B 2019, 374, 20180087. 10.1098/rstb.2018.0087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Makarova K. S.; Wolf Y. I.; Iranzo J.; Shmakov S. A.; Alkhnbashi O. S.; Brouns S. J. J.; Charpentier E.; Cheng D.; Haft D. H.; Horvath P.; Moineau S.; Mojica F. J. M.; Scott D.; Shah S. A.; Siksnys V.; Terns M. P.; Venclovas C.; White M. F.; Yakunin A. F.; Yan W.; Zhang F.; Garrett R. A.; Backofen R.; van der Oost J.; Barrangou R.; Koonin E. V. Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nat. Rev. Microbiol 2020, 18, 67–83. 10.1038/s41579-019-0299-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Koonin E. V.; Makarova K. S. Evolutionary plasticity and functional versatility of CRISPR systems. PLoS Biology 2022, 20, e3001481 10.1371/journal.pbio.3001481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Porteus M. Genome Editing: A New Approach to Human Therapeutics. Annu. Rev. Pharmacol Toxicol 2016, 56, 163–190. 10.1146/annurev-pharmtox-010814-124454. [DOI] [PubMed] [Google Scholar]
  12. Wright A. V.; Nunez J. K.; Doudna J. A. Biology and Applications of CRISPR Systems: Harnessing Nature’s Toolbox for Genome Engineering. Cell 2016, 164, 29–44. 10.1016/j.cell.2015.12.035. [DOI] [PubMed] [Google Scholar]
  13. Saha K.; Sontheimer E. J.; Brooks P. J.; Dwinell M. R.; Gersbach C. A.; Liu D. R.; Murray S. A.; Tsai S. Q.; Wilson R. C.; Anderson D. G.; Asokan A.; Banfield J. F.; Bankiewicz K. S.; Bao G.; Bulte J. W. M.; Bursac N.; Campbell J. M.; Carlson D. F.; Chaikof E. L.; Chen Z.-Y.; Cheng R. H.; Clark K. J.; Curiel D. T.; Dahlman J. E.; Deverman B. E.; Dickinson M. E.; Doudna J. A.; Ekker S. C.; Emborg M. E.; Feng G.; Freedman B. S.; Gamm D. M.; Gao G.; Ghiran I. C.; Glazer P. M.; Gong S.; Heaney J. D.; Hennebold J. D.; Hinson J. T.; Khvorova A.; Kiani S.; Lagor W. R.; Lam K. S.; Leong K. W.; Levine J. E.; Lewis J. A.; Lutz C. M.; Ly D. H.; Maragh S.; McCray P. B.; McDevitt T. C.; Mirochnitchenko O.; Morizane R.; Murthy N.; Prather R. S.; Ronald J. A.; Roy S.; Roy S.; Sabbisetti V.; Saltzman W. M.; Santangelo P. J.; Segal D. J.; Shimoyama M.; Skala M. C.; Tarantal A. F.; Tilton J. C.; Truskey G. A.; Vandsburger M.; Watts J. K.; Wells K. D.; Wolfe S. A.; Xu Q.; Xue W.; Yi G.; Zhou J. The NIH Somatic Cell Genome Editing program. Nature 2021, 592, 195–204. 10.1038/s41586-021-03191-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Slaymaker I. M.; Gaudelli N. M. Engineering Cas9 for human genome editing. Curr. Opin Struct Biol. 2021, 69, 86–98. 10.1016/j.sbi.2021.03.004. [DOI] [PubMed] [Google Scholar]
  15. Bravo J. P. K.; Liu M. S.; Hibshman G. N.; Dangerfield T. L.; Jung K.; McCool R. S.; Johnson K. A.; Taylor D. W. Structural basis for mismatch surveillance by CRISPR-Cas9. Nature 2022, 603, 343–347. 10.1038/s41586-022-04470-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Anzalone A. V.; Koblan L. W.; Liu D. R. Genome editing with CRISPR-Cas nucleases, base editors, transposases and prime editors. Nat. Biotechnol. 2020, 38, 824–844. 10.1038/s41587-020-0561-9. [DOI] [PubMed] [Google Scholar]
  17. Yeh C. D.; Richardson C. D.; Corn J. E. Advances in genome editing through control of DNA repair pathways. Nat. Cell Biol. 2019, 21, 1468–1478. 10.1038/s41556-019-0425-z. [DOI] [PubMed] [Google Scholar]
  18. Peters J. E. Tn7. Microbiol. Spectrum 2014, 10.1128/microbiolspec.MDNA3-0010-2014. [DOI] [PubMed] [Google Scholar]
  19. Haniford D. B.; Ellis M. J. Transposons Tn10 and Tn5. Microbiol. Spectrum 2015, 10.1128/microbiolspec.MDNA3-0002-2014. [DOI] [PubMed] [Google Scholar]
  20. Yusa K. piggyBac transposon. Microbiol. Spectrum 2015, 10.1128/microbiolspec.MDNA3-0028-2014. [DOI] [PubMed] [Google Scholar]
  21. Malamy M. H.; Fiandt M.; Szybalski W. Electron microscopy of polar insertions in the lac operon of Escherichia coli. Mol. Gen Genet 1972, 119, 207–222. 10.1007/BF00333859. [DOI] [PubMed] [Google Scholar]
  22. Hirsch H. J.; Saedler H.; Starlinger P. Insertion mutations in the control region of the galactose operon of E. coli. II. Physical characterization of the mutations. Mol. Gen Genet 1972, 115, 266–276. 10.1007/BF00268890. [DOI] [PubMed] [Google Scholar]
  23. Fiandt M.; Szybalski W.; Malamy M. H. Polar mutations in lac, gal and phage lambda consist of a few IS-DNA sequences inserted with either orientation. Mol. Gen Genet 1972, 119, 223–231. 10.1007/BF00333860. [DOI] [PubMed] [Google Scholar]
  24. Hirsch H. J.; Starlinger P.; Brachet P. Two kinds of insertions in bacterial genes. Mol. Gen Genet 1972, 119, 191–206. 10.1007/BF00333858. [DOI] [PubMed] [Google Scholar]
  25. He S.; Corneloup A.; Guynet C.; Lavatine L.; Caumont-Sarcos A.; Siguier P.; Marty B.; Dyda F.; Chandler M.; Ton Hoang B. The IS200/IS605 Family and ″Peel and Paste″ Single-strand Transposition Mechanism. Microbiol. Spectrum 2015, 10.1128/microbiolspec.MDNA3-0039-2014. [DOI] [PubMed] [Google Scholar]
  26. Ton-Hoang B.; Pasternak C.; Siguier P.; Guynet C.; Hickman A. B.; Dyda F.; Sommer S.; Chandler M. Single-stranded DNA transposition is coupled to host replication. Cell 2010, 142, 398–408. 10.1016/j.cell.2010.06.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Koonin E. V.; Ilyina T. V. Computer-assisted dissection of rolling circle DNA replication. Biosystems 1993, 30, 241–268. 10.1016/0303-2647(93)90074-M. [DOI] [PubMed] [Google Scholar]
  28. Ton-Hoang B.; Guynet C.; Ronning D. R.; Cointin-Marty B.; Dyda F.; Chandler M. Transposition of ISHp608, member of an unusual family of bacterial insertion sequences. EMBO J. 2005, 24, 3325–3338. 10.1038/sj.emboj.7600787. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Guynet C.; Hickman A. B.; Barabas O.; Dyda F.; Chandler M.; Ton-Hoang B. In vitro reconstitution of a single-stranded transposition mechanism of IS608. Mol. Cell 2008, 29, 302–312. 10.1016/j.molcel.2007.12.008. [DOI] [PubMed] [Google Scholar]
  30. Barabas O.; Ronning D. R.; Guynet C.; Hickman A. B.; Ton-Hoang B.; Chandler M.; Dyda F. Mechanism of IS200/IS605 family DNA transposases: activation and transposon-directed target site selection. Cell 2008, 132, 208–220. 10.1016/j.cell.2007.12.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Pasternak C.; Ton-Hoang B.; Coste G.; Bailone A.; Chandler M.; Sommer S. Irradiation-induced Deinococcus radiodurans genome fragmentation triggers transposition of a single resident insertion sequence. PLoS Genet 2010, 6, e1000799 10.1371/journal.pgen.1000799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hickman A. B.; James J. A.; Barabas O.; Pasternak C.; Ton-Hoang B.; Chandler M.; Sommer S.; Dyda F. DNA recognition and the precleavage state during single-stranded DNA transposition in D. radiodurans. EMBO J. 2010, 29, 3840–3852. 10.1038/emboj.2010.241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kersulyte D.; Akopyants N. S.; Clifton S. W.; Roe B. A.; Berg D. E. Novel sequence organization and insertion specificity of IS605 and IS606: chimaeric transposable elements of Helicobacter pylori. Gene 1998, 223, 175–186. 10.1016/S0378-1119(98)00164-4. [DOI] [PubMed] [Google Scholar]
  34. Kersulyte D.; Mukhopadhyay A. K.; Shirai M.; Nakazawa T.; Berg D. E. Functional organization and insertion specificity of IS607, a chimeric element of Helicobacter pylori. J. Bacteriol. 2000, 182, 5300–5308. 10.1128/JB.182.19.5300-5308.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kersulyte D.; Velapatino B.; Dailide G.; Mukhopadhyay A. K.; Ito Y.; Cahuayme L.; Parkinson A. J.; Gilman R. H.; Berg D. E. Transposable element ISHp608 of Helicobacter pylori: nonrandom geographic distribution, functional organization, and insertion specificity. J. Bacteriol. 2002, 184, 992–1002. 10.1128/jb.184.4.992-1002.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Pasternak C.; Dulermo R.; Ton-Hoang B.; Debuchy R.; Siguier P.; Coste G.; Chandler M.; Sommer S. ISDra2 transposition in Deinococcus radiodurans is downregulated by TnpB. Mol. Microbiol. 2013, 88, 443–455. 10.1111/mmi.12194. [DOI] [PubMed] [Google Scholar]
  37. Kapitonov V. V.; Makarova K. S.; Koonin E. V. ISC, a Novel Group of Bacterial and Archaeal DNA Transposons That Encode Cas9 Homologs. J. Bacteriol. 2016, 198, 797–807. 10.1128/JB.00783-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Altae-Tran H.; Kannan S.; Demircioglu F. E.; Oshiro R.; Nety S. P.; McKay L. J.; Dlakic M.; Inskeep W. P.; Makarova K. S.; Macrae R. K.; Koonin E. V.; Zhang F. The widespread IS200/IS605 transposon family encodes diverse programmable RNA-guided endonucleases. Science 2021, 374, 57–65. 10.1126/science.abj6856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Shmakov S.; Smargon A.; Scott D.; Cox D.; Pyzocha N.; Yan W.; Abudayyeh O. O.; Gootenberg J. S.; Makarova K. S.; Wolf Y. I.; Severinov K.; Zhang F.; Koonin E. V. Diversity and evolution of class 2 CRISPR-Cas systems. Nat. Rev. Microbiol 2017, 15, 169–182. 10.1038/nrmicro.2016.184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Karvelis T.; Druteika G.; Bigelyte G.; Budre K.; Zedaveinyte R.; Silanskas A.; Kazlauskas D.; Venclovas C.; Siksnys V. Transposon-associated TnpB is a programmable RNA-guided DNA endonuclease. Nature 2021, 599, 692–696. 10.1038/s41586-021-04058-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Gomes-Filho J. V.; Zaramela L. S.; Italiani V. C.; Baliga N. S.; Vencio R. Z.; Koide T. Sense overlapping transcripts in IS1341-type transposase genes are functional non-coding RNAs in archaea. RNA Biol. 2015, 12, 490–500. 10.1080/15476286.2015.1019998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Jager D.; Forstner K. U.; Sharma C. M.; Santangelo T. J.; Reeve J. N. Primary transcriptome map of the hyperthermophilic archaeon Thermococcus kodakarensis. BMC Genomics 2014, 15, 684. 10.1186/1471-2164-15-684. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Weinberg Z.; Perreault J.; Meyer M. M.; Breaker R. R. Exceptional structured noncoding RNAs revealed by bacterial metagenome analysis. Nature 2009, 462, 656–659. 10.1038/nature08586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sasnauskas G.; Siksnys V. CRISPR adaptation from a structural perspective. Curr. Opin Struct Biol. 2020, 65, 17–25. 10.1016/j.sbi.2020.05.015. [DOI] [PubMed] [Google Scholar]
  45. Schuler G.; Hu C.; Ke A. Structural basis for RNA-guided DNA cleavage by IscB-omegaRNA and mechanistic comparison with Cas9. Science 2022, eabq7220 10.1126/science.abq7220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Bao W.; Jurka J. Homologues of bacterial TnpB_IS605 are widespread in diverse eukaryotic transposable elements. Mob DNA 2013, 4, 12. 10.1186/1759-8753-4-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Islam S. M.; Hua Y.; Ohba H.; Satoh K.; Kikuchi M.; Yanagisawa T.; Narumi I. Characterization and distribution of IS8301 in the radioresistant bacterium Deinococcus radiodurans. Genes Genet Syst 2003, 78, 319–327. 10.1266/ggs.78.319. [DOI] [PubMed] [Google Scholar]
  48. Pausch P.; Al-Shayeb B.; Bisom-Rapp E.; Tsuchida C. A.; Li Z.; Cress B. F.; Knott G. J.; Jacobsen S. E.; Banfield J. F.; Doudna J. A. CRISPR-CasPhi from huge phages is a hypercompact genome editor. Science 2020, 369, 333–337. 10.1126/science.abb1400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Carabias A.; Fuglsang A.; Temperini P.; Pape T.; Sofos N.; Stella S.; Erlendsson S.; Montoya G. Structure of the mini-RNA-guided endonuclease CRISPR-Cas12j3. Nat. Commun. 2021, 12, 4476. 10.1038/s41467-021-24707-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Pausch P.; Soczek K. M.; Herbst D. A.; Tsuchida C. A.; Al-Shayeb B.; Banfield J. F.; Nogales E.; Doudna J. A. DNA interference states of the hypercompact CRISPR-CasPhi effector. Nat. Struct Mol. Biol. 2021, 28, 652–661. 10.1038/s41594-021-00632-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Schmidt M. J.; Gupta A.; Bednarski C.; Gehrig-Giannini S.; Richter F.; Pitzler C.; Gamalinda M.; Galonska C.; Takeuchi R.; Wang K.; Reiss C.; Dehne K.; Lukason M. J.; Noma A.; Park-Windhol C.; Allocca M.; Kantardzhieva A.; Sane S.; Kosakowska K.; Cafferty B.; Tebbe J.; Spencer S. J.; Munzer S.; Cheng C. J.; Scaria A.; Scharenberg A. M.; Cohnen A.; Coco W. M. Improved CRISPR genome editing using small highly active and specific engineered RNA-guided nucleases. Nat. Commun. 2021, 12, 4219. 10.1038/s41467-021-24454-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Strecker J.; Ladha A.; Gardner Z.; Schmid-Burgk J. L.; Makarova K. S.; Koonin E. V.; Zhang F. RNA-guided DNA insertion with CRISPR-associated transposases. Science 2019, 365, 48–53. 10.1126/science.aax9181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Klompe S. E.; Vo P. L. H.; Halpin-Healy T. S.; Sternberg S. H. Transposon-encoded CRISPR–Cas systems direct RNA-guided DNA integration. Nature 2019, 571, 219–225. 10.1038/s41586-019-1323-z. [DOI] [PubMed] [Google Scholar]
  54. Saito M.; Ladha A.; Strecker J.; Faure G.; Neumann E.; Altae-Tran H.; Macrae R. K.; Zhang F. Dual modes of CRISPR-associated transposon homing. Cell 2021, 184, 2441–2453.e18. 10.1016/j.cell.2021.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Peters J. E. Targeted transposition with Tn7 elements: safe sites, mobile plasmids, CRISPR/Cas and beyond. Mol. Microbiol. 2019, 112, 1635–1644. 10.1111/mmi.14383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Shmakov S.; Smargon A.; Scott D.; Cox D.; Pyzocha N.; Yan W.; Abudayyeh O. O.; Gootenberg J. S.; Makarova K. S.; Wolf Y. I.; Severinov K.; Zhang F.; Koonin E. V. Diversity and evolution of class 2 CRISPR-Cas systems. Nature Reviews Microbiology 2017, 15, 169–182. 10.1038/nrmicro.2016.184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Faure G.; Shmakov S. A.; Yan W. X.; Cheng D. R.; Scott D. A.; Peters J. E.; Makarova K. S.; Koonin E. V. CRISPR–Cas in mobile genetic elements: counter-defence and beyond. Nature Reviews Microbiology 2019, 17, 513–525. 10.1038/s41579-019-0204-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Peters J. E.; Makarova K. S.; Shmakov S.; Koonin E. V. Recruitment of CRISPR-Cas systems by Tn7-like transposons. Proc. Natl. Acad. Sci. U. S. A. 2017, 114, E7358–E7366. 10.1073/pnas.1709035114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Rybarski J. R.; Hu K.; Hill A. M.; Wilke C. O.; Finkelstein I. J. Metagenomic discovery of CRISPR-associated transposons. Proc. Natl. Acad. Sci. U. S. A. 2021, 118, e2112279118 10.1073/pnas.2112279118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Shen Y.; Gomez-Blanco J.; Petassi M. T.; Peters J. E.; Ortega J.; Guarné A. Structural basis for DNA targeting by the Tn7 transposon. Nat. Struct. Mol. Biol. 2022, 29, 143–141. 10.1038/s41594-022-00724-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kaczmarska Z.; Czarnocki-Cieciura M.; Górecka-Minakowska K. M.; Wingo R. J.; Jackiewicz J.; Zajko W.; Poznan ski J. T.; Rawski M.; Grant T.; Peters J. E.; Nowotny M. Structural basis of transposon end recognition explains central features of Tn7 transposition systems. Mol. Cell 2022, 82, 2618–2632.e7. 10.1016/j.molcel.2022.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Halpin-Healy T. S.; Klompe S. E.; Sternberg S. H.; Fernández I. S. Structural basis of DNA targeting by a transposon-encoded CRISPR–Cas system. Nature 2020, 577, 271–274. 10.1038/s41586-019-1849-0. [DOI] [PubMed] [Google Scholar]
  63. Wang B.; Xu W.; Yang H. Structural basis of a Tn7-like transposase recruitment and DNA loading to CRISPR-Cas surveillance complex. Cell Research 2020, 30, 185–187. 10.1038/s41422-020-0274-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Jia N.; Xie W.; de la Cruz M. J.; Eng E. T.; Patel D. J. Structure–function insights into the initial step of DNA integration by a CRISPR–Cas–Transposon complex. Cell Research 2020, 30, 182–184. 10.1038/s41422-019-0272-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Petassi M. T.; Hsieh S.-C.; Peters J. E. Guide RNA categorization enables target site choice in Tn7-CRISPR-Cas transposons. Cell 2020, 183, 1757–1771. 10.1016/j.cell.2020.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Xiao R.; Wang S.; Han R.; Li Z.; Gabel C.; Mukherjee I. A.; Chang L. Structural basis of target DNA recognition by CRISPR-Cas12k for RNA-guided DNA transposition. Mol. Cell 2021, 81, 4457–4466.e5. 10.1016/j.molcel.2021.07.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Querques I.; Schmitz M.; Oberli S.; Chanez C.; Jinek M. Target site selection and remodelling by type V CRISPR-transposon systems. Nature 2021, 599, 497–502. 10.1038/s41586-021-04030-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Park J.-U.; Tsai A. W.; Mehrotra E.; Petassi M. T.; Hsieh S.-C.; Ke A.; Peters J. E.; Kellogg E. H. Structural basis for target site selection in RNA-guided DNA transposition systems. Science 2021, 373, 768–774. 10.1126/science.abi8976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Choi K. Y.; Spencer J. M.; Craig N. L. The Tn7 transposition regulator TnsC interacts with the transposase subunit TnsB and target selector TnsD. Proc. Natl. Acad. Sci. U. S. A. 2014, 111, 2858–2865. 10.1073/pnas.1409869111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Choi K. Y.; Li Y.; Sarnovsky R.; Craig N. L. Direct interaction between the TnsA and TnsB subunits controls the heteromeric Tn7 transposase. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, E2038 10.1073/pnas.1305716110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Ronning D. R.; Li Y.; Perez Z. N.; Ross P. D.; Hickman A. B.; Craig N. L.; Dyda F. The carboxy-terminal portion of TnsC activates the Tn7 transposase through a specific interaction with TnsA. EMBO Journal 2004, 23, 2972–2981. 10.1038/sj.emboj.7600311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. May E. W.; Craig N. L. Switching from Cut-and-Paste to Replicative Tn7 Transposition. Science 1996, 272, 401–404. 10.1126/science.272.5260.401. [DOI] [PubMed] [Google Scholar]
  73. Parks A. R.; Li Z.; Shi Q.; Owens R. M.; Jin M. M.; Peters J. E. Transposition into Replicating DNA Occurs through Interaction with the Processivity Factor. Cell 2009, 138, 685–695. 10.1016/j.cell.2009.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Rice P. A.; Craig N. L.; Dyda F. Comment on “RNA-guided DNA insertion with CRISPR- associated transposases. Science 2020, 368, abb2022. 10.1126/science.abb2022. [DOI] [PubMed] [Google Scholar]
  75. Strecker J.; Ladha A.; Makarova K. S.; Koonin E. V.; Zhang F. Response to Comment on “RNA-guided DNA insertion with CRISPR-associated transposases. Science 2020, 368, abb2920. 10.1126/science.abb2920. [DOI] [PubMed] [Google Scholar]
  76. Kholodii G. Y.; Mindlin S. Z.; Bass I. A.; Yurieva O. V.; Minakhina S. V.; Nikiforov V. G. Four genes, two ends, a res region are involved in transposition of Tn5053: a paradigm for a novel family of transposons carrying either a mer operon or an integron. Mol. Microbiol. 1995, 17, 1189. 10.1111/j.1365-2958.1995.mmi_17061189.x. [DOI] [PubMed] [Google Scholar]
  77. Nicolas E.; Lambin M.; Dandoy D.; Galloy C.; Nguyen N.; Oger C. A.; Hallet B. The Tn3-family of replicative transposons. Microbiol. Spectrum 2014, 10.1128/microbiolspec.MDNA3-0060-2014. [DOI] [PubMed] [Google Scholar]
  78. Siguier P.; Gourbeyre E.; Chandler M. Known knowns, known unknowns and unknown unknowns in prokaryotic transposition. Curr. Opin. Microbiol. 2017, 38, 171–180. 10.1016/j.mib.2017.06.005. [DOI] [PubMed] [Google Scholar]
  79. Petrovski S.; Stanisich V. A. Tn502 and Tn512 are res site hunters that provide evidence of resolvase-independent transposition to random sites. Journal of bacteriology 2010, 192, 1865–1874. 10.1128/JB.01322-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Minakhina S.; Kholodii G.; Mindlin S.; Yurieva O.; Nikiforov V. Tn5053 family transposons are res sites hunters sensing plasmidal res sites occupied by cognate resolvases. Mol. Microbiol. 1999, 33, 1059. 10.1046/j.1365-2958.1999.01548.x. [DOI] [PubMed] [Google Scholar]
  81. Chen Q.; Luo W.; Veach R. A.; Hickman A. B.; Wilson M. H.; Dyda F. Structural basis of seamless excision and specific targeting by piggyBac transposase. Nat. Commun. 2020, 11, 3446. 10.1038/s41467-020-17128-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Arnoult N.; Correia A.; Ma J.; Merlo A.; Garcia-Gomez S.; Maric M.; Tognetti M.; Benner C. W.; Boulton S. J.; Saghatelian A.; Karlseder J. Regulation of DNA repair pathway choice in S and G2 phases by the NHEJ inhibitor CYREN. Nature 2017, 549, 548–552. 10.1038/nature24023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Wang H. H.; Isaacs F. J.; Carr P. A.; Sun Z. Z.; Xu G.; Forest C. R.; Church G. M. Programming cells by multiplex genome engineering and accelerated evolution. Nature 2009, 460, 894–898. 10.1038/nature08187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Vo P. L. H.; Ronda C.; Klompe S. E.; Chen E. E.; Acree C.; Wang H. H.; Sternberg S. H. CRISPR RNA-guided integrases for high-efficiency, multiplexed bacterial genome engineering. Nat. Biotechnol. 2021, 39, 480–489. 10.1038/s41587-020-00745-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Rubin B. E.; Diamond S.; Cress B. F.; Crits-Christoph A.; Lou Y. C.; Borges A. L.; Shivram H.; He C.; Xu M.; Zhou Z.; Smith S. J.; Rovinsky R.; Smock D. C. J.; Tang K.; Owens T. K.; Krishnappa N.; Sachdeva R.; Barrangou R.; Deutschbauer A. M.; Banfield J. F.; Doudna J. A. Species- and site-specific genome editing in complex bacterial communities. Nature Microbiology 2022, 7, 34–47. 10.1038/s41564-021-01014-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Tou C. J.; Orr B.; Kleinstiver P. Cut-and-Paste DNA Insertion with Engineered Type V-K CRISPR-associated Transposases. bioRxiv 2022, 10.1101/2022.01.07.475005. [DOI] [PubMed] [Google Scholar]
  87. Schmitz M.; Querques I.; Oberli S.; Chanez C.; Jinek M. Structural basis for RNA-mediated assembly of type V CRISPR-associated transposons. bioRxiv 2022, 10.1101/2022.06.17.496590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Horlbeck M. A.; Witkowsky L. B.; Guglielmi B.; Replogle J. M.; Gilbert L. A.; Villalta J. E.; Torigoe S. E.; Tjian R.; Weissman J. S. Nucleosomes impede Cas9 access to DNA in vivo and in vitro. eLife 2016, 5, e12677. 10.7554/eLife.12677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Strohkendl I.; Saifuddin F. A.; Gibson B. A.; Rosen M. K.; Russell R.; Finkelstein I. J. Inhibition of CRISPR-Cas12a DNA targeting by nucleosomes and chromatin. Sci. Adv. 2021, 7, eabd6030 10.1126/sciadv.abd6030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Yarrington R. M.; Verma S.; Schwartz S.; Trautman J. K.; Carroll D. Nucleosomes inhibit target cleavage by CRISPR-Cas9 in vivo. Proc. Natl. Acad. Sci. U. S. A. 2018, 115, 9351–9358. 10.1073/pnas.1810062115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Jensen K. T.; Fløe L.; Petersen T. S.; Huang J.; Xu F.; Bolund L.; Luo Y.; Lin L. Chromatin accessibility and guide sequence secondary structure affect CRISPR-Cas9 gene editing efficiency. FEBS Lett. 2017, 591, 1892–1901. 10.1002/1873-3468.12707. [DOI] [PubMed] [Google Scholar]
  92. Kuduvalli P. N.; Mitra R.; Craig N. L. Site-specific Tn7 transposition into the human genome. Nucleic Acids Res. 2005, 33, 857–863. 10.1093/nar/gki227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Kumar A.; Seringhaus M.; Biery M. C.; Sarnovsky R. J.; Umansky L.; Piccirillo S.; Heidtman M.; Cheung K. H.; Dobry C. J.; Gerstein M. B.; Craig N. L.; Snyder M. Large-scale mutagenesis of the yeast genome using a Tn7-derived multipurpose transposon. Genome Res. 2004, 14, 1975–1986. 10.1101/gr.2875304. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biochemistry are provided here courtesy of American Chemical Society

RESOURCES