Skip to main content
Genome Biology logoLink to Genome Biology
. 2025 Sep 23;26:297. doi: 10.1186/s13059-025-03756-7

Fueling chromosomal gene diversification and artificial evolution with CRISPR

Ruiying Zhu 1,2,3,#, Chuanhong Ren 1,2,3,#, Zehua Bao 1,2,3,4,
PMCID: PMC12455791  PMID: 40988083

Abstract

Gene diversification is an effective approach to massively dissecting variant functions and evolving sequences when paired with an appropriate assay. In vitro mutagenesis and ectopic gene expression, however, fail to simulate the endogenous regulatory environment of the variants. The development of clustered, regularly interspaced short palindromic repeats (CRISPR) systems has greatly boosted the efficiency of targeted gene diversification in various species. Here, we review recent CRISPR-assisted methods for chromosomal gene diversification and artificial evolution, focusing on the advantages and limitations of each approach, and propose possible strategies to overcome current limitations and directions in future technology development.

Supplementary Information

The online version contains supplementary material available at 10.1186/s13059-025-03756-7.

Keywords: Gene diversification, Evolution, CRISPR, Functional genomics, Organism engineering

Introduction

Gene diversification, the process of generating a large number of distinct genetic variants, facilitates the screening and analysis of functional sequences. In addition, iterative diversification and selection improves gene and protein functions for various biotechnological innovations. Error-prone PCR [1], site saturation mutagenesis [2, 3], and DNA shuffling [4, 5] are just a few methods in the well-developed in vitro gene diversification toolbox. A common limitation is that the generated mutant libraries need to be introduced into host organisms and expressed ectopically to conduct phenotypic screening, which is cumbersome and limited by host-dependent transformation efficiencies [6, 7]. In vivo continuous evolution diversifies genes expressed from artificial replication/transcription systems [8]. Although these in vivo orthogonal systems avoid iterative cloning and transformation/transfection steps, they still require the genes of interest to be expressed on plasmids or at a non-native genomic site. Such ectopic expression is often subject to the influences of plasmid instability and copy number effects, or of chromosomal position effects if integrated [9]. Therefore, these methods fail to recapitulate the endogenous regulatory environments, which can be essential for analyzing variant functions. In addition, ectopic expression systems are yet to be established in some species, such as plants. As a result, heterologous gene diversification methods have been historically applied to studying protein fitness landscapes and evolving individual proteins in more permissive model organisms. Their applications in functional genomics and organism engineering have thus far been limited. Lastly, large structural variants at the genome level, including large deletions, duplications, inversions, and translocations, have been impractical to engineer in vitro without costly whole genome synthesis [10].

Chromosomal gene diversification, which we refer to as the direct generation of genetic variants at the native chromosome position, permits reliable functional profiling in situ [11]. Traditionally, this has been attempted with spontaneous, chemical, or physical mutageneses. However, these approaches are often inefficient with low gene mutation frequency [12, 13]. In addition, such mutations are randomly distributed genome-wide, leaving the interested genomic region rarely perturbed. Oligonucleotide-mediated recombineering is an important progress towards chromosomal gene diversification with customized point mutations [14]. However, to date it is only successfully applied to bacterial species [1518] and transgenes in yeast [1921]. In recent years, traditional recombineering has been upgraded by using the clustered, regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated proteins (Cas) system to generate a targeted DNA double-strand break (DSB) to boost mutagenesis frequency. This has led to the development of homology-directed repair (HDR) based variant engineering methods in bacteria, yeast, and human cell lines. Other recombination-independent methods have also been developed to enable gene diversification in species where homologous recombination is inefficient. These methods include using a dead or nicking Cas protein with deaminases, reverse transcriptases, and error-prone DNA polymerases. With the support of CRISPR technologies, large-scale targeted chromosomal gene diversification has become available by simply generating single-guide RNA (sgRNA) libraries (Fig. 1), enabling the rapid interrogation of various cellular processes in multiple species and aiding in the development of useful products ranging from therapeutics to bio-based products. CRISPR-based methods, in combination with other genome editing agents such as recombinases and transposases, are also enabling us to engineer targeted large structural variants. In this review, we summarize the various methods of CRISPR-assisted chromosomal gene diversification, compare their advantages and limitations, and discuss emerging applications and future directions for technology improvement.

Fig. 1.

Fig. 1

Workflow of CRISPR-assisted chromosomal gene diversification methods. sgRNAs and/or templates are designed as oligonucleotide libraries to cover the target chromosomal region. After cloning as a plasmid library, these reagents can be introduced into host organisms using transformation methods tailored to specific species. Under conditions of interest, specific phenotypes are enriched, allowing researchers to link these phenotypes to genotypes through sequencing

CRISPR systems facilitate programmable genomic targeting

CRISPR/Cas systems are bacterial innate immune systems capable of recognizing, binding, and cleaving foreign nucleic acids. Based on their effector complex architecture, CRISPR/Cas systems are classified into two major classes and six types. Class 1 systems employ multi-subunit effector complexes and include Type I (signature protein: Cas3), Type III (Cas10), and Type IV systems. Class 2 systems utilize single-protein effectors and include Type II (Cas9), Type V (Cas12), and Type VI (Cas13). Notably, Cas9 and Cas12 primarily target DNA, while Cas13 is specialized for RNA cleavage [22]. Since their discovery, various CRISPR/Cas systems have been engineered and reprogrammed as precision genome editing tools. The most widely used system to date is the Class II, type II CRISPR/Cas9 system from Streptococcus pyogenes [23]. In this system, the Cas9 protein is targeted by the sgRNA to the genomic locus referred to as a protospacer, whose sequence is adjacent to a protospacer adjacent motif (PAM) and matches the guide sequence of the sgRNA (Fig. 2a) [23]. It subsequently cleaves the target DNA to generate a DSB, which is repaired through non-homologous end-joining (NHEJ) or HDR pathways of the host cell [24]. dCas9 (dead Cas9) [25] and nCas9 (nickase Cas9) [26] are mutants derived from Cas9 by inactivating both nuclease domains (HNH and RuvC) in the case of dCas9 or a single nuclease domain (HNH or RuvC) in the case of nCas9. dCas9 lacks DNA cleavage activity but retains the ability to accurately target and bind to DNA, making it a recruiting platform for additional effector proteins [27]. nCas9 retains the ability to cleave one of the two DNA strands, which facilitates advanced genome editing techniques such as base editing and prime editing [28, 29].

Fig. 2.

Fig. 2

CRISPR-stimulated homology-directed repair and its applications. a CRISPR-HDR involves Cas9 recognizing the PAM sequence, forming a R-loop, and cleaving the chromosomal gene to generate the DSB. Repair templates containing mutated sequences are subsequently integrated into the target site. SpCas9, the commonly used Cas9 from Streptococcus pyogenes. PID, PAM interaction domain. PAM, protospacer-adjacent motif. sgRNA, single guide RNA. b Saturation genome editing (SGE) in human cells involves the cutting of a specific genomic region (usually an exon around 100 bp in length) by one sgRNA, followed by the use of HDR to introduce SNVs. Multiple exons can be diversified with each SNV library targeting one exon in one experiment. High-throughput sequencing quantifies the frequency of SNVs before and after screen, allowing for the assessment of variant impacts based on SNV abundance changes. c CasPER (Cas9-mediated Protein Evolution Reaction) combines error-prone PCR with Cas9-mediated HDR for in situ integration of large diversified DNA fragments (300 ~ 600 bp) in yeast. CasPER enables multiplex engineering of metabolic pathway genes for identifying high-yield strains. d The guide + donor strategy pairs each sgRNA with a short mutagenic DNA donor for sequence diversification in E. coli and yeast. It accelerates laboratory evolution through a targeted and trackable approach. e In the CRAIDE system, an error-prone T7 RNA polymerase (epT7RNAP) continuously generates mutated chimeric donor gRNAs (cgRNAs). These cgRNAs are then copied into the target genomic site under the guidance of Cas9 via homologous recombination. This approach can be applied to evolve endogenous metabolic pathways

CRISPR-assisted gene diversification

Chromosomal gene diversification often necessitates the use of a library of genome editing agents with targeting specificities covering the regions of interest. Earlier generations of genome editing tools, i.e., zinc finger nucleases (ZFNs) [30] and transcription activator-like effector nucleases (TALENs) [31], suffer from the complexity of protein design, the high cost of molecular cloning, and cleavage-only effector function, preventing the facile generation of gene diversifying libraries. Consequently, TALEN libraries were mainly constructed for gene knockouts [32, 33]. And only until recently, the challenges of designing specific zinc finger arrays have been tackled using advanced machine learning models [34]. In contrast, high-efficiency, easy programmability, and cost-effectiveness drove the wide adoption of CRISPR/Cas systems in different species. Simply by constructing sgRNA libraries, the targeting range of CRISPR readily encompasses a full gene or long genomic regions (Fig. 1).

CRISPR-stimulated homology-directed repair

It is not surprising that HDR was first used to introduce a library of user-defined precision mutations due to decades of research on its highly conserved mechanisms and well-developed experimental protocols [35]. The efficiency of spontaneous homologous recombination in vivo is generally low [36]. A landmark study reported that an artificially introduced DSB greatly boosted the efficiency of targeted integration of foreign DNA carrying homologous sequences, through triggering the homologous recombination repair pathway [37]. This boosted efficiency permits the scaling up of genomic integration using massive in vitro synthesized HDR templates incorporating rationally designed or saturated mutations, following DNA cleavage by CRISPR/Cas9 (referred to in this review as the CRISPR-HDR strategy, Fig. 2a–d).

It was first demonstrated in human cell lines that chromosomal genes can be diversified using saturation editing, which involves a single genomic cut by a sgRNA and subsequent recombination with a library of in vitro synthesized mutagenic donor plasmids [38]. Since then, multiple studies have reported using CRISPR in combination with homologous recombination for saturation editing and functional profiling of exonic regions in various disease-associated genes (Fig. 2b) [3845]. Unlike traditional methods, which are limited by the slow curation of clinical and population genomics data, CRISPR-HDR enables high-throughput generation and analysis of genetic variants, including those of uncertain significance. By introducing targeted mutations and tracking variant abundance before and after selection (e.g., drug treatments), researchers can systematically assess the functional impact of these variants more rapidly (Fig. 2b). However, due to the generally low efficiency of HDR in mammalian cells, HDR templates targeting each exon typically require homologous arms of 600 to 1000 nucleotides in length on both sides to support efficient repair [3840, 45]. In yeast, an analogous method termed Cas9-mediated Protein Evolution Reaction (CasPER) was developed [46]. The CasPER method is a robust directed evolution technique that employs error-prone PCR (epPCR) to create large mutant libraries and integrates these variants into genomic contexts using Cas9-mediated genome editing (Fig. 2c). CasPER allows for the integration of donor sequences ranging from 300 to 600 bp with over 98% efficiency (Table 1). This method has proven useful for diversifying endogenous non-essential yeast genes, such as those of the mevalonate pathway, enabling the generation of diverse mutants and an 11-fold increase in isoprenoid production after selection.

Table 1.

Features of chromosomal gene diversification methods

Method Species Cell line/strain Efficiency Efficiency Note Applications Year Reference
CRISPR-stimulated homology-directed repair Saturation genome editing Mammalian cells HEK293T, HAP1 1.02%–3.33% Range of reported efficiencies Saturation editing of exon 18 of BRCA1 and a region of DBR1 2014 [38]
Mammalian cells HAP1 0.2–0.8 From Fig. 2a of [39] Saturation editing of 13 exons of BRCA1 2018 [39]
Mammalian cells HAP1 Not reported N/A Saturation editing of all 17 exons of DDX3X 2023 [40]
Mammalian cells Humanized mESC 0.6%–2.3% Percentage of HDR (figure S6 of [41]) Saturation editing of exon 13 of BRCA1 2023 [41]
Mammalian cells HAP1-A5 99% From Fig. 2c of [42] Saturation editing of BAP1 2024 [42]
Mammalian cells HAP1-A5 50% From figure S1b of [43] Saturation editing of all 9 exons of RAD51C 2024 [43]
Mammalian cells HAP1  > 0.3 HDR rate (from Extended Data Fig. 3a of [44]) Saturation editing of VHL 2024 [44]
Mammalian cells HAP1 Not reported N/A Saturation editing of exons 15 to 26 of BRCA2 2025 [45]
CasPER Saccharomyces cerevisiae CEN.PK2-1C 98%–99% Cas9-mediated integration of large mutagenized DNA fragments Directed evolution of BTS1 and HMG2 2018 [46]
CREATE Escherichia coli MG1655 75% Predicted from single guide + donor efficiencies

Saturation mutagenesis of the folA gene;

Screening drug-resistant mutations of the AcrB multidrug efflux pump;

Laboratory evolution of E.coli thermotolerance;

Selection of stress resistance to rifampicin, erythromycin, and furfural

2017 [47]
CHAnGE Saccharomyces cerevisiae BY4741 47% Average editing efficiency of guide + donor cassettes with a PAM–codon distance within 20 bp Saturation editing of a region of SIZ1 2018 [49]
MAGESTIC Saccharomyces cerevisiae DHY214  > 60% Editing efficiency of ADE2 (from Fig. 3b of [50]) Saturation editing of a region of SEC14 2018 [50]

Guide + donor

strategy

Saccharomyces cerevisiae BY4741 85%–95% Library editing efficiency (from table S7 of [51]) 20 bp tiling deletion of SGS1 2018 [51]
CHASE Saccharomyces cerevisiae BY4741, ER 30%–40% Library editing efficiency (from figure S6 of [53]) Saturation editing of a global transcription factor gene SPT15 2024 [53]
CRAIDE Saccharomyces cerevisiae CEN.PK2-1C 3.26 × 10^-6 Per viable cell per generation per base

ADE2 disruption;

Directed evolution of CAN1

2021 [56]
CRISPR-guided deaminases TAM Mammalian cells K562  > 20% Achieved with dCas9-AIDx and pooled sgRNAs against GFP Identification of BCR-ABL imatinib resistance variants 2016 [67]
CRISPR-X Mammalian cells K562  > 20% Achieved with AID*∆ and the most efficient sgRNA

Evolution of GFP to EGFP;

Identification of PSMB5 bortezomib resistance variants

2016 [68]
STEMEs Rice Japonica 13.18% Average editing efficiency in each group Directed evolution of the OsACC herbicide resistance 2020 [69]
BEMGE Rice Kitaake 15.2%–23.8% Agrobacterium or particle bombardment Directed evolution of the OsACC herbicide resistance 2020 [95]
AGBE Mammalian cells HEK293 Varied A-to-G and C-to-G (0.01%–32.46%), A-to-G and C-to-T (0.18%–23.75%), or A-to-G and C-to-A (0.01%–2.59%) conversions at the same DNA strand Saturation mutagenesis of hDTR 2022 [76]
MoBE Rice Japonica variety Nipponbare 37.60% Highest efficiency Directed evolution of OsACC herbicide resistance 2023 [77]
BE screen Mammalian cells HAP1  < 25% Substitution frequency (from supplementary Fig. 1b of [80]) Base editing of all exons in BRCA1 2020 [80]
Mammalian cells eHAP Varied

AncBE4max: 9.6%

ABEmax: 4.6%

(median fraction of edited reads)

Base editing of all exons of BRCA1

and BRCA2

2021 [81]
Mammalian cells A375, MELJUSO, HAP1, HT29, OVCAR8, HA1E 46.6% Median C-to-T editing in the target window in HAP1

Identification of known loss-of-function variants in BRCA1 and BRCA2;

Identification of MCL1 and BCL2L1 drug-sensitivity or resistance mutations;

Identification of PARP1 drug-sensitivity or resistance mutations;

Functional screens of 52,034 variants in ClinVar

2021 [82]
Mammalian cells MDA-MB-231, PC9, PDECs, NIH3T3, KPT1  > 60% Median C-to-T by FNLS at NGG PAMs Identification of uncharacterized TP53 variants 2022 [83]
Mammalian cells HAP1, MELJUSO Varied Samples with an intermediate or active PAM averaged 41.7% C-to-T editing and 37.4% A-to-G editing in the predicted edit window of 4 − 8 nucleotides Screening for loss-of-function mutations in BRCA1 and Venetoclax-resistant mutations in BCL2 2022 [84]
Mammalian cells HBEC30KT Varied

C-to-T: 37%–61%

A-to-G: 16%–68%

Evaluated the functional effects of 29,060 cancer-related transition mutations 2022 [85]
Mammalian cells K562  ~ 60% Median C-to-T efficiencies (from Extended Data Fig. 5d of [86]) In situ mutational scanning of DNMT3A 2022 [86]
Mammalian cells K562 Varied

C-to-T: averaging 34.1%

A-to-G: averaging 31.6%

Saturated base editing screens of a GWAS locus 2023 [87]
Mammalian cells MCF10A  > 95% For both CBE and ABE Identification of EGFR pathogenic variants 2024 [89]
0.5%–66% Prime editing efficiencies from Fig. 4a of [89]
CoMuTER Saccharomyces cerevisiae S288c N/A Average of 0.3 mutations per kilobase Evolution of a heterologous lycopene synthesis pathway 2023 [99]
HACE Mammalian cells HEK293FT N/A  ~ 2 per kbp per 3 days (from table S13 of [100])

Identification of MEK1 inhibitor–resistance mutations;

Identification of SF3B1 variants;

Mutagenesis of noncoding regulatory elements

2024 [100]
CRISPR coupled with reverse transcriptase PLSM Rice Japonica variety Nipponbare Not reported N/A Identification of OsACC1 herbicide resistance mutations 2021 [109]
SPE Mammalian cells Haploidized HEK293T 9.95%–80.01% Data from the 16 libraries targeting NPC1 Screening of NPC1 and BRCA2 variants 2022 [110]
Mammalian cells MCF7 20%–40% At two loci (from Fig. 1D of [111])

Identification of essential nucleotides in the MYC enhancer;

Functional characterization of breast cancer-associated variants;

Evaluation of clinically identified variants

2023 [111]
Mammalian cells PC-9 0.0014%–0.36% Per base, from Fig. 1d of [112]

Screening of drug-resistant SNVs in EGFR;

Evaluation of SNVs in RPL15 and BRCA1

2024 [112]
Mammalian cells A549  ~ 8% Peaking average editing efficiency Screening cancer-associated variants of TP53 2024 [113]
Mammalian cells Jurkat, THP-1 Varied

28%–68% individual editing rates;

39% total editing rate of the pool

Tiling mutagenesis screens of PPIF and the promoter of IL2RA 2025 [114]
CRISPR coupled with error-prone DNA polymerase EvolvR Escherichia coli TG1 0.5%–1% Substitution rates at a base Directed evolution of rpsE and rpsL 2018 [116]
yEvolvR Saccharomyces cerevisiae yCT01 10^-5 Mutations/nucleotide/generation Diversification of URA3 and CAN1 2020 [117]

While the above studies demonstrated the feasibility of integrating donor libraries with a single sgRNA, the efficiency of mutant incorporation tends to drop at a distance from the Cas9 cleavage site. It is therefore advantageous to use nearby sgRNAs for each intended mutation. This approach is feasible in organisms with efficient homologous recombination systems, such as in certain engineered bacteria or yeast, where shorter homology arms (typically 40–100 bp) are sufficient and thus allow the massively parallel synthesis of the donor and a nearby guide (Fig. 2d). With this concept, a gene diversification technology named CRISPR-enabled trackable genome engineering (CREATE) was developed in E. coli [47]. CREATE integrates CRISPR/Cas9 with HDR to achieve saturation mutagenesis of chromosomal genes using libraries of guide and donor pairs (referred to as the guide + donor strategy hereafter). In this strategy, it is crucial that the donor is supplied on the sgRNA expression plasmid so that the guide + donor pair is physically linked for delivery into the same cell. In yeast, we also showed that the donors can be supplied on the sgRNA expression plasmid to achieve efficient gene editing [48]. Using similar guide + donor strategies, we and others achieved gene diversification in yeast through CRISPR-HDR [4952]. Our strategy, CHAnGE (CRISPR-Cas9 and homology-directed repair-assisted genome-scale engineering), enabled chromosomal gene diversification and genome-wide gene knockouts using plasmid libraries [49]. Recently, we expanded its targeting range by employing a PAM-relaxed Cas9 variant and achieved CRISPR-Cas9 and homology-directed repair-assisted saturation editing (dubbed as CHASE) [53]. Using a conceptually similar approach, Roy et al. developed Multiplexed Accurate Genome Editing with Short, Trackable, Integrated Cellular barcodes (MAGESTIC) [50]. In this system, editing efficiency was increased by more than five-fold to reach 60% on average through the active recruitment of donor DNA to the Cas9 cleavage site. Furthermore, the mutant tracking barcodes were genomically integrated by MAGESTIC to improve the fidelity of mutant tracing during downstream functional selection. Other variations of the same guide + donor strategy are also capable of performing chromosomal gene diversification with minor adaptations [51, 52].

These CRISPR-HDR platforms capable of large-scale diversification (CREATE, CHAnGE, MAGESTIC, etc.) are particularly transformative for microbial strain engineering (Fig. 2d)—a field requiring precise genetic modifications coupled with rapid phenotypic characterization for industrial applications. While traditional adaptive laboratory evolution [54] remains widely employed for industrial strain improvement, CRISPR-assisted diversification now offers a targeted alternative that significantly accelerates this process. The versatility of CRISPR-HDR has been demonstrated through successful diversification of various targets including metabolic genes [47], transcription factors [53], and promoters [52], facilitating the evolution of critical industrial phenotypes such as stress tolerance. By enabling simultaneous and trackable mutagenesis of metabolic pathways and regulatory elements, these systems overcome challenges previously solved by months of untargeted evolution.

While DNA-based donor templates are commonly used in CRISPR-HDR applications, RNA was also shown to serve as the donor for DSB repair in yeast [55]. In light of this, CRISPR- and RNA-assisted in vivo directed evolution (CRAIDE) utilized a chimeric donor gRNA (cgRNA) for both Cas9 targeting and DSB repair [56]. cgRNAs were continuously produced by an error-prone T7 RNA Polymerase (epT7RNAP), which introduced random mutations into the RNA template for integration into the target site (Fig. 2e). CRAIDE was used to evolve drug resistance by targeting an arginine transporter. However, this approach exhibits a mutation rate 2–3 orders of magnitude lower than other ectopic in vivo evolution methods [57, 58]. Its application for evolving complex and industrially relevant traits has yet to be explored.

Although the CRISPR-HDR approach approves versatile in mutant design, Cas9 cleavage can generate random insertions and deletions at the target site, especially in mammalian cell lines that prefer NHEJ. Additionally, both on-target and off-target cleavage can lead to structural abnormalities in chromosomes, such as unexpected translocations and the loss of large chromosomal segments [24, 5961], potentially confounding the interpretation of genetic variant functions. To circumvent this problem, researchers have taken advantage of studying essential genes in haploid cell lines or strains (Table 1) [50], where lethal indels and structural abnormalities are depleted and only successful HDR events are retained. Beyond the challenges of indels and chromosomal instability, CRISPR-HDR is limited to HR-competent cells or during specific stages of the cell cycle [62, 63]. DSB-free and HDR-independent methods are thus desired in certain applications and species.

CRISPR-guided deaminases

Here we first illustrate the use of one of the HDR-independent methods, namely CRISPR-guided deaminases, for chromosomal gene diversification, where the associated deaminases directly covert nucleotide types without the need of a DSB [28, 64].

Cas9-deaminase fusions/complexes

Deaminases, such as the activation-induced cytidine deaminase (AID), have been utilized to generate genetic diversity of heterologous genes before the CRISPR era, by hijacking the natural somatic hypermutation system in Ramos B cells [65]. Base editing (BE) technology further renders deaminases with target programmability and a much wider adoption in different host cells. The first-generation base editor fused a cytidine deaminase (CDA) to dCas9, which induced the deamination of cytosine in the non-template strand to uracil. During DNA replication, this uracil was misidentified as a thymine, resulting in a C:G to T:A transition [28]. To expand the type of mutations, the cytidine deaminase was replaced with adenosine deaminase (ADA), resulting in the invention of an adenine base editor (ABE). This editor mediated the deamination of adenine to hypoxanthine, which is recognized as guanine during DNA replication, achieving A:T to G:C transitions (Fig. 3a) [66].

Fig. 3.

Fig. 3

CRISPR-guided deaminases and its applications. a In base editing, a nickase version of Cas9 (nCas9) is fused with a cytidine deaminase (CDA), an adenosine deaminase (ADA), or both to induce base deaminations. This process converts cytosine (C) to thymine (T) and adenine (A) to guanine (G). The efficiency of C to T conversion can be enhanced by fusing nCas9 with uracil-DNA glycosylase inhibitor (UGI). In contrast, fusing with uracil-DNA N-glycosylase (UNG) generates apurinic/apyrimidinic (AP) sites, which leads to random mutations. BE is widely used in human cells for functional variant screening with high efficiency. In bacteria, BE facilitates the evolution of traits such as improved carbon source utilization through targeted mutagenesis of regulatory sequences of metabolic pathway genes. RBS, ribosome binding site. Additionally, in plants, BE is employed for evolving herbicide resistance and other agronomically important traits. b The CoMuTER system uses a crRNA-guided Cascade complex to recognize genomic targets and recruit a Cas3-CDA fusion, thus converting C to T during processing, which can be utilized to evolve a complete metabolic pathway. c In the HACE system, a nCas9-sgRNA complex binds to the target site and generates ssDNA. Then a helicase is loaded onto the ssDNA to continuously unwind the DNA in the 3'→ 5' direction, allowing the helicase-fused CDA to randomly deaminate cytosines on both strands

Two base editing tools, namely targeted AID-mediated mutagenesis (TAM) and CRISPR-X, demonstrated the ability of cytidine base editors (CBEs) to rapidly perform genetic diversification at specified chromosomal loci. Both tools incorporate a dCas9 that is complexed with AID, enabling the generation of a broad range of single-nucleotide mutations in the target region under the guidance of sgRNAs [67, 68]. The tools utilize different strategies of cytidine deaminase recruitment. TAM fuses AIDx (an AID variant) to the C-terminus of dCas9 [67], while CRISPR-X utilizes a modified sgRNA scaffold with a stem-loop structure containing two MS2 binding sites to recruit an MS2 protein fused with AID or a highly active AID variant (AID*∆) [68].

To enhance the diversity of mutation types, several dual base editing tools have been developed [6976]. The first of these tools, saturated targeted endogenous mutagenesis editors (STEMEs), fused both a cytidine deaminase (APOBEC3) and an adenosine deaminase (ecTadA-ecTadA7.10) to the C-terminus of nCas9 in different configurations. This allowed for simultaneous C to T and A to G conversions within the same protospacer. By further employing a PAM-flexible Cas9 that recognizes the NG PAM sequence, STEME-NG achieved near-saturation mutagenesis across a 56-amino acid sequence [69]. In addition to the common C:G to T:A and A:T to G:C conversions, STEME-NG also facilitated less frequent C to G/A conversions. Targeting 400 amino acids of the carboxyl terminal domain of the rice acetyl-CoA carboxylase (encoded by the OsACC gene), STEMEs successfully induced mutants with enhanced herbicide resistance and generated multiplex mutations [69]. Additionally, instead of using fusion proteins, multiplexed orthogonal base editing (MoBE) also achieved random dual base editing using an sgRNA scaffold that concurrently recruits an ADA (TadA9) and a CDA (CDA1) using orthogonal stem loops, which seems to be an upgrade of CRISPR-X in plants [77].

To improve editing efficiency and accuracy for gene therapy, CBEs were initially designed with a fused uracil glycosylase inhibitor (UGI) to prevent the formation of apurinic/apyrimidinic (AP) sites after the removal of uracil through mismatch repair. For gene diversification, eliminating UGI or replacing it with uracil DNA N-glycosylase (UNG) enhances the formation of AP sites and promotes more diversified editing of the target sequence (Fig. 3a). Using this strategy, C to G base editors (CGBEs) achieved efficient C to G transversions by omitting UGI and incorporating eUNG (a variant of UNG from E. coli), while still retaining less frequent C to A/T conversions [78, 79]. Building on this, miniAGBE combines ABEs with CGBE to simultaneously generate A to G and C to G/T/A conversions at the target site, with editing windows of A4-A8 and C3-C13 [76]. During resistance screening, over 50,000 variants were assessed through the saturation editing of the critical region of the human diphtheria toxin receptor gene (hDTR) using miniAGBE, with 54.9% of the mutant gene displaying both A to G and C to G/T/A conversions [76].

BE exhibited significantly higher editing efficiencies in mammalian cells as compared to CRISPR-HDR (Table 1). With the rapid advancement of BE, recent studies have increasingly employed this technique to characterize genetic variations in both coding and non-coding regions [8087]. In contrast to CRISPR-HDR-mediated functional screening, which uses complex donor libraries with substantially larger library sizes, BE enables direct catalytic conversion of target bases without donor templates. By simply transfecting a pooled sgRNA library, BE allows simultaneous screening of hundreds to thousands of sites in a single experiment. Its high editing efficiency and streamlined workflow confer superior practical throughput compared to HDR-based approaches. However, BE is inherently limited to mainly C:G to T:A and A:T to G:C conversions. Although emerging editors such as AGBE [76], CGBE [78, 79], and AYBE [88] expand the scope of base conversions, they still do not match the versatility of CRISPR-HDR, which introduces any desired single-nucleotide variant (SNV) or codon mutation. Thus, BE may be suited for an initial “coarse-grained” screening of functional variants at C/G or A/T sites, followed by a finer screen of non-BE-targetable mutations through CRISPR-HDR or other templated methods. Recent studies have begun integrating BE with prime editing (discussed in the section “CRISPR coupled with reverse transcriptase”) for comprehensive SNV analysis [89], demonstrating the potential of hybrid approaches.

Beyond its utility in mammalian cells, BE has emerged as a powerful tool for engineering prokaryotic systems. BE-based evolution of a Sec-translocase in Bacillus subtilis generated strains with higher protein secretion [90]. BE-assisted in situ diversification was also used to diversify combinations of regulatory sequences, enabling the engineering of Corynebacterium glutamicum and Bacillus subtilis for more efficient carbon source utilization and higher chemical production [91]. Furthermore, by developing a new BE system guided by dCas12b, the E. coli chassis was evolved to obtain a higher protein secretion efficiency [92].

BE is also widely adopted in plant breeding. Due to a low HDR efficiency in plants and the technical challenges associated with delivering HDR donor libraries [93, 94], BE has become a preferred strategy of high-throughput plant genome editing. The evolution of herbicide resistance demonstrated the potential of BE for crop improvement. Targeting the full-length coding region of the rice acetolactate synthase gene OsALS1, distinct variants conferring herbicide tolerance were identified by employing CBE and ABE [95]. Herbicide-resistant variants were also identified by screening other genes such as EPSPS, ALS, and HPPD in Arabidopsis through BE [96]. These variants in model plants may serve as a reference for enhancing traits in other crops.

Processive protein-deaminase fusions

Cas9-dependent deaminases are restricted to a narrow editing window when using a single sgRNA. While this can be remedied by using high-throughput sgRNA libraries, the library size can be a practical limit. In addition, the targeting density is restricted by the PAM distribution within the target region. Cas3 from the type I CRISPR/Cas systems, which harbors both helicase and 3'→ 5'nuclease activities, works in conjunction with the CRISPR-associated complex for antiviral defense (Cascade) to unwind and degrade DNA regions of at least 10 kb upstream of the PAM site [97, 98]. Confined Mutagenesis using a Type I-E CRISPR-Cas system (CoMuTER) fuses Cas3 with cytidine deaminase, enabling random deamination across a genomic region of up to 55 kb (Fig. 3b). This approach was successfully applied to evolve a lycopene biosynthesis pathway, resulting in a two-fold increase in the lycopene yield in yeast [99]. In another study, helicase-assisted continuous editing (HACE) was developed, which consists of a fusion of a DNA helicase and a deaminase. HACE is guided by CRISPR/Cas9 to become target specific and programmable (Fig. 3c) [100]. In this system, the helicase binds to the single-stranded DNA (ssDNA) within the Cas9-generated R-loop, and subsequently unwinds the DNA in the 3'→ 5'direction [100]. Meanwhile, the deaminase introduces random mutations along the ssDNA substrate. HACE generated mutations in both coding and noncoding strands. By employing three sgRNAs, HACE was programmed for the diversification of multiple genomic regions. Of these two examples, CoMuTER should be readily transferable to different hosts, given that type I CRISPR-Cas systems have been validated in multiple species including bacteria [101103], mammalian cells [104106], and plants [107]. The induction of genome-wide random mutations through helicase-deaminase fusion proteins was also reported in S. cerevisiae [108], indicating that HACE is also species-agnostic. Nonetheless, like with BE, the mutation types of CoMuTER and HACE are inherently limited by the chemistry enabled by the fused deaminases. Additionally, although the extended editing window permits the investigation of combinatorial effects of distal mutations, CoMuTER and HACE lack the ability to precisely control where to terminate the continuous mutagenesis. Overall, these two examples represent exciting developments that combine the programmability of CRISPR systems and the processivity of certain enzymes.

CRISPR coupled with reverse transcriptase

The other DSB-free and HDR-independent gene diversification method uses prime editors (PEs). PE consists of a protein complex and a prime editing guide RNA (pegRNA) (Fig. 4). The protein complex consists of a reverse transcriptase domain fused to the C-terminus of nCas9 (H840A). The pegRNA is a modified version of sgRNA that includes a 20-nucleotide guide sequence to direct nCas9 to the target site, along with an extended sequence at the 3'end of the sgRNA scaffold that contains a primer binding sequence (PBS) and a reverse transcriptase template (RTT). The RTT, which encodes the desired mutations, is reverse transcribed into DNA by the reverse transcriptase using the nicked ssDNA as a primer, allowing for the introduction of new genetic variations. On par with HDR, the PE system enables all 12 types of point mutations [29].

Fig. 4.

Fig. 4

CRISPR coupled with reverse transcriptase and its applications. Prime editing utilizes a fusion protein that includes a nCas9 and a reverse transcriptase (RT), along with a prime editing guide RNA (pegRNA) that features a primer-binding site (PBS) and an RT template (RTT). When nCas9 nicks the non-target DNA strand, the PBS hybridizes with the exposed ssDNA, permitting the RT to reverse transcribe the RTT into DNA, enabling targeted genetic modifications. PE enables saturation mutagenesis in mammalian cells for functional variant analysis and facilitates crop improvement, such as herbicide resistance evolution

To date, several studies have demonstrated chromosomal gene diversification with high-throughput prime editing using a pegRNA library [89, 109114]. This approach was first demonstrated in rice through the development of prime editing-library-mediated saturation mutagenesis (PLSM) [109]. PLSM targeted six conserved residues in the rice OsACC gene known to be associated with herbicide resistance, with RTTs covering all possible codon substitutions for each residue. Sixteen herbicide-tolerant mutations across the six sites were identified [109]. Another strategy that combines prime editing with site haploidization expands the application of saturation prime editing (SPE) to mammalian cellular models [110]. Enabled by SPE, this study analyzed approximately 1400 SNVs in selected regions of two disease-related genes [110]. Later developments utilized even larger pegRNA libraries. For instance, the pooled prime-editing screen method (PRIME) improved editing efficiency to 40% by optimizing lentiviral delivery and created a library of 6252 pairs of pegRNA and nicking sgRNA that targeted a 716-bp enhancer region, generating 2127 single-nucleotide substitutions [111]. Prime editing and endogenous region sequencing (PEER-seq) used an elegant approach to enhance both the accuracy of SNV detection and editing efficiency [112]. PEER-seq simultaneously introduced a 1-bp synonymous mutation adjacent to the intended mutation. This synonymous mutation serves as a reliable editing marker and distracts the mismatch repair system from repairing the intended edits. As a result, false-positive reads were reduced by 625-fold, and overall editing efficiency increased by 3.4-fold. Additionally, PEER-seq revealed that knocking out MLH1 significantly improved prime editing efficiency compared to using a dominant negative variant (MLH1dn), serving as a reference for future PE optimization [112]. In another study, PE was combined with BE to realize multimodal precision gene editing screens, allowing for a more comprehensive analysis of genetic variants by overcoming the limitations inherent to each method when used alone [89]. This multimodal approach enhanced the modeling of broader genetic diversity and supported clinical diagnosis and drug development for populations with rare mutations. Similarly, Prime Editing Sensor Libraries [113], akin to Base Editing Sensor Libraries [83], link sensor targets to pegRNAs, thereby providing a more accurate quantitative assessment of editing efficiency and a more comprehensive functional evaluation of genetic variants. This strategy contributed to a deeper understanding of pathogenic variants.

Compared with CRISPR-HDR in mammalian cells, PE-based approaches induce the same full-spectrum mutations without generating DSBs yet with much higher editing efficiencies (Table 1). However, the PE system is more complex, and its larger size often raise delivery challenges [115]. Further optimization of pegRNA design is also required to predict efficient pegRNAs to ensure success in massively parallel screening. In terms of applicability in different host species, PE allows gene diversification in plants, which is challenging to accomplish using CRISPR-HDR. However, the utility of PE-based screening in microbes has yet to be explored.

CRISPR coupled with error-prone DNA polymerase

Another DSB-free strategy called EvolvR fuses a nCas9 (D10A) with an error-prone DNA polymerase I (PolI3M) (Fig. 5) and was initially developed in E. coli [116]. Mechanistically, PolI3M introduces random mutations while extending from the 3'-OH end generated by nCas9. The editing window of EvolvR spans 17 base pairs from the incision, which is limited by the processivity of PolI3M. To extend the editing window, the thioredoxin-binding domain from the bacteriophage T7 DNA polymerase was inserted into the thumb domain of PolI3M, increasing the window to 56 base pairs. However, the mutation rate decreased rapidly as the distance from the nicking site increased. Attempts to replace PolI3M with a more processive Phi29 DNA polymerase successfully extended the editing window to 347 bp, but resulted in a significant reduction in mutation rate. EvolvR was later adapted for use in S. cerevisiae, namely the yEvolvR system. yEvolvR utilized polymerases with varying fidelity to achieve tunability of the mutation rate, and realized the diversification of the endogenous CAN1 gene [117]. The EvolvR technologies are unique in that both error-prone DNA synthesis and processivity can be accomplished by the DNA polymerase, which is not achievable with Cas3 or helicases. It also achieved more diverse base conversions than deaminases. However, the randomness of mutagenesis also makes it more suited for directed evolution rather than the functional profiling of user-defined SNVs.

Fig. 5.

Fig. 5

CRISPR coupled with error-prone DNA polymerase. The EvolvR system integrates nCas9 (D10A) with an error-prone DNA polymerase I (PolI3M), which introduces random mutations while processing from the 3' end of the nCas9-generated nick

CRISPR-assisted structural variation

While CRISPR systems have been mainly applied to perform gene diversification with point mutations, several recent studies highlighted its potential to also induce diverse structural variations (SVs). SVs, including large deletions, duplications, inversions, and translocations, are key drivers of genome diversity and natural evolution. Our understanding of SVs has largely come from observations of naturally occurring variations. The emergence of CRISPR tools now allows the de novo creation of SVs (Fig. 6), permitting a more comprehensive interrogation of their impacts on genome function. For example, by employing CRISPR/Cas9 to target the repetitive retrotransposon element Ty1 in the yeast genome, DSBs formed by Cas9 cleavage within Ty1 sequences induced a wide range of structural variations, including deletions, duplications, and translocations [118]. This artificial induction of SVs helped reveal the importance of retrotransposon lesions in driving genome evolution. Alternative strategies coupling CRISPR with site-specific recombinases have also been developed. In recent examples, multiple LoxP sites were inserted into the human genome using PE targeting repetitive LINE-1 elements, and random structural variations including deletions, inversions, extrachromosomal DNAs, and translocations were generated through Cre recombinase-induced genome shuffling [119]. Analysis of survival cell clones revealed some interesting findings such as the relative locations of genes did not influence their expression levels in their experimental cell line. This method is further extended to induce SVs within large regulatory regions, such as super-enhancers, enabling the generation of randomized regulatory configurations. It allowed the resolution of functional minimal elements within the super-enhancer [120], revealing a surprising result that many sequences are dispensable for super-enhancer function. In another study, up to 30 LoxP sites were introduced into Vibrio natriegens chromosomes using CRISPR-associated transposases (CASTs, reviewed elsewhere [121]) [122]. Cre recombinase-induced chromosome rearrangement led to the identification of a genomic region underpinning the rapid growth phenotype of V. natriegens, a characteristic highly relevant for industrial and molecular biology applications. While the above methods took advantage of repetitive elements, innovative PE strategies enabled the generation of precise SVs. For example, PRIME-Del utilizes dual pegRNAs to achieve large genomic deletions with user-defined boundaries, which are specified by the two reverse-complement pegRNA templates [123]. Such strategies may be further scaled up to generate a library of precise SVs for functional genomics studies. Although these CRISPR-assisted structural variation methods are only emerging, they promise to further our understandings of genome evolution, speciation, and may facilitate the generation of next-generation biotechnological chassis with minimal and customized genomes (Fig. 6).

Fig. 6.

Fig. 6

CRISPR-assisted structural variation and its applications. Structural variations can be induced by CRISPR/Cas9 targeting high-copy genomic sites, such as the Ty1 retrotransposon elements in yeast (upper panel). Alternatively, recombinase recognition sites like LoxP can be inserted into the genome using prime editing (PE) or CRISPR-associated transposases (CAST), with structural variations subsequently generated through recombinase induction (lower panel). The generation of diverse artificial SVs could facilitate studies of genome evolution principles, enable the engineering of large regulatory elements, and support the development of next-generation chassis organisms with minimal genomes

Data analysis

After phenotypic selection of a diversified library, selecting the appropriate method to analyze the outcomes is crucial for measuring mutation effects or identifying the desired variants. For targeted diversification of a genomic region, which can be achieved through CRISPR-HDR, BE, and PE, high-throughput amplicon sequencing of the target gene is typically used (Fig. 1). By aligning sequencing reads to the reference, mutations are called and counted. Enrichment or depletion of designed mutations can be calculated by comparing samples from untreated and treated conditions. This change in mutant abundance serves as a functional score indicating the mutation effect size or fitness (Fig. 2b). In some microbial CRISPR-HDR approaches (Fig. 2d), where a DNA barcode or the guide is uniquely paired with the introduced mutation, the abundance of such barcodes can be used as an estimate of the abundance of the mutations [47, 50]. In such cases, well developed algorithms and computational pipelines for analyzing traditional CRISPR knockout screens can be adapted for data analysis [124]. In directed evolution methods such as CasPER (Fig. 2c), CRAIDE (Fig. 2e), CoMuTER (Fig. 3b), HACE (Fig. 3c), and EvolvR (Fig. 5), where typically only “winners” are of interest, several surviving clones will be sequenced to identify common mutations that indicate functional importance. Otherwise, long-read sequencing is needed for phasing multiple introduced mutations. For SVs, where sequence alterations occur across an entire genome or within large genomic regions, whole-genome sequencing is often necessary to determine genotypes. Recently, a barcoding strategy has also been validated in analyzing SVs, greatly reducing the cost and labor [125]. Generally, most data analysis codes and pipelines have been developed ad hoc in each study. Developing a universal software or webtool integrating library design and data analysis with easy-to-use interfaces, at least for some of the CRISPR-based methods reviewed here, will greatly facilitate the adoption of these advanced gene diversification techniques.

Future directions

Further expanding the mutation spectrum and editing window

For CRISPR-guided deaminases, dual base editors have been used to increase the mutation spectrum [6972]. However, it remains less diverse comparing to HDR and PE (Fig. 7). Achieving A to N editing in BE and efficient dual deamination in processive systems will be the next milestones. Progresses are already there. For example, AYBE, developed by fusing ABE with an N-methylpurine DNA glycosylase (MPG), excises hypoxanthine generated from adenine deamination, enabling A to C and A to T transversions [88]. In theory, combining UNG and MPG with a dual BE system could allow for concurrent C to N and A to N conversions and possibly in high-throughput. Such strategy may also be tested on the CoMuTER and HACE systems.

Fig. 7.

Fig. 7

Editing window, library size, and mutation types of representative chromosomal gene diversification approaches. N indicates mutation of a single base to any base; NNN indicates the mutation of an entire codon; HDR, homology-directed repair; PE, prime editing. The data used for plotting and detailed explanation of the library size and editing window of each approach can be found in Additional file 1: Table S1

The range of targeted gene diversification, or the editing window, determines the size of the genomic region that can be perturbed in a single experiment. The editing window varies depending on the method used (Fig. 7). High-throughput libraries allow targeting of multiple sites simultaneously, expanding the editing window up to thousands of base pairs. Expanding this editing window is ongoing. Studies have utilized PAM-relaxed Cas variants for this purpose [53, 87, 126]. Sangree et al. have shown that PAM-relaxed Cas9 variants, when used in BE, increased the editable residues in BRCA1 from 24.2 to 75.3% [84]. However, these Cas mutants are associated with improved off-target effects [127, 128]. In addition, they may suffer from reduced activities. Alternatively, the editing window of BE may be expanded through protein architecture engineering. In one example, multiple copies of a deaminase were recruited to dCas9 through the SunTag scaffold, which expanded the editing window [129]. The editing window was also increased by inserting the deaminase into the PI domain or replacing the HNH domain with the deaminase [130, 131]. These engineered systems expanded the editing window without affecting the Cas specificity. Importantly, a smaller sgRNA library can cover the same editing window as compared to those PAM-relaxed Cas variants. In addition, by fusing the deaminase with other Cas proteins, such as Cas12b, the editing window may also be increased [92].

An alternative strategy for expanding the editing window may be to use large sequence genome editors. For example, CASTs enable the targeted integration of large DNA fragments up to 10 kb in length [132, 133]. In these systems, the R-loop formed by CRISPR/Cas near the target site serves as a binding platform for the transposase complex. Subsequently, mediated by the transposase, fragments of donor DNA can be inserted without the need for DSBs. However, the insertion typically occurs within a few dozen base pairs downstream of the sgRNA target site and lacks precision. To achieve a more precise integration, the programmable addition via site-specific targeting elements (PASTE) system leverages prime editing to pre-install an attB recognition site, followed by phage-derived serine integrase-mediated integration of a circular double-stranded DNA template containing an attP attachment site into the attB locus [134]. Potentially, both CAST and PASTE systems may be used to insert long fragments derived from epPCR, analogous to CasPER, but in mammalian cells.

Multiplex gene diversification

Most efforts so far focused on sequence diversification of individual genes or regions. With smart library design, multiplex gene diversification may enable us to tackle many interesting problems or achieve more ambitious engineering goals, for example to probe protein–protein interactions or to concurrently optimize multiple regulatory sequences. A representative example is the use of base editors to simultaneously target the ribosomal binding site (RBS) regions of three genes involved in xylose metabolism in bacteria. The optimized RBS combinations significantly improved the xylose utilization rate. The same approach was applied to screen multiple high-yield RBS combinations of lycopene biosynthesis pathway genes [91]. Recently, multiplex gene editing using BE and PE, guided by a single sgRNA for each gene, were demonstrated in more species including mammalian cells and plants [135138], suggesting that multiplex gene diversification is feasible in diverse organisms. However, several technical challenges must be addressed to achieve large-scale multiplex gene diversification. One of them is the exponentially growing library size. Due to the library cost and transformation limits, it is wise to prioritize key regions for diversification. Alternatively, if the mutation spectrum of CoMuTER and HACE can be further improved, transforming a few sgRNAs instead of a huge library may suffice multiplex gene diversification. In some niche applications, there may be a need to diversify sequence regions with extreme AT or GC content, such as promoters or particular gene bodies. In such scenarios, it becomes necessary to develop orthogonal targeting systems, such as Cas12a and SpCas9.

Automated iteration of workflows

At present, the creation of large-scale variant libraries and high-throughput screening mainly rely on manual workflows. With the projected increase in library scale and evolution rounds, automation rapidly becomes a necessity. The significance of automation platforms in genetic engineering has been demonstrated. For example, CRISPR-HDR-mediated gene integration was automated to realize overexpression and knockdown of more than 90% of the genes in S. cerevisiae, leading to the development of acetic acid-resistant strains after three rounds of evolution [139]. Such established workflows could be easily adapted for iterating gene diversifications, enabling long-term and even high-throughput strain evolution beyond the current scope achieved, in which typically only one or two rounds of evolution were demonstrated as a proof-of-concept. Although mostly developed for microbes, such automation can be easily reconfigured to facilitate mammalian gene diversification as well. For example, cellular disease models with pathogenic SNVs were recently generated on an automated high-throughput platform [140]. This platform streamlined the large-scale cloning of sgRNA libraries containing thousands of plasmids, cell transfections, and high-throughput screening. It is anticipated that iterating this workflow for gene diversification will enable mammalian directed evolution beyond functional genomics.

In conclusion, we believe that CRISPR and its derivative technologies will continue to evolve, fueling chromosomal gene diversification in ever-expanding species and applications. Looking ahead, with the rapid advancement of modern artificial intelligence algorithms and automation platforms, the vast amount of data generated can be used to build powerful predictive models, aiding in library design and variant discovery. Eventually, the limiting factor will not be the technology itself, but where in the vast genome space we want to investigate.

Supplementary Information

13059_2025_3756_MOESM1_ESM.xlsx (14.5KB, xlsx)

Additional file 1: Table S1. Library size, editing window, and mutation type of chromosomal gene diversification approaches. This table contains the source data used for plotting Fig. 7.

Acknowledgements

We thank Bao Laboratory members for their valuable inputs during the preparation of this article. We also thank iBioFoundry and Core Facility at ZJU‐Hangzhou Global Scientific and Technological Innovation Center for their technical support in our gene diversification research.

Peer review information

Claudia Feng was the primary editor of this article and managed its editorial process and peer review in collaboration with the rest of the editorial team. The peer-review history is available in the online version of this article.

Authors’ contributions

ZB conceived of the manuscript. RZ and CR wrote the manuscript. RZ, CR, and ZB made the figures. ZB reviewed and edited the manuscript. All authors read and approved the final manuscript.

Funding

This work was supported by the National Key R&D Program of China (grant no. 2023YFF1204500 to ZB), the National Natural Science Foundation of China (grant no. 22308316 to ZB), and the Fundamental Research Funds for the Central Universities (grant nos. 226–2025-00043 and 226–2022-00214 to ZB).

Data availability

No datasets were generated or analysed during the current study.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Ruiying Zhu and Chuanhong Ren contributed equally to this work.

References

  • 1.Wilson DS, Keefe AD. Random mutagenesis by PCR. Curr Protoc Mol Biol. 2001;51:8–3. [DOI] [PubMed] [Google Scholar]
  • 2.Reetz MT, Wu S. Greatly reduced amino acid alphabets in directed evolution: making the right choice for saturation mutagenesis at homologous enzyme positions. Chem Commun (Camb). 2008;43:5499–501. [DOI] [PubMed] [Google Scholar]
  • 3.Yu H, Ye C, Wang Y, Wang Z, Fang S, Jin H, et al. Enhancing substrate preference of iridoid synthase via focused polarity-steric mutagenesis scanning. Chem Bio Eng. 2024;1:826–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stemmer WP. DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution. Proc Natl Acad Sci U S A. 1994;91:10747–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Stemmer WP, Crameri A, Ha KD, Brennan TM, Heyneker HL. Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides. Gene. 1995;164:49–53. [DOI] [PubMed] [Google Scholar]
  • 6.Packer MS, Liu DR. Methods for the directed evolution of proteins. Nat Rev Genet. 2015;16:379–94. [DOI] [PubMed] [Google Scholar]
  • 7.Fryer T, Wolff DS, Overath MD, Schäfer E, Laustsen AH, Jenkins TP, et al. Post-assembly plasmid amplification for increased transformation yields in E. coli and S. cerevisiae. Chem Bio Eng. 2025;2:87–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Molina RS, Rix G, Mengiste AA, Alvarez B, Seo D, Chen H, et al. In vivo hypermutation and continuous evolution. Nat Rev Methods Primers. 2022;2:36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Chen X, Zhang J. The genomic landscape of position effects on protein expression level and noise in yeast. Cell Syst. 2016;2:347–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.James JS, Dai J, Chew WL, Cai Y. The design and engineering of synthetic genomes. Nat Rev Genet. 2025;26:298–319. [DOI] [PubMed] [Google Scholar]
  • 11.Erdogan M, Fabritius A, Basquin J, Griesbeck O. Targeted In Situ Protein Diversification and Intra-organelle Validation in Mammalian Cells. Cell Chem Biol. 2020;27:610-621.e615. [DOI] [PubMed] [Google Scholar]
  • 12.Lynch M, Ackerman MS, Gout JF, Long H, Sung W, Thomas WK, et al. Genetic drift, selection and the evolution of the mutation rate. Nat Rev Genet. 2016;17:704–14. [DOI] [PubMed] [Google Scholar]
  • 13.Lewis JA, Morran LT. Advantages of laboratory natural selection in the applied sciences. J Evol Biol. 2022;35:5–22. [DOI] [PubMed] [Google Scholar]
  • 14.Wannier TM, Ciaccia PN, Ellington AD, Filsinger GT, Isaacs FJ, Javanmardi K, et al. Recombineering and MAGE. Nat Rev Methods Primers. 2021;1:7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Wang HH, Isaacs FJ, Carr PA, Sun ZZ, Xu G, Forest CR, et al. Programming cells by multiplex genome engineering and accelerated evolution. Nature. 2009;460:894–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wang HH, Kim H, Cong L, Jeong J, Bang D, Church GM. Genome-scale promoter engineering by coselection MAGE. Nat Methods. 2012;9:591–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Carr PA, Wang HH, Sterling B, Isaacs FJ, Lajoie MJ, Xu G, et al. Enhanced multiplex genome engineering through co-operative oligonucleotide co-selection. Nucleic Acids Res. 2012;40:e132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Nyerges Á, Csörgő B, Nagy I, Bálint B, Bihari P, Lázár V, et al. A highly precise and portable genome engineering method allows comparison of mutational effects across bacterial species. Proc Natl Acad Sci U S A. 2016;113:2502–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Barbieri EM, Muir P, Akhuetie-Oni BO, Yellman CM, Isaacs FJ. Precise Editing at DNA Replication Forks Enables Multiplex Genome Engineering in Eukaryotes. Cell. 2017;171:1453-1467.e1413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.DiCarlo JE, Conley AJ, Penttilä M, Jäntti J, Wang HH, Church GM. Yeast oligo-mediated genome engineering (YOGE). ACS Synth Biol. 2013;2:741–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ciaccia PN, Liang Z, Schweitzer AY, Metzner E, Isaacs FJ. Enhanced eMAGE applied to identify genetic factors of nuclear hormone receptor dysfunction via combinatorial gene editing. Nat Commun. 2024;15:5218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Makarova KS, Wolf YI, Iranzo J, Shmakov SA, Alkhnbashi OS, Brouns SJJ, et al. Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants. Nat Rev Microbiol. 2020;18:67–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell. 2013;152:1173–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Mali P, Aach J, Stranges PB, Esvelt KM, Moosburner M, Kosuri S, et al. CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering. Nat Biotechnol. 2013;31:833–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Xu X, Qi LS. A CRISPR-dCas toolbox for genetic engineering and synthetic biology. J Mol Biol. 2019;431:34–47. [DOI] [PubMed] [Google Scholar]
  • 28.Komor AC, Kim YB, Packer MS, Zuris JA, Liu DR. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature. 2016;533:420–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Anzalone AV, Randolph PB, Davis JR, Sousa AA, Koblan LW, Levy JM, et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature. 2019;576:149–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kim YG, Cha J, Chandrasegaran S. Hybrid restriction enzymes: zinc finger fusions to Fok I cleavage domain. Proc Natl Acad Sci U S A. 1996;93:1156–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Li T, Huang S, Jiang WZ, Wright D, Spalding MH, Weeks DP, et al. TAL nucleases (TALNs): hybrid proteins composed of TAL effectors and FokI DNA-cleavage domain. Nucleic Acids Res. 2011;39:359–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kim Y, Kweon J, Kim A, Chon JK, Yoo JY, Kim HJ, et al. A library of TAL effector nucleases spanning the human genome. Nat Biotechnol. 2013;31:251–8. [DOI] [PubMed] [Google Scholar]
  • 33.Kim YK, Wee G, Park J, Kim J, Baek D, Kim JS, et al. TALEN-based knockout library for human microRNAs. Nat Struct Mol Biol. 2013;20:1458–64. [DOI] [PubMed] [Google Scholar]
  • 34.Ichikawa DM, Abdin O, Alerasool N, Kogenaru M, Mueller AL, Wen H, et al. A universal deep-learning model for zinc finger design enables transcription factor reprogramming. Nat Biotechnol. 2023;41:1117–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Scherer S, Davis RW. Replacement of chromosome segments with altered DNA sequences constructed in vitro. Proc Natl Acad Sci U S A. 1979;76:4951–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Brown AD, Claybon AB, Bishop AJ. A conditional mouse model for measuring the frequency of homologous recombination events in vivo in the absence of essential genes. Mol Cell Biol. 2011;31:3593–602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Rouet P, Smih F, Jasin M. Introduction of double-strand breaks into the genome of mouse cells by expression of a rare-cutting endonuclease. Mol Cell Biol. 1994;14:8096–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Findlay GM, Boyle EA, Hause RJ, Klein JC, Shendure J. Saturation editing of genomic regions by multiplex homology-directed repair. Nature. 2014;513:120–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Findlay GM, Daza RM, Martin B, Zhang MD, Leith AP, Gasperini M, et al. Accurate classification of BRCA1 variants with saturation genome editing. Nature. 2018;562:217–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Radford EJ, Tan HK, Andersson MHL, Stephenson JD, Gardner EJ, Ironfield H, et al. Saturation genome editing of DDX3X clarifies pathogenicity of germline and somatic variation. Nat Commun. 2023;14:7702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Sahu S, Sullivan TL, Mitrophanov AY, Galloux M, Nousome D, Southon E, et al. Saturation genome editing of 11 codons and exon 13 of BRCA2 coupled with chemotherapeutic drug response accurately determines pathogenicity of variants. PLoS Genet. 2023;19:e1010940. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Waters AJ, Brendler-Spaeth T, Smith D, Offord V, Tan HK, Zhao Y, et al. Saturation genome editing of BAP1 functionally classifies somatic and germline variants. Nat Genet. 2024;56:1434–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Olvera-León R, Zhang F, Offord V, Zhao Y, Tan HK, Gupta P, et al. High-resolution functional mapping of RAD51C by saturation genome editing. Cell. 2024;187:5719-5734.e5719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Buckley M, Terwagne C, Ganner A, Cubitt L, Brewer R, Kim DK, et al. Saturation genome editing maps the functional spectrum of pathogenic VHL alleles. Nat Genet. 2024;56:1446–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Huang H, Hu C, Na J, Hart SN, Gnanaolivu RD, Abozaid M, et al. Functional evaluation and clinical classification of BRCA2 variants. Nature. 2025;638:528–37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Jakočiūnas T, Pedersen LE, Lis AV, Jensen MK, Keasling JD. CasPER, a method for directed evolution in genomic contexts using mutagenesis and CRISPR/Cas9. Metab Eng. 2018;48:288–96. [DOI] [PubMed] [Google Scholar]
  • 47.Garst AD, Bassalo MC, Pines G, Lynch SA, Halweg-Edwards AL, Liu R, et al. Genome-wide mapping of mutations at single-nucleotide resolution for protein, metabolic and genome engineering. Nat Biotechnol. 2017;35:48–55. [DOI] [PubMed] [Google Scholar]
  • 48.Bao Z, Xiao H, Liang J, Zhang L, Xiong X, Sun N, et al. Homology-integrated CRISPR-Cas (HI-CRISPR) system for one-step multigene disruption in Saccharomyces cerevisiae. ACS Synth Biol. 2015;4:585–94. [DOI] [PubMed] [Google Scholar]
  • 49.Bao Z, HamediRad M, Xue P, Xiao H, Tasan I, Chao R, et al. Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision. Nat Biotechnol. 2018;36:505–8. [DOI] [PubMed] [Google Scholar]
  • 50.Roy KR, Smith JD, Vonesch SC, Lin G, Tu CS, Lederer AR, et al. Multiplexed precision genome editing with trackable genomic barcodes in yeast. Nat Biotechnol. 2018;36:512–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Guo X, Chavez A, Tung A, Chan Y, Kaas C, Yin Y, et al. High-throughput creation and functional profiling of DNA sequence variant libraries using CRISPR-Cas9 in yeast. Nat Biotechnol. 2018;36:540–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sharon E, Chen SA, Khosla NM, Smith JD, Pritchard JK, Fraser HB. Functional Genetic Variants Revealed by Massively Parallel Precise Genome Editing. Cell. 2018;175:544-557.e516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Deng L, Zhou YL, Cai Z, Zhu J, Li Z, Bao Z. Massively parallel CRISPR-assisted homologous recombination enables saturation editing of full-length endogenous genes in yeast. Sci Adv. 2024;10:eadj9382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Mavrommati M, Daskalaki A, Papanikolaou S, Aggelis G. Adaptive laboratory evolution principles and applications in industrial biotechnology. Biotechnol Adv. 2022;54:107795. [DOI] [PubMed] [Google Scholar]
  • 55.Storici F, Bebenek K, Kunkel TA, Gordenin DA, Resnick MA. RNA-templated DNA repair. Nature. 2007;447:338–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Jensen ED, Laloux M, Lehka BJ, Pedersen LE, Jakočiūnas T, Jensen MK, et al. A synthetic RNA-mediated evolution system in yeast. Nucleic Acids Res. 2021;49:e88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Crook N, Abatemarco J, Sun J, Wagner JM, Schmitz A, Alper HS. In vivo continuous evolution of genes and pathways in yeast. Nat Commun. 2016;7:13051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Ravikumar A, Arzumanyan GA, Obadi MKA, Javanpour AA, Liu CC. Scalable, Continuous Evolution of Genes at Mutation Rates above Genomic Error Thresholds. Cell. 2018;175:1946-1957.e1913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kosicki M, Tomberg K, Bradley A. Repair of double-strand breaks induced by CRISPR-Cas9 leads to large deletions and complex rearrangements. Nat Biotechnol. 2018;36:765–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Cullot G, Boutin J, Toutain J, Prat F, Pennamen P, Rooryck C, et al. CRISPR-Cas9 genome editing induces megabase-scale chromosomal truncations. Nat Commun. 2019;10:1136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Cullot G, Aird EJ, Schlapansky MF, Yeh CD, van de Venn L, Vykhlyantseva I, Kreutzer S, Mailänder D, Lewków B, Klermund J, et al. Genome editing with the HDR-enhancing DNA-PKcs inhibitor AZD7648 causes large-scale genomic alterations. Nat Biotechnol. 2024. 10.1038/s41587-024-02488-6. [DOI] [PMC free article] [PubMed]
  • 62.Karanam K, Kafri R, Loewer A, Lahav G. Quantitative live cell imaging reveals a gradual shift between DNA repair mechanisms and a maximal use of HR in mid S phase. Mol Cell. 2012;47:320–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Ploessl D, Zhao Y, Cao M, Ghosh S, Lopez C, Sayadi M, et al. A repackaged CRISPR platform increases homology-directed repair for yeast engineering. Nat Chem Biol. 2022;18:38–46. [DOI] [PubMed] [Google Scholar]
  • 64.Nishida K, Arazoe T, Yachie N, Banno S, Kakimoto M, Tabata M, et al. Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems. Science. 2016;353:aaf8729. [DOI] [PubMed]
  • 65.Wang L, Jackson WC, Steinbach PA, Tsien RY. Evolution of new nonantibody proteins via iterative somatic hypermutation. Proc Natl Acad Sci U S A. 2004;101:16745–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Gaudelli NM, Komor AC, Rees HA, Packer MS, Badran AH, Bryson DI, et al. Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage. Nature. 2017;551:464–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Ma Y, Zhang J, Yin W, Zhang Z, Song Y, Chang X. Targeted AID-mediated mutagenesis (TAM) enables efficient genomic diversification in mammalian cells. Nat Methods. 2016;13:1029–35. [DOI] [PubMed] [Google Scholar]
  • 68.Hess GT, Frésard L, Han K, Lee CH, Li A, Cimprich KA, et al. Directed evolution using dCas9-targeted somatic hypermutation in mammalian cells. Nat Methods. 2016;13:1036–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Li C, Zhang R, Meng X, Chen S, Zong Y, Lu C, et al. Targeted, random mutagenesis of plant genes with dual cytosine and adenine base editors. Nat Biotechnol. 2020;38:875–82. [DOI] [PubMed] [Google Scholar]
  • 70.Zhang X, Zhu B, Chen L, Xie L, Yu W, Wang Y, et al. Dual base editor catalyzes both cytosine and adenine base conversions in human cells. Nat Biotechnol. 2020;38:856–60. [DOI] [PubMed] [Google Scholar]
  • 71.Grünewald J, Zhou R, Lareau CA, Garcia SP, Iyer S, Miller BR, et al. A dual-deaminase CRISPR base editor enables concurrent adenine and cytosine editing. Nat Biotechnol. 2020;38:861–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Sakata RC, Ishiguro S, Mori H, Tanaka M, Tatsuno K, Ueda H, et al. Base editors for simultaneous introduction of C-to-T and A-to-G mutations. Nat Biotechnol. 2020;38:865–9. [DOI] [PubMed] [Google Scholar]
  • 73.Tao W, Liu Q, Huang S, Wang X, Qu S, Guo J, et al. CABE-RY: a PAM-flexible dual-mutation base editor for reliable modeling of multi-nucleotide variants. Mol Ther. 2021;26:114–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Neugebauer ME, Hsu A, Arbab M, Krasnow NA, McElroy AN, Pandey S, et al. Evolution of an adenine base editor into a small, efficient cytosine base editor with low off-target activity. Nat Biotechnol. 2023;41:673–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Lam DK, Feliciano PR, Arif A, Bohnuud T, Fernandez TP, Gehrke JM, et al. Improved cytosine base editors generated from TadA variants. Nat Biotechnol. 2023;41:686–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Liang Y, Xie J, Zhang Q, Wang X, Gou S, Lin L, et al. AGBE: a dual deaminase-mediated base editor by fusing CGBE with ABE for creating a saturated mutant population with multiple editing patterns. Nucleic Acids Res. 2022;50:5384–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Zhang A, Shan T, Sun Y, Chen Z, Hu J, Hu Z, et al. Directed evolution rice genes with randomly multiplexed sgRNAs assembly of base editors. Plant Biotechnol J. 2023;21:2597–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Kurt IC, Zhou R, Iyer S, Garcia SP, Miller BR, Langner LM, et al. CRISPR c-to-g base editors for inducing targeted DNA transversions in human cells. Nat Biotechnol. 2021;39:41–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Zhao D, Li J, Li S, Xin X, Hu M, Price MA, et al. Glycosylase base editors enable C-to-A and C-to-G base changes. Nat Biotechnol. 2021;39:35–40. [DOI] [PubMed] [Google Scholar]
  • 80.Kweon J, Jang A-H, Shin HR, See J-E, Lee W, Lee JW, et al. A CRISPR-based base-editing screen for the functional assessment of BRCA1 variants. Oncogene. 2020;39:30–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Huang C, Li G, Wu J, Liang J, Wang X. Identification of pathogenic variants in cancer genes using base editing screens with editing efficiency correction. Genome Biol. 2021;22:80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Hanna RE, Hegde M, Fagre CR, DeWeirdt PC, Sangree AK, Szegletes Z, et al. Massively parallel assessment of human variants with base editor screens. Cell. 2021;184:1064-1080.e1020. [DOI] [PubMed] [Google Scholar]
  • 83.Sánchez-Rivera FJ, Diaz BJ, Kastenhuber ER, Schmidt H, Katti A, Kennedy M, et al. Base editing sensor libraries for high-throughput engineering and functional analysis of cancer-associated single nucleotide variants. Nat Biotechnol. 2022;40:862–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Sangree AK, Griffith AL, Szegletes ZM, Roy P, DeWeirdt PC, Hegde M, et al. Benchmarking of SpCas9 variants enables deeper base editor screens of BRCA1 and BCL2. Nat Commun. 2022;13:1318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Kim Y, Lee S, Cho S, Park J, Chae D, Park T, et al. High-throughput functional evaluation of human cancer-associated mutations using base editors. Nat Biotechnol. 2022;40:874–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Lue NZ, Garcia EM, Ngan KC, Lee C, Doench JG, Liau BB. Base editor scanning charts the DNMT3A activity landscape. Nat Chem Biol. 2023;19:176–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Yao Y, Zhou Z, Wang X, Liu Z, Zhai Y, Chi X, et al. SpRY-mediated screens facilitate functional dissection of non-coding sequences at single-base resolution. Cell Genom. 2024;4:100583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Tong H, Wang X, Liu Y, Liu N, Li Y, Luo J, et al. Programmable a-to-Y base editing by fusing an adenine base editor with an N-methylpurine DNA glycosylase. Nat Biotechnol. 2023;41:1080–4. [DOI] [PubMed] [Google Scholar]
  • 89.Belli O, Karava K, Farouni R, Platt RJ. Multimodal scanning of genetic variants with base and prime editing. Nat Biotechnol. 2024. 10.1038/s41587-024-02439-1. [DOI] [PMC free article] [PubMed]
  • 90.Hao W, Cui W, Cheng Z, Han L, Suo F, Liu Z, et al. Development of a base editor for protein evolution via in situ mutation in vivo. Nucleic Acids Res. 2021;49:9594–605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Wang Y, Cheng H, Liu Y, Liu Y, Wen X, Zhang K, et al. In-situ generation of large numbers of genetic combinations for metabolic reprogramming via CRISPR-guided base editing. Nat Commun. 2021;12:678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Hao W, Cui W, Liu Z, Suo F, Wu Y, Han L, et al. A new-generation base editor with an expanded editing window for microbial cell evolution in vivo based on CRISPR-Cas12b engineering. Adv Sci. 2024;11:2309767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Svitashev S, Young JK, Schwartz C, Gao H, Falco SC, Cigan AM. Targeted mutagenesis, precise gene editing, and site-specific gene insertion in maize using Cas9 and guide RNA. Plant Physiol. 2015;169:931–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Ran Y, Liang Z, Gao C. Current and future editing reagent delivery systems for plant genome editing. Sci China Life Sci. 2017;60:490–505. [DOI] [PubMed] [Google Scholar]
  • 95.Kuang Y, Li S, Ren B, Yan F, Spetz C, Li X, et al. Base-editing-mediated artificial evolution of OsALS1 in planta to develop novel herbicide-tolerant rice germplasms. Mol Plant. 2020;13:565–72. [DOI] [PubMed] [Google Scholar]
  • 96.Wang X, Pan W, Sun C, Yang H, Cheng Z, Yan F, et al. Creating large-scale genetic diversity in Arabidopsis via base editing-mediated deep artificial evolution. Genome Biol. 2024;25:215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Makarova KS, Wolf YI, Alkhnbashi OS, Costa F, Shah SA, Saunders SJ, et al. An updated evolutionary classification of CRISPR-Cas systems. Nat Rev Microbiol. 2015;13:722–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Morisaka H, Yoshimi K, Okuzaki Y, Gee P, Kunihiro Y, Sonpho E, et al. CRISPR-Cas3 induces broad and unidirectional genome editing in human cells. Nat Commun. 2019;10:5302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Zimmermann A, Prieto-Vivas JE, Cautereels C, Gorkovskiy A, Steensels J, Van de Peer Y, et al. A Cas3-base editing tool for targetable in vivo mutagenesis. Nat Commun. 2023;14:3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Chen XD, Chen Z, Wythes G, Zhang Y, Orr BC, Sun G, et al. Helicase-assisted continuous editing for programmable mutagenesis of endogenous genomes. Science. 2024;386:eadn5876. [DOI] [PubMed] [Google Scholar]
  • 101.Csörgő B, León LM, Chau-Ly IJ, Vasquez-Rifo A, Berry JD, Mahendra C, et al. A compact cascade-Cas3 system for targeted genome engineering. Nat Methods. 2020;17:1183–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Whitford CM, Gockel P, Faurdal D, Gren T, Sigrist R, Weber T. Cascade-Cas3 enables highly efficient genome engineering in Streptomyces species. Nucleic Acids Res. 2025. 10.1093/nar/gkaf214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Zhou Q, Zhao Y, Ke C, Wang H, Gao S, Li H, et al. Repurposing endogenous type I-E CRISPR-Cas systems for natural product discovery in Streptomyces. Nat Commun. 2024;15:9833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Dolan AE, Hou Z, Xiao Y, Gramelspacher MJ, Heo J, Howden SE, et al. Introducing a Spectrum of Long-Range Genomic Deletions in Human Embryonic Stem Cells Using Type I CRISPR-Cas. Mol Cell. 2019;74:936-950.e935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Tan R, Krueger RK, Gramelspacher MJ, Zhou X, Xiao Y, Ke A, et al. Cas11 enables genome engineering in human cells with compact CRISPR-Cas3 systems. Mol Cell. 2022;82:852-867.e855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Guo J, Gong L, Yu H, Li M, An Q, Liu Z, et al. Engineered minimal type I CRISPR-Cas system for transcriptional activation and base editing in human cells. Nat Commun. 2024;15:7277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Li Y, Huang B, Chen J, Huang L, Xu J, Wang Y, et al. Targeted large fragment deletion in plants using paired crRNAs with type I CRISPR system. Plant Biotechnol J. 2023;21:2196–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Wang J, Zhao D, Li J, Hu M, Xin X, Price MA, et al. Helicase-AID: a novel molecular device for base editing at random genomic loci. Metab Eng. 2021;67:396–402. [DOI] [PubMed] [Google Scholar]
  • 109.Xu R, Liu X, Li J, Qin R, Wei P. Identification of herbicide resistance OsACC1 mutations via in planta prime-editing-library screening in rice. Nat Plants. 2021;7:888–92. [DOI] [PubMed] [Google Scholar]
  • 110.Erwood S, Bily TMI, Lequyer J, Yan J, Gulati N, Brewer RA, et al. Saturation variant interpretation using CRISPR prime editing. Nat Biotechnol. 2022;40:885–95. [DOI] [PubMed] [Google Scholar]
  • 111.Ren X, Yang H, Nierenberg JL, Sun Y, Chen J, Beaman C, et al. High-throughput PRIME-editing screens identify functional DNA variants in the human genome. Mol Cell. 2023;83:4633-4645.e4639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Kim Y, Oh HC, Lee S, Kim HH. Saturation profiling of drug-resistant genetic variants using prime editing. Nat Biotechnol. 2024. 10.1038/s41587-024-02465-z [DOI] [PubMed]
  • 113.Gould SI, Wuest AN, Dong K, Johnson GA, Hsu A, Narendra VK, Atwa O, Levine SS, Liu DR, Sánchez Rivera FJ. High-throughput evaluation of genetic variants with prime editing sensor libraries. Nat Biotechnol. 2024. 10.1038/s41587-024-02172-9 [DOI] [PMC free article] [PubMed]
  • 114.Martyn GE, Montgomery MT, Jones H, Guo K, Doughty BR, Linder J, et al. Rewriting regulatory DNA to dissect and reprogram gene expression. Cell. 2025. 10.1016/j.cell.2025.03.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Xie J, Xiang J, Shen Y, Shao S. Mechanistic insights into the tools for intracellular protein delivery. Chem Bio Eng. 2025;2:132–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Halperin SO, Tou CJ, Wong EB, Modavi C, Schaffer DV, Dueber JE. CRISPR-guided DNA polymerases enable diversification of all nucleotides in a tunable window. Nature. 2018;560:248–52. [DOI] [PubMed] [Google Scholar]
  • 117.Tou CJ, Schaffer DV, Dueber JE. Targeted diversification in the S. cerevisiae genome with CRISPR-guided DNA polymerase I. ACS Synth Biol. 2020;9:1911–6. [DOI] [PubMed] [Google Scholar]
  • 118.Qi L, Sui Y, Tang XX, McGinty RJ, Liang XZ, Dominska M, et al. Shuffling the yeast genome using CRISPR/Cas9-generated DSBs that target the transposable Ty1 elements. PLoS Genet. 2023;19:e1010590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Koeppel J, Ferreira R, Vanderstichele T, Riedmayr LM, Peets EM, Girling G, et al. Randomizing the human genome by engineering recombination between repeat elements. Science. 2025;387:eado3979. [DOI] [PubMed] [Google Scholar]
  • 120.Koeppel J, Murat P, Girling G, Peets EM, Gouley M, Rebernig V, Maheshwari A, Hepkema J, Weller J, Johnkingsly Jebaraj JH, et al. Resolution of a human super-enhancer by targeted genome randomisation. bioRxiv 2025:2025.2001.2014.632548.
  • 121.Hwang J, Ye DY, Jung GY, Jang S. Mobile genetic element-based gene editing and genome engineering: recent advances and applications. Biotechnol Adv. 2024;72:108343. [DOI] [PubMed] [Google Scholar]
  • 122.Xu J, Sun Y, Wu J, Yang S, Yang L. Chromosome recombination and modification by LoxP-mediated evolution in Vibrio natriegens using CRISPR-associated transposases. Biotechnol Bioeng. 2024;121(3):1163–72. [DOI] [PubMed] [Google Scholar]
  • 123.Choi J, Chen W, Suiter CC, Lee C, Chardon FM, Yang W, et al. Precise genomic deletions using paired prime editing. Nat Biotechnol. 2022;40:218–26. [DOI] [PMC free article] [PubMed]
  • 124.Bock C, Datlinger P, Chardon F, Coelho MA, Dong MB, Lawson KA, et al. High-content CRISPR screening. Nat Rev Methods Primers. 2022;2:9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Pinglay S, Lalanne J-B, Daza RM, Kottapalli S, Quaisar F, Koeppel J, et al. Multiplex generation and single-cell analysis of structural variants in mammalian genomes. Science. 2025;387:eado5978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Cai Z, Xie W, Bao Z. Broadening the targetable space: engineering and discovery of PAM-flexible Cas proteins. Trends Microbiol. 2024;32:728–31. [DOI] [PubMed] [Google Scholar]
  • 127.Kim HK, Lee S, Kim Y, Park J, Min S, Choi JW, et al. High-throughput analysis of the activities of xCas9, SpCas9-NG and SpCas9 at matched and mismatched target sequences in human cells. Nat Biomed Eng. 2020;4:111–24. [DOI] [PubMed] [Google Scholar]
  • 128.Yang C, Zhou Z, Sun X, Ju H, Yue X, Rao S, et al. PAMless spRY exhibits a preference for the seed region for efficient targeting. Cell Rep. 2024;43:114225. [DOI] [PubMed] [Google Scholar]
  • 129.Jiang W, Feng S, Huang S, Yu W, Li G, Yang G, et al. BE-PLUS: a new base editing tool with broadened editing window and enhanced fidelity. Cell Res. 2018;28:855–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Wang Y, Zhou L, Liu N, Yao S. BE-PIGS: a base-editing tool with deaminases inlaid into Cas9 PI domain significantly expanded the editing scope. Signal Transduct Target Ther. 2019;4:36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Villiger L, Schmidheini L, Mathis N, Rothgangl T, Marquart K, Schwank G. Replacing the SpCas9 HNH domain by deaminases generates compact base editors with an alternative targeting scope. Mol Ther Nucleic Acids. 2021;26:502–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Strecker J, Ladha A, Gardner Z, Schmid-Burgk JL, Makarova KS, Koonin EV, et al. RNA-guided DNA insertion with CRISPR-associated transposases. Science. 2019;365:48–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Klompe SE, Vo PLH, Halpin-Healy TS, Sternberg SH. Transposon-encoded CRISPR-cas systems direct RNA-guided DNA integration. Nature. 2019;571:219–25. [DOI] [PubMed] [Google Scholar]
  • 134.Yarnall MTN, Ioannidi EI, Schmitt-Ulms C, Krajeski RN, Lim J, Villiger L, et al. Drag-and-drop genome insertion of large sequences without double-strand DNA cleavage using CRISPR-directed integrases. Nat Biotechnol. 2023;41:500–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Chen F, Lian M, Ma B, Gou S, Luo X, Yang K, et al. Multiplexed base editing through Cas12a variant-mediated cytosine and adenine base editors. Commun Biol. 2022;5:1163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Geurts MH, Gandhi S, Boretto MG, Akkerman N, Derks LLM, van Son G, et al. One-step generation of tumor models by base editor multiplexing in adult stem cell-derived organoids. Nat Commun. 2023;14:4998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Wu Y, Li Y, Liu Y, Xiu X, Liu J, Zhang L, et al. Multiplexed in-situ mutagenesis driven by a dCas12a-based dual-function base editor. Nucleic Acids Res. 2024;52:4739–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Gupta A, Liu B, Raza S, Chen QJ, Yang B. Modularly assembled multiplex prime editors for simultaneous editing of agronomically important genes in rice. Plant Commun. 2024;5:100741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Si T, Chao R, Min Y, Wu Y, Ren W, Zhao H. Automated multiplex genome-scale engineering in yeast. Nat Commun. 2017;8:15187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Li S, An J, Li Y, Zhu X, Zhao D, Wang L, et al. Automated high-throughput genome editing platform with an AI learning in situ prediction model. Nat Commun. 2022;13:7386. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

13059_2025_3756_MOESM1_ESM.xlsx (14.5KB, xlsx)

Additional file 1: Table S1. Library size, editing window, and mutation type of chromosomal gene diversification approaches. This table contains the source data used for plotting Fig. 7.

Data Availability Statement

No datasets were generated or analysed during the current study.


Articles from Genome Biology are provided here courtesy of BMC

RESOURCES