Skip to main content
National Science Review logoLink to National Science Review
. 2023 Mar 17;10(5):nwad073. doi: 10.1093/nsr/nwad073

Dynamics of synthetic yeast chromosome evolution shaped by hierarchical chromatin organization

Sijie Zhou 1,2,b, Yi Wu 3,4,b, Yu Zhao 5,b, Zhen Zhang 6,7, Limin Jiang 8, Lin Liu 9, Yan Zhang 10, Jijun Tang 11,12, Ying-Jin Yuan 13,14,
PMCID: PMC10202648  PMID: 37223244

ABSTRACT

Synthetic genome evolution provides a dynamic approach for systematically and straightforwardly exploring evolutionary processes. Synthetic Chromosome Rearrangement and Modification by LoxP-mediated Evolution (SCRaMbLE) is an evolutionary system intrinsic to the synthetic yeast genome that can rapidly drive structural variations. Here, we detect over 260000 rearrangement events after the SCRaMbLEing of a yeast strain harboring 5.5 synthetic yeast chromosomes (synII, synIII, synV, circular synVI, synIXR and synX). Remarkably, we find that the rearrangement events exhibit a specific landscape of frequency. We further reveal that the landscape is shaped by the combined effects of chromatin accessibility and spatial contact probability. The rearrangements tend to occur in 3D spatially proximal and chromatin-accessible regions. The enormous numbers of rearrangements mediated by SCRaMbLE provide a driving force to potentiate directed genome evolution, and the investigation of the rearrangement landscape offers mechanistic insights into the dynamics of genome evolution.

Keywords: synthetic biology, genome rearrangement, hierarchical chromatin organization, synthetic yeast genome, SCRaMbLE, Saccharomyces cerevisiae


A specific landscape of over 260,000 rearrangements in SCRaMbLEd yeast harboring 5.5 synthetic chromosomes was uncovered and found to be a result of combinatorial effects of chromatin accessibility and 3D spatial contact probability.

INTRODUCTION

Studying the processes and mechanisms of genome evolution is critical to understanding genetics and species diversity at the genomic level [1–4]. Saccharomyces cerevisiae, a powerful model organism for studying eukaryotic genome evolution [5–7], has been subjected to comparative genomic studies that have provided mechanistic insights [8–12]. These studies relied on relatively static genomic sequences and may have missed many details of process dynamics. Synthetic genomes, assembled from scratch and incorporated with a variety of designer features, have greatly facilitated the study of genome evolution and engineering, such as genome minimization [13], genetic codon recoding [14,15], the introduction of synthetic components [16] and data storage [17,18]. Synthetic Chromosome Rearrangement and Modification by LoxP-mediated Evolution (SCRaMbLE), with symmetrical loxP sites (loxPsym) positioned in the 3′ untranslated regions (3′ UTRs) of almost all non-essential genes in the Synthetic Yeast Genome Project (Sc2.0), has also recently been used in the study of genome evolution [16,19]. Induced Cre recombinase activity quickly triggers recombination between loxPsym sites and generates structural variations, including deletions, inversions, duplications and translocations [20–27].

In this study, we consolidated 5.5 synthetic chromosomes (synII, synIII, synV, circular synVI, synIXR and synX) in a single haploid strain with an orthogonal site-specific recombination system, which enabled the eliminations of counterpart native chromosomes [19,28–32]. In this strain with nearly a quarter of a synthetic genome, we induced SCRaMbLE and generated massive rearrangement events involving both intra- and inter-chromosomal recombination. We comprehensively sequenced this SCRaMbLEd pool and detected over 260000 rearrangement events via a loxPsym junction analysis method. By analyzing each rearrangement event, we revealed a stable rearrangement landscape among synthetic chromosomes that was correlated with local chromatin structures and the three-dimensional genome architecture revealed via assay for transposase-accessible chromatin sequencing (ATAC-seq) and genome-wide chromosome conformation capture (Hi-C). Our study provides insight into the combinatorial effect of hierarchical chromatin organization on the dynamics of genome evolution.

RESULTS

Consolidation and SCRaMbLE of six synthetic chromosomes

To build the strain with multiple synthetic chromosomes, we used a stepwise method, starting with two haploid strains of opposite mating types, yYW169 (synV, synX) [33] and yZY192 (synII, synIII, synVI and synIXR) [16]. To accelerate consolidation, we developed a new strategy using chromosome elimination by Vika/vox, a site-specific recombination system orthogonal to Cre/loxP (Fig. 1A) [34,35]. First, we built a heterozygous diploid strain yYW268 by mating yYW169 and yZY192 (Fig. S1A). The centromeres of native chromosomes (II, III, V, VI, IX and X) were successively excised by Vika/vox recombination, leading to elimination of the chromosomes (Fig. S1B). We then obtained the yYW393 strain, whose ploidy was further tested by Next-Generation Sequencing (NGS) and flow cytometry. We confirmed that yYW393 was a diploid strain (Figs S2 and S3), indicating that endoreduplication of the counterpart synthetic chromosomes occurred after targeted elimination of native chromosomes by Vika/vox. We characterized the growth phenotypes of yYW393 and found that the yYW393 strain grew slightly slower on rich medium at a low temperature (25°C) (Fig. S4). Following sporulation and tetrad dissection, one haploid strain (yYW394), in which the six aforementioned native chromosomes were replaced with synthetic chromosomes, was obtained, comprising ∼2.61 Mb (∼22.0% of the yeast genome) (Table S1). Its karyotype and genome sequence were confirmed by NGS, flow cytometry and pulsed-field gel electrophoresis (Figs S2, S3, S5B and Table 1). The yYW394 strain grew robustly at 30°C but exhibited a growth defect at 37°C (Fig. S5C). To recover its fitness to wild-type levels, the adaptive laboratory evolution of yYW394 was performed, generating an evolved strain yZSJ025, which was used for further analysis (Fig. S5). We then performed spotting assays for yZSJ025 under 30 various conditions. Although growth of yZSJ025 on the Yeast Peptone Dextrose (YPD) agar plate at 37ºC has been largely recovered, it is obviously still slower-growing under some other tested conditions (Fig. S6). We speculated that the removal of tRNA genes from their native loci may be the primary reason for the growth defects. A recent investigation revealed that addition of a tRNA array can recover growth defects of a synthetic yeast strain [36].

Figure 1.

Figure 1.

Consolidation and SCRaMbLE of six synthetic chromosomes. (A) Schematic diagram showing the construction of the synthetic yeast strain yZSJ025 and the analysis of rearrangement events following SCRaMbLE. (B) Circos plot of intra-chromosomal rearrangement events in yZSJ025. (C) Circos plot of inter-chromosomal rearrangement events in yZSJ025. Rearrangement events between two loxPsym sites are connected with a line and the color intensity represents the frequency of the event. (D) Classifications of rearrangement events with percentages of different groups calculated by event numbers and read numbers respectively. Different groups of events are labeled in different colors as indicated. (E) Correlation analyses between the rearrangement frequency and the number of loxPsym sites on each chromosome. Pearson correlation analysis was applied to determine the correlation coefficient and associated P values. SynVI is a ring chromosome.

Table 1.

Overview of sequencing data.

Sample name BioSample Accession number Type
yZY192 SAMN31499054 PRJNA895231 Genome_sequencing
yYW393 SAMN31499055 PRJNA895231 Genome_sequencing
yYW394 SRX10191928 PRJNA705059 Genome_sequencing
yZSJ025 SRX10191929 PRJNA705059 Genome_sequencing
yZSJ364 SRX10191930 PRJNA705059 Genome_sequencing
yZSJ365 SRX10191931 PRJNA705059 Genome_sequencing
yZSJ400 SRX10191933 PRJNA705059 Genome_sequencing
yZSJ490 SRX10191932 PRJNA705059 Genome_sequencing
yZSJ025-ATACseq-rep1 SAMN18129009 GSE168182 ATAC sequencing
yZSJ025-ATACseq-rep2 SAMN18129008 GSE168182 ATAC sequencing
yZSJ025-HiC-rep1 SAMN18129007 GSE168182 HiC sequencing
yZSJ025-HiC-rep2 SAMN18129006 GSE168182 HiC sequencing
yZSJ025 GSE219048 RNA sequencing

The evolved strain yZSJ025, comprising 894 loxPsym sites, was transformed with the Cre recombinase expression plasmid pYW085 (pRS413-pCLB2-Cre-EBD) (Fig. S7). SCRaMbLE was induced by the addition of β-estradiol. The SCRaMbLEd cells were then diluted in fresh YPD liquid medium without β-estradiol (Fig. 1A), and subjected to deep sequencing (∼600000×). All reads (150 bp each) were screened for the presence of loxPsym sequences and then aligned to the reference sequence of the original synthetic chromosomes. Those with flanking sequences differing from the references, hereafter referred to as rearrangement reads, were then used to identify and classify rearrangement events [25] (Fig. S8). Identical reads were considered to represent a single rearrangement event. In total, 263520 rearrangement events, including 124499 (47.24%) intra- and 139021 (52.76%) inter-chromosomal events, were detected (Figs 1B and C). We further analyzed the intra-chromosomal rearrangement events and found 62106 (23.57%) inversions, 22526 (8.55%) deletions and 39867 (15.13%) complex rearrangement events, such as duplications and circularization events (Figs 1D and S8). We also counted the reads corresponding to each rearrangement. The numbers of identical reads represent the frequencies of corresponding rearrangement events. Inter-chromosomal recombination accounted for only 9.87% of the total read numbers, indicating that these are relatively low frequency events compared to intra-chromosomal rearrangements. We next investigated whether there is any chromosome preference in rearrangements. To answer this question, we plotted the number of rearrangements involving each chromosome against the number of loxPsym sites on that chromosome. As expected, the number of rearrangements was correlated with the number of loxPsym sites per chromosome (r = 0.96, P = 0.002) (Fig. 1E).

A specific rearrangement pattern of the synthetic yeast chromosomes

In theory, SCRaMbLE can generate rearrangements between any two loxPsym sites on synthetic chromosomes. However, different chromosomal loci showed high or low rearrangement frequencies (Figs 1B and C). To estimate the local rearrangement frequencies along the synthetic chromosomes in our SCRaMbLEd pool, we counted the number of rearrangement reads from each loxPsym site, generating an observable landscape of rearrangement frequencies (Fig. 2A). Seventeen loxPsym sites were inserted in synthetic telomere regions, where the flanking sequences were not distinguishable from each other; these sites were excluded from the identification of rearrangement reads. Notably, rearrangement reads were identified from all 877 loxPsym sites, indicating that the sequencing coverage was sufficient to include all loci. The loxPsym sites differed from each other in their rearrangement frequencies, ranging from 1027 to 44353 per site. These 877 loxPsym sites were divided into 10% steps according to rearrangement frequencies (Fig. 2B). We defined 88 loxPsym sites with the highest rearrangement frequencies as hotspots and 88 loxPsym sites with the lowest rearrangement frequencies as coldspots. The cut-offs for hotspots and coldspots were >17800 and <3400 rearrangement events per site respectively. These rearrangement frequencies were significantly different from the average frequency of all loxPsym sites (Fig. 2B). Interestingly, the rearrangement frequency landscapes of independent SCRaMbLE experiments were highly reproducible for all the synthetic chromosomes, indicating that certain areas of the genome have the potential to undergo rearrangement at higher frequencies (Figs S9 and S10).

Figure 2.

Figure 2.

Investigating rearrangement frequencies revealed a specific rearrangement pattern among synthetic yeast chromosomes. (A) Landscape of rearrangement frequencies along synII, synIII, synV, synVI, synIXR and synX. Seventeen loxPsym sites were inserted in the synthetic telomere regions, where flanking sequences were not distinguishable from each other; these sites were excluded from the identification of rearrangement reads. Dark dots indicate centromeres. (B) Comparison of rearrangement frequencies at 877 loxPsym sites. These 877 loxPsym sites were divided into 10% steps according to rearrangement frequencies. The points represent the rearrangement frequencies at each loxPsym site. Horizontal lines indicate weighted means. P values were calculated using two-tailed paired two-sample t-tests. (C) Intra-chromosomal rearrangement patterns of synV following SCRaMbLE in three different yeast strains, yXZX846 (synV), yYW169 (synV, synX) and yZSJ025 respectively. Pearson correlation analysis was applied to determine the correlation coefficients and associated P values. The dotted line indicates centromeres. SynVI is a ring chromosome.

We also compared the rearrangement patterns of the same synthetic chromosome in separate strains containing different numbers of synthetic chromosomes. Similar intra-chromosomal rearrangement patterns were observed for synV in the three synV-containing strains (yXZX846, yYW169 and yZSJ025) (Figs 2C and S11A–C). The same result was observed for synX in the two synX-containing strains (yYW169 and yZSJ025) (Figs S11D and E). Overall, our results demonstrate that the SCRaMbLE patterns followed the same biological reason on each synthetic chromosome with reproducible rearrangement hotspots and coldspots. In addition, we also carried out RNA-seq to investigate the correlation of gene transcription and adjacent recombination frequency of yZSJ025, which showed no obvious correlation, with an R value of 0.16 (Fig. S12).

Rearrangement frequency correlates with chromatin accessibility

To determine whether the specific rearrangement patterns we observed were due to an underlying biological mechanism, we further investigated the genomic landscape. SCRaMbLE requires physical interaction between Cre recombinase and loxPsym sites, suggesting the importance of chromatin accessibility in determining the rearrangement frequency in SCRaMbLEd cells. To test this hypothesis, we measured genome-wide chromatin accessibility in the yZSJ025 strain by ATAC-seq [37]. The ATAC-seq signals from windows 400 bp upstream and downstream of each loxPsym site were collected and processed. The signals from each loxPsym site were normalized with the average signals of all 877 loxPsym sites. We analyzed the ATAC-seq signals of 877 loxPsym sites in 20% steps based on rearrangement frequencies. Interestingly, ATAC signals are well correlated with the rearrangement frequencies (Fig. S13). We further analyzed the defined rearrangement hotspots and coldspots. Statistical analysis showed that the average ATAC-seq signals of hotspots were significantly higher, while coldspot signals were significantly weaker (Fig. 3A). The rearrangement frequencies and ATAC-seq signals of a typical loxPsym hotspot and coldspot from synX are shown in Fig. 3B. The presented coldspot, the loxPsym site in the SET4 3’ UTR, showed a weak ATAC-seq signal, while as a hotspot, the loxPsym site in the PRY3 3′ UTR showed a strong ATAC-seq signal. In addition, we observed that nucleosome occupancy was relatively low in hotspots and high in coldspots (Fig. S14). Both ATAC-seq and nucleosome occupancy analyses indicated that chromatin accessibility is critical for determining the patterns of rearrangement frequencies.

Figure 3.

Figure 3.

Correlation of rearrangement frequency with chromatin accessibility. (A) Average ATAC-seq signals of the hotspot- and coldspot-centered 800 bp regions. (B) ATAC-seq signals for a typical rearrangement hotspot and a coldspot in the 186–300 kb region of synX.

We then aligned the hotspots and coldspots to the corresponding regions of native chromosomes. Using the same approach applied previously, we extracted and analyzed pre-published ATAC-seq data from wild-type yeast [37] (Fig. S15). ATAC-seq signals peaked at the positions of hotspots, and remained weak at coldspots on all six chromosomes, suggesting that sequence modifications of Sc2.0 do not perturb chromatin accessibility.

Rearrangement frequency also correlates with 3D chromosome conformation

Next, we aimed to explore whether the rearrangement frequency was correlated with the spatial proximity of loxPsym sites. Our SCRaMbLE system provided a unique platform for statistically evaluating the role of spatial proximity in chromosomal rearrangements. A Hi-C analysis was thus carried out for yZSJ025, generating a contact map with the frequencies of spatial contacts between any two genomic loci (Fig. S16). We extracted contact frequencies from the synthetic chromosomes (Fig. 4A), and plotted rearrangement frequencies in a similar heatmap to facilitate direct comparison (Fig. 4B). The regions near the diagonal represent proximal loci on each chromosome. Accordingly, these intra-chromosomal regions were the ‘hottest’ in the Hi-C map (Fig. 4A), and they also exhibited the highest rearrangement frequency (Fig. 4B). Rearrangements tended to occur most frequently between adjacent loxPsym sites for all synthetic chromosomes, as shown by the statistical analysis of all rearrangement reads (Fig. S17). Although inclusion of a circular chromosome in our system was not originally designed, it served as a great internal control and gave the opportunity to explore the difference between the rearrangement of the ring chromosome and that of other linear chromosomes in a cell. Compared to other linear chromosomes, the ring chromosome exhibited higher frequencies of SCRaMbLE events between two peritelomeric regions that are now adjacent to each other (Figs 4C and D, and S18). Indeed, a similar observation on a different yeast circular chromosome, synII, agreed with our conclusion that circular chromosomes exhibit stronger frequencies of SCRaMbLE events between two peritelomeric regions [38].

Figure 4.

Figure 4.

Figure 4.

Correlation of the rearrangement frequency with 3D chromosome conformation. (A) Hi-C heatmap. The heatmap value for a site (i, j) is the contact probability (log2) between genomic loci i (horizontal axis) and j (vertical axis). Both axes are displayed at a 1 kb resolution. Spots with different heatmap values from low to high are colored from light yellow to red as indicated. (B) Rearrangement frequency heatmap. The heatmap value for a site (i, j) is the rearrangement frequency (log2) of the event occurring at loci i (vertical axis) and j (horizontal axis). Spots with different heatmap values are labeled in red with different intensities as indicated. (C) Hi-C heatmap and rearrangement frequency heatmap of synV. (D) Hi-C heatmap and rearrangement frequency heatmap of ring synVI. (E) 3D structures of synthetic chromosomes inferred from the Hi-C contact map displayed in panel (A). Synthetic chromosomes are labeled with different colors and the native chromosomes are shown in gray. Centromeres and telomeres are represented by dark and red spheres respectively. Dark and red dashed circles represent centromere and telomere regions respectively. (F) Comparisons of the contact probability and frequency of rearrangement events at loci in pericentromeric regions (CEN) and in other regions (Others). CENs are centromere-centered 10 kb regions. (G) Comparisons of the contact probability and frequency of rearrangement events at loci in peritelomeric regions (TEL) and in other regions (Others). TELs are ≤5 kb from telomeres. For (D) and (E), each data point represents a 1 kb bin in regions as indicated. Horizontal lines indicate weighted means. P values were calculated using two-tailed paired two-sample t-tests. SynVI is a ring chromosome.

Notably, the inter-chromosomal contact probability in the regions of centromeres and telomeres was obviously higher than that in other regions (Fig. 4A), consistent with centromeres being clustered around the spindle pole body and telomeres being clustered with the nuclear envelope in both wild-type [39] and synthetic chromosomes [33] (Fig. 4E). The 3D conformations showed the circularity of synVI. Compared to other regions, pericentromeric regions exhibited a higher inter-chromosomal rearrangement frequency (Fig. 4B and F). Similar results were found for the peritelomeric regions of the synthetic chromosomes (Fig. 4B and G). Taken together, our results suggested that rearrangement events are generally more likely to occur between genomic loci in closer spatial proximity to each other.

Combinatorial effects of chromatin accessibility and chromosome conformation

As both chromatin accessibility and spatial proximity influence rearrangement frequency, we hypothesized that they have combinatorial effects on yeast chromosome organization. We tested our proposed model based on individual rearrangement events. The results indicated that if two loxPsym sites showed similar chromatin accessibility (Fig. 5A), the frequency of rearrangement events was determined by their spatial proximity (Fig. 5B). For loxPsym sites with similar spatial proximity (Fig. 5C), the rearrangement frequency was mainly affected by their chromatin accessibility (Fig. 5D). Most interestingly, if two loxPsym sites presented an open chromatin conformation but were spatially distant, they presented a similar recombination frequency to other pairs of loxPsym sites that were proximal but hidden within closed chromatin structures (Figs 5E and F). In other words, the effect of chromatin accessibility on the rearrangement frequency could be compensated by the difference in spatial proximity, and vice versa. More generally, the chromatin architecture and spatial conformation in the yeast genome affected the rearrangement frequency in a combinatorial manner during SCRaMbLE.

Figure 5.

Figure 5.

Mechanistic models of the effects of hierarchical chromatin organization on chromosomal rearrangement. (A) A schematic model interpreting the differences in the frequencies of two rearrangement events with different contact probabilities. (B) The loxPsym positions of pair 1: synX 183 kb and synX 184 kb; pair 2: synX 184 kb and synX 342 kb. (C) A schematic model interpreting the difference in the frequencies of two rearrangement events under differences in chromatin accessibility. (D) The loxPsym positions of pair 3: synII 186 kb and synII 204 kb; pair 4: synV 346 kb and synV 384 kb. (E) A schematic model interpreting two rearrangement events with similar frequencies as a result of counter-effects according to contact probability and chromatin accessibility. (F) The loxPsym positions of pair 5: synVI 125 kb and synVI 202 kb; pair 6: synII 504 kb and synII 507 kb.

DISCUSSION

In this study, we induced SCRaMbLE in a novel strain with multiple synthetic chromosomes. Over 260000 rearrangement events were detected in the SCRaMbLEd pool, covering all 877 loxPsym sites. This finding revealed the tremendous plasticity of the yeast genome. Previous studies involving SCRaMbLE have mainly focused on the correlation of chromosomal rearrangement with nucleotide sequences [21,23,25,27,40]. The biochemical essence of SCRaMbLE is the recombination between pairs of different loxPsym sites on synthetic chromosomes catalyzed by Cre recombinase [41]. We speculate that local chromatin structure affects rearrangement through the accessibility of loxPsym sites to Cre and that 3D chromosome conformation affects the contact probability at any two loxPsym sites. In this study, we discovered that the rearrangement frequency landscape was indeed molded by chromatin structure and that the recombination hotspots tended to occur in 3D spatially proximal and chromatin-accessible regions. Similarly, it was observed in non-synthetic chromosomes that the frequency of double strand break (DSB), a main trigger of chromosomal recombination, is also associated with chromosome accessibility [42,43]. Large chromosome structural variations, including simple Structure Variantions (SVs) (e.g. deletions, duplications and inversions), chromosome fusion, copy-number alterations and other complex SVs, have been found to be associated with certain types of cancers [44,45]. Recently, researches focused on the occurrence of such variations that lead to cancers highlighted the importance of spatial contact probability within the 3D organization in this process [46,47].

The use of synthetic chromosomes and SCRaMbLE allowed us to statistically demonstrate that inter-chromosomal rearrangement hotspots strikingly clustered at the peritelomeric and pericentromeric regions. Our results are consistent with previous reports showing that increased inter-chromosomal reshuffling has occurred in the peritelomeric regions of domesticated and wild yeast isolates during genome evolution [8,11]. Peritelomeric regions are functionally enriched for genes involved in secondary metabolism and stress responses that contribute to environmental adaptation [48–51]. A high frequency of rearrangement in these regions could thus be an important driving force for evolution. However, others were surprisingly found located in the pericentromeric regions. Centromeric rearrangements may result in the formation of acentric or dicentric chromosomes, and thus lead to cell death, and could not be detected as rearrangement hotspots in natural living cells [52]. In addition, centromeric rearrangements result in the swapping of chromosomal arms between two chromosomes, which can cause reproductive isolation and promote incipient speciation [53–55]. Taken together, recombination events in the pericentromeric regions are rarely observed in nature, and sequences in these regions are believed to be relatively stable across species. Nevertheless, the sequencing data of our SCRaMbLEd pool may reveal the hidden fact that the pericentromeric regions of S. cerevisiae are prone to rearrange at the beginning, but the rearrangements are evolutionarily eliminated in the later adaptive process. Taking the combinatorial effect of chromatin accessibility and chromosome conformation into consideration, we speculated that chromatin structures might play an important role in genome evolution in terms of the effects on rearrangements in peritelomeric and pericentromeric regions.

Consolidation of all 16 yeast synthetic chromosomes and novel tRNA neochromosome in the final Sc2.0 strain is likely to happen soon [16,36,56]. The flexible and controllable synthetic yeast genome provides a unique model to systematically interrogate and explore the dynamics of eukaryotic genome evolution. The investigation of a large number of rearrangements indicated the tremendous plasticity of the yeast genome and the importance of hierarchical chromatin organization to the regional rates of chromosomal variation during genome evolution. These findings provide crucial insights into the variability and complexity of synthetic genome design. Additionally, the influence of chromatin organization needs to be considered in the design and engineering of higher organism genomes.

Data Availability

Genome sequencing data have been submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) under accession numbers PRJNA705059 and PRJNA895231. The ATAC-seq and Hi-C sequencing data have been submitted to the NCBI Gene Expression Omnibus (GEO) under accession number GSE168182. The ATAC-seq data of wild-type S. cerevisiae are available from GEO under accession number GSE66386.

Supplementary Material

nwad073_Supplemental_File

ACKNOWLEDGEMENTS

We thank Jef D. Boeke from NYU Langone Health for discussions and advice. We also thank Stephanie Lauer for helpful comments during manuscript preparation. We are grateful to Jef D. Boeke and Leslie A. Mitchell from NYU Langone Health, Huanming Yang, Yue Shen, Yun Wang and Tai Chen from BGI-Shenzhen, and Yizhi Cai from the University of Manchester, for strain sharing and technical support. Figure 5 was created with BioRender.com.

Contributor Information

Sijie Zhou, Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin 300072, China; Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.

Yi Wu, Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin 300072, China; Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.

Yu Zhao, Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.

Zhen Zhang, Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin 300072, China; Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.

Limin Jiang, School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin 300350, China.

Lin Liu, Epigenetic Group, Frasergen Bioinformatics Co., Ltd., Wuhan 430000, China.

Yan Zhang, Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin 300072, China.

Jijun Tang, School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin 300350, China; Department of Computer Science, University of South Carolina, Columbia, SC 29208, USA.

Ying-Jin Yuan, Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin University, Tianjin 300072, China; Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.

FUNDING

This work was supported by the National Key Research and Development Program of China (2021YFC2100800) and the National Natural Science Foundation of China (21621004, 31861143017 and 31971351). This work is part of the Synthetic Yeast Genome Project (http://syntheticyeast.org/sc2-0/).

AUTHOR CONTRIBUTIONS

Conceptualization: Y.J.Y., Y.W.; experiments: S.Z., Y.Z., Z.Z.; analysis: Y.J.Y., Y.W., S.Z., Z.Z., L.J., L.L.; writing: Y.J.Y., Y.W., S.Z., Y.Z., J.T.

Conflict of interest statement

None declared.

REFERENCES

  • 1. Good BH, McDonald MJ, Barrick JEet al. The dynamics of molecular evolution over 60,000 generations. Nature 2017; 551: 45–50. 10.1038/nature24287 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Jasinska W, Manhart M, Lerner Jet al. Chromosomal barcoding of E. Coli populations reveals lineage diversity dynamics at high resolution. Nat Ecol Evol 2020; 4: 437–52. 10.1038/s41559-020-1103-z [DOI] [PubMed] [Google Scholar]
  • 3. Johnson MS, Martsul A, Kryazhimskiy Set al. Higher-fitness yeast genotypes are less robust to deleterious mutations. Science 2019; 366: 490–3. 10.1126/science.aay4199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Nguyen Ba AN, Cvijović I, Rojas Echenique JIet al. High-resolution lineage tracking reveals travelling wave of adaptation in laboratory yeast. Nature 2019; 575: 494–9. 10.1038/s41586-019-1749-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Peter J, Schacherer J. Population genomics of yeasts: towards a comprehensive view across a broad evolutionary scale. Yeast 2016; 33: 73–81. 10.1002/yea.3142 [DOI] [PubMed] [Google Scholar]
  • 6. Gallone B, Steensels J, Prahl Tet al. Domestication and divergence of Saccharomyces cerevisiae beer yeasts. Cell 2016; 166: 1397–410.e16. 10.1016/j.cell.2016.08.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Botstein D, Fink GR. Yeast: an experimental organism for 21st century biology. Genetics 2011; 189: 695–704. 10.1534/genetics.111.130765 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Peter J, De Chiara M, Friedrich Aet al. Genome evolution across 1,011 Saccharomyces cerevisiae isolates. Nature 2018; 556: 339–44. 10.1038/s41586-018-0030-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Shen X, Opulente DA, Kominek Jet al. Tempo and mode of genome evolution in the budding yeast subphylum. Cell 2018; 175: 1533–45.e20. 10.1016/j.cell.2018.10.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Marsit S, Leducq J, Durand Éet al. Evolutionary biology through the lens of budding yeast comparative genomics. Nat Rev Genet 2017; 18: 581–98. 10.1038/nrg.2017.49 [DOI] [PubMed] [Google Scholar]
  • 11. Yue J, Li J, Aigrain Let al. Contrasting evolutionary genome dynamics between domesticated and wild yeasts. Nat Genet 2017; 49: 913–24. 10.1038/ng.3847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Dujon B. Yeast evolutionary genomics. Nat Rev Genet 2010; 11: 512–24. 10.1038/nrg2811 [DOI] [PubMed] [Google Scholar]
  • 13. Hutchison CA, Chuang RY, Noskov VNet al. Design and synthesis of a minimal bacterial genome. Science 2016; 351: d6253. 10.1126/science.aad6253 [DOI] [PubMed] [Google Scholar]
  • 14. Ostrov N, Landon M, Guell Met al. Design, synthesis, and testing toward a 57-codon genome. Science 2016; 353: 819–22. 10.1126/science.aaf3639 [DOI] [PubMed] [Google Scholar]
  • 15. Fredens J, Wang K, de la Torre Det al. Total synthesis of Escherichia coli with a recoded genome. Nature 2019; 569: 514–8. 10.1038/s41586-019-1192-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Richardson SM, Mitchell LA, Stracquadanio Get al. Design of a synthetic yeast genome. Science 2017; 355: 1040–4. 10.1126/science.aaf4557 [DOI] [PubMed] [Google Scholar]
  • 17. Chen W, Han M, Zhou Jet al. An artificial chromosome for data storage. Natl Sci Rev 2021; 8: nwab028. 10.1093/nsr/nwab028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Lu X, Ellis T. Self-replicating digital data storage with synthetic chromosomes. Natl Sci Rev 2021; 8: nwab086. 10.1093/nsr/nwab086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Dymond JS, Richardson SM, Coombes CEet al. Synthetic chromosome arms function in yeast and generate phenotypic diversity by design. Nature 2011; 477: 471–6. 10.1038/nature10403 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Blount BA, Gowers GF, Ho JCHet al. Rapid host strain improvement by in vivo rearrangement of a synthetic yeast chromosome. Nat Commun 2018; 9: 1932. 10.1038/s41467-018-03143-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Jia B, Wu Y, Li Bet al. Precise control of SCRaMbLE in synthetic haploid and diploid yeast. Nat Commun 2018; 9: 1933. 10.1038/s41467-018-03084-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Liu W, Luo Z, Wang Yet al. Rapid pathway prototyping and engineering using in vitro and in vivo synthetic genome scramble-in methods. Nat Commun 2018; 9: 1936. 10.1038/s41467-018-04254-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Luo Z, Wang L, Wang Yet al. Identifying and characterizing SCRaMbLEd synthetic yeast using ReSCuES. Nat Commun 2018; 9: 1930. 10.1038/s41467-017-00806-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Shen MJ, Wu Y, Yang Ket al. Heterozygous diploid and interspecies SCRaMbLEing. Nat Commun 2018; 9: 1934. 10.1038/s41467-018-04157-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Shen Y, Stracquadanio G, Wang Yet al. SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes. Genome Res 2016; 26: 36–49. 10.1101/gr.193433.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Wang J, Xie ZX, Ma Yet al. Ring synthetic chromosome V SCRaMbLE. Nat Commun 2018; 9: 3783. 10.1038/s41467-018-06216-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Wu Y, Zhu R, Mitchell LAet al. In vitro DNA SCRaMbLE. Nat Commun 2018; 9: 1935. 10.1038/s41467-018-03743-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Mitchell LA, Wang A, Stracquadanio Get al. Synthesis, debugging, and effects of synthetic chromosome consolidation: synVI and beyond. Science 2017; 355: f4831. 10.1126/science.aaf4831 [DOI] [PubMed] [Google Scholar]
  • 29. Shen Y, Wang Y, Chen Tet al. Deep functional analysis of synII, a 770-kilobase synthetic yeast chromosome. Science 2017; 355: f4791. 10.1126/science.aaf4791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Wu Y, Li B, Zhao Met al. Bug mapping and fitness testing of chemically synthesized chromosome X. Science 2017; 355: f4706. 10.1126/science.aaf4706 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Xie Z, Li B, Mitchell LAet al. “Perfect” designer chromosome V and behavior of a ring derivative. Science 2017; 355: f4704. 10.1126/science.aaf4704 [DOI] [PubMed] [Google Scholar]
  • 32. Annaluru N, Muller H, Mitchell LAet al. Total synthesis of a functional designer eukaryotic chromosome. Science 2014; 344: 55–8. 10.1126/science.1249252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Mercy G, Mozziconacci J, Scolari VFet al. 3D organization of synthetic and SCRaMbLEd chromosomes. Science 2017; 355: f4597. 10.1126/science.aaf4597 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Lin Q, Qi H, Wu Yet al. Robust orthogonal recombination system for versatile genomic elements rearrangement in yeast Saccharomyces cerevisiae. Sci Rep 2015; 5: 15249. 10.1038/srep15249 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Karimova M, Abi-Ghanem J, Berger Net al. Vika/vox, a novel efficient and specific Cre/loxp-like site-specific recombination system. Nucleic Acids Res 2013; 41: e37. 10.1093/nar/gks1037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Zhao Y, Coelho C, Hughes ALet al. Debugging and consolidating multiple synthetic chromosomes reveals combinatorial genetic interactions. Biorxiv 2022; 2022–4 10.1101/2022.04.11.486913. [DOI] [PubMed] [Google Scholar]
  • 37. Schep AN, Buenrostro JD, Denny SKet al. Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions. Genome Res 2015; 25: 1757–70. 10.1101/gr.192294.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Zhang H, Fu X, Gong Xet al. Systematic dissection of key factors governing recombination outcomes by GCE-SCRaMbLE. Nat Commun 2022; 13: 5836. 10.1038/s41467-022-33606-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Duan Z, Andronescu M, Schutz Ket al. A three-dimensional model of the yeast genome. Nature 2010; 465: 363–7. 10.1038/nature08973 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Wang P, Xu H, Li Het al. SCRaMbLEing of a synthetic yeast chromosome with clustered essential genes reveals synthetic lethal interactions. ACS Synth Biol 2020; 9: 1181–9. 10.1021/acssynbio.0c00059 [DOI] [PubMed] [Google Scholar]
  • 41. Guo F, Gopaul DN, Van Duyne GD. Structure of Cre recombinase complexed with DNA in a site-specific recombination synapse. Nature 1997; 389: 40–6. 10.1038/37925 [DOI] [PubMed] [Google Scholar]
  • 42. Pan J, Sasaki M, Kniewel Ret al. A hierarchical combination of factors shapes the genome-wide topography of yeast meiotic recombination initiation. Cell 2011; 144: 719–31. 10.1016/j.cell.2011.02.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Tock AJ, Henderson IR. Hotspots for initiation of meiotic recombination. Front Genet 2018; 9: 521. 10.3389/fgene.2018.00521 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Hadi K, Yao X, Behr JMet al. Distinct classes of complex structural variation uncovered across thousands of cancer genome graphs. Cell 2020; 183: 197–210.e32. 10.1016/j.cell.2020.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Beroukhim R, Mermel CH, Porter Det al. The landscape of somatic copy-number alteration across human cancers. Nature 2010; 463: 899–905. 10.1038/nature08822 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Zhang Y, McCord RP, Ho Yet al. Spatial organization of the mouse genome and its role in recurrent chromosomal translocations. Cell 2012; 148: 908–21. 10.1016/j.cell.2012.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Fudenberg G, Getz G, Meyerson Met al. High order chromatin architecture shapes the landscape of chromosomal alterations in cancer. Nat Biotechnol 2011; 29: 1109–13. 10.1038/nbt.2049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Ames RM, Rash BM, Hentges KEet al. Gene duplication and environmental adaptation within yeast populations. Genome Biol Evol 2010; 2: 591–601. 10.1093/gbe/evq043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Bergström A, Simpson JT, Salinas Fet al. A high-definition view of functional genetic variation from natural yeast genomes. Mol Biol Evol 2014; 31: 872–88. 10.1093/molbev/msu037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Brown CA, Murray AW, Verstrepen KJ. Rapid expansion and functional divergence of subtelomeric gene families in yeasts. Curr Biol 2010; 20: 895–903. 10.1016/j.cub.2010.04.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Parker R. RNA degradation in Saccharomyces cerevisiae. Genetics 2012; 191: 671–702. 10.1534/genetics.111.137265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Paek AL, Jones H, Kaochar Set al. The role of replication bypass pathways in dicentric chromosome formation in budding yeast. Genetics 2010; 186: 1161–73. 10.1534/genetics.110.122663 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Yadav V, Sun S, Coelho MAet al. Centromere scission drives chromosome shuffling and reproductive isolation. Proc Natl Acad Sci USA 2020; 117: 7917–28. 10.1073/pnas.1918659117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Guin K, Chen Y, Mishra Ret al. Spatial inter-centromeric interactions facilitated the emergence of evolutionary new centromeres. eLife 2020; 9: e58556. 10.7554/eLife.58556 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Sankaranarayanan SR, Ianiri G, Coelho MAet al. Loss of centromere function drives karyotype evolution in closely related malassezia species. eLife 2020; 9: e53944. 10.7554/eLife.53944 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Zhou S, Wu Y, Xie Zet al. Directed genome evolution driven by structural rearrangement techniques. Chem Soc Rev 2021; 50: 12788–807. 10.1039/D1CS00722J [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

nwad073_Supplemental_File

Data Availability Statement

Genome sequencing data have been submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA) under accession numbers PRJNA705059 and PRJNA895231. The ATAC-seq and Hi-C sequencing data have been submitted to the NCBI Gene Expression Omnibus (GEO) under accession number GSE168182. The ATAC-seq data of wild-type S. cerevisiae are available from GEO under accession number GSE66386.


Articles from National Science Review are provided here courtesy of Oxford University Press

RESOURCES