Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2016 May 24;44(14):e124. doi: 10.1093/nar/gkw475

CasHRA (Cas9-facilitated Homologous Recombination Assembly) method of constructing megabase-sized DNA

Jianting Zhou 1,2, Ronghai Wu 1,2, Xiaoli Xue 1,*, Zhongjun Qin 1,*
PMCID: PMC5001600  PMID: 27220470

Abstract

Current DNA assembly methods for preparing highly purified linear subassemblies require complex and time-consuming in vitro manipulations that hinder their ability to construct megabase-sized DNAs (e.g. synthetic genomes). We have developed a new method designated ‘CasHRA (Cas9-facilitated Homologous Recombination Assembly)’ that directly uses large circular DNAs in a one-step in vivo assembly process. The large circular DNAs are co-introduced into Saccharomyces cerevisiae by protoplast fusion, and they are cleaved by RNA-guided Cas9 nuclease to release the linear DNA segments for subsequent assembly by the endogenous homologous recombination system. The CasHRA method allows efficient assembly of multiple large DNA segments in vivo; thus, this approach should be useful in the last stage of genome construction. As a proof of concept, we combined CasHRA with an upstream assembly method (Gibson procedure of genome assembly) and successfully constructed a 1.03 Mb MGE-syn1.0 (Minimal Genome of Escherichia coli) that contained 449 essential genes and 267 important growth genes. We expect that CasHRA will be widely used in megabase-sized genome constructions.

INTRODUCTION

The assembly of small DNA fragments into large constructs, such as the genes involved in biochemical pathways, biological machinery, and even entire genomes, is one of the most fundamental requirements of synthetic biology. State-of-the-art in vitro DNA assembly methods have been developed to fulfil different purposes (1,2). Restriction endonuclease assembly methods (including BioBricks (3), BglBricks (4) and Golden gate (5)) are widely used to assemble standardized biological components, and in vitro overlap assembly methods (including InFusion (6), SLIC (7) and Gibson one-step isothermal assembly (8)) are useful for large DNA constructions. The efficient homologous recombination observed in Saccharomyces cerevisiae has greatly simplified the construction of DNA molecules up to several hundred kilobases (9,10).

The techniques required to synthesize a complete genome are important for synthetic biology. The ongoing Synthetic Yeast Genome Project (Sc2.0) has demonstrated the plasticity of the yeast genome by synthesizing chromosome arms (11). As a milestone achievement, the first heavily edited yeast chromosome, synIII (273 kb), was synthesized by 11 successive rounds of DNA assembly in vivo and used to replace the native chromosome III (12). This method avoided the direct assembly of large DNAs and allowed for step-by-step testing of the functionality of the newly synthesized regions. However, a similar strategy would not be suitable for genome synthesis in other organisms without an efficient endogenous homologous recombination system. Because the smallest free-living organisms have genomes larger than 1 Mb (13), the ability to synthesize a megabase-sized genome is of great importance. The first complete 1.08 Mb Mycoplasma mycoides genome was synthesized by three rounds of assembly in yeast following the Gibson procedure, and its functionality was demonstrated in another closely related species, M. capricolum (14). The Gibson procedure of genome assembly allows for the one-step assembly of more than 10 DNA fragments in yeast and has an efficiency of 10–100%. However, the assembly efficiency decreases drastically down to 1/48 at the final stage of genome assembly because this method requires a sufficient quantity of high-quality large subassembly DNA segments from complex in vitro manipulation procedures. All eleven 100 kb assembly intermediates were isolated from yeast, and then two steps of purification and one step of enrichment were performed to remove the host chromosomal DNA contamination (14). These difficult and time-consuming in vitro manipulations resulted in low efficiency.

In recent years, the RNA-guided nuclease Cas9 has become widely used in prokaryotic and eukaryotic genome editing (1520). Guided by designed RNAs, Cas9 can introduce double-stranded DNA breakage at the defined position, which dramatically increases the efficiency of targeted genome deletions, mutations, and insertions (15). Cas9 has also been reported to facilitate direct multiplex integrations of DNA parts assembled in vivo in yeast (21).

To avoid the complex and time-consuming in vitro manipulations of large linear DNAs during genome assembly, we have developed a new method in which circular DNAs are used directly in a one-step in vivo assembly. We designated this method CasHRA (Cas9-facilitated Homologous Recombination Assembly). The large circular DNAs were cut by target RNA-guided Cas9 in vivo to release the linear DNAs for direct assembly by overlapping sequences and homologous recombination in yeast. By eliminating the guide RNAs, the assembled circular DNAs could be directly used for the next round of DNA assembly. We tested the efficiency of CasHRA in assemblies with both small and large circular DNAs. To demonstrate the utility of CasHRA for genome assembly, we combined this method with the Gibson procedure of genome assembly as an upstream assembly method to construct a 1.03 Mb MGE-syn1.0 (Minimal Genome of E. coli).

MATERIALS AND METHODS

Strains, plasmids, media and reagents

The yeast strain used in this study was S. cerevisiae VL6-48 (MATα, trp1-Δ1, ura3-Δ1, ade2-101, his3-Δ200, lys2, met14) (22). The yeast cells were cultured in YPAD medium (1% [w/v] Bacto yeast extract, 2% [w/v] Bacto peptone, 2% [w/v] glucose, and 80 mg/L adenine hemisulfate) or synthetic complete medium (SC) (Sigma-Aldrich, St. Louis, MO, USA) lacking the nutrients required by the corresponding auxotrophic strains. The E. coli strain DH10B (F, λ, endA1, glnV44, thi-1, recA1, relA1, gyrA96, deoR, nupG, Φ80dlacZΔM15, Δ(lacZYA-argF)U169, hsdR17(rK mK+), mcrA, mcrBC, mrr, galE) was used for the routine genetic modifications.

The plasmids and primers used in this study are listed in Supplementary Tables S1 and S2. The primers were synthesized by Genescript (Nanjing, Jiangsu, China) or JIE LI Biology (Shanghai, China). Phanta Super-Fidelity DNA Polymerase (Vazyme, Nanjing, Jiangsu, China) and KOD-Plus-Neo DNA Polymerase (Toyobo, Osaka, Japan) were used for the standard PCR amplification. KOD-FX DNA Polymerase (Toyobo, Osaka, Japan) was used for the colony PCR confirmation. The Wizard SV Gel and PCR Clean-up System (Promega, Fitchburg, WI, USA) was used for DNA purification. The restriction endonucleases and DNA size markers were purchased from New England BioLabs (Ipswich, MA, USA). All of the chemicals were obtained from Sigma-Aldrich unless otherwise specified.

Construction of plasmids

Plasmids p426-SNR52p-gRNA.CAN1.Y-SUP4t and p415-GalL-Cas9-CYC1t were gifts from George Church (15) (Addgene plasmids #43803 and #43804). To construct the plasmid pMetcas9-0 for the constitutive expression of Cas9, the two fragments released by HpaI/SpeI digestion of p415-GalL-Cas9-CYC1t were gel purified. The selection marker MET14 was PCR amplified from the S. cerevisiae S288c genome using the primers Met14-F/Met14-R. The strong constitutive promoter TEF1 was amplified by PCR using the template pAG36 (23) and primers Tef1p-F/Tef1p-R. The PCR primers were designed to contain 40 bp overlapping sequences with the adjacent DNA segments. The four DNA segments described were co-introduced into yeast and assembled into pMetcas9-0. To eliminate the guide RNA expression plasmid, the guide RNA targeting the replication origin of pTrp at the S3 site was synthesized by PCR amplification using the template p426-SNR52p-gRNA.CAN1.Y-SUP4t and primers Cas-S3-F/Cas-S3-R. The galactose inducible GAL1 promoter was PCR amplified from p415-GalL-Cas9-CYC1t using GalL-F/GalL-R. The two DNA segments were assembled by fusion PCR and then cloned into pMetcas9-0 at the XbaI site to construct pMet-Cas9 (Supplementary Figure S1).

To construct the vector backbone pTrp to facilitate the cloning of constitutively expressed guide RNAs, the auxotrophic marker TRP1 was PCR amplified from the S. cerevisiae S288c genome using the primers Trp1-F/Trp1-R. The 2 μm origin, pBR322 origin and ampicillin resistance gene were PCR amplified from p426-SNR52p-gRNA.CAN1.Y-SUP4t using the primer pairs 2μ ori-F/2μ ori-R, pBR322-F/pBR322-R, and f1ori-F/f1ori-R separately. The four DNA fragments were co-introduced into yeast by transformation and then assembled into pTrp. For the first round of DNA assembly, the Target1 and Target2 DNAs, which constitutively express the guide RNAs targeting the common vector backbone of all of the circular DNAs at the S1 (5′-ATAGTGTCACCTAAATAGCT-3′) and S2 (5′-CGTAGCAACCAGGCGTTTAA-3′) sites, were synthesized (for the detailed sequences, please refer to Supplementary Figure S2). The synthesized DNA segments were released from the cloning vectors by BglII/SpeI and SpeI/NcoI digestion and ligated together with the BglII/NcoI digested vector pTrp to construct pTrp-gRNA (Supplementary Figure S2). For the second round of DNA assembly, the DNA segments that targeted the common vector backbone of all of the circular DNAs at the S4 (5′-CGGCCAACGCGAACCCTTAG-3′) and S5 (5′-TGATGAACCTGAATCGCCAG-3′) sites were synthesized and similarly cloned into pTrp to construct pTrp-gRNA2.

The assembly vector pCriv0 (Supplementary Figure S3) was PCR amplified using the primers pCriv0-VF/pCriv0-VR and the DNA template pZQ233 (constructed by our lab, unpublished) to contain the replication origin of chromosome II (24) from Vibrio cholerae N16961 (25), the chloramphenicol resistance marker (Cm), and the yeast origin CEN6 ARS4. The yeast selection marker ADE2 was amplified from the S. cerevisiae S288c genome using the primers ade2-F/ade2-R. The two DNA fragments were introduced into yeast and then assembled into pCriv0 through 40 bp overlaps at the termini.

The assembly vector pZJ231 (Supplementary Figure S4) for EMG-syn1.0 was constructed by assembling the following four DNA segments. The replication origin of V. cholerae chromosome II and its partition system as well as the yeast origin CEN6 ARS4 and the selection marker HIS3 were PCR amplified from pZQ233. The kanamycin resistance gene was PCR amplified using pET28 as a template. The 500 bp sequences that overlapped with the large DNA segments Criv7 and TP5 were PCR amplified from the E. coli MDS42 genome (26).

Co-introduction of multiple small or large circular DNAs

The circular DNAs constructed from our previous work (27) were used for the assembly process in this study (Table 1). Two or three small circular DNAs to be assembled were co-introduced into the yeast cells harbouring pMet-Cas9 by using the lithium acetate (LiAc) transformation procedure (28). Circular DNAs larger than 100 kb were co-introduced into the yeast cells by using the protoplast fusion procedure (29,30). Cells harbouring two and three circular DNAs in addition to pMet-Cas9 were selected on the synthetic drop-out media SC-Met-His-Ura (omitting methionine, histidine, and uracil) and SC-Met-His-Ura-Lys (omitting methionine, histidine, uracil, and lysine), respectively. The cells were allowed to grow for 3 days at 30°C until colonies appeared.

Table 1. Efficiency of CasHRA in the assembly of both small and large circular DNAs.

Circular DNAs joined in assembly Length of assembled DNA (kb)a Name of assembly No. of transformantsb in three independent experiments Assembly efficiencyc
Name Length of DNA (kb)a
Two small circular DNAs assembly
pAEEG5 10 25 pCriv4 27, 232, 173 87 ± 15%
pAEEG6 15
Three small circular DNAs assembly
pAEEG4 14
pAEEG5 10 39 pCriv5 77, 244, 116 60 ± 31%
pAEEG6 15
Two large circular DNAs assembly
pSP5 117 302 pCriv6 28, 74, 27 80 ± 17%
pTP3-U 185
Three large circular DNAs assembly
pTP1 177
pTP2 298 660 pCriv7 7, 10, 6 73 ± 9%
pTP3-L 185
Assembly of the 1.03 Mb MGE-syn1.0 (Minimal Genome of E. coli)
pCriv7 660
pTP4 185 1028 MGE-syn1.0 4, 7, 11 65 ± 13%
pTP5 185

aThe DNA length was calculated without the vector backbone.

bTransformants of pCriv4–pCriv7 were plated on triple-drop-out medium SC-Met-Trp-Ade (omitting methionine, tryptophan, and adenine) for the selection of the assembly vector (auxotrophic marker ADE2) in addition to the complete Cas9 system. The auxotrophic marker in the MGE-syn1.0 assembly vector was HIS3; thus, the corresponding transformants were plated on triple-drop-out medium SC-Met-Trp-His (omitting methionine, tryptophan, and histidine).

cThe assembly efficiency was evaluated by PCR positive rates from three independent experiments.

Assembly booting

The unique linear assembly vector was PCR amplified using pCriv0 as the template and primers that included 60 bp sequences that overlapped the adjacent DNA segments. Approximately 1 μg pTrp-gRNA and 1 μg linear assembly vector (except for the first assembly experiment of pCriv4, for which only half the amount of each DNA was used) were co-introduced into the yeast cells containing circular DNAs to be assembled and pMet-Cas9 by using LiAc transformation. Five percent of the transformants were selected on double-drop-out medium SC-Met-Trp (omitting methionine and tryptophan) for selection of the complete Cas9 system (pMet-Cas9 and pTrp-gRNA). The remaining transformants were plated on the triple-drop-out medium SC-Met-Trp-Ade (omitting methionine, tryptophan, and adenine) for selection of the assembly vector as well as the complete Cas9 system. The transformants were allowed to grow for 2 days at 30°C until colonies appeared.

MGE-syn1.0 assembly

The guide RNA expression plasmid pTrp-gRNA was eliminated using a previously described protocol (15). After the elimination of pTrp-gRNA, the yeast cells containing pMet-Cas9 (auxotrophic marker Met) and pCriv7 (auxotrophic marker Ade) were protoplast fused with a yeast cell derivative that harboured pTP4 (auxotrophic marker Ura) and a yeast cell derivative that harboured pTP5 (auxotrophic marker Lys), respectively. Cells containing all three large circular DNAs and pMet-Cas9 were selected on SC-Met-Ade-Ura-Lys (omitting methionine, adenine, uracil, and lysine) medium.

The linear assembly vector was generated by PCR amplification using the template pZJ231 and the primers pCriv8-VF/ pCriv8-VR. Approximately 1 μg pTrp-gRNA2 and 1 μg linear assembly vector were co-introduced into the yeast cells by LiAc transformation. Five percent of the transformants were selected on the double-drop-out medium SC-Met-Trp. The remaining transformants were plated on the triple-drop-out medium SC-Met-Trp-His (omitting methionine, tryptophan, and histidine). The transformants were allowed to grow for 2 days at 30°C until colonies appeared.

Confirmation of the correct assemblies

Correct assemblies were confirmed by colony PCR using the primers listed in Supplementary Table S2. For each assembly, PCR products from five to eight randomly selected positive colonies were sent for sequencing using the same primers used for the PCR assay. The positive assemblies were further subjected to enzymatic digestion and electrophoresis analysis. Small assemblies (pCriv4 and pCriv5) were isolated from the yeast and introduced into E. coli DH10B by electroporation. The assemblies re-isolated from E. coli were subjected to enzymatic digestion and an electrophoresis analysis. It was difficult to introduce DNAs of several hundred kilobases into E. coli; therefore, the large assemblies pCriv6, pCriv7 and MGE-syn1.0 were isolated directly from the yeast and analysed. Approximately 5 × 108 yeast cells were collected and embedded in agarose plugs using a previously described protocol (14). The linear yeast chromosomal DNAs were separated out of the agarose plugs by pulsed-field gel electrophoresis (PFGE) performed under the following conditions: 6 V/cm, 10–60 s switch time, and 14°C for 20 h. The large circular assembly trapped in the agarose plugs was subjected to restriction enzymatic digestion and then a PFGE analysis performed under the following conditions: 6 V/cm, 1–25 s switch time, and 14°C for 20 h.

RESULTS

Design of the CasHRA method

CasHRA was developed to facilitate the direct use of large circular DNAs for one-step assemblies in vivo and avoid the substantial manipulation of linear DNA segments in vitro. As illustrated in Figure 1, three large circular DNAs were co-introduced into yeast cells harbouring the Cas9 expression plasmid pMet-Cas9 via protoplast fusion (step 1). The assembly booting was achieved by co-introducing pTrp-gRNA and the linear assembly vector into the cells by transformation (step 2). The plasmid pTrp-gRNA constitutively expressed guide RNAs that targeted the common vector backbone of all of the large circular DNAs at the S1 and S2 sites. Once pTrp-gRNA entered the cell, a functional Cas9 system rapidly developed and cleaved the large circular DNAs to release the linear DNA segments with exposed overlaps at the termini. Taking advantage of the highly efficient homologous recombination system of S. cerevisiae, these large DNA segments were then properly assembled together with the linear assembly vector. The plasmid pMet-Cas9 contained a galactose-inducible guide RNA that targets the replication origin of pTrp-gRNA at the S3 site. Transferring the yeast cells into the galactose medium efficiently eliminated pTrp-gRNA (step 3) and allowed the yeast cells containing the assembled circular DNA to be directly used for the next round of DNA assembly. The entire CasHRA process requires 9 days, including 5 days to introduce all of the circular DNAs into the yeast cells, 3 days to obtain the yeast cells with the assembled product, and 1 day to eliminate pTrp-gRNA.

Figure 1.

Figure 1.

Schematic diagram of the CasHRA method. Large circular DNAs were co-introduced into individual yeast cells harbouring the Cas9 expression plasmid pMet-Cas9 by protoplast fusion (step 1). Then, the guide RNA expression plasmid pTrp-gRNA and the linear assembly vector were co-introduced into the cells by transformation. The RNA-guided Cas9 cut the vector backbone of all of the large circular DNAs at the S1 and S2 sites. The released linear DNA segments were joined together with the linear vector through overlaps by taking advantage of the efficient homologous recombination system in yeast. The plasmid pMet-Cas9 contained a galactose-inducible guide RNA targeting the replication origin of pTrp-gRNA at the S3 site.

Efficiency of CasHRA for circular DNA assembly

We validated the efficiency of CasHRA for the assembly of both small and large circular DNAs as listed in Table 1. These circular DNAs contained 10–298 kb of essential and important growth genes for E. coli and were constructed in our previous work (27). The overlaps in the adjacent DNA segments were designed to be approximately 500 bp. Shorter overlap sequences could also be used; however, longer overlaps tended to increase the assembly efficiency (10). The common vector backbone of these circular DNAs contained CEN6 and ARS4 for replication and single-copy maintenance in yeast and were used to avoid any discrepant assembly in the yeast (10). Additional yeast replication origins were included in every ∼100 kb of E. coli DNA to improve the stability of these large DNA segments in yeast (31). To increase the stability of the large foreign DNAs in the yeast, different auxotrophic selection markers were used to ensure the co-occurrence of the subassemblies in the yeast cells.

To properly validate the success rate, all of the assembly reactions were repeated three times. As shown in Table 1, the success rates of pCriv4 and pCriv5, which were assembled from two and three small circular DNAs, were 87 ± 15% and 60 ± 31%, respectively. Similar efficiencies were obtained for the assembly of circular DNAs larger than 100 kb. The success rates of pCriv6 and pCriv7, which were assembled from two and three large circular DNAs, were 80 ± 17% and 73 ± 9%, respectively. Sequencing 82 overlaps from 23 assembly DNA molecules revealed 10 errors, all of which occurred in the 60 bp overlaps between the vector and the leftmost or rightmost subassemblies. The 60 bp overlaps were introduced by the 3′ ends of the primers during PCR amplification of the assembly vector, and errors could have resulted from impure primers. Detailed information on the enzymatic analysis and sequencing of the assemblies of pCriv4∼7 is presented in Supplementary Figures S5–S8 and Supplementary Table S3.

Assembly of a 1.03 Mb MGE-syn1.0 genome via CasHRA

To gain a deeper understanding of the basic processes required to create synthetic life, we designed a 1.03 Mb MGE-syn1.0 (Minimal Genome of E. coli) that contained 449 essential genes from published experimental and computational studies (27,3236) and 267 important growth genes, including 151 genes that affect growth (34) and 76 genes that have not been assigned to central metabolism pathways and regulation networks. These genes were PCR amplified from the E. coli MDS42 genome (26), sequenced, and then assembled into five large circular DNAs pTP1–pTP5 (Figure 2) by three rounds of assembly following the Gibson procedure (14). The Gibson procedure of genome assembly method can assemble >10 DNA segments at one time with high efficiency; thus, it is a useful method for the early stages of genome assembly. Our previous attempts to sequentially assemble pTP1–pTP3 (177–298 kb) into pCriv7 (660 kb) using the Gibson procedure of genome assembly failed. However, the application of CasHRA successfully assembled pCriv7, which contained all 449 E. coli genes. Because of the simplicity and efficiency of CasHRA, we applied this method to assemble the 1.03 Mb MGE-syn1.0. To directly use yeast cells containing pCriv7 for the next round of DNA assembly, the guide RNA expression vector pTrp-gRNA was eliminated by simply transferring the cells into galactose medium, and the elimination frequency was usually >90% in one round of induction. The 1.03 Mb MGE-syn1.0 was assembled from pCriv7 (660 kb), pTP4 (185 kb) and pTP5 (185 kb) (Figure 2). The assembly vector for MGE-syn1.0, pZJ231, was constructed to contain approximately 500 bp overlaps with the left terminus of Criv7 and the right terminus of TP5. The success rate of the MGE-syn1.0 assembly was 65 ± 13% (Table 1), and the sequences of all of the overlaps were 100% correct. The circular MGE-syn1.0 DNA was separated from the yeast linear chromosomes and digested with SpeI and XbaI separately. The digestion patterns were consistent with the theoretical calculation, and DNA bands larger than 50 kb were clearly shown in the pulsed-field gel electrophoresis (PFGE) image (Figure 3), although smaller DNA bands were not separated from the short fragments of the yeast genome under the experimental conditions.

Figure 2.

Figure 2.

Schematic diagram of the MGE-syn1.0 (Minimal Genome of E. coli) assembly. Four hundred forty nine essential genes and 267 important growth genes were PCR amplified from the E. coli MDS42 genome, sequenced and assembled into five large, circular DNAs pTP1–pTP5 (177–298 kb via three rounds of assembly following the Gibson procedure. The TP1–TP5 were assembled into the 1.03 Mb MGE-syn1.0 via two rounds of CasHRA. Please note that the DNA length was calculated without the vector backbone.

Figure 3.

Figure 3.

Analysis of the assembled MGE-syn1.0. (A) Map of MGE-syn1.0, assembled from three large DNA segments, Criv7 (660 kb, marked in red), TP4 (298 kb, marked in yellow), and TP5 (184 kb, marked in green). The total size of MGE-syn1.0 (1.03 Mb) and the assembly vector (10 kb) was 1.04 Mb. The SpeI cutting sites are marked by greenish brown lines in the inner circle. The XbaI cutting sites are marked by black lines on the outer circle. To separate out the linear yeast chromosomal DNA, agarose plugs were subjected to PFGE under the following conditions: 6 V/cm, 10–60 s switch time, and 14°C for 20 h. The circular MGE-syn1.0 that was trapped inside the plug was digested with SpeI and XbaI, separately, followed by another round of PFGE under the following conditions: 6 V/cm, 1–25 s switch time, and 14°C for 16 h. For the control, the assembly host VL6-48 was subjected to the same treatment. (B) The SpeI digestion of MGE-syn1.0 released ten DNA fragments. The five larger DNA fragments with sizes of 262, 200, 182, 157 and 98 kb are indicated by red arrows. Another five smaller bands (39, 34, 27, 19 and 18 kb) could not be clearly separated from the short fragments of yeast chromosomes under the experimental conditions. (C) The XbaI digestion of MGE-syn1.0 released nine DNA fragments. The DNA fragments with sizes of 248, 205, 173, 143, 117, 64 and 56 kb are indicated by red arrows. The smaller bands (20 and 11 kb) could not be clearly separated from the yeast chromosomal fragments under the experimental conditions.

We have attempted to transplant MGE-syn1.0 from yeast to E. coli by two different methods to test its functionality. First, we attempted to directly isolate and purify MGE-syn1.0 from yeast and introduce it into E. coli DH10B by electroporation. Our pilot study showed that pTP1 (the total size of TP1 and the vector backbone was 187 kb) could be successfully introduced into E. coli and stably maintained. However, the introduction of pTP2 (the total size of TP2 and the vector backbone was 308 kb) into E. coli failed. The size of TP2 (308 kb) exceeded the maximum size reported for E. coli electroporation (37), which might have caused the failure of the TP2 introduction. Second, we attempted to use protoplast fusion to transplant MGE-syn1.0 from yeast to E. coli. We prepared the protoplasts of E. coli (38) and yeast (29) according to reported protocols and mixed them in different ratios. However, successfully fused E. coli cells were not observed in our pilot study. A new method of transplanting giant DNA (>1 Mb) from yeast to E. coli awaits further development.

DISCUSSION

Conventional DNA assembly methods produce purified linear subassembly DNAs by complex manipulations in vitro, which can be burdensome in the last stage of megabase-sized DNA (e.g., synthetic genome) assembly. CasHRA was developed to directly use large circular DNAs (>100 kb) in a one-step DNA assembly process in vivo that avoids the difficult manipulations in vitro. All of the critical steps of CasHRA have been developed in vivo, including the co-introduction of large circular DNAs into individual cells by protoplast fusion, the release of large linear DNA segments by RNA-guided Cas9, and the subsequent DNA assembly using the yeast homologous recombination system. The time (9 days) and effort required to assemble DNAs with sizes from hundreds of kilobases (pCriv6 and pCriv7) to one megabase (MGE-syn1.0) were similar. The positive rates of the large assemblies (60–80%) were high enough to allow for easy detection of the correct constructs. Moreover, the efficient elimination of the guide RNA expression vector allowed for the direct use of the yeast cells containing the correct assembly in the next round of DNA assembly, which is convenient for large DNA constructions. Because the activities of CasHRA in vivo do not have strict limitations on the size of the subassemblies, we speculate that this method could be used to assemble DNA constructs larger than one megabase.

In our experiments, the subassemblies or the fully assembled molecules could be stably maintained in yeast by selection with different auxotrophic markers. Six auxotrophic markers are available for the S. cerevisiae VL6-48 strain. In this study, three auxotrophic markers were replaced by the Cas9 expression plasmid, the gRNA expression plasmid, and the linear assembly vector. The three remaining auxotrophic markers were used for the subassemblies. However, when combined with the commonly used selection markers (e.g. G418, bleomycin and hygromycin), additional subassemblies could be stably maintained in the individual yeast cells. Because of the high performance of Cas9 and the yeast homologous recombination, additional subassemblies could be simultaneously assembled by CasHRA.

To ensure the efficiency of CasHRA, the sequence and length of the assembly overlaps must be considered. In this study, the overlap sequences originated from different regions of the E. coli genome (GC content, 51%) to reduce the possible effect of sequence variation in the homology overlaps on the assembly efficiency. Similar efficiencies were obtained for five assembled DNAs with different overlaps, indicating the limited effect of overlap sequence variations on the assembly efficiency in this study. It is important to realize that introducing short overlaps (e.g. in the overlaps between the vector and subassemblies) at the 3′ ends of PCR primers would result in errors in the overlap sequence because of the impurity of the primers. Therefore, pre-cloning the overlaps into the assembly vector will be required if the sequence accuracy of the overlaps are important for the functionality of the assembled DNA. When the assembly efficiencies for both small and large circular DNAs are similar, fewer transformants are obtained from the assembly of large circular DNAs. This result may be caused by a decreased probability of large DNAs finding their counterparts with approximately 500 bp overlaps for homologous recombination. Whether longer overlaps result in more transformants from large DNA assemblies awaits further exploration.

In summary, CasHRA is a promising method for performing the in vivo assembly of multiple large DNA segments, and it could be useful in the construction of megabase-sized genomes.

Supplementary Material

SUPPLEMENTARY DATA

Acknowledgments

We are grateful to Prof. Biao Kan from the Chinese Center for Disease Control and Prevention for kindly providing the genomic DNA of Vibrio cholerae N16961.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

National Key Basic Research Program of China (973 Program) [2011CBA00801, 2012CB721102]; National Natural Science Foundation of China [31421061]. Funding for open access charge: National Key Basic Research Program of China (973 Program) [2011CBA00801, 2012CB721102]; National Natural Science Foundation of China [31421061].

Conflict of interest statement. None declared.

REFERENCES

  • 1.Ellis T., Adie T., Baldwin G.S. DNA assembly for synthetic biology: from parts to pathways and beyond. Integr. Biol. 2011;3:109–118. doi: 10.1039/c0ib00070a. [DOI] [PubMed] [Google Scholar]
  • 2.Liu W., Yuan J.S., Stewart C.N., Jr Advanced genetic tools for plant biotechnology. Nat. Rev. Genet. 2013;14:781–793. doi: 10.1038/nrg3583. [DOI] [PubMed] [Google Scholar]
  • 3.Shetty R.P., Endy D., Knight T.F., Jr Engineering BioBrick vectors from BioBrick parts. J. Biol. Eng. 2008;2:5. doi: 10.1186/1754-1611-2-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Anderson J.C., Dueber J.E., Leguia M., Wu G.C., Goler J.A., Arkin A.P., Keasling J.D. BglBricks: a flexible standard for biological part assembly. J. Biol. Eng. 2010;4:1. doi: 10.1186/1754-1611-4-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Engler C., Kandzia R., Marillonnet S. A one pot, one step, precision cloning method with high throughput capability. PLoS One. 2008;3:e3647. doi: 10.1371/journal.pone.0003647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sleight S.C., Bartley B.A., Lieviant J.A., Sauro H.M. In-Fusion BioBrick assembly and re-engineering. Nucleic Acids Res. 2010;38:2624–2636. doi: 10.1093/nar/gkq179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Li M.Z., Elledge S.J. Harnessing homologous recombination in vitro to generate recombinant DNA via SLIC. Nat. Methods. 2007;4:251–256. doi: 10.1038/nmeth1010. [DOI] [PubMed] [Google Scholar]
  • 8.Gibson D.G., Young L., Chuang R.Y., Venter J.C., Hutchison C.A., 3rd, Smith H.O. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods. 2009;6:343–345. doi: 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
  • 9.Gibson D.G., Benders G.A., Axelrod K.C., Zaveri J., Algire M.A., Moodie M., Montague M.G., Venter J.C., Smith H.O., Hutchison C.A., 3rd One-step assembly in yeast of 25 overlapping DNA fragments to form a complete synthetic Mycoplasma genitalium genome. Proc. Natl. Acad. Sci. U.S.A. 2008;105:20404–20409. doi: 10.1073/pnas.0811011106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Shao Z., Zhao H., Zhao H. DNA assembler, an in vivo genetic method for rapid construction of biochemical pathways. Nucleic Acids Res. 2009;37:e16. doi: 10.1093/nar/gkn991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dymond J.S., Richardson S.M., Coombes C.E., Babatz T., Muller H., Annaluru N., Blake W.J., Schwerzmann J.W., Dai J., Lindstrom D.L., et al. Synthetic chromosome arms function in yeast and generate phenotypic diversity by design. Nature. 2011;477:471–476. doi: 10.1038/nature10403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Annaluru N., Muller H., Mitchell L.A., Ramalingam S., Stracquadanio G., Richardson S.M., Dymond J.S., Kuang Z., Scheifele L.Z., Cooper E.M., et al. Total synthesis of a functional designer eukaryotic chromosome. Science. 2014;344:55–58. doi: 10.1126/science.1249252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Moya A., Gil R., Latorre A., Pereto J., Pilar Garcillan-Barcia M., de la Cruz F. Toward minimal bacterial cells: evolution vs. design. FEMS microbiology reviews. 2009;33:225–235. doi: 10.1111/j.1574-6976.2008.00151.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Gibson D.G., Glass J.I., Lartigue C., Noskov V.N., Chuang R.Y., Algire M.A., Benders G.A., Montague M.G., Ma L., Moodie M.M., et al. Creation of a bacterial cell controlled by a chemically synthesized genome. Science. 2010;329:52–56. doi: 10.1126/science.1190719. [DOI] [PubMed] [Google Scholar]
  • 15.DiCarlo J.E., Norville J.E., Mali P., Rios X., Aach J., Church G.M. Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems. Nucleic Acids Res. 2013;41:4336–4343. doi: 10.1093/nar/gkt135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jiang W., Bikard D., Cox D., Zhang F., Marraffini L.A. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat. Biotechnol. 2013;31:233–239. doi: 10.1038/nbt.2508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Mali P., Yang L., Esvelt K.M., Aach J., Guell M., DiCarlo J.E., Norville J.E., Church G.M. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–826. doi: 10.1126/science.1232033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Qi L.S., Larson M.H., Gilbert L.A., Doudna J.A., Weissman J.S., Arkin A.P., Lim W.A. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell. 2013;152:1173–1183. doi: 10.1016/j.cell.2013.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ran F.A., Hsu P.D., Lin C.Y., Gootenberg J.S., Konermann S., Trevino A.E., Scott D.A., Inoue A., Matoba S., Zhang Y., et al. Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity. Cell. 2013;154:1380–1389. doi: 10.1016/j.cell.2013.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang H., Yang H., Shivalila C.S., Dawlaty M.M., Cheng A.W., Zhang F., Jaenisch R. One-step generation of mice carrying mutations in multiple genes by CRISPR/Cas-mediated genome engineering. Cell. 2013;153:910–918. doi: 10.1016/j.cell.2013.04.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Jakociunas T., Rajkumar A.S., Zhang J., Arsovska D., Rodriguez A., Jendresen C.B., Skjodt M.L., Nielsen A.T., Borodina I., Jensen M.K., et al. CasEMBLR: Cas9-facilitated multiloci genomic integration of in vivo assembled DNA parts in Saccharomyces cerevisiae. ACS Synth. Biol. 2015;4:1226–1234. doi: 10.1021/acssynbio.5b00007. [DOI] [PubMed] [Google Scholar]
  • 22.Larionov V., Kouprina N., Solomon G., Barrett J.C., Resnick M.A. Direct isolation of human BRCA2 gene by transformation-associated recombination in yeast. Proc. Natl. Acad. Sci. U.S.A. 1997;94:7384–7387. doi: 10.1073/pnas.94.14.7384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Goldstein A.L., McCusker J.H. Three new dominant drug resistance cassettes for gene disruption in Saccharomyces cerevisiae. Yeast. 1999;15:1541–1553. doi: 10.1002/(SICI)1097-0061(199910)15:14<1541::AID-YEA476>3.0.CO;2-K. [DOI] [PubMed] [Google Scholar]
  • 24.Egan E.S., Waldor M.K. Distinct replication requirements for the two Vibrio cholerae chromosomes. Cell. 2003;114:521–530. doi: 10.1016/s0092-8674(03)00611-1. [DOI] [PubMed] [Google Scholar]
  • 25.Heidelberg J.F., Eisen J.A., Nelson W.C., Clayton R.A., Gwinn M.L., Dodson R.J., Haft D.H., Hickey E.K., Peterson J.D., Umayam L., et al. DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature. 2000;406:477–483. doi: 10.1038/35020000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Posfai G., Plunkett G., 3rd, Feher T., Frisch D., Keil G.M., Umenhoffer K., Kolisnychenko V., Stahl B., Sharma S.S., de Arruda M., et al. Emergent properties of reduced-genome Escherichia coli. Science. 2006;312:1044–1046. doi: 10.1126/science.1126439. [DOI] [PubMed] [Google Scholar]
  • 27.Xue X., Wang T., Jiang P., Shao Y., Zhou M., Zhong L., Wu R., Zhou J., Xia H., Zhao G., et al. MEGA (Multiple Essential Genes Assembling) deletion and replacement method for genome reduction in Escherichia coli. ACS Synth. Biol. 2015;4:700–706. doi: 10.1021/sb500324p. [DOI] [PubMed] [Google Scholar]
  • 28.Gietz R.D., Schiestl R.H. High-efficiency yeast transformation using the LiAc/SS carrier DNA/PEG method. Nat. Protoc. 2007;2:31–34. doi: 10.1038/nprot.2007.13. [DOI] [PubMed] [Google Scholar]
  • 29.Curran B.P., Bugeja V.C. Protoplast fusion in Saccharomyces cerevisiae. Methods Mol. Biol. 1996;53:45–49. doi: 10.1385/0-89603-319-8:45. [DOI] [PubMed] [Google Scholar]
  • 30.Kouprina N., Larionov V. Selective isolation of genomic loci from complex genomes by transformation-associated recombination cloning in the yeast Saccharomyces cerevisiae. Nat. Protoc. 2008;3:371–377. doi: 10.1038/nprot.2008.5. [DOI] [PubMed] [Google Scholar]
  • 31.Noskov V.N., Karas B.J., Young L., Chuang R.Y., Gibson D.G., Lin Y.C., Stam J., Yonemoto I.T., Suzuki Y., Andrews-Pfannkoch C., et al. Assembly of large, high G+C bacterial DNA fragments in yeast. ACS Synth. Biol. 2012;1:267–273. doi: 10.1021/sb3000194. [DOI] [PubMed] [Google Scholar]
  • 32.Gil R., Silva F.J., Pereto J., Moya A. Determination of the core of a minimal bacterial gene set. Microbiol. Mol. Biol. Rev.: MMBR. 2004;68:518–537. doi: 10.1128/MMBR.68.3.518-537.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Hashimoto M., Ichimura T., Mizoguchi H., Tanaka K., Fujimitsu K., Keyamura K., Ote T., Yamakawa T., Yamazaki Y., Mori H., et al. Cell size and nucleoid organization of engineered Escherichia coli cells with a reduced genome. Mol. Microbiol. 2005;55:137–149. doi: 10.1111/j.1365-2958.2004.04386.x. [DOI] [PubMed] [Google Scholar]
  • 34.Baba T., Ara T., Hasegawa M., Takai Y., Okumura Y., Baba M., Datsenko K.A., Tomita M., Wanner B.L., Mori H. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2006;2 doi: 10.1038/msb4100050. 2006.0008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Forster A.C., Church G.M. Towards synthesis of a minimal cell. Mol. Syst. Biol. 2006;2:45. doi: 10.1038/msb4100090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Keseler I.M., Mackie A., Peralta-Gil M., Santos-Zavaleta A., Gama-Castro S., Bonavides-Martinez C., Fulcher C., Huerta A.M., Kothari A., Krummenacker M., et al. EcoCyc: fusing model organism databases with systems biology. Nucleic Acids Res. 2013;41:D605–D612. doi: 10.1093/nar/gks1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Sheng Y., Mancino V., Birren B. Transformation of Escherichia coli with large DNA molecules by electroporation. Nucleic Acids Res. 1995;23:1990–1996. doi: 10.1093/nar/23.11.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Weiss R.L. Protoplast formation in Escherichia coli. J. Bacteriol. 1976;128:668–670. doi: 10.1128/jb.128.2.668-670.1976. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SUPPLEMENTARY DATA

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES