Skip to main content
microPublication Biology logoLink to microPublication Biology
. 2022 Oct 26;2022:10.17912/micropub.biology.000657. doi: 10.17912/micropub.biology.000657

A new combinatorial megaplasmid library assembly method designed to screen for minimal pathways by using SCRaMbLE

Wei Sheng Yap 1,2, Guillaume Thibault 1,3,4,§
Reviewed by: Anonymous
PMCID: PMC9647409  PMID: 36389121

Abstract

Human proteins expressed in yeast are common to enhance protein production while the expression of functional human pathways remain challenging. Here, we propose a simple and economical high-throughput gene assembly method to create a yeast megaplasmid library from human cDNA to screen for minimal human functional pathways. We introduced artificial promoters followed by symmetric loxP sites into the megaplasmids using Golden Gate assembly coupled with streptavidin-bead-based purification. The isolated high molecular weight, randomly assembled cDNA megaplasmid library may be useful for high-throughput directed evolution experiments and may be adapted for use in other model organisms.


Figure 1. A new combinatorial megaplasmid human cDNA library assembly method.


Figure 1.
A new combinatorial megaplasmid human cDNA library assembly method.

A , The potential use of the combinatorial megaplasmid library for directed evolution experiments using SCRaMbLE. B , An overview of the construction of the combinatorial megaplasmid library. C , Generation of synthetic minimal promoter-regulated cDNA expression library flanked by halves of loxPsym site and PaqCI restriction sites. TS RT, template-switching reverse transcription; TSO, template-switching oligos; dT, deoxythymidine; TSS, transcription start site. D , Validation of the synthetic minimal promoter (56 bp) activity in yeast. The GFP transcriptional fusion reporter of the synthetic minimal promoter (P) was quantified by flow cytometry. Fluorescence signals were normalized to autofluorescence (AF). E , Electrophoretic analysis of the cDNA PCR products. Partial cDNA synthesis is observed in first lane due to dNTP exhaustion. High-quality full-length cDNA was generated by reducing PCR cycles to 20 and followed by size fractionation (last lane). Size fractionated (closed circles) or non-fractionated (opened circles) samples are indicated. F , Proof of concept of concatemerization using Golden Gate assembly with PaqCI. GG, Golden Gate assembly. The presence (closed circles) or absence (opened circles) of the enzymes required for GG are indicated. G, Assembly of the biotinylated vector and the combinatorial cDNA library. Small aliquot of respective fractions was used for visualization on 0.6% agarose gel. Assembled megaplasmids were immobilized using streptavidin magnetic beads, followed by elution and SgrDI digestion. UB, unbound; W, wash; B, bound fractions. H , The assembled megaplasmids were self-circularized, electroeluted, and then transformed into E. coli by electroporation. The presence of the ligated inserts was verified by gel electrophoresis of colony PCR products (left panel) and/or extracted plasmids (right panel).

Description

The laboratory model organism S. cerevisiae strain S288C has been used since the early nineteenth century and has become a powerful genetic tool ( Mortimer and Johnston, 1986 ). Despite being highly divergent, human and yeast share many conserved genes and pathways. Thus, it is unsurprising that many scientific breakthroughs are directly attributed to S . cerevisiae , including Nobel Prizes related to cell cycle regulators, telomeres, and autophagy. Subsequently, these discoveries were expanded in human and other higher eukaryotes. However, the complexity of the human genetic architecture remains a roadblock to new discoveries, including parallel pathways and a large proportion of unknown gene functions. Humanized yeast, a concept whereby human genes are expressed in yeast for further characterizations, has been widely practised in different degrees to study the complexity of the human system ( Kachroo et al., 2015 , Laurent et al., 2020 , Laurent et al., 2016 ). Many studies have expressed human orthologs to assess gene complementarity. Recently, with the advent of synthetic biology tools and pathway engineering, the entire human glycosylation pathway was engineered in yeast, signifying the modifiability and adaptability of yeast in modelling higher-order eukaryotes ( Hamilton et al., 2003 , Hamilton et al., 2006 ). However, this is possible only when a pathway is relatively well characterized ( Laurent et al., 2016 ). To this end, we propose a method combining bottom-up and top-down approaches to screen for the minimal human genes required to form a functional pathway in yeast.

We developed a method inspired by the recently ongoing synthetic yeast genome Sc2.0 project. The Sc2.0 project started with a bottom-up approach to build a synthetic chromosome containing only the nonredundant genome. In addition, a symmetric loxP site (loxPsym) was incorporated into the non-essential genomic regions between each coding sequence. This loxPsym feature is designed for the downstream directed evolution experiment via synthetic chromosome rearrangement and modification by loxPsym-mediated evolution (SCRaMbLE). SCRaMbLE is a powerful tool in the Sc2.0 project for many applications such as the creation of minimal cells and strains with desired phenotypes. Many studies that utilized SCRaMbLE have had high impact especially in the biotechnology field such as the generation of alkali and ethanol tolerance strains ( Ma et al., 2019 , Luo et al., 2018 ). SCRaMbLE was also used to expedite the identification of synthetic lethal interactions ( Wang et al., 2020 ). While the Sc2.0 project is indeed impressive, it is very costly due to the excessive reliance on gene synthesis and the tedious stepwise gene integration in yeast. Instead of building a whole chromosome out of synthesized gene fragments, we propose to apply SCRaMbLE to a yeast expression megaplasmid of randomly assembled human cDNA to identify the members required for a minimal pathway of desired phenotypes. This system is heterologous and therefore prevents the compensatory mechanisms that otherwise would occur in a same-species expression system. The megaplasmid can be maintained separately from the yeast genome and therefore eliminating the need for in vivo gene editing. We expect that upon completion of the megaplasmid library, it can be characterized genetically and phenotypically, and to be subjected to directed evolution to isolate strains with desired phenotypes (Fig. 1A).

We developed and optimized a blueprint of the megaplasmid library in which the vector backbone is a yeast centromeric shuttle vector and the inserts consist of the combinatorial human cDNA library (Fig. 1B). To construct the megaplasmid library, we first performed the template-switching reverse transcription for directional first strand cDNA synthesis which allowed the incorporation of the synthetic minimal promoter near the 5’ UTR of the human cDNA (Fig. 1C). The minimal promoter sequence was modified from a previously reported sequence ( Redden and Alper, 2015 ). We validated the promoter activity through the detection of a GFP reporter (Fig. 1D). To generate a representative and full-length double-stranded cDNA library, we amplified the first strand cDNA using low number of PCR cycles to avoid dNTP exhaustion. Next, we removed short cDNA fragments and other impurities by separating the PCR products on 1% agarose gel (Fig. 1E). To assemble the cDNA fragments into megachunks, we used Golden Gate assembly with PaqCI restriction enzyme which minimizes domestication issues seen in other commonly used enzymes that recognize hexanucleotide sequences. We first tested the assembly method using PCR-amplified fragments of a specific size, a 300 bp CYC1 terminator sequence. We amplified the fragments using two different primer pairs in separate reactions so that the sticky ends produced by PaqCI digestion during Golden Gate assembly were four random nucleotides (NNNN) or TCGA overhangs, respectively. The rationale was to assess if the heterogeneity of sticky ends prevents self-circularization and promotes greater concatemerization. However, the same degree of concatemer formation was observed in both reactions, suggesting that the concatemerization ability was insignificantly influenced by the overhang mismatch (Fig. 1F, left panel). Given this insight, we proceeded with Golden Gate assembly of the cDNA library fragments using the “TCGA” overhang design. This approach should also generate the symmetric spacer sequence of the loxPsym site upon restriction digestion and ligation. As expected, high molecular weight concatemers of the cDNA library fragments were observed after Golden Gate assembly (Fig. 1F, right panel). Next, we prepared the vector backbone by linearizing the centromeric plasmid pGT731 by PCR where one of the primers was 5’-end-biotinylated for subsequent immobilization steps (Fig. 1B). The non-biotinylated end of the backbone vector was cleaved by the restriction enzyme XhoI to produce a sticky end compatible to the “TCGA” overhang produced by the PaqCI digestion. Next, we performed Golden Gate assembly of the vector and the loxPsym-engineered cDNA library, followed by pulldown of the assembled megaplasmid using streptavidin magnetic beads. A distinctive band of the immobilized vector at ~5 kbp was observed in the bound fraction of the negative control (without PaqCI and T4 Ligase), whereas the band corresponding to the vector in the Golden Gate assembly reaction smeared and shifted upwards, indicating successful ligation of the heterogenous cDNA library fragments with the vector (Fig. 1G). The assembled and immobilized DNA products were digested with SgrDI to cut near the biotinylated ends and to generate compatible sticky ends. The released DNA products were self-circularized by T4 ligase and separated on gel electrophoresis for extraction of high molecular weight megaplasmids by electroelution. We transformed the megaplasmids into E. coli for propagation. Probably owing to the cryptic sites and the repetitive sequences within the cDNA library, the colonies harbouring the megaplasmids were relatively smaller in size. Using a pair of vector-specific primers, we amplified the cDNA library inserts by colony PCR to validate the ligation product. The gel electrophoresis revealed the presence (colony 2, 3 and 4) or absence (colony 1) of ligated inserts (Fig. 1H). Intriguingly, colony 5 showed neither the empty vector band nor an upshifted band indicating the presence of an insert. We hypothesized that it could be due to the presence of a huge chunk of insert which is beyond the amplification capability of a conventional PCR. Therefore, we extracted the plasmids from colony 1 and 5 to compare their sizes using gel electrophoresis. Indeed, we noticed the presence of a megaplasmid in colony 5. It is noteworthy that the megaplasmid is in circular and possibly supercoiled form, suggesting that the megaplasmid may be well over 50 kbp in size.

Historically, DNA shuffling techniques were employed for the evolution of single known protein coding sequences, de novo chimeric protein construction, and peptide aptamer screening ( Stemmer, 1994b , Stemmer, 1994a , Fujishima et al., 2015 ). Here, we applied the DNA shuffling technique to express a pool of human proteins in yeast for screening of minimal pathways. Our work demonstrates a streamlined method to create a randomized combinatorial megaplasmid library. In synthetic biology, creating a synthetic chromosome is technically challenging. The Sc2.0 project created the synthetic chromosomes by replicating the essential sequences from the yeast genome. In our system, replicating the human genome to be used directly in yeast is not feasible due to the incompatibility of the transcriptional machinery. Instead, we designed a synthetic minimal promoter for heterologous expression to minimize genome instability as a result of repetitive sequences. A better approach is to diversify the short promoter sequences, particularly the spacer regions, although it may incur a higher cost from gene synthesis ( Kotopka and Smolke, 2020 ). The heterologous expression of human proteins in yeast may be toxic especially in high concentration. Therefore, the addition of a controllable cis-regulatory transcriptional silencer may be advantageous.

The assembly of a megaplasmid is extremely challenging considering the scale of the molecular construction. Golden Gate assembly was chosen for seamless ligation among the cDNA fragments and between the ligated cDNA fragments and the destination vector. The linearized vector was biotinylated at one end and restriction digested on the other to prevent self-circularization and promote concatenation. Moreover, the biotin conjugated to the vector allowed bead-based purification which removed most of the unligated cDNA fragments that interfere with the electrophoretic analysis, facilitating the extraction of the high molecular weight megaplasmid during the next step. In conclusion, this bottom-up assembly method presented a cheap alternative for the construction of combinatorial library of high complexity. Moreover, the complexity of the library can be easily manipulated, for instance, by transforming S. cerevisiae with multiple megaplasmids of different markers. This method can possibly be modified and adapted for use in other model organisms.

Methods

Strains. S. cerevisiae wild-type (WT) strain background BY4741 ( his3Δ1 leu2Δ0 met15Δ0 ura3Δ0 ) was used in this study. In Figure 1D, WT strain was transformed with empty plasmid pGT562 or GFP reporter plasmid pGT784, respectively, using standard lithium acetate transformation protocols.

Plasmids used in this study. Plasmids used in this study are listed in Table 1. Plasmids were constructed either by restriction or Gibson cloning. The pGT731 plasmid was cloned by inserting an additional SgrDI restriction site into a SacI/XbaI double digested pRS416 vector. The GFP reporter plasmid, pGT784, was cloned by the assembly of the pGT570 backbone and the synthetic minimal promoter insert. The backbone was amplified from pGT570 using WS170 primer pair to remove the promoter, whereas the synthetic minimal promoter insert was amplified from the cDNA library using TATA-TSS_F and TSS_R primer pair.

Flow cytometry. Yeast cells expressing either the synthetic-minimal-promoter-driven GFP or the empty vector control were grown to mid-log phase at 30ºC. The fluorescence intensity was measured using the LSRFortessa X-20 (BD) flow cytometer by exciting at 488 nm and collecting through a 505 nm longpass filter and a 530/30 bandpass filter. A median readout from 10,000 cells was acquired via the software FACSDiVA v 8.0. Data was analysed with the software FlowJo 10.8.0. Reported GFP fluorescence levels were normalized to autofluorescence from the empty vector control.

Human cDNA yeast expression library preparation. Total human mRNA was extracted from RPE-1 cells using NEB Magnetic mRNA Isolation Kit (NEB S1550S). Extracted mRNA was used for cDNA synthesis by template-switching reverse transcription (NEB M0466). The oligo (dT)18 primer, WS220, used during the reverse transcription contains an adaptor region of half a loxPsym site and a PaqCI restriction site whereas the template-switching primer, TSO contains the full sequence of the synthetic minimal promoter (Fig. 1C). The cDNA products were treated with Exonuclease I to remove residual primers, and then amplified using WS219 and WS220 primers which added half of a loxPsym site and a PaqCI restriction site to the 5’ end of the products. The amplified cDNA library was column purified (Biobasic BS664) and ran through the size exclusion spin-column (Takara Bio Inc. 636079) for size fractionation.

Megaplasmid assembly and purification. To prepare the vector backbone, pGT731 was amplified with WS165 primer pair and column purified. The purified backbone was then incubated with XhoI, DpnI, and CIP for restriction digestion and dephosphorylation. The digested backbone was column-purified again. The backbone was combined with the cDNA library at a 1:5 mass ratio for Golden Gate assembly with PaqCI restriction enzyme (NEB R0745). The Golden Gate assembly reaction was set up according to the manufacturer’s protocol. After the assembly, the linear DNA plasmid was captured by the streptavidin-conjugated magnetic beads (Invitrogen 11206D). The beads were reconstituted with the binding and washing (BW) buffer (5 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, 1 M NaCl) prior to DNA coupling. The DNA-bound beads were washed twice with BW buffer before reconstituting in buffer R (Thermo Scientific BR5). After equilibrating in buffer R, the coupled plasmid DNA was released by SgrDI restriction digestion (Thermo Scientific ER2031). The released plasmid DNA was self-circularized by the addition of T4 DNA ligase (NEB M0202) and ATP (NEB P0756) according to the manufacturer’s protocol. The circular plasmid DNA was run on a 0.6% agarose gel electrophoresis for extraction of high molecular weight megaplasmids by electroelution (G-Biosciences 786-001).

Bacterial transformation by electroporation. The electroeluted megaplasmids were transformed into Stable Competent E. coli (NEB C3040H) by electroporation. In brief, 1 µl of the megaplasmids was added to 50 µl of electrocompetent cells and mixed on ice. The mixture was transferred to a pre-chilled 0.2 cm gap width electroporation cuvette (Bio-Rad 1652086) and pulsed at 2.5 kV with the Bio-Rad Gene Pulser Xcell Electroporation System. The electroporated cells were resuspended in 1 ml of outgrowth medium and recovered for 1 h at 37ºC. Cells were then plated on ampicillin selection plates and incubated at room temperature until the appearance of colonies.

Colony PCR. The E. coli transformants of relatively smaller colony size were picked and transferred to individual PCR reaction mix. The vector-specific universal primers of T3 and T7 promoters were used for the PCR amplification. The PCR products were then run on a 0.8% agarose gel electrophoresis and visualized afterwards. The presence of a ~170 bp band indicates an empty vector, whereas the presence of an upshifted band indicates the presence of ligated inserts.

Megaplasmid extraction from E. coli. E. coli transformants were each inoculated in a 3 ml selective medium and grown at room temperature for 2 days. Alkaline lysis was performed using the reagents provided by the plasmid DNA miniprep kit (Biobasic BS614) according to the manufacturer’s protocol. After the neutralization step, the lysate was clarified by centrifugation and the removal of the pellet fraction. The clarified lysate containing the plasmid DNA was precipitated by the addition of an equal volume of pure isopropanol and centrifugation. The pellet was washed twice and air-dried. Finally, the pellet was resuspended gently in elution buffer using a cut pipette tip.

Table 1. Plasmids used in this study

Plasmid

Encoded protein

Promoter

Vector

Source

pGT562

-

-

pRS415

This study

pGT570

GFP

GCN4

pRS415

( Yap and Thibault, 2022 )

pGT731

-

-

pRS416

This study

pGT784

GFP

TATA-TSS

pRS415

This study

Table 2. Oligonucleotide Primers used in this study

Primer

Sequence (5’ to 3’)

TSO

Biotin-TATAAAAGAACGATCTACCGACTGTTTCGCAGAGGGCCAAAAAAGCATCGAAAAAAGGGG

WS219

GCCTCACCTGCAAGGTCGAATTATACGAAGTTATTATAAAAGAACGATCTACCG

WS220

GCCTCACCTGCAGGGTCGAATTATACGAAGTTATTTTTTTTTTTTTTTTTT

PaqCI(TCGA)-CYC1t_F

GCCTCACCTGCAAGTTCGAACAGGCCCCTTTTCCTTTGT

PaqCI(TCGA)-CYC1t_R

GCCTCACCTGCAGGTTCGAGATGAGAGTGTAAACTGCGA

PaqCI(NNNN)-CYC1t_F

GCCTCACCTGCAAGTNNNNACAGGCCCCTTTTCCTTTGT

PaqCI(NNNN)-CYC1t_R

GCCTCACCTGCAGGTNNNNGATGAGAGTGTAAACTGCGA

WS165F

Biotin-GACTTCTAGAGCGGCCGCCACCG

WS165R

GCCTACTAGTGGATCCCCCGGGCT

WS170F

ATGAGTAAAGGAGAAGAACTTTTCACTGG

WS170R

CTGCAGGAATTCGATATCAAGCTTATCG

TSS_R

TCCTTTACTCATTTTTTTCGATGCTTTTTTGGC

TATA-TSS_F

CGAATTCCTGCAGTATAAAAGAACGATCTACCGACTG

T3 promoter

GCAATTAACCCTCACTAAAGG

T7 promoter

TAATACGACTCACTATAGGG

Acknowledgments

Acknowledgments

We thank members of Thibault lab for critical reading of the manuscript.

Funding

This work was supported by the National Research Foundation, Singapore, under its NRF-NSFC joint research grant call (NRF2018NRFNSFC003SB-006 to G.T.). The authors declare no competing financial interests.

References

  1. Fujishima K, Venter C, Wang K, Ferreira R, Rothschild LJ. An overhang-based DNA block shuffling method for creating a customized random library. Sci Rep. 2015 May 26;5:9740–9740. doi: 10.1038/srep09740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Hamilton SR, Bobrowicz P, Bobrowicz B, Davidson RC, Li H, Mitchell T, Nett JH, Rausch S, Stadheim TA, Wischnewski H, Wildt S, Gerngross TU. Production of complex human glycoproteins in yeast. Science. 2003 Aug 29;301(5637):1244–1246. doi: 10.1126/science.1088166. [DOI] [PubMed] [Google Scholar]
  3. Kachroo AH, Laurent JM, Yellman CM, Meyer AG, Wilke CO, Marcotte EM. Evolution. Systematic humanization of yeast genes reveals conserved functions and genetic modularity. Science. 2015 May 22;348(6237):921–925. doi: 10.1126/science.aaa0769. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Kotopka BJ, Smolke CD. Model-driven generation of artificial yeast promoters. Nat Commun. 2020 Apr 30;11(1):2113–2113. doi: 10.1038/s41467-020-15977-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Laurent JM, Garge RK, Teufel AI, Wilke CO, Kachroo AH, Marcotte EM. Humanization of yeast genes with multiple human orthologs reveals functional divergence between paralogs. PLoS Biol. 2020 May 18;18(5):e3000627–e3000627. doi: 10.1371/journal.pbio.3000627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Laurent JM, Young JH, Kachroo AH, Marcotte EM. Efforts to make and apply humanized yeast. Brief Funct Genomics. 2015 Oct 13;15(2):155–163. doi: 10.1093/bfgp/elv041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Luo Z, Wang L, Wang Y, Zhang W, Guo Y, Shen Y, Jiang L, Wu Q, Zhang C, Cai Y, Dai J. Identifying and characterizing SCRaMbLEd synthetic yeast using ReSCuES. Nat Commun. 2018 May 22;9(1):1930–1930. doi: 10.1038/s41467-017-00806-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Ma L, Li Y, Chen X, Ding M, Wu Y, Yuan YJ. SCRaMbLE generates evolved yeasts with increased alkali tolerance. Microb Cell Fact. 2019 Mar 11;18(1):52–52. doi: 10.1186/s12934-019-1102-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Mortimer RK, Johnston JR. Genealogy of principal strains of the yeast genetic stock center. Genetics. 1986 May 1;113(1):35–43. doi: 10.1093/genetics/113.1.35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Redden H, Alper HS. The development and characterization of synthetic minimal yeast promoters. Nat Commun. 2015 Jul 17;6:7810–7810. doi: 10.1038/ncomms8810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Stemmer WP. DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution. Proc Natl Acad Sci U S A. 1994 Oct 25;91(22):10747–10751. doi: 10.1073/pnas.91.22.10747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Stemmer WP. Rapid evolution of a protein in vitro by DNA shuffling. Nature. 1994 Aug 4;370(6488):389–391. doi: 10.1038/370389a0. [DOI] [PubMed] [Google Scholar]
  13. Wang P, Xu H, Li H, Chen H, Zhou S, Tian F, Li BZ, Bo X, Wu Y, Yuan YJ. SCRaMbLEing of a Synthetic Yeast Chromosome with Clustered Essential Genes Reveals Synthetic Lethal Interactions. ACS Synth Biol. 2020 Apr 17;9(5):1181–1189. doi: 10.1021/acssynbio.0c00059. [DOI] [PubMed] [Google Scholar]
  14. Yap WS, Thibault G. Human PERK rescues unfolded protein response-deficient yeast cells. MicroPubl Biol. 2022 Jun 28;2022 doi: 10.17912/micropub.biology.000592. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from microPublication Biology are provided here courtesy of California Institute of Technology

RESOURCES