Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2024 Mar 18;52(7):4098–4107. doi: 10.1093/nar/gkae189

Engineering an Escherichia coli strain for production of long single-stranded DNA

Konlin Shen 1, Jake J Flood 2, Zhihuizi Zhang 3, Alvin Ha 4,5,6, Brian R Shy 7,8,9, John E Dueber 10,11, Shawn M Douglas 12,
PMCID: PMC11040142  PMID: 38499480

Abstract

Long single-stranded DNA (ssDNA) is a versatile molecular reagent with applications including RNA-guided genome engineering and DNA nanotechnology, yet its production is typically resource-intensive. We introduce a novel method utilizing an engineered Escherichia coli ‘helper’ strain and phagemid system that simplifies long ssDNA generation to a straightforward transformation and purification procedure. Our method obviates the need for helper plasmids and their associated contamination by integrating M13mp18 genes directly into the E. coli chromosome. We achieved ssDNA lengths ranging from 504 to 20 724 nt with titers up to 250 μg/l following alkaline lysis purification. The efficacy of our system was confirmed through its application in primary T-cell genome modifications and DNA origami folding. The reliability, scalability and ease of our approach promise to unlock new experimental applications requiring large quantities of long ssDNA.

Graphical Abstract

Graphical Abstract.

Graphical Abstract

Introduction

Single-stranded DNA (ssDNA) plays a crucial role in biotechnology, particularly in DNA nanotechnology and gene editing (1,2). Synthesis of long ssDNA exceeding 5000 nt is challenging, and significant barriers prevent scalable production. Direct chemical synthesis via phosphoramidite chemistry is limited to lengths of 300–400 nt due to incorporation errors and depurination (3). To obtain longer ssDNA strands, current practices employ double-stranded DNA (dsDNA) as a template. For instance, asymmetric polymerase chain reaction (PCR) can generate ssDNA up to 15 000 nt in length (4). Other approaches include PCR amplification using differentially modified primers: phosphorylated and unphosphorylated for lambda exonuclease digestion (5), or biotinylated and nonbiotinylated for streptavidin bead separation (6)—resulting in isolation of long ssDNA strands. However, these techniques typically yield <1 μg of ssDNA per 50 μl reaction, making the production of milligram quantities costly and inefficient due to the extensive labor and high reagent consumption. This underscores the necessity for more scalable and economical ssDNA production methods.

Bioproduction of ssDNA in Escherichia coli offers a scalable alternative. By engineering the M13 bacteriophage to include specific ssDNA sequences, up to 590 mg ssDNA per liter of culture can be recovered once the viral particles are processed (7). However, the resulting ssDNA products include ∼6 kb of the M13 genome, and issues with sequence stability and titer can arise for custom insertions longer than 2 kb (8,9). Additionally, utilizing phages carries a contamination risk for other bacterial cultures.

Improvements to M13-based ssDNA production can be made through use of helper phages (10) or helper plasmids (11,12). In this approach, the phage genes are encoded on the helper entity and the desired sequence is cloned into a phagemid, a separate plasmid bearing an M13 replication origin and packaging sequence. This separation prevents the inclusion of the M13 genome in the final ssDNA product. Phagemid utilization can yield ssDNA lengths up to 51 000 nt (13) and, with optimized E. coli growth conditions, titers can reach up to 80 mg/l of culture (14). Nevertheless, the consistency of ssDNA production using helper elements is variable, with outcomes ranging from off-target by-products to a lack of ssDNA (15).

Here, we describe our novel phagemid-based method that improves the quality of bacteriologically produced ssDNA (Figure 1). Our method relies on an engineered ‘helper strain’ called ‘eScaf’. We posited that inconsistent target ssDNA yields may result from high or variable expression of M13 genes, which, in turn, may arise from fluctuations in helper plasmid copy number. To address this, we integrated the M13 genes directly into the bacterial chromosome. We assessed the performance of eScaf, considering the chromosomal integration site and target ssDNA length and composition. We demonstrate that eScaf consistently produces high-quality ssDNA at concentrations of 100–200 μg per liter of bacterial culture using shake flasks. We further validated our method by successfully employing eScaf-generated ssDNA in DNA origami fabrication and in CRISPR-mediated gene editing applications, highlighting its utility in advanced biotechnological applications.

Figure 1.

Figure 1.

Overview of ssDNA production using the eScaf helper strain. Conventional phagemid-based ssDNA synthesis involves co-transforming an E. coli host strain with a helper plasmid and a phagemid, which leads to the production of phage-like particles harboring the desired ssDNA (top). The helper plasmid mediates the expression of the phage genes necessary for this process. In contrast, our eScaf method (bottom) streamlines this by integrating the phage genes directly into the chromosome, establishing a ‘helper strain’ that requires only the phagemid for transformation. For both strategies, phage-like particles containing ssDNA are released into the culture medium and can be isolated through polyethylene glycol (PEG) fractionation. The target ssDNA is subsequently extracted from these particles using alkaline lysis followed by ethanol precipitation for purification.

Materials and methods

Helper strain construction

We integrated the M13mp18 and M13KO7 genomes into the chromosome of the TG1 E. coli strain using a CRISPR RNA-guided transposase system (16). The M13mp18 and M13KO7 genomes were cloned into ‘donor’ plasmids carrying mini-transposons. Guide RNA (gRNA) targeting locations within the genome were cloned into a separate ‘effector’ plasmid also carrying the transposition and CRISPR machinery. Transposition was achieved by dual transformation of the donor and effector plasmids into TG1 and growth at 30°C. Colonies were then screened for successful integration using colony PCR (cPCR). Colonies carrying the insert were then cured of their plasmids by overnight growth in Luria-Bertani (LB) broth without selection and subsequently plated onto regular LB agar plates without selection to use as a master plate. Replica plating was used to transfer those colonies onto plates selecting for either the donor plasmid or the effector plasmid. Colonies that did not grow on either selection plate were then picked from the master plate and integration of M13 genes was once again verified through cPCR. Glycerol stocks and chemically competent cells were made from colonies showing successful integration and curing for banking and ssDNA production, respectively.

Phagemid construction

Phagemids were created by cloning inserts into the pScaf (11) backbone using Gibson assembly or golden gate cloning. Phagemids designed for testing sequence length between 504 and 10 080 nt were constructed from 1 to 10 gBlocks (IDT), 1008 bp in length, using Gibson assembly. To construct the 20 874-nt phagemid, a 9-kb fragment from the dynein gene and a red fluorescent protein (RFP) expression operon were cloned into the 10 080-nt phagemid using Gibson assembly. The phagemid for the DNA origami tile was constructed using golden gate cloning to insert a 500-bp gBlock into the phagemid backbone. The phagemid designed for the B-cell maturation antigen chimeric antigen receptor (BCMA-CAR) CRISPR knock-in was constructed via PCR by amplifying the template from plasmids bearing the construct and cloning them into the pScaf backbone using Gibson assembly.

Amplification and purification of ssDNA from helper strains

We first transformed the strains with a phagemid of interest, and then picked a colony and subsequent growth in liquid culture with antibiotic selection. Cultures were grown in standard shaker flasks at 30°C for 12–16 h. To harvest ssDNA, cultures were first chilled on ice for 35 min, and then bacteria and phage-like particles were separated by pelleting the bacteria using centrifugation for 15 min at 7 kRCF and 4°C. The supernatant was collected and the phage-like particles were purified by PEG precipitation. PEG-8000 (40 g/l) and NaCl (30 g/l) were added to the mixture and incubated on ice for 30 min. Then, the phage-like particles were pelleted by spinning the mixture at 9 kRCF for 15 min at 4°C, discarding the supernatant and resuspending the pellet in Tris-EDTA (TE) buffer. To prevent possible contamination of the ssDNA products by bacterial DNA, any residual bacteria in the phage-like particle solutions were pelleted by spinning the phage-like particle solutions at 15 kRCF and 4°C for 15 min, transferring the supernatant to clean tubes and spinning the solution again at 15 kRCF and 4°C for 15 min to ensure that no bacteria remained in the solution. The phage coat proteins were subsequently stripped via alkaline lysis. Two volumes of P2 buffer (Qiagen) was added to the phage solution for lysis and 1.5 volumes of P3 buffer (Qiagen) was immediately added to neutralize the reaction. The mixture was then incubated on ice for 15 min and the phage debris was pelleted by centrifugation at 15 kRCF and 4°C for 15 min. The supernatant of the mixture was then moved to a clean tube and three volumes of cold 100% EtOH was added to the mixture and the ssDNA precipitated at −20°C overnight. Finally, the ssDNA was pelleted by centrifugation at 16 kRCF and 4°C for 15 min. The supernatant was aspirated off and the pellet washed in cold 70% EtOH by a second centrifugation step at 15 kRCF and 4°C for 15 min. The supernatant was aspirated off again, and the pellet was dried at room temperature by leaving the tube inverted on the bench. Finally, the pellet was resuspended in TE buffer.

ssDNA concentrations were measured using an NP80 nanophotometer (Implen) on the ssDNA setting and purity was evaluated using agarose gel electrophoresis (2% Tris-Borate-EDTA agarose gel with 11 mM Mg added).

Fabrication of DNA origami tile

We designed a small DNA origami tile using Cadnano (17) and optimized the staple routing path using a custom software developed by the Douglas lab (manuscript in preparation). A custom 3024-nt origami scaffold was produced using the methods described above for phagemid cloning and scaffold purification. Staples were ordered from IDT. The DNA origami tile was folded by mixing the custom scaffold with a 10× molar excess of staples and subjecting the mixture to the following thermal ramp:

  • Incubate at 65°C for 15 min.

  • Incubate at 60°C for 1 h, decreasing 1°C per cycle.

  • Go to step 2 and repeat an additional 40 times.

DNA origami nanostructures were purified via PEG precipitation by adding an equivalent volume of a 15% PEG-8000 solution [15% (w/v) PEG-8000, 5 mM Tris, 1 mM ethylenediaminetetraacetic acid (EDTA), 5 mM NaCl, 20 mM MgCl2] and centrifuging for 25 min at 16 kRCF. The supernatant was aspirated, and the pellet was resuspended in 1xFOB8 (5 mM Tris, 1 mM EDTA, 8 mM MgCl2) and allowed to resuspend for 30 min at room temperature. The PEG precipitation step was repeated one more time.

DNA origami tiles were evaluated using gel agarose electrophoresis (2% TBE with 11 mM Mg) and negative-stain transmission electron microscopy (TEM). Briefly, formvar-coated copper grids (Ted Pella) were plasma treated with a Pelco glow discharge unit (30 s hold/30 s glow discharge). One microliter of DNA origami at a concentration of 6.55 nM was incubated on the grid for 3 min, and then the excess sample was wicked away using filter paper. Grids were washed twice in 1xFOB8 and washed once in freshly prepared 2% uranyl formate. A droplet of 2% uranyl formate was then placed on the grid and incubated for 1 min at room temperature for staining and then wicked away, and the grid was dried by a vacuum aspirator. Micrographs of the negatively stained grids were collected on a Tecnai T12 (FEI). 2D class averages were made using EMAN2.

BCMA-CAR knock-in T cells using ssDNA products generated by helper strains

Cas9 ribonucleoproteins (RNPs) targeting the TRAC locus were produced by complexing gRNA to Cas9 with the addition of an single-stranded oligodeoxynucleotide (ssODN) electroporation enhancer. To create the gRNA, synthetic CRISPR RNA (crRNA) and trans-activating crRNA were chemically synthesized individually (Dharmacon) and each strand was resuspended in IDT duplex buffer at a concentration of 160 μM. The two strands were mixed 1:1 (v/v) and annealed by incubation at 37°C for 30 min to form an 80 μM gRNA solution. ssODN enhancer was mixed into the gRNA solution at a 0.8:1 volume ratio followed by 40 μM Cas9-NLS (UC Berkeley QB3 MacroLab) at 1:1 (v/v). The entire mixture was incubated for 15–30 min.

Primary adult blood cells were obtained from healthy human donors as a leukapheresis pack purchased from StemCell Technologies, Inc. Primary human CD3+ T cells were isolated by negative selection using magnetic cell isolation kits (StemCell Technologies, Inc., catalog no. 17951). The isolated cells were then activated with magnetic anti-human CD3/CD28 Dynabeads (CTS, Thermo Fisher). Beads were mixed at a 1:1 ratio with cells and cells were cultured at a density of 1 × 106 cells/ml in complete X-Vivo 15 culture media (Lonza, 5% fetal bovine serum, 10 nM N-acetyl-l-cysteine, 50 μM 2-mercaptoethanol) supplemented with 5 ng/ml of IL-7 and IL-15 for 48 h.

Activated primary T cells were debeaded 48 h post-activation using an EasySep magnet (StemCell Technologies, Inc.) and electroporated using the Lonza 4D 96-well electroporation system. Immediately before electroporation, cells were centrifuged at 90 × g for 10 min and then resuspended at 0.5 × 106 cells per 20 μl of Lonza P3 electroporation buffer. Circularized ssDNA (cssDNA) encoding the BCMA-CAR and RNPs were mixed and incubated for at least 5 min, and then combined with cells and transferred to the Lonza 96-well electroporation shuttle. Fifty picomoles of RNP was used for each electroporation. Primary T cells were electroporated using pulse code EH-115, immediately rescued with prewarmed culture media and incubated at 37°C for at least 15 min. Cells were then transferred to 96-well plates and maintained at 1.0 × 106 cells/ml in complete X-Vivo 15 culture media. Fresh cytokines and media were added every 2–3 days and split using identical dilution factors for all samples to prevent cell overgrowth and maintain comparable cell counts.

Knock-in percentage and cell counts were measured on day 10 of culturing (day 8 post-electroporation) by flow cytometry using the Attune NxT flow cytometer with a 96-well autosampler (Thermo Fisher Scientific). Briefly, cells were resuspended in EasySep buffer (phosphate-buffered saline, 2% fetal bovine serum, 1 mM EDTA), and then incubated with biotinlyated recombinant BCMA protein (Acro Biosystems) for 20 min at 4°C. Cells were washed and stained with GhostDye 780 (Tonbo), TCRab-BV421 (BioLegend) and Streptavidin-APC (BioLegend) for 20 min at 4°C. Events were recorded from a fixed volume for all samples to obtain comparable live cell counts between conditions.

Results

eScaf successfully generated custom ssDNA from 504 to 20 724 nt in length

We constructed two eScaf helper strains by inserting the genomes of M13mp18 and M13KO7 into the chromosome of the JM101 E. coli derivative, TG1. These strains were transformed with phagemids carrying ssDNA sequences varying from 504 to 20 724 nt. The transformed strains were cultured in LB media, after which we isolated phage-like particles from the culture supernatant and extracted the ssDNA by alkaline lysis. While both the M13mp18 and M13KO7 strains successfully produced ssDNA across a range of lengths, the M13mp18 strain achieved notably higher ssDNA yields (Figure 2A). For a 3396-nt ssDNA strand, M13mp18 helper strain produced average titers of 206 μg/l, whereas M13KO7 strains produced average titers of 11 μg/l (Figure 2B). While both integrated strains showed target band purities above 90%, the M13mp18 strain produced fewer off-target species than the M13KO7 strain (Figure 2C). Both strains showed limited efficiency in producing the 504-nt ssDNA (Supplementary Figure S2), consistent with previous reports that short ssDNA is difficult to produce using phagemid approaches (12). Below we summarize specific factors that affected titer and purity of ssDNA using our new method.

Figure 2.

Figure 2.

Comparison of ssDNA production by M13mp18 and M13KO7 helper strains. (A) Both M13mp18- and M13KO7-based helper strains can produce ssDNA within a size range of 504–20 724 nucleotides. The M13KO7-based strains exhibit a higher presence of off-target species and background than the M13mp18-based strains. (B) The total ssDNA yield is greater when using M13mp18-based helper strains compared to M13KO7-based strains. (C) While ssDNA purity exceeds 90% for both helper strains, M13mp18 strains produce ssDNA that is marginally purer than that from M13KO7 strains. Purity percentages were determined by quantifying the intensity of the target band within the dotted box and normalizing this value against the total intensity of the full lane. All ladders are 1-kb ladders from New England Biolabs.

Phagemid–helper strain compatibility affects ssDNA yield

Contrary to earlier reports suggesting superior ssDNA purity from M13KO7 plasmid systems (12), our findings indicate a decrease in purity when employing M13KO7 helper strains. We hypothesized that this discrepancy could be linked to the compatibility between the M13 packaging sequences in the phagemids and the helper strains. Our phagemids, based on the pScaf backbone (11), incorporate a modified packaging sequence, which we suspected might influence the final ssDNA product’s quality. To test this hypothesis, we conducted experiments with our M13KO7 and M13mp18 helper strains using two distinct phagemids: the pScaf-derived phagemid and another that was previously paired with the M13KO7 helper plasmid (12) (referred to as the ‘KO7 phagemid’).

Our data revealed a clear dependence of ssDNA yield on the phagemid used (Figure 3A). Optimal yields were achieved when pairing the M13mp18 helper strain with the pScaf phagemid and the M13KO7 helper strain with the KO7 phagemid. In contrast, using the KO7 phagemid with the M13mp18 helper strain or the pScaf phagemid with the M13KO7 helper strain resulted in significantly lower ssDNA production. These results indicate that both M13KO7 and M13mp18 helper strains can be effective when paired with the appropriate phagemid. Given the slight edge in ssDNA purity and our primary use of the pScaf backbone in previous phagemid cloning, we performed all subsequent work with the M13mp18 helper strain.

Figure 3.

Figure 3.

ssDNA production requires phagemid–host strain compatibility and is time dependent. (A) Optimal titer results are achieved when both the helper strain and phagemid are suitably matched. M13KO7-integrated helper strains fail to package ssDNA derived from the pScaf phagemid but work well with the KO7 phagemid. Conversely, M13mp18-integrated helper strains show high-compatibility pScaf-based phagemids and are less effective with the KO7 phagemid. (B) Time series of ssDNA production for a 4032-nt ssDNA strand in the M13mp18 helper strain. Measurable ssDNA production begins 8 h after inoculation and ceases following 16 h of growth, with the culture reaching saturation at ∼12 h. (C) Agarose gel analysis of temporal ssDNA production by the M13mp18 helper strain shows predominant desired on-target ssDNA bands with a minor presence of an off-target band. Ladders are 1-kb ladders from New England Biolabs.

Production of ssDNA peaked after 16 h of growth at 30°C

We observed that ssDNA production reached its maximum after 16 h of growth at 30°C, which occurs roughly 4 h after culture saturation (Figure 3B). Notably, ssDNA titer exhibited a reproducible decline following its peak, followed by a marginal increase later in the growth cycle. The final increase did not appear to be due to cellular lysis, which would be indicated by additional gel bands typically associated with extraneous nucleic acid contamination (Figure 3C).

Integration location affects ssDNA titer, but does not affect purity

We found that integrating the M13mp18 genome in different genomic loci of TG1 affected ssDNA titer, but not its purity (Figure 4A and B). We tested five different insertion sites, both intergenic and intragenic (Table 1), but did not find that insertion into intergenic or intragenic regions correlated with ssDNA titer. In fact, we found the location with the highest titer was within the UlaD gene (155 μg/l). On the other hand, insertion into an intergenic region close to the origin of replication resulted in no ssDNA production. In all integration sites, ssDNA purity was >90% (Supplementary Figure S3).

Figure 4.

Figure 4.

M13mp18 helper strain characterization. (A) Integration location of M13mp18 genome in the TG1 genome affects ssDNA yield. (B) Integration location does not affect ssDNA purity. M13mp18 helper strains with integration into site 1 yielded no ssDNA products, so gel results were not included. (C) ssDNA yield from the M13mp18 helper strain does not show strong dependence on ssDNA length. The 1008-nt ssDNA product does show lower yields than other ssDNA products with lengths ranging from 2008 to 10 080 nt. (D) Production of 20 724-nt ssDNA is not as consistent as shorter length ssDNA products. Of four tested preparations, all preparations showed off-target contamination and we found that only two produced the correct product. Preparations that produced the correct product did show a red pellet, corresponding to the RFP gene encoded in the ssDNA product. (E) Sequence composition influences ssDNA titer. (F) Sequence composition does not have a strong effect on ssDNA purity. All ladders are 1-kb ladders from New England Biolabs.

Table 1.

Integration locations

Site Disrupted gene gRNA target
1 Intergenic (near OriC) tctggcgcagttgatatgtaaggcaggtttat
2 dcm attattgcgatccggcgacgcatcattttaat
3 YaaJ gactttagcaaaaaatgagaatgagttgatcg
4 Intergenic (between YaaJ and ThrC) taatctctaagttaccgcatttattgagaatt
5 UlaD aacgcggcagcttaagaaggaattgaacatgt

The length of ssDNA has minimal impact on production titer

Our investigation revealed no substantial correlation between the titer and the length of ssDNA within the range of 2016–10 080 nt. Notably, the ssDNA strand of 1008 nt consistently showed lower titers (refer to Figure 4C). We designed a series of phagemids encoding ssDNA sequences, increasing in length by increments of 1008 nt. Each subsequent phagemid contained the entirety of the preceding sequence plus an additional 1008 nt. Although the yield for the 1008-nt product was lower, it was producible, unlike the 504-nt product, indicating that a minimum threshold for efficient ssDNA synthesis exists between 1008 and 2016 nt.

To explore the upper limits of ssDNA production, we cloned a 20 784-nt target into the pScaf vector. This extended ssDNA sequence included all sequences from the 10 080-nt construct with an RFP operon incorporated within the additional 10 704 nt. The resulting ssDNA products using this phagemid showed increased variability and the presence of off-target by-products (Figure 4D), with only half of the tested colonies displaying the expected band size and all colonies exhibiting additional bands corresponding to the vector backbone. However, the expression of RFP in bacterial cells was correlated with the successful production of the target ssDNA. This outcome hints at the potential instability of long ssDNA sequences within phagemids and suggests that incorporating a secondary selection marker might enhance the quality of ssDNA, particularly for constructs exceeding 10 080 nt.

Influence of ssDNA sequence on yield and purity

Our investigation into the effects of ssDNA sequence on production yield and purity involved four constructs with lengths ranging from 3024 to 3396 nt. These constructs included a randomly generated sequence (p3024), a segment from the dynein gene’s coding region (p3031) and two sequences constituting complete operons for a BCMA-CAR (p3384 and p3396).

Notably, the sequence composition significantly influenced yield; for instance, p3396 resulted in the highest yield at 245 μg/l, while p3024 had the lowest yield at 49 μg/l. Contrary to initial expectations, complete operon sequences led to higher yields compared to the coding sequence or a random sequence. Despite a high degree of similarity (93%) between the two operon sequences with the highest yields, their yield differed by nearly 50%, underscoring the critical role of sequence in ssDNA production efficiency.

Purity, however, did not exhibit significant variations across the constructs, as demonstrated in Figure 4E and F. Base composition was relatively consistent among the sequences, with each having a base content ranging from 22% to 27% (detailed in Supplementary Figure S4).

Utilization of eScaf-derived ssDNA in DNA nanotechnology applications

To validate the practical applications of our eScaf-derived ssDNA, we employed it in the construction of DNA origami nanostructures. DNA origami is a technique where a long ssDNA scaffold is intricately folded into predetermined shapes with the help of numerous short ssDNA staple strands. We used eScaf to produce a custom 3024-nt scaffold and folded it into a 24 nm × 43 nm × 6 nm rectangular tile using 68 commercially purchased staple strands (Figure 5A and Supplementary Figure S5). Agarose gel electrophoresis and negative-stain TEM confirmed the integrity and accuracy of the folded structures (Figure 5B). These data demonstrate that eScaf-derived ssDNA can be used to make DNA nanostructures.

Figure 5.

Figure 5.

Applications of ssDNA products in biotechnology. (A, B) ssDNA products can be used for DNA origami. (A) Schematic of DNA origami tile made from custom 3024-nt ssDNA scaffold. (B) (Left) Gel data of a rectangular DNA origami tile folded from a custom 3024-nt scaffold made using the eScaf system. Lane 3 shows the tile immediately after folding with a well-defined origami band. Lane 4 shows the tile after staple purification. L, NEB 1-kb ladder; S, 3024-nt scaffold; O, original origami without purification; P, purified origami. (Right) Negative-stain TEM of the purified 3024-nt tile shows well-formed tiles. ‘Narrow’ tiles show when the tile lands on its side rather than on its face. Box: 2D class average of the tile. (C, D) eScaf-derived ssDNA products can be used for CRISPR-mediated knock-ins. (C) Schematic of the knock-in procedure using eScaf produced BCMA-CAR homology-directed repair (HDR) template for human primary T-cell knock-ins. Knock-in efficiency and cell yield were evaluated by flow cytometry. (D) (Left) Cells using eScaf-derived templates can successfully be knocked in at all template doses. However, the efficiency of knock-in decreases substantially with template dose. At nontoxic doses, knock-in efficiency using eScaf-derived ssDNA is comparable to the previously published literature (18). (Right) Cell yield decreases as increasing amounts of eScaf-derived templates are electroporated. The toxicity of eScaf-derived products may be due to endotoxins from bacterial culture.

Application of eScaf-derived ssDNA in CRISPR knock-ins for primary T cells

We synthesized a single-stranded CRISPR HDR template for a BCMA-CAR knock-in, which includes a 1600-nt coding sequence, homology arms ∼600 nt in length and additional 500-nt noncoding regions comprising the M13 packaging sequence, and confirmed successful integration via flow cytometry (Figure 5C).

Flow cytometry analysis revealed that, at low doses (0.125 and 0.25 pmol) of repair template, eScaf-derived ssDNA produced knock-in efficiencies approaching 20% (Figure 5D), consistent with prior knock-in rates using DNA templates for primary T-cell engineering (18). At higher doses, cell yield decreased dramatically for cells electroporated with eScaf-derived templates (Figure 5D). Although we purify the ssDNA from the phage proteins using ethanol purification, endotoxins can be retained with DNA in alcohol-based DNA precipitation methods (19). We suspect that additional residual endotoxins from bacterial production of the templates may cause cell death, contributing to the decreased knock-in efficiency. Despite this drawback, our results demonstrate substantial promise for using eScaf as a cost-effective method to create cssDNA templates for use in gene editing.

Discussion

Our eScaf helper strain consistently produces long ssDNA with higher purity compared to conventional techniques utilizing helper plasmids or helper phages. Our helper plasmid ssDNA products typically yielded <60% of the desired ssDNA product, while ssDNA products made by our helper strain typically achieved >90% purity (Supplementary Figure S1). For ssDNA lengths ranging from 1008 to 10 080 nt synthesized via our process, even though occasional faint off-target bands were observed, the purity was sufficient to forgo additional purification steps.

The eScaf system enables the production of ssDNA within a 5-day period, including transformation, culture growth, phage-like particle harvesting and purification. Using standard shake flasks and LB media, we consistently generated BCMA-CAR templates at a concentration of 100 μg per liter of bacterial culture. This translates to an estimated cost of $0.04 per microgram of ssDNA, representing a notable cost efficiency. Further cost reductions may be achieved by optimizing bacterial media and growth conditions, or by employing bioreactors for high-density cultures.

Commercial kits for ssDNA synthesis are currently constrained by their limited production length (typically <5000 nt) and high costs compared to our eScaf method. While commercial vendors can synthesize longer strands, the associated expenses and lengthy production times are substantially more burdensome than using eScaf, which offers a faster and more cost-effective solution for long ssDNA synthesis. Additionally, the higher purity of eScaf products compared to those generated by phage or helper entities reduces the amount of labor needed to generate target ssDNA. Our system’s reliance on phagemids and a helper strain—rather than on self-replicating phages—eliminates the risk of contaminating other bacterial cultures, enhancing biosafety and reducing the potential for cross-contamination. The requirement for just a single phagemid avoids the dual antibiotic selection required by helper plasmid systems and thereby reduces reagent cost.

It is important to note, however, that phagemid-based ssDNA production, including eScaf, does not export completely customized ssDNA, as the target sequence must contain the M13 packaging sequences. Recent advances have shortened the size of these packaging sequences to 234 nt, significantly less than the original 2000–3000 nt (12,20). Moreover, self-cleaving DNAzymes can then be used to remove the M13 sequences post-harvest, leaving <10 nt of residual sequence (21,22). While eScaf also does not efficiently produce 1000-nt or shorter ssDNA strands, the use of self-cleaving DNAzymes can facilitate the generation of shorter ssDNA from longer ones.

By enabling the inexpensive production of pure ssDNA, our eScaf method holds significant promise for the field of DNA nanotechnology. The DNA origami method depends on ssDNA for constructing nanoscale structures, where precise control over sequence length and composition can be crucial. Considering the potential need for hundreds of unique strands per nanostructure, the affordability of our approach is particularly advantageous for those seeking to explore various design variants, assemble complex structures or scale up DNA nanodevice production for clinical or industrial applications. Additionally, our method has the potential to revolutionize the assembly of DNA origami by producing long ssDNA strands up to 20 000 nt, enabling the creation of larger nanostructures on a single phagemid. Combining ssDNA staples and scaffolds on a single construct enhances folding reaction efficiency and reduces cost (21). This could dramatically expand the size and complexity of structures that can be synthesized.

Our method to economically produce custom long ssDNA will also benefit gene editing applications. First, the efficiency of CRISPR-mediated HDR in mice is significantly higher when using ssDNA templates than when using dsDNA templates (23). Long ssDNA templates also are less toxic and have increased knock-in efficiency compared to dsDNA templates in human T-cell genome editing (2,18). A secondary benefit from phagemid-based ssDNA production is that the ssDNA produced is naturally circularized due to the intrinsic rolling-circle amplification and ligation steps employed in filamentous phage propagation. cssDNA has been shown to improve knock-in efficiency over linear ssDNA (24,25). The increased efficiency of gene editing using cssDNA HDR templates will facilitate CAR T-cell manufacturing and gene therapies and allow insertion of larger gene circuits for new synthetic biology constructs. Applying well-established methods to remove endotoxins from purified DNA may further improve cell viability and knock-in efficiency.

Conventional flask-based growth optimizations have yielded several milligrams of ssDNA per liter of phage culture, suggesting that further improvements to our eScaf system are possible. To increase titer, future studies could investigate additional chromosomal integration sites (26), explore ssDNA sequence variations and even consider alternative E. coli strains that may offer superior performance (27). Genetic modifications to the host strain, such as knocking out RecA, may also improve titer and stabilize very large (10 000-nt+) ssDNA production.

Modifications to the M13 genome could also boost ssDNA titer, as evidenced by our observation of improved ssDNA yield by using M13mp18 over M13KO7 and literature indicating that downregulation of protein V increases ssDNA yield in standard phage preparations (28). Changes must be made judiciously, due to the delicate balance of the M13 life cycle in which modifications to the expression of any protein may have significant downstream effects (29). For example, we designed and built an inducible version of the helper strain by replacing two native M13 promoters with T7lac promoters to test the hypothesis that constitutive phage gene expression resulted in toxicity. Although we observed suppression of ssDNA production prior to the addition of the isopropyl ß-D-1-thiogalactopyranoside (IPTG) inducer, the purity and yield of the exported ssDNA were poor (Supplementary Figure S6). Our experiments with inducible helper strains indicate that directed evolution of the M13 genes may be a more effective approach to optimize ssDNA production.

In conclusion, our eScaf method presents a scalable and economical solution akin to bacterial protein manufacturing. By integrating M13 genes into the E. coli chromosome, we have markedly enhanced the purity of ssDNA, overcoming a major hurdle toward the production and utilization of ssDNA-dependent biotechnological tools. We anticipate that the high-purity ssDNA amplification methods we have developed will be instrumental in advancing research in gene editing and DNA nanotechnology, among other fields.

Supplementary Material

gkae189_Supplemental_File

Acknowledgements

Plasmids for the Cas-guided transposon system (pSL1777 and pSL1119) were gifts from Samuel H. Sternberg (Addgene plasmid #160731, http://n2t.net/addgene:160731, RRID:Addgene_160731; Addgene plasmid #160732, http://n2t.net/addgene:160732, RRID:Addgene_160732). The phagemid used for the phagemid–helper strain compatibility study (pre_IV_CS6_4323_v1) was a gift from Hendrik Dietz (Addgene plasmid #126859, http://n2t.net/addgene:126859, RRID:Addgene_126859). We thank Leslie Biennen of C3 Science Communication for edits and comments on the manuscript.

Contributor Information

Konlin Shen, Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA, 94158 USA.

Jake J Flood, Department of Bioengineering, University of California, Berkeley, Berkeley, CA, 94720 USA.

Zhihuizi Zhang, Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA, 94158 USA.

Alvin Ha, Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, 94143 USA; Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA, 94158 USA; Department of Medicine, University of California, San Francisco, San Francisco, CA, 94143 USA.

Brian R Shy, Department of Laboratory Medicine, University of California, San Francisco, San Francisco, CA, 94143 USA; Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA, 94158 USA; Department of Medicine, University of California, San Francisco, San Francisco, CA, 94143 USA.

John E Dueber, Department of Bioengineering, University of California, Berkeley, Berkeley, CA, 94720 USA; Biological Systems & Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720 USA.

Shawn M Douglas, Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA, 94158 USA.

Data availability

The data underlying this article are available at doi.org/10.6084/m9.figshare.25301443.

Supplementary data

Supplementary Data are available at NAR Online.

Funding

National Science Foundation [ECCS-2036865, DBI-1548297]; National Institutes of Health [R35GM125027, 1F32GM147967 to K.S.]. Funding for open access charge: National Science Foundation [DBI-1548297].

Conflict of interest statement. None declared.

References

  • 1. Rothemund  P.W.  Folding DNA to create nanoscale shapes and patterns. Nature. 2006; 440:297–302. [DOI] [PubMed] [Google Scholar]
  • 2. Shy  B.R., Vykunta  V.S., Ha  A., Talbot  A., Roth  T.L., Nguyen  D.N., Pfeifer  W.G., Chen  Y.Y., Blaeschke  F., Shifrut  E.  et al.  High-yield genome engineering in primary cells using a hybrid ssDNA repair template and small-molecule cocktails. Nat. Biotechnol.  2023; 41:521–531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Caruthers  M.H.  The chemical synthesis of DNA/RNA: our gift to science. J. Biol. Chem.  2013; 288:1420–1427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Veneziano  R., Shepherd  T.R., Ratanalert  S., Bellou  L., Tao  C., Bathe  M.  In vitro synthesis of gene-length single-stranded DNA. Sci. Rep.  2018; 8:6548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Zhang  H., Chao  J., Pan  D., Liu  H., Huang  Q., Fan  C.  Folding super-sized DNA origami with scaffold strands from long-range PCR. Chem. Commun.  2012; 48:6405–6407. [DOI] [PubMed] [Google Scholar]
  • 6. Wakimoto  Y., Jiang  J., Wakimoto  H.  Isolation of single-stranded DNA. Curr. Protoc. Mol. Biol.  2014; 107:2–15. [DOI] [PubMed] [Google Scholar]
  • 7. Kick  B., Hensler  S., Praetorius  F., Dietz  H., Weuster-Botz  D.  Specific growth rate and multiplicity of infection affect high-cell-density fermentation with bacteriophage M13 for ssDNA production. Biotechnol. Bioeng.  2017; 114:777–784. [DOI] [PubMed] [Google Scholar]
  • 8. Hines  J.C., Ray  D.S.  Construction and characterization of new coliphage M13 cloning vectors. Gene. 1980; 11:207–218. [DOI] [PubMed] [Google Scholar]
  • 9. Yanisch-Perron  C., Vieira  J., Messing  J.  Improved M13 phage cloning vectors and host strains: nucleotide sequences of the M13mpl8 and pUC19 vectors. Gene. 1985; 33:103–119. [DOI] [PubMed] [Google Scholar]
  • 10. Chen  X., Wang  Q., Peng  J., Long  Q., Yu  H., Li  Z.  Self-assembly of large DNA origami with custom-designed scaffolds. ACS Appl. Mater. Interfaces. 2018; 10:24344–24348. [DOI] [PubMed] [Google Scholar]
  • 11. Nafisi  P.M., Aksel  T., Douglas  S.M.  Construction of a novel phagemid to produce custom DNA origami scaffolds. Synth. Biol.  2018; 3:ysy015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Engelhardt  F.A., Praetorius  F., Wachauf  C.H., Brüggenthies  G., Kohler  F., Kick  B., Kadletz  K.L., Pham  P.N., Behler  K.L., Gerling  T.  et al.  Custom-size, functional, and durable DNA origami with design-specific scaffolds. ACS Nano. 2019; 13:5015–5027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Marchi  A.N., Saaem  I., Vogen  B.N., Brown  S., LaBean  T.H.  Toward larger DNA origami. Nano Lett.  2014; 14:5740–5747. [DOI] [PubMed] [Google Scholar]
  • 14. Behler  K.L., Honemann  M.N., Silva-Santos  A.R., Dietz  H., Weuster-Botz  D.  Phage-free production of artificial ssDNA with Escherichia coli. Biotechnol. Bioeng.  2022; 119:2878–2889. [DOI] [PubMed] [Google Scholar]
  • 15. Zhou  B., Dong  Q., Ma  R., Chen  Y., Yang  J., Sun  L.Z., Huang  C.  Rapid isolation of highly pure single-stranded DNA from phagemids. Anal. Biochem.  2009; 389:177–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Vo  P.L.H., Ronda  C., Klompe  S.E., Chen  E.E., Acree  C., Wang  H.H., Sternberg  S.H.  CRISPR RNA-guided integrases for high-efficiency, multiplexed bacterial genome engineering. Nat. Biotechnol.  2021; 39:480–489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Douglas  S.M., Marblestone  A.H., Teerapittayanon  S., Vazquez  A., Church  G.M., Shih  W.M.  Rapid prototyping of 3D DNA-origami shapes with caDNAno. Nucleic Acids Res.  2009; 37:5001–5006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Roth  T.L., Puig-Saus  C., Yu  R., Shifrut  E., Carnevale  J., Li  P.J., Hiatt  J., Saco  J., Krystofinski  P., Li  H.  et al.  Reprogramming human T cell function and specificity with non-viral genome targeting. Nature. 2018; 559:405–409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Montbriand  P.M., Malone  R.W.  Improved method for the removal of endotoxin from DNA. J. Biotechnol.  1996; 44:43–46. [DOI] [PubMed] [Google Scholar]
  • 20. Specthrie  L., Bullitt  E., Horiuchi  K., Model  P., Russel  M., Makowski  L.  Construction of a microphage variant of filamentous bacteriophage. J. Mol. Biol.  1992; 228:720–724. [DOI] [PubMed] [Google Scholar]
  • 21. Praetorius  F., Kick  B., Behler  K.L., Honemann  M.N., Weuster-Botz  D., Dietz  H.  Biotechnological mass production of DNA origami. Nature. 2017; 552:84–87. [DOI] [PubMed] [Google Scholar]
  • 22. Zhang  Q., Xia  K., Jiang  M., Li  Q., Chen  W., Han  M., Li  W., Ke  R., Wang  F., Zhao  Y.  et al.  Catalytic DNA-assisted mass production of arbitrary single-stranded DNA. Angew. Chem. Int. Ed.  2023; 62:e202212011. [DOI] [PubMed] [Google Scholar]
  • 23. Quadros  R.M., Miura  H., Harms  D.W., Akatsuka  H., Sato  T., Aida  T., Redder  R., Richardson  G.P., Inagaki  Y., Sakai  D.  et al.  Easi-CRISPR: a robust method for one-step generation of mice carrying conditional and insertion alleles using long ssDNA donors and CRISPR ribonucleoproteins. Genome Biol.  2017; 18:92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Iyer  S., Mir  A., Vega-Badillo  J., Roscoe  B.P., Ibraheim  R., Zhu  L.J., Lee  J., Liu  P., Luk  K., Mintzer  E.  et al.  Efficient homology-directed repair with circular single-stranded DNA donors. CRISPR J.  2022; 5:685–701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Lin-Shiao  E., Pfeifer  W.G., Shy  B.R., Saffari Doost  M., Chen  E., Vykunta  V.S., Hamilton  J.R., Stahl  E.C., Lopez  D.M., Sandoval Espinoza  C.R.  et al.  CRISPR–Cas9-mediated nuclear transport and genomic integration of nanostructured genes in human primary cells. Nucleic Acids Res.  2022; 50:1256–1268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Bassalo  M.C., Garst  A.D., Halweg-Edwards  A.L., Grau  W.C., Domaille  D.W., Mutalik  V.K., Arkin  A.P., Gill  R.T.  Rapid and efficient one-step metabolic pathway integration in E. coli. ACS Synth. Biol.  2016; 5:561–568. [DOI] [PubMed] [Google Scholar]
  • 27. Clackson  T., Hoogenboom  H.R., Griffiths  A.D., Winter  G.  Making antibody fragments using phage display libraries. Nature. 1991; 352:624–628. [DOI] [PubMed] [Google Scholar]
  • 28. Lee  B.Y., Lee  J., Ahn  D.J., Lee  S., Oh  M.K.  Optimizing protein V untranslated region sequence in M13 phage for increased production of single-stranded DNA for origami. Nucleic Acids Res.  2021; 49:6596–6603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Smeal  S.W., Schmitt  M.A., Pereira  R.R., Prasad  A., Fisk  J.D.  Simulation of the M13 life cycle II: investigation of the control mechanisms of M13 infection and establishment of the carrier state. Virology. 2017; 500:275–284. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkae189_Supplemental_File

Data Availability Statement

The data underlying this article are available at doi.org/10.6084/m9.figshare.25301443.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES