Abstract
We have developed a highly sensitive single-molecule clonal amplification method called dual primer emulsion PCR (DPePCR) for next-generation DNA sequencing. The approach is similar in concept to standard emulsion PCR; however, in DPePCR both primers are attached to the beads, therefore following PCR amplification, both strands of the PCR products are attached to the beads. The ends of each strand can be freed for analysis by restriction digestion of the bridged PCR fragments, which allows efficient paired-end sequencing of fragment libraries.
Keywords: next-generation sequencing, emulsion PCR, bridge PCR
The concept of studying single molecules in high throughput using emulsions is not new and has been used for single-molecule evaluation for a number of years (1) before the approach was first applied to next-generation sequencing (2). However, for emulsion technology to be applicable to next-generation sequencing, a method for capturing the contents from the droplets is required. To accomplish this, Dressman el al. (3) described an approach to amplify single DNA molecules onto beads for detection and enumeration of genetic variation. They named their technology BEAMing, for “beads, emulsion, amplification, and magnetics.” This approach has now evolved to emulsion PCR (ePCR). The DNA bound to beads generated from ePCR provides an excellent template for high-throughput sequencing, because PCR can amplify a single molecule of DNA into many clonal molecules per bead (2). Due to that fact, a number of next-generation sequencing approaches utilize emulsions and beads for DNA amplification prior to sequencing (4–8).
An alternative approach to amplify DNA for next-generation sequencing is the bridge amplification strategy, used by Illumina and originally reported by Bing et al. (www.promega.com/geneticidproc/ussymp7proc/0726.html). Bridge amplification is a technology that uses a single aqueous compartment; however, the individual amplicons are constrained by primers bound to a solid phase that are extended and amplified (9). As the name implies, the extension product from one primer forms a bridge to the other primer.
In this manuscript, we describe a novel approach called dual primer emulsion PCR (DPePCR), which combines concepts from both emulsion PCR and bridge amplification for the generation of simple fragment libraries for paired-end next-generation sequencing. The DPePCR strategy can amplify short DNA fragments (less than ~300 bp, including genome fragment and primers) and enables sequencing of both ends of a DNA fragment. This effectively shortens library preparation time, increases the library complexity (2) when compared with the construction of a mate-paired DNA library, and will contribute to the $1,000-genome goal.
Normal ePCR has been extensively applied to next-generation DNA sequencing (10). Most of the next-generation sequencing approaches are restricted to short read lengths (5), and therefore to optimize resequencing of a human genome, mate-paired or paired-end sequencing is important. However, the construction of mate-paired libraries for next-generation sequencing is difficult and time-consuming (10). Fortunately, fragment library construction is quite simple, and the described DPePCR strategy (Figure 1) enables paired-end sequencing of fragment libraries with essentially no modification of existing sequencing approaches.
Figure 1. Dual primer emulsion PCR approach.
(A) The DPePCR library. 70–130-bp DNA fragments were ligated to forward and reverse adaptors FDV2-BceAI-Primer-F/R and RDV2-AcuI-Primer-F/R. Restriction enzymes recognition sites were designed for BceAI and AcuI at the ends of the two primers for digestion of the DPePCR product. (B) Dual primer beads. Equal amounts of FDV2-dual biotin and RDV2-dual biotin were attached to MyOne strepavidin beads. (C) Dual primer beads, diluted DNA fragment, ePCR reagents, and both free primers were mixed with silicone oil phase. The goal was for each microdroplet to contain a single bead, single DNA molecule, and enough PCR reagents for clonal amplification. After 120 cycles, amplified DNA bridges formed. (D) Digestion and sequencing of the DPePCR beads. After the emulsion amplification, the DPePCR beads were digested with BceAI and AcuI. The digestion left 13 bases on the forward (BceAI) strand and 15 bases on the reverse (AcuI) strand of fragment DNA for sequencing. Specific anchor primers were hybridized at the end of the forward and reverse strands (individually), followed by sequencing by ligation.
To perform DPePCR, both forward and reverse primers are attached to 1-µm beads (Figure 1B) (see Supplementary Materials) that are included in a modified ePCR protocol (10). Additionally, since the amplicons are confined to the droplets, the amplification efficiency is increased by including free primers in the aqueous phase (Figure 1C). After 120 PCR cycles, a single DNA fragment in an emulsion drop can be amplified effectively. After amplification, we have found the DNA to be highly stable in the double-stranded state. Denaturing conditions will denature the DNA; however, since both strands are present on the bead, the double-stranded state immediately reforms, which inhibits the ability to sequence the DNA. To overcome this issue, type IIs recognition enzyme sites (i.e., BceAI and AcuI) were placed at the ends of the amplicons being amplified immediately adjacent to the unknown sequence during library construction (Figure 1A). The DPePCR product can then be digested with restriction enzymes (i.e., BceAI and AcuI), and capping adaptors are ligated to the free end of the dsDNA (Figure 1D). This provides the ability to sequence from both strands of the DPePCR using standard sequencing by ligation (SBL) (Figure 1D) (2).
The SBL sequencing strategy for DPePCR beads is identical to sequencing from standard ePCR beads (Figure 1D). The difference is that since there are two paired-end fragments, both ends can be sequenced independently from both the 3′→5′ and 5′→3′ directions using four different anchor primers (Figure 1D).
To validate the formation of strong double-stranded DPePCR product, the beads (following DPePCR) were treated with 0.1 M NaOH (without restriction enzyme digestion). After denaturing, a Cy5-labeled oligo (Cy5-labeled FDV2-PM; see Supplementary Material) was annealed to the bead-bound DNA fragment (Supplementary Material). The results indicated that the Cy5-oligo could not hybridize to the DPePCR product (data not shown), which suggests the formation of double-stranded DPePCR product was in a bridged conformation. Following restriction enzyme digestion, we were able to anneal the anchor primers (Supplementary Material) and successfully sequence the DNA on the beads from both ends of the fragment.
Theoretically, the DNA fragment on each bead arose from a single molecule in an emulsion and should therefore be clonal, which is the critical requirement for the DPePCR. To validate the clonal amplification, we used DPePCR and SBL in the G.007 Polonator (Westborough, MA, USA) to sequence a Streptococcus pyogenes genome fragment library; when sequenced, the clonal amplified beads showed a single color during each SBL cycle (Figure 2, A and B), which indicated the beads were clonal. Additionally, a random sampling of ~106 reads were mapped to the S. pyogenes genome (AE014074) to ensure we were sequencing paired ends. The reads were NNNNNNN from the forward primer site, and NNNNNNN(NN)NNNNNN from the reverse priming site [note that the (NN) bases was not sequenced]. To map to the S. pyogenes genome, the reverse priming site sequence was first mapped (to both strands of the genome) and then the forward priming site sequence was mapped to the opposite strand on the genome. Roughly, 25% of the reads mapped to the genome and the average separation between the paired ends in the genome was found to be ~100 bases (Figure 2C); this is consistent with the size range that selected in the gel purification step of the library preparation (Supplementary Material).
Figure 2. Validation of Polonator DNA sequencing image.
(A)The DPePCR beads were sequenced using sequencing by ligation. Green, red, and blue beads relate to A, T, and C, respectively, in the image shown. Note that images are displayed in RGB for visualization, and the forth base is present but not shown. For example, the sequence from bead 1 is CTT; bead 2 is TAC from the three cycles shown. (B) The tetrahedral plot is used for the base calling. Each dot on the plot corresponds to a single bead at one cycle in the sequencing. The normalized fluorescence intensity in all four channels is plotted in 4-D and displayed here in a 3-D plot. The base is called by the highest intensity value. The quality of each call is defined by the distance from each bead to the centroid of the beads that have been called the same base. Since each of the beads is a clonal amplicon, the data points are clustered into four groups. The top 50% of the data from a single frame is shown in the figure. (C) The histogram shows that the average separation in the genome is ~100 bases, of a random sampling of ~20,000 reads that mapped uniquely to the Streptococcus pyogenes genome (AE014074). Thus, the beads are clonal amplicons of ~100-bp fragments from the S. pyogenes genome.
Additionally, as a final validation for the approach, we measured the library complexity. The complexity of the library is defined as the diversity of DNA molecules that are sequenced. For example, when a sequencing library is prepared and subjected to extensive amplification (PCR or rolling circle amplification) the complexity is reduced. The fragment library was found to have high complexity, with >99% of the reads being unique, thus improving upon the traditional mate-pair library production protocol (2,10).
In conclusion, we have presented DPePCR, an efficient emulsion PCR strategy for paired-end next-generation sequencing. The DPePCR concept will be applicable to any next-generation sequencing platform using emulsion PCR.
Supplementary Material
Acknowledgments
This work was supported by the National Institutes of Health and the National Human Genome Research Initiative (NIH/NHGRI $1,000 Genome grant no.R21HG004350, to J.S.E.). This paper is subject to the NIH Public Access Policy.
Footnotes
Supplementary material for this article is available at www.BioTechniques.com/article/113423.
Competing interests
The authors declare no competing interests.
References
- 1.Tawfik DS, Griffiths AD. Man-made cell-like compartments for molecular evolution. Nat. Biotechnol. 1998;16:652–656. doi: 10.1038/nbt0798-652. [DOI] [PubMed] [Google Scholar]
- 2.Shendure J, Porreca GJ, Reppas NB, Lin X, McCutcheon JP, Rosenbaum AM, Wang MD, Zhang K, et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science. 2005;309:1728–1732. doi: 10.1126/science.1117389. [DOI] [PubMed] [Google Scholar]
- 3.Dressman D, Yan H, Traverso G, Kinzler KW, Vogelstein B. Transforming single DNA molecules into fluorescent magnetic particles for detection and enumeration of genetic variations. Proc. Natl. Acad. Sci. USA. 2003;100:8817–8822. doi: 10.1073/pnas.1133470100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Shendure J, Ji H. Next-generation DNA sequencing. Nat. Biotechnol. 2008;26:1135–1145. doi: 10.1038/nbt1486. [DOI] [PubMed] [Google Scholar]
- 5.Metzker ML. Sequencing technologies - the next generation. Nat. Rev. Genet. 2010;11:31–46. doi: 10.1038/nrg2626. [DOI] [PubMed] [Google Scholar]
- 6.Ansorge WJ. Next-generation DNA sequencing techniques. N. Biotechnol. 2009;25:195–203. doi: 10.1016/j.nbt.2008.12.009. [DOI] [PubMed] [Google Scholar]
- 7.Fullwood MJ, Wei CL, Liu ET, Ruan Y. Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. Genome Res. 2009;19:521–532. doi: 10.1101/gr.074906.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mardis ER. Next-generation DNA sequencing methods. Annu. Rev. Genomics Hum. Genet. 2008;9:387–402. doi: 10.1146/annurev.genom.9.081307.164359. [DOI] [PubMed] [Google Scholar]
- 9.Pemov A, Modi H, Chandler DP, Bavykin S. DNA analysis with multiplex microarray-enhanced PCR. Nucleic Acids Res. 2005;33:e11. doi: 10.1093/nar/gnh184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Porreca GJ, Shendure J, Church GM. Polony DNA sequencing. Curr. Protoc. Mol. Biol. 2006;Chapter 7(Unit 7.8) doi: 10.1002/0471142727.mb0708s76. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


