Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2016 Nov 10;4(6):e01245-16. doi: 10.1128/genomeA.01245-16

Complete Genome Sequence of NEB 5-alpha, a Derivative of Escherichia coli K-12 DH5α

Brian P Anton 1,, Elisabeth A Raleigh 1
PMCID: PMC5105096  PMID: 27834703

Abstract

Escherichia coli K-12 DH5α is one of the most popular and widely available laboratory strains, but, surprisingly, no complete genome sequence has been publicly available. Here, we report the complete, finished sequence of NEB 5-alpha (DH5α fhuA2). It should serve as a useful reference for researchers working with DH5α.

GENOME ANNOUNCEMENT

DH5α, constructed by Douglas Hanahan, is one of the most commonly used Escherichia coli K-12 laboratory strains. It is widely commercially available and has several properties that make it well suited for cloning applications: it can be transformed with high efficiency; yields high-quality plasmid DNA due to the absence of nonspecific endonuclease I (endA1) (1); maintains plasmids stably due to low levels of homologous recombination (recA1); can be transformed efficiently with unmethylated DNA due to the disruption of endonuclease EcoKI (hsdR17); supports blue/white selection due to the alpha-complementable ΔlacZ58(M15) allele; and is deficient in periplasmic alkaline phosphatase (PhoA), making it useful for studying membrane proteins expressed as phoA fusions (2). Finally, it is the only commercially available M.EcoKI+ strain, so DNA passaged through it will efficiently transform classical EcoKI+ E. coli strains.

Several DH5α markers have been characterized previously: deoR, originally thought to be defective and therefore responsible for the high transformability of DH5α (3), is in fact a wild type (4); luxS is defective (5); and the rfbC1 allele is actually a frameshift in rfbD (6). Numerous other features can be deduced from the genome sequence of its ancestor, DH1 (7). A whole-genome shotgun assembly (WGA) of DH5α, based on short reads and comprising 89 contigs, revealed additional variants (6). Although the characteristic deletion Δ(argF-lac)169 was originally determined to be 97 kb in length (8), the WGA study (erroneously, we find) inferred a shorter length of 85 kb, with the cynS-mhpC region not part of the deletion (6).

We have sequenced the genome of NEB 5-alpha (New England Biolabs), an immediate fhuA2 derivative of DH5α, using the Pacific Biosciences RSII platform with P6 chemistry. A 10-kb SMRTbell library was prepared from total DNA using the manufacturer’s instructions, size-selected (4 to 50 kb) using the BluePippin instrument (Sage Science), and sequenced on 1 SMRT cell with a 360-min movie, obtaining 314× mean coverage. The genome was assembled using RS_HGAP_Assembly.3 followed by manual refinement and reassembly using RS_BridgeMapper.

The closed and finished genome of NEB 5-alpha is 4,583,637 bp in length. Our assembly shows the deletion at Δ(argF-lac)169 to be 97,240 bp in length, extending from mmuP to mhpD, which is consistent with the original report (8) and contrary to the WGA study (6). The cynS-mhpC region, including the ΔlacZ58(M15) allele, is present on the 47,357-bp ϕ80d[ΔlacZ58(M15)] insertion at attϕ80 (as in DH10β [9]) but is deleted as part of Δ(argF-lac)169, which we confirmed by PCR. The phoA8 allele is a 723-bp internal, in-frame deletion.

In addition to those genes disrupted in DH1, fhuA, crl, and phoE are disrupted in NEB 5-alpha, and ylbE contains a frameshift. We confirmed all of the nonsynonymous changes identified previously (6), except for abgR and yicJ, which appear to be wild type. We identified four additional nonsynonymous mutations: flgJ (P254S), insH20 (W140stop), msbB (M33I), and ppsA (A50T). We suggest the following genotype: fhuA2::IS2 Δ(mmuP-mhpD)169 ΔphoA8 glnX44 ϕ80d[ΔlacZ58(M15)] rfbD1 gyrA96 luxS11 recA1 endA1 rphWT thiE1 hsdR17.

Accession number(s).

This sequence has been deposited at DDBJ/ENA/GenBank under the accession number CP017100 and is also available at New England Biolabs.

ACKNOWLEDGMENTS

We are grateful to Lixin An and Tamas Vincze for excellent technical assistance and to Don Comb for support.

Footnotes

Citation Anton BP, Raleigh EA. 2016. Complete genome sequence of NEB 5-alpha, a derivative of Escherichia coli K-12 DH5α. Genome Announc 4(6):e01245-16. doi:10.1128/genomeA.01245-16.

REFERENCES

  • 1.Taylor RG, Walker DC, Mclnnes RR. 1993. E. coli host strains significantly affect the quality of small scale plasmid DNA preparations used for sequencing. Nucleic Acids Res 21:1677–1678. doi: 10.1093/nar/21.7.1677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Rodriguez-Quinones F, Benedi VJ. 1993. Escherichia coli strain DH5α is a suitable host for the study of phoA insertions. Focus 15:110–112. [Google Scholar]
  • 3.Hanahan D, Jessee J, Bloom FR. 1991. Plasmid transformation of Escherichia coli and other bacteria. Methods Enzymol 204:63–113. [DOI] [PubMed] [Google Scholar]
  • 4.Xia XX, Qian ZG, Lee SY. 2011. Comparative proteomic and genetic analyses reveal unidentified mutations in Escherichia coli XL1-blue and DH5α. FEMS Microbiol Lett 314:119–124. doi: 10.1111/j.1574-6968.2010.02157.x. [DOI] [PubMed] [Google Scholar]
  • 5.Surette MG, Miller MB, Bassler BL. 1999. Quorum sensing in Escherichia coli, Salmonella typhimurium, and Vibrio harveyi: a new family of genes responsible for autoinducer production. Proc Natl Acad Sci U S A 96:1639–1644. doi: 10.1073/pnas.96.4.1639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Song Y, Lee BR, Cho S, Cho YB, Kim SW, Kang TJ, Kim SC, Cho BK. 2015. Determination of single nucleotide variants in Escherichia coli DH5α by using short-read sequencing. FEMS Microbiol Lett 362:fnv073. doi: 10.1093/femsle/fnv073. [DOI] [PubMed] [Google Scholar]
  • 7.Suzuki S, Ono N, Furusawa C, Ying BW, Yomo T. 2011. Comparison of sequence reads obtained from three next-generation sequencing platforms. PLoS One 6:e19534. doi: 10.1371/journal.pone.0019534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Peters JE, Thate TE, Craig NL. 2003. Definition of the Escherichia coli MC4100 genome by use of a DNA array. J Bacteriol 185:2017–2021. doi: 10.1128/JB.185.6.2017-2021.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Durfee T, Nelson R, Baldwin S, Plunkett G III, Burland V, Mau B, Petrosino JF, Qin X, Muzny DM, Ayele M, Gibbs RA, Csörgo B, Pósfai G, Weinstock GM, Blattner FR. 2008. The complete genome sequence of Escherichia coli DH10B: insights into the biology of a laboratory workhorse. J Bacteriol 190:2597–2606. doi: 10.1128/JB.01695-07. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES