ABSTRACT
This paper describes the complete genome sequences and methylome analysis of six psychrotrophic strains isolated from perennially ice-covered Lake Untersee in Antarctica.
GENOME ANNOUNCEMENT
In this study, we performed a complete genome and methylome analysis of four bacterial isolates from concentrated water samples taken from a depth of 40 to 70 m in the aerobic zone of perennially ice-covered Lake Untersee (East Antarctica, 71°20′S 13°30′E) (1). The heterotrophic aerobic microbial cultures were isolated on LB agar plates under growth conditions of 8°C and 24°C for 1 to 2 weeks. DNA from the four isolated cultures, U13-I, U41, U17-1, and U14-5, were purified using a modified Qiagen method, and their genomes were sequenced using the Pacific Biosciences (PacBio) RSII sequencing platform. Briefly, SMRTbell libraries were constructed from a genomic DNA sample sheared to an average size of ~10 to 20 kb using the G-tubes protocol (Covaris, Woburn, MA, USA), end repaired, and ligated to hairpin adapters. Incompletely formed SMRTbell templates were digested with a combination of exonuclease III and exonuclease VII (New England BioLabs, Ipswich, MA, USA). Genomic DNA fragments and SMRTbell libraries qualification and quantification were performed using the Qubit fluorimeter (Invitrogen, Eugene, OR) and 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA). Four 20-kb SMRTbell libraries were prepared according to the modified PacBio sample preparation protocols, including additional separation on a BluePippin, and sequenced using C4 chemistry on 16 single-molecule real-time (SMRT) cells with a 180- to 240-min collection time. Sequencing reads were processed, mapped, and assembled by the Pacific Biosciences SMRT Analysis pipeline using the HGAP3 protocol and polished using Quiver (2) to yield six fully closed genomes, including nine plasmids (Table 1). The assembled sequences were annotated with Rapid Annotations using Subsystems Technology (RAST) (3) and the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) and have been deposited at DDBJ/EMBL/GenBank.
TABLE 1.
Isolate | Species | Genome size (bp) | Accession no. | Motif in genomea | Methylation type(s) | Assigned methylase(s) |
---|---|---|---|---|---|---|
U13-1 | Exiguobacterium sp. | 3,208,635 | CP015731 | CGGCSBYRb | m5C type II | NDc |
CGCCGVNYb | m5C type II | ND | ||||
U17-1 | Bacillus safensis | 3,741,806 | CP015611 | ND | ND | ND |
U14-5 | B. safensis | 4,070,483 | CP015607 | CCWGG | m5C type II | M.Bsa145I |
58,536 | CP015608 | |||||
22,301 | CP015609 | |||||
Roseomonas gilardi | 4,328,147 | CP015583 | GAACN7TCGC, GANTC | m6A type I, m6A type II | S.Rgi145I, M.Rgi145II | |
702,334 | CP015584 | |||||
265,182 | CP015585 | |||||
64,339 | CP015586 | |||||
52,461 | CP015587 | |||||
U41 | B. safensis | 3,735,180 | CP015610 | ND | ND | ND |
Arthrobacter sp. | 4,386,370 | CP015732 | TTAA | m6A type II | M.AspU41I | |
178,576 | CP015733 | GGCAN6TGA | m6A type I | S.AspU41III | ||
166,446 | CP015734 | GGATCC | m6A type II | M.AspU41ORF21335P | ||
61,247 | CP015735 | CCWGG | m5C type I | M.AspU41II |
Modified bases are highlighted in bold.
The m5C motif cannot be deduced unambiguously by the PacBio software. The motifs are unlikely to be correct as called.
ND, no motif detected, although the same m5C methylase gene is present in both strains.
The advantage of the PacBio sequencing platform is its ability to detect the epigenetic state of sequenced DNA, which allows for the identification of modified nucleotides and their corresponding motifs. Epigenetic modification at each nucleotide position was measured as kinetic variations (KVs) in the nucleotide incorporation rates, and methylated motifs were deduced from the KV data (4–6). Nine DNA methyltransferase recognition motifs were detected corresponding to N6-methyladenine (m6A) and probable 5-methylcytosine (m5C) modifications by direct SMRT sequencing. Two isolates, U14-5 and U41, each represented two organisms with different modification patterns. This allowed us to assign plasmids and chromosomes to the appropriate host based on their modification profiles. The motifs were then matched with methyltransferase genes, and the results are shown in Table 1. They have also been deposited in REBASE (7).
Accession number(s).
Sequences have been deposited in GenBank under the accession numbers listed in Table 1.
ACKNOWLEDGMENTS
This project was supported by the Small Business Innovation Research Program (NIGMS) of the National Institutes of Health under award R44GM105125 to R.J.R.
R.J.R., A.F., and T.V. work for New England BioLabs, a company that sells research reagents, including restriction enzymes and DNA methylases to the scientific community. Samples were collected in Antarctica with support from the Tawani Foundation of Chicago, the Trottier Family Foundation, NASA’s Exobiology and Astrobiology Programs, and the Arctic and Antarctic Research Institute/Russian Antarctic Expedition’s Subprogram “Study and Research of the Antarctic.”
Footnotes
Citation Fomenkov A, Akimov VN, Vasilyeva LV, Andersen DT, Vincze T, Roberts RJ. 2017. Complete genome and methylome analysis of psychrotrophic bacterial isolates from Lake Untersee in Antarctica. Genome Announc 5:e01753-16. https://doi.org/10.1128/genomeA.01753-16.
REFERENCES
- 1.Andersen DT, Sumner DY, Hawes I, Webster-Brown J, McKay CP. 2011. Discovery of large conical stromatolites in Lake Untersee, Antarctica. Geobiology 9:280–293. doi: 10.1111/j.1472-4669.2011.00279.x. [DOI] [PubMed] [Google Scholar]
- 2.Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
- 3.Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Flusberg BA, Webster DR, Lee JH, Travers KJ, Olivares EC, Clark TA, Korlach J, Turner SW. 2010. Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat Methods 7:461–465. doi: 10.1038/nmeth.1459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Clark TA, Murray IA, Morgan RD, Kislyuk AO, Spittle KE, Boitano M, Fomenkov A, Roberts RJ, Korlach J. 2012. Characterization of DNA methyltransferase specificities using single-molecule, real-time DNA sequencing. Nucleic Acids Res 40:e29. doi: 10.1093/nar/gkr1146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Korlach J, Turner SW. 2012. Going beyond five bases in DNA sequencing. Curr Opin Struct Biol 22:251–261. doi: 10.1016/j.sbi.2012.04.002. [DOI] [PubMed] [Google Scholar]
- 7.Roberts RJ, Vincze T, Posfai J, Macelis D. 2015. Rebase–a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res 43:D298–D299. doi: 10.1093/nar/gku1046. [DOI] [PMC free article] [PubMed] [Google Scholar]