ABSTRACT
Basidiomycetous yeast Mrakia gelida MGH-2 has been reported from Surikogi Ike in the Skarvsnes ice-free area, East Antarctica. Here, we report on the high-quality genome sequence of the Mrakia gelida MGH-2 strain analyzed by PacBio Sequel and HiSeq 2500 instruments.
ANNOUNCEMENT
Basidiomycetous yeast Mrakia gelida MGH-2 was isolated from lake sediment of Surikogi Ike in the Skarvsnes ice-free area, East Antarctica, by the 48th Japanese Antarctic Research Expedition (JARE-48) (1). This strain has high growth and alcohol fermentation abilities even at 4°C (2).
The MGH-2 strain culture and DNA extraction methods are the same as those we reported previously (3, 4). Briefly, yeast cells were cultured in yeast extract-peptone-dextrose (YPD) liquid medium (Difco) at 15°C for 5 days. The cells from cultures were collected by centrifugation at 3,500 × g for 5 min at 4°C. The cell pellets were washed with sterile distilled water and were precipitated by centrifugation again. The cell pellets were dried by a vacuum freeze dryer. Then, the lyophilized cells were powdered in a mortar and used for DNA extraction. The genomic DNA of MGH-2 was extracted and purified using a NucleoBond AXG100 column with a Nucleo buffer set III (TaKaRa Bio) according to the manufacturer’s protocol. Concentration and purification of the genomic DNA were determined by NanoDrop (Thermo Scientific) and the Qubit double-stranded DNA (dsDNA) broad-range (BR) assay kits (Thermo Scientific). Then, the genomic DNA was divided into two parts, namely, one for the PacBio Sequel platform and the other for the HiSeq 2500 platform (Illumina, CA) for use in sequencing. For the PacBio Sequel platform, a single-molecule real-time (SMRT) sequence library was constructed using a SMRTbell express template prep kit v1.0 (Pacific Bioscience, CA) according to the manufacturer’s protocol. Two sequencing libraries were size selected using the BluePippin system (Saga Science, MA) with a minimum fragment length cutoff of 20 kbp and 30 kbp. Each library was sequenced on one SMRT cell with binding kit 1.0/sequencing kit 1.2.1 and binding kit 2.0/sequencing kit 2.0, respectively. The run time for the movies was 360 min and 600 min, yielding a total of 1,133,939 polymerase reads. In this study, default parameters were used except where otherwise noted. In the PacBio Sequel sequencing, read quality control was performed prior to assembly by measuring the average length and concordance of spike-in controls and the distribution of read lengths in the sample. PacBio reads were assembled using the FALCON-Unzip assembler program v0.7 (5) and were used to polish the assembly sequence with SMRT Link v4.0.0.190159 (Application: Resequencing). For Illumina HiSeq 2500 sequencing, genomic DNA was fragmented to an average size of 600 bp with the DNA shearing system M220 (Covaris Inc., MA). A paired-end library was constructed with a TruSeq DNA PCR-free library prep kit (Illumina) and was size selected on an agarose gel using a Zymoclean large fragment DNA recovery kit (Zymo Research, CA). The final paired-end library was sequenced on the Illumina HiSeq 2500 sequencer with a read length of 250 bp. In the HiSeq sequencing, read quality control was performed prior to assembly by the average quality value (QV) of the reads during each cycle. Illumina HiSeq reads were mapped against the assembled PacBio contigs with BWA-MEM, and single nucleotide polymorphisms (SNPs) and indels were identified and corrected using Pilon v1.22 (6).
A summary of the genomic analysis results of the M. gelida MGH-2 is shown in Table 1.
TABLE 1.
Assembly summary for genomic data of Mrakia gelida MGH-2
| Characteristic | Data |
|---|---|
| Sampling site | Surikogi Ike, Skarvsnes ice-free area |
| No. of scaffolds | 25 |
| No. of reads by HiSeq 2500 instrument | 6,344,609,500 |
| Coverage (×) by HiSeq 2500 instrument | 175.9 |
| No. of reads by PacBio Sequel instrument | 8,042,485,732 |
| Coverage (×) by PacBio Sequel instrument | 230.5 |
| Total length (Mb) | 34.9 |
| GC content (%) | 55.7 |
| Scaffold N50 value (kb) | 1,793 |
| Avg length (kb) | 1,395 |
| Gen Bank accession no. | BRON00000000.1 |
| DRA accession no. | DRA014407 |
Antarctica is one of the last frontiers for bioprospecting, along with deep sea and space. We believe that genomic information on M. gelida MGH-2 will be useful in the search for new bioactive substances and new pharmaceutical raw materials.
Data availability.
This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under BioProject no. PRJDB13796 with the accession no. BRON00000000.1. The raw reads are available under the DRA accession no. DRA014407.
ACKNOWLEDGMENTS
This work was supported by the National Institute of Polar Research (NIPR) Research Project (KP-309), General Collaboration Project no. 2-29, and “Strategic Research Projects” grant from Research Organization of Information and Systems (ROIS). This work was also supported by a JSPS Grant-in-Aid for Young Scientists (A) no. 16H06211, JSPS KAKENHI grant number 16H06279 (PAGS), and Institution for Fermentation, Osaka for General Research grant no. G-2022-1-007.
Contributor Information
Masaharu Tsuji, Email: spindletuber@gmail.com.
Antonis Rokas, Vanderbilt University.
REFERENCES
- 1.Tsuji M, Fujiu S, Xiao N, Hanada Y, Kudoh S, Kondo H, Tsuda S, Hoshino T. 2013. Cold adaptation of fungi obtained from soil and lake sediment in the Skarvsnes ice-free area, Antarctica. FEMS Microbiol Lett 346:121–130. doi: 10.1111/1574-6968.12217. [DOI] [PubMed] [Google Scholar]
- 2.Tsuji M, Kudoh S, Hoshino T. 2016. Ethanol productivity of cryophilic basidiomycetous yeast Mrakia spp. correlates with ethanol tolerance. Mycoscience 57:42–50. doi: 10.1016/j.myc.2015.08.002. [DOI] [Google Scholar]
- 3.Tsuji M, Ishihara J, Gotoh Y, Hayashi T, Takahashi H. 2022. Draft genome sequences of five Cystobasidium ongulense strains isolated from areas near Syowa Station, East Antarctica. Microbiol Resour Announc 11:e00224-22. doi: 10.1128/mra.00224-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tsuji M, Ishihara J, Toyoda A, Takahashi H. 2022. High-quality genome sequence of Cystobasidium tubakii JCM 31526T isolated from East Ongul Island, Antarctica. Microbiol Resour Announc 11:e0074122. doi: 10.1128/mra.00741-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chin CS, Peluso P, Sedlazeck FJ, Nattestad M, Concepcion GT, Clum A, Dunn C, O'Malley R, Figueroa-Balderas R, Morales-Cruz A, Cramer GR, Delledonne M, Luo C, Ecker JR, Cantu D, Rank DR, Schatz MC. 2016. Phased diploid genome assembly with single-molecule real-time sequencing. Nat Methods 13:1050–1054. doi: 10.1038/nmeth.4035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under BioProject no. PRJDB13796 with the accession no. BRON00000000.1. The raw reads are available under the DRA accession no. DRA014407.
