Abstract
Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.
Keywords: Thraustochytrid, Aurantiochytrium, Whole genome sequencing, Docosahexaenoic acid
Specifications
Organism/cell line/tissue | Aurantiochytrium sp. T66 |
Sex | N/A |
Sequencer or array type | Illumina HiSeq 2 × 100 bp paired-end and Roche 454 FLX ++ |
Data format | Analyzed |
Experimental factors | DNA extracted from a pure strain, no treatment |
Experimental features | Draft genome sequencing |
Consent | N/A |
Sample source location | The coast of Madeira, Portugal |
1. Direct link to deposited data
2. Introduction
Eicosapentaenoic acid (EPA; 20:5n − 3), docosapentaenoic acid (DPA; 22:5n − 3), and docosahexaenoic acid (DHA; 22:6n − 3) are the main long chain polyunsaturated omega-3 fatty acids (ω-3 LC-PUFA). Over the past decades, the importance of ω-3 LC-PUFA in many aspects of human health, including brain and neural development, cardiovascular function and immune system regulation has been uncovered [1], [2], [3], [4]. At present, marine fish and fish oils are the main sources of EPA and DHA. However, the need for new sustainable ω-3 LC-PUFA sources has attracted increasing attention in recent years [5], [6].
Aurantiochytrium spp. belong to the thraustochytrids, which are unicellular, heterotrophic, marine protists, abundant in marine water, and able to grow on various carbon sources [7]. Recent studies have demonstrated that Aurantiochytrium spp. can be cultivated to high cell densities and produce biomass with up to 70% lipids of the dry cell weight, and up to 70% of the lipids may be DHA [8]. In a previous study of Aurantiochytrium sp. strain T66 (ATCC PRA-276), we demonstrated that its lipid content and fatty acid profile can be manipulated by changing the growth conditions [9]. The genetic, regulatory, and biochemical basis of DHA biosynthesis in thraustochytrids are largely unknown due to the scarcity of genome sequences. The genome sequences of the two thraustochytrids Schizochytrium sp. CCTCC M209059 [10] and Quahog Parasite Unknown (QPX) [11] as well as transcriptomes of QPX [12] and Aurantiochytrium sp. SD116 [13] were recently published. Here, we report the draft genome sequence of Aurantiochytrium sp. strain T66.
3. DNA extraction, library construction and sequencing
Aurantiochytrium sp. strain T66 was isolated from a mixture of marine sediment and seawater sampled from the coast of Madeira, Portugal [14]. Total genomic DNA was isolated with the Blood & Cell culture DNA kits (Qiagen, Hilden, Germany). DNA quality was assessed by gel electrophoresis, and the purity and quantity were determined by the NanoDrop 1000 UV–Vis spectrophotometer (Thermo Scientific) and Qubit 2.0 fluorometer using the Qubit® dsDNA BR Assay Kit (ThermoFisher Scientific). The genome of Aurantiochytrium sp. T66 was sequenced using the Illumina HiSeq 2 × 100 bp paired-end and Roche 454 FLX ++ platforms at Eurofins Genomics GmbH, Ebersberg, Germany. Three libraries of Illumina HiSeq were prepared (shotgun, 8 kbp and 20 kbp long jumping distance libraries). Following quality clipping and adapter trimming by Trimmomatic version 0.30 [15], genome sequence assembly was achieved by the combination of Velvet version 1.2.10 [16], Newbler v2.9 (454 Life Sciences), and Convey GraphConstructor (Cnykmer, Cnygc v2.2.5526, http://www.conveycomputer.com/). Gene prediction was done by a combination of homology-based approaches and de novo predictions [17], [18], [19], [20], [21].
4. Data analysis and results
The cleaned data output of Illumina HiSeq were 4.8 Gbp, 5.0 Gbp, and 3.6 Gbp, which represent estimated genome coverages of 102-fold, 115-fold and 82-fold, respectively. Roche 454 FLX ++ sequencing resulted in 179 Mbp of cleaned data corresponding to an estimated genome coverage of 4.1-fold. The draft genome of Aurantiochytrium sp. T66 is 43 Mbp, with a G + C content of 62.8%, distributed on 495 large scaffolds (≥ 1000 bp) with an N50 length of 1,342,793 bp, L50 count of 3, N75 length of 594,063 bp, and a N90 length of 115,579 bp. A total of 11,683 putative protein-coding genes, 112 tRNA genes, 20 rRNA genes and 4 snRNA genes were predicted. Repetitive regions were estimated to comprise 7.1% of the genome. The Aurantiochytrium sp. T66 draft genome sequence generated in this study represents a new source of knowledge which can be used as a reference to study thraustochytrids and it will help to further understand the genetic mechanisms of DHA biosynthesis and regulation. It will also be valuable in comparative genomic studies of other Aurantiochytrium sp. strains as well as for metabolic engineering of thraustochytrids.
5. Nucleotide accession number
This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession LNGJ00000000.
Conflict of interest
The authors declare that there is no conflict of interests on the work published in this paper.
Acknowledgments
This work was supported by a grant from the Research Council of Norway.
References
- 1.Li D., Hu X. Fish and its multiple human health effects in times of threat to sustainability and affordability: are there alternatives? Asia Pac. J. Clin. Nutr. 2009;18:553–563. [PubMed] [Google Scholar]
- 2.Birch E.E., Carlson S.E., Hoffman D.R., Fitzgerald-Gustafson K.M., Fu V.L., Drover J.R., Castaneda Y.S., Minns L., Wheaton D.K., Mundy D., Marunycz J., Diersen-Schade D.A. The DIAMOND (DHA Intake And Measurement Of Neural Development) Study: a double-masked, randomized controlled clinical trial of the maturation of infant visual acuity as a function of the dietary level of docosahexaenoic acid. Am. J. Clin. Nutr. 2010;91:848–859. doi: 10.3945/ajcn.2009.28557. [DOI] [PubMed] [Google Scholar]
- 3.Cottin S.C., Sanders T.A., Hall W.L. The differential effects of EPA and DHA on cardiovascular risk factors. Proc. Nutr. Soc. 2011;70:215–231. doi: 10.1017/S0029665111000061. [DOI] [PubMed] [Google Scholar]
- 4.Robinson L.E., Buchholz A.C., Mazurak V.C. Inflammation, obesity, and fatty acid metabolism: influence of n − 3 polyunsaturated fatty acids on factors contributing to metabolic syndrome. Appl. Physiol. Nutr. Metab. 2007;32:1008–1024. doi: 10.1139/H07-087. [DOI] [PubMed] [Google Scholar]
- 5.Martins D.A., Custodio L., Barreira L., Pereira H., Ben-Hamadou R., Varela J., Abu-Salah K.M. Alternative sources of n − 3 long-chain polyunsaturated fatty acids in marine microalgae. Mar. Drugs. 2013;11:2259–2281. doi: 10.3390/md11072259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Salem N.J., Eggersdorfer M. Is the world supply of omega-3 fatty acids adequate for optimal human nutrition? Curr. Opin. Clin. Nutr. Metab. Care. 2015;18:147–154. doi: 10.1097/MCO.0000000000000145. [DOI] [PubMed] [Google Scholar]
- 7.Yu X.J., Yu Z.Q., Liu Y.L., Sun J., Zheng J.Y., Wang Z. Utilization of high-fructose corn syrup for biomass production containing high levels of docosahexaenoic acid by a newly isolated Aurantiochytrium sp. YLH70. Appl. Biochem. Biotechnol. 2015;177:1229–1240. doi: 10.1007/s12010-015-1809-6. [DOI] [PubMed] [Google Scholar]
- 8.Aasen I.M., Ertesvåg H., Heggeset T.M., Liu B., Brautaset T., Vadstein O., Ellingsen T.E. Thraustochytrids as production organisms for docosahexaenoic acid (DHA), squalene, and carotenoids. Appl. Microbiol. Biotechnol. 2016;100:4309–4321. doi: 10.1007/s00253-016-7498-4. [DOI] [PubMed] [Google Scholar]
- 9.Jakobsen A.N., Aasen I.M., Josefsen K.D., Strøm A.R. Accumulation of docosahexaenoic acid-rich lipid in thraustochytrid Aurantiochytrium sp. strain T66: effects of N and P starvation and O2 limitation. Appl. Microbiol. Biotechnol. 2008;80:297–306. doi: 10.1007/s00253-008-1537-8. [DOI] [PubMed] [Google Scholar]
- 10.Ji X.J., Mo K.Q., Ren L.J., Li G.L., Huang J.Z., Huang H. Genome sequence of Schizochytrium sp. CCTCC M209059, an effective producer of docosahexaenoic acid-rich lipids. Genome Announc. 2015;3 doi: 10.1128/genomeA.00819-15. (e00819–00815) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Garcia-Vedrenne A.E., Groner M., Page-Karjian A., Siegmund G.F., Singhal S., Sziklay J., Roberts S. Development of genomic resources for a thraustochytrid pathogen and investigation of temperature influences on gene expression. PLoS One. 2013;8 doi: 10.1371/journal.pone.0074196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rubin E., Tanguy A., Perrigault M., Pales Espinosa E., Allam B. Characterization of the transcriptome and temperature-induced differential gene expression in QPX, the thraustochytrid parasite of hard clams. BMC Genomics. 2014;15:245. doi: 10.1186/1471-2164-15-245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ma Z., Tan Y., Cui G., Feng Y., Cui Q., Song X. Transcriptome and gene expression analysis of DHA producer Aurantiochytrium under low temperature conditions. Sci. Rep. 2015;5:14446. doi: 10.1038/srep14446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Jakobsen A.N., Aasen I.M., Strøm A.R. Endogenously synthesized (−)-proto-quercitol and glycine betaine are principal compatible solutes of Schizochytrium sp. strain S8 (ATCC 20889) and three new isolates of phylogenetically related thraustochytrids. Appl. Environ. Microbiol. 2007;73:5848–5856. doi: 10.1128/AEM.00610-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zerbino D.R., Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008;18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Altschul S.F., Gish W., Miller W., Myers E.W., Lipman D.J. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 18.Birney E., Clamp M., Durbin R. GeneWise and Genomewise. Genome Res. 2004;14:988–995. doi: 10.1101/gr.1865504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Stanke M., Keller O., Gunduz I., Hayes A., Waack S., Morgenstern B. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 2006;34:W435–W439. doi: 10.1093/nar/gkl200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Korf I. Gene finding in novel genomes. BMC Bioinf. 2004;5:59. doi: 10.1186/1471-2105-5-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Majoros W.H., Pertea M., Salzberg S.L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics. 2004;20:2878–2879. doi: 10.1093/bioinformatics/bth315. [DOI] [PubMed] [Google Scholar]