Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2022 Nov 30;12(1):e00566-22. doi: 10.1128/mra.00566-22

Draft Genome Sequence of the Sophorolipid-Producing Yeast Pseudohyphozyma bogoriensis ATCC 18809

Joseph Msanne a,, Jonathan Shao b, Richard Ashby a, Philip Campos b,c, Yanhong Liu a, Daniel Solaiman a
Editor: Jason E Stajichd
PMCID: PMC9872583  PMID: 36448832

ABSTRACT

Pseudohyphozyma bogoriensis is gaining attention as a microbial source of high-value sophorolipids. We report here on its genomic sequence, which will improve our understanding of its metabolic pathways and allow the development of genome manipulation systems. PacBio sequencing was performed, yielding a 26-Mbp genome with 57% GC content and encoding 7,847 predicted proteins.

ANNOUNCEMENT

Pseudohyphozyma bogoriensis, with the homotypic synonyms Rhodotorula bogoriensis and Vanrijia bogoriensis (1, 2) and the basionym Candida bogoriensis (3), is an aerobic yeast in the family Chrysozymaceae, subphylum Pucciniomycotina, and phylum Basidiomycota. P. bogoriensis produces a structurally unique sophorolipid composed of 13-hydroxydocosanoic acid (48), with increasing industrial applications (5, 9). These molecules have antimicrobial properties and have been suggested as anti-inflammation and anticancer compounds (1014). Achieving substantial improvements through genetic manipulations necessitates an understanding of the gene content. To define the yeast metabolic capabilities, a draft genome sequence was generated.

P. bogoriensis ATCC 18809, acquired from the American Type Culture Collection, was isolated from the surface of a Randia malleifera leaf (15). The culture was maintained on yeast and malt (0.3% yeast extract, 0.3% malt extract, 0.5% peptone, and 1% glucose) agar plates; individual colonies were inoculated into liquid medium and then incubated for 48 h at 25°C on a rotary shaker at 225 rpm. Prior to genomic DNA (gDNA) extraction, the freshly harvested cells were subjected to enzymatic digestion with Zymolyase (Zymo; E1005-A), according to the manufacturer’s instructions. DNA was extracted using the YeaStar kit (Zymo; D2002), and the quality and quantity were initially assessed using the DS-11 spectrophotometer (DeNovix Inc., Wilmington, DE).

The genome was sequenced and assembled using the Pacific Biosciences Sequel System (PacBio, Menlo Park, CA) at the Genomics Core Facility of Drexel University (Philadelphia, PA). The PacBio 10-kb library preparation protocol was applied without Covaris g-TUBE shearing. Size selection of the gDNA was performed using the Blue Pippin kit (Sage Science, Beverly, MA), followed by DNA damage repair using PacBio reagents. The gDNA was then converted into a SMRTbell library using the SMRTbell template prep kit version 1.0-SPv3 Sequel, according to the manufacturer’s instructions. For quality assessment, the size distribution of the library was determined using an Agilent Bioanalyzer 12000 chip (Agilent Technologies Inc., Santa Clara, CA). The size-selected library was sequenced on the PacBio Sequel 1 platform using single-molecule real-time (SMRT) cell technology (4 cells). Read quality control was performed using the SMRTLink version 5.1.0 workflow. There were 2,383,328 subread sequences, with 180× genome coverage, and the N50 read value was 6,803 bp. The postfiltered reads were used for genome assembly using Hierarchical Genome Assembly (HGAP) version 4 software (PacBio). Default parameters were used for all tools except where otherwise noted. The final assembly resulted in 304 contigs, with an overall GC content of 57.01%. The average contig N50 read value was 159,754 bp, and the maximum size was 672,149 bp. The total assembly length was 26,004,456 bp, and the mitochondrial genome identified from the assembly totaled 145,115 bp. The yeast genome contained 7,847 predicted protein-encoding genes, annotated using Augustus version 3.3.1 (16) with Cryptococcus neoformans as the model in Funannotate version 1.5.3 (17). The mitochondrial genome had 37 protein-encoding genes, annotated using FGENESH with C. neoformans as the model and mold/mitochondrial genetic code (18). A total of 176 tRNA genes (108 nuclear and 68 mitochondrial) were identified using tRNAscan version 2.0 (19).

P. bogoriensis is a promising platform for commercial biosurfactant production. Recently, a sophorolipid gene cluster was reported for Starmerella bombicola (7, 9). We report here on the assembly and annotation of P. bogoriensis genomes, which should provide functional genomics tools for efficient genome manipulation.

Data availability.

This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number JALBUQ000000000. The version described in this paper is version JALBUQ010000000; the BioSample accession number is SAMN26542470, and the BioProject accession number is PRJNA814273. The raw data can be found in the Sequence Read Archive under the accession number SRR20187078.

ACKNOWLEDGMENTS

This research was supported by Agricultural Research Service (ARS)-appropriated funds, through the U.S. Department of Agriculture (USDA).

Mention of trade names or commercial products is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture (USDA).

Contributor Information

Joseph Msanne, Email: Joseph.Msanne@usda.gov.

Jason E. Stajich, University of California, Riverside

REFERENCES

  • 1.Arx JA, Weijman ACM. 1979. Conidation and carbohydrate composition in some Candida and Torulopsis species. Antonie Van Leeuwenhoek 45:547–555. doi: 10.1007/BF00403654. [DOI] [PubMed] [Google Scholar]
  • 2.Wang QM, Yurkov AM, Goker M, Lumbsch HT, Leavitt SD, Groenewald M, Theelen B, Liu X-Z, Boekhout T, Bai F-Y. 2015. Phylogenetic classification of yeasts and related taxa within Pucciniomycotina. Stud Mycol 81:149–189. doi: 10.1016/j.simyco.2015.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Deinema MH. 1961. Intra- and extra-cellular lipid production by yeasts. Meded Landbouwhogesch Wageningen 61:1–54. [Google Scholar]
  • 4.Kitamoto D, Isoda H, Nakahara T. 2002. Functions and potential applications of glycolipid biosurfactants—from energy-saving materials to gene delivery carriers. J Biosci Bioeng 94:187–201. doi: 10.1016/S1389-1723(02)80149-9. [DOI] [PubMed] [Google Scholar]
  • 5.Ribeiro IA, Bronze MR, Castro MF, Ribeiro MHL. 2012. Design of selective production of sophorolipids by Rhodotorula bogoriensis through nutritional requirements. J Mol Recognit 25:630–640. doi: 10.1002/jmr.2188. [DOI] [PubMed] [Google Scholar]
  • 6.Sen S, Borah SN, Bora A, Deka S. 2017. Production, characterization, and antifungal activity of a biosurfactant produced by Rhodotorula babjevae YS3. Microb Cell Fact 16:95. doi: 10.1186/s12934-017-0711-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Geys R, De Graeve M, Lodens S, Van Malderen J, Lemmens C, De Smet M, Mincke S, Van Bogaert INA, Stevens C, De Maeseneire SL, Roelants SLKW, Soetaert WKG. 2018. Increasing uniformity of biosurfactant production in Starmerella bombicola via the expression of chimeric cytochrome P450s. Colloids Interfaces 2:42. doi: 10.3390/colloids2040042. [DOI] [Google Scholar]
  • 8.Leyton A, Araya M, Tala F, Flores L, Lienqueo ME, Shene C. 2021. Macrocystis pyrifera extract residual as nutrient source for the production of sophorolipids compounds by marine yeast Rhodotorula rubra. Molecules 26:2355. doi: 10.3390/molecules26082355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Van Bogaert INA, Holvoet K, Roelants SLKW, Li B, Lin Y-C, Van de Peer Y, Soetaert W. 2013. The biosynthetic gene cluster for sophorolipids: a biotechnological interesting biosurfactant produced by Starmerella bombicola. Mol Microbiol 88:501–509. doi: 10.1111/mmi.12200. [DOI] [PubMed] [Google Scholar]
  • 10.Shah V, Doncel GF, Seyoum T, Eaton KM, Zalenskaya I, Hagver R, Azim A, Gross R. 2005. Sophorolipids, microbial glycolipids with anti-human immunodeficiency virus and sperm-immobilizing activities. Antimicrob Agents Chemother 49:4093–4100. doi: 10.1128/AAC.49.10.4093-4100.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Shah V, Jurjevic M, Badia D. 2007. Utilization of restaurant waste oil as a precursor for sophorolipid production. Biotechnol Prog 23:512–515. doi: 10.1021/bp0602909. [DOI] [PubMed] [Google Scholar]
  • 12.Chen J, Song X, Zhang H, Qu Y-B, Miao J-Y. 2006. Sophorolipid produced from the new yeast strain Wickerhamiella domercqiae induces apoptosis in H7402 human liver cancer cells. Appl Microbiol Biotechnol 72:52–59. doi: 10.1007/s00253-005-0243-z. [DOI] [PubMed] [Google Scholar]
  • 13.Van Bogaert INA, Saerens K, De Muynck C, Develter D, Soetaer W, Vandamme EJ. 2007. Microbial production and application of sophorolipids. Appl Microbiol Biotechnol 76:23–34. doi: 10.1007/s00253-007-0988-7. [DOI] [PubMed] [Google Scholar]
  • 14.Solaiman DKY, Ashby RD, Nuñez A, Crocker N. 2020. Low-temperature crystallization for separating monoacetylated long-chain sophorolipids: characterization of their surface-active and antimicrobial properties. J Surfactants Deterg 23:553–563. doi: 10.1002/jsde.12396. [DOI] [Google Scholar]
  • 15.Garay LA, Sitepu IR, Cajka T, Chandra I, Shi S, Lin T, German JB, Fiehn O, Boundy-Mills KL. 2016. Eighteen new oleaginous yeast species. J Ind Microbiol Biotechnol 43:887–900. doi: 10.1007/s10295-016-1765-3. [DOI] [PubMed] [Google Scholar]
  • 16.Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B. 2006. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res 34:W435–W439. doi: 10.1093/nar/gkl200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Palmer JM, Stajich J. 2019. nextgenusfs/funannotate: funannotate v1.5.3. Zenodo doi: 10.5281/zenodo.2604804. [DOI] [Google Scholar]
  • 18.Solovyev V, Kosarev P, Seledsov I, Vorobyev D. 2006. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol 7:S10.1–S10.12. doi: 10.1186/gb-2006-7-s1-s10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chan PP, Lin BY, Mak AJ, Lowe TM. 2021. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res 49:9077–9096. doi: 10.1093/nar/gkab688. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number JALBUQ000000000. The version described in this paper is version JALBUQ010000000; the BioSample accession number is SAMN26542470, and the BioProject accession number is PRJNA814273. The raw data can be found in the Sequence Read Archive under the accession number SRR20187078.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES