Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2017 Sep 14;5(37):e00823-17. doi: 10.1128/genomeA.00823-17

First Whole-Genome Shotgun Sequence of a Promising Cellulase Secretor, Trichoderma koningiopsis Strain POS7

María Lorena Castrillo a,c,, Gustavo Ángel Bich a,c, Carlos Modenutti b,c, Adrián Turjanski b,c, Pedro Darío Zapata a,c, Laura Lidia Villalba a
PMCID: PMC5597750  PMID: 28912309

ABSTRACT

Trichoderma koningiopsis strain POS7 produces significantly large amounts of cellulase enzymes in solid-state fermentation. The Illumina-based sequence analysis reveals an approximate genome size of 36.6 Mbp, with a G+C content of 48.82% for T. koningiopsis POS7. Based on ab initio prediction, 12,661 coding genes were annotated.

GENOME ANNOUNCEMENT

Cellulolytic fungi, such as many Trichoderma species, play an important role in natural ecosystems, participating in the transformation of cellulose (1). Therefore, many efforts have been made toward the isolation and mass multiplication of microorganisms of this genus to produce cellulase enzymes with higher specific activity and outstanding efficiency (2, 3). The Trichoderma koningiopsis POS7 strain of this study was found to produce significantly large amounts of cellulase enzymes in solid-state fermentation, with high enzymatic stability (4). Looking at these prospects, we aimed to sequence the genome of a promising cellulase secretor strain and to gain detailed insights into its genomic features. We report for the first time, to our knowledge, the genome sequence of the T. koningiopsis species.

T. koningiopsis strain POS7 was collected from a forest environment of Misiones (Argentina) (27°24′31.8″S, 55°53′48.5″W). Genomic DNA from T. koningiopsis POS7 was extracted according to the protocol reported previously (5). Genomic DNA library construction and draft genome sequencing were performed by Macrogen Co. (South Korea) using the Illumina MiSeq system. A genomic DNA library was prepared using a rapid shotgun library. Quality control procedures removed DNA spike-in, artifacts, and ambiguous or low-quality reads. Paired ends having at least 90% of bases with a quality score greater than or equal to Q20 were filtered before assembly. These sequences were assembled de novo using the IDBA software package (6), and SSPACE was used to assemble the sequences in scaffolds (7). To predict genes in the T. koningiopsis POS7 genome, we used an ab initio gene predictor, geneID (8), which is specifically trained for this genome, and one homology-based gene predictor, Exonerate (9). Using a heuristic approach implemented in a homemade pipeline, we combined all predicted gene models to produce a nonredundant set of genes, in which a single best-gene model per locus was selected on the basis of sequence similarity to known proteins. We annotated and classified genes according to Gene Ontology (GO), EC numbers, and eukaryotic orthologous groups (KOGs).

The resulting genome sequence of T. koningiopsis POS7 has an estimated size of 36.6 Mb and a GC content of 48.82%. To accomplish the next-generation sequencing libraries, a rapid shotgun library was sequenced, resulting in 7,773,936 paired reads (36,586,254 bp), with an approximate insert size of 100 bp.

The reads were assembled in 147 scaffolds, with a scaffold N50 value of 1.75 Mb. This representative set included 12,661 protein-coding genes. The majority (68%) of the predicted genes contained multiple exons, with an average of 2.59 exons per gene. The average gene density, similar to that of most of the larger scaffolds, was 2.9 kb per gene. The average protein length was 450 amino acids. In total, 12,642 (99%) gene models were predicted to be complete. Approximately 83.5% of the predicted proteins showed sequence similarity to other proteins, primarily from fungi. We assigned GO terms to 7,509 (59.3%) of the predicted Trichoderma proteins. We also assigned 3,554 (28%) proteins to KOG clusters and 1,146 distinct EC numbers to 3,204 (25%) proteins.

Accession number(s).

This whole-genome shotgun project has been deposited at GenBank under the accession number MRBD00000000. The version described in this paper is the first version, MRBD01000000.

ACKNOWLEDGMENTS

M. L. Castrillo, G. Á. Bich, and C. Modenutti are postgraduate CONICET fellowship holders. A. Turjanski and P. D. Zapata are CONICET independent researchers.

Footnotes

Citation Castrillo ML, Bich GÁ, Modenutti C, Turjanski A, Zapata PD, Villalba LL. 2017. First whole-genome shotgun sequence of a promising cellulase secretor, Trichoderma koningiopsis strain POS7. Genome Announc 5:e00823-17. https://doi.org/10.1128/genomeA.00823-17.

REFERENCES

  • 1.Doolotkeldieva T, Bobusheva S. 2011. Screening of wild-type fungal isolates for cellulolytic activity. Microbiol Insights 4:1–10. [Google Scholar]
  • 2.Johnvesly B, Virupakshi S, Patil GN, Ramalingam Naik GR. 2002. Cellulase-free thermostable alkaline xylanase from thermophilic and alkalophillic Bacillus sp. JB-99. J Microbiol Biotechnol 12:153–156. [Google Scholar]
  • 3.Zhou J, Wang YH, Chu J, Zhuang YP, Zhang SL, Yin P. 2008. Identification and purification of the main components of cellulase from a mutant strain of Trichoderma viride T 100–14. Bioresour Technol 99:6826–6833. doi: 10.1016/j.biortech.2008.01.077. [DOI] [PubMed] [Google Scholar]
  • 4.Castrillo ML. 2015. Caracterización de celulasas secretadas por aislamientos de Trichoderma, nativos de la provincia de Misiones (Argentina) aplicables en la etapa de sacarificación. Doctoral thesis Universidad Nacional de Misiones, Misiones, Argentina. [Google Scholar]
  • 5.Castrillo ML, Fonseca MI, Bich GA, Jerke G, Horianski MA, Zapata PD. 2012. Taxonomy and phylogenetic analysis of Aspergillus section nigri isolated from yerba mate in Misiones (Argentina). J Bas Appl Genet 23:19–27. [Google Scholar]
  • 6.Peng Y, Leung HCM, Yiu SM, Chin FYL. 2012. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28:1420–1428. doi: 10.1093/bioinformatics/bts174. [DOI] [PubMed] [Google Scholar]
  • 7.Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. 2011. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27:578–579. doi: 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
  • 8.Blanco E, Parra G, Guigó R. 2007. Using geneid to identify genes. Curr Protoc Bioinformatics Chapter 4:Unit 4.3. doi: 10.1002/0471250953.bi0403s18. [DOI] [PubMed] [Google Scholar]
  • 9.Slater GS, Birney E. 2005. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics 6:31. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES