Skip to main content
Microorganisms logoLink to Microorganisms
. 2021 Jul 23;9(8):1564. doi: 10.3390/microorganisms9081564

An Update on Trichoderma Mitogenomes: Complete De Novo Mitochondrial Genome of the Fungal Biocontrol Agent Trichoderma harzianum (Hypocreales, Sordariomycetes), an Ex-Neotype Strain CBS 226.95, and Tracing the Evolutionary Divergences of Mitogenomes in Trichoderma

Yunyoung Kwak 1,2,3
Editor: Johannes Wöstemeyer
PMCID: PMC8401334  PMID: 34442643

Abstract

Members of the genus Trichoderma (Hypocreales), widely used as biofungicides, biofertilizers, and as model fungi for the industrial production of CAZymes, have actively been studied for the applications of their biological functions. Recently, the study of the nuclear genomes of Trichoderma has expanded in the directions of adaptation and evolution to gain a better understanding of their ecological traits. However, Trichoderma’s mitochondria have received much less attention despite mitochondria being the most necessary element for sustaining cell life. In this study, a mitogenome of the fungus Trichoderma harzianum CBS 226.95 was assembled de novo. A 27,632 bp circular DNA molecule was revealed with specific features, such as the intronless of all core PCGs, one homing endonuclease, and a putative overlapping tRNA, on a closer phylogenetic relationship with T. reesei among hypocrealean fungi. Interestingly, the mitogenome of T. harzianum CBS 226.95 was predicted to have evolved earlier than those of other Trichoderma species and also assumed with a selection pressure in the cox3. Considering the bioavailability, both for the ex-neotype strain of the T. harzianum species complex and the most globally representative commercial fungal biocontrol agent, our results on the T. harzianum CBS 226.95 mitogenome provide crucial information which will be helpful criteria in future studies on Trichoderma.

Keywords: divergence, divergence time, evolution, fungal mitochondrial genome, fungal mitogenome, selective pressure, Trichoderma

1. Introduction

Mitochondria are essential double-membrane bound subcellular compartments responsible for producing cellular energy, adenosine triphosphate (ATP), in eukaryotes [1]. Typically, their genetic materials exist in the form of a single chromosome (i.e., mitochondrial genome or mitogenome) [2] and consist of (1) conserved core protein-coding genes (PCGs) for the functional proteins involved in oxidative phosphorylation (OXPHOS), (2) genes encoding small- and large subunits of ribosomal RNA (rRNAs), and (3) transfer RNAs (tRNAs) [3]. However, mitogenomic features (e.g., lengths, gene orders, intergenic-/repetitive regions, mobile elements, and tRNA distribution) have been characterized differently in various eukaryotic lineages [4].

Mitogenomes are ideal tools for the evolutionary analysis of eukaryotes due to their (1) small size, (2) high copy numbers, (3) limited recombination, and (4) high mutation rates leading to independent evolutions of nuclear genomes [2,5]. When considering the ecological features of fungi, such as a broad diversity of lineages and lifestyles in nature, fungal mitogenomes can provide crucial information on evolutionary adaptations and divergences of the mitochondria in different environments [1,6]. According to the recent rapid developments of high-throughput sequencing technologies, fungal organelle genome databases have increasingly accumulated, however, these are still less abundant than those of animals and plants [4,7]. Thus far, most studies of fungal mitogenomes have focused on the (1) sequencing and characterization of individuals and (2) the elucidation of their phylogenetic positions within its relatives to support the potentials for one of the significant molecular markers identifying fungal species. Until recently, few studies have reported on comparative analyses of mitogenomes, ranging from a taxa level for specific fungal groups, under evolutionary viewpoints [8,9,10,11].

Trichoderma spp. (Hypocreales, Sordariomycetes, Ascomycota) are widely known as representative fungal biocontrol agents of plant pathogens due to their diverse nutritional modes (e.g., for being as saprotrophic, endophytic, and mycoparasitic) [12,13,14]. Since the genome sequencing of T. reesei QM6a (GenBank accession no. AAIL00000000) was first presented [15], Trichoderma sequencing studies have actively increased to gain a better understanding of their biological and ecological roles for the more improved applications (e.g., biofungicide, biofertilizer, and a producer of plant biomass hydrolyzing enzymes (CAZymes)); however, they were mainly focused on nuclear genomes [16,17,18,19,20]. To date (as of January 2021), only five Trichoderma mitogenomes (T. asperellum B05 (GenBank accession no. NC_037075), T. atroviride ATCC 26799 (GenBank accession no. MN125601), T. gamsii KUC1747 (GenBank accession no. KU687109), T. hamatum (GenBank accession no. MF287973), and T. reesei QM9414 (GenBank accession no. AF447590)) are available as a verified dataset in GenBank. In our previous report of the T. atroviride ATCC 26799 mitogenome, all of these five complete sequences were compared and reviewed for the first time [11].

When considering the species complex of T. harzianum, initially it was suggested to be an aggregate species including several cryptic species [21], genomic studies of T. harzianum can provide more knowledge of their adaptive evolution and functional mechanisms in particular niches with both of systematic- and ecological perspectives [19,20]. In terms of the T. harzianum CBS 226.95, an ex-neotype strain of the T. harzianum species complex and the most globally representative commercial fungal biocontrol agent, the nuclear genome of this fungus has already been sequenced (GenBank accession no. GCA_003025095) [19], but the mitogenome has not. Here, we present a complete de novo mitogenome of the fungus T. harzianum CBS 226.95 with genomic characterization, comparative analyses, and the tracing of time-scaled evolutionary divergences.

2. Materials and Methods

2.1. DNA Extraction and Genome Sequencing

The ex-neotype strain T. harzianum CBS 226.95 [22] was obtained from the Westerdijk Fungal Biodiversity Institute (formerly the Centraalbureau voor Schimmelcultures (CBS) Fungal Biodiversity Centre, The Netherlands). The fungus was cultured on DifcoTM potato dextrose medium (Difco Laboratories Inc., Detroit, MI, USA) at 25 °C, and the obtained mycelia were used as a source of genomic DNA. Total genomic DNA was extracted using a Plant/Fungi DNA Isolation Kit (Sigma-Aldrich Co. Ltd., St. Louis, MO, USA) according to the manufacturer’s instructions and followed by further purification using Phenol-Chloroform (Sigma-Aldrich Co. Ltd., USA) [23]. A quality/quantity of the extracted DNA was measured using the Qubit assay in a QubitTM 3.0 Fluorometer (Thermo Fisher Scientific Inc., Waltham, MA, USA).

The genomic DNA was used to construct a 20 kb insert SMRTbell® DNA library in a BluePippinTM size-selection system (Pacific Biosciences, Menlo Park, CA, USA). Then, it was sequenced on a single molecule real-time (SMRT) sequencing platform by a PacBio RS-II DNA sequencer with P6 polymerase-C4 sequencing chemistry (Pacific Biosciences, USA) [24] at the Génome Québec Innovation Centre (Montréal, QC, Canada).

2.2. De Novo Assembly and Annotation

Only raw reads with a read-quality higher than 0.85 were used for de novo mitogenome assembly using the hierarchical genome-assembly process (HGAP) [25] in the SMRTTM pipeline (Pacific Biosciences, USA). The de novo mitogenome assembly’s accuracy was verified using the P-mapping module [26], and the circularity was confirmed using Gepard [27].

The mitogenome was primarily annotated on webservers for Mfannot (Mfannot, http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl (accessed on 12 January 2021) [28] and MITOS (Mitos, http://mitos.bioinf.uni-leipzig.de/index.py (accessed on 12 January 2021)) [29], under the Genetic Code 4 (the Mold, Protozoan, and Coelenterate Mitochondrial Code). Then, these annotations were modified by BLAST searching (default parameters) on the NCBI non-redundant (nr) database [30] and NCBI Open Reading Frame (ORF) Finder [31]. All annotations relating to gene boundaries were also manually curated to avoid artificial frameshifts in the ORFs. The tRNAs were annotated using the webserver for RNAweasel (RNAweasel, http://megasun.bch.umontreal.ca/cgi-bin/RNAweasel/RNAweaselInterface.pl (accessed on 12 January 2021)) [32] and tRNAScan-SE (v.2.0) [33], under the Genetic Code 4. A circular map of the mitogenome was illustrated using Circos [34]. Finally, the complete de novo mitogenome sequence of T. harzianum CBS 226.95 was deposited in GenBank under the accession number MN564945.

2.3. Genomic Analysis

The base composition and codon usage were analyzed using MEGA (v.7.0) [35] and the EMBOSS package [36]. The secondary structure of tRNAs was predicted by tRNAScan-SE (v.2.0) [33], and the asymmetric bias of nucleotide composition was calculated using the following formulas [37]: AT-skew = (A − T)/(A + T) and GC-skew = (G − C)/(G + C). The interspersed-/tandem repeats were analyzed using BLAST [38] via the self-comparison of mitogenome itself (parameter: E-values < 10−10) and Tandem Repeats Finder (v.4.0; default parameters) [39], respectively. All intron loci and intron types were investigated on the webserver for RNAweasel (RNAweasel, http://megasun.bch.umontreal.ca/cgi-bin/RNAweasel/RNAweaselInterface.pl (accessed on 12 January 2021)) [32] under the Genetic Code 4, and intronic ORFs were identified by BLAST searching on the NCBI-nr database [30] and NCBI-ORF Finder [31]. Finally, the evolutionary pairwise genetic distance between the nucleotide sequences was measured using the proportional (p) distance method in MEGA (v.7.0) [35], and the obtained values were plotted using ggplot2 [40] in R (v.3.4) [41].

2.4. Phylogenetic Analyses

A total of 47 Sordariomycetes mitogenomes were downloaded from the GenBank database on NCBI (GenBank, https://www.ncbi.nlm.nih.gov/genbank/ (accessed on 12 January 2021)) [42]. Full nucleotide sequences of the 13 core PCGs (atp6, atp8, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6; in this study, the atp9 gene was excluded because it has not been reported in the GenBank dataset of the T. gamsii KUC1747 mitogenome (GenBank accession no. KU687109)) were individually aligned using MAFFT (v.7.4; parameter: mafft -auto) [43], and these multiple alignments were concatenated in SequenceMatrix (v.1.8) [44]. The best-fit evolutionary model was determined using jModelTest (v.2.1) under the Information Criterion of Akaike (AIC) and Bayesian (BIC) [45], and the GTR + Gamma + I substitution model was selected as the best-fit model from all criteria.

Maximum likelihood (ML) [46] analysis was conducted in RAxML (v.8.2) with 1000 bootstrap replicates [47], and the bootstrap (BS) values were displayed on the nodes of the constructed phylogenetic tree. Bayesian inference (BI) [48,49] analysis was conducted in MrBayes (v.3.2) [50] under the Markov chains Monte Carlo (MCMC) algorithm (three heated and one cold chain, with a heating coefficient 0.1) for 2 × 106 generations (sampling for every 400 gen.). After a burn-in for the first 25% of the sampled trees, the remaining trees were used to construct the consensus BI tree with values of Bayesian posterior probabilities (BPPs). The stationarity of BI analysis was determined by calculating the average standard deviation for split frequencies (<0.01).

2.5. Time-Scaled Bayesian Phylogeny

Using the target mitogenomes of Hypocreales species, only exon sequences of the 13 core PCGs (atp6, atp8, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6; the atp9 gene was excluded because it has not been reported in the GenBank dataset of the T. gamsii KUC1747 mitogenome (GenBank accession no. KU687109)) were individually aligned using MAFFT (v.7.4; parameter: mafft -auto) [43] and concatenated in SequenceMatrix (v.1.8) [44].

Under a best-fit of the GTR + Gamma substitution model determined by jModelTest (v.2.1) [45], MCMC analysis was counted for 4 × 107 generations (sampling for every 1000 gen.) with a lognormal relaxed clock in BEAST2 (v.2.5) [51]. Calibration nodes were defined by five ancestral nodes (i.e., set clade ages for covering a central 95% probability range at the common ancestral node for (1) all species (272–288 Mya); (2) the order Hypocreales (185–201 Mya); (3) the family Nectriaceae (117–133 Mya); (4) the family Ophiocordycipitacea (109–125 Mya); and (5) the family Hypocreaceae (112–128 Mya)—based on the Bayesian chronogram previously reported using 638 core orthologous proteins by Hypocreales genomes [20], and species within the clades for each calibration node were constrained to be a monophyletic group. All parameters of MCMC output were verified to meet the effective sample size (ESS; as above 200) in Tracer (v.1.7) [52]. After a burn-in for the first 25% of the sampled trees, only remaining trees were used to construct the consensus tree in TreeAnnotator (v.2.5; in BEAST2 package), and all estimated divergence times (million years ago, Mya) were visualized on the time-scaled BI tree for node ages in FigTree (FigTree, http://tree.bio.ed.ac.uk/software/figtree/ (accessed on 12 January 2021)) (v.1.4).

2.6. Selective Pressures Analysis

The exon sequences of the 13 core PCGs (atp6, atp8, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6; the atp9 gene was excluded because it has not been reported in the GenBank dataset of the T. gamsii KUC1747 mitogenome (GenBank accession no. KU687109)) were individually aligned using ClustalW (parameter: -codon option) in MEGA (v.7.0) [35], and these in-frame codon alignments were used to explore evolutionary pressures.

The non-synonymous substitution rate (Ka), synonymous substitution rate (Ks), and Ka/Ks ratio were calculated on the pairwise alignments of each target gene between species using DnaSP (v.6.12) [53], and obtained values were plotted using ggplot2 [40] in R (v.3.4) [41]. The codon sites evolving under the episodic diversifying positive selection were detected using the mixed effects model of evolution (MEME) method [54] in HyPhy (v.2.1) [55], and sites under the p-value threshold of 0.1 (p < 0.1) were considered.

3. Results

3.1. Genomic Features of the T. harzianum CBS 226.95 Mitogenome

After filtering low-quality reads, a total of 1,806,923,812 read bases were used as seeds for the assembling process, and a complete mitogenome of T. harzianum CBS 226.95 was assembled de novo with a length of 27,632 bp (coverage depth, 573.59×; GenBank accession no. MN564945).

A closed-circular DNA molecule of the T. harzianum CBS 226.95 mitogenome was composed of A (36.1%), T (36.4%), G (15.1%), and C (12.4%), with the AT content (72.45%) approximately 2.63 times higher than the GC content. The asymmetric biases of these nucleotide compositions were inferred from the skewness, each for the negative value of AT-skew and the positive value of GC-skew, it showed more frequency of thymine (T) and guanine (G) than adenine (A) and cytosine (C) in the forward strand. The coding and non-coding regions of the entire length of the mitogenome accounted for approximately 79.25% and 20.75%, respectively (Table S1). All genes constituting the coding regions were predicted to be transcribed in a clockwise direction on the same forward strand (Figure 1).

Figure 1.

Figure 1

Circular map of the complete mitochondrial genome of T. harzianum CBS 226.95. Gene components were indicated with different color blocks, and the circular map was plotted using Circos.

All 14 conserved core PCGs were composed of three ATP synthases (atp6, atp8 and atp9), one apocytochrome b (cob), three cytochrome c oxidases (cox1, cox2 and cox3), and seven NADH dehydrogenases (nad1, nad2, nad3, nad4, nad4L, nad5 and nad6), without any introns capable of splitting each of their coding sequences (CDS) to the exon–intron structure. Two ORFs that were predicted without assigned functions, orf408 and orf170, were also found as a form of non-freestanding ORF. The orf408 gene was entirely located within a coding region of the rps3 gene for ribosomal protein S3 and rrnL gene for a large subunit ribosomal RNA, and the orf170 gene was identified completely within a conserved domain for homing endonuclease (HE). All of the core PCGs and ORFs were initiated with the standard ATG start codon and terminated with the TAA stop codon, except for three genes: the nad4 gene with the GTG start codon, and atp9 and orf170 with the TAG stop codon. An overlap of one nucleotide base was only found between the TAA stop codon of the nad4L gene and the ATG start codon of the nad5 gene (−1 bp, corresponding to the codon of A at nucleotide position 897) (Table S1).

All 24 tRNA genes (trn genes), accounting for approximately 6.49% of the entire mitogenome, were shown with average lengths ranging from 71 bp to 87 bp. These trn genes were found as single copies, but several were found in multiple copies with different anticodons (two copies for trnRArg (trnRArg[ACG] and trnRArg[TCT]), two copies for trnSSer (trnSSer[GCT] and trnSSer[TGA]), and two copies for trnLLeu (trnLLeu[TAA] and trnLLeu[TAG])). A trnMMet gene coding for methionine was also found in three copies with the same anticodon ((trnMMet[CAT]) (Tables S1 and S2)). Most of the tRNA genes were predicted to have a typical cloverleaf secondary structure. However, some trn genes (positions: trnYTyr at 13,598, trnSSer at 13,854 and 16,306, and trnLLeu at 21,911 and 22,462) were found with an extra variable arm (V-arm) (Figure S1), which was consistent with previous reports about the specificity of extra V-arm to tRNALeu, tRNASer, and bacterial-/eukaryotic organellar tRNATyr [56,57].

Two rRNA genes, the rns gene for small subunit ribosomal RNA and the rrnL gene for large subunit ribosomal RNA, accounted for approximately 22.50% of the entire length of mitogenome. The rns gene located between the atp6 gene and the cox3 gene was found as a freestanding ORF (length 1503 bp). Otherwise, the rrnL gene was predicted in a position ranging over two of PCGs, orf408 and rps3 (length 4713 bp). The rps3 gene for ribosomal protein S3 was found to be an internal ORF of the group I intron that was positioned within the rrnL gene [58], and was 1368 bp in size (141 bp larger than orf408) with the ATA start codon (Figure 1 and Table S1).

Leucine was the most abundant amino acid (13.57%), followed by isoleucine (11.22%), serine (8.30%), and phenylalanine (7.20%). In contrast, the least frequent amino acid was cysteine (0.46%) (Table S3). Lastly, three specific sites were found for interspersed repeats, and tandem repeats were also detected with consensus lengths ranging from 9 to 26 bp on the multi-copies (Table S4).

3.2. Phylogenetic Analyses of the T. harzianum CBS 226.95 Mitogenome

Excluding an unverified dataset, 47 complete mitogenomes (i.e., 46 from Hypocreales species (belonging to the class Sordariomycetes) and one of N. crassa OR74A (belonging to the order Sordariales that composed of the class Sordariomycetes with the order Hypocreales)) were retrieved from the GenBank and used in the analyses (Table S5).

In both of the molecular phylogenies (ML- and BI tree), all Hypocreales species were well observed with high confidence, under the significant clades supported by seven major families of the order Hypocreales; interestingly, Paecilomyces penicillatus SAAS_ppe1, belonging to the family Clavicipitaceae, was found with a closer relationship with the family Hypocreaceae than those of native family (BS = 100, BPP = 1.00). It may be resolved by improving the Hypocreales phylogeny resolution on accumulations of informative datasets of the mitochondrial genomes from closely related family species (Figure 2 and Figure S2).

Figure 2.

Figure 2

Bayesian phylogeny for a position of the T. harzianum CBS 226.95 mitogenome in Hypocreales. The tree was generated using concatenated sequences of 13 core genes (atp6, atp8, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5 and nad6), and the mitochondrial genome of Neurospora crassa OR74A (Sordariales) was used as an outgroup. All Sordariomycetes species that used for the phylogenetic tree are described in Table S5, and Bayesian posterior probabilities (BPPs) values were marked on the nodes. Asterisks for the Neurospora crassa OR74A mitogenome (*) and T. harzianum CBS 226.95 mitogenome (**).

The genus Trichoderma, including T. harzianum CBS 226.95, was placed in a sister clade with the genus Hypomyces. Among hypocrealean fungi, a sister relationship between T. harzianum ATCC 26799 and T. gamsii KUC1747 was observed identically in both tree topologies (BS = 100, BPP = 1.00) (Figure 2 and Figure S2).

3.3. Comparative Analysis of the Trichoderma Mitogenomes

The complete de novo mitogenome of T. harzianum CBS 226.95 was compared with five complete Trichoderma mitogenomes that publicly available from the genus Trichoderma, excluding an unverified dataset (as of January 2021) (Table 1).

Table 1.

Comparison of mitochondrial genomic features among Trichoderma species.

Strain T. harzianum CBS 226.95 a T. reesei
QM9414 a
T. atroviride
ATCC 26799 a
T. gamsii KUC1747 a T. asperellum
B05 a
T. hamatum a
GenBank
accession no.
MN564945 AF447590 MN125601 KU687109 NC_037075 MF287973
Mitochondrial genome size (bp) 27,632 42,130 32,758 29,303 29,999 32,763
AT content (%)/
GC content (%)
72.5/27.6 72.8/27.2 71.8/28.2 71.7/28.3 72.2/27.8 72.3/27.7
AT-skew/
GC-skew
(−)0.004/0.098 0.041/0.086 0.006/0.091 (−)0.062/0.036 (−)0.066/0.043 (−)0.002/0.031
Protein CDS/
tRNAs b/rRNAs
18/24 (1)/2 19/23 (2)/2 21/27 (1)/2 18/26 (1)/2 17/25 (1)/1 20/26 (1)/2
Intergenic region (%) 20.75 17.58 24.93 30.15 31.73 19.05
Genome identity (%)
(query coverage)
Used as query 95.17
(93.0%)
95.71
(88.0%)
95.52
(88.0%)
94.56
(88.0%)
94.69
(87.0%)
No. of intron locus c
(subtype)
2
(IA (2))
11
(IA (2), IB (6),
C2 (1), ID (2))
4
(IA (1), IB (3))
1
(IA (1))
3
(IA (1), IB (2))
5
(IA (1), IB (3), IC2 (1))
Note This study - - Non-existence of atp9 gene Non-existence of rrnL gene -
In Figure 3A, indicated as TS T1 T2 T3 T4 T5

a Based on the GenBank database, NCBI (datasets as of, January 2021). b A number of putative trn genes predicted within the protein-coding genes were indicated inside the parentheses (details in Table S7). c Details of the detected group I intron loci (subtype, position, intronic ORF) are shown in Table S8.

The entire lengths of mitogenome ranged from 27,632 bp (T. harzianum CBS 226.95) to 42,130 bp (T. reesei QM9414) with an average AT content (72.2%). The intergenic regions ranged from 17.58% (T. reesei QM9414) to 31.73% (T. asperellum B05), and the mitogenome of T. harzianum CBS 226.95 was shown to fall in the middle of this range. The mitogenomes of T. gamsii KUC1747, T. asperellum B05, and T. hamatum were presented with the negative value of AT-skew as those of T. harzianum CBS 226.95. In contrast, only two mitogenomes, from T. reesei QM9414 and T. atroviride ATCC 26799, contained all positive values for both AT-/GC-skew (Table 1).

The mitogenome of T. atroviride ATCC 26799 was shown to harbor more protein CDS and trn genes than other Trichoderma species (Table 1). Across Trichoderma species, gene lengths for apocytochrome b (cob) and cytochrome c oxidases (cox1, cox2, and cox3) were observed in various sizes due to the presence of introns (Table S6). The T. reesei QM9414 mitogenome, which contains both the most extensive length of entire sequences and restricted proportions of non-coding regions, was observed to carrying more introns than other Trichoderma species. A total of 11 intron loci were detected, mainly in the exon–intron structures of the PCGs for apocytochrome b (cob) and cytochrome c oxidases (cox1, cox2, and cox3), which increase the full size of the target gene itself than the total number of annotated genes in the mitogenome sequences, whereas few introns were predicted in the T. gamsii KUC1747 and T. harzianum CBS 226.95 (Table 1 and Table S8). Notably, an IA-type intron (i.e., an intron of the subtype IA) was commonly found with an intronic ORF in the region of the rrnL gene of all Trichoderma species except T. asperellum B05 (i.e., in this study, the likelihood of the rrnL gene being present in the T. asperellum B05 mitogenome was not considered because this gene was not reported in the GenBank dataset of the T. asperellum B05 mitogenome) (Figure 3A and Table S8). Finally, compared with other Trichoderma species, the mitogenome of T. harzianum CBS 226.95 showed the highest identity of 95.17% to T. reesei QM9414 (Table 1).

Figure 3.

Figure 3

Comparative analysis of Trichoderma mitogenomes—I. (A) Comparison of the gene components. TS, T. harzianum CBS 226.95; T1, T. reesei QM9414; T2, T. atroviride ATCC 26799; T3, T. gamsii KUC1747; T4, T. asperellum B05; T5, T. hamatum. (B) Bayesian chronogram for the evolutionary divergences of Trichoderma mitogenomes. Estimated chronological times were presented in the time scale of Mya (million years ago), with 95% confidence interval values in blue bars. Bayesian posterior probabilities (BPPs) values and calibration points were indicated at nodes using numbers (black) and asterisks (red), respectively. Asterisks for the Neurospora crassa OR74A mitogenome (*) and T. harzianum CBS 226.95 mitogenome (**).

3.4. Divergence Times of the Trichoderma Mitogenomes in Hypocreales

To explore the evolutionary divergence of Trichoderma mitogenomes in Hypocreales, a range for the taxonomic analysis was set to cover major families of the order of Hypocreales with species, such as (1) 10 selected species that represented each significant family in the order Hypocreales (one for Bionectriaceae (Clonostachys rosea 6792), two for Clavicipitaceae (Epichloe typhina E8 and Metarhizium anisopliae ME1), two for Cordycipitaceae (Cordyceps militaris EFCC-C2 and Beauveria bassiana), one for Hypocreales incertae sedis (Acremonium chrysogenum ATCC11550), two for Nectriaceae (Nectria cinnabarina 5175 and Fusarium oxysporum F11), and two for Ophiocordycipitaceae (Hirsutella thompsonii ARSEF9457 and Ophiocordyceps sinensis)), (2) one strain H. aurantius from the genus Hypomyces, to use as a sub-outgroup for the genus Trichoderma within the Hypocreaceae clade, and (3) one strain N. crassa OR74A, belonging to the order Sordariales that composed of the class Sordariomycetes with the order Hypocreales, to use as an outgroup for all Hypocreales species. Based on the complete mitogenomes of the selected species (Table S5), only exons sequences of the 13 core PCGs were used in the phylogenetic analysis to remove the potential of evolutionary effects by the presence/numbers of introns within the nucleotide sequences of the target gene.

On the time-scaled Bayesian phylogeny, all positions of species were well supported with high BPPs values (BPP = 1.00). Five calibration points were also presented with significant node ages, which were entirely consistent with those of the Bayesian chronogram previously reported using orthologous proteins of Hypocreales genomes (i.e., including Trichoderma genomes) [20].

Under the occurrence of the family Hypocreaceae at 114.83 Mya, a divergence time of the Trichoderma genus was estimated to be 52.46 Mya, and the speciation events that evolved within the genus Trichoderma appear to occur during the following periods: (1) 41.86 Mya between T. harzianum CBS 226.95 and T. reesei QM9414; (2) 8.06 Mya between T. asperellum B05 and T. hamatum; and (3) 3.82 Mya between T. atroviride ATCC 26799 and T. gamsii KUC1747. Interestingly, when considering the divergent times of Trichoderma species that dated on the Bayesian chronogram using orthologous of Hypocreales genomes (i.e., including Trichoderma genomes) [20], the speciation between T. harzianum CBS 226.95 and T. reesei QM9414 (41.86 Mya) was comparable to that of Trichoderma genomes (46 Mya; [20]) (Figure 3B). However, the speciation events of the Trichoderma clade consisting of four species (T. asperellum B05, T. atroviride ATCC 26799, T. gamsii KUC1747, and T. hamatum) evolved later (8.06–3.82 Mya) than those of Trichoderma genomes (25–11 Mya; [20]) (Figure 3B), which may be an indication of the mitogenome evolution that more recently and independently occurred apart from nuclear genomes.

4. Discussion

4.1. Putative tRNAs Overlapped with Protein-Coding Genes

Considering the tRNA genes that fully or partially overlapped with PCGs, some studies of metazoan/nematode mitogenomes have noted that it may allow (1) avoiding the loss of tRNA genes or (2) keeping gene functions by the co-evolution of two overlapping genes, on the mitogenome evolving to compacted lengths [56,59]. Recently, several putative trn genes, fully positioned within the PCGs of Trichoderma mitogenomes, have been described in our study of the T. atroviride ATCC 26799 mitogenome (e.g., tRNAVal overlapped with the 3′ end region of nad6 gene, tRNAMet integrated within the 5′ end region of nad2 gene) [11]. Similarly, a putative trnVVal gene, fully overlapped with the 3′ end region of the nad6 gene, was also found in the complete de novo mitogenome of T. harzianum CBS 226.95 (Tables S2 and S7).

The tRNA genes of mitochondria are expected to have evolved through various processes, such as (1) duplications or degenerations of tRNAs, (2) transpositions of tRNAs to the nucleus, causing the reduced tRNAs in the mitogenome itself, and (3) mutations of the tRNA anticodon, leading a remodeling of the tRNA secondary structure, however, the molecular-/functional aspects of these mitochondrial tRNAs have not yet been fully understood [60,61]. Interestingly, among the Trichoderma mitochondrial tRNAs, which are responsible for 20 standard amino acids in the translation system of mitochondrial coding proteins, a putative trnVVal gene was only found for the amino acid valine across all Trichoderma species (Table S7). This feature requires further study concerning (1) the expression of putative trnVVal as a form of functional tRNA, (2) whether it is co-expressed within the mature mRNA of PCGs, and (3) whether the mitochondria require a nucleus-encoded trnVVal from the cytosol for the mitochondrial translation system (i.e., because the putative trnVVal, which is positioned within the protein coding sequences, might already be recognized as a pseudo-tRNA by the tRNA degenerations). Given these questions, all putative trn genes that were observed in the PCGs (trnVVal and tRNAMet) actively support the likelihood of complex and dynamic evolutions of tRNAs in Trichoderma mitogenomes.

4.2. Evolution of Mitochondrial Introns in Trichoderma

In our recent study of the T. atroviride ATCC 26799 mitogenome [11], Trichoderma mitochondrial introns were characterized with (1) the similarities of intron sequences between Trichoderma species, though not clustered in most within the range of the Trichoderma genus, (2) potentials of the horizontal transfer of introns between inter-/intraspecific fungal diversities over the genus level of Trichoderma, and (3) several gain/loss events of introns on the reconstructed ancestral states of these introns [11]. In contrast to previous studies based on the most homologous introns of the core PCGs, in this study on the mitogenome of T. harzianum CBS 226.95, only two intron loci (all subtype IA) were observed regardless of the core PCGs: one was found in the rrnL gene with two internal ORFs (orf408 and rps3) as an rrnL-i (i.e., intron placed within the rrnL gene), another was also detected with multiple intronic ORFs that were composed by embedding the orf170 gene in an 864 bp-sized HE harboring a GIY-YIG motif (Figure 3A, Tables S1 and S8). Nevertheless, compared among six Trichoderma species, all features of the Trichoderma mitochondrial introns that were described in our previous study are still meaningful through the constant patterns of intron distributions in Trichoderma species (e.g., most of the intron loci were detected in the exon–intron structures of PCGs for apocytochrome b (cob) and cytochrome c oxidases (cox1, cox2, and cox3), but not in the two species of T. gamsii KUC1747 and T. harzianum CBS 226.95 that harbor all core PCGs on the intronless structures) (Figure 3A and Table S8).

Since mitochondrial endosymbiosis occurred between the earliest ancestor of mitochondria (pre-mitochondrial alphaproteobacterium) and an archaeal-derived host (Asgard archaea), the ancestral mitochondria genome had experienced massively reduced mitogenomes, similar to modern mitochondria [1]. However, despite its ultimately compact size that was driven by reductive evolutionary processes (e.g., mainly, the loss of redundant/unnecessary genes and/or endosymbiotic gene transfers to the nucleus) [1], the structural intricacy of the current mitogenome varies immensely across eukaryotes [4,62]. In fungal mitogenomes, the group I intron, which is capable of self-splicing, causes the architectural complex of the mitogenome [32,63,64]. Among six Trichoderma mitogenomes, the existence of mitochondrial introns was involved in the structural complexity of the mitogenome itself (Figure 3A), however, strictly not leading to the increase in the entire lengths of their mitochondrial sequences. Excluding T. reesei QM9414, a correlation between the numbers of introns present and the increase in mitogenome size could not be typically defined across Trichoderma species, suggesting the significance of the location and size of the intron locus rather than the intron’s existence itself within mitogenome sequences (Table 1 and Table S8).

Including our previous study of the T. atroviride ATCC 26799 mitogenome [11], several recent studies of fungal species have demonstrated the evolutionary dynamics of mitochondrial introns, such as multiple-gains/losses/degenerations of introns, by tracing the ancestral history of these intron loci within the relevant diversity of the target fungus [9,65]. In this study, ancestral states of intron events across all six Trichoderma species were not constructed due to the absence of homologous intron loci of the core PCGs within the T. harzianum CBS 226.95 mitogenome (Figure 3A and Table S8). However, when comparing phylogenies, six Trichoderma mitogenomes were identically arranged in two kinds of BI topologies, which were obtained either by using full genes (including introns) of the core PCGs (Figure 2) or by only using exon sequences (excluded introns) of the core PCGs (Figure 3B), such as (1) a clade by T. atroviride ATCC 26799 and T. gamsii KUC1747, (2) a clade by T. asperellum B05 and T. hamatum, and (3) a clade by T. harzianum CBS 226.95 and T. reesei QM9414. In particular, the clustering between T. reesei QM9414 (the largest sized one with the highest number of intron loci) and T. harzianum CBS 226.95 (the smallest lengths harboring few introns) deserved attention related to the divergence time of this clade that was estimated to be much older (~41.86 Mya) than the other two clades (8.06–3.82 Mya) (Figure 2 and Figure 3B). Given the theory of genomic streamlining [66], some, but not all, species of lichenized fungi are speculated to have experienced multiple intron loss events for mitogenome contractions [10]. However, in the evolutionary view of eukaryogenesis, the origin and evolution of introns have been considered using the comprehensive perspectives of two competing theories: the ‘introns-early’ theory (i.e., introns first existed at the earliest stages of life’s evolution, then evolved toward intron loss) and the ‘introns-late’ theory (i.e., introns emerged/increased in eukaryotes during eukaryote evolution)’ [67]. All Trichoderma clustering discovered in this study seems to be insufficient to infer whether the Trichoderma mitogenomes evolved to reduce mitogenome sizes due to intron loss and degeneration under genome streamlining, but clearly imply the continuation of varied-gain and loss of mitochondrial introns during Trichoderma evolution (Figure 2 and Figure 3B).

When considering the identical arrangements of Trichoderma clustering between two BI topologies that were obtained with/without intron sequences of the core PCGs (Figure 2 and Figure 3B), it actively supports a shared evolutionary history between the core PCGs and their introns in Trichoderma. Meanwhile, IA-typed rrnL-i was commonly detected across all Trichoderma species except T. asperellum B05 (i.e., in this study, the likelihood of the rrnL gene being present in the T. asperellum B05 mitogenome was not considered because this gene was not reported in the GenBank dataset of the T. asperellum B05 mitogenome (GenBank accession no. NC_037075)) (Figure 3A and Table S8). Although it should be comprehensively reflected in the unique features of other intron loci, such as (1) the intron locus harboring multiple intronic ORFs, or (2) the coexistence of different intron subtypes within a single coding gene (Table S8); nevertheless, the rrnL-i is expected to play an essential role in revealing the dynamic history of Trichoderma mitochondrial introns, as one of the primitive factors that is assumed to be inherited from the most recent common ancestor of the Trichoderma genus.

4.3. Time-Scaled Bayesian Phylogeny with the Strain T. harzianum HB324

In addition to the six complete mitogenomes of Trichoderma species (including T. harzianum CBS 226.95 of this study), there is additional mitogenomic information from the group of T. harzianum species; the 32,277 bp mitogenome of T. harzianum HB324 (GenBank accession no. MT263519). Nonetheless, defined annotations of this mitogenome are unavailable in GenBank because it was publicly deposited as an unverified dataset (as of January 2021). In a recent study of this mitogenome that described mitogenomic features for the first time [68], some coding genes were suggested with multiple terminations of their coding regions. For these reasons, in this study, the core PCGs of the T. harzianum HB324 mitogenome were re-annotated to remove the positions of stop codons that were multiply predicted in the middle of coding genes and re-defined their gene lengths (Table S9). Then, the evolutionary divergences of Trichoderma mitogenomes were re-traced with T. harzianum HB324 harboring newly modified annotations, under the same analytical conditions that applied for the BI phylogeny in Figure 3B. On the newly constructed time-scaled Bayesian tree, all fungal species were well-positioned with high BPPs values (BPP = 1.00), and five calibration points were also indicated to be compatible with those of the Bayesian chronogram using orthologous of Hypocreales genomes (i.e., including Trichoderma genomes) [20]. However, despite belonging to the same species of T. harzianum, two strains of CBS 226.95 and HB324 were placed separately (Figure 4A).

Figure 4.

Figure 4

Comparative analysis of Trichoderma mitogenomes—II. (A) Time-scaled Bayesian tree on Trichoderma mitogenomes that includes a new isolate of Trichoderma. Estimated chronological times were presented in time scale in the time scale of Mya (million years ago), with 95% confidence interval values in blue bars. Bayesian posterior probabilities (BPPs) values and calibration points were indicated at nodes using numbers (black) and asterisks (red), respectively. (B) Pairwise genetic distance among Hypocreales mitogenomes. All used fungal species (including N. crassa OR74A as an outgroup) and obtained p-distance values are described in Tables S5 and S10, respectively. Asterisks for the Neurospora crassa OR74A mitogenome (*) and T. harzianum CBS 226.95 mitogenome (**).

As shown in the two BI topologies (Figure 2 and Figure 3B), phylogenetic replacements of the complete six Trichoderma mitogenomes (i.e., T. asperellum B05, T. atroviride ATCC 26799, T. gamsii KUC1747, T. hamatum, T. harzianum CBS 226.95, and T. reesei QM9414) were congruent with those of the chronogram using 638 core orthologous from Hypocreales genome [20], such as (1) a clade by T. harzianum and T. reesei, and (2) a clade consisting of subclades by four Trichoderma species (T. asperellum, T. atroviride, T. gamsii, and T. hamatum). However, as shown in Figure 4A, among the seven Trichoderma mitogenomes (including strain HB324), T. harzianum HB324 was positioned as an outgroup to all Trichoderma taxa. At the same time, T. harzianum CBS 226.95 was clustered with a subclade comprised of the five remaining Trichoderma species. According to the joining of strain HB324, Trichoderma speciation was also predicted more divergently. The divergence of T. harzianum HB324 was estimated at 87.28 Mya and followed by T. harzianum CBS 226.95 at 55.69 Mya. Furthermore, T. reesei QM9414 appeared later than T. harzianum CBS 226.95 at 44.77 Mya (Figure 4A).

Compared to T. harzianum CBS 226.95, the mitogenome of T. harzianum HB324 was approximately 4645 bp larger and exhibited a sequence identity of 92.23% (query coverage 94%). The GC content was similar, but skewness patterns were different for all positive values in T. harzianum HB324. As described in the beginning of this discussion, putative trn genes entirely positioned within the coding genes (e.g., tRNAVal of the nad6 gene, tRNAMet of the nad2 gene) were found across all Trichoderma species (Table S7)—however, not in the T. harzianum HB324 (Table S9). In addition, when comparing genetic distances using exon sequences of the 13 core PCGs (atp6, atp8, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6; the atp9 gene was excluded because it has not been reported in the GenBank dataset of the T. gamsii KUC1747 mitogenome (GenBank accession no. KU687109)), T. harzianum HB324 was assumed to be more divergent from T. harzianum CBS 226.95 (p-distance value, 0.099) than other Trichoderma species (p-distance values, 0.055–0.057) (Figure 4B and Table S10). All these aspects showed a presence of mitogenomic differentiation between two T. harzianum strains, CBS 226.95 and HB324. Nevertheless, it seems that one additional point should be also contemplated together as below.

Since the first description of the name Trichoderma [69], phylogenetic concepts of the genus Trichoderma have been developed to systematically identify Trichoderma taxonomy [70]. Particularly, in earlier studies, some Trichoderma strains were misidentified, and even the name of T. harzianum has been misused for many different species due to the species complex of Trichoderma [71,72,73,74]. To overcome these difficulties, species of Trichoderma have recently been (re-)identified for the most accurate discrimination between Trichoderma using combined approaches, such as (1) morphological phenotypes and (2) simultaneous applications using multiple-DNA barcodes on regions of rDNA internal transcribed spacers (ITS), elongation factor 1-α (tef1), and RNA polymerase II subunit (RPB2) [74,75]; consequentially, a taxonomy of the T. harzianum species complex was revised to include 14 species [74]. For instance, the strain P1 (ATCC 74058), strain T22, and strain ATCC 26799, widely known as representatives of T. harzianum species in earlier studies, were re-determined to be T. atroviride P1 [71], T. afroharzianum T22 [74], and T. atroviride ATCC 26799 [75], respectively. Meanwhile, the fungus T. harzianum HB324, isolated from the leaves of rubber tree (Hevea brasiliensis), was reported to be identified using a DNA barcode only for the ITS region (used PCR primers for ITS4 and ITS5) [76].

In a recent study of the T. harzianum HB324 mitogenome [68], a time-scaled maximum likelihood (ML) tree was constructed using only three coding genes (atp8, atp9, and cox3). On this ML tree, the mitogenome of T. harzianum HB324 occurred at 27.40 Mya, placed apart from the clade covering all other Trichoderma species [68]. In contrast, though disagreements of analysis conditions must be considered (e.g., tree platform, applied gene sets), the clustering of Trichoderma species on the time-scaled BI tree obtained in this study was congruent with those of the chronogram using nucleus orthologous proteins of Hypocreales [20] (Figure 3B). Nevertheless, when considering the newly constructed BI chronogram by including strain HB324 (Figure 4A), it is evident that the mitogenome of T. harzianum HB324 has distinctive features, which influenced the Trichoderma divergences dynamically. Taxonomic revisions of Trichoderma species are still ongoing to correctly identify members of the T. harzianum complex [77]. Obtaining improved resolutions of Trichoderma topologies, including new isolates such as the strain HB324, will be more useful for the elucidation of Trichoderma mitogenome evolutionary relationships.

4.4. Evolutionary Pressures for the Trichoderma Mitogenomes

Related with the adaptive evolution of Trichoderma mitogenomes, selective pressures were investigated using the codon alignments of 13 core PCGs (atp6, atp8, cob, cox1, cox2, cox3, nad1, nad2, nad3, nad4, nad4L, nad5, and nad6; the atp9 gene was excluded because it has not been reported in the GenBank dataset of the T. gamsii KUC1747 mitogenome (GenBank accession no. KU687109)). Compared to the Ka/Ks ratios, most of the core PCGs were assessed with values lower than one (<1), which is supposed to be under purifying selection. However, interestingly, the cox3 gene was measured with a value greater than one (1>) in a pairwise comparison between T. harzianum CBS 226.95 and T. reesei QM9414, implying a positive selection potential. When considering the Trichoderma clustering of the two BI topologies (Figure 2 and Figure 3B), the cox3 gene might be a site for adaptive changes that influenced the evolutionary speciation events between two strains, CBS 226.95 and QM9414, which were closely placed in a sister relationship within the same clade.

Another remarkable thing is that the nad6 gene of T. harzianum HB324 was shown with a Ka/Ks ratio over one (1>) against those of all other Trichoderma species, even including H. aurantius belonging to the genus Hypomyces that consisted of the family Hypocreaceae. It seems to support the potential of evolutionarily diversifying for the fungus HB324, apart from the other species within the clade of all Trichoderma taxa as shown in Figure 4A (Figure 5 and Table S11). Meanwhile, using the MEME approach on the phylogenetic lineages of Hypocreales (Figure 4A), the cox1 gene was predicted with higher individuals of 20 codon sites that were assumed to have evolved under a positive diversifying selection, in contrast to the non-sites in the atp8 gene (Table S12).

Figure 5.

Figure 5

Analysis of Ka/Ks ratios of the core PCGs. Obtained values are described in Table S11. Ka, nonsynonymous substitution rate; Ks, synonymous substitution rate.

Under intense selective pressures, organisms may evolve adaptations for their survival in certain ecological conditions, which would be achieved via site-specific substitutions of nucleotides/amino acids affecting the protein structure, further leading to changes in protein functions [78]. In a previous study of the evolution of Trichoderma habitat preferences, T. reesei was found to favor decaying plant materials rather than soil [14]. The OXPHOS pathway is strictly dependent on oxygen to generate cellular energy [79]. When considering the various habitats where Trichoderma have been isolated (e.g., soils, wood, and living plants/fungi) [14], environmental conditions, particularly concerning oxygen, might have affected the evolution of Trichoderma mitogenomes during their adaptive diversification. Our results can be proposed as clear signals of the evolving effects on the essential genes of OXPHOS, further expanding the viewpoint of molecular- and metabolic evolutions of Trichoderma mitochondria under the inherited habits of their geographic origins.

5. Conclusions

Here, a complete de novo mitogenome of an ex-neotype fungus of the T. harzianum species complex, T. harzianum CBS 226.95, is presented for the first time. Trichoderma mitogenomes were comprehensively compared to update the genomic understanding of Trichoderma mitochondria, the evolutionary divergences of Trichoderma mitogenomes were also exclusively traced using the mitochondrial core PCGs and discussed with latent genomic features triggered mitogenomic evolution in Trichoderma. Our results demonstrate that the adaptive evolution of Trichoderma mitogenomes has been actively ongoing with regard to the complex effects of various genetic elements. All of this information will allow for a better understanding of the distinct behaviors of Trichoderma species in nature, thus further expanding our insight into Trichoderma to make them more effective fungal biocontrol agents.

Acknowledgments

I wish to thank Sungpil Ryu, Yiseul Kim, and Hyobin Seo (Kyungpook National University, Korea) for their great help in analytical instruments, and Tatiana Giraud for her valuable advice on this paper.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/microorganisms9081564/s1, Figure S1. Predicted secondary structure of transfer RNA genes in the T. harzianum CBS 226.95 mitogenome.; Figure S2. Phylogenetic tree of the Hypocreales species (Sordariomycetes) mitochondrial genomes based on ML (maximum likelihood) analysis; Table S1. Genomic organization of the T. harzianum CBS 226.95 mitogenome; Table S2. Transfer RNA genes identified in the mitogenome of T. harzianum CBS 226.95; Table S3. Codon usage in the mitogenome of T. harzianum CBS 226.95; Table S4. Identification of repetitive elements in the mitogenome of T. harzianum CBS 226.95; Table S5. All Sordariomycetes mitogenomes that are used for phylogenetic analyses in this study; Table S6. Gene components for the core-PCGs and rRNAs in the Trichoderma complete mitogenomes; Table S7. Putative tRNA gene (trn) within the protein-coding gene of Trichoderma mitogenomes; Table S8. Positions of detected group I intron loci on the mitogenomes of Trichoderma species; Table S9. Comparison of the mitochondrial genomic features between T. harzianum CBS 226.95 and T. harzianum HB324; Table S10. Estimated p-distance values among the Hypocreales mitogenomes; Table S11. Ka/Ks ratios for the 13-core protein-coding genes in the Hypocreaceae mitogenomes; Table S12. Prediction of codon sites evolving under positive diversifying selection on the MEME approach.

Funding

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2018R1D1A1B07043042, NRF-2021R1I1A1A01042148).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The complete de novo mitochondrial genome sequence of Trichoderma harzianum CBS 226.95 is openly available in GenBank (accession number MN564945).

Conflicts of Interest

The author declares no conflict of interest.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Roger A.J., Muñoz-Gómez S.A., Kamikawa R. The Origin and Diversification of Mitochondria. Curr. Biol. 2017;27:R1177–R1192. doi: 10.1016/j.cub.2017.09.015. [DOI] [PubMed] [Google Scholar]
  • 2.Burger G., Gray M.W., Lang B.F. Mitochondrial genomes: Anything goes. Trends Genet. 2003;19:709–716. doi: 10.1016/j.tig.2003.10.012. [DOI] [PubMed] [Google Scholar]
  • 3.Gray M.W., Burger G., Lang B.F. Mitochondrial Evolution. Science. 1999;283:1476–1481. doi: 10.1126/science.283.5407.1476. [DOI] [PubMed] [Google Scholar]
  • 4.Aguileta G., de Vienne D., Ross O.N., Hood M.E., Giraud T., Petit E., Gabaldón T. High Variability of Mitochondrial Gene Order among Fungi. Genome Biol. Evol. 2014;6:451–465. doi: 10.1093/gbe/evu028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Saccone C., Gissi C., Lanave C., Larizza A., Pesole G., Reyes A. Evolution of the mitochondrial genetic system: An overview. Gene. 2000;261:153–159. doi: 10.1016/S0378-1119(00)00484-4. [DOI] [PubMed] [Google Scholar]
  • 6.Naranjo-Ortiz M.Á., Gabaldón T. Fungal evolution: Cellular, genomic and metabolic complexity. Biol. Rev. 2020;95:1198–1232. doi: 10.1111/brv.12605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zardoya R. Recent advances in understanding mitochondrial genome diversity. F1000Research. 2020;9:270. doi: 10.12688/f1000research.21490.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Deng Y., Hsiang T., Li S., Lin L., Wang Q., Chen Q., Xie B., Ming R. Comparison of the Mitochondrial Genome Sequences of Six Annulohypoxylon stygium Isolates Suggests Short Fragment Insertions as a Potential Factor Leading to Larger Genomic Size. Front. Microbiol. 2018;9:2079. doi: 10.3389/fmicb.2018.02079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Fan W.-W., Zhang S., Zhang Y.-J. The complete mitochondrial genome of the Chan-hua fungus Isaria cicadae: A tale of intron evolution in Cordycipitaceae. Environ. Microbiol. 2019;21:864–879. doi: 10.1111/1462-2920.14522. [DOI] [PubMed] [Google Scholar]
  • 10.Pogoda C.S., Keepers K.G., Nadiadi A.Y., Bailey D.W., Lendemer J., Tripp E.A., Kane N.C. Genome streamlining via complete loss of introns has occurred multiple times in lichenized fungal mitochondria. Ecol. Evol. 2019;9:4245–4263. doi: 10.1002/ece3.5056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kwak Y. Complete Mitochondrial Genome of the Fungal Biocontrol Agent Trichoderma atroviride: Genomic Features, Comparative Analysis and Insight into the Mitochondrial Evolution in Trichoderma. Front. Microbiol. 2020;11:785. doi: 10.3389/fmicb.2020.00785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Druzhinina I.S., Seidl-Seiboth V., Herrera-Estrella A., Horwitz B.A., Kenerley C.M., Monte E., Mukherjee P.K., Zeilinger S., Grigoriev I., Kubicek C.P. Trichoderma: The genomics of opportunistic success. Nat. Rev. Microbiol. 2011;9:749–759. doi: 10.1038/nrmicro2637. [DOI] [PubMed] [Google Scholar]
  • 13.Druzhinina I.S., Shelest E., Kubicek C.P. Novel traits of Trichoderma predicted through the analysis of its secretome. FEMS Microbiol. Lett. 2012;337:1–9. doi: 10.1111/j.1574-6968.2012.02665.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chaverri P., Samuels G.J. Evolution of Habitat Preference and Nutrition Mode in A Cosmopolitan Fungal Genus with Evidence of Interkingdom Host Jumps and Major Shifts in Ecology. Evolution. 2013;67:2823–2837. doi: 10.1111/evo.12169. [DOI] [PubMed] [Google Scholar]
  • 15.Martinez D., Berka R.M., Henrissat B., Saloheimo M., Arvas M., Baker S., Chapman J., Chertkov O., Coutinho P.M., Cullen D., et al. Genome sequencing and analysis of the biomass-degrading fungus Trichoderma reesei (syn. Hypocrea jecorina) Nat. Biotechnol. 2008;26:553–560. doi: 10.1038/nbt1403. [DOI] [PubMed] [Google Scholar]
  • 16.Kubicek C.P., Herrera-Estrella A., Seidl-Seiboth V., Martinez D.A., Druzhinina I.S., Thon M., Zeilinger S., Casas-Flores S., Horwitz B.A., Mukherjee P.K., et al. Comparative genome sequence analysis underscores mycoparasitism as the ancestral life style of Trichoderma. Genome Biol. 2011;12:R40. doi: 10.1186/gb-2011-12-4-r40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Mukherjee P.K., Horwitz B.A., Herrera-Estrella A., Schmoll M., Kenerley C.M. Trichoderma Research in the Genome Era. Annu. Rev. Phytopathol. 2013;51:105–129. doi: 10.1146/annurev-phyto-082712-102353. [DOI] [PubMed] [Google Scholar]
  • 18.Schmoll M., Dattenböck C., Carreras-Villaseñor N., Mendoza-Mendoza A., Tisch D., Alemán M.I., Baker S.E., Brown C., Cervantes-Badillo M.G., Cetz-Chel J., et al. The Genomes of Three Uneven Siblings: Footprints of the Lifestyles of Three Trichoderma Species. Microbiol. Mol. Biol. Rev. 2016;80:205–327. doi: 10.1128/MMBR.00040-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Druzhinina I.S., Chenthamara K., Zhang J., Atanasova L., Yang D., Miao Y., Rahimi M.J., Grujic M., Cai F., Pourmehdi S., et al. Massive lateral transfer of genes encoding plant cell wall-degrading enzymes to the mycoparasitic fungus Trichoderma from its plant-associated hosts. PLoS Genet. 2018;14:e1007322. doi: 10.1371/journal.pgen.1007322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Kubicek C.P., Steindorff A.S., Chenthamara K., Manganiello G., Henrissat B., Zhang J., Cai F., Kopchinskiy A.G., Kubicek E.M., Kuo A., et al. Evolution and comparative genomics of the most common Trichoderma species. BMC Genom. 2019;20:485. doi: 10.1186/s12864-019-5680-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Rifai M.A. A revision of the genus Trichoderma. Mycol. Pap. 1969;116:1–56. [Google Scholar]
  • 22.Gams W., Meyer W. What Exactly Is Trichoderma harzianum? Mycologia. 1998;90:904–915. doi: 10.1080/00275514.1998.12026984. [DOI] [Google Scholar]
  • 23.Sambrook J., Russell D.W. Purification of Nucleic Acids by Extraction with Phenol:Chloroform. Cold Spring Harb. Protoc. 2006;2006:4455. doi: 10.1101/pdb.prot4455. [DOI] [PubMed] [Google Scholar]
  • 24.Eid J., Fehr A., Gray J., Luong K., Lyle J., Otto G., Peluso P., Rank D., Baybayan P., Bettman B., et al. Real-Time DNA Sequencing from Single Polymerase Molecules. Science. 2009;323:133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
  • 25.Chin C.-S., Alexander D.H., Marks P., Klammer A.A., Drake J., Heiner C., Clum A., Copeland A., Huddleston J., Eichler E.E., et al. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods. 2013;10:563–569. doi: 10.1038/nmeth.2474. [DOI] [PubMed] [Google Scholar]
  • 26.Chaisson M.J., Tesler G. Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): Application and theory. BMC Bioinform. 2012;13:238. doi: 10.1186/1471-2105-13-238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Krumsiek J., Arnold R., Rattei T. Gepard: A rapid and sensitive tool for creating dotplots on genome scale. Bioinformatics. 2007;23:1026–1028. doi: 10.1093/bioinformatics/btm039. [DOI] [PubMed] [Google Scholar]
  • 28.Valach M., Burger G., Gray M., Lang B.F. Widespread occurrence of organelle genome-encoded 5S rRNAs including permuted molecules. Nucleic Acids Res. 2014;42:13764–13777. doi: 10.1093/nar/gku1266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Bernt M., Donath A., Jühling F., Externbrink F., Florentz C., Fritzsch G., Pütz J., Middendorf M., Stadler P.F. MITOS: Improved de novo metazoan mitochondrial genome annotation. Mol. Phylogenet. Evol. 2013;69:313–319. doi: 10.1016/j.ympev.2012.08.023. [DOI] [PubMed] [Google Scholar]
  • 30.O’Leary N.A., Wright M.W., Brister J.R., Ciufo S., Haddad D., McVeigh R., Rajput B., Robbertse B., Smith-White B., Ako-Adjei D., et al. Reference sequence (RefSeq) database at NCBI: Current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44:D733–D745. doi: 10.1093/nar/gkv1189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Coordinators N.R. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2018;46:D8–D13. doi: 10.1093/nar/gkx1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lang B.F., Laforest M.-J., Burger G. Mitochondrial introns: A critical view. Trends Genet. 2007;23:119–125. doi: 10.1016/j.tig.2007.01.006. [DOI] [PubMed] [Google Scholar]
  • 33.Chan P.P., Lowe T.M. tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences. Methods Mol. Biol. 2019;1962:1–14. doi: 10.1007/978-1-4939-9173-0_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Krzywinski M., Schein J., Birol I., Connors J., Gascoyne R., Horsman D., Jones S., Marra M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kumar S., Stecher G., Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol. Biol. Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rice P., Longden I., Bleasby A. EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–277. doi: 10.1016/S0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]
  • 37.Perna N., Kocher T. Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes. J. Mol. Evol. 1995;41:353–358. doi: 10.1007/BF01215182. [DOI] [PubMed] [Google Scholar]
  • 38.Boratyn G.M., Camacho C., Cooper P., Coulouris G., Fong A., Ma N., Madden T.L., Matten W.T., McGinnis S.D., Merezhuk Y., et al. BLAST: A more efficient report with usability improvements. Nucleic Acids Res. 2013;41:W29–W33. doi: 10.1093/nar/gkt282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Benson G. Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573–580. doi: 10.1093/nar/27.2.573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Springe; Berlin/Heidelberg, Germany: 2016. [Google Scholar]
  • 41.R Core Team . R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Vienna, Austria: 2017. [(accessed on 12 January 2021)]. Available online: https://www.R-project.org/ [Google Scholar]
  • 42.Sayers E.W., Cavanaugh M., Clark K., Ostell J., Pruitt K., Mizrachi I.K. GenBank. Nucleic Acids Res. 2019;47:D94–D99. doi: 10.1093/nar/gky989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Katoh K., Standley D.M. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Vaidya G., Lohman D.J., Meier R. SequenceMatrix: Concatenation software for the fast assembly of multi-gene datasets with character set and codon information. Cladistics. 2011;27:171–180. doi: 10.1111/j.1096-0031.2010.00329.x. [DOI] [PubMed] [Google Scholar]
  • 45.Darriba D., Taboada G.L., Doallo R., Posada D. jModelTest 2: More models, new heuristics and parallel computing. Nat. Methods. 2012;9:772. doi: 10.1038/nmeth.2109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Cavalli-Sforza L.L., Edwards A.W.F. Phylogenetic analysis. Models and estimation procedures. Am. J. Hum. Genet. 1967;19:233–257. [PMC free article] [PubMed] [Google Scholar]
  • 47.Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Rannala B., Yang Z. Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference. J. Mol. Evol. 1996;43:304–311. doi: 10.1007/BF02338839. [DOI] [PubMed] [Google Scholar]
  • 49.Nascimento F.F., Dos Reis M., Yang Z. A biologist’s guide to Bayesian phylogenetic analysis. Nat. Ecol. Evol. 2017;1:1446–1454. doi: 10.1038/s41559-017-0280-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Ronquist F., Teslenko M., Van Der Mark P., Ayres D.L., Darling A., Hoehna S., Larget B., Liu L., Suchard M.A., Huelsenbeck J.P. MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space. Syst. Biol. 2012;61:539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bouckaert R., Vaughan T.G., Barido-Sottani J., Duchêne S., Fourment M., Gavryushkina A., Heled J., Jones G., Kühnert D., De Maio N., et al. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 2019;15:e1006650. doi: 10.1371/journal.pcbi.1006650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Rambaut A., Drummond A.J., Xie D., Baele G., Suchard M.A. Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst. Biol. 2018;67:901–904. doi: 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Rozas J., Ferrer-Mata A., Sánchez-DelBarrio J.C., Guirao-Rico S., Librado P., Ramos-Onsins S., Sánchez-Gracia A. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets. Mol. Biol. Evol. 2017;34:3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
  • 54.Murrell B., Wertheim J.O., Moola S., Weighill T., Scheffler K., Pond S.L.K. Detecting Individual Sites Subject to Episodic Diversifying Selection. PLoS Genet. 2012;8:e1002764. doi: 10.1371/journal.pgen.1002764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Pond S.L.K., Frost S., Muse S.V. HyPhy: Hypothesis testing using phylogenies. Bioinformatics. 2004;21:676–679. doi: 10.1093/bioinformatics/bti079. [DOI] [PubMed] [Google Scholar]
  • 56.Fujishima K., Kanai A. tRNA gene diversity in the three domains of life. Front. Genet. 2014;5:142. doi: 10.3389/fgene.2014.00142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hamashima K., Tomita M., Kanai A. Expansion of Noncanonical V-Arm-Containing tRNAs in Eukaryotes. Mol. Biol. Evol. 2015;33:530–540. doi: 10.1093/molbev/msv253. [DOI] [PubMed] [Google Scholar]
  • 58.Sethuraman J., Majer A., Iranpour M., Hausner G. Molecular Evolution of the mtDNA Encoded rps3 Gene Among Filamentous Ascomycetes Fungi with an Emphasis on the Ophiostomatoid Fungi. J. Mol. Evol. 2009;69:372–385. doi: 10.1007/s00239-009-9291-9. [DOI] [PubMed] [Google Scholar]
  • 59.Doublet V., Ubrig E., Alioua A., Bouchon D., Marcadé I., Maréchal-Drouard L. Large gene overlaps and tRNA processing in the compact mitochondrial genome of the crustacean Armadillidium vulgare. RNA Biol. 2015;12:1159–1168. doi: 10.1080/15476286.2015.1090078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Salinas-Giegé T., Giegé R., Giegé P. tRNA Biology in Mitochondria. Int. J. Mol. Sci. 2015;16:4518–4559. doi: 10.3390/ijms16034518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Romanova E.V., Bukin Y.S., Mikhailov K.V., Logacheva M.D., Aleoshin V., Sherbakov D.Y. Hidden cases of tRNA gene duplication and remolding in mitochondrial genomes of amphipods. Mol. Phylogenet. Evol. 2020;144:106710. doi: 10.1016/j.ympev.2019.106710. [DOI] [PubMed] [Google Scholar]
  • 62.Smith D.R., Keeling P. Mitochondrial and plastid genome architecture: Reoccurring themes, but significant differences at the extremes. Proc. Natl. Acad. Sci. USA. 2015;112:10177–10184. doi: 10.1073/pnas.1422049112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Hurst G.D.D., Werren J. The role of selfish genetic elements in eukaryotic evolution. Nat. Rev. Genet. 2001;2:597–606. doi: 10.1038/35084545. [DOI] [PubMed] [Google Scholar]
  • 64.Haugen P., Simon D.M., Bhattacharya D. The natural history of group I introns. Trends Genet. 2005;21:111–119. doi: 10.1016/j.tig.2004.12.007. [DOI] [PubMed] [Google Scholar]
  • 65.Wang L., Zhang S., Li J.-H., Zhang Y.-J. Mitochondrial genome, comparative analysis and evolutionary insights into the entomopathogenic fungus Hirsutella thompsonii. Environ. Microbiol. 2018;20:3393–3405. doi: 10.1111/1462-2920.14379. [DOI] [PubMed] [Google Scholar]
  • 66.Lynch M. Streamlining and Simplification of Microbial Genome Architecture. Annu. Rev. Microbiol. 2006;60:327–349. doi: 10.1146/annurev.micro.60.080805.142300. [DOI] [PubMed] [Google Scholar]
  • 67.Koonin E.V. The origin of introns and their role in eukaryogenesis: A compromise solution to the introns-early versus introns-late debate? Biol. Direct. 2006;1:22. doi: 10.1186/1745-6150-1-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Fonseca P.L.C., Badotti F., De Paula R., Araújo D., Bortolini D.E., Del-Bem L.-E., Azevedo V.A., Brenig B., Aguiar E., Góes-Neto A. Exploring the Relationship Among Divergence Time and Coding and Non-coding Elements in the Shaping of Fungal Mitochondrial Genomes. Front. Microbiol. 2020;11:765. doi: 10.3389/fmicb.2020.00765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Persoon C.H. Dispositio methodica fungorum. Neues Mag. Bot. 1794;1:81–128. [Google Scholar]
  • 70.Samuels G.J. Trichoderma: Systematics, the Sexual State, and Ecology. Phytopathology. 2006;96:195–206. doi: 10.1094/PHYTO-96-0195. [DOI] [PubMed] [Google Scholar]
  • 71.Kullnig C.M., Krupica T., Woo S.L., Mach R.L., Rey M., Benítez T., Lorito M., Kubicek C.P. Confusion Abounds Over Identities of Trichoderma Biocontrol Isolates. Mycol. Res. 2001;105:770–772. doi: 10.1017/S0953756201229967. [DOI] [Google Scholar]
  • 72.Druzhinina I.S., Kopchinskiy A.G., Kubicek C.P. The first 100 Trichoderma species characterized by molecular data. Mycoscience. 2006;47:55–64. doi: 10.1007/S10267-006-0279-7. [DOI] [Google Scholar]
  • 73.Schuster A., Schmoll M. Biology and biotechnology of Trichoderma. Appl. Microbiol. Biotechnol. 2010;87:787–799. doi: 10.1007/s00253-010-2632-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Chaverri P., Branco-Rocha F., Jaklitsch W.M., Gazis R., Degenkolb T., Samuels G.J. Systematics of the Trichoderma harzianum species complex and the re-identification of commercial biocontrol strains. Mycologia. 2015;107:558–590. doi: 10.3852/14-147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ban S., Yamaguchi K., Okane I., Nakagiri A., Tabuchi Y., Genra M. Re-identification of Hypocrea/Trichoderma strains preserved at the NBRC collection. Microbiol. Cult. Coll. 2010;26:119–124. [Google Scholar]
  • 76.Vaz A.B.M., Fonseca P., Badotti F., Skaltsas D., Tomé L.M.R., Silva A.C., Cunha M.C., Soares M.A., Santos V.L., Oliveira G., et al. A multiscale study of fungal endophyte communities of the foliar endosphere of native rubber trees in Eastern Amazon. Sci. Rep. 2018;8:16151. doi: 10.1038/s41598-018-34619-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Bissett J., Gams W., Jaklitsch W.M., Samuels G.J. Accepted Trichoderma names in the year 2015. IMA Fungus. 2015;6:263–295. doi: 10.5598/imafungus.2015.06.02.02. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Almeida D., Maldonado E., Vasconcelos V., Antunes A. Adaptation of the Mitochondrial Genome in Cephalopods: Enhancing Proton Translocation Channels and the Subunit Interactions. PLoS ONE. 2015;10:e0135405. doi: 10.1371/journal.pone.0135405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Lavín J.L., Oguiza J.A., Ramírez L., Pisabarro A.G. Comparative genomics of the oxidative phosphorylation system in fungi. Fungal Genet. Biol. 2008;45:1248–1256. doi: 10.1016/j.fgb.2008.06.005. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The complete de novo mitochondrial genome sequence of Trichoderma harzianum CBS 226.95 is openly available in GenBank (accession number MN564945).


Articles from Microorganisms are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES