Abstract
Fungi produce a remarkable diversity of secondary metabolites: small, bioactive molecules not required for growth but which are essential to their ecological interactions with other organisms. Genes that participate in the same secondary metabolic pathway typically reside next to each other in fungal genomes and form biosynthetic gene clusters (BGCs). By synthesizing state-of-the-art knowledge on the evolution of BGCs in fungi, we propose that fungal chemodiversity stems from three molecular evolutionary processes involving BGCs: functional divergence, horizontal transfer, and de novo assembly. We provide examples of how these processes have contributed to the generation of fungal chemodiversity, discuss their relative importance, and outline major, outstanding questions in the field.
Keywords: natural product, mycotoxin, antibiotic, secondary metabolic gene cluster, lateral transfer, fungal genomics, fungal chemical diversity, ecological interactions, fungal communication
1. Fungal biosynthetic gene clusters produce diverse secondary metabolites of broad ecological importance and human relevance
Fungi produce a remarkable diversity of secondary metabolites,1 also known as natural products, such as the immunosuppressant cyclosporin,2 the cholesterol reducing lovastatin,3 the antibiotic penicillin,4 the hallucinogenic prodrug psilocybin,5 and the mycotoxins trichothecene6 and aflatoxin7 (Fig. 1). Although these small molecules are not required for fungal survival and growth, their bioactive properties render them highly relevant to human affairs as drugs, toxins, and pigments. But arguably their raison d’être is to act as crucial intermediaries at the front line of fungal ecology. Numerous secondary metabolites are thought to play key roles in shaping the interactions that fungi have with other organisms across the tree of life, including with other fungi,8 bacteria,9, 10 plants,11, 12 or animals.13–15 These interactions are varied, and include virulence, defense, quorum sensing, protection, nutrient acquisition and the promotion of growth (Fig. 2).
Figure 1. Select examples of fungal BGCs, their secondary metabolites, and the organisms that produce them.
Genes are represented by arrows; Genes colored maroon denote secondary metabolite backbone biosynthesis genes (such as polyketide synthases, terpene synthases, and non-ribosomal peptide synthases), whereas genes colored grey denote BGC genes with diverse functions, such as metabolite modification, metabolite transport, regulation of BGC expression, and resistance to secondary metabolite activity. Note that psilocybin biosynthesis does not require any of the canonical backbone biosynthesis genes. Data from: Cyclosporin BGC2, lovastatin BGC3, trichothecene T-2 toxin BGC6, aflatoxin BGC7, penicillin BGC4, and psilocybin BGC5, 16.
Figure 2. Secondary metabolites are central to the ecology of many fungi and shape their diverse interactions with other organisms.
Penicillin is an antibiotic whose ecological role lies in fungal defense against bacteria,10 6-n-pentyl-6H-pyran-2-one (6-PP) promotes plant growth,12 butyrolactone I is a quorum sensing molecule,30 gliotoxin is a virulence factor,31 DHN-melanin protects again UV light damage,32 and enterobactin is an iron uptake molecule that contributes to the acquisition of nutrients.33
Most fungal secondary metabolites are encoded by biosynthetic gene clusters (BGCs; Fig. 1); each cluster typically contains the majority, if not all, of the genes participating in the production of a given secondary metabolite, with these genes located adjacent to each other (i.e., “clustered”) in the genome.1, 17–19 A typical fungal BGC contains one or more genes whose protein products catalyze the synthesis of the backbone of the metabolite (such as polyketide synthases, non-ribosomal peptide synthases, and terpene synthases), and one or more genes encoding for: i) enzymes (such as epimerases, methyltransferases, and hydroxylases) that modify this backbone, ii) proteins involved in metabolite transport, iii) transcription factors involved in regulation of BGC expression, and iv) proteins that confer resistance to the activity of the secondary metabolite.1, 17 Fungal BGCs are generally similar in their genomic organization to bacterial BGCs; the key difference is that bacterial BGCs are typically organized into operons (where multiple genes are transcribed into a single messenger RNA), whereas fungal BGCs are typically transcribed individually.20, 21
Notable secondary metabolites produced by diverse backbone biosynthesis genes and BGCs include: cyclosporin, a non-ribosomal peptide biosynthesized by a 14-gene BGC in the ascomycete fungus Tolypocladium inflatum;2 lovastatin, a polyketide biosynthesized by an 18-gene BGC in the mold Aspergillus terreus;3 the trichothecene T-2 toxin, a terpene biosynthesized by a 12-gene BGC and a 2-gene BGC found in several Fusarium species;6 aflatoxin, a polyketide biosynthesized by a 25-gene BGC in the mold Aspergillus flavus and its close relatives;7 penicillin, a non-ribosomal peptide biosynthesized by a 3-gene BGC in molds in the genera Penicillium and Aspergillus;4 and psilocybin, a tryptamine-derived secondary metabolite biosynthesized by a 9-gene BGC in several different basidiomycete genera whose biosynthesis does not require any of the canonical backbone biosynthesis genes (Fig. 1).5, 16 A comprehensive and up to date compilation of fungal BGCs whose secondary metabolite products have been functionally validated can be found at the MIBiG (Minimum Information about Biosynthetic Gene cluster) repository.22, 23
BGCs vary widely in their numbers across fungal genomes; whereas ascomycete filamentous fungi and basidiomycete fungi typically contain dozens (if not scores) of BGCs, unicellular yeasts in both lineages either lack BGCs altogether or contain very few.17, 24, 25 A given BGC is often known from only a single species or a few closely related ones, but broadly and discontinuously distributed BGCs, such as sterigmatocystin,26 also exist. Additionally, BGCs and their secondary metabolites also show extensive variation in their presence / absence patterns within fungal species.17, 27–29
A notable feature of BGCs, hinted at by their high variability and narrow taxonomic range, is that they are rapidly evolving.17, 34 Why is that so? From a molecular perspective, it has been argued that the lower specificity of secondary metabolic enzymes means that new gene duplicates are more likely to catalyze novel substrates and produce novel products that may be favored by natural selection, accelerating their evolution.35 Additionally, BGCs often reside in fast-evolving genomic regions, such as near the ends of chromosomes36 or in accessory chromosomes.37 From an ecological perspective, the involvement of secondary metabolites in mediating interspecific interactions suggests that they are key in “arms races” between fungi and their competitors, which are thought to accelerate evolutionary rates of the genes involved.38 But secondary metabolite biosynthesis is also energetically costly. Thus, loss of the ability to produce a secondary metabolite and reliance on other fungal relatives in the community for its production may be, at least up to a point, advantageous to individual organisms39, 40 and further increase the rate of BGC evolution.17
One important question raised by considering the ecological relevance of fungal secondary metabolites, the narrowness of their taxonomic distribution, and the fast pace of BGC evolution, concerns the molecular evolutionary processes that give rise to fungal chemodiversity. In this highlight, we suggest that there are three major molecular evolutionary processes that occur at the level of BGCs and which give rise to fungal chemodiversity: functional divergence, horizontal or lateral transfer, and de novo assembly (see glossary in Table 1 for definitions of these terms). While the focus of our highlight is on discussing how variation at the level of BGCs gives rise to variation in secondary metabolism or chemodiversity, we note that all genetic variation at the level of BGCs occurs via the standard battery of mutational types, such as point mutations, insertions, deletions, rearrangements, duplications, and horizontal gene transfer (see glossary in Table 1 for definitions). All of these types of mutations are well established and known to influence fungal genes, genomes, and BGCs.20, 27, 29, 34, 40
Table 1.
Glossary of evolutionary terms
| Term | Definition |
|---|---|
| BGC de novo assembly | Refers to the process by which an entire BGC is evolutionarily assembled through the recruitment and relocation of native genes, duplicates of native genes, and horizontally acquired genes |
| BGC duplication | Refers to the generation of an additional (duplicate) copy of an entire BGC in the genome |
| BGC functional divergence | Refers to the process by which homologous BGCs, through the accumulation of genetic changes, gradually diverge in their functions (i.e., in the secondary metabolites they produce) |
| BGC horizontal or lateral transfer | Refers to the process by which an entire BGC from the genome of one organism is transferred and stably integrated into the genome of another through non-reproduction related mechanisms |
| Deletion | Type of mutation, which stems from the deletion of genetic material in the genome |
| Duplication | Refers to the generation of an additional (duplicate) copy of genetic material in the genome |
| Homology / homologous | In the context of genes, two genes are homologous if their origins can be traced to the same common ancestor. Homologous genes can originate via processes such as gene duplication (in which case they are paralogs) and vertical descent / speciation (in which case they are orthologs) |
| Horizontal / lateral gene transfer | Refers to the transfer and integration of genetical material from the genome of one organism to the genome of another through non-reproduction related mechanisms |
| Insertion | Type of mutation, which stems from the insertion of genetic material in the genome |
| Orthology / orthologous | In the context of genes, two genes are orthologous if they originated via vertical descent / speciation, i.e., if they stem from the same ancestral gene that was present in the last common ancestor of the strains / species being compared |
| Paralogy / paralogous | In the context of genes, two genes are paralogous if they originated via gene duplication |
| Point mutation | Type of mutation, which stems from the replacement of one nucleotide base pair by another |
| Rearrangement | Type of mutation, which stems from the rearrangement of genetic material in the genome |
2. The evolutionary processes underlying fungal chemodiversity
2.1. BGC functional divergence
Functional divergence is the process by which the accumulation of molecular differences between evolutionarily related or homologous (see glossary in Table 1) genes and pathways leads to a change in their function or phenotype. In the context of BGCs, functional divergence refers to the accumulation of molecular differences between the gene sequences of homologous BGCs that then give rise to chemical differences in their secondary metabolite products and generate secondary metabolite structural diversity. Functional divergence has influenced both the evolution of orthologous (see glossary in Table 1) BGCs that have originated through speciation events as well as paralogous (see glossary in Table 1) BGCs that have originated through duplication events.
2.1.1. Functional divergence of orthologous BGCs
Orthologous BGCs can functionally diverge via the accumulation of amino acid differences in the enzymes encoded by BGCs. For example, the chemodiversity of fumonisin mycotoxins among Fusarium fungi stems from amino acid sequence variation in a protein encoded by a single gene from the fumonisin BGC (Fig. 3A).41 Some Fusarium species, such as Fusarium verticillioides, are known to produce primarily fumonisin B, whereas other species, such as Fusarium oxysporum, produce primarily fumonisin C. The only difference in the structures of fumonisin B and C is in the length of their backbones; the fumonisin B backbone is 20 carbon atoms long, whereas the backbone of fumonisin C is 19 carbon atoms long. Comparison of the fumonisin BGCs in F. verticillioides and F. oxysporum showed that the two species contain orthologous BGCs with the same 19 (orthologous) genes; gene swapping experiments further showed that sequence variation within the fum8 gene, which encodes for an α-oxoamine synthase, is responsible for the observed difference in the type of fumonisin (B or C) produced by the two species.41 The precise amino acid difference(s) between the F. verticillioides and F. oxysporum Fum8 protein orthologs responsible for observed divergence in fumonisin structure are not known and the two orthologs exhibit 91% similarity in their amino acid sequences.41 However, it appears that the F. verticillioides Fum8 enzyme preferentially binds the amino acid alanine (and catalyzes its condensation to an 18-carbon linear polyketide to produce the 20-carbon-long fumonisin B), whereas the F. oxysporum Fum8 preferentially binds glycine, resulting in the production of the 19-carbon-long fumonisin C.41 Sequence comparisons of Fusarium α-oxoamine synthase sequences show that the amino acid residue at position 580 of the protein is strongly associated with the type of fumonisin produced; presence of alanine at position 580 is associated with fumonisin B production, whereas presence of valine at the same position is associated with fumonisin C production.42 Consistent with this association, mutations of this residue in human α-oxoamine synthase have been shown to alter the enzyme’s binding affinity to its amino acid substrate.43
Figure 3. Fumonisins and sterigmatocystin / aflatoxins; two notable examples of fungal chemodiversity that stems from the functional divergence of orthologous BGCs.
Genes are represented by arrows. Lines between genes from different species refer to orthologous genes.
Alternatively, orthologous BGCs can functionally diverge through gains and losses of genes (Fig. 3B). For example, some Aspergillus species, such as Aspergillus flavus, produce the mycotoxin aflatoxin, whereas other species, including Aspergillus nidulans, produce the mycotoxin sterigmatocystin. The two mycotoxins, as well as their BGCs, are similar to each other. The difference in the mycotoxin produced is due to at least three genes (aflP, aflU, and aflQ; shown in bold in Fig. 3B) present in the aflatoxin BGC that are not found in the sterigmatocystin BGC. The AflP protein is an O-methyltransferase that converts sterigmatocystin to O-methylsterigmatocystin, whereas the cytochrome P450 monooxygenase AflU and the P-450 monooxygenase AflQ catalyze the conversion of O-methylsterigmatocystin to aflatoxin G and aflatoxin B, respectively.7, 44 Note that the differences in gene content between the aflatoxin and sterigmatocystin BGCs include additional genes (Fig. 3B); however, only aflP, aflU, and aflQ have been shown to be involved in the conversion of sterigmatocystin to the aflatoxins.
Finally, some orthologous BGCs have functionally diverged through both the accumulation of amino acid differences in the protein products of their genes as well as through gains and losses of genes. The combined effect of these two processes is thought to account for the observed structural diversity of yanuthone antimicrobial compounds in Penicillium molds45, as well as for the diversity of the echinocandin class of antifungal drugs,46 trichothecene mycotoxins,47 and ergot alkaloids48 produced by diverse fungi.
2.1.2. Functional divergence of paralogous BGCs
BGC functional divergence that gives rise to the evolution of new secondary metabolites can also occur via the duplication of genomic regions containing entire BGCs. Even though duplication of genes in BGCs has been widely documented and it is now well established that gene duplication is a major driver of both the diversity of individual backbone genes present in BGCs49, 50 as well as of genes in BGCs in general,34 much less is known about the duplication of entire BGCs.
An example of BGC duplication concerns the duplication of two polyketide-producing BGCs, Pks1-gc and Pks2-gc, in Metarhizium entomopathogenic fungi, one of which is known to produce an anthraquinone derivative.51 Genomic and functional analyses of the two paralogous BGCs show that they have functionally diverged through the reciprocal loss of genes in each BGC as well as through the accumulation of substitutions in both the promoter and protein-coding regions of their polyketide synthase genes (Fig. 4).51 Interestingly, the only shared paralogous gene pair between the Pks1-gc and the Pks2-gc is the Pks1 – Pks2 pair. In contrast, the Pks1-gc and the Pks2-gc BGCs share two and three homologous genes, respectively, with the A. fumigatus conidial pigment BGC (Fig. 4). Consistent with these differences in gene sequence and content, the two BGCs show distinct expression patterns (the genes of Pks1-gc are expressed during asexual spore formation, whereas the genes of Pks2-gc are expressed during the establishment of infection in insects) and produce distinct secondary metabolites. The anthraquinone derivative product of Pks1-gc is involved in the pigmentation of asexual spores and in abiotic stress tolerance, such as tolerance to UV light, whereas the uncharacterized product of Pks2-gc appears to contribute to pathogenicity and not pigmentation or abiotic stress.51
Figure 4. Duplication and subsequent functional divergence of Pks1-gc and Pks2-gc, two paralogous polyketide BGCs present in Metarhizium insect pathogens.
Two polyketide synthase-containing BGCs, Pks1-gc and Pks2-gc, in Metarhizium robertsii appear to be the result of an ancient duplication of an entire BGC that likely resembled the conidial pigment BGC found in the mold Aspergillus fumigatus. The Pks1-gc BGC produces an anthraquinone derivative, whereas the product of the Pks2-gc has yet to be characterized.51 Data from: M. robertsii Pks1-gc BGC51 and chemical structure; M. robertsii Pks2-gc BGC;51 and A. fumigatus conidial pigment BGC.52 Lines between genes from different species refer to orthologous genes.
Another example of functional divergence of paralogous BGCs are the patulin and yanuthone BGCs in Penicillium molds, whose secondary metabolite products share a 6-methylsalicylic acid (6-MSA) core.45 The 15-gene patulin BGC and the 10-gene yanuthone BGCs contain several pairs of paralogous genes thought to catalyze the same reactions leading to the formation of the 6-MSA core structure as well as several additional genes that lack sequence similarity to genes in the other BGC and are presumably responsible for the structural differences between patulin and yanuthones.45 Thus, a proto-BGC responsible for the production of 6-MSA likely originated and duplicated prior to the origin of Penicillium, followed by additional recruitment of non-homologous genes in both BGCs. Interestingly, phylogenetic analysis of the 6-MSA synthase protein suggests that the patulin 6-MSA synthase is more closely related to the 6-MSA synthases found in the aculinic acid BGC from Aspergillus aculeatus53 and in the terreic acid BGC in Aspergillus terreus,54 both of which produce 6-MSA-based secondary metabolites.45 Thus, the duplication and subsequent functional divergence of the patulin and yanuthone BGCs is part of a broader series of duplication and functional divergence events of 6-MSA-based BGCs.
2.3. BGC horizontal transfer
Fungal chemodiversity can also originate via the horizontal transfer of entire BGCs from other organisms.55 For example, horizontal transfer of the sterigmatocystin BGC from Aspergillus to Podospora resulted in the ability of the latter to produce sterigmatocystin (Fig. 5).56, 57 In the aftermath of horizontal transfer, the acquired BGCs can accumulate changes in their sequence and genomic organization without altering the structure of the metabolic product. For example, the average amino acid sequence similarity between the proteins encoded by the Aspergillus nidulans and Podospora anserina sterigmatocystin BGCs is 63% and the two BGCs also differ somewhat in their genomic organization, yet both produce the same metabolite. Thus, in contrast to BGC functional divergence (section 2.2) and BGC de novo assembly (section 2.4 below), both of which result in BGCs that produce new compounds, BGC horizontal transfer typically results in the production of an existing compound in a new, typically distantly related, organism.
Figure 5. Horizontal transfer of the sterigmatocystin BGC from Aspergillus to Podospora resulted in the presence of the sterigmatocystin BGC in the Podospora genome and its ability to produce sterigmatocystin.
Evolutionary analyses of the history of the genes in the sterigmatocystin BGC suggest that the Podospora BGC was horizontally acquired from an Aspergillus ancestor.56 Subsequent functional and chemical studies have validated that Podospora fungi produce the sterigmatocystin mycotoxin.10, 57, 65 Large orthologous blocks of genetic sequence are depicted using orange trapezoids.
In the last decade, several examples of BGC horizontal transfer have been reported; most transfers of entire BGCs are between fungi, such as the transfers of the BGC for the pigment bikaverin from the ascomycete genus Fusarium to that of Botrytis,58–60 of the BGC for the hallucinogen psilocybin among basidiomycete fungi,16 of the fumonisin BGC across Fusarium species,42 of the chaetoglobosin-like BGC from Penicillium to Mycosphaerella populorum,61 or the multiple transfers of the BGC for the histone deacetylase inhibitor depudecin among ascomycete fungi.62 In contrast, horizontal transfer of entire BGCs from bacteria, the lineage in which secondary metabolism first originated,63 appears to be less common and only one clear-cut example of transfer of the siderophore enterobactin from enterobacteria to budding yeasts is known to date.33
The examples discussed above all concern transfers of BGCs in the absence of functional divergence (i.e., the same secondary metabolite is produced in both the donor and the recipient organisms). The identification of examples of BGCs that functionally diverged after HGT is more challenging because, following functional divergence, donor and recipient BGCs can exhibit substantial divergence in gene content and arrangement.55 BGC horizontal transfer followed by functional divergence is thought to account for the diversification of epipolythiodioxopiperazine (ETP) mycotoxins, such as gliotoxin, sirodesmin and their relatives.64
2.4. De novo BGC assembly
The final, and least well-documented, evolutionary process involved in the generation of fungal chemodiversity is de novo BGC assembly, under which new secondary metabolites originate from scratch in fungal genomes. The genes that become part of the newly formed secondary metabolic pathway originate either through duplication and relocation of native genes or through horizontal acquisition. One important source of genes for BGCs are duplicates of genes encoding for enzymes already involved in primary and secondary metabolism, such as the isopropyl-malate synthase gene in the echinocandin BGC of Aspergillus rugulosus,66 and the citrate synthase gene in the zaragozic acid BGC of Curvularia lunata.67
De novo assembled BGCs are unlikely to be highly similar in their gene or sequence content to already existing BGCs, making their identification through comparisons of genome sequences (the major way all cases of BGC functional divergence and BGC horizontal transfer have been identified) much more challenging. Several lines of evidence support that this mechanism also gives rise to fungal BGCs. The same general evolutionary process of de novo pathway assembly is thought to be responsible for the origin of novel pathways that break down anthropogenic chemicals68 as well as of certain catabolic pathways.11, 69
De novo secondary metabolic pathways may originate in a similar manner via a two-step process; step one involves the assembly of the secondary metabolic pathway through the recruitment of native genes, duplicates of native genes, and horizontally acquired genes, and step two involves their clustering into a BGC. Consistent with this model, several fungal secondary metabolic pathways are comprised of two or more BGCs,17 suggesting that the clustering of fungal secondary metabolic pathways is not an absolute requirement for their function. For example, a 12-gene and a 2-gene BGC found in distinct genomic locations are involved in the biosynthesis of the trichothecene T-2 toxin in F. graminearum (Fig. 1).6 Additionally, several BGCs contain distinct smaller clusters of genes (often referred to as modules) responsible for the production of functional intermediates within the pathway, suggesting that the entire BGC evolved via the merging of distinct, pre-existing smaller BGCs. For example, BGCs associated with the production of echinocandins typically contain a 4-gene cluster for the production of L-homotyrosine, one of the intermediates required for echinocandin biosynthesis.46 Similarly, the genes of BGCs responsible for the production of distinct secondary metabolites can be intertwined in the genome, as in the case of the fumagillin and pseurotin BGCs in A. fumigatus,70 providing empirical evidence of the evolutionary merging of distinct BGCs.
The second line of evidence is that gene relocation has been implicated in the diversification of BGCs, such as the expansion of a trichothecene BGC in Fusarium species.71 Perhaps the best candidate of de novo assembly of a BGC involved in secondary metabolism is the fumonisin BGC found in certain Fusarium and Aspergillus species.72 While the presence of the BGC in Aspergillus is best explained by horizontal transfer from another fungus, one hypothesis for the origin of the Fusarium BGC, based on examination of phylogenies of genes in the BGC, is that it arose through the relocation and clustering of genes that were originally dispersed in the genome.72
3. Perspective and Major unanswered questions
Even though the remarkable breadth of fungal chemodiversity was well appreciated before the advent of the genomics revolution,73 the sequencing of diverse fungal genomes from 2003 onward quickly began revealing that fungal genomes contained even larger numbers of BGCs responsible for the biosynthesis of yet-unknown secondary metabolite products and provided unprecedented opportunities for studying the origins and evolution of fungal chemodiversity at the DNA sequence level.17 Currently, the molecular evolutionary processes by which fungal BGCs evolve are becoming established (Fig. 3) and the relationship between chemical diversity and BGC diversity for several secondary metabolites is being increasingly refined.47
Comparison of the genetic and evolutionary mechanisms underpinning the evolution of fungal chemodiversity with those inferred from the study of bacterial chemodiversity74, 75 suggests that similar mechanisms operate in both lineages. Arguably the biggest difference is the extent of the contribution of BGC horizontal transfer in driving chemodiversity in the two lineages. Although the role of BGC horizontal transfer is increasingly appreciated in fungi (see section 2.3), bacterial BGC horizontal transfer occurs at far higher rates and plays a bigger role in shaping bacterial chemodiversity.74
While the major contours of the molecular evolutionary basis of fungal chemodiversity are increasingly well understood, several major outstanding questions and opportunities remain. For example, we still lack an understanding of why fungal secondary metabolic pathways are typically arranged in the genome as BGCs (three genetic models, namely co-regulation, genetic linkage, and selfishness, and one phenotypic model, namely toxicity avoidance, have been put forward as explanations)17, 19, 76 and whether this clustering is associated with fungal chemodiversity. We similarly lack a complete knowledge of the distribution and genomic arrangement of secondary metabolic pathways in fungal genomes, especially from less-studied and less-sequenced lineages located outside a few select genera of filamentous fungi (e.g., Aspergillus, Fusarium, Penicillium) from the phylum Ascomycota.77
In the context of this highlight article, arguably the biggest challenges and opportunities lie in uncovering examples of de novo BGC assembly, understanding the relative contribution of the three different processes in sculpting BGC diversity, and elucidating how this diversity translates to chemodiversity. Recently developed computational algorithms now allow the construction of networks of fungal BGCs on the basis of their sequence similarity and gene order, enabling the grouping of BGCs into BGC families, of families into clans, and so on.78 Reconciling this network view of BGC evolution with the evolutionary processes that we discuss promises to illuminate their relative importance in sculpting BGC diversity and how that translated to chemodiversity. For example, a recent examination of 37 Aspergillus and Penicillium genomes identified more than 2,700 BGCs that could be grouped into 455 BGC families that presumably produce distinct groups of secondary metabolites; strikingly, nearly half of these families contained only a single BGC.79 How did these single-BGC families originate and how common are they when the entirety of fungal genomes is examined? And how do these 455 BGC families relate to the ~15,600 described fungal secondary metabolites?10 These are exciting questions but also non-trivial to address, not least because of the challenges associated with handling and analyzing the ever increasing volume of publicly available fungal genomes (there are 5,064 draft fungal genomes in GenBank as of October 30, 2019).
But the opportunity does not stop here; by considering the mechanisms that give rise to BGC diversity we begin to set the foundations of an evolutionary framework to bridge genotype (BGCs) with chemotype (their secondary metabolites). Establishing such a framework will not only advance our understanding of how genomic diversity translates to chemodiversity, but will also be useful in genetic engineering- and directed evolution-based efforts to discover and produce new leads in the pharmaceutical and agrochemical research areas.45 Connecting BGC diversity with chemodiversity, and elucidating the relationship between BGC sequence divergence and chemical structure divergence, is even more daunting due to the current lack of structures for most fungal BGCs17 and vice versa (i.e., the BGCs responsible for making most fungal secondary metabolites are unknown). With the sequences of tens of thousands of fungal BGCs, thousands of fungal secondary metabolite chemical structures, and a smorgasbord of novel synthetic biology, chemical, and bioinformatic tools that accelerate the discovery of new secondary metabolites80 at hand, exciting discoveries lay ahead.
5. Acknowledgements
We are grateful to Marnix Medema for the invitation to contribute this article, as well as to two anonymous reviewers and an editorial board member for their constructive feedback. We thank Sonja Knowles from UNCG for assistance with the structural drawings. This work was supported by a Vanderbilt University Discovery Grant, by the Howard Hughes Medical Institute through the James H. Gilliam Fellowships for Advanced Study program (J.L.S. and A.R.), and a Regular Faculty Grant (from the University of North Carolina at Greensboro). Research in A.R.’s lab is also supported by the National Science Foundation (DEB1442113), the Guggenheim Foundation, and the Burroughs Wellcome Fund.. Research on bioactive fungal metabolites in N.H.O.’s lab is also supported by the National Cancer Institute (P01 CA125066).
Footnotes
Conflicts of interest
There are no conflicts of interest to declare.
6. References
- 1.Keller NP, Nat. Rev. Microbiol, 2019, 17, 167–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bushley KE, Raja R, Jaiswal P, Cumbie JS, Nonogaki M, Boyd AE, Owensby CA, Knaus BJ, Elser J, Miller D, Di Y, McPhail KL and Spatafora JW, PLoS Genet, 2013, 9, e1003496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Mulder KC, Mulinari F, Franco OL, Soares MS, Magalhaes BS and Parachin NS, Biotechnol Adv, 2015, 33, 648–665. [DOI] [PubMed] [Google Scholar]
- 4.Gutierrez S, Fierro F, Casqueiro J and Martin JF, Antonie Van Leeuwenhoek, 1999, 75, 81–94. [DOI] [PubMed] [Google Scholar]
- 5.Fricke J, Blei F and Hoffmeister D, Angew Chem Int Ed Engl, 2017, 56, 12352–12355. [DOI] [PubMed] [Google Scholar]
- 6.Alexander NJ, Proctor RH and McCormick SP, Toxin Rev, 2009, 28, 198–215. [Google Scholar]
- 7.Yu J, Chang PK, Ehrlich KC, Cary JW, Bhatnagar D, Cleveland TE, Payne GA, Linz JE, Woloshuk CP and Bennett JW, Appl. Environ. Microbiol, 2004, 70, 1253–1262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Knowles SL, Raja HA, Wright AJ, Lee AML, Caesar LK, Cech NB, Mead ME, Steenwyk JL, Ries LNA, Goldman GH, Rokas A and Oberlies NH, Front Microbiol, 2019, 10, 285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Van Arnam EB, Currie CR and Clardy J, Chem Soc Rev, 2018, 47, 1638–1651. [DOI] [PubMed] [Google Scholar]
- 10.Bills GF and Gloer JB, Microbiol Spectr, 2016, 4. [DOI] [PubMed] [Google Scholar]
- 11.Gluck-Thaler E, Vijayakumar V and Slot JC, Mol. Ecol, 2018, 27, 5120–5136. [DOI] [PubMed] [Google Scholar]
- 12.Vinale F, Sivasithamparam K, Ghisalberti EL, Marra R, Barbetti MJ, Li H, Woo SL and Lorito M, Physiol Mol Plant P, 2008, 72, 80–86. [Google Scholar]
- 13.Drott MT, Debenport T, Higgins SA, Buckley DH and Milgroom MG, mBio, 2019, 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Caballero Ortiz S, Trienens M, Pfohl K, Karlovsky P, Holighaus G and Rohlfs M, Ecol Evol, 2018, 8, 4328–4339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rohlfs M, Front Microbiol, 2014, 5, 788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Reynolds HT, Vijayakumar V, Gluck-Thaler E, Korotkin HB, Matheny PB and Slot JC, Evolution Letters, 2018, DOI: 10.1002/evl3.42, in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Rokas A, Wisecaver JH and Lind AL, Nat. Rev. Microbiol, 2018, 16, 731–744. [DOI] [PubMed] [Google Scholar]
- 18.Slot JC, Adv Genet, 2017, 100, 141–178. [DOI] [PubMed] [Google Scholar]
- 19.Nutzmann HW, Scazzocchio C and Osbourn A, Ann. Rev. Genet, 2018, 52, 159–183. [DOI] [PubMed] [Google Scholar]
- 20.Osbourn A, Trends Genet, 2010, 26, 449–457. [DOI] [PubMed] [Google Scholar]
- 21.Yue Q, Chen L, Li Y, Bills GF, Zhang X, Xiang M, Li S, Che Y, Wang C, Niu X, An Z and Liu X, MBio, 2015, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Li YF, Tsai KJS, Harvey CJB, Li JJ, Ary BE, Berlew EE, Boehman BL, Findley DM, Friant AG, Gardner CA, Gould MP, Ha JH, Lilley BK, McKinstry EL, Nawal S, Parry RC, Rothchild KW, Silbert SD, Tentilucci MD, Thurston AM, Wai RB, Yoon Y, Aiyar RS, Medema MH, Hillenmeyer ME and Charkoudian LK, Fungal Genet. Biol, 2016, 89, 18–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Medema MH, Kottmann R, Yilmaz P, Cummings M, Biggins JB, Blin K, de Bruijn I, Chooi YH, Claesen J, Coates RC, Cruz-Morales P, Duddela S, Dusterhus S, Edwards DJ, Fewer DP, Garg N, Geiger C, Gomez-Escribano JP, Greule A, Hadjithomas M, Haines AS, Helfrich EJN, Hillwig ML, Ishida K, Jones AC, Jones CS, Jungmann K, Kegler C, Kim HU, Kotter P, Krug D, Masschelein J, Melnik AV, Mantovani SM, Monroe EA, Moore M, Moss N, Nutzmann HW, Pan GH, Pati A, Petras D, Reen FJ, Rosconi F, Rui Z, Tian ZH, Tobias NJ, Tsunematsu Y, Wiemann P, Wyckoff E, Yan XH, Yim G, Yu FG, Xie YC, Aigle B, Apel AK, Balibar CJ, Balskus EP, Barona-Gomez F, Bechthold A, Bode HB, Borriss R, Brady SF, Brakhage AA, Caffrey P, Cheng YQ, Clardy J, Cox RJ, De Mot R, Donadio S, Donia MS, van der Donk WA, Dorrestein PC, Doyle S, Driessen AJM, Ehling-Schulz M, Entian KD, Fischbach MA, Gerwick L, Gerwick WH, Gross H, Gust B, Hertweck C, Hofte M, Jensen SE, Ju JH, Katz L, Kaysser L, Klassen JL, Keller NP, Kormanec J, Kuipers OP, Kuzuyama T, Kyrpides NC, Kwon HJ, Lautru S, Lavigne R, Lee CY, Linquan B, Liu XY, Liu W, Luzhetskyy A, Mahmud T, Mast Y, Mendez C, Metsa-Ketela M, Micklefield J, Mitchell DA, Moore BS, Moreira LM, Muller R, Neilan BA, Nett M, Nielsen J, O’Gara F, Oikawa H, Osbourn A, Osburne MS, Ostash B, Payne SM, Pernodet JL, Petricek M, Piel J, Ploux O, Raaijmakers JM, Salas JA, Schmitt EK, Scott B, Seipke RF, Shen B, Sherman DH, Sivonen K, Smanski MJ, Sosio M, Stegmann E, Sussmuth RD, Tahlan K, Thomas CM, Tang Y, Truman AW, Viaud M, Walton JD, Walsh CT, Weber T, van Wezel GP, Wilkinson B, Willey JM, Wohlleben W, Wright GD, Ziemert N, Zhang CS, Zotchev SB, Breitling R, Takano E and Glockner FO, Nature Chemical Biology, 2015, 11, 625–631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Krause DJ, Kominek J, Opulente DA, Shen XX, Zhou X, Langdon QK, DeVirgilio J, Hulfachor AB, Kurtzman CP, Rokas A and Hittinger CT, Proc Natl Acad Sci U S A, 2018, 115, 11030–11035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Helaly SE, Thongbai B and Stadler M, Nat. Prod. Rep, 2018, 35, 992–1014. [DOI] [PubMed] [Google Scholar]
- 26.Rank C, Nielsen KF, Larsen TO, Varga J, Samson RA and Frisvad JC, Fungal Biol, 2011, 115, 406–420. [DOI] [PubMed] [Google Scholar]
- 27.Lind AL, Wisecaver JH, Lameiras C, Wiemann P, Palmer JM, Keller NP, Rodrigues F, Goldman GH and Rokas A, PLoS Biol, 2017, 15, e2003583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hartmann FE and Croll D, Mol. Biol. Evol, 2017, 34, 2808–2822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Olarte RA, Menke J, Zhang Y, Sullivan S, Slot JC, Huang Y, Badalamenti JP, Quandt AC, Spatafora JW and Bushley KE, BMC Genomics, 2019, 20, 120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Schimmel TG, Coffman AD and Parsons SJ, Appl. Environ. Microbiol, 1998, 64, 3707–3712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sugui JA, Pardo J, Chang YC, Zarember KA, Nardone G, Galvez EM, Mullbacher A, Gallin JI, Simon MM and Kwon-Chung KJ, Eukaryot. Cell, 2007, 6, 1562–1569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kawamura C, Tsujimoto T and Tsuge T, Mol Plant Microbe Interact, 1999, 12, 59–63. [DOI] [PubMed] [Google Scholar]
- 33.Kominek J, Doering DT, Opulente DA, Shen XX, Zhou X, DeVirgilio J, Hulfachor AB, Groenewald M, McGee MA, Karlen SD, Kurtzman CP, Rokas A and Hittinger CT, Cell, 2019, 176, 1356–1366 e1310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wisecaver JH, Slot JC and Rokas A, PLoS Genet, 2014, 10, e1004816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Weng JK, Philippe RN and Noel JP, Science, 2012, 336, 1667–1670. [DOI] [PubMed] [Google Scholar]
- 36.Palmer JM and Keller NP, Curr. Opin. Microbiol, 2010, 13, 431–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Hoogendoorn K, Barra L, Waalwijk C, Dickschat JS, van der Lee TAJ and Medema MH, Front Microbiol, 2018, 9, 1158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Paterson S, Vogwill T, Buckling A, Benmayor R, Spiers AJ, Thomson NR, Quail M, Smith F, Walker D, Libberton B, Fenton A, Hall N and Brockhurst MA, Nature, 2010, 464, 275–278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Morris JJ, Lenski RE and Zinser ER, MBio, 2012, 3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Richards TA and Talbot NJ, Nat. Rev. Microbiol, 2013, 11, 720–727. [DOI] [PubMed] [Google Scholar]
- 41.Proctor RH, Busman M, Seo JA, Lee YW and Plattner RD, Fungal Genet. Biol, 2008, 45, 1016–1026. [DOI] [PubMed] [Google Scholar]
- 42.Proctor RH, Van Hove F, Susca A, Stea G, Busman M, van der Lee T, Waalwijk C, Moretti A and Ward TJ, Mol. Microbiol, 2013, 90, 290–306. [DOI] [PubMed] [Google Scholar]
- 43.Shoolingin-Jordan PM, Al-Daihan S, Alexeev D, Baxter RL, Bottomley SS, Kahari ID, Roy I, Sarwar M, Sawyer L and Wang SF, Biochim Biophys Acta, 2003, 1647, 361–366. [DOI] [PubMed] [Google Scholar]
- 44.Yu J, Toxins (Basel), 2012, 4, 1024–1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Nielsen JC, Grijseels S, Prigent S, Ji B, Dainat J, Nielsen KF, Frisvad JC, Workman M and Nielsen J, Nat Microbiol, 2017, 2, 17044. [DOI] [PubMed] [Google Scholar]
- 46.Yue Q, Chen L, Zhang X, Li K, Sun J, Liu X, An Z and Bills GF, Eukaryot. Cell, 2015, 14, 698–718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Proctor RH, McCormick SP, Kim HS, Cardoza RE, Stanley AM, Lindo L, Kelly A, Brown DW, Lee T, Vaughan MM, Alexander NJ, Busman M and Gutierrez S, PLoS Pathog, 2018, 14, e1006946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Young CA, Schardl CL, Panaccione DG, Florea S, Takach JE, Charlton ND, Moore N, Webb JS and Jaromczyk J, Toxins (Basel), 2015, 7, 1273–1302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Condon BJ, Leng Y, Wu D, Bushley KE, Ohm RA, Otillar R, Martin J, Schackwitz W, Grimwood J, MohdZainudin N, Xue C, Wang R, Manning VA, Dhillon B, Tu ZJ, Steffenson BJ, Salamov A, Sun H, Lowry S, LaButti K, Han J, Copeland A, Lindquist E, Barry K, Schmutz J, Baker SE, Ciuffetti LM, Grigoriev IV, Zhong S and Turgeon BG, PLoS Genet, 2013, 9, e1003233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Koczyk G, Dawidziuk A and Popiel D, Genome Biol Evol, 2015, 7, 3132–3154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Zeng G, Zhang P, Zhang Q, Zhao H, Li Z, Zhang X, Wang C, Yin WB and Fang W, PLoS Genet, 2018, 14, e1007472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Tsai HF, Wheeler MH, Chang YC and Kwon-Chung KJ, Journal of Bacteriology, 1999, 181, 6469–6477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Petersen LM, Holm DK, Gotfredsen CH, Mortensen UH and Larsen TO, Chembiochem, 2015, 16, 2200–2204. [DOI] [PubMed] [Google Scholar]
- 54.Guo CJ, Sun WW, Bruno KS and Wang CC, Org. Lett, 2014, 16, 5250–5253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Wisecaver JH and Rokas A, Front Microbiol, 2015, 6, 161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Slot JC and Rokas A, Curr. Biol, 2011, 21, 134–139. [DOI] [PubMed] [Google Scholar]
- 57.Shen L, Poree FH, Gaslonde T, Lalucque H, Chapeland-Leclerc F and Ruprich-Robert G, Environmental microbiology, 2019, DOI: 10.1111/1462-2920.14698. [DOI] [PubMed] [Google Scholar]
- 58.Campbell MA, Staats M, van Kan JLA, Rokas A and Slot JC, Mycologia, 2013, 105, 1126–1134. [DOI] [PubMed] [Google Scholar]
- 59.Campbell MA, Rokas A and Slot JC, Genome Biol Evol, 2012, 4, 289–293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Spraker JE, Wiemann P, Baccile JA, Venkatesh N, Schumacher J, Schroeder FC, Sanchez LM and Keller NP, MBio, 2018, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Dhillon B, Feau N, Aerts AL, Beauseigle S, Bernier L, Copeland A, Foster A, Gill N, Henrissat B, Herath P, LaButti KM, Levasseur A, Lindquist EA, Majoor E, Ohm RA, Pangilinan JL, Pribowo A, Saddler JN, Sakalidis ML, de Vries RP, Grigoriev IV, Goodwin SB, Tanguay P and Hamelin RC, Proc Natl Acad Sci U S A, 2015, 112, 3451–3456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Reynolds HT, Slot JC, Divon HH, Lysoe E, Proctor RH and Brown DW, Mol. Biol. Evol, 2017, 34, 2002–2015. [DOI] [PubMed] [Google Scholar]
- 63.Cavalier-Smith T, Ciba Found Symp, 1992, 171, 64–80; discussion 80–67. [DOI] [PubMed] [Google Scholar]
- 64.Patron NJ, Waller RF, Cozijnsen AJ, Straney DC, Gardiner DM, Nierman WC and Howlett BJ, BMC Evol. Biol, 2007, 7, 174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Matasyoh JC, Dittrich B, Schueffler A and Laatsch H, Parasitol. Res, 2011, 108, 561–566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Cacho RA, Jiang W, Chooi YH, Walsh CT and Tang Y, J Am Chem Soc, 2012, 134, 16781–16790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Liu N, Hung YS, Gao SS, Hang L, Zou Y, Chooi YH and Tang Y, Org. Lett, 2017, 19, 3560–3563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Copley SD, Nat. Chem. Biol, 2009, 5, 559–566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Wong S and Wolfe KH, Nat. Genet, 2005, 37, 777–782. [DOI] [PubMed] [Google Scholar]
- 70.Wiemann P, Guo CJ, Palmer JM, Sekonyela R, Wang CC and Keller NP, Proc Natl Acad Sci U S A, 2013, 110, 17065–17070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Proctor RH, McCormick SP, Alexander NJ and Desjardins AE, Mol. Microbiol, 2009, 74, 1128–1142. [DOI] [PubMed] [Google Scholar]
- 72.Khaldi N and Wolfe KH, Int. J. Evol. Biol., 2011, 2011, 423821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Pearce C, Adv. Appl. Microbiol, 1997, 44, 1–80. [DOI] [PubMed] [Google Scholar]
- 74.Ruzzini AC and Clardy J, Curr. Biol, 2016, 26, R859–R864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Fischbach MA, Walsh CT and Clardy J, Proc Natl Acad Sci U S A, 2008, 105, 4601–4608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.McGary KL, Slot JC and Rokas A, Proc. Natl. Acad. Sci. USA, 2013, 110, 11481–11486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Stadler M and Hoffmeister D, Front Microbiol, 2015, 6, 127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Navarro-Muñoz JC, Selem-Mojica N, Mullowney MW, Kautsar S, Tryon JH, Parkinson EI, De Los Santos ELC, Yeong M, Cruz-Morales P, Abubucker S, Roeters A, Lokhorst W, Fernandez-Guerra A, Dias Cappelini LT, Thomson RJ, Metcalf WW, Kelleher NL, Barona-Gomez F and Medema MH, bioRxiv, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Vesth TC, Nybo JL, Theobald S, Frisvad JC, Larsen TO, Nielsen KF, Hoof JB, Brandl J, Salamov A, Riley R, Gladden JM, Phatale P, Nielsen MT, Lyhne EK, Kogle ME, Strasser K, McDonnell E, Barry K, Clum A, Chen C, LaButti K, Haridas S, Nolan M, Sandor L, Kuo A, Lipzen A, Hainaut M, Drula E, Tsang A, Magnuson JK, Henrissat B, Wiebenga A, Simmons BA, Makela MR, de Vries RP, Grigoriev IV, Mortensen UH, Baker SE and Andersen MR, Nat. Genet, 2018, 50, 1688–1695. [DOI] [PubMed] [Google Scholar]
- 80.Greco C, Keller NP and Rokas A, Curr. Opin. Microbiol, 2019, 51, 22–29. [DOI] [PMC free article] [PubMed] [Google Scholar]





