ABSTRACT
Natural products of lichen-forming fungi are structurally diverse and have a variety of medicinal properties. Despite this, they have limited implementation in industry mostly because the corresponding genes are unknown for most of their natural products. Here, we implement a long-read sequencing and bioinformatic approach to identify the putative biosynthetic gene cluster of the bioactive natural product gyrophoric acid (GA). Using 15 high-quality genomes representing nine GA-producing species of the lichen-forming fungal genus Umbilicaria, we identify the most likely GA cluster and investigate the cluster gene organization and composition across the nine species. Our results show that GA clusters are promiscuous within Umbilicaria, and only three genes are conserved across species, including the polyketide synthase (PKS) gene. In addition, our results suggest that the same cluster codes for different, but structurally similar compounds, namely, GA, umbilicaric-, and hiascic acid, bringing new evidence that lichen metabolite diversity is also generated through regulatory mechanisms at the molecular level. Ours is the first study to identify the most likely GA cluster and, thus, provides essential information to open new avenues for biotechnological approaches to producing and modifying GA and similar lichen-derived compounds. GA PKS is the first tridepside PKS to be identified.
IMPORTANCE The implementation of natural products in the pharmaceutical industry relies on the possibility of modifying the natural product (NP) pathway to optimize yields and pharmacological effects. Characterization of genes and pathways underlying natural product biosynthesis is a major bottleneck for exploiting the medicinal properties of the natural products. Genome mining is a promising and relatively cost- and time-effective approach to utilize unexplored NP resources for drug discovery. In this study, we identify the most likely gene cluster for the lichen-forming fungal depside gyrophoric acid in nine Umbilicaria species. This compound shows cytotoxic and antiproliferative properties against several cancer cell lines and is also a broad-spectrum antimicrobial agent. This information paves the way for generating GA analogs with modified properties by selective activation/deactivation of genes.
KEYWORDS: biosynthetic genes, depsides, fungi, genome mining, long-read sequencing, microbial biotechnology, PKS phylogeny, secondary metabolites, Umbilicaria, genomics, lichen compounds, nonreducing PKSs, pharmaceutically relevant natural products
INTRODUCTION
Natural products (NPs) and their derivatives/analogs constitute about 70% of modern medicines (1, 2). NPs alone, however, i.e. unmodified molecules as produced by organisms in nature, constitute only a small portion of this. The vast majority, about 60 to 65%, are derivatives and analogs of naturally occurring substances, synthesized through biotechnological or synthetic approaches (2, 3). The exploitation of NPs in the pharmaceutical industry relies on the possibility of modifying NP pathways in order to optimize yields and to acheive the desired pharmacological effects. Culture-dependent approaches to identify/produce NPs are labor-intensive and time-consuming and are not successful for every organism (4, 5). As a result, the biosynthetic potential of many biosynthetically prolific organisms remains untapped. Information on the genetic background and mechanisms of NP synthesis may thus contribute to fast-tracking NP-based drug discovery (2, 6).
Lichens, symbiotic organisms composed of fungal and photosynthetic partners (green algae or cyanobacteria or both at the same time) (7–9), are a treasure chest of NPs (10–12). Lichen compounds have great pharmacological potential, encompassing antimicrobial, antiproliferative, cytotoxic, and antioxidant properties (11, 13–16). So far, about 1,000 NPs with great structural and functional diversity have been reported from lichen-forming fungi (LFF), and about 300 to 400 have been screened for bioactivity (11). However, the genetic background of more than 97% of lichen NPs is unknown (17–20). This is because, there are various major bottlenecks in using lichen NPs in the pharmaceutical industry, including low yield in nature, slow growth, and/or tedious isolation/culturing methods. Targeted genome mining approaches integrate the latest DNA sequencing technologies with computational advancements and large, publicly available databases of already characterized biosynthetic gene clusters (BGCs) to identify genes coding for NPs (1, 21, 22). This approach combines genome mining with the expected genetics of the NP to narrow down the candidate biosynthetic genes.
In silico approaches for linking natural products with their respective BGCs—genomic clusters of biosynthetic-related genes typically found in fungi (23–25)—are becoming more common in LFF due to the increased availability of genomic resources and databases (10, 17, 20), improvement of detection software and genome mining tools, stabilizing polyketide synthase (PKS) phylogenies, and information gained from recent successes in the heterologous expression of PKS from LFF (17, 18). For instance, the clade “group I, PKS16” from Kim et al. (17) is associated with the biosynthesis of orsellinic acid derivatives (orcinol depsides and depsidones), such as lecanoric acid (18), grayanic acid (19), physodic acid, and olivetoric acid (20), whereas the clade “group IX, PKS23” from Kim et al. (17) is associated with the biosynthesis of methylated orsellinic acid derivatives (β-orcinol depsides and depsidones) such as atranorin. The cluster linked to usnic acid biosynthesis is also fairly well studied (26, 27) and corresponds to “group VI, PKS8” from Kim et al. (17).
Here, we combine high-throughput long-read sequencing with a comparative genomics approach to identify the putative cluster(s) linked to the synthesis of gyrophoric acid (GA). GA is a natural product synthesized by several LFF species. It has a broad spectrum of bioactivity, such as anticancer and antimicrobial, and industrially-relevant properties, including usage as dyes (15, 28–30). However, the molecular mechanism and genetics of its synthesis remain unknown. Identification of the GA cluster would facilitate its production via biotechnology to optimize yield as well as to generate GA analogs with the desired pharmaceutical effect. For this study, we chose nine species of GA producers belonging to the lichen-forming fungal genus Umbilicaria (Table 1). GA is the most characteristic compound of this genus and is found at high concentrations in all of the chosen species (28, 31–33). It is a depside containing three orsellinic acid rings joined together by ester bonds (Fig. 1). Apart from GA, several other structurally related depsides, such as umbilicaric acid, lecanoric acid, and hiascic acid (Fig. 1), have also been reported from Umbilicaria spp. (31, 32, 34), but these usually constitute a minor fraction (<10%) of the total NPs detected via high-pressure liquid chromatography (HPLC) (Fig. 1).
TABLE 1.
Genome quality and annotation statistics
| Taxon | Sample ID | Ccs HiFi yield (%) | No. of Scaffolds | N 50 | Completeness (%) | Assembly size (Mb) | No. of genes | No. of proteins | Genome accession |
|---|---|---|---|---|---|---|---|---|---|
| Umbilicaria deusta 1 | TBG_2334 | 47.86 | 44 | 1.67 | 97.6 | 40.9 | 8,949 | 8,857 | JALILR000000000 |
| U. deusta 2 | TBG_2335 | 43.54 | 42 | 1.87 | 90.2 | 37.4 | 8,194 | 8,049 | NA |
| U. freyi 1 | TBG_2329 | 47.39 | 107 | 2.58 | 95.7 | 47.5 | 10,156 | 10,065 | JALILQ000000000 |
| U. freyi 2 | TBG_2330 | 46.41 | 54 | 2.04 | 85.9 | 50 | 8,848 | 8,773 | NA |
| U. grisea | TBG_2336 | 42.54 | 40 | 1.83 | 96.9 | 44.43 | NA | NA | JALILX000000000 |
| U. hispanica 1 | TBG_2322 | 38.71 | 130 | 3.13 | 96.8 | 43.4 | 9,111 | 9,021 | NA |
| U. hispanica 2 | TBG_2337 | 54.22 | 53 | 4.23 | 97.3 | 48.6 | 8,781 | 8,696 | JALILS000000000 |
| U. phaea 1 | TBG_1111 | NAa | 47 | 1.54 | 96.5 | 35.1 | 7,659 | 7,576 | NA |
| U. phaea 2 | TBG_1112 | NA | 38 | 1.22 | 96.5 | 35.55 | 7,681 | 7,628 | JALILT000000000 |
| U. pustulata 1 | TBG_2333 | 33 | 26 | 2.62 | 97.3 | 37.7 | 9,569 | 9,503 | NA |
| U. pustulata 2 | TBG_2345 | 32.26 | 31 | 2.36 | 96.8 | 35.7 | 8,790 | 8,740 | JALILU000000000 |
| U. spodochroa 1 | TBG_2434 | 34.20 | 130 | 9.93 | 97.0 | 44.3 | 8,791 | 8,705 | JALILV000000000 |
| U. spodochroa 2 | TBG_2435 | 40.93 | 97 | 1.25 | 97.1 | 40.1 | 8,612 | 8,507 | NA |
| U. subpolyphylla 1 | TBG_2323 | 41.14 | 190 | 1.55 | 99.6 | 58.2 | 16,993 | 16,852 | NA |
| U. subpolyphylla 2 | TBG_2324 | 33.68 | 39 | 1.52 | 97.6 | 31.8 | 8,556 | 8,410 | JALILW000000000 |
| Dermatocarpon miniatum 1 | TBG_2326 | 29.36 | 26 | 5.08 | 98.1 | 63.5 | 9,273 | 9,189 | JALILY000000000 |
| D. miniatum 2 | TBG_2331 | 26.28 | 22 | 4.25 | 98.4 | 49.8 | 7,938 | 7,871 | NA |
NA, not applicable.
FIG 1.
Chemical structures and nomenclature. Structure of a lichen depside, atranorin, GA, and other depsides produced by Umbilicaria spp.
In the present study, we assembled highly contiguous long-read-based genomes of the nine species of the genus Umbilicaria, identified their biosynthetic gene clusters, and singled out the candidate genes linked to GA biosynthesis.
RESULTS
Genome sequencing, assembly, and annotation.
The genome quality stats and assembly reports of all of the genomes used for this study are presented in Table 1.
Total BGCs and phylogenetic analysis.
A total of 406 BGCs and 236 PKSs were identified in 15 Umbilicaria genomes, representing nine species (Table 2). Out of 236, 122 were nonreducing PKSs (NR-PKSs), 86 were reducing PKSs, 16 were type-III PKSs, and 12 were partial or unclassified because the core PKS was fragmented and the characteristic domains were missing (Table 2).
TABLE 2.
Biosynthetic gene clusters and PKSs identified in Umbilicaria sppa
| Taxon | Sample ID | No. BGCs | Total no. PKSs | No. NR-PKSs | No. red-PKSs | No. unclassified/fragmented | No. T3PKS |
|---|---|---|---|---|---|---|---|
| Umbilicaria deusta 1 | TBG_2334 | 33 | 24 | 11 | 10 | 1 | 2 |
| U. deusta 2 | TBG_2335 | 30 | 21 | 11 | 9 | 1 | 0 |
| U. freyi 1 | TBG_2329 | 25 | 9 | 5 | 3 | 0 | 1 |
| U. freyi 2 | TBG_2330 | 23 | 10 | 4 | 5 | 0 | 1 |
| U. grisea | TBG_2336 | 20 | 13 | 7 | 4 | 1 | 1 |
| U. hispanica 1 | TBG_2322 | 24 | 13 | 9 | 3 | 0 | 1 |
| U. hispanica 2 | TBG_2337 | 25 | 14 | 9 | 4 | 0 | 1 |
| U. phaea 1 | TBG_1111 | 22 | 14 | 6 | 7 | 0 | 1 |
| U. phaea 2 | TBG_1112 | 19 | 11 | 6 | 4 | 0 | 1 |
| U. pustulata 1 | TBG_2333 | 31 | 18 | 8 | 9 | 0 | 1 |
| U. pustulata 2 | TBG_2345 | 27 | 17 | 9 | 6 | 1 | 1 |
| U. spodochroa 1 | TBG_2434 | 27 | 14 | 8 | 4 | 1 | 1 |
| U. spodochroa 2 | TBG_2435 | 27 | 16 | 9 | 5 | 1 | 1 |
| U. subpolyphylla 1 | TBG_2323 | 30 | 14 | 7 | 4 | 1 | 2 |
| U. subpolyphylla 2 | TBG_2324 | 20 | 10 | 5 | 4 | 0 | 1 |
| U. muhlenbergii | NA | 23 | 18 | 8 | 5 | 5 | 0 |
| Dermatocarpon miniatum 1 | TBG_2326 | 29 | 13 | 3 | 8 | 2 | 0 |
| D. miniatum 2 | TBG_2331 | 32 | 11 | 4 | 6 | 1 | 0 |
aRed-PKSs, reducing PKSs; NR-PKSs, nonreducing PKSs; T3PKSs, type III PKSs.
Four NR-PKSs were common to all species, namely, PKS15, PKS16, PKS20, and a novel PKS (forming a monophyletic, supported clade with PKS33) (Fig. 2). Only one NR-PKS per species formed a supported monophyletic clade with PKS16 (group I, i.e., orsellinic acid, depside, and depsidone NR-PKSs) (Fig. 2). No PKS from Dermatocarpon miniatum grouped within the PKS16 clade, which is expected, as D. miniatum does not produce orsellinic acid-based compounds.
FIG 2.
NR-PKS phylogeny of lichen-forming fungi. This is a maximum-likelihood tree based on amino acid sequences of NR-PKSs from nine Umbilicaria spp., six Cladonia spp., Dermatocarpon miniatum, Stereocaulon alpinum, and Pseudevernia furfuracea. Branches in bold indicate bootstrap support >70%. Green color clades represent the PKSs common to all nine Umbilicaria spp. used in this study. PKS groups are based on Kim et al. (17).
Gyrophoric acid cluster.
The cluster most likely associated with GA synthesis is the cluster containing PKS16 (Fig. 2), as (i) it is present in all Umbilicaria spp., (ii) it contains an NR-PKS, and (iii) it forms a monophyletic group with the clade “group I, PKS 16” from Kim et al. (17).
Out of 15 genes (in the GA cluster of Umbilicaria deusta) (Fig. 3A), antiSMASH identified two genes, PKS and cyt P450, whereas other genes were identified as coding for proteins of unknown function by antiSMASH as well as by InterProScan. NCBI conserved domain search (CDS) identified the additional 10 genes of the candidate GA cluster and estimated their putative function based on their domain motifs (Fig. 3B). Specifically, these genes code for proteins involved in transcription regulation, oxidation, hydrolysis, and protein-protein interaction/trafficking. Conserved domains were not detected in three genes (genes 5, 11, and 13; Fig. 3B), and two genes (genes 7 and 14; Fig. 3B) had conserved domains belonging to the DUF (domain of unknown function) superfamily.
FIG 3.
(A) Gyrophoric acid cluster from Umbilicaria deusta as predicted by antiSMASH. Colored boxes indicate genes. Genes in gray represent genes coding for unknown proteins as predicted by the antiSMASH. (B) Putative functions of the genes of the gyrophoric acid cluster based on InterProScan and NCBI CDS search. The numbers correspond to the gene numbers of U. deusta in panel A.
We inferred the synteny of the U. deusta GA cluster with the GA clusters of all other Umbilicaria spp. to estimate homology between them (Fig. 4). In addition, we also examined the synteny of the U. deusta GA cluster with the other clusters involved in the synthesis of orsellinic-acid-derivative compounds—olivetoric acid, grayanic acid, and orsellinic acid cluster (Fig. 4). The synteny plots show that GA clusters are highly homologous among Umbilicaria deusta, U. freyi, U. grisea, U. phaea, and U. subpolyphylla, whereas between GA and grayanic acid cluster and GA and olivetoric acid cluster only the PKS genes are homologous (Fig. 4|). The other genes of the clusters involved in the production of orsellinic acid derivatives are not conserved among the genera examined, i.e., Umbilicaria spp., Cladonia grayi, and Pseudevernia furfuracea. The orsellinic acid cluster from Aspergillus nidulans showed almost no homology to the GA cluster.
FIG 4.
Synteny plots based on tBLASTn showing the homology and synteny between the putative gyrophoric acid clusters U. deusta and other Umbilicaria spp. (A) and between the gyrophoric acid cluster from U. deusta and the grayanic acid cluster from Cladonia grayi, the olivetoric acid cluster from Pseudevernia furfuracea, and the orsellinic acid cluster from Aspergillus nidulans (B). All of the PKSs are highly homologous to the GA PKS and have the same domains as the GA PKS: SAT-KS-AT-PT-ACP-ACP-TE.
BGC clustering: BiG-SCAPE and CORASON.
BGC sequence similarity networks group gene clusters at multiple hierarchical levels. This analysis implements a “glocal” alignment mode that groups both complete and fragmented BGCs. The BGCs forming a supported monophyletic clade to PKS16 (group I) were then analyzed for conservation across species using CORASON. The CORASON analysis also showed that only the following three genes in the cluster were shared among the studied Umbilicaria species: the core PKS and the two genes present upstream and downstream of the core gene (Fig. 5).
FIG 5.
CORASON-based PKS phylogeny to elucidate evolutionary relationships and cluster organization of the GA cluster in Umbilicaria spp. The bar plot below depicts the percentage of Umbilicaria species in which a particular gene is present. The color of the genes is white when the homology to the genes of the other gyrophoric acid clusters is low.
We used CORASON plot in order to display the GA clusters of Umbilicaria spp. (Fig. 5). There are 9 to 15 genes present in the putative GA cluster of the species studied. The PKS and genes upstream and downstream of it are present in all Umbilicaria spp. As the number of genes present in the GA cluster varies among the Umbilicaria spp., we consider the U. deusta GA cluster as a representative of Umbilicaria GA clusters for this study (Fig. 3A). The gene numbers refer to the genes in the GA cluster of U. deusta (Fig. 3A).
DISCUSSION
Gyrophoric acid PKS.
We identified only one PKS as the most likely GA PKS (Fig. 2).
So far, the BGCs associated with the biosynthesis of the following lichen depsides and depsidones have been experimentally identified: atranorin (17), lecanoric (18), and grayanic acid (19). These studies demonstrate that the PKS alone is capable of synthesizing the backbone depside, whereas modifications, such as methylation and oxidation, are made by enzymes coded by other genes of the cluster after the release of the depside from the PKS. For instance, the synthesis of atranorin involves at least three genes, but the depside backbone is coded by the PKS (17). The other two genes, an O-methyltransferase (OMT) and a cyt P450, methylate the carboxyl group and oxidize the methyl group (into -CHO), respectively, to produce the final product atranorin (Fig. 1). As GA does not have side-chain modifications (Fig. 1), we propose that the PKS alone is capable of producing the final product, i.e. GA.
The depside PKSs identified so far code for didepsides, i.e. compounds that contain two phenolic rings joined with an ester bond, e.g., atranorin, lecanoric-, and olivetoric acid (17, 18, 20). Ours is the first study to identify the most likely PKS associated with a tridepside synthesis, i.e., three phenolic rings joined with two ester bonds. Our study suggests that the PKS genes coding for a didepside and a tridepside are highly homologous (Fig. 4).
GA cluster in Umbilicaria spp.
The most likely GA cluster contains about 9 to 15 genes in different Umbilicaria spp. (Fig. 4 and 5). Interestingly, only the following three genes are conserved across the species analyzed: the PKS, a gene having a conserved domain of unknown function present upstream, and a hydrolase present downstream of the PKS (Fig. 4 and 5). This suggests that these three genes form an integral part of the GA cluster, whereas the other genes are facultative among GA producing Umbilicaria spp. The facultative genes in the GA cluster code for enzymes involved in transcription regulation, oxidation, hydrolysis, and protein-protein interaction/trafficking, indicating that they play a role in species-specific transcription regulation, transport, or modification (Fig. 3). Differences in gene content among the clusters synthesizing the same compound have been reported before. For instance, the usnic acid cluster differs in its gene content among taxa, and out of 8 to 15 genes belonging to the cluster, only PKS and cyt P450 are conserved (26). These differences could be responsible for species-specific BGC regulation or modifications to the protein released by the PKS.
The most likely GA cluster contains a cyt P450 (Fig. 3A), which has been associated with oxidative function (17, 19, 20, 27). However, the location and orientation of the cyt P450 in the Umbilicaria GA clusters are different from those of a cluster which requires an active cyt P450 for the production of the final compound, i.e., grayanic acid, usnic acid, and atranorin cluster (Fig. 4) (17, 19, 26). In the GA cluster the cyt P450 is not located next to the PKS and has the same orientation as the PKS, whereas in the grayanic acid, usnic acid, and atranorin cluster, cyt P450 lies next to the PKS in the opposite orientation (Fig. 4). For instance, grayanic acid synthesis (in Cladonia grayi) involves the synthesis of the depside intermediate by PKS followed by oxidation of the released depside into depsidone by cyt P450 (19). Such organization is suggestive of genes being regulated and coexpressed by the same promoter (17, 19). PKS and cyt P450 form an integral part of depsidone synthesis (35), whereas the depside is coded by the PKS alone, with the exception of the side-chain modifications (17, 18). Therefore, despite being part of the GA cluster, cyt P450 does not seem to be involved in GA synthesis or in the synthesis of umbilicaric and/or lecanoric acid reported from Umbilicaria spp. analyzed in this study. The synthesis of hiascic acid, however, would require the hydroxylation of a methyl group by a cyt P450 enzyme after the depside is released from the PKS (Fig. 1, the OH group in bold in hiascic acid). The lower amounts of hiascic acid found in Umbilicaria thalli, compared to GA, could be due to the fact that the cyt P450 may not be coexpressed with the PKS.
Although the GA cluster shows varied degrees of homology among Umbilicaria spp., with certain genes present only in some species, the gyrophoric acid PKS is highly homologous among Umbilicaria (Fig. 4). This is expected, as the PKSs are involved in the synthesis of the same compound. Interestingly, the GA PKS is also homologous to the olivetoric acid PKS and grayanic acid PKS, suggesting that the depside/depsidone PKS are homologous even among distantly related taxa in lichen-forming fungi. The GA was not homologous to the orsellinic acid PKS from Aspergillus nidulans (Fig. 4), suggesting different evolutionary trajectories despite being involved in the synthesis of orcinol-derived compounds.
Our study provides novel insights into GA cluster composition and organization across different species (Fig. 5). This information is crucial in order to open the way for future genetic manipulation of the GA biosynthetic pathway that may be aimed at increasing structural diversity and/or yield of the products, as well as at generating analogs with novel properties.
One cluster, different compounds.
Variation in cluster composition reflects the potential to produce diverse NPs. Apart from GA, other depsides related in structure to GA, i.e., lecanoric acid, umbilicaric acid, and hiascic acid (Fig. 1), are often reported from Umbilicaria spp. as minor metabolites (31). Interestingly, we found only one orcinol depside PKS in Umbilicaria spp. (Fig. 2). This strongly indicates that all of the Umbilicaria depsides are coded by the same PKS cluster. One cluster coding for different, structurally-related compounds has also been reported previously (20, 36, 37). For instance, in the case of the antifungal drug caspofungin acetate, a semisynthetic derivative of the NP pneumocandins from the fungus Glarea lozoyensis, selective inactivation of different genes in this biosynthetic gene cluster generates 13 different analogues, some of them with elevated antifungal activity relative to the original compound and its semisynthetic derivative (38). Similarly, the aspyridone biosynthetic cluster from Aspergillus nidulans produces eight different compounds in a heterologous host (37). These studies show that a single PKS cluster is capable of producing different compounds depending upon which genes are coexpressed and on the available starters. In lichens, a single PKS has been associated with the synthesis of olivetoric and physodic acid (20), and the same PKS has been shown to be involved in the synthesis of lecanoric acid in a heterologous host (18). We propose that the same PKS cluster is most likely involved in the synthesis of GA, umbilicaric acid (an additional methyl group) (Fig. 1), hiascic acid (additional hydroxyl group) (Fig. 1), and lecanoric acid (didepside with no side chains) (Fig. 1) in Umbilicaria. It is possible, however, that in nature only GA is synthesized in members of the genus Umbilicaria, and the cooccurring minor compound lecanoric acid is a hydrolysis product of GA (39).
Interestingly, although umbilicaric acid is reported from some Umbilicaria species (U. grisea, U. freyi, U. muhlenbergii, and U. subpolyphylla [31, 40]), O-methyltransferase (OMT) was not identified in the depside-related BGC of any Umbilicaria species (Fig. 3A and B). OMT would be required for the methylation of oxygen to produce umbilicaric acid (Fig. 1). Its absence from depside-related BGCs suggests that an external OMT, e.g., from other BGCs, might be involved in the production of umbilicaric acid in Umbilicaria. This could explain the lower amounts of umbilicaric acid compared to GA found in these species (31). In contrast, when the O-methylated compound is the major secondary metabolite, as in the case of grayanic acid and atranorin, OMT is an integral part of the BGC and is coexpressed along with the other crucial genes for grayanic acid production, i.e., PKS and cyt P450 (17, 19).
Future perspectives.
Advances in long-read sequencing and in computational approaches to genome mining not only enable linking biosynthetic genes to NPs but also provide an overview of the entire gene cluster composition and organization. Ours is the first study to identify the most-likely GA cluster, which is essential for opening up avenues for biotechnological approaches to producing and modifying this compound and possibly other lichen compounds. In particular, this information can be applied to generate novel NP analogs with improved pharmacological properties via synthetic biology, biotechnology, and combinatorial biosynthesis approaches. This paves the way to an entirely new horizon in terms of utilizing these understudied taxa for pharmacological industry and drug discovery.
MATERIALS AND METHODS
Sampling and data set.
We collected samples of the following eight Umbilicaria species for genome sequencing: U. deusta, U. freyi, U. grisea, U. subpolyphylla, U. hispanica, U. phaea, U. pustulata, and U. spodochroa (see voucher information in Table S1 in the supplemental material). When possible, we sequenced two samples of the same species collected in different climatic zones. This was done to consider the possible intraspecific variation in BGC content as recently shown in Singh et al. (34). The genome of U. muhlenbergii was downloaded from the JGI database. In addition, we sampled Dermatocarpon miniatum as a control, as it does not produce depsides/depsidones.
DNA extraction, library preparation, and genome sequencing.
Lichen thalli were thoroughly washed with sterile water and checked under the stereomicroscope for the presence of possible contamination and other lichen thalli. DNA was extracted from all of the samples using a cetyltrimethylammonium bromide (CTAB)-based method (41) as presented in reference 42.
SMRTbell libraries were constructed according to the manufacturer's instructions of the SMRTbell Express prep kit v. 2.0 following the low DNA input protocol (Pacific Biosciences, Menlo Park, CA). Total input for samples was approximately 170 to 800 ng. Ligation with T-overhang SMRTbell adapters was performed at 20°C for 1 h or overnight. Following ligation, the libraries were purified with a 0.45× or 0.8× AMPure PB bead cleanup step. The subsequent size selection step to remove SMRTbell templates of <3 kb was performed with 2.2× of a 40% (vol/vol) AMPure PB bead working solution.
SMRT sequencing was performed on the Sequel system II with the Sequel II sequencing kit 2.0 using the continuous long read (CLR) mode or the circular consensus sequencing (CCS) mode, 30 h movie time with no preextension and Software SMRTLINK 8.0. Each metagenomic library was sequenced on one SMRT cell at the Medical Center Nijmegen (the Netherlands) or at MPI Dresden.
Genome assembly and annotation.
The continuous long reads (i.e., CLR reads) from the PacBio Sequel II CLR run were first processed into highly accurate consensus sequences (i.e., HiFi reads) using PacBio tool CCS v5.0.0 with default parameters (https://ccs.how). HiFi reads were then assembled into contigs using the assembler metaFlye v2.7 (43). The resulting contigs were scaffolded with LRScaf v1.1.12 (https://github.com/shingocat/lrscaf; 44). The scaffolds were then taxonomically binned to extract Ascomycota reads with blastx using DIAMOND (–more-sensitive –frameshift 15 –range-culling) on a custom database and following the MEGAN6 Community Edition pipeline (45). All scaffolds assigned to Ascomycota were extracted as to represent the Umbilicaria spp. Assembly statistics, such as number of contigs, total length, and N50 were accessed with Assemblathon v2 (46) (Table 1). The completeness of the mycobiont bins (i.e., the fungal genomes) was estimated using benchmarking universal single-copy orthologs (BUSCO) analysis in BUSCO v4 (47).
Identification of biosynthetic gene clusters.
Functional annotation of genomes, including genes, proteins, and BGC prediction (antiSMASH; antibiotics & SM Analysis Shell, v5.0) was performed with scripts implemented in the funannotate pipeline (48, 49). First, the genomes were masked for repetitive elements, and then the gene prediction was performed using BUSCO2 to train Augustus and self-training GeneMark-ES (47, 50). Functional annotation was done with InterProScan (51), egg-NOG-mapper (52, 53), and BUSCO (47) with ascomycota_db models. Secreted proteins were predicted using SignalP (54) as implemented in the funannotate “annotate” command.
To complement the functional annotation of genes by InterProScan, egg-NOG-mapper, and SignalP, we performed NCBI CDS searches on the genes of the putative gyrophoric acid cluster using the nucleotide sequences of the genes.
Phylogenetic analyses.
To search for PKSs involved in the synthesis of GA, we extracted the amino acid sequences of all of the NR-PKS from the BGCs predicted by the antiSMASH for the Umbilicaria spp. and Dermatocarpon miniatum (see Table S2 in the supplemental material). The sequences were aligned using MAFFT as implemented in Geneious v5.4 (55, 56). Gaps were treated as missing data. The maximum likelihood search was performed on the aligned sequences with RAxMLHPC BlackBox v8.1.11 (57) on the Cipres Scientific gateway (58). We then performed a phylogenetic analysis by incorporating these amino acid sequences into the most comprehensive PKS data set currently available (Table S2) (17, 20). This data set comprises NR-PKS sequences of the following species downloaded from previous publications and public databases: Cladonia borealis, C. grayi, Cladonia macilenta, Cladonia metacorallifera, Cladonia rangiferina, Cladonia uncialis, Pseudevernia furfuracea, Stereocaulon alpinum, and Umbilicaria muhlenbergii. The final data set contains amino acid sequences of 229 NR-PKSs from 18 species belonging to five LFF genera. Phylogenetic trees were visualized using iTOL (59).
Annotation of PKSs.
Umbilicaria PKSs were named according to the clustering with preannotated PKSs in the phylogeny. NR-PKSs have been categorized into nine groups based on phylogenetic clustering and broad category of the protein coded by them (17). For instance, group I comprises PKSs involved in the synthesis of zearalenone, orsellinic acid, and its derivative compounds, and group II consists of PKSs coding for melanins.
Each of the nine groups contains several PKSs based on the supported phylogenetic clades and protein sequence similarity. For this study, we included the following 25 NR-PKSs: PKS1, PKS2, PKS8, PKS9, PKS13 to PKS28 (total 16), and PKS30 to PKS34 (total 5). Each PKS represents a supported monophyletic clade within a group in the NR-PKS phylogeny (17). To summarize, Umbilicaria PKSs were annotated and named according to phylogenetic clustering with the preannotated NR-PKS sequences of Cladonia spp., Pseudevernia furfuracea, and Stereocaulon alpinum, downloaded from previous publications and public databases (17, 19, 20). PKS16 and PKS23 have been suggested to be involved in the synthesis of depsides, the chemical category of GA. The PKSs responsible for the synthesis of β-orcinol depsides such as atranorin are PKS23, whereas those involved in the synthesis of orcinol-depsides, such as grayanic acid and olivetoric acid are PKS16 (19, 20). GA is an orcinol depside; therefore, the corresponding PKS(s) would be PKS16.
Selecting candidate gene clusters linked to GA biosynthesis.
We used the following criteria to select the candidate gene cluster associated with GA synthesis in Umbilicaria: (i) it must contain an NR-PKS (as some of the structural features of an NP can be directly inferred from the domain architecture of the PKS: PKS genes without reducing domains [NR-PKS genes] are linked to nonreduced compounds such as gyrophoric acid, olivetoric acid [20], physodic acid [20], and grayanic acid [19]), (ii) it must be present in all of the Umbilicaria genomes, as all of the species have GA as the major secondary metabolite (33), and (iii) it must be closely related to the PKSs involved in the synthesis of orcinol depsides, i.e., PKS16 (19, 20), because orsellinic acid units constitute the building blocks of GA.
Homology between orcinol-depside clusters.
Homologous GA clusters from Umbilicaria spp. were visualized using synteny plots as implemented in Easyfig v2.2.3 (60). In addition, we also inferred the synteny between the GA cluster and the other clusters involved in the synthesis of orcinol derivatives—the olivetoric acid (20), grayanic acid (19), and the orsellinic/lecanoric acid cluster (61, 62). The .gbk input files for Easyfig were taken from antiSMASH (48). Easyfig was run with tblastx v2.6.0+ and a minimum identity value of 50 to draw the blast hits. Clusters were matched for orientation so that the PKS genes were oriented in the same direction (Fig. 4).
BGC clustering: BiG-SCAPE and CORASON.
We used BiG-SCAPE and CORASON (63) to identify the gene cluster networks and infer evolutionary relationships among clusters of interest among different Umbilicaria spp. BiG-SCAPE utilizes antiSMASH (48) and MIBiG databases (64) for inferring BGC sequence similarity networks, whereas CORASON employs a phylogenomic approach to infer evolutionary relationships between the clusters. BiG-SCAPE v1.0.1 was run in –auto mode to identify BGC families using antiSMASH output files (.gbk) as input. Networks were generated using similarity thresholds of 0.25. The most likely GA cluster from all of the Umbilicaria spp. was examined for conservation and variation among different Umbilicaria species using the CORASON pipeline. The antiSMASH .gbk files of the corresponding clusters, based on phylogenetic grouping, were used as input. The most-likely GA cluster from U. deusta was used as reference to fish out the most closely related clusters from the other Umbilicaria spp.
Data availability.
This whole-genome shotgun project has been deposited at GenBank under the accession number PRJNA820300. The versions described in this paper are the versions JALILQ000000000 to JALILY000000000. The lichen samples of eight Umbilicaria spp. and Dermatocarpon miniatum have been deposited under BioSample accession numbers SAMN27294873 to SAMN27294881. The mycobiont samples have been deposited under BioSample accession numbers SAMN26992773 to SAMN26992781. The antiSMASH files of Umbilicaria spp. and Dermatocarpon miniatum and the candidate gyrophoric acid cluster .gbk files are available at figshare (https://doi.org/10.6084/m9.figshare.19625997).
Supplementary Material
ACKNOWLEDGMENTS
This research was funded by LOEWE-Centre TBG, funded by the Hessen State Ministry of Higher Education, Research and the Arts (HMWK).
We thank Daniele Armaleo (Duke University) for his input on the PKS domain composition of tridepsides and didepsides and Marnix Medema and Satria Kautsar for their support with the BiG-SCAPE program.
We thank the reviewers for their constructive suggestions on the manuscript.
Footnotes
Supplemental material is available online only.
Contributor Information
Garima Singh, Email: garima.singh@unipd.it, gsingh458@gmail.com.
Lea Atanasova, University of Natural Resources and Life Sciences Vienna.
Ekaterina Shelest, University of Portsmouth.
Young-Mo Kim, Pacific Northwest National Laboratory.
REFERENCES
- 1.Newman DJ, Cragg GM. 2016. Natural products as sources of new drugs from 1981 to 2014. J Nat Prod 79:629–661. doi: 10.1021/acs.jnatprod.5b01055. [DOI] [PubMed] [Google Scholar]
- 2.Newman DJ, Cragg GM. 2020. Natural products as sources of new drugs over the nearly four decades from 01/1981 to 09/2019. J Nat Prod 83:770–803. doi: 10.1021/acs.jnatprod.9b01285. [DOI] [PubMed] [Google Scholar]
- 3.Newman DJ, Cragg GM. 2007. Natural products as sources of new drugs over the last 25 years. J Nat Prod 70:461–477. doi: 10.1021/np068054v. [DOI] [PubMed] [Google Scholar]
- 4.Xu W, Klumbys E, Ang EL, Zhao H. 2020. Emerging molecular biology tools and strategies for engineering natural product biosynthesis. Metab Eng Commun 10:e00108. doi: 10.1016/j.mec.2019.e00108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Palazzotto E, Tong Y, Lee SY, Weber T. 2019. Synthetic biology and metabolic engineering of actinomycetes for natural product discovery. Biotechnol Adv 37:107366. doi: 10.1016/j.biotechadv.2019.03.005. [DOI] [PubMed] [Google Scholar]
- 6.Atanasov AG, Zotchev SB, Dirsch VM, Orhan IE, Banach M, Rollinger JM, Barreca D, Weckwerth W, Bauer R, Bayer EA, Majeed M, Bishayee A, Bochkov V, Bonn GK, Braidy N, Bucar F, Cifuentes A, D’Onofrio G, Bodkin M, Diederich M, Dinkova-Kostova AT, Efferth T, El Bairi K, Arkells N, Fan TP, Fiebich BL, Freissmuth M, Georgiev MI, Gibbons S, Godfrey KM, Gruber CW, Heer J, Huber LA, Ibanez E, Kijjoa A, Kiss AK, Lu A, Macias FA, Miller MJS, Mocan A, Müller R, Nicoletti F, Perry G, Pittalà V, Rastrelli L, Ristow M, Russo GL, Silva AS, Schuster D, Sheridan H, the International Natural Product Sciences Taskforce , et al. 2021. Natural products in drug discovery: advances and opportunities. Nat Rev Drug Discov 20:200–216. doi: 10.1038/s41573-020-00114-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hawksworth DL, Honegger R. 1994. The lichen thallus: a symbiotic phenotype of nutritionally specialized fungi and its response to gall producers, p 77–98. In Williams MAJ (ed), Plant galls: organisms, interactions, populations. Clarendon Press, Oxford, UK. [Google Scholar]
- 8.Ahmadjian V. 1993. The lichen symbiosis. John Wiley and Sons, New York, NY. [Google Scholar]
- 9.Ahmadjian V. 1982. Algal/fungal symbioses, p 179–233. In Round FE, Chapman DJ (ed), Progress in phycological research. Elsevier Biomedical Press, Amsterdam, Netherlands. [Google Scholar]
- 10.Calchera A, Dal Grande F, Bode HB, Schmitt I. 2019. Biosynthetic gene content of the “perfume lichens” Evernia prunastri and Pseudevernia furfuracea. Molecules 24:203. doi: 10.3390/molecules24010203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Goga M, Elečko J, Marcinčinová M, Ručová D, Bačkorová M, Bačkor M. 2020. Lichen metabolites: an overview of some secondary metabolites and their biological potential, p 175–209. In Merillon JM, Ramawat K (eds), Co-evolution of secondary metabolites. Springer, Cham, Switzerland. [Google Scholar]
- 12.Boustie J, Grube M. 2005. Lichens—a promising source of bioactive secondary metabolites. Plant Genet Resour 3:273–287. doi: 10.1079/PGR200572. [DOI] [Google Scholar]
- 13.Barbero M, Artuso E, Prandi C. 2018. Fungal anticancer metabolites: synthesis towards drug discovery. Curr Med Chem 25:141–185. doi: 10.2174/0929867324666170511112815. [DOI] [PubMed] [Google Scholar]
- 14.Stanojković T. 2015. Investigations of lichen secondary metabolites with potential anticancer activity, p 127–146. In Lichen secondary metabolites: bioactive properties and pharmaceutical potential. Springer International Publishing, New York, NY. [Google Scholar]
- 15.Ingelfinger R, Henke M, Roser L, Ulshöfer T, Calchera A, Singh G, Parnham MJ, Geisslinger G, Fürst R, Schmitt I, Schiffmann S. 2020. Unraveling the pharmacological potential of lichen extracts in the context of cancer and inflammation with a broad screening approach. Front Pharmacol 11:1322. doi: 10.3389/fphar.2020.01322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Molnár K, Farkas E. 2010. Current results on biological activities of lichen secondary metabolites: a review. Z Naturforsch C J Biosci 65:157–173. doi: 10.1515/znc-2010-3-401. [DOI] [PubMed] [Google Scholar]
- 17.Kim W, Liu R, Woo S, Bin Kang K, Park H, Yu YH, Ha H-H, Oh S-Y, Yang JH, Kim H, Yun S-H, Hur J-S. 2021. Linking a gene cluster to atranorin, a major cortical substance of lichens, through genetic dereplication and heterologous expression. mBio 12:e0111121. doi: 10.1128/mBio.01111-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kealey JT, Craig JP, Barr PJ. 2021. Identification of a lichen depside polyketide synthase gene by heterologous expression in Saccharomyces cerevisiae. Metab Eng Commun 13:e00172. doi: 10.1016/j.mec.2021.e00172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Armaleo D, Sun X, Culberson C. 2011. Insights from the first putative biosynthetic gene cluster for a lichen depside and depsidone. Mycologia 103:741–754. doi: 10.3852/10-335. [DOI] [PubMed] [Google Scholar]
- 20.Singh G, Armaleo D, Dal Grande F, Schmitt I. 2021. Depside and depsidone synthesis in lichenized fungi comes into focus through a genome-wide comparison of the olivetoric acid and physodic acid chemotypes of Pseudevernia furfuracea. Biomolecules 11:1445. doi: 10.3390/biom11101445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Van Santen JA, Kautsar SA, Medema MH, Linington RG. 2021. Microbial natural product databases: moving forward in the multi-omics era. Nat Prod Rep 38:264–278. doi: 10.1039/d0np00053a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bachmann BO, Van Lanen SG, Baltz RH. 2014. Microbial genome mining for accelerated natural products discovery: is a renaissance in the making? J Ind Microbiol Biotechnol 41:175–184. doi: 10.1007/s10295-013-1389-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hoffmeister D, Keller NP. 2007. Natural products of filamentous fungi: enzymes, genes, and their regulation. Nat Prod Rep 24:393–416. doi: 10.1039/b603084j. [DOI] [PubMed] [Google Scholar]
- 24.Keller NP. 2019. Fungal secondary metabolism: regulation, function and drug discovery. Nat Rev Microbiol 17:167–180. doi: 10.1038/s41579-018-0121-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Brakhage AA. 2013. Regulation of fungal secondary metabolism. Nat Rev Microbiol 11:21–32. doi: 10.1038/nrmicro2916. [DOI] [PubMed] [Google Scholar]
- 26.Pizarro D, Divakar PK, Grewe F, Crespo A, Dal Grande F, Lumbsch HT. 2020. Genome-wide analysis of biosynthetic gene cluster reveals correlated gene loss with absence of usnic acid in lichen-forming fungi. Genome Biol Evol 12:1858–1868. doi: 10.1093/gbe/evaa189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Abdel-Hameed M, Bertrand RL, Piercey-Normore MD, Sorensen JL. 2016. Putative identification of the usnic acid biosynthetic gene cluster by de novo whole-genome sequencing of a lichen-forming fungus. Fungal Biol 120:306–316. doi: 10.1016/j.funbio.2015.10.009. [DOI] [PubMed] [Google Scholar]
- 28.Nguyen K-H, Chollet-Krugler M, Gouault N, Tomasi S. 2013. UV-protectant metabolites from lichens and their symbiotic partners. Nat Prod Rep 30:1490–1508. doi: 10.1039/c3np70064j. [DOI] [PubMed] [Google Scholar]
- 29.Lohezic-Le Devehat F, Legouin B, Couteau C, Boustie J, Coiffard L. 2013. Lichenic extracts and metabolites as UV filters. J Photochem Photobiol B Biol 120:17–28. doi: 10.1016/j.jphotobiol.2013.01.009. [DOI] [PubMed] [Google Scholar]
- 30.Buçukoglu TZ, Albayrak S, Halici MG, Tay T. 2013. Antimicrobial and antioxidant activities of extracts and lichen acids obtained from some Umbilicaria species from central Anatolia, Turkey. J Food Process Preserv 37:1103–1110. doi: 10.1111/j.1745-4549.2012.00811.x. [DOI] [Google Scholar]
- 31.Posner B, Feige GB, Huneck S. 1992. Studies on the chemistry of the lichen genus Umbilicaria hoffm. Z Naturforsch C J Biosci 47:1–9. doi: 10.1515/znc-1992-1-202. [DOI] [Google Scholar]
- 32.Narui T, Culberson CF, Culberson WL, Johnson A, Shibata S. 1996. A contribution to the chemistry of the lichen family umbilicariaceae (Ascomycotina). Bryologist 99:199–211. doi: 10.2307/3244550. [DOI] [Google Scholar]
- 33.Davydov EA, Peršoh D, Rambold G. 2017. Umbilicariaceae (lichenized Ascomycota) – trait evolution and a new generic concept. Taxon 66:1282–1303. doi: 10.12705/666.2. [DOI] [Google Scholar]
- 34.Singh G, Calchera A, Schulz M, Drechsler M, Bode HB, Schmitt I, Dal Grande F. 2021. Climate‐specific biosynthetic gene clusters in populations of a lichen‐forming fungus. Environ Microbiol 23:4260–4275. doi: 10.1111/1462-2920.15605. [DOI] [PubMed] [Google Scholar]
- 35.Armaleo D. 1995. Factors affecting depside and depsidone biosynthesis in a cultured lichen fungus. Crypt Bot 5:14–21. [Google Scholar]
- 36.Martinet L, Naômé A, Deflandre B, Maciejewska M, Tellatin D, Tenconi E, Smargiasso N, De Pauw E, Van Wezel GP, Rigali S. 2019. A single biosynthetic gene cluster is responsible for the production of bagremycin antibiotics and ferroverdin iron chelators. mBio 10:e01230-19. doi: 10.1128/mBio.01230-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wasil Z, Pahirulzaman KAK, Butts C, Simpson TJ, Lazarus CM, Cox RJ. 2013. One pathway, many compounds: heterologous expression of a fungal biosynthetic pathway reveals its intrinsic potential for diversity. Chem Sci 4:3845–3856. doi: 10.1039/c3sc51785c. [DOI] [Google Scholar]
- 38.Chen L, Yue Q, Zhang X, Xiang M, Wang C, Li S, Che Y, Ortiz-López FJ, Bills GF, Liu X, An Z. 2013. Genomics-driven discovery of the pneumocandin biosynthetic gene cluster in the fungus Glarea lozoyensis. BMC Genomics 14:339. doi: 10.1186/1471-2164-14-339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Leuckert C. 1985. Probleme der Flechten-Chemotaxonomie – Stoffkombinationen und ihre taxonomische Wertung. Ber Dtsch Bot Ges 98:401–408. [Google Scholar]
- 40.Davydov EA, Blum OB, Kashevarov GP, Grakhov VP. 2019. Umbilicaria subpolyphylla Oxner: the correct name for U. iberica Sancho & Krzewicka and its bipolar distribution pattern. Lichenologist 51:205–220. doi: 10.1017/S002428291900015X. [DOI] [Google Scholar]
- 41.Rubio-Piña JA, Zapata-Pérez O. 2011. Isolation of total RNA from tissues rich in polyphenols and polysaccharides of mangrove plants. Electron J Biotechnol 14. [Google Scholar]
- 42.Merges D, Dal Grande F, Greve C, Otte J, Schmitt I. 2021. Virus diversity in metagenomes of a lichen symbiosis (Umbilicaria phaea): complete viral genomes, putative hosts and elevational distributions. Environ Microbiol 23:6637–6650. doi: 10.1111/1462-2920.15802. [DOI] [PubMed] [Google Scholar]
- 43.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
- 44.Qin M, Wu S, Li A, Zhao F, Feng H, Ding L, Ruan J. 2019. LRScaf: improving draft genomes using long noisy reads. BMC Genomics 20:955. doi: 10.1186/s12864-019-6337-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, Ruscheweyh H-J, Tappu R. 2016. MEGAN community edition - interactive exploration and analysis of large-scale microbiome sequencing data. PLoS Comput Biol 12:e1004957. doi: 10.1371/journal.pcbi.1004957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bradnam KR, Fass JN, Alexandrov A, Baranay P, Bechner M, Birol I, Boisvert S, Chapman JA, Chapuis G, Chikhi R, Chitsaz H, Chou W-C, Corbeil J, Del Fabbro C, Docking TR, Durbin R, Earl D, Emrich S, Fedotov P, Fonseca NA, Ganapathy G, Gibbs RA, Gnerre S, Godzaridis É, Goldstein S, Haimel M, Hall G, Haussler D, Hiatt JB, Ho IY, Howard J, Hunt M, Jackman SD, Jaffe DB, Jarvis ED, Jiang H, Kazakov S, Kersey PJ, Kitzman JO, Knight JR, Koren S, Lam T-W, Lavenier D, Laviolette F, Li Y, Li Z, Liu B, Liu Y, Luo R, MacCallum I, MacManes MD, et al. 2013. Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species. Gigascience 2:10. doi: 10.1186/2047-217X-2-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 48.Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, Medema MH, Weber T. 2019. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res 47:W81–W87. doi: 10.1093/nar/gkz310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Palmer J, Stajich J. 2019. Funannotate v1.7.4. Zenodo doi: 10.5281/ZENODO.2604804. [DOI] [Google Scholar]
- 50.Borodovsky M, Lomsadze A. 2011. Eukaryotic gene prediction using GeneMark.hmm-E and GeneMark-ES. Curr Protoc Bioinformatics CHAPTER 4:Unit-4.6.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. 2005. InterProScan: protein domains identifier. Nucleic Acids Res 33:W116–W120. doi: 10.1093/nar/gki442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, Mende DR, Letunic I, Rattei T, Jensen LJ, Von Mering C, Bork P. 2019. EggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Huerta-Cepas J, Forslund K, Coelho LP, Szklarczyk D, Jensen LJ, Von Mering C, Bork P. 2017. Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper. Mol Biol Evol 34:2115–2122. doi: 10.1093/molbev/msx148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, von Heijne G, Nielsen H. 2019. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat Biotechnol 37:420–423. doi: 10.1038/s41587-019-0036-z. [DOI] [PubMed] [Google Scholar]
- 55.Katoh K, Kuma K, Toh H, Miyata T. 2005. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res 33:511–518. doi: 10.1093/nar/gki198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Drummond AJ, Ashton B, Buxton S, Cheung M, Cooper A, Duran C, Field M, Heled J, Kearse M, Markowitz S, Moir R, Stones-Havas S, Sturrock S, Thierer T, Wilson A. 2011. Geneious. 5.4. Biomatters, Auckland, New Zealand. http://www.geneious.com. [Google Scholar]
- 57.Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
- 58.Miller MA, Pfeiffer W, Schwartz T. 2010. Creating the CIPRES science gateway for inference of large phylogenetic trees, p 1–8. In Proceedings of the Gateway Computing Environments Workshop (GCE). IEEE Press, Piscataway, NJ. [Google Scholar]
- 59.Letunic I, Bork P. 2021. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res 49:W293–W296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sullivan MJ, Petty NK, Beatson SA. 2011. Easyfig: a genome comparison visualizer. Bioinformatics 27:1009–1010. doi: 10.1093/bioinformatics/btr039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Wang Y, Kim JA, Cheong YH, Koh YJ, Hur JS. 2012. Isolation and characterization of a non-reducing polyketide synthase gene from the lichen-forming fungus Usnea longissima. Mycol Progress 11:75–83. doi: 10.1007/s11557-010-0730-1. [DOI] [PubMed] [Google Scholar]
- 62.Braesel J, Fricke J, Schwenk D, Hoffmeister D. 2017. Biochemical and genetic basis of orsellinic acid biosynthesis and prenylation in a stereaceous basidiomycete. Fungal Genet Biol 98:12–19. doi: 10.1016/j.fgb.2016.11.007. [DOI] [PubMed] [Google Scholar]
- 63.Navarro-Muñoz JC, Selem-Mojica N, Mullowney MW, Kautsar SA, Tryon JH, Parkinson EI, De Los Santos ELC, Yeong M, Cruz-Morales P, Abubucker S, Roeters A, Lokhorst W, Fernandez-Guerra A, Cappelini LTD, Goering AW, Thomson RJ, Metcalf WW, Kelleher NL, Barona-Gomez F, Medema MH. 2020. A computational framework to explore large-scale biosynthetic diversity. Nat Chem Biol 16:60–68. doi: 10.1038/s41589-019-0400-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Kautsar SA, Blin K, Shaw S, Navarro-Muñoz JC, Terlouw BR, Van Der Hooft JJJ, Van Santen JA, Tracanna V, Suarez Duran HG, Pascal Andreu V, Selem-Mojica N, Alanjary M, Robinson SL, Lund G, Epstein SC, Sisto AC, Charkoudian LK, Collemare J, Linington RG, Weber T, Medema MH. 2020. MIBiG 2.0: a repository for biosynthetic gene clusters of known function. Nucleic Acids Res 48:D454–D458. doi: 10.1093/nar/gkz882. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Table S1. Download spectrum.00109-22-s0001.xlsx, XLSX file, 0.01 MB (11.7KB, xlsx)
Table S2. Download spectrum.00109-22-s0002.xlsx, XLSX file, 0.2 MB (186.6KB, xlsx)
Data Availability Statement
This whole-genome shotgun project has been deposited at GenBank under the accession number PRJNA820300. The versions described in this paper are the versions JALILQ000000000 to JALILY000000000. The lichen samples of eight Umbilicaria spp. and Dermatocarpon miniatum have been deposited under BioSample accession numbers SAMN27294873 to SAMN27294881. The mycobiont samples have been deposited under BioSample accession numbers SAMN26992773 to SAMN26992781. The antiSMASH files of Umbilicaria spp. and Dermatocarpon miniatum and the candidate gyrophoric acid cluster .gbk files are available at figshare (https://doi.org/10.6084/m9.figshare.19625997).





