Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2024 Nov 7;22(11):e3002862. doi: 10.1371/journal.pbio.3002862

Microbial occurrence and symbiont detection in a global sample of lichen metagenomes

Gulnara Tagirdzhanova 1,¤, Paul Saary 2, Ellen S Cameron 2,3, Carmen C G Allen 1, Arkadiy I Garber 4, David Díaz Escandón 1, Andrew T Cook 1, Spencer Goyette 1,5, Veera Tuovinen Nogerius 6, Alfredo Passo 7, Helmut Mayrhofer 8, Håkon Holien 9, Tor Tønsberg 10, Lisa Y Stein 1, Robert D Finn 2,*, Toby Spribille 1,*
Editor: Nancy A Moran11
PMCID: PMC11542873  PMID: 39509454

Abstract

In lichen research, metagenomes are increasingly being used for evaluating symbiont composition and metabolic potential, but the overall content and limitations of these metagenomes have not been assessed. We reassembled over 400 publicly available metagenomes, generated metagenome-assembled genomes (MAGs), constructed phylogenomic trees, and mapped MAG occurrence and frequency across the data set. Ninety-seven percent of the 1,000 recovered MAGs were bacterial or the fungal symbiont that provides most cellular mass. Our mapping of recovered MAGs provides the most detailed survey to date of bacteria in lichens and shows that 4 family-level lineages from 2 phyla accounted for as many bacterial occurrences in lichens as all other 71 families from 16 phyla combined. Annotation of highly complete bacterial, fungal, and algal MAGs reveals functional profiles that suggest interdigitated vitamin prototrophies and auxotrophies, with most lichen fungi auxotrophic for biotin, most bacteria auxotrophic for thiamine and the few annotated algae with partial or complete pathways for both, suggesting a novel dimension of microbial cross-feeding in lichen symbioses. Contrary to longstanding hypotheses, we found no annotations consistent with nitrogen fixation in bacteria other than known cyanobacterial symbionts. Core lichen symbionts such as algae were recovered as MAGs in only a fraction of the lichen symbioses in which they are known to occur. However, the presence of these and other microbes could be detected at high frequency using small subunit rRNA analysis, including in many lichens in which they are not otherwise recognized to occur. The rate of MAG recovery correlates with sequencing depth, but is almost certainly influenced by biological attributes of organisms that affect the likelihood of DNA extraction, sequencing and successful assembly, including cellular abundance, ploidy and strain co-occurrence. Our results suggest that, though metagenomes are a powerful tool for surveying microbial occurrence, they are of limited use in assessing absence, and their interpretation should be guided by an awareness of the interacting effects of microbial community complexity and sequencing depth.


Metagenomes are increasingly used to study lichen symbionts. This study reassembles 400 available metagenomes, recovering 1000 metagenome-assembled genomes (MAGs), and providing the most detailed survey to date of bacteria in lichens. Functional annotation of bacterial, fungal and algal MAGs suggests interdigitated vitamin prototrophies and auxotrophies, implying a novel dimension to microbial cross-feeding in lichen symbioses.

Introduction

In some biological systems, unrelated organisms have evolved interactions so stable and integrated that they appear to function, to the casual observer, as one. This phenomenon was discovered over 150 years ago in lichens [1,2]. Long thought to constitute a single organism, lichens were revealed to be a tightly integrated relationship of a fungus and phototrophic alga and/or cyanobacterium (collectively often called “photobionts”), in which fungi are thought to receive sugar alcohols or glucose from the photobiont [3,4]. Subsequent research has revealed the pairings currently classified under the umbrella of “lichens” to have arisen multiple times in both fungal and photobiont evolution [5,6], to engage in different schemes of metabolic exchange [79], and to involve, to varying degrees, additional constitutively associated organisms [1014].

All symbioses currently classified as lichens conform to a few basic patterns. All are ectotrophic [15] and all symbionts—including algae [16]—are partially or wholly osmotrophs. Participating microbes exchange goods and services across shared cell wall contacts and mucilages [17], making lichens essentially highly structured biofilms [18]. The lichen fungal symbiont (LFS), which is not known to produce a free-living vegetative life stage in nature and never reproduces outside of symbiosis [4,9], has been shown to produce, together with its mucilage, about an order of magnitude more biomass than the photobiont [17]. LFS species likewise far outnumber those of the photobiont. Given their asymmetrical biomass and diversity, the LFS is typically accorded the role that in other symbiotic systems has been called the host. Most eukaryotic photobionts, by contrast, are regularly detected in the free-living state in surrounding environment, reproduce almost exclusively outside of the lichen symbiosis [9], and like cyanobacterial photobionts are widely shared among LFSs [19]. Although much about the biology of the disparate photobiont lineages remains unclear, especially with respect to what they receive from the fungus, their population structuring vis-à-vis their LFS partners is consistent with the definition of an open symbiosis [20].

The only universally recognized organismal associate of lichen fungi, and central to countless definitions of “lichen” [21,22], is the photobiont, the presence of which is determined today, no different than 150 years ago, by visual inspection under a microscope. No consensus exists however on how many other symbionts might be involved, and since most other associated microbes lack visible chlorophyll, their detection is not trivial. Researchers as early as Maria Cengia Sambo in the 1920s cultured bacteria from lichens, leading to the speculation that lichens could be “polysymbioses” of more than 2 partners [23]. Culture-based screening of lichens in the search for functional bacterial contributors to lichen symbioses continued into the 21st century [24,25], and the detection of associated strains and potentially shared metabolic products led Grube and Berg [10] to begin referring to these microbes as “bacteriobionts.” However, though hundreds of bacterial strains have been detected, overall few lichen symbioses have been censused for their bacterial composition, with most work focused on the model symbiosis involving the ascomycete fungus Lobaria pulmonaria (reviewed by Grimm and colleagues [13]). Similarly, early culture-based detections of basidiomycete yeasts from lichens have been followed by metatranscriptome studies and PCR screening suggesting high frequency occurrence, especially in lichens involving fungi from the family Parmeliaceae [12]. All of these approaches have been limited in their ability to capture the totality of lichen microbial composition by the inherent biases of microscopic detectability, culturability, the use of specific primers, or, in the case of metatranscriptomes used to study basidiomycete yeasts, selection of eukaryotic mRNA via poly-A tailing [12].

The shotgun-based DNA sequencing approach underlying metagenomics would appear to offer a solution to surveying total organismal composition in lichens owing to its theoretical ability to conduct an unbiased census of genomes, much as it has in other experimentally recalcitrant biological systems [26]. Hundreds of lichen metagenomes have been published in the last decade in the context of the study of lichen microbial composition [2532] and recovery and analysis of the LFS genome [3337], and these metagenomes have collectively begun to be mined for their protein diversity [38,39]. The recent use of lichen metagenomes to argue both for [14,27,29,30,31,32] and against [28,32] specific organismal compositions of lichen samples has raised questions about the overall organismal information content of currently available lichen metagenomes, their uses and their limitations. In particular, since no broad-scale survey of lichen metagenome content has been undertaken, underlying assumptions regarding their ability to capture basic lichen organismal composition, such as known symbionts, have not been tested.

In the present study, we reassembled and analyzed 456 lichen metagenomes to produce metagenome-assembled genomes (MAGs) and facilitate the mapping of the overall organismal composition, in a geographically wide sample of lichen symbioses drawn from 12 different studies. We complemented these with 24 additional, newly generated metagenomes that had been sequenced more deeply than the average publicly available data. We mapped organismal composition of broad taxonomic groups based on individual MAGs as well as detection of unambiguous rRNA in assemblies and raw reads. We gave special emphasis to testing for recovery of the canonically recognized photobionts, as well as putative basidiomycete symbionts and high-frequency bacterial lineages. We also annotated and compared the occurrence of core functional biosynthetic pathways in highly complete prokaryotic and eukaryotic MAGs and explored potential cross-feeding patterns in the small number of metagenomes in which multiple complete MAGs were recovered together.

Results

Metagenomic assembly and binning yield 1,000 MAGs

We started with a data set of 456 lichen metagenomes from 12 different published sources [14,27,28,3037,40] (Fig 1 and S1 Table), plus 24 metagenomes generated de novo (S2 Table). This data set included representatives of most major lichen groups (S1 Fig). Although the majority of the metagenomes were generated from samples collected in North America, 6 continents were represented in total (S2 Fig). We began by removing identical metagenomes (not biological replicates) that have been published twice in the sequence databases (n = 43; S3 Table). Next, we assembled and binned each metagenome individually, obtaining 17,390 bins (Fig 1A, see Methods). We obtained 1,142 MAGs above the quality threshold QS50 [39] (hereafter “QS50 MAGs”). Since different genotypes of the same organism can occur in multiple samples, we dereplicated the MAG set to obtain species-level representatives, defined at 95% average nucleotide identity (ANI; hereafter “species”).

Fig 1. Bioinformatic pipeline used in the study.

Fig 1

(A) Flowchart of the bioinformatic analysis of this study. (B) Sankey plot showing the source of 437 metagenomes used in the analysis and the progressive reduction of the dataset in the course of the study. LFS, lichen fungal symbiont; MAG, metagenome-assembled genome.

The final set included 1,000 QS50 MAGs: 674 bacterial, 294 fungal, and 32 algal (S4 Table). For each of these taxonomic groups, we constructed phylogenomic trees (Fig 2) based on sets of genes shared across all MAGs found in the same taxonomic same taxonomic group, thereby contextualizing the relationship of the MAGs to each other and to reference genomes (listed in S5 Table). Bacterial taxonomy was assigned according to the Genome Taxonomy Database (GTDB [41]). Fungal MAGs came from 7 taxonomic classes (Fig 2A), with the large majority from the class Lecanoromycetes (n = 260), followed by Eurotiomycetes (n = 15), Dothideomycetes (n = 5), Arthoniomycetes (n = 6), Lichinomycetes (n = 4), Tremellomycetes (n = 3), and Cystobasidiomycetes (n = 1) and were assigned in a curation process following [42]. In addition to LFSs, we recovered 22 non-LFS fungal MAGs, including 4 from the basidiomycete classes Cystobasidiomycetes and Tremellomycetes that have been postulated to be involved in some lichen symbioses (S4 Table). Three clades were recovered of Trebouxiophyceae algae, which include major lichen photobionts (Fig 2B). Bacteria came from 16 phyla, with 42% of species-level lineages (n = 282) coming from Proteobacteria and 23% (n = 153) coming from Acidobacteriota. Cyanobacteria (a photobiont group) were represented by 27 species. The rest came from: Actinobacteriota (n = 69), Bacteroidota (n = 47), Verrucomicrobiota (n = 29), Armatimonadota (n = 23), Chloroflexota (n = 21), Deinococcota (n = 6), Eremiobacterota (n = 4), Bdellovibrionota (n = 3), Myxococcota (n = 3), Patescibacteria (n = 3), Planctomycetota (n = 2), Chlamydiota (n = 1), and Firmicutes (n = 1) (Fig 2C).

Fig 2. Maximum likelihood phylogenetic trees of the MAGs.

Fig 2

The trees are calculated using IQ-TREE and are based on alignments of marker genes. Clade labels and color bars indicate broad groups the MAGs were assigned to. Reference genomes (interleaved) are not colored or labeled. (A) Tree of recovered fungal MAGs (colored sectors) interleaved with reference genomes (listed in S5 Table), based on 709 BUSCO genes. (B) Tree showing recovered algal MAGs interleaved with reference genomes based on 1296 BUSCO genes. Bars represent the number of occurrences of the MAG in Dataset 1. (C) Tree of the bacterial MAGs obtained from lichen metagenomes interleaved with reference genomes based on 120 marker genes from GTDB-Tk. The 4 most frequent bacterial families are denoted by colored sectors. Radiating bars represent the number of occurrences for each given MAG in Dataset 1. Reference genomes belong to bacteria previously isolated from lichens and are shown as numbers: 1: Lichenicola cladoniae PAMC 26569; 2: Lichenicoccus roseus KEBCLARHB70R; 3: Lichenihabitans minor RmlP026; 4: Lichenihabitans ramalinae RmlP001; 5: Lichenihabitans psoromatis PAMC 29128 and PAMC 29148; 6: Lichenifustis flavocetrariae BP6-180914; 7: Methylobacterium planeticum YIM 132548; 8: Aureimonas leprariae YIM 132180; 9: Rubellimicrobium rubrum YIM 131921; 10: Paracoccus lichenicola YIM 132242; 11: Polymorphobacter megasporae PAMC 29362; 12: Subtercola lobariae CGMCC 1.12976; 13: Luteimicrobium album NBRC 106348; 14: Streptomyces lichenis LCR6-01; 15: Nakamurella leprariae YIM 132084. Full versions of the phylogenomic trees are available in FigShare (10.6084/m9.figshare.27054937). GTDB, Genome Taxonomy Database; MAG, metagenome-assembled genome.

Mapping MAG occurrences reveals a few bacterial lineages to be disproportionately frequent in metagenomes

Given the known propensity of lichens with different LFSs to share, e.g., the same photobiont species, we expected that the same species could occur in multiple metagenomes. We accordingly mapped occurrences of the 95% ANI “species” by read mapping each metagenome against the dereplicated MAGs, where we counted a species as present if over 50% of the MAG was covered (hereafter referred to as “MAG recovery”). MAG mapping resulted in a stepwise reduction of the overall data set of metagenomes used for analysis (Fig 1B). In 62 of the metagenomes, no MAG could be recovered that passed the mapping thresholds outlined above, leaving 375 metagenomes with at least 1 recoverable MAG (Dataset 1). Of these, 348 contained at least 1 fungal MAG. The placement of these fungal MAGs in phylogenomic trees allowed us to identify 18 metagenomes where the taxonomy of the fungal MAG did not match the name of the lichen given in the NCBI metadata associated with the sample, indicating that the sample was likely misidentified (S6 Table). The MAGs from these 18 metagenomes were excluded from the final tree (Fig 2A) and occurrence heatmaps (see below), but included in the overall MAG counts. This left 330 metagenomes that contained a verified MAG of the LFS (Dataset 2). Echoing the taxonomic breakdown of fungal MAGs, the 330 metagenomes represented in Dataset 2 were made from lichen symbioses involving LFSs from 5 putative origins of lichen symbiosis within Ascomycota, including Arthoniomycetes (n = 6), Dothideomycetes (n = 3), Lichinomycetes (n = 3), Eurotiomycetes (n = 6), and Lecanoromycetes (n = 312).

The mapping of MAG occurrences also allowed us to reconstruct the frequency of individual MAGs across Dataset 1. As expected, most fungal MAGs occurred only once, except in a few cases where several metagenomes of the same lichen symbiosis, or several symbioses with closely related LFSs, were included. For algae and bacteria, we plotted occurrence frequency for each MAG as a column on its respective tip in the phylogenomic tree (Fig 2B and 2C). In algae, the most frequently occurring MAG, from the genus Trebouxia, was found 16 times (S4 Table). Bacterial MAG occurrences were highly skewed, with more than 50% of bacterial occurrences deriving from 2 phyla (Protobacteria and Acidobacteriota) and specifically 4 bacterial families, Acetobacteraceae, Beijerinckiaceae, Sphingomonadaceae, and Acidobacteriaceae. Major occurrence spikes can be seen in the genus Lichenihabitans in the Beijerinckiaceae (occurred in 99 metagenomes, with 1 species detected in 52 metagenomes); the clade provisionally assigned in GTDB to the genus-level lineage LMUY01 in the Acetobacteraceae (occurred in 50 metagenomes, with 1 species detected in 28 metagenomes); and CAHJXG01 (Acetobacteraceae), which occurred as various species in 58 metagenomes (S7 Table). High occurrence-count genera outside these 4 families included Nostoc in the Cyanobacteria and CAHJWO01 in Verrucomicrobiota (S7 Table). Each high frequency bacterial lineage derived from multiple of the 12 studies from which metagenomes were used, and at minimum was present in data from 3 independent studies produced by different investigators (S8 Table; see also S3 Fig).

To explore how lineages co-occur within lichen samples, we built co-occurrence network graphs (S4 Fig), using the occurrence matrix. We defined co-occurrence as an instance of 2 lineages occurring together in one metagenome. For this analysis, we focused on the groups that are known to stably occur in lichens (algae, Cyanobacteria, and Cystobasidiomycetes and Tremellomycetes fungi), and on the most frequent bacterial groups (most frequent genera of Acetobacteraceae, Beijerinckiaceae, and Acidobacteriaceae). Only metagenomes that yielded an LFS MAG (n = 330; Dataset 2) were included in this analysis.

Known lichen photobionts are underrepresented as MAGs, but detectable in metagenomic raw reads and metagenomic assemblies

In the next analysis, we compared MAG occurrences against a priori known photobiont occurrence patterns and major clades of “host” LFSs. Specifically, we plotted frequencies for the major photobiont lineages Trebouxiophyceae, Ulvophyceae, and Cyanobacteria as proxies for the 3 most commonly assigned photobiont categories in lichen research (“trebouxioid,” “trentepohlioid,” and “cyanobacteria”). In addition, we plotted data for 2 basidiomycete fungal groups recovered as MAGs in the initial binning exercise and widely discussed as putative lichen symbionts (Cystobasidiomycetes and Tremellomycetes), and the 4 highest frequency bacterial families based on MAG mapping in the previous section. The purpose of this analysis was 2-fold: first, it would allow us to assess the frequency at which MAGs were recovered from known symbionts—photobionts—visible by microscopy; and second, it could reveal if any microbial lineages exhibit co-occurrence patterns with certain photobiont or LFS clades. MAGs of known photobionts are recovered at a low rate compared to their known occurrences (Fig 3). MAGs of known photobionts occurred together with the LFSs with which they were expected to occur, but likewise at low frequency compared to expectations (Fig 3). MAGs of basidiomycete fungi were recovered in only a few metagenomes. The 4 bacterial families that account for most MAGs are recovered at varying frequencies as MAGs across most lichen symbioses, mapped both by LFS and by the major photobiont group (Fig 3A).

Fig 3. Organismal occurrence by major microbial group.

Fig 3

Heatmaps showing occurrence rates of mapped MAGs, and detection of canonical symbionts and major additional organismal groups using rRNA in assemblies and raw read sets. (A) For class-level taxa against major LFS (lichen fungal symbiont) groups and major photobiont groups; (B) for 9 of the 13 key genus-level bacterial groups against major LFS groups and major photobiont groups. We grouped metagenomes by the photobiont that is expected in a given lichen based on literature sources (see Supporting information for Spribille and colleagues [9]). Only metagenomes from the Dataset 2 (n = 330) are included in this analysis, and only groups represented by 4 or more metagenomes are shown. The number of metagenomes in each category is shown in the “n” column. The data underlying this figure can be found in S1 Data. LFS, lichen fungal symbiont; MAG, metagenome-assembled genome.

We suspected that the low frequency of MAG recovery of photobionts compared to their known occurrences was due to sequencing depth being insufficient to yield photobiont MAGs. To address this limitation, we screened raw read sets of each metagenome for small subunit (SSU) rRNA sequences for all of the same organismal groups for which we had plotted MAG frequency. We also examined SSU rRNA detection at the level of assembly, as we considered that this could represent a conservative estimate. SSU rRNA detection finds the 3 main known photobiont groups Trebouxiophyceae, Ulvophyceae, and Cyanobacteria in most assemblies and read sets (Fig 3A). We detected SSU rRNA genes from major photobiont lineages in all major LFS groups they were expected to occur in, as well as, unexpectedly, in many in which they are not recognized photobionts. As expected, assemblies tended to have lower detection rates than raw read sets, but even with assemblies, photobionts were detected in many lichens they are not considered to occur in. Based on read sets, the Trebouxiophyceae were found in almost all lichen metagenomes (>99%), including in lichens considered to have only a cyanobacterial or only ulvophycean photobiont. Similarly, Cyanobacteria were detected in about half of metagenomes of lichens considered to have only Trebouxiophyceae or only Ulvophyceae as photobiont (n = 154 out of 278 and 9 out of 20, respectively, based on Dataset 2), and Ulvophyceae were detected in 21% (n = 59 out of 278) and 43% (n = 10 out of 23) of metagenomes of lichens considered to have only Trebouxiophyceae or Cyanobacteria as the sole photobiont, respectively. Similarly, the fungal groups Cystobasidiomycetes and Tremellomycetes occurred at much higher frequencies in assemblies (8% and 15%) and read sets (31% and 64%), respectively, than as MAGs. Although some trends are visible, we did not perform statistics on relative occurrence frequencies owing to uneven sequencing depth across the data set (see Discussion), but some patterns of presence correspond to expectations, such as the high frequency of Cyanobacteria associated with the LFS order Peltigerales.

Bacteria also exhibit a higher detection rate as SSU rRNA than as MAGs. At the level of the detection of SSU rRNA, Acetobacteraceae were detected in assemblies and read sets of nearly all metagenomes, followed in frequency by Beijerinckiaceae, Acidobacteriaceae, and Sphingomonadaceae (Fig 3A). Few patterns relative to major LFS clade or photobiont group are visible, with the exception of high frequency occurrences of Sphingomonadaceae associated with Cyanobacteria and several LFS orders both with (Peltigerales) and without (e.g., Caliciales) recognized cyanobacterial photobionts.

To probe another level of taxonomic resolution, we also mapped 9 of the top 13 bacterial genera from which we selected representatives for annotation (see below; Fig 3B). Four GTDB-classified “genera” (CAHJXG01, Acetobacteraceae gen. sp., CAHJWO01 and CAHJWL01) could not be screened using SSU rRNA as they are currently only represented by MAGs that lack rRNA annotations. As expected, Nostoc is recovered at a high occurrence rate in LFS groups known to associate with it as a photobiont, as well as in cyanobacterial symbioses; the few cases of cyanobacterial symbioses where Nostoc was not detected involve cases where LFSs associate with other cyanobacterial genera, such as Scytonema or Stigonema. Of the mapped genera, Lichenihabitans is the most frequently recovered MAGs and detected in assemblies and read sets as SSU rRNA. More detailed taxonomic co-occurrence tables are provided in S7 Table.

The publicly available metagenomes used for this survey span a wide range of sequencing depth, measured as sequenced base pairs (S1 Table). Plotting MAG recovery against sequencing depth (Fig 4A and S9 Table) shows a positive relationship, suggesting that no inferences about true absence can be drawn from the numerous shallowly sequenced metagenomes.

Fig 4. Organism detection as a function of sequencing depth.

Fig 4

(A) Number of recovered MAGs as a function of sequencing depth (bp). Each dot represents a metagenome colored based on the recovery of the LFS (lichen fungal symbiont) and the photobiont. The curve indicates a generalized additive model (GAM) smoothing. The data underlying this figure can be found in S9 Table. (B) Detection of the key microbial groups in lichen metagenomes based on 2 methods of screening: presence of MAGs and presence of the SSU rRNA in the raw, unassembled reads as a function of sequencing depth. Each vertical bar represents a single metagenome, with the bar heights representing sequencing depth and the color representing the screening outcome. The metagenomes are sorted along the x-axis based on the outcome of the screening and the sequencing depth. The data underlying this figure can be found in S1 Data. GAM, generalized additive model; LFS, lichen fungal symbiont; MAG, metagenome-assembled genome; SSU, small subunit.

When detection of MAGs and SSU rRNA of the 7 mapped groups used in Fig 3 is plotted against sequencing depth, it becomes evident that some detections happen frequently at low sequencing depth, while in contrast, in numerous cases lack of detection persists despite high sequencing depth. The highest rate of MAG recovery was for the bacterial families Acetobacteraceae, Acidobacteriaceae, and Beijerinckiaceae (Fig 4B). For these 3 families, we recovered MAGs in many metagenomes even with low sequencing depth. Far fewer MAGs were recovered at low sequencing depth for the eukaryotes (Trebouxiophyceae or either of the basidiomycete groups). Overall, Trebouxiophyceae and Acetobacteraceae were the most frequently detected as SSU rRNA in raw reads, each having 99% (n = 374 and 372 respectively; based on the Dataset 1) occurrence frequency (Fig 4B), followed by Beijerinckiaceae and Acidobacteriaceae with 97% and 95% (S10 Table). The only cases where SSU rRNA was not detected for Trebouxiophyceae and Acetobacteraceae was in metagenomes with low sequencing depth, suggesting that the occurrence of these 2 taxon groups cannot be ruled out in those metagenomes. The 2 basidiomycete classes Tremellomycetes and Cystobasidiomycetes were detected in about two thirds and one third of read sets, respectively (n = 234 and 112, respectively; Fig 4B). For these, we failed to detect SSU rRNA in numerous deeply sequenced metagenomes (>10 Gbp; mapped as “not detected” in Fig 4B) suggesting that these lichens genuinely lack these taxa.

Recently published data validate high frequency of 4 bacterial families

In light of new lichen metagenomes being constantly deposited by researchers, we validated the occurrence patterns of high-frequency bacterial families in 243 additional metagenomes not included in the initial data set (S11 Table). These derived from 4 taxonomic groups of fungal “hosts”: Lichinomycetes (n = 17, as circumscribed by Díaz-Escandón and colleagues [43]), Dothideomycetes (n = 3), Arthoniomycetes (n = 2), and Lecanoromycetes (n = 221). Of the Lecanoromycetes, 157 were derived from just 3 LFS genera owing to inclusion of studies that included replicated sampling of those lichens (Cladonia, n = 51; Rhizoplaca, n = 73; and n = 33 from the species Lecanora polytropa; S11 Table) and 62 came from replicates of lichens involving just 2 species of LFS. Of the newly generated metagenomes, 95% (231 of 243) come from LFS families already included in the initial data set. Screening raw unassembled reads for SSU rRNA sequences for these data and tabulating occurrence frequencies of major taxa yielded results consistent with our data set (S12 Table): the same 4 bacterial families have the highest prevalence, and the prevalence of Acetobacteraceae reaches 97.1%. Lichenihabitans was detected in 67.5% of these metagenomes.

None of the non-cyanobacterial bacteria are predicted to fix nitrogen and most are carbon heterotrophs

The availability of numerous new MAGs offered the possibility to compare predicted metabolic capabilities of the highest frequency bacteria in and on lichens, irrespective of whether they are ultimately found to be constitutive components of the symbiosis. We rank-ordered bacterial genera by the number of occurrences and selected the 13 most frequent genera, which together accounted for 37% of bacterial lineages and 53% of bacterial occurrences, occurring in a total of 252 metagenomes. Next, we annotated all MAGs assigned to these genera that were near complete genomes (≥95% complete and ≤10% contaminated) (Fig 5A and S13 Table), for a total of 63 out of 250 MAGs (detected in 122 metagenomes). We focused on 5 selected metabolic capabilities that have been addressed in previous work on bacteria from lichens: (1) nitrogen fixation; (2) carbohydrate modification and transport, as well as modification of exogenous carbon sources; (3) anoxygenic photosynthesis; (4) iron scavenging and provisioning; and (5) cofactor synthesis.

Fig 5. Functional annotation of key bacterial MAGs.

Fig 5

(A) Selecting MAGs for annotation. The bar graph on the left shows the total number of occurrences per bacterial “genus” (after GTDB), with genera listed in decreasing order. Red bars represent the genera selected for functional annotation, which were the 13 most frequent genera together accounting for 53% of all bacterial occurrences. The waffle graph on the right shows the most frequent bacterial genera and MAGs assigned to them. Red circles represent MAGs selected for annotation: all MAGs from the selected genera that had >95% completeness. Filled gray circles represent MAGs with >95% completeness that belonged to less frequent genera and were not annotated. The open gray circles represent MAGs with ≤95% completeness, also not annotated. (B) Presence of selected pathways and protein complexes in the MAGs of most common lichen bacteria. Each column represents one of the 63 annotated MAGs, grouped by their taxonomy. We reconstructed pathways using KEGG; to detect carotenoid BGCs, we used antiSMASH. Here, we show the presence of pathways and protein complexes potentially relevant to the symbiosis. For 3 pathways (biosynthesis of bacteriochlorophyll, biotin, and cobalamin), we also show partial completeness (allowing one missing gene). (C) Number of genes assigned to each CAZy class per MAG. We annotated CAZymes in the MAGs selected for an in-depth annotation using dbcan. The data here are grouped on the family level. The CAZy classes are: Auxiliary Activities (AA), Carbohydrate-Binding Modules (CB), Carbohydrate Esterases (CE), Glycoside Hydrolases (GH), Glycosyl Transferases (GT), and Polysaccharide Lyases (PL). The data underlying this figure can be found in S1 Data. GTDB, Genome Taxonomy Database; MAG, metagenome-assembled genome.

The involvement of non-cyanobacterial bacteria in nitrogen fixation is one of the oldest hypotheses in lichen microbiology. Hyphomicrobiales (Rhizobiales) bacteria in lichens have been postulated to be involved in nitrogen fixation since the 1930s [44], and in recent decades several authors (Liba and colleagues [45], Grube and colleagues [11], Hodkinson and Lutzoni [46], Almendras and colleagues [47]) reported detecting nifH from lichen bacteria. However, we did not recover nifH, the key gene required for fixing nitrogen, in any non-cyanobacterial MAGs. To account for the possibility that nifH is present on a plasmid and therefore failed to be detected in MAGs passing threshold, we searched for nifH across all metagenomic assemblies using multiple queries (S14 Table). Furthermore, the inability to detect nifH was not an artifact, as we recovered it using tblastn with high frequency in Hyphomicrobiales genomes that we surveyed from Genbank, and that we used to construct a phylogenomic tree to contextualize the placement of lichen-derived Hyphomicrobiales (S5 Fig). We found only 1 non-cyanobacterial nifH, from Hyphomicrobiales, in one metagenome. By contrast, nearly all analyzed lineages possess the ammonium transporter amtB (Fig 5B). In addition, in 3 families (Acetobacteraceae, Beijerinckiaceae, and Nostocaceae), the majority of studied MAGs had genes of the urea transporter urtABCDE and urea metabolism ureABC (Fig 5B). Most lineages possess various amino acids transporters: uivGFHKM for branched-chain amino acids, aapJMPQ for L-amino acids, and GltIJKL for glutamate/aspartate (Fig 5B).

In addition, we examined iron scavenging, briefly suggested by [48] to be a role played by lichen bacteria, and a survival function of both host-associated and unassociated bacterial communities [49,50]. Gene clusters potentially involved in siderophore biosynthesis were rare in all bacterial groups except Cyanobacteria. Even though every analyzed MAG had genes related to siderophore transport (tonB-dependent receptors), these genes are not exclusively connected to, and cannot be viewed as evidence of, siderophore uptake [51]. Instead, the majority of annotated MAGs (78%; n = 49) encoded iron ion transporters (S15 Table), suggesting that dissolved iron is present and not a limiting nutrient in the system.

Lichen-associated bacteria have been reported to be involved in degradation of eukaryote-derived chitin and glucans [11] as well as hemicellulose and starches [52]. We found gene predictions consistent with these observations as well as suggesting that carbohydrate degradation capacities are unequally distributed among the major bacterial groups. The family Acidobacteriaceae had a markedly different set of predicted carbohydrate-active enzymes (CAZymes) compared to the other bacterial families included in the annotations. Acidobacteriaceae had twice as many glycoside hydrolases (GH) as the other bacterial families (average = 69 in Acidobacteriaceae, 22–34 in other families). Moreover, only in Acidobacteriaceae were GHs the dominant class of CAZymes (Fig 5C and S16 Table). Acidobacteriaceae possess several GH families targeting mannans, one of the dominant cell wall polysaccharides of LFSs: GH92 (mannosidase), GH125 (exo-alpha-1,6-mannosidase), GH38 (alpha-mannosidase), and GH76 (alpha-1,6-mannosidase/alpha-glucosidase). Also present at a higher frequency in Acidobacteriaceae were GH18 (chitinase), GH55 (beta-1,3-glucanase), and GH51 (endoglucanase, endoxylanase, cellobiohydrolase; S17 Table), all known to act on fungal polysaccharides.

To identify potential carbon sources for bacterial heterotrophs, we analyzed the presence of transporter genes. Acetobacteraceae and one genus of Beijerinckiaceae (Lichenihabitans) have a large arsenal of transporters for various monosaccharides (ribose/D-xylose transporter rbcABC, D-xylose xylFGH, multiple sugars chvE-gguAB, more rarely fructose frcABC and L-arabinose araFGH) and polyols (glycerol glpPQSTV, sorbitol/mannitol smoEFGK, xylitol xltABC, erythritol eryEFG, glucitol/sorbitol srlABE) (Fig 5B). Other studied bacterial families had fewer transporter systems, although Acidobacteriaceae had a glycerol transporter GLPF. Surprisingly, the second genus of Beijerinckiaceae, RH-AL1, did not possess any predicted sugar or polyol transporters, suggesting that these bacteria might not take up sugars. However, we cannot rule out sugar uptake, since RH-AL1 MAGs encoded several putative transporters with unknown function.

Eymann and colleagues [53] postulated, based on metaproteomic and taxonomic assignment data, that Hyphomicrobiales may be responsible for the catabolism of methanol or other C1 molecules possibly produced as a byproduct of secondary phenol synthesis by the LFS. Several lineages of Acetobacteraceae and Beijerinckiaceae exhibited annotations consistent with methylotrophy, the ability to use methanol as a carbon source. Methanol dehydrogenases (xoxF or mxaF) were detected in both Beijerinckiaceae genera with annotations (Lichenihabitans and RH-AL1), and 5 of 6 Acetobacteraceae genera (Fig 5B), which were present in at least 50 metagenomes. None of the studied bacteria appeared to use methane as none possess methane monooxygenase genes.

Among all annotated MAGs, 29 encoded biosynthetic gene clusters with similarity to known exopolysaccharide-producing clusters (S18 Table; based on annotations with Jaccard distance <0.7 estimated by SanntiS). Lineages with these clusters were detected in 61 metagenomes. In addition, MAGs of several lineages from Acetobacteraceae, Beijerinckiaceae, and Sphingomonadaceae encode a capsule polysaccharide transport system (Fig 5B).

Photosynthesis is widespread in some bacterial lineages

Pankratov and colleagues [54] recently reported bacteriochlorophyll a from the bacterial genus Lichenococcus (Acetobacteraceae) from a Cladonia lichen, suggesting a capacity for photosynthesis in non-cyanobacterial lichen bacteria. Fifteen of our annotated bacterial MAGs possessed both a complete set of anoxygenic photosystem II proteins (KEGG modules M00597 and M00165; pufABCML-puhA), and a bacteriochlorophyll biosynthesis pathway (acsF, chlBNL, bchCFGPXYZ). Many also contained carotenoid biosynthetic gene clusters (Fig 5B). All of these MAGs came from Acetobacteraceae and Beijerinckiaceae (including Lichenihabitans). This combination of pathways corresponds to a bacterial group known as aerobic anoxygenic phototrophs (AAPs [55]). The AAP profile is present in 6 of the 7 annotated Beijerinckiaceae MAGs, and in 9 of the 18 annotated Acetobacteraceae MAGs. The MAGs that possessed these predicted features were mapped as occurring in 56 of the 122 metagenomes for which any annotated bacterial MAG was available (15% of metagenomes from Dataset 1).

Pankatrov and colleagues also characterized lichen bacteria with both a partial [54] and complete [56] Calvin–Benson(-Bassham; CBB) pathway from lichen-derived Acetobacteraceae and Beijerinckiaceae, respectively, suggesting that at least some photosynthetic bacteria also fix carbon. Fourteen MAGs possessed a complete CBB cycle (Fig 5B and S1 File). These included all 11 cyanobacterial photobionts, which are known autotrophs, and 3 species in Acetobacteraceae. None of the remaining bacterial MAGs (n = 49) were predicted to encode a complete alternative carbon fixation pathway [57]: reductive citrate cycle (KEGG module M00173), 3-hydroxypropionate bi-cycle (M00376), hydroxypropionate-hydroxybutyrate cycle (M00375), dicarboxylate-hydroxybutyrate cycle (M00374), Wood–Ljungdahl pathway (M00377), or the phosphate acetyltransferase-acetate kinase pathway (M00579), and all are therefore likely to be carbon heterotrophs.

Most high frequency bacterial lineages are predicted prototrophs of some B vitamins

Bacteria have been postulated to synthesize essential cofactors required by algal symbionts [58], and bacterial-derived cofactors have been detected in a metaproteomic study of the lichen Lobaria pulmonaria [58,59]. A complete biotin synthesis pathway (complete module M00123 or the alternative M00950) were present in 43% of the analyzed bacterial MAGs (n = 27; Fig 5B and S1 File). None of the MAGs contained a complete thiamine biosynthesis pathway, but the thiamine salvage pathway was complete in several Acetobacteraceae and Beijerinckiaceae (KEGG Module M00899 or thiMDE [60]). Several Acetobacteraceae and 1 Sphingomonadaceae MAG encode a pathway for synthesis of cobalamin (vitamin B12; KEGG Module M00122). Taken together, the MAGs that possessed features of some form of B vitamin prototrophy were mapped as occurring in 99 of the 122 metagenomes for which any bacterial annotated MAG was available (26% of all metagenomes from Dataset 1).

B-vitamin auxotrophy is common in lichen fungi

The recovery of widespread biotin and cobalamin prototrophy in annotated bacterial MAGs, and the frequency with which corresponding bacterial lineages were mapped onto metagenomes, prompted us to annotate all high-quality eukaryotic MAGs derived from the present study to screen for potential auxotrophies. For this, we selected all nearly complete eukaryotic MAGs (≥95% complete and ≤10% contaminated), resulting in 9 algal MAGs, 71 LFS MAGs (including 3 LFS MAGs from misidentified samples) and 4 MAGs of non-LFS fungi (S2 File).

All 9 annotated lichen algal MAGs are predicted to possess at least partially complete pathways for thiamine synthesis and scavenging and biotin synthesis. Of them, 4 are predicted to have a complete pathway for thiamine synthesis and 7 to have a complete thiamine scavenging pathway (Fig 6). By contrast, 8 of the 9 algal MAGs lack most of the predicted components of cobalamin metabolism and one lacks all genes (Fig 6). The missing components correspond to 10 out of 11 enzymes in the module M00925 and 6 out 7 enzymes in the module M00122 (S2 File). To establish whether lichen algae actually need cobalamin, we screened their MAGs for the cobalamin-dependent methionine synthase gene metH, as well as its cobalamin-independent alternative metE. While five of the algal MAGs possessed only metE, the other 4 possessed both metH and metE (S19 Table). This result echoes that from Croft and colleagues [61], who suggested that some algae preferentially use more efficient metH in the presence of cobalamin, and switch to metE only when cobalamin is absent.

Fig 6. Completeness of modules involved in cofactor biosynthesis in the high-quality eukaryotic MAGs.

Fig 6

We selected all ≥95% complete and ≤10% contaminated eukaryotic MAGs, and used KEGG to determine completeness of pathways involved in synthesis of biotin, thiamine, and cobalamin. The trees on the left represent the phylogenomic trees constructed for the MAGs; only tips corresponding to the high-quality MAGs are shown. Color on the heatmap represents the percentage of present blocks in a given pathway. The data underlying this figure can be found in S1 Data. MAG, metagenome-assembled genome.

Fungal MAGs exhibit a mixed pattern of predicted thiamine synthesis and/or scavenging systems. Only 2 MAGs are predicted to have a complete thiamine biosynthesis pathway: an LFS from Arthoniomycetes and an early diverging lecanoromycete lineage in the Acarosporales (Fig 6). Thiamine scavenging pathway was at least partially complete in nearly all fungal MAGs, and a complete pathway was present in one third of MAGs (n = 25), which were scattered across the whole fungal tree (Fig 6). Notably, all 8 annotated members of the fungal order Peltigerales lack both thiamine synthesis and salvage pathways, with all components of the former missing except thiamine pyrophosphokinase (S2 File).

All 4 of the non-LFS fungal MAGs plus LFS from Eurotiomycetes, Arthoniomycetes, Dothideomycetes, and the early diverging lecanoromycete lineage Umbilicariales are the only fungal lineages to possess partial predicted biotin pathways. All other fungi, all of which are LFSs, lack any predicted component of biotin synthesis. No components of cobalamin synthesis were detected in the fungal MAGs and no cobalamin-dependent biochemistry is known from this group of fungi [62].

Fungal vitamin auxotrophs co-occur with algal and bacterial prototrophs

Thiamine and biotin are essential for life and since they do not appear to be synthesized by some (thiamine; Peltigerales) or most (biotin) LFSs, they will need to be offset from exogenous sources. Based on our annotations, candidates for providing these cofactors could include the alga, cyanobacterium or non-cyanobacterial bacteria, but this depends on the combination of MAG co-occurrences in the sampled lichens as represented by metagenomes. In our final analysis, we identified all lichen metagenomes from which at least 1 complete MAG each from the LFS, photobiont and an annotated non-cyanobacterial bacterium was present in ≥95% completeness and ≤10% contamination (from the set of 63, above), to assess in which combinations predicted auxotrophies and prototrophies complemented each other. Only 16 metagenomes fit these criteria (Figs 1B and 7). It should be noted that all of these metagenomes included additional MAGs that did not pass the quality threshold and therefore were not used. The 16 metagenomes include 8 LFSs from the order Lecanorales with Trebouxia photobionts, 3 from Peltigerales with a cyanobacterium only (“bipartite”) or an alga and cyanobacterium (“tripartite”) as photobionts, whereby in Lobaria pulmonaria only the alga was recovered as a complete MAG, and 5 additional LFSs from Rhizocarpales and Ostropomycetidae with Trebouxia or a member of Elliptochloris-clade as photobiont.

Fig 7. Co-occurrence of cofactor synthesis modules within lichen metagenomes.

Fig 7

Each panel represents a highly complete lichen metagenome (defined as a metagenome that contains the MAGs of the LFS, photobiont, and at least 1 high-frequency bacterium, all at ≥95% complete and ≤10% contaminated). Each column within a panel represents a MAG, labeled based on the inferred role it plays in the symbiosis and colored based on its taxonomic assignment. For each genome, we show the presence/absence and completeness of several KEGG modules for synthesis of biotin and thiamine. The data underlying this figure can be found in S1 Data. LFS, lichen fungal symbiont; MAG, metagenome-assembled genome.

The 4 metagenomes with fungal MAGs with thiamine and biotin auxotrophy co-occurred either with an alga with an at least partially complete thiamine and/or biotin pathway and non-cyanobacterial bacterium, or with a cyanobacterium with a partial thiamine synthesis/salvage pathway (Fig 7). An additional 7 metagenomes, in which the LFS MAG contained only a partial thiamine synthesis/salvage pathway, contained a thiamine prototrophic alga and/or bacterium. All 13 metagenomes with a biotin auxotrophic LFS co-occurred with both an algal and a bacterial biotin prototroph.

Discussion

The reanalysis of lichen metagenomes we present here is a case study in the paradoxes inherent in short read metagenomics data. On the one hand, the information content can be high, and, when MAGs can be assembled, they offer the possibility to extract data that allow the frequency, protein diversity and potential metabolic capacities to be characterized for organisms that have yet to be cultured and in many cases even seen. Our reanalysis of lichen metagenomes resulted in a large and to our knowledge previously unseen body of evidence on organismal presence in lichens: recovery of 1,000 MAGs, two-thirds of which are newly assembled bacterial MAGs; heretofore unavailable documentation of organismal presence in lichen thalli; and annotations and comparative analyses of metabolic potential in 63 bacterial MAGs for which completeness, at ≥95%, and contamination ≤10%, rival genomes derived from culture. On the other hand, current approaches to metagenomics with short reads run up against hard limits when MAGs for various reasons cannot be assembled. As our data also show, even the photobiont, which on account of its size and color can be seen by visual inspection alone, is not recovered as a usable MAG in most lichen metagenomes analyzed. The limitations that arise from assembly and detection of some MAGs place caveats on the ability to assess, with confidence, absence: absence of microbes and absence of predicted proteins. Our study is at its core an attempt to establish (a) what can and cannot be said about microbial occurrence in lichen metagenomes from current short read data, based on what we know is missing; and (b) identify microbes associated at high frequency with lichen symbioses that may represent promising candidates for further research on metabolic interactions.

Shortfalls in eukaryotic MAG assembly

Organismal occurrence analyses highlight how the combination of community complexity and inadequate sequencing depth can lead to underestimates of even known symbiont occurrence patterns, especially for eukaryotes. In 62 metagenomes—14% of the data set—we failed to recover a single MAG that passed minimum quality thresholds, including of the most abundant organism, the LFS. Even in cases of lichen symbionts that are visible, well-documented and in considerable cellular abundance, as is the case with photobionts, we were able to recover only a few QS50 MAGs, representing only a fraction of all known occurrences. We recovered more cyanobacterial MAGs than those from Trebouxiophyceae, and none from the trentepohlian photobionts of the Ulvophyceae.

The explanations for the assembly of so few QS50 MAGs and their subsequent low MAG recovery rate are at least 2-fold. First, the sequencing depth of the publicly available metagenomes spans 6 orders of magnitude, with a strong skew to shallowly sequenced metagenomes, which account for nearly all cases in which no LFS MAG was recovered (S6 Fig). Commonly applied sequencing depths and short reads are clearly insufficient to achieve QS50 MAGs for even commonly occurring, undisputed lichen symbionts, and this can be expected to impact assembly regardless of genome characteristics. Second, assembly quality and QC metrics are affected by both genome size and the propensity for closely related genomes to intermingle. The eukaryotic photobionts differ markedly from bacterial photobionts in this regard. The most common Trebouxiophycean genus, Trebouxia, with a haploid genome >50 MB in size [27], is well known to occur in individual thalli as a mixture of closely related strains [63,64], which can be expected to introduce graph ambiguities and result in assembly breaks. This may account for why the only MAG that was recovered frequently had a contamination level (i.e., duplicate single copy marker genes) well above the threshold for inclusion in downstream analyses. The most common Ulvophycean genus, Trentepohlia, from which no QS50 MAG was obtained, is thought to possess the largest genome of any canonical lichen symbiont, at >100 MB [65,66], is thought to be a polyploid [67,68], and has no published genome at all to date. The cyanobacterium Nostoc, by contrast, has smaller genomes than both of these eukaryotic genera, and is not known to occur in multiple strains in lichen thalli (C. Pardo de la Hoz, pers. comm., 2024). We recovered a single Nostoc MAG each in all lichens expected to have Nostoc as a photobiont in Dataset 2 (Fig 3) and only recovered 2 Nostoc MAGs in a single metagenome once, in a Phaeophyscia lichen not expected to have Nostoc as a photobiont; we assume these grew on the surface of the lichen. These 2 MAGs were phylogenetically divergent with ANI of 90.6%. Whether genome size and occurrence in single strains fully explain the higher MAG recovery rate is unclear. We note that Nostoc is also the only microbial component of lichen metagenomes that regularly occurs in higher coverage depth than the LFS (S7 Fig).

Screening assemblies and raw read sets of known SSU rRNA sequences dramatically increased the rate of positive hits from higher order taxonomic groups that correspond to those containing the photobionts, and using raw reads in particular returned hits for all lichen symbioses in which they are already known to occur. However, screening raw reads also returned hits in many symbioses in which they have never been documented using microscopy. For example, positive hits from Trebouxiophyceae were recovered in all but one of the 61 metagenomes from lichens assessed with classical microscopy as having only cyanobacteria or only Ulvophyceae. Cyanobacteria, for their part, were detectable in fully half of lichen symbioses in which only Trebouxiophyceae or only Ulvophyceae are the microscopically documented photobionts. These detections are unlikely to be false positives, as we used a method relying on first extracting all SSU rRNA sequences from the data and then classifying each sequence based on its similarity to the entries in the reference database that includes sequences from a wide range of organisms. To ensure reliability, the classification—and the taxonomic level at which the assignment is provided—were informed by the top 5 hits for each identified SSU rRNA sequence. Instead, we see 2 explanations as most likely to account for these occurrences. First, they could reflect the presence of algal cells in or on the exopolysaccharide mucilages that frequently coat or associate with lichen thalli [69]. The broad taxonomic categories we used (e.g., Trebouxiophyceae) include many free-living aeroterrestrial algae, and the cells in question are not necessarily photobionts in the strict sense. A second possibility is that the detected cells are indeed secondary photobionts of the sampled lichen. Separate methods, including visualizing cell location and cell-to-cell interactions, would be necessary to determine this with certainty, but we note that some secondary algae such as Coccomyxa viridis exhibit strong associations with lichens that otherwise contain, e.g., Trebouxia photobionts [70].

Similarly, for basidiomycete fungi reported to occur in lichens as yeasts (classes Cystobasidiomycetes and Tremellomycetes), we recovered only a few MAGs. Based on SSU rRNA screening, the detection rate increased to about one-third of metagenomes for Cystobasidiomycetes and two-thirds for Tremellomycetes. Using PCR screening, Cystobasidiomycetes were originally reported from 75% of macrolichens in which a fungus of the Parmeliaceae (“parmelioids”) constitutes the LFS [12]. Compared to this, the 41% detection rate from SSU rRNA screening of parmelioids in the present study is low. A possible explanation for this is different methods used—targeted PCR-based screening versus presence of rRNA in metagenomic libraries—and the inclusion of libraries with comparatively shallow sequencing depths in our data set. The detection rate is however much higher than the nine lichens in which they were detected in the 339 metagenomes of Lendemer and colleagues [28], which are part of this study. The low detection rate of basidiomycete yeasts in the latter study is almost certainly related to the shallow sequencing depths of many of the metagenomes used in that study, which included all 62 of the metagenomes we excluded due to recovery of no MAGs and 26 of 27 of the remaining metagenomes from which no LFS MAG could be recovered (see Fig 1B). In addition, low sequencing depth can be expected to affect the ability to capture even rRNA of organisms such as Cystobasidiomycetes occurring in low cellular abundance (in this case, typically ca. 1:100 depth coverage ratio to the LFS: S7 Fig).

What can and cannot be said about the bacterial fraction

By far the largest fraction of assembled MAGs is contributed by bacteria, with 674 non-redundant species MAGs. These MAGs, and their associated mapping data and annotations, potentially represent a data windfall for the study of lichen-associated bacteria. However, the bacterial composition of the extracted data can be expected to include not only any bacteria that have been postulated to play biologically meaningful roles in healthy lichens, but also any involved in degradation of senescing thalli, by-catch from the surrounding environment and contaminants introduced during sampling. Because the dataset is inherently heterogeneous, the limits of inference can be expected to differ depending on the questions being asked, the intrinsic attributes of the data sets themselves and basic assumptions about lichen systems. Our approach here was to recursively analyze progressively smaller datasets, defined by quality metrics (Fig 1B), and, as in our previous work [12,14], limit our analyses to the most frequently occurring lineages on the assumption that the likelihood that a lineage occurrence is attributable to contamination decreases as its detection frequency increases across independent samples.

At least 3 data set attributes broadly framed the limits of inference for our objectives. First, the collection of the original material was not guided by considerations about bacterial composition. This could be seen on the one hand as an advantage, as the sampling is blind, but on the other hand it also means that sampling sources (e.g., lobe tips or fruiting bodies), which could be expected to differ in bacterial composition, were not specified (except in [14,31,36,40] and the 24 new metagenomes). Second, the data set is taxonomically broad, meaning that specific bacterial lineages that may be important for certain lichen symbioses would occur in only low frequency. Third, some of the same constraints that affect the interpretation of the eukaryotic data, such as sequencing depth, also affect the bacterial set, with both MAG recovery frequency and quality likely to be lower in shallowly sequenced metagenomes. As a result of these 3 built-in constraints, whatever patterns that emerge would likely be a “lowest common denominator” across the broader swathe of sampled lichen symbioses.

We ultimately determined that the structure, sample handling and sequencing protocols underlying the data set would, with appropriate filtering, permit inferences regarding bacterial presence; sample-level taxonomic composition (as lichens were sampled blindly with respect to their bacterial composition); minimum overall frequency of occurrence (both as MAGs and 16S rRNA); predicted gene content in high-quality MAGs; and minimum occurrence of gene predictions in the sampled lichens. We also concluded that it would not permit inferences on absence and by extension any statistical inference on microbial occurrence or distribution of function across lichen symbioses. Even taking into consideration these limitations, our exploration of the bacterial data yielded a set of patterns that to us were unexpectedly structured and are to our knowledge novel:

  1. Four family-level lineages from 2 phyla accounted for as many bacterial occurrences in lichens as all other 71 families from 16 phyla combined.

  2. Among those 4 lineages, 2 network sharing motifs are visible: one in which extensive species-level sharing is evident (Acetobacteraceae and Beijerinckiaceae) and 2 in which few species are shared by more than 2 or 3 lichen metagenomes (Acidobacteriaceae and Sphingomonadaceae).

  3. The 4 lineages, when screened with 16S rRNA, are found in even more lichens, with the 2 families that exhibit extensive species-level sharing occurring in 99% (Acetobacteraceae) and 97% (Beijerinckiaceae) of metagenomes, respectively.

  4. The 4 lineages are recovered (as mappable MAGs; S8 Table) in the 4 data sets with more than 10 metagenomic libraries each produced for different studies in 3 different labs, as well as in more recently released data sets which we screened using 16S rRNA.

  5. The high frequency of these 4 families is consistent with previous results of amplicon sequencing of lichens involving, e.g., 9 LFS genera in Alaska [71], 4 species from Colorado [72], 7 species from Antarctica [73], and 7 genera sampled in the Colombian páramo [74].

  6. Some of the most frequently detected MAGs belong to bacterial genera that have either already been the focus of lichen microbiological studies or are close phylogenetic relatives of them. In our study, MAGs that cluster with LAR-1 [46,71], recently formally described as the genus Lichenihabitans [75,76] (Beijerinckiaceae, syn. Lichenibacterium [52], first validly published 8 months later [77]), were recovered in 99 metagenomes, with 1 individual MAG recovered in 52 metagenomes (Fig 2C) and accordingly shared among many lichen symbioses (S4 Fig). The second-most commonly detected bacterial “genus” that we recovered in the MAG set, tracked in GTDB as CAHJXG01, was previously known from a single MAG from a Peltigera lichen [30], but assembled in our study as multiple MAGs and recovered in 58 metagenomes (it could not be included in SSU rRNA screening as its SSU rRNA is absent from reference databases and not part of the MAG). Among other most frequently detected genera are Lichenicoccus and Lichenicola, both recently described genera cultured from lichens [54,78]. Another frequently occurring “genus,” LMUY01, is described from a soil sample in the Czech Republic, but it was previously classified as Acidisphaera and subsequently segregated as a distinct clade. In a study of Peltigera lichens, bacteria from this genus were shown to be enriched, while being present in lichen substrata only in small amounts [79]. Other bacterial lineages previously postulated to play a metabolic role in lichens, including Acidobacteriaceae and Sphingomonadaceae, were also nearly ubiquitous in rRNA screening. Sphingomonadaceae were also recovered as MAGs in nearly half of metagenomes from lichens with cyanobacterial photobionts, echoing previous results that showed a notably greater abundance of sphingomonads in cyanolichens than in those with algal photobionts [71,74].

The annotations of MAGs from the highest frequency bacterial genera largely corroborate predictions from the last 20 years based on metaproteomics from whole lichens and characterizations of isolated bacteria. An unexpectedly common feature, only recently characterized in a lichen bacterium by Pankratov and colleagues [54,80], is the capacity for anoxygenic photosynthesis. This is consistent with the profile of AAPs, a group of bacteria that has only recently begun to gain attention for its occurrence in diverse extreme environments [55]. The capacity for methylotrophy, previously identified in bacterial annotations from a Lobaria lichen [53], proved to be widespread. A notable exception to the corroboration of past claims is nitrogen fixation. Despite the history of claimed nitrogen fixation by lichen bacteria, it received no support as a general phenomenon from either annotations or a targeted pan-assembly search for the nifH gene. The cellular sources of nifH sequences obtained by other researchers [11,45,46,47] are impossible to reconstruct from our data, but we cannot rule out that nifH-containing bacteria might occur in low quantity fractions that would be detectable by PCR, or occur in lichens not included in our data (e.g., certain tropical lichens [46]).

Cofactor metabolism stands out in the lichen microbe data as one of the areas with obvious potential for bacterial-eukaryotic cross-feeding. Most lecanoromycete fungi, and by extension most LFSs, appear to be biotin and/or thiamine auxotrophs. Some of the most frequent bacteria, for their part, appear to be biotin prototrophs, and algae have partially complete predicted pathways for biotin and thiamine; in the few metagenomes where we have at least one complete MAG of each, both algae and bacteria appear to complement the fungus. Fungal vitamin B auxotrophy is not common knowledge in lichen biology, but it is not a new discovery. Several lichen researchers documented vitamin B requirements of about a dozen LFS species in the 1950s [81,82,83,84]. Bednar [83,84] documented biotin auxotrophy in detail in the LFS of the lichen Peltigera aphthosa. He found that its alga, Coccomyxa sp., liberated large amounts of biotin in statu symbiotico, and proposed that this was one of the underlying syntrophies of lichen symbiosis. It is possible that the only partial completeness of biotin synthesis we predicted for algae is an annotation artifact, or that more work needs to be done to understand biotin synthesis in algae, as well as in lichen-associated bacteria. In particular, we were not able to identify in our genomes the pathway for the synthesis of pimelate moiety, one of the biotin precursors. Enzymes responsible for this step are highly diverse [85], with alpha-proteobacteria in particular having a non-standard pimelate synthesis pathway [86]. Since this made large-scale screening challenging, in our analysis we focused instead on the modules M00123 and M00950, which contain enzymes that catalyze biotin ring assembly and are generally conserved [86].

Cobalamin synthesis is predicted for many of our high frequency bacteria and has been hypothesized to sustain cobalamin-auxotrophic algae in vivo [9]. However, none of the algal MAGs we queried unambiguously require cobalamin. Instead, they possess both metE and metH, suggesting that, if they function similarly to Chlamydomonas reinhardtii [61], lichen algae might be facultatively dependent on cobalamin for methionine synthesis.

What do our results mean for research on lichen-associated bacteria?

There is a long history of christening lichen bacteria as “symbionts,” from the 1920s and 1930s [23,44,87] up to the present [18,53,58,71]. The term “symbiosis” originally denoted the stable co-occurrence or “living together” of different species, regardless of whether the relationship is mutualistic, commensalistic or even parasitic [2], and one can surmise that this has been the sense in which the term has been applied in the lichen context. In practice, however, “symbiosis” has increasingly come to be applied to one segment of a wider spectrum of organismal co-occurrence phenomena in which the players perform some kind of required function. The remaining part of this spectrum is usually referred to as the microbiome, though it should be noted that many cross-feeding relationships (syntrophies) have been described from microbiome data [88] without necessarily referring to them as symbioses. Against this background, although bacteria are physiologically active and have been shown to produce multipurpose metabolites in the lichen milieu (reviewed by [13]), to our knowledge no studies have shown any metabolic function that locks the now-canonical fungal and photobiont symbionts into a dependency on bacteria, or vice versa.

Our analyses of the high-frequency bacterial fraction inject new data into this discussion. The high frequency occurrence at the family level of Acetobacteraceae and Beijerinckiaceae in lichen symbioses is arguably consistent with the broadest, classical definition of symbionts. Beyond this, however, our results are insufficient to advance a narrower, function-based case for bacterial symbiosis in lichens. Our predictions of biotin prototrophy in some high-frequency lichen bacteria and corresponding auxotrophy in lichen fungi suggests the potential of fungal dependency on these bacteria, but much will depend on mapping the constellations in which prototrophs and auxotrophs co-occur (or do not co-occur). Certainty regarding the fungal sourcing of biotin will require targeted studies that explore a range of possibilities. An alternative scenario that has already been proposed, albeit without considering a possible role for bacteria, is that algae cover the biotin needs of their fungal partners [83]. It is also possible that biotin-prototrophic bacteria occur in such small amounts that their contribution to eukaryotic metabolism is physiologically negligible. An additional and not mutually exclusive possibility is that biotin-prototrophic bacteria play a role in the poorly studied period of the fungal life cycle before spores and young mycelia pair with a photobiont. We note however that none of the metagenomes included in this survey were sampled from this life stage and we have no evidence that biotin-producing bacteria are associated with the LFS outside of lichens. In any case, if experiments confirm widespread LFS biotin auxotrophy, provisioning from any exogenous source would constitute an important and hitherto largely overlooked metabolic interaction in lichen symbioses that requires further study.

Several lines of evidence suggest that some of the high frequency bacterial lineages, for their part, also depend on metabolic subsidies or feed off the lichen. Most of the annotated bacteria are predicted thiamine auxotrophs or possess only a thiamine scavenging pathway, and could conceivably use algal or fungal thiamine or precursors; yet others are predicted biotin auxotrophs. Carbohydrate transporter and CAZyme annotations suggest that some bacteria tap into the abundant polyols of lichen symbioses and, especially for Acidobacteriaceae, may be involved in degradation of eukaryotic cell walls, as has been suggested before [89,90]. Taken together, these predictions could support the interpretation that some bacteria are commensals that make a living off the lichen environment.

Our approach to analyzing only the most frequently occurring bacteria with the highest quality MAGs yielded a narrow set of taxa. We consider the probability that these are contaminations to be minimal, as they closely mirror those found in previous studies, e.g., using metabarcoding, were recovered across multiple metagenome sequencing projects, and exhibit little to no overlap with the documented “kitome” taken up from laboratory sample handling [91]. Beyond the high frequency bacterial taxa we analyze here, we cannot rule out that lineages that occur at lower frequencies in our data sets, or were not fully recovered as high-quality MAGs, may ultimately deserve further study in the context of individual lichen “species.” An example is Actinobacteriota, which we recovered as MAGs at low frequency, but which have been highlighted, e.g., by Gonzales and colleagues [92], cultured from some lichens ([93], also see Fig 2C), and postulated to be rich in bioactive compounds [13]. Distinguishing a “lichen-relevant” bacterial fraction from contaminations becomes an increasingly non-trivial problem at lower occurrence frequencies, and at least 3 different solutions have been proposed to address this. One has been to wash lichen thalli with water, ethanol, and bleach prior to downstream use, a treatment borrowed from the study of endophytes [94,95]. A downside of this treatment is that the lichen exopolysaccharide layer is of variable composition [18] and thallus washes may dislodge more than just surficial microbes, based on our experience with warm water [31]. Another downside, in our view, is that it assumes, to our knowledge without testing, that whatever microbial fraction is lost is not metabolically relevant. A second approach is to compare microbial composition on lichens with those of surrounding environments. We think that much stands to be learned from environmental comparisons about microbial ecology, but here, too, we are not convinced that we know enough about lichen microbiology to know a priori that occurrence in a non-lichen environment means that a microbe can be ruled out as participating in the lichen system. In fact, it is well known that canonical eukaryotic lichen photobionts are commonly sampled in the environment (reviewed by [96]). This need not be disqualifying for lichen symbiosis if specific roles such as carbon fixation or vitamin provisioning are filled under an “It’s the Song, Not The Singer” model [97] in which the overall lifestyle of the microbe is thought to be less important than that it is compatible with—and performs a function in—the system, when present. A third approach, similar to our approach here of identifying high frequency bacterial species, has been to resample individual lichen symbioses from spatially or temporally independent sites by metagenomics or metagenomics coupled with custom PCR screening, as has been done for lichens involving the LFSs Alectoria sarmentosa, Bryoria fremontii, and Letharia vulpina (“secondary screening” sensu [9]). This approach however also has the downside that it emphasizes taxonomic identity over function and compatibility.

Conclusions

Lichens are notoriously recalcitrant to experimentation in the lab and their component symbionts are extraordinarily slow-growing, when they can be cultured at all. The ability to extract MAGs from metagenomes has been a game-changer for lichen research, but many questions remain. The survey of metagenomic data is an opportunity to take stock of their organismal content, evaluate their utility for microbial and symbiont detection, and extract a substantial body of new data. Novelties in microbial occurrence patterns and gene annotations aside, one of the more striking results from our survey is just how few metagenomes (16 of 437, after deduplication) yielded anything close to a set of MAGs that would allow even a cursory assessment of microbial interactions between at least one fungus, one photobiont, and one bacterium. Even in these metagenomes, the majority of MAGs were not of sufficient quality to use for comparison, and none included MAGs of basidiomycete fungi that widely occur as yeasts, meaning that even in the most deeply sequenced metagenomes, the metabolic predictions are potentially only a teaser of a larger story.

A leading factor limiting MAG recovery in metagenomes is sequencing depth. Published lichen metagenomes span 6 orders of magnitude of sequencing coverage, and skew low. Consequently, the data cannot be used at all for estimates of absence, and organismal occurrence frequencies are likely underestimates across the board. This is clearly visible for known photobionts and can be expected to disproportionately affect eukaryotes, with their larger and more complex genomes. Deeper sequencing will likely alleviate, though not fully resolve, this problem. Some of the problems of, e.g., MAG recovery and rRNA detection arise from the application of short read-based DNA sequencing on samples with specific biological attributes, such as the co-occurrence in a single sample of highly similar DNA sequences by virtue of multiple chromosome sets or multiple genotypes. As long read sequencing becomes more tractable, some of these problems may be resolved, such as removing issues associated with assembly and ambiguities introduced from binning approaches. Techniques such as dynamic adaptive sequencing [98] may further facilitate the sequencing of relatively simple communities, to obtain genomic sequences of even low abundance members of the community.

Methods

Ethics statement

Newly collected samples acquired at Cardinal Divide, Mountain Park and Kootenay Plains (Alberta, Canada) were collected under a collection permit issued on 09.07.2018 by Alberta Environment and Parks (permit number: N/A_2). No other areas required special permitting. The existing herbarium specimens from which all other newly sequenced metagenomes were derived (S2 Table) are deposited in the collections of Uppsala University, Sweden (UPS), the University of Bergen, Norway (BG), the University of Graz, Austria (GZU) and the Universidad Nacional de Comahue, Argentina (BCRU).

Data set construction

We analyzed a total of 480 lichen metagenomes. Data were obtained from ENA (S1 Table) and complemented with 24 metagenomes that were newly generated for this study. In the early stages of our analysis, we removed 43 metagenomes from the data set as they were identified as duplicates (S3 Table). To identify such metagenomes, we used sourmash v4.2.2 [99]; we used the “sourmash sketch” module to compute signatures and the “sourmash module” to compare individual metagenomes. This way, we identified 43 pairs of identical metagenomes; one metagenome from each pair was subsequently removed from the analysis. To generate new metagenomes, we collected lichen samples, froze them at –80°C, and pulverized them using a TissueLyser II (Qiagen). We extracted DNA from the samples with DNAEasy Plant Mini Kit (Qiagen) and prepared metagenomic libraries. The libraries were sequenced on different Illumina HiSeq platforms to paired-end reads. The details on the procedure, including voucher information, library prep, and sequencing are given in S2 Table.

Initial steps of metagenomic analysis

We started by assembling each metagenome individually and extracting MAGs from them. The metagenomic libraries were filtered using fastp [100] to remove adapters and low-quality bases and the READ_QC module of the metaWRAP pipeline v.1.2 [101] to remove human contamination. The filtered data were assembled with metaSPAdes [102]. Individual assemblies were binned using CONCOCT [103] and metaBAT2 [104]. To refine prokaryotic MAGs, we used the binrefine module of the metaWRAP pipeline and then evaluated all bins with CheckM v1.1.3 [105]. Next, we selected all bins that passed the QS50 threshold [106] and dereplicated them using dRep v3 [107] at 95% ANI and 30% AF (alignment fraction) thresholds in order to obtain species-level representatives. We did not undertake co-assembly as this can result in the merging of closely related species and we would have lost the relationship between sample and genome. Furthermore, given the number of metagenomes involved it would have been prohibitively expensive computationally to assemble all of the data and the diversity of the lichen genomic composition was not understood to facilitate selective sample co-assembly.

We obtained taxonomic assignments for prokaryotic MAGs using GTDB-Tk v1.5.0 [108], a tool based on the Genome Taxonomy DataBase (GTDB) [41]. In the text, we refer to the GTDB-defined lineages CAIMSN01 and VCDI0 as Lichenicola and Lichenicoccus, respectively, as they are referenced in the later GTDB release (R214). We generated a phylogenomic tree for all prokaryotic MAGs that passed the QS50 threshold. In the tree, we also included reference genomes from bacteria previously isolated from lichens (S5 Table). We used the marker gene alignment produced by GTDB-Tk (concatenated alignment of 120 loci). We generated the tree with IQ-TREE [109] using the model finder (selected model: LG+F+R10) and 1,000 bootstraps.

Eukaryotic MAGs were identified and refined with EukCC v2 [110]. Bins with a quality score of at least 50 were dereplicated with dRep at 2 levels: first, on the level of the individual binned metagenome (with the 99% ANI threshold), and second, on the level of the whole data set, where bins from all metagenomes were dereplicated at 95% ANI and 40% AF to create species-representative MAGs. For each MAG, we calculated the EukCC v2 and BUSCO v5 [111] quality scores. To analyze the relationship between sequencing depth and recovery of MAGs of the LFS and photobionts, we used the pre-dereplication MAG set.

To obtain preliminary taxonomy annotations for eukaryotic MAGs, we used BAT (CAT v5.2.3, database version: 20210107 [112]), which predicts taxonomy based on searching predicted genes against the NCBI database. These taxonomic assignments were refined using phylogenomics. We separated all eukaryotic MAGs into 2 groups: fungal and algal MAGs. To both groups we added reference genomes (S5 Table). To compute the fungal phylogenomic tree, we used the Phylociraptor pipeline v0.9.6 (https://github.com/reslp/phylociraptor), executing the following steps. In each genome, we identified BUSCO universal single copy orthologs shared by at least 10% genomes in the set. We aligned the sequences and trimmed the alignments. From these data, we produced 2 trees: a coalescence tree that was reconstructed by ASTRAL v5.7.1 [113] from the individual gene trees (produced by IQ-TREE v2.0.7 [109]), and a tree calculated from a concatenated alignment using IQ-TREE. We compared the coalescence and the concatenated phylogenies and reconciled detected discrepancies using signal base approximation in favor of the concatenated phylogeny. The algal phylogenomic tree was produced in the same way. The final trees were based on 1,296 genes in algae and 709 genes in fungi.

Occurrence analysis

To map MAG occurrence across the metagenomes, we aligned reads from all metagenomes against all MAGs using BWA-mem [114]. Next, we filtered the alignments using SAMtools [115] to remove secondary alignments. All MAGs that were at least 50% covered in a given metagenome were counted as present. Using these data, we constructed an occurrence matrix of MAGs in metagenomes. To estimate the depth of coverage of MAGs, we used the number of reads aligned to the MAG, multiplied by the read length and divided by the total length of the contigs assigned to the MAG.

In each metagenome, we identified the MAG of the LFS. To do that, we manually inspected all fungal MAGs present in a metagenome. If only one fungal MAG was present, it was labeled as putative LFS. If multiple fungal MAGs were present, we selected one as the LFS MAG based on its position on the tree and the one with the highest depth of coverage, since the MAG of the main, most abundant LFS is expected to have greater depth of coverage than a MAG from a different fungus. To confirm the LFS assignments, we checked that the MAG placement on the phylogenomic tree is consistent with the taxonomic assignment provided by the original data submitters in the NCBI metadata. If these did not match, we excluded these metagenomes from Dataset 2 as potentially derived from misidentified samples. In total, we identified 23 inconsistencies. In 3 metagenomes (SRR14722059, SRR14722135, and SRR14722098), the inconsistency was easily resolved by correcting the putative LFS assignment and giving it to a different fungal MAG present in the metagenome. In the additional 2 metagenomes (SRR14722289 and SRR14721950), NCBI metadata had inconsistencies within itself: the lichen name in the “organism” field did not match the name in the “library name.” Our taxonomic placement agreed with the latter, and therefore we suspected that the “organism” field was filled in incorrectly during data uploading. We retained these 2 samples, correcting their names to their “library names.” A total of 18 metagenomes presented inconsistencies that could not be resolved and were thus excluded (S6 Table).

All retained metagenomes were used with the name of the LFS as assigned in the original publication, with one exception, Lecidea scabridula Hedl. was described from Sweden by Hedlund [116] and is a distinct fungal taxon engaged in a biofilm-like lichen symbiosis that has appeared in taxonomic works for that region and was reported as new to North America by Spribille and Björk [117]. As the latter authors acknowledged, Lecidea scabridula Hedl. is an illegitimate name as it is a later homonym of Lecidea scabridula Müll. Arg; accordingly, the species represented by Lecidea scabridula Hedl. currently has no legitimate name. A nomenclatural and taxonomic revision has been in preparation for some time (T. Spribille and M. Svensson, in prep.) but remains uncompleted as of this writing. Based on information available at the time to TS, who provided the sample, Resl and colleagues [36] applied the name Bachmanniomyces sp. S44760 to a sample corresponding to L. scabridula from Alberta, Canada, as Bachmanniomyces was thought to represent the correct generic placement of the species. We however no longer consider this to be the case. To avoid any ambiguity, or the perception that the species is a lichenicolous fungus as other members of the genus Bachmanniomyces, we have reverted in the present paper to applying its de facto name Lecidea scabridula Hedl. to the metagenomic library sequenced from the sample T1894.

SSU rRNA gene-based screening

We searched metagenomic assemblies and raw, unassembled metagenomic data for the presence of the SSU rRNA gene of several lineages. This process consisted of 2 steps: the detection of bacterial SSU rRNA and eukaryotic SSU rRNA sequences, and their taxonomic assignment. These sequences were used for 2 reasons: first, they are the marker loci most frequently used for taxonomic profiling, and second, they tend to be present in multiple copies in a genome [118] and therefore have better chances of being recovered in a shallow metagenome. For the first step, we used Metaxa2 [119], a tool that uses an HMM-based searching algorithm followed by a taxonomic assignment via BLAST search against a SILVA database. For eukaryotic lineages, taxonomic placement was done through Metaxa2 as well. For bacteria, we used SSU rRNA sequences extracted by Metaxa2, to which we assigned taxonomic positions with IDTAXA [120], which allowed us to use taxonomy consistent with GTDB. Only metagenomes from the Dataset 1 were included in this analysis (n = 375; Fig 1B).

Identifying most frequent bacterial groups

We ranked bacterial groups based on their frequency, defined as the total number of occurrences across the data set. We summarized frequency on 4 taxonomic levels: species-level lineage, genus, family, and order. For the species-level lineages, we simply counted how many metagenomes they were detected in. For the higher taxonomic levels, we summed all occurrences of all lineages assigned to that group. If a MAG did not have a genus level assignment, we used its family-level annotations (e.g., Acetobacteraceae gen. sp.). In addition, we ranked higher-level taxonomic groups based on how many species-level lineages from this group were detected. When calculating the percentage of metagenomes, a given lineage was detected in, we only included metagenomes from the Dataset 1 (Fig 1B).

Co-occurrence analysis

To explore how lineages co-occur within lichen samples, we built co-occurrence network graphs (S4 Fig), using the occurrence matrix. We defined co-occurrence as an instance of 2 lineages occurring together in one metagenome. For this analysis, we focused on the groups that are known to stably occur in lichens (algae, Cyanobacteria, and Cystobasidiomycetes and Tremellomycetes fungi) and on the most frequent bacterial groups (most frequent genera of Acetobacteraceae, Beijerinckiaceae, and Acidobacteriaceae). Only metagenomes that yielded an LFS MAG (n = 330; Dataset 2) were included in this analysis.

Functional analysis: Bacteria

We annotated all bacterial MAGs using PROKKA v1.13 [121]. Predicted proteins in lichen metagenomes were clustered to the MGnify protein database [122] using the Linclust algorithm in mmseqs2 v13.45111 [123] at 90% coverage and 90% sequence identity. Next, we selected the MAGs of the most frequent lineages and annotated them in depth. To select the MAGs, we first ranked all bacterial genera based on the number of occurrences. For the MAGs that did not have a genus level assignment, we used family-level annotations. Next, we selected the MAGs assigned to the top 13 genera, and among them retained only MAGs with a completeness score above 95% and contamination score below 10%, as estimated by CheckM.

For the selected MAGs, we obtained functional annotations. We annotated predicted proteins against KEGG Orthology Database [124] using KofamScan [125]. The resulting KEGG ortholog (KO) assignments were used to estimate KEGG module completeness using ggkegg [126] with the KEGG module definitions outlined in the KEGG MODULE Database (www.genome.jp/kegg/module.html). We focused on several metabolic traits, which we expected to be relevant to the symbiosis:

  1. Carbon metabolism. We screened the MAGs for known carbon fixation pathways, including Calvin–Bensen path (KEGG module M00165), and 6 alternative pathways (see [55]): reductive citrate cycle (M00173), 3-hydroxypropionate bi-cycle (M00376), hydroxypropionate-hydroxybutylate cycle (M00375), dicarboxylate-hydroxybutyrate cycle (M00374), Wood–Ljungdahl pathway (M00377), or the phosphate acetyltransferase-acetate kinase pathway (M00579). We also searched for the genes related to C1 metabolism: methanol dehydrogenase (KEGG family K23995) and methane monooxygenase (K10946 and K16157). In Cyanobacteria, we were not able to find one of the enzymes of the Calvin–Bensen path, sedoheptulose 1,7-bisphosphatase. However, since its function can be performed in Cyanobacteria by fructose-1,6-bisphosphatase II [127], we still show the Calvin–Bensen path as complete.

  2. Nitrogen metabolism. We screened the MAGs for the presence of nitrogenase NifH (K02588; involved in nitrogen fixation) and urease (EC 3.5.1.5).

  3. Photosynthesis. We screened the MAGs for the proteins of Anoxygenic Photosystem II (M00597 and M00165; pufABCML-puhA), and for biosynthetic pathways for the photosynthetic pigments: bacteriochlorophyll (K04035, K04037, K04038, K04039, K11333, K11334, K11335, K11336, K11337, K04040, K10960; AcsF, ChlBNL, BchCFGPXYZ), and carotenoids (K02291, K10027, K09844, K09844, K09845, K09846).

  4. Transport systems. We annotated the following transport systems and transporters: sorbitol/mannitol transporter (K10227, K10228, K10229, K10111; SmoEFGK), urea transporter (K11959, K11960, K11961, K11962, K11963; urtABCDE), erythritol transporter (K17202, K17203, K17204; EryEFG), xylitol transporter (K17205, K17206, K17207; XltABC), inositol transporter (K17208, K17209, K17210; IatAP-IbpA), glycerol transporter (K17321 K17322, K17323, K17324, 17325; GlpPQSTV), fucose transporter (K02429; FucP), glycerol aquaporin transporter (K02440; GLPF), glucitol/sorbitol transporter (K02781, K02782, K02783; SrlABE), ammonium transporter (K03320; Amt), ribose transporter (K10439, K10440, K10441; RbcABC), xylose transporter (K10543, K10544, K10545; XylFGH), multiple sugar transporter (K10546, K10547, K10548; ChvE-GguAB), fructose transporter (K10552, K10553, K10554; FrcABC), arabinose transporter (K10537, K10538, K10539; AraFGH), branched-chain amino acid transporter (K01999, K01997, K01998, K01995, K01996; LivGFHKM), L-amino acid transporter (K09969, K09970, K09971, K09972; AapJMPQ), glutamate transporter (K10001, K10002, K10003, K10004; GltIJKL), capsular transporter (K10107, K09688, K09689; KpsMTE).

  5. Cofactors. We searched for the biosynthetic pathways of the following cofactors: biotin (M00123, M00577, and M00950), thiamine (M00899; thiamine salvage pathway), cobalamin (M00122), and riboflavin (M00125).

We used several tools to annotate groups of genes that are potentially informative to the symbiotic lifestyle. We used following tools: run_dbcan (standalone tool of dbcan2, v3.0.2, https://github.com/linnabrown/run_dbcan) for annotations of Carbohydrate-Active EnZymes (CAZymes), FeGenie [51] for the genes related to iron metabolism. We used Sanntis [128] and antiSMASH v6.1.0 [129] for biosynthetic gene clusters. Both tools predict BGCs from the genome sequences and annotate predicted BGCs by comparing them to the MiBIG database. We screened these annotations for 2 groups of interest: BGCs potentially producing carotenoids and BGCs potentially producing extracellular polysaccharides. While analyzing the antiSMASH results, we used only BGCs that had significant hits to BGCs from the MiBIG database according to the outputs of the KnownClusterBlast module. While analyzing the SanntiS results, we retained only annotations that had Jaccard distance score below 0.7. Our SanntiS results should be considered as preliminary, since they are based entirely on sequence similarity, and we did not further validate the annotations by taking into account the position of the domains or their copy number.

To account for the possibility that NifH is present on a plasmid and therefore failed to be included in the MAG during binning, we further searched NifH across all contigs of all metagenomic assemblies, using tBLASTn [130] and a NifH sequence from NCBI as a query (S14 Table). We extracted all hits with e-value below 1e-50 and checked their taxonomy using reciprocal blast search against the NCBI database using getLCA (https://github.com/frederikseersholm/getLCA). All hits were assigned to Cyanobacteria, with 1 exception: 1 metagenome (SRR14722280) contained a Hyphomicrobiales (Rhizobiales) hit on a low-coverage contig not assigned to any MAG isolated from this metagenome.

Functional analysis: Eukaryotes

We annotated high-quality eukaryotic MAGs (≥95% complete and ≤10% contaminated), using Funannotate v1.8.15 [131]. Briefly, each MAG was cleaned and sorted and used for gene prediction with Genemark-ES v4.62 [132], Augustus v3.3.2 [133], CodingQuarry v2.0 [134], GlimmerHMM v3.0.4 [135], and SNAP 2006-07-28 [136]. We annotated predicted proteins using InterProScan v5.42–78.0 [137] and KofamScan [125]. Resulting KEGG annotations were processed as described above.

To establish whether the algae in our data set are cobalamin auxotrophs, we followed Croft and colleagues [61] and screened the algal MAGs for a cobalamin-dependent methionine synthase MetH and another gene performing the same function—cobalamin-independent MetE. First, we selected algal MAGs with completeness >90% according to EukCC (n = 19). Next, we screened them using tBLASTn [130] and 2 protein sequences as a query (BAU71143.1 for MetH and BAU71146.1 for MetE). We confirmed the identity of the resulting hits by a reciprocal search against the NCBI database.

Loss of function in Hyphomicrobiales MAGs

Hyphomicrobiales (Rhizobiales) MAGs from our data set lacked several functions typical for bacteria from this order. To put these MAGs into the evolutionary context, we assembled a data set that included 518 previously published genomes across the whole order [138] (S20 Table) and a genome of Rhodobacter (GCF_009908265.2), which served as an outgroup. Using GTDB-Tk, we identified and aligned 120 marker genes. From this alignment, we generated a phylogenomic tree using IQ-TREE v2.1.2 [109]. We used tblastn to screen all Hyphomicrobiales MAGs for the same genes related to nitrogen fixation, methanotrophy, and methylotrophy (S14 Table). For the genomes from GenBank, we confirmed that the tblastn results were consistent with the protein annotations available at NCBI.

Screening of newly published metagenomic data

Our initial data set was built before October, 2021. To confirm that our results were consistent with more recently published metagenomes, we additionally reran our occurrence analysis on these new raw sequence reads. We queried the SRA using the rentrez package in R [139] with search terms chosen to capture all reads associated with lichens. Our terms included “Pezizomycotina” and “lichen metagenome” in the lichen taxonomy, or the terms “lichen” or “lichens” in any of the searchable fields. To reduce duplicating records previously found, we only searched for records submitted after October 1, 2021. Finally, to make these results comparable to our earlier assembled data set, we included only records that used a whole genome sequencing strategy and were sequenced on Illumina machines. This produced a list of 7,324 metagenomes potentially relevant for our analysis. We manually checked SRA metadata for each entrance to confirm that the metagenomes originated from lichen samples. Since cultured or excessively cleaned lichens may have lost their full diversity, we also confirmed via literature associated with the records that the lichen thallus was collected in the field and was not cleaned beyond removing debris. This reduced the candidate data set to 243 metagenomes generated since October 1, 2021. We confirmed no duplicate metagenomes were present using sourmash [99], comparing both the new and previously used metagenomes. No duplicate metagenomes were found in the final set of raw reads. To detect the presence of bacteria in the 243 new metagenomes, taxonomy was assigned to raw reads using the Metaxa2 [119] and IDTAXA [120] approach described in our SSU rRNA gene-based screening methods.

Data handling and visualization

Custom scripts used for data analysis and visualization were written in R v4.1.0 [140], using the following libraries: dplyr v1.0.8 [141], tidyr v1.2.0 [142], scales v1.1.1 [143], for data handling; ggplot2 v3.3.5 [144], ape v5.0 [145], phangorn v2.8.1 [146], phytools v1.0–3 [147], circlize v0.4.14 [148], qgraph v1.9.2 [149], treeio v1.16.2 [150], DECIPHER v2.14.0 [151], for data visualization. For visualizing phylogenetic trees, we also used iTOL [152].

Data and code availability

The sequencing data in this project are submitted to ENA: de novo generated raw data (study accession PRJEB59037), metagenomic assemblies (PRJEB72384, PRJEB72386-PRJEB72404, PRJEB72498- PRJEB72501), and MAGs (PRJEB77567). Phylogenomic trees in Newick format are available at FigShare (10.6084/m9.figshare.27054937). Custom scripts used for data analysis and visualization are available on GitHub (https://github.com/Spribille-lab/2024-Microbial-occurrence-in-lichen-metagenomes) and FigShare (10.6084/m9.figshare.27054937).

Supporting information

S1 Fig. Dot histogram of lichen metagenomes arranged by sequencing depth.

Each metagenome is shown with a dot; the dots are colored according to the taxonomy of the lichen fungal symbiont (LFS). The dots are arranged on the x-axis based on the sequencing depth (bp) and are “stacked” on top of each other. The data underlying this figure can be found in S1 Table.

(TIF)

pbio.3002862.s001.tif (1.6MB, tif)
S2 Fig. Sampling locations for the metagenomic data used in the analysis and the geographic distributions of most frequent bacterial genera.

Each dot represents a sample used for metagenomic sequencing. The basemap shapefile was taken from the rnaturalearthdata library (v0.1.0) and belongs to public domain (https://www.naturalearthdata.com/).

(TIF)

pbio.3002862.s002.tif (1.1MB, tif)
S3 Fig. Geographic distribution of selected groups of symbionts, based on 3 methods of screening: presence of MAGs, presence of SSU rRNA gene in the metagenomic assemblies, and presence of the SSU rRNA in the raw, unassembled reads.

Each dot represents a sample used for metagenomic sequencing. Here are shown data on the 4 most frequent bacterial families, on the genus Lichenihabitans, and on the 3 eukaryotic lineages known to be stably associated with lichens. The basemap shapefile was taken from the rnaturalearthdata library (v0.1.0) and belongs to public domain (https://www.naturalearthdata.com/).

(TIF)

pbio.3002862.s003.tif (5.6MB, tif)
S4 Fig. Co-occurrence networks of lichen symbionts based on presence-absence of MAGs in each metagenome.

Each node is a MAG, and edges represent the co-occurrence of MAGs within 1 metagenome; the thicker the edge, the more often 2 MAGs co-occur. Nodes are colored based on the taxonomy and function of the symbiont; in each network, yellow nodes represent MAGs of the LFS (lichen fungal symbiont). Only data on metagenomes that yielded an LFS MAG are shown. (A) Co-occurrence of LFSs, other known eukaryotic symbionts, and Cyanobacteria. (B) Co-occurrence of LFSs and Lichenihabitans. (C) Co-occurrence of LFSs and most frequent genera of Acetobacteraceae. (D) Co-occurrence of LFSs and most frequent genera of Sphingomonadaceae. (E) Co-occurrence of LFSs and most frequent genera of Acidobacteriaceae. The data underlying this figure can be found in S1 Data.

(TIFF)

pbio.3002862.s004.tiff (243.7KB, tiff)
S5 Fig. Maximum likelihood phylogenetic tree of Hyphomicrobiales (Rhizobiales).

The tree includes published genomes of Hyphomicrobiales and the Hyphomicrobiales MAGs derived from the lichen metagenomes (indicated in red). We generated the alignment of 120 marker genes using GTDB-Tk and calculated the tree using IQ-TREE. The color represents family-level taxonomic assignment. We used tblastn to search the genomes for key genes involved in nitrogen fixation and C1 metabolism. The presence of these genes is indicated with symbols. The full-size version of the tree in the Newick format is available at FigShare (10.6084/m9.figshare.27054937).

(TIF)

pbio.3002862.s005.tif (2.5MB, tif)
S6 Fig. Recovery of MAGs of the main 2 symbionts as a function of sequencing depth.

These graphs are based on the pre-dereplication set of MAGs, each dot represents a metagenome and is positioned based on its sequencing depth and on whether it contained MAGs assigned to one or both of the 2 main partners: (A) the LFS (lichen fungal symbiont); (B) the photobiont partner; (C) both the LFS and the photobiont partner. The data underlying this figure can be found in S1 Data.

(TIFF)

pbio.3002862.s006.tiff (425.3KB, tiff)
S7 Fig. Relative abundances of symbionts in lichen metagenomes.

The relative abundances were calculated by dividing the coverage depth of the symbiont MAG by the coverage of the LFS MAG. Here are shown data on the 13 most frequent bacterial genera and the eukaryotes known to be stably associated with lichens. The red line shows 1:1 ratio, where the symbiont is estimated to have the same cellular abundance as the main fungal symbiont. The boxplot elements are defined as: center line, median; box limits, 25th and 75th percentiles; whiskers, 1.5× interquartile range. The data underlying this figure can be found in S1 Data.

(TIFF)

pbio.3002862.s007.tiff (896KB, tiff)
S1 Data. Excel spreadsheet containing, in separate sheets for each figure, the underlying data used for Figs 3, 4B, 5A, 5B, 5C, 6, 7, S4, S6, and S7.

(XLSX)

pbio.3002862.s008.xlsx (347.9KB, xlsx)
S1 File. KEGG module completeness for prokaryotic MAGs selected for analysis.

(PDF)

pbio.3002862.s009.pdf (93.3KB, pdf)
S2 File. KEGG module completeness for eukaryotic MAGs selected for analysis.

(PDF)

pbio.3002862.s010.pdf (1.3MB, pdf)
S1 Table. Metagenomic data used in the analysis.

(XLSX)

pbio.3002862.s011.xlsx (45.1KB, xlsx)
S2 Table. Details on the metagenomes generated de novo for this study.

(XLSX)

pbio.3002862.s012.xlsx (15.9KB, xlsx)
S3 Table. Pairs of identical metagenomes (sourmash similarity = 1) identified during analysis.

Using sourmash, we identified the pairs of identical metagenomes, potentially deriving from one data set having been submitted to NCBI twice under different accession numbers. This table contains all such pairs and shows which of the pair was retained for the further analysis.

(XLSX)

pbio.3002862.s013.xlsx (12.8KB, xlsx)
S4 Table. Metagenome-assembled genomes generated from the lichen metagenomic data.

For prokaryotic MAGs, contamination and completeness scores are based on the CheckM results, and taxonomic assignments are based on GTDB. For eukaryotes, contamination and completeness scores are based on the EukCC results, and taxonomic assignments are based on the BAT assignments corrected via phylogenomic analysis. Number of occurrences represents the number of metagenomes this MAG was detected in.

(XLSX)

pbio.3002862.s014.xlsx (69.3KB, xlsx)
S5 Table. List of reference genomes used for construction of the phylogenomic trees.

(XLSX)

pbio.3002862.s015.xlsx (17.7KB, xlsx)
S6 Table. Metagenomes derived from potentially misidentified samples.

Metagenomes in this list had inconsistencies between their metadata and the taxonomy of the LFS as estimated from the phylogenomic tree. These metagenomes were excluded from the occurrence analysis.

(XLSX)

pbio.3002862.s016.xlsx (12.5KB, xlsx)
S7 Table. Most frequent bacterial lineages (on the family and genus level) in various groups of lichen metagenomes.

Bacterial lineages were ranked based on the total number of their occurrences. For each lichen group here we give top-ten bacterial taxa.

(XLSX)

pbio.3002862.s017.xlsx (29.3KB, xlsx)
S8 Table. Occurrence of key bacterial groups in metagenomes originating from different studies.

For 13 most common bacterial genus-level lineages and 4 bacterial families, we are showing the number of metagenomes in which they were detected in as MAGs, split by the source of the metagenomic data (the study it was reported in). The total number of metagenomes that come from each study and which were included in the Dataset 1 are shown at the bottom of the table.

(XLSX)

pbio.3002862.s018.xlsx (10.6KB, xlsx)
S9 Table. Number of MAGs from different categories in each metagenome.

The MAG categories were based on a combination of taxonomy and function. For the metagenomes that were identified as potentially derived from misidentified samples, we assigned their LFS to a separate category “LFS misassigned.”

(XLSX)

pbio.3002862.s019.xlsx (62.2KB, xlsx)
S10 Table. Bacterial families ranked on their prevalence in lichen metagenomes, as detected via screening of rRNA genes in metagenomic assemblies and unassembled reads.

Prevalence is calculated using only metagenomes from the Dataset 1 (n = 375).

(XLSX)

pbio.3002862.s020.xlsx (37.1KB, xlsx)
S11 Table. Details on additional metagenomes from NCBI’s SRA.

(XLSX)

pbio.3002862.s021.xlsx (20.5KB, xlsx)
S12 Table. Prevalence of key taxa in additionally screened metagenomes.

(XLSX)

pbio.3002862.s022.xlsx (14KB, xlsx)
S13 Table. MAGs selected for the in depth functional annotation.

To select these MAGs, we first calculated the number of occurrence per genus, and selected the 13 top genera. Next, we selected among them the MAGs with >95% completeness and <5% contamination, according to CheckM.

(XLSX)

pbio.3002862.s023.xlsx (17.2KB, xlsx)
S14 Table. Sequences used as a blast query for the screening genomes for the signs of nitrogen fixation, methanotrophy, and methylotrophy.

(XLSX)

pbio.3002862.s024.xlsx (11.8KB, xlsx)
S15 Table. Number of iron-related genes annotated with FeGenie.

(XLSX)

pbio.3002862.s025.xlsx (17.3KB, xlsx)
S16 Table. Median number of CAZymes assigned to different classes per MAG for the most frequent bacterial genera.

(XLSX)

pbio.3002862.s026.xlsx (12.6KB, xlsx)
S17 Table. Number of CAZymes assigned to different CAZy families in the 63 MAGs selected for in-depth annotation.

(XLSX)

pbio.3002862.s027.xlsx (48.1KB, xlsx)
S18 Table. Biosynthetic Gene Clusters (BGC) predicted for the selected 63 bacterial MAGs.

We used SanntiS to predict the BGCs and to annotate them by identifying the most similar BGC included in the MiBIG database. Only the hits with <0.7 Jaccard distance are listed in this table. BGCs that are similar to known clusters producing exopolysaccharides are indicated in bold.

(XLSX)

pbio.3002862.s028.xlsx (34.1KB, xlsx)
S19 Table. Presence/absence of cobalamin-dependent MetH and cobalamin-independent MetE in the high-quality algal genomes (completeness >90%, contamination <10%, according to EukCC).

(XLSX)

pbio.3002862.s029.xlsx (11.8KB, xlsx)
S20 Table. Reference genomes from NCBI used for the Hyphomicrobiales phylogenomic tree.

Taxon sampling followed Volpiano and colleagues [138]. Taxonomic classification was done using GTDB-Tk.

(XLSX)

pbio.3002862.s030.xlsx (32.5KB, xlsx)

Acknowledgments

Special thanks go to Piotr Łukasik and John McCutcheon for support to GT and SG during a visit to the University of Montana. Thanks also to Maria Chuvochina (University of Queensland) for clarifying the nomenclatural priority of Lichenihabitans.

Abbreviations

AAP

aerobic anoxygenic phototroph

ANI

average nucleotide identity

CBB

Calvin–Benson–Bassham

GH

glycoside hydrolases

GTDB

Genome Taxonomy Database

KO

KEGG ortholog

LFS

lichen fungal symbiont

MAG

metagenome-assembled genome

SSU

small subunit

Data Availability

The sequencing data in this project are submitted to ENA: de novo generated raw data (study accession PRJEB59037), metagenomic assemblies (PRJEB72384, PRJEB72386-PRJEB72404, PRJEB72498- PRJEB72501), and MAGs (PRJEB77567). Phylogenomic trees in Newick format are available at FigShare (10.6084/m9.figshare.27054937). Custom scripts used for data analysis and visualization are available on GitHub (https://github.com/Spribille-lab/2024-Microbial-occurrence-in-lichen-metagenomes) and FigShare (10.6084/m9.figshare.27054937).

Funding Statement

Support for this study was provided by funding to GT from an Alberta Graduate Excellence Scholarship and Alberta Innovates Graduate Student Scholarship. CCGA was supported by NSERC PGS-D grant NSERC PGS-D fellowship 545691-2020. GT, DDE, SG, ATC, VTN and TS were supported by Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grant RGPIN-2019-04892 to TS, and funding to TS through the Canada Research Chairs (CRC) Program. PS, ESC and RDF were supported by European Molecular Biology Laboratory (EMBL) funds. The funders were not involved in conceptualization, study design, data collection, analysis, manuscript preparation or decision to publish.

References

  • 1.Schwendener S. Die Algentypen der Flechtengonidien. Basel: Universitaetsbuchdruckerei; 1869. [Google Scholar]
  • 2.De Bary A. Die Erscheinung der Symbiose. Strassburg: Verlag Karl Trübner; 1879. [Google Scholar]
  • 3.Drew EA, Smith DC. Studies in the physiology of lichens: VIII. Movement of glucose from alga to fungus during photosynthesis in the thallus of Peltigera polydactyla. New Phytol. 1967. Jul;66(3):389–400. [Google Scholar]
  • 4.Ahmadjian V. The lichen symbiosis. New York: John Wiley & Sons; 1993. [Google Scholar]
  • 5.Gargas A, DePriest PT, Grube M, Tehler A. Multiple origins of lichen symbioses in fungi suggested by SSU rDNA phylogeny. Science. 1995. Jun 9;268(5216): 1492–1495. doi: 10.1126/science.7770775 [DOI] [PubMed] [Google Scholar]
  • 6.Nelsen MP, Lücking R, Boyce CK, Lumbsch HT, Ree RH. The macroevolutionary dynamics of symbiotic and phenotypic diversification in lichens. Proc Natl Acad Sci U S A. 2020. Sep 1;117(35):21495–21503. doi: 10.1073/pnas.2001913117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Matthes U, Feige GB. Ecophysiology of lichen symbioses. In: Lange OL, Nobel PS, Osmond CB, Ziegler H, editors. Physiological Plant Ecology III: Responses to the Chemical and Biological Environment. Berlin, Heidelberg: Springer Berlin Heidelberg; 1983. p. 423–467. [Google Scholar]
  • 8.Honegger R. Developmental biology of lichens. New Phytol. 1993. Dec;125(4):659–677. doi: 10.1111/j.1469-8137.1993.tb03916.x [DOI] [PubMed] [Google Scholar]
  • 9.Spribille T, Resl P, Stanton DE, Tagirdzhanova G. Evolutionary biology of lichen symbioses. New Phytol. 2022. Jun;234(5):1566–1582. doi: 10.1111/nph.18048 [DOI] [PubMed] [Google Scholar]
  • 10.Grube M, Berg G. Microbial consortia of bacteria and fungi with focus on the lichen symbiosis. Fungal Biol Rev. 2009. Aug 1;23(3):72–85. [Google Scholar]
  • 11.Grube M, Cardinale M, de Castro JV Jr, Müller H, Berg G. Species-specific structural and functional diversity of bacterial communities in lichen symbioses. ISME J. 2009. Sep;3(9):1105–1115. doi: 10.1038/ismej.2009.63 [DOI] [PubMed] [Google Scholar]
  • 12.Spribille T, Tuovinen V, Resl P, Vanderpool D, Wolinski H, Aime MC, et al. Basidiomycete yeasts in the cortex of ascomycete macrolichens. Science. 2016. Jul 29;353(6298):488–492. doi: 10.1126/science.aaf8287 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Grimm M, Grube M, Schiefelbein U, Zühlke D, Bernhardt J, Riedel K. The lichens’ microbiota, still a mystery?. Front Microbiol. 2021. Mar 30;12:714. doi: 10.3389/fmicb.2021.623839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tuovinen V, Ekman S, Thor G, Vanderpool D, Spribille T, Johannesson H. Two basidiomycete fungi in the cortex of wolf lichens. Curr Biol. 2019. Feb 4;29(3):476–83. doi: 10.1016/j.cub.2018.12.022 [DOI] [PubMed] [Google Scholar]
  • 15.Galun M, Bubrick P. Physiological interactions between the partners of the lichen symbiosis. In: Linskens HF, Heslop-Harrison J, editors. Cellular interactions. Berlin, Heidelberg: Springer Berlin Heidelberg; 1984. p. 362–401. [Google Scholar]
  • 16.Ahmadjian V. Trebouxia: reflections on a perplexing and controversial lichen photobiont. In: Seckbach J, editor. Symbiosis: Mechanisms and model systems. Dordrecht: Springer Netherlands; 2001. p. 373–383. [Google Scholar]
  • 17.Collins CR, Farrar JF. Structural resistances to mass transfer in the lichen Xanthoria parietina. New Phytol. 1978. Jan;81(1):71–83. [Google Scholar]
  • 18.Spribille T, Tagirdzhanova G, Goyette S, Tuovinen V, Case R, Zandberg WF. 3D biofilms: in search of the polysaccharides holding together lichen symbioses. FEMS Microbiol Lett. 2020. Mar;367(5):fnaa023. doi: 10.1093/femsle/fnaa023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Rikkinen J, Oksanen I, Lohtander K. Lichen guilds share related cyanobacterial symbionts. Science. 2002. Jul 19;297(5580):357–357. doi: 10.1126/science.1072961 [DOI] [PubMed] [Google Scholar]
  • 20.Perreau J, Moran NA. Genetic innovations in animal–microbe symbioses. Nat Rev Genet. 2022. Jan;23(1):23–39. doi: 10.1038/s41576-021-00395-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hawksworth DL. The variety of fungal-algal symbioses, their evolutionary significance, and the nature of lichens. Bot J Linn. 1988;96(1):3–20. [Google Scholar]
  • 22.Sanders WB. The disadvantages of current proposals to redefine lichens: A comment on Hawksworth & Grube (2020):‘Lichens redefined as complex ecosystems’. New Phytol. 2024. Feb;241(3):969–971. [DOI] [PubMed] [Google Scholar]
  • 23.Cengia Sambo M. Ancora della polysimbiosi nei licheni ad alghe cianoficee. 1. Batteri simbionti. Atti della Società Italiana di Scienze Naturali e del Museo Civico di Storia Naturale in Milano. 1926;64:191–195. [Google Scholar]
  • 24.Pankratov TA, Kachalkin AV, Korchikov ES. Dobrovol’skaya TG. Microbial communities of lichens Microbiology. 2017;86:293–309. [Google Scholar]
  • 25.He Z, Naganuma T. Chronicle of research into lichen-associated bacteria. Microorganisms. 2022;10:2111. doi: 10.3390/microorganisms10112111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Taş N, de Jong AE, Li Y, Trubl G, Xue Y, Dove NC. Metagenomic tools in microbial ecology research. Curr Opin Biotechnol. 2021. Feb 1;67:184–191. doi: 10.1016/j.copbio.2021.01.019 [DOI] [PubMed] [Google Scholar]
  • 27.Greshake Tzovaras B, Segers FH, Bicker A, Dal Grande F, Otte J, Anvar SY, et al. What is in Umbilicaria pustulata? A metagenomic approach to reconstruct the holo-genome of a lichen. Genome Biol Evol. 2020. Apr;12(4): 309–324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lendemer JC, Keepers KG, Tripp EA, Pogoda CS, McCain CM, Kane NC. A taxonomically broad metagenomic survey of 339 species spanning 57 families suggests cystobasidiomycete yeasts are not ubiquitous across all lichens. Am J Bot. 2019. Aug;106(8):1090–1095. doi: 10.1002/ajb2.1339 [DOI] [PubMed] [Google Scholar]
  • 29.Keepers KG, Pogoda CS, White KH, Anderson Stewart CR, Hoffman JR, Ruiz AM, et al. Whole genome shotgun sequencing detects greater lichen fungal diversity than amplicon-based methods in environmental samples. Front Ecol Evol. 2019. Dec 13;7:484. [Google Scholar]
  • 30.Wicaksono WA, Cernava T, Grube M, Berg G. Assembly of bacterial genomes from the metagenomes of three lichen species. Microbiology Resource Announcements. 2020. Sep 17;9(38):e00622–20. doi: 10.1128/MRA.00622-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Tagirdzhanova G, Saary P, Tingley JP, Díaz-Escandón D, Abbott DW, Finn RD, et al. Predicted input of uncultured fungal symbionts to a lichen symbiosis from metagenome-assembled genomes. Genome Biol Evol. 2021. Apr 1;13(4): evab047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Smith HB, Dal Grande F, Muggia L, Keuler R, Divakar PK, Grewe F, et al. Metagenomic data reveal diverse fungal and algal communities associated with the lichen symbiosis. Symbiosis. 2020;82:133–147. [Google Scholar]
  • 33.McDonald TR, Mueller O, Dietrich FS, Lutzoni F. High-throughput genome sequencing of lichenizing fungi to assess gene loss in the ammonium transporter/ammonia permease gene family. BMC Genomics. 2013. Dec;14(1):225. doi: 10.1186/1471-2164-14-225 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Meiser A, Otte J, Schmitt I, Grande FD. Sequencing genomes from mixed DNA samples-evaluating the metagenome skimming approach in lichenized fungi. Sci Rep. 2017. Nov 2;7(1):14881. doi: 10.1038/s41598-017-14576-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Grewe F, Ametrano C, Widhelm TJ, Leavitt S, Distefano I, Polyiam W, et al. Using target enrichment sequencing to study the higher-level phylogeny of the largest lichen-forming fungi family: Parmeliaceae (Ascomycota). IMA Fungus. 2020. Dec;11:27. doi: 10.1186/s43008-020-00051-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Resl P, Bujold AR, Tagirdzhanova G, Meidl P, Freire Rallo S, Kono M, et al. Large differences in carbohydrate degradation and transport potential among lichen fungal symbionts. Nat Commun. 2022. May 12;13(1):2634. doi: 10.1038/s41467-022-30218-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cornet L, Magain N, Baurain D, Lutzoni F. Exploring syntenic conservation across genomes for phylogenetic studies of organisms subjected to horizontal gene transfers: A case study with Cyanobacteria and cyanolichens. Mol Phylogenet Evol. 2021. Sep 1;162:107100. doi: 10.1016/j.ympev.2021.107100 [DOI] [PubMed] [Google Scholar]
  • 38.Singh G, Dal Grande F, Schmitt I. Genome mining as a biotechnological tool for the discovery of novel biosynthetic genes in lichens. Front Fungal Biol. 2022. Oct 3;3:993171. doi: 10.3389/ffunb.2022.993171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Saary P, Kale V, Finn R. Large-scale analysis reveals the distribution of novel cellular microbes across multiple biomes and kingdoms. Research Square. doi: 10.21203/rs.3.rs-1441815/v1 [Preprint]. cited 2024 Feb 8. 2022. [DOI] [Google Scholar]
  • 40.Tagirdzhanova G, McCutcheon JP, Spribille T. Lichen fungi do not depend on the alga for ATP production: A comment on Pogoda et al. (2018). Mol Ecol. 2021. Sep;30(17):4155–4159. [DOI] [PubMed] [Google Scholar]
  • 41.Parks DH, Chuvochina M, Rinke C, Mussig AJ, Chaumeil PA, Hugenholtz P. GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy. Nucleic Acids Res. 2022. Jan 7;50(D1):D785–D794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lücking R, Hodkinson BP, Leavitt SD. The 2016 classification of lichenized fungi in the Ascomycota and Basidiomycota–Approaching one thousand genera. Bryologist. 2017. Jan;119(4):361–416. [Google Scholar]
  • 43.Díaz-Escandón D, Tagirdzhanova G, Vanderpool D, Allen CC, Aptroot A, Češka O, et al. Genome-level analyses resolve an ancient lineage of symbiotic ascomycetes. Curr Biol. 2022. Dec 5;32(23):5209–5218. doi: 10.1016/j.cub.2022.11.014 [DOI] [PubMed] [Google Scholar]
  • 44.Henkel PA, Yuzhakova LA. Azotfiksiruyuschie bakterii v lishaynikah [nitrogen-fixing bacteria in lichens]. Izv Biol Inst Permsk Gos Univ. 1936;10:9–10. [Google Scholar]
  • 45.Liba CM, Ferrara FI, Manfio GP, Fantinatti-Garboggini F, Albuquerque RC, Pavan C, et al. Nitrogen-fixing chemo-organotrophic bacteria isolated from cyanobacteria-deprived lichens and their ability to solubilize phosphate and to release amino acids and phytohormones. J Appl Microbiol. 2006;101:1076–1086. doi: 10.1111/j.1365-2672.2006.03010.x [DOI] [PubMed] [Google Scholar]
  • 46.Hodkinson BP, Lutzoni F. A microbiotic survey of lichen-associated bacteria reveals a new lineage from the Rhizobiales. Symbiosis. 2009. Dec;49:163–180. [Google Scholar]
  • 47.Almendras K, García J, Carú M, Orlando J. Nitrogen-fixing bacteria associated with Peltigera cyanolichens and Cladonia chlorolichens. Molecules. 2018. Nov 25;23(12):3077. doi: 10.3390/molecules23123077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Auður Sigurbjörnsdóttir M, Andrésson ÓS, Vilhelmsson O. Nutrient scavenging activity and antagonistic factors of non-photobiont lichen-associated bacteria: a review. World J Microbiol Biotechnol. 2016. Apr;32:68. doi: 10.1007/s11274-016-2019-2 [DOI] [PubMed] [Google Scholar]
  • 49.Pattnaik S, Mohapatra B, Gupta A. Plant growth-promoting microbe mediated uptake of essential nutrients (Fe, P, K) for crop stress management: Microbe–soil–plant continuum. Front Agron. 2021. Aug 9;3:689972. [Google Scholar]
  • 50.Kramer J, Özkaya Ö, Kümmerli R. Bacterial siderophores in community and host interactions. Nat Rev Microbiol. 2020. Mar;18(3):152–63. doi: 10.1038/s41579-019-0284-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Garber AI, Nealson KH, Okamoto A, McAllister SM, Chan CS, Barco RA, et al. FeGenie: a comprehensive tool for the identification of iron genes and iron gene neighborhoods in genome and metagenome assemblies. Front Microbiol. 2020. Jan 31;11:37. doi: 10.3389/fmicb.2020.00037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Pankratov TA, Grouzdev DS, Patutina EO, Kolganova TV, Suzina NE, Berestovskaya JJ. Lichenibacterium ramalinae gen. nov, sp. nov., Lichenibacterium minor sp. nov., the first endophytic, beta-carotene producing bacterial representatives from lichen thalli and the proposal of the new family Lichenibacteriaceae within the order Rhizobiales. Antonie Van Leeuwenhoek. 2020. Apr;113:477–489. [DOI] [PubMed] [Google Scholar]
  • 53.Eymann C, Lassek C, Wegner U, Bernhardt J, Fritsch OA, Fuchs S, et al. Symbiotic interplay of fungi, algae, and bacteria within the lung lichen Lobaria pulmonaria L. Hoffm. as assessed by state-of-the-art metaproteomics. J Proteome Res. 2017. Jun 2;16(6):2160–2173. [DOI] [PubMed] [Google Scholar]
  • 54.Pankratov TA, Grouzdev DS, Patutina EO, Kolganova TV, Berestovskaya JJ, Ashikhmin AA. Lichenicoccus roseus gen. nov., sp. nov., the first bacteriochlorophyll a-containing, psychrophilic and acidophilic Acetobacteraceae bacteriobiont of lichen Cladonia species. Int J Syst Evol Microbiol. 2020. Aug;70(8):4591–4601. doi: 10.1099/ijsem.0.004318 [DOI] [PubMed] [Google Scholar]
  • 55.Yurkov V, Csotonyi JT. New light on aerobic anoxygenic phototrophs. In: Hunter CN, Daldal F, Thurnauer MC, Beatty JT, editors. The purple phototrophic bacteria. Dordrecht: Springer Netherlands; 2009. p. 31–55. [Google Scholar]
  • 56.Pankratov TA, Samylina OS, Tikhonova EN, Ianutsevich EA, Avtukh AN, Lee YM. A novel bacteriobiont of the Arctic lichen Flavocetraria nivalis, Lichenifustis flavocetrariae gen. nov, sp. nov. demonstrating hydrolytic properties and containing a full set of the Calvin–Benson–Bassham cycle genes. Arch Microbiol. 2023. Jun;205(6):232. [DOI] [PubMed] [Google Scholar]
  • 57.Assié A, Leisch N, Meier DV, Gruber-Vodicka H, Tegetmeyer HE, Meyerdierks A, et al. Horizontal acquisition of a patchwork Calvin cycle by symbiotic and free-living Campylobacterota (formerly Epsilonproteobacteria). ISME J. 2020. Jan;14(1):104–122. doi: 10.1038/s41396-019-0508-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Grube M, Cernava T, Soh J, Fuchs S, Aschenbrenner I, Lassek C, et al. Exploring functional contexts of symbiotic sustain within lichen-associated bacteria by comparative omics. ISME J. 2015. Feb;9(2):412–24. doi: 10.1038/ismej.2014.138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Erlacher A, Cernava T, Cardinale M, Soh J, Sensen CW, Grube M, et al. Rhizobiales as functional and endosymbiontic members in the lichen symbiosis of Lobaria pulmonaria L. Front Microbiol. 2015. Feb 10;6:53. doi: 10.3389/fmicb.2015.00053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Karunakaran R, Ebert K, Harvey S, Leonard ME, Ramachandran V, Poole PS. Thiamine is synthesized by a salvage pathway in Rhizobium leguminosarum bv. viciae strain 3841. J Bacteriol. 2006. Sep 15;188(18):6661–6668. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Croft MT, Lawrence AD, Raux-Deery E, Warren MJ, Smith AG. Algae acquire vitamin B12 through a symbiotic relationship with bacteria. Nature. 2005. Nov 3;438(7064):90–93. doi: 10.1038/nature04056 [DOI] [PubMed] [Google Scholar]
  • 62.Orłowska M, Steczkiewicz K, Muszewska A. Utilization of cobalamin is ubiquitous in early-branching fungal phyla. Genome Biol Evol. 2021. Apr 1;13(4):evab043. doi: 10.1093/gbe/evab043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Casano LM, del Campo EM, García-Breijo FJ, Reig-Armiñana J, Gasulla F, Del Hoyo A, et al. Two Trebouxia algae with different physiological performances are ever-present in lichen thalli of Ramalina farinacea. Coexistence versus competition?. Environ Microbiol. 2011. Mar;13(3):806–818. [DOI] [PubMed] [Google Scholar]
  • 64.Onuț-Brännström I, Benjamin M, Scofield DG, Heiðmarsson S, Andersson MG, Lindström ES, et al. Sharing of photobionts in sympatric populations of Thamnolia and Cetraria lichens: evidence from high-throughput sequencing. Sci Rep. 2018. Mar 13;8(1):4406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Čertnerová D. Nuclei isolation protocols for flow cytometry allowing nuclear DNA content estimation in problematic microalgal groups. J Appl Phycol. 2021. Aug;33(4):2057–2067. [Google Scholar]
  • 66.Kapraun DF. Nuclear DNA content estimates in green algal lineages: Chlorophyta and Streptophyta. Ann Bot. 2007. Apr 1;99(4):677–701. doi: 10.1093/aob/mcl294 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Lopez-Bautista JM. Molecular systematics of the green algal order Trentepohliales (Chlorophyta). Ph.D. Thesis, Louisiana State University and Agricultural & Mechanical College. 2000. Available from: https://repository.lsu.edu/cgi/viewcontent.cgi?article=8208&context=gradschool_disstheses.
  • 68.López-Bautista JM, Kapraun DF, Chapman RL. Nuclear DNA content estimates in the Trentepohliales (Chlorophyta): phylogenetic considerations. Eur J Phycol. 2006;33:259–273. [Google Scholar]
  • 69.Muggia L, Vancurova L, Škaloud P, Peksa O, Wedin M, Grube M. The symbiotic playground of lichen thalli–a highly flexible photobiont association in rock-inhabiting lichens. FEMS Microbiol Ecol. 2013. Aug 1;85(2):313–323. doi: 10.1111/1574-6941.12120 [DOI] [PubMed] [Google Scholar]
  • 70.Tagirdzhanova G, Scharnagl K, Yan X, Talbot NJ. Genomic analysis of Coccomyxa viridis, a common low-abundance alga associated with lichen symbioses. Sci Rep. 2023. Dec 2;13(1):21285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Hodkinson BP, Gottel NR, Schadt CW, Lutzoni F. Photoautotrophic symbiont and geography are major factors affecting highly structured and diverse bacterial communities in the lichen microbiome. Environ Microbiol. 2012. Jan;14(1):147–161. doi: 10.1111/j.1462-2920.2011.02560.x [DOI] [PubMed] [Google Scholar]
  • 72.Bates ST, Cropsey GW, Caporaso JG, Knight R, Fierer N. Bacterial communities associated with the lichen symbiosis. Appl Environ Microbiol. 2011. Feb 15;77(4):1309–1314. doi: 10.1128/AEM.02257-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Park CH, Kim KM, Kim O-S, Jeong G, Hong SG. Bacterial communities in Antarctic lichens. Antarct Sci. 2016;28:455–461. [Google Scholar]
  • 74.Sierra MA, Danko DC, Sandoval TA, Pishchany G, Moncada B, Kolter R, et al. The microbiomes of seven lichen genera reveal host specificity, a reduced core community and potential as source of antimicrobials. Front Microbiol. 2020;11:398. doi: 10.3389/fmicb.2020.00398 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Noh HJ, Baek K, Hwang CY, Shin SC, Hong SG, Lee YM. Lichenihabitans psoromatis gen. nov., sp. nov., a member of a novel lineage (Lichenihabitantaceae fam. nov.) within the order of Rhizobiales isolated from Antarctic lichen. Int J Syst Evol Microbiol. 2019. Dec;69(12):3837–3842. [DOI] [PubMed] [Google Scholar]
  • 76.Noh HJ, Park Y, Hong SG, Lee YM. Diversity and physiological characteristics of Antarctic lichens-associated bacteria. Microorganisms. 2021. Mar 15;9(3):607. doi: 10.3390/microorganisms9030607 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Oren A, Garrity GM. List of new names and new combinations previously effectively, but not validly, published. Int J Syst Evol Microbiol. 2019. Jan;69(1):5–9. doi: 10.1099/ijsem.0.003174 [DOI] [PubMed] [Google Scholar]
  • 78.Noh HJ, Shin SC, Park Y, Choi A, Baek K, Hong SG, et al. Lichenicola cladoniae gen. nov., sp. nov., a member of the family Acetobacteraceae isolated from an Antarctic lichen. Int J Syst Evol Microbiol. 2020. Nov;70(11):5918–5925. [DOI] [PubMed] [Google Scholar]
  • 79.Leiva D, Fernández-Mendoza F, Acevedo J, Carú M, Grube M, Orlando J. The bacterial community of the foliose macro-lichen Peltigera frigida is more than a mere extension of the microbiota of the subjacent substrate. Microb Ecol. 2021. May;81:965–976. [DOI] [PubMed] [Google Scholar]
  • 80.Pankratov TA, Nikitin PA, Patutina EO. Genome analysis of two lichen bacteriobionts, Lichenibacterium ramalinae and Lichenibacterium minor: Toxin–antitoxin systems and secretion proteins. Microbiology. 2022. Apr;91(2):160–172. [Google Scholar]
  • 81.Hale ME Jr. Vitamin requirements of three lichen fungi. Bull Torrey Bot Club. 1958. May;85:182–187. [Google Scholar]
  • 82.Richardson DH, Smith DC. Lichen physiology X. The isolated algal and fungal symbionts of Xanthoria aureola. New Phytol. 1968. Jan;67(1):69–77. [Google Scholar]
  • 83.Bednar TW, Holm-Hansen OS. Biotin liberation by the lichen alga Coccomyxa sp. and by Chlorella pyrenoidosa. Plant Cell Physiol. 1964. Sep 1;5(3):297–303. [Google Scholar]
  • 84.Bednar TW. Physiological studies on the isolated components of the lichen Peltigera aphthosa. Ph. D. dissertation, University of Wisconsin. 1963. [Google Scholar]
  • 85.Shapiro MM, Chakravartty V, Cronan JE. Remarkable diversity in the enzymes catalyzing the last step in synthesis of the pimelate moiety of biotin. PLoS ONE. 2012. Nov 9;7(11):e49440. doi: 10.1371/journal.pone.0049440 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Hu Y, Cronan JE. α-proteobacteria synthesize biotin precursor pimeloyl-ACP using BioZ 3-ketoacyl-ACP synthase and lysine catabolism. Nat Commun. 2020. Nov 5;11(1):5598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Iskina RY. K voprosu ob azotfiksiruyushchikh bakteriyakh v lishaynikakh (On nitrogen-fixing bacteria in lichens) Izvestiya Permskogo biologicheskogo nauchno-issledivaltel’skogo instituta. 1938;11:134–138. [Google Scholar]
  • 88.Morris BE, Henneberger R, Huber H, Moissl-Eichinger C. Microbial syntrophy: interaction for the common good. FEMS Microbiol Rev. 2013. May 1;37(3):384–406. doi: 10.1111/1574-6976.12019 [DOI] [PubMed] [Google Scholar]
  • 89.Jacobs JB, Ahmadjian V. The ultrastructure of lichens. II. Cladonia cristatella: the lichen and its isolated symbionts. J Phycol. 1971. Mar;7(1):71–81. [Google Scholar]
  • 90.Zook D. A study of the role of bacteria in lichens. Master’s Thesis, Clark University. 1983.
  • 91.Salter SJ, Cox MJ, Turek EM, Calus ST, Cookson WO, Moffatt MF, et al. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014. Dec;12:1–2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.González I, Ayuso-Sacido A, Anderson A, Genilloud O. Actinomycetes isolated from lichens: evaluation of their diversity and detection of biosynthetic gene sequences. FEMS Microbiol Ecol. 2005. Nov 1;54(3):401–15. doi: 10.1016/j.femsec.2005.05.004 [DOI] [PubMed] [Google Scholar]
  • 93.Vobis G, Solans M, Scervino JM, Schumann P, Spröer C, Messuti MI. Isolation and characterization of an endolichenic actinobacterium from the lichen thallus of Pseudocyphellaria berberina. Symbiosis. 2020. Jan;80(1):43–51. [Google Scholar]
  • 94.Schulz B, Wanke U, Draeger S, Aust HJ. Endophytes from herbaceous plants and shrubs: effectiveness of surface sterilization methods. Mycol Res. 1993. Dec 1;97(12):1447–1450. [Google Scholar]
  • 95.Arnold AE, Miadlikowska J, Higgins KL, Sarvate SD, Gugger P, Way A, et al. A phylogenetic estimation of trophic transition networks for ascomycetous fungi: are lichens cradles of symbiotrophic fungal diversification?. Syst Biol. 2009. Jun 1;58(3):283–297. doi: 10.1093/sysbio/syp001 [DOI] [PubMed] [Google Scholar]
  • 96.Veselá V, Malavasi V, Škaloud P. A synopsis of green-algal lichen symbionts with an emphasis on their free-living lifestyle. Phycologia. 2024. May 3;63(3):317–38. [Google Scholar]
  • 97.Doolittle WF, Inkpen SA. Processes and patterns of interaction as units of selection: An introduction to ITSNTS thinking. Proc Natl Acad Sci U S A. 2018. Apr 17;115(16):4006–14. doi: 10.1073/pnas.1722232115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Weilguny L, De Maio N, Munro R, Manser C, Birney E, Loose M, et al. Dynamic, adaptive sampling during nanopore sequencing using Bayesian experimental design. Nat Biotechnol. 2023. Jul;41(7):1018–25. doi: 10.1038/s41587-022-01580-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Pierce NT, Irber L, Reiter T, Brooks P, Brown CT. Large-scale sequence comparisons with sourmash. F1000Res. 2019;8:1006. doi: 10.12688/f1000research.19675.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018. Sep 1;34(17):i884–i890. doi: 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Uritskiy GV, DiRuggiero J, Taylor J. MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis. Microbiome. 2018. Dec;6(1):1–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017. May 1;27(5):824–834. doi: 10.1101/gr.213959.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Alneberg J, Bjarnason BS, De Bruijn I, Schirmer M, Quick J, Ijaz UZ, et al. Binning metagenomic contigs by coverage and composition. Nat Methods. 2014. Nov;11(11):1144–1146. doi: 10.1038/nmeth.3103 [DOI] [PubMed] [Google Scholar]
  • 104.Kang DD, Li F, Kirton E, Thomas A, Egan R, An H, et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ. 2019. Jul 26;7:e7359. doi: 10.7717/peerj.7359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015. Jul 1;25(7):1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Saheb Kashaf S, Almeida A, Segre JA, Finn RD. Recovering prokaryotic genomes from host-associated, short-read shotgun metagenomic sequencing data. Nat Protoc. 2021. May;16(5):2520–2541. doi: 10.1038/s41596-021-00508-2 [DOI] [PubMed] [Google Scholar]
  • 107.Olm MR, Brown CT, Brooks B, Banfield JF. dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication. ISME J. 2017. Dec;11(12):2864–2868. doi: 10.1038/ismej.2017.126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2020;36(6):1925–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Nguyen LT, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015. Jan 1;32(1):268–274. doi: 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Saary P, Mitchell AL, Finn RD. Estimating the quality of eukaryotic genomes recovered from metagenomic analysis with EukCC. Genome Biol. 2020. Dec;21(1):244. doi: 10.1186/s13059-020-02155-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness. In: Kollmar M, editor. Gene prediction: methods and protocols New York NY: Springer New York. 2019. p. 227–245. [DOI] [PubMed] [Google Scholar]
  • 112.Von Meijenfeldt FB, Arkhipova K, Cambuy DD, Coutinho FH, Dutilh BE. Robust taxonomic classification of uncharted microbial sequences and bins with CAT and BAT. Genome Biol. 2019. Dec;20:217. doi: 10.1186/s13059-019-1817-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Zhang C, Rabiee M, Sayyari E, Mirarab S. ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinformatics. 2018. May;19(6):15–30. doi: 10.1186/s12859-018-2129-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009. Jul 15;25(14):1754–1760. doi: 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics. 2009. Aug 15;25(16):2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Hedlund T. Kritische Bemerkungen über einige Arten der Flechtengattungen Lecanora, Lecidea und Micarea. Bih. K. Svenska Vet.-Akad. Handl. 18. 1892. III: 1–104. [Google Scholar]
  • 117.Spribille T, Bjork CR. New records and range extensions in the North American lignicolous lichen flora. Mycotaxon. 2008. Jul 1;105:455. [Google Scholar]
  • 118.Gruber-Vodicka HR, Seah BK, Pruesse E. phyloFlash: rapid small-subunit rRNA profiling and targeted assembly from metagenomes. Msystems. 2020. Oct 27;5(5). doi: 10.1128/mSystems.00920-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Bengtsson-Palme J, Hartmann M, Eriksson KM, Pal C, Thorell K, Larsson DG, et al. METAXA2: improved identification and taxonomic classification of small and large subunit rRNA in metagenomic data. Mol Ecol Resour. 2015;15(6):1403–1414. doi: 10.1111/1755-0998.12399 [DOI] [PubMed] [Google Scholar]
  • 120.Murali A, Bhargava A, Wright ES. IDTAXA: a novel approach for accurate taxonomic classification of microbiome sequences. Microbiome. 2018. Dec;6(1):140. doi: 10.1186/s40168-018-0521-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014. Jul 15;30(14):2068–2069. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
  • 122.Mitchell AL, Almeida A, Beracochea M, Boland M, Burgin J, Cochrane G, et al. MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res. 2020. Jan 8;48(D1):D570–D578. doi: 10.1093/nar/gkz1035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Steinegger M, Söding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol. 2017. Nov;35(11):1026–1028. doi: 10.1038/nbt.3988 [DOI] [PubMed] [Google Scholar]
  • 124.Kanehisa M, Goto S, Kawashima S, Nakaya A. The KEGG databases at GenomeNet. Nucleic Acids Res. 2002. Jan 1;30(1):42–46. doi: 10.1093/nar/30.1.42 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Aramaki T, Blanc-Mathieu R, Endo H, Ohkubo K, Kanehisa M, Goto S, et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics. 2020. Apr 1;36(7):2251–2252. doi: 10.1093/bioinformatics/btz859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Sato N, Uematsu M, Fujimoto K, Uematsu S, Imoto S. ggkegg: analysis and visualization of KEGG data utilizing the grammar of graphics. Bioinformatics. 2023. Oct 1;39(10):btad622. doi: 10.1093/bioinformatics/btad622 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Gerbling KP, Steup M, Latzko E. Fructose 1, 6-bisphosphatase form B from Synechococcus leopoliensis hydrolyzes both fructose and sedoheptulose bisphosphate. Plant Physiol. 1986. Mar 1;80(3):716–20. doi: 10.1104/pp.80.3.716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Sanchez S, Rogers JD, Rogers AB, Nassar M, McEntyre J, Welch M, et al. Expansion of novel biosynthetic gene clusters from diverse environments using SanntiS. BioRxiv [Preprint] bioRxiv. 2023. 540769 [posted 2023 May 24; revised 2023 October 12; cited 2024 February 8]. Available from: 10.1101/2023.05.23.540769. [DOI] [Google Scholar]
  • 129.Blin K, Shaw S, Kloosterman AM, Charlop-Powers Z, Van Wezel GP, Medema MH, et al. antiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res. 2021. Jul 2;49(W1):W29–W35. doi: 10.1093/nar/gkab335 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990. Oct 5;215(3):403–10. doi: 10.1016/S0022-2836(05)80360-2 [DOI] [PubMed] [Google Scholar]
  • 131.Palmer JM, Stajich J, Funannotate v1. 8.1: Eukaryotic genome annotation. Zenodo. 2020:4054262. [Google Scholar]
  • 132.Lomsadze A, Ter-Hovhannisyan V, Chernoff YO, Borodovsky M. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005. Jan 1;33(20):6494–506. doi: 10.1093/nar/gki937 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Stanke M, Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics-Oxford. 2003. Sep 27;19(2):215–25. doi: 10.1093/bioinformatics/btg1080 [DOI] [PubMed] [Google Scholar]
  • 134.Testa AC, Hane JK, Ellwood SR, Oliver RP. CodingQuarry: highly accurate hidden Markov model gene prediction in fungal genomes using RNA-seq transcripts. BMC Genomics. 2015. Dec;16:1–2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Majoros WH, Pertea M, Salzberg SL. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics. 2004. Nov 1;20(16):2878–9. doi: 10.1093/bioinformatics/bth315 [DOI] [PubMed] [Google Scholar]
  • 136.Korf I. Gene finding in novel genomes. BMC Bioinformatics. 2004. Dec;5:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Paysan-Lafosse T, Blum M, Chuguransky S, Grego T, Pinto BL, Salazar GA, et al. InterPro in 2022. Nucleic Acids Res. 2023. Jan 6;51(D1):D418–27. doi: 10.1093/nar/gkac993 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Volpiano CG, Sant’Anna FH, Ambrosini A, de São José JF, Beneduzi A, Whitman WB, et al. Genomic metrics applied to Rhizobiales (Hyphomicrobiales): species reclassification, identification of unauthentic genomes and false type strains. Front Microbiol. 2021. Mar 25;12:614957. doi: 10.3389/fmicb.2021.614957 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Winter DJ. rentrez: an R package for the NCBI eUtils API. R J. 2017;9:520–526. [Google Scholar]
  • 140.R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2015. [Google Scholar]
  • 141.Wickham H, Francois R. dplyr: A Grammar of Data Manipulation; 2016. Available from: https://CRAN.R-project.org/package=dplyr. [Google Scholar]
  • 142.Wickham H, Girlich M. tidyr: Tidy Messy Data. R package version 1.2.0; 2016. Available from: https://CRAN.R-project.org/package=tidyr. [Google Scholar]
  • 143.Wickham H, Seidel D. scales: Scale Functions for Visualization. R package version 1.1.1; 2020. Available from: https://CRAN.R-project.org/package=scales. [Google Scholar]
  • 144.Wickham H. ggplot2: elegant graphics for data analysis. Springer; 2016. [Google Scholar]
  • 145.Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019. Feb 1;35(3):526–528. doi: 10.1093/bioinformatics/bty633 [DOI] [PubMed] [Google Scholar]
  • 146.Schliep KP. phangorn: phylogenetic analysis in R. Bioinformatics. 2011. Feb 15;27(4):592–593. doi: 10.1093/bioinformatics/btq706 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Revell LJ. phytools: an R package for phylogenetic comparative biology (and other things). Methods Ecol Evol. 2012. Apr(2):217–223. [Google Scholar]
  • 148.Gu Z, Gu L, Eils R, Schlesner M, Brors B. Circlize implements and enhances circular visualization in R. Bioinformatics. 2014;30:2811–2812. doi: 10.1093/bioinformatics/btu393 [DOI] [PubMed] [Google Scholar]
  • 149.Epskamp S, Cramer AO, Waldorp LJ, Schmittmann VD, Borsboom D. qgraph: Network visualizations of relationships in psychometric data. J Stat Softw. 2012. May 24;48:1–8. [Google Scholar]
  • 150.Wang LG, Lam TT, Xu S, Dai Z, Zhou L, Feng T, et al. Treeio: an R package for phylogenetic tree input and output with richly annotated and associated data. Mol Biol Evol. 2020. Feb 1;37(2):599–603. doi: 10.1093/molbev/msz240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 151.Wright ES. Using DECIPHER v2. 0 to analyze big biological sequence data in R. R J. 2016. Aug 1;8(1):352. [Google Scholar]
  • 152.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019. Jul 2;47(W1):W256–W259. doi: 10.1093/nar/gkz239 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Roland G Roberts

28 Feb 2024

Dear Dr Spribille,

Thank you for submitting your manuscript entitled "Microbial occurrence and symbiont detection in lichen metagenomes" for consideration as a Research Article by PLOS Biology.

Your manuscript has now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I'm writing to let you know that we would like to consider your paper further.

IMPORTANT: We're still deciding to what extent it needs further peer review; sorry not to be able to clarify that yet. Please ignore anything in the following letter template that relates to peer review, as the jury is still out...

However, before we can consider your manuscript further, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. After your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by Mar 01 2024 11:59PM.

If your manuscript has been previously peer-reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Roli Roberts

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

Decision Letter 1

Roland G Roberts

15 May 2024

Dear Toby,

Thank you for your patience while your manuscript "Microbial occurrence and symbiont detection in lichen metagenomes" was peer-reviewed by PLOS Biology. Your work was assessed and discussed by the PLOS Biology editorial team, an Academic Editor with relevant expertise, and by three independent reviewers. As previously discussed, as part of our "portable peer review" initiative, all three reviewers were privy to the prior reviews from Nature and Cell Genomics. Based on the reviews, which you will find at the end of this email, I regret that we will not be pursuing your manuscript for publication in the journal.

You'll see that reviewer #1 is positive, saying that the paper is now more nuanced and re-framed in a more cautionary manner. Their requests are textual and fairly minor. Reviewer #2 starts positive but thinks that you are too dismissive of the possibility that contamination could have occurred during DNA isolation, and worries about your data being uploaded to GenBank without caveats; s/he also says that you missed an opportunity by not sequencing the substrate from which the lichens were obtained. Reviewer #3 thinks that the you still imply symbiosis too readily and is concerned that you propose to write a perspective piece about the topic. S/he recommends further re-framing the manuscript around issues that affect MAGs, rather than relationships between organisms; the rest of this review is a series of points about how your current manuscript is a halfway house, and how to make it more about the pros/cons of MAG approaches. I should say that two of the reviewers explicitly recommended that we reject the paper.

We solicited cross-comments from the reviewers, and discussed these and the reviews with the Academic Editor. Unfortunately the AE was not supportive of considering your study further at PLOS Biology, even in a much-caveated and tempered form.

I'm sorry that we cannot be more positive on this occasion, especially after this lengthy process, and hope the reviewer reports will help you in preparing your manuscript for submission elsewhere. We would be happy to transfer the peer reviews, along with the names of the reviewers (where they agree), to another journal of your choice. To start this process, we need an explicit request by the corresponding author.

I hope you understand the reasons for this decision and will consider PLOS Biology for other submissions in the future. Thank you for your support of PLOS and of Open Access publishing.

Sincerely,

Roli

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

------------------------------------------------------------------------

REVIEWERS' COMMENTS:

Reviewer #1:

Tagirdzhanova et al.

Microbial occurrence and symbiont detection in lichen metagenomes

This manuscript is a resubmission of a manuscript previously reviewed at Nature and Cell Genomics. I was a previous reviewer and have read the extensively revised manuscript and the complex set of reviews and author responses from the earlier submissions. The authors have made a major effort to address my previous comments and, I believe, those of the other reviewers. I recommend that the paper be accepted with minor revision.

Specific comments:

-Most generally, I want to highlight what I see as the most important element of the revised manuscript. It has a revised thrust and presents an important set of take-home messages and cautionary tales for lichen researchers, and metagenomicists in general. I won't rehash them here, but basically they are as follows: given what we know about the biology of lichens, there are things we can trust and things we cannot trust about MAG data. We now know enough to know that there are a variety of factors contribute to the extent to which key players in lichens are recovered in MAGs. Sometimes they are not recovered at all, despite knowing from other lines of inquiry that they are there (e.g., basidiomycetes). This is sobering and important for the community to take on board.

-Second, my opinion is that the authors have taken a suitably nuanced perspective of the concept of symbiosis and acknowledge the myriad uncertainties. The Discussion section on page 25-26 stands out in this regard, as does the "Limits to inference" section that immediately follows, which brings the question of 'surface sterilization' out into the open. The authors seem to be asking the question: do lichens [whatever that means] have a microbiome? I think this is an important question to ask and debate amongst researchers in the field. Side-stepping use of the term 'core' doesn't make the underlying issues go away.

-The introduction is truly excellent - it provides an authoritative overview of the history and current state of affairs in modern lichen research. It reads a bit like a thesis intro (maybe it is); certainly the extensive referencing (well over 100 citations) is a strength of the paper, to the extent that it aligns with the constraints of PLOS Biology.

-The 24 newly generated metagenomes are basically an afterthought in the current paper. Yes, they are embedded within a much larger study and analytical / interpretive framework, but readers will be interested in where they came from and how they were generated (provided on page 19, the very last section of the Results). Space permitting, draw more attention to it in the abstract (line 31) and consider moving this further up the results. I see why it is placed where it is - but maybe a nod to these data and their significance earlier would be helpful too.

-I am not sure why the authors chose to embed the figure legends but not the figures into the main text. This was not helpful.

-Line 127 - "We began by removing identical metagenomes that have been published twice in the sequence databases (n=43; S3 Table)". Meaning unclear based on main text alone. My brain instantly asked: do they mean literally the same dataset or independently sequenced metagenomes from the same location / environment by different labs? The former makes sense to remove, the latter would seem to be a useful control for experimental and bioinformatic procedures. This becomes clear from the supp info line 1381/82, i.e., they are literally the same data submitted twice to GenBank with different accession numbers, which of course is unfortunate. I suggest the authors make this clear in the main text as well.

-For much of the intro, the LFS acronym is used in the singular (e.g., "the lichen fungal symbiont (LFS),..."). This of course becomes clear that they are indeed exploring and discussing the presence of basidiomycetes as well. I suggest working in acknowledgement of the presence of multiple types of fungi earlier on. Why? Even non-specialists / lichen enthusiasts will be aware of the high-profile Science paper describing multiple fungal partners. The way the intro reads at first it almost sounds as though the authors are back-tracking on that observation, which they are not.

Reviewer #2:

This manuscript presents an extensive analysis of lichen metagenomes aimed at elucidating the microbial composition and detecting symbiotic relationships within lichen ecosystems. The authors have reassembled and analyzed over 400 publicly available metagenomes, generating metagenome-assembled genomes (MAGs), constructing phylogenomic trees, and mapping MAG occurrence. The study provides valuable insights into the prevalence and diversity of bacterial, algal, and fungal constituents across a wide array of lichen species.

Before the submission to PLoS Biology, the manuscript was submitted to the journals Nature and Cell. Reviews from these earlier submissions were attached to this submission.

The authors of the manuscript have made a significant change in the latest version by reducing the emphasis on describing a "core" and "global" set of microbial lichen symbionts. This change aligns with the recommendation of many reviewers. Although the authors' responses to the reviews suggest that they are not entirely convinced that their data does not allow such an evaluation, the changes are still appreciated. The new submission compares various methods to extract lichen metagenomes and presents the results more holistically by focusing on the total pool of genomes recoverable from the lichen data.

Reviewers have pointed out that there is contamination in both the public dataset and the data produced by the authors. I disagree with the dismissive response from the authors. The contamination could have occurred during DNA isolation, before the barcoding. Although the authors provide an unlikely biological explanation, I am surprised they decided to ignore this contamination. I assume they still plan to upload this data without commenting on it to Genbank, which could cause problems for colleagues who want to work with the data. Furthermore, I am concerned that these samples may be representative of other inconsistencies in their data.

Additionally, I believe there was a missed opportunity with the data they produced themselves. They could have addressed many of the criticisms raised by the reviewers by conducting further testing. For instance, they could have sequenced the substrate in addition to the lichen symbiosis to determine how many of the bacteria identified by sequencing were associated with the lichen and how many were sequenced by chance. Furthermore, they could have treated the same sample differently to test if they were correct in their speculation that irrespective of the treatment, the sequencing results would have remained the same.

The manuscript uses a metagenomic methodology to assess lichen symbioses, which is a valuable addition to the field. However, I have concerns regarding the inconsistencies in the presented data and the missed opportunities for further investigation. These factors cast doubt on the manuscript's suitability for publication in PLoS Biology.

Reviewer #3:

Dear Authors,

I have carefully reviewed the manuscript and the comments of other referees. Although I admire the effort behind putting together a manuscript, I have several major concerns, which prevent me from recommending its publication in its current form. Some of my major concerns align with those of the referees who reviewed the manuscript before.

Firstly, while the manuscript has shown improvement in certain areas, such as refraining from referring to the identified bacteria as core, the central narrative remains somewhat vague. Throughout the manuscript, you imply that bacteria are symbionts, yet refrain from stating this directly due to the lack of sufficient evidence to support the claim, as advised by most referees.

Specifically, while the referees have expressed concerns about the adequacy of evidence supporting this classification, you argue that the presence of bacteria, including on the unsterilised surface justifies them to be part of the biofilm and you also propose that bacterial gene pathways are beneficial and functionally relevant for fungi based on their mere presence in bacteria. However, the sole presence of microorganisms on an unsterilized surface does not necessarily imply a symbiotic relationship. Do we consider viruses and contaminants from the environment, such as dust and sand particles, as a part of biofilm and tag them as symbionts? No. Then why bacteria. Simply because bacteria have the pathways that could potentially be beneficial for fungus does not mean the identified bacteria are symbiont. Do closely related non-symbiont bacteria lack these pathways?

Until sufficient evidence is generated, I would suggest to refrain from making such claims.

Furthermore, your reliance on publishing a perspective to support your argument may indicate a lack of openness to alternative viewpoints, which undermines the purpose of peer review. You have faint to no support for their idea of bacteria being symbionts but you argue that you will publish a perspective on this. So you will increase the support for your claim by publishing opinions and perspectives. We should be careful about supporting such approach that an idea/perspective is suggested as a hypothesis but becomes a "suggestion" and eventually a fact. A perspective therefore would not give more support to bacteria being symbionts.

Moreover, I suggest refraining from name-calling, such as "Duke school," and instead showing respect for alternative hypotheses. I understand the frustration given the work put into a manuscript. However, given the insufficient evidence to support a strong symbiotic relationship, it's important to acknowledge and address the specific concerns raised by the referees.

As the central idea of the manuscript is not supported by the evidence, I would recommend major restructuring.

Best regards.

Major concerns:

1) The manuscript introduction appears to lack a clear direction, with an imbalance evident in all the sections. For instance, given the title of the manuscript "Microbial occurrence and symbiont detection in lichen metagenomes" and the main knowledge gap being "overall content and limitations of these metagenomes have not been assessed," I would expect the introduction to center around issues of MAGs and risks of bacterial read contamination. Instead, there is an undue emphasis on explaining the relationship between fungi and algae. This suggests that the manuscript may have undergone several rewrites, leading to a loss of focus and coherence in conveying the central story.

2) My second major concern aligns with one of the reviewers, regarding the use of MAGs to identify lichen-associated bacteria. This also explains the use of convoluted terminology between the abstract and introduction to state the main aim of the paper. While the manuscript progresses, the authors shift from "overall content and limitations of MAGs have not been assessed" to implying consistently that bacteria are symbionts but without ever stating it directly. They keep referring to the studies that investigated the possibility that bacteria are symbionts.

3) Between the two possible directions of the manuscript (bacteria in MAG or bacteria as lichen symbionts), based on the data, I would recommend opting for the first one as hinted in the abstract. For the second option, 1) the data is not apt, and 2) even if the presence is established, there is not enough or any evidence for stating that bacteria are symbionts (see detailed comments below).

4) My fourth but actually one of the biggest concerns is that despite various reviewers in two rounds of reviews, the authors are not keeping an open mind to the fact that several unrelated reviewers, for one reason or another, but all very strong, advise the authors that they do not have evidence to even suggest that bacteria are symbiotic to lichen associations or lichens are composed of fungus, algae, and bacteria. It seems that the authors got fixed on the idea of proving bacteria are symbiotic in lichens, and they are looking at everything through that lens, completely ignoring the concerns of all the reviewers.

5) Misleading idea. Starting with bacteria in MAGs to slowly transitioning to bacteria are symbionts is conveying that authors want to make a claim without evidence but being stopped by the reviewers' refrain to state it directly but keep implying it through the manuscript. I would strongly recommend against it. Please define an aim, state a knowledge gap, address it, and REFRAIN not only from stating but also implying the conclusions not supported by evidence.

Specific comments

1) The manuscript introduction appears to be lacking a clear direction, with an imbalance evident in all the sections.

For instance given the title of the manuscript "Microbial occurrence and symbiont detection in lichen metagenomes", and the main knowledge gap being "overall content and limitations of these metagenomes have not been assessed" I would expect the introduction centers around issues of MAGs and risks of bacterial read contamination.

Instead, there is an undue emphasis on explaining the relationship between fungi and algae. This suggests that the manuscript may have undergone several rewrites, leading to a loss of focus and coherence in conveying the central story.

As the introduction flows the primary focus of the paper, i.e., bacteria in MAG (if we read the title and abstract), shifts to bacteria as lichen symbionts. See below

Abstract: In lichen research, metagenomes are increasingly being used for evaluating symbiont composition and metabolic potential, but the overall content and limitations of these metagenomes have not been assessed. We reassembled over 400 publicly available metagenomes, generated metagenome-assembled genomes (MAGs), constructed phylogenomic trees and mapped MAG occurrence and frequency across the dataset.

Introduction: (lines 86-89) However, though hundreds of bacterial strains have been detected, overall few lichen symbioses have been censused for their bacterial composition, with most work focused on the model symbiosis involving the ascomycete fungus Lobaria pulmonaria (reviewed by Grimm et al. [13]).

Discussion:

lines 446-450

Our analyses of lichen metagenomes yielded two contrasting sets of results. On the one hand, our data provide evidence that bacteria that have until now been studied as potential lichen symbionts in only a couple dozen lichens in fact occur in or on hundreds of lichens, and by extracting over 600 MAGs we increase the amount of available genomic data by orders of magnitude.

Results:

If the paper is about overall content and limitations of these MAGs

What is the logic of analyzing the bacterial pathways and genes and their functional categories? In that case authors analyze the pathways present in bacteria from different labs when the samples were prepared

If the paper analyzes the possibility of bacteria being symbionts

1) The dataset is not suitable. In short most lichen genomes are generated to obtain the fungus and hence may not implement appropriate sample preparation protocol and environment.

2) As the data is generated by several different labs, the amount of bacteria in the samples is uncomparable.

3) Even if we ignore these points and consider the dataset is suitable for the study, the study only shows bacteria are present in the thalli but if the pathways that bacteria contain benefit the fungus is anything but pure supposition, especially because some of these bacteria may also be free living or associated with other organisms. In that case one must prove that these pathways exist entirely in bacteria symbiotic with lichens to say that these pathways are relevant for fungi and hence bacteria are symbionts.

I would suggest to define the aim of the manuscript clearly it can either be :

Bacterial contamination in lichenized fungal MAGs; As evident from the abstract lines 28-32

In lichen research, metagenomes are increasingly being used for evaluating symbiont composition and metabolic potential, but the overall content and limitations of these metagenomes have not been assessed. We reassembled over 400 publicly available metagenomes, generated metagenome-assembled genomes (MAGs), constructed phylogenomic trees and mapped MAG occurrence and frequency across the dataset."

Or

Screening lichens for the bacterial composition/symbionts; As implied in the introduction lines 86-89

However, though hundreds of bacterial strains have been detected, overall few lichen symbioses have been censused for their bacterial composition, with most work focused on the model symbiosis involving the ascomycete fungus Lobaria pulmonaria (reviewed by Grimm et al. [13]).

But not implying bacteria as symbionts in either of the above

Introduction contains details and knowledge gaps which have nothing to do with this title or abstract of the article

for instance:

73-76 Although much about the biology of the disparate photobiont lineages remains unclear, especially with respect to what they receive from the fungus, their population structuring vis-a-vis their LFS partners is consistent with the definition of an open symbiosis [20].

Another example: No consensus exists however on how many other symbionts might be involved, and since most other associated microbes lack visible chlorophyll, their detection is not trivial. Researchers as early as Maria Cengia Sambo in the 1920s cultured additional bacteria from lichens, leading to the speculation that lichens could be 'polysymbioses' of more than two partners.

This is an example of slowly shifting focus from bacteria in MAG (implying that bacteria are symbionts?)

The major section of results deals with the functional profile of the detected bacteria (line 311 Functional profiles of the most commonly detected bacteria, line 342-367 Carbon usage, 368-377 Aerobic an oxygenic phototrophy, 378-388 Nitrogen fixation, 389-410 Cofactor and iron scavenging, 411-426 Carbohydrate degradation). But unless there is a proof that bacteria are symbiotic to lichens we are reading about which pathways do bacteria detected on surface of lichens have.

Discussion

Major part of discussion (lines 446-525) is focussed on proving why bacteria and algae may go undetected in lichens to support that bacteria are common occurrence (=symbiotic according to authors). They refer to studies which have and have not found Cystobasidiomycetes, photobionts etc. in lichens before. Regarding this, first this is of methodological relevance and does not require a major section in discussion and second still it only shows that bacteria are present.

Decision Letter 2

Roland G Roberts

6 Jun 2024

Dear Toby,

Many thanks for your patience while we considered your Appeal of our previous decision on your manuscript "Microbial occurrence and symbiont detection in lichen metagenomes." As mentioned in my previous email, after discussion among the team and with the Academic Editor, we think that despite the concerns raised, there is significant value in your study, and have decided to invite you to revise your manuscript, taking measures to allay the reviewers' substantial and understandable concerns.

As previously discussed, reviewer #1 is positive, saying that the paper is now more nuanced and re-framed in a more cautionary manner. Their requests are textual and fairly minor. Reviewer #2 starts positive but thinks that you are too dismissive of the possibility that contamination could have occurred during DNA isolation, and worries about your data being uploaded to GenBank without caveats; s/he also says that you missed an opportunity by not sequencing the substrate from which the lichens were obtained. Reviewer #3 thinks that the you still imply symbiosis too readily and is concerned that you propose to write a perspective piece about the topic. S/he recommends further re-framing the manuscript around issues that affect MAGs, rather than relationships between organisms; the rest of this review is a series of points about how your current manuscript is a halfway house, and how to make it more about the pros/cons of MAG approaches.

Your Appeal email rebutted some of the points raised by reviewers #2 and #3, but also proposed constructive measure to address their remaining concerns. As I understand it, these include a) being completely explicit about having zero direct evidence for symbiosis, b) making a circumstantial but strong case for the likelihood of contamination being negligible, c) clarifying why uploading the MAGs is “industry standard” and should not be controversial or problematic, d) incorporating text about the “open” nature of well accepted lichen symbioses (and why this reduces the onus on analysing substrate metagenomes), and e) including the eukaryotic side of the KEGG/pathway analysis to complement the current bacterial ones presented in Fig 6. To my mind, a reader should not walk away from this paper thinking that you have unequivocally demonstrated a symbiotic role (sensu stricto) for the bacterial components, but that the very useful data have been presented even-handedly and the circumstantial nature of the evidence for symbiosis made clear.

I should also say that when discussing the Appeal decision, the Academic Editor added the following, which you might find helpful: "One clarification that would help prevent misunderstandings is a clear definition of how the term 'symbiont' is being used. The original definition was just organisms of different species in sustained, close physical contact. But a common use of the term is more specific -- an organism playing some role in the interaction. And an even more specific definition is an organism playing a positive role by enhancing fitness of other participants (i.e. mutualists, not parasites). There is clear evidence for the first meaning, not so clear for the latter meanings."

In light of the reviews, which you will find at the end of this email, we would like to invite you to revise the work to thoroughly address the reviewers' reports.

Given the extent of revision needed, we cannot make a decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is likely to be sent for further evaluation by all or a subset of the reviewers.

We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension.

At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may withdraw it.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point-by-point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Revised Article with Changes Highlighted" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roli

Roland Roberts, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

------------------------------------

REVIEWERS' COMMENTS:

Reviewer #1:

Tagirdzhanova et al.

Microbial occurrence and symbiont detection in lichen metagenomes

This manuscript is a resubmission of a manuscript previously reviewed at Nature and Cell Genomics. I was a previous reviewer and have read the extensively revised manuscript and the complex set of reviews and author responses from the earlier submissions. The authors have made a major effort to address my previous comments and, I believe, those of the other reviewers. I recommend that the paper be accepted with minor revision.

Specific comments:

-Most generally, I want to highlight what I see as the most important element of the revised manuscript. It has a revised thrust and presents an important set of take-home messages and cautionary tales for lichen researchers, and metagenomicists in general. I won't rehash them here, but basically they are as follows: given what we know about the biology of lichens, there are things we can trust and things we cannot trust about MAG data. We now know enough to know that there are a variety of factors contribute to the extent to which key players in lichens are recovered in MAGs. Sometimes they are not recovered at all, despite knowing from other lines of inquiry that they are there (e.g., basidiomycetes). This is sobering and important for the community to take on board.

-Second, my opinion is that the authors have taken a suitably nuanced perspective of the concept of symbiosis and acknowledge the myriad uncertainties. The Discussion section on page 25-26 stands out in this regard, as does the "Limits to inference" section that immediately follows, which brings the question of 'surface sterilization' out into the open. The authors seem to be asking the question: do lichens [whatever that means] have a microbiome? I think this is an important question to ask and debate amongst researchers in the field. Side-stepping use of the term 'core' doesn't make the underlying issues go away.

-The introduction is truly excellent - it provides an authoritative overview of the history and current state of affairs in modern lichen research. It reads a bit like a thesis intro (maybe it is); certainly the extensive referencing (well over 100 citations) is a strength of the paper, to the extent that it aligns with the constraints of PLOS Biology.

-The 24 newly generated metagenomes are basically an afterthought in the current paper. Yes, they are embedded within a much larger study and analytical / interpretive framework, but readers will be interested in where they came from and how they were generated (provided on page 19, the very last section of the Results). Space permitting, draw more attention to it in the abstract (line 31) and consider moving this further up the results. I see why it is placed where it is - but maybe a nod to these data and their significance earlier would be helpful too.

-I am not sure why the authors chose to embed the figure legends but not the figures into the main text. This was not helpful.

-Line 127 - "We began by removing identical metagenomes that have been published twice in the sequence databases (n=43; S3 Table)". Meaning unclear based on main text alone. My brain instantly asked: do they mean literally the same dataset or independently sequenced metagenomes from the same location / environment by different labs? The former makes sense to remove, the latter would seem to be a useful control for experimental and bioinformatic procedures. This becomes clear from the supp info line 1381/82, i.e., they are literally the same data submitted twice to GenBank with different accession numbers, which of course is unfortunate. I suggest the authors make this clear in the main text as well.

-For much of the intro, the LFS acronym is used in the singular (e.g., "the lichen fungal symbiont (LFS),..."). This of course becomes clear that they are indeed exploring and discussing the presence of basidiomycetes as well. I suggest working in acknowledgement of the presence of multiple types of fungi earlier on. Why? Even non-specialists / lichen enthusiasts will be aware of the high-profile Science paper describing multiple fungal partners. The way the intro reads at first it almost sounds as though the authors are back-tracking on that observation, which they are not.

Reviewer #2:

This manuscript presents an extensive analysis of lichen metagenomes aimed at elucidating the microbial composition and detecting symbiotic relationships within lichen ecosystems. The authors have reassembled and analyzed over 400 publicly available metagenomes, generating metagenome-assembled genomes (MAGs), constructing phylogenomic trees, and mapping MAG occurrence. The study provides valuable insights into the prevalence and diversity of bacterial, algal, and fungal constituents across a wide array of lichen species.

Before the submission to PLoS Biology, the manuscript was submitted to the journals Nature and Cell. Reviews from these earlier submissions were attached to this submission.

The authors of the manuscript have made a significant change in the latest version by reducing the emphasis on describing a "core" and "global" set of microbial lichen symbionts. This change aligns with the recommendation of many reviewers. Although the authors' responses to the reviews suggest that they are not entirely convinced that their data does not allow such an evaluation, the changes are still appreciated. The new submission compares various methods to extract lichen metagenomes and presents the results more holistically by focusing on the total pool of genomes recoverable from the lichen data.

Reviewers have pointed out that there is contamination in both the public dataset and the data produced by the authors. I disagree with the dismissive response from the authors. The contamination could have occurred during DNA isolation, before the barcoding. Although the authors provide an unlikely biological explanation, I am surprised they decided to ignore this contamination. I assume they still plan to upload this data without commenting on it to Genbank, which could cause problems for colleagues who want to work with the data. Furthermore, I am concerned that these samples may be representative of other inconsistencies in their data.

Additionally, I believe there was a missed opportunity with the data they produced themselves. They could have addressed many of the criticisms raised by the reviewers by conducting further testing. For instance, they could have sequenced the substrate in addition to the lichen symbiosis to determine how many of the bacteria identified by sequencing were associated with the lichen and how many were sequenced by chance. Furthermore, they could have treated the same sample differently to test if they were correct in their speculation that irrespective of the treatment, the sequencing results would have remained the same.

The manuscript uses a metagenomic methodology to assess lichen symbioses, which is a valuable addition to the field. However, I have concerns regarding the inconsistencies in the presented data and the missed opportunities for further investigation. These factors cast doubt on the manuscript's suitability for publication in PLoS Biology.

Reviewer #3:

Dear Authors,

I have carefully reviewed the manuscript and the comments of other referees. Although I admire the effort behind putting together a manuscript, I have several major concerns, which prevent me from recommending its publication in its current form. Some of my major concerns align with those of the referees who reviewed the manuscript before.

Firstly, while the manuscript has shown improvement in certain areas, such as refraining from referring to the identified bacteria as core, the central narrative remains somewhat vague. Throughout the manuscript, you imply that bacteria are symbionts, yet refrain from stating this directly due to the lack of sufficient evidence to support the claim, as advised by most referees.

Specifically, while the referees have expressed concerns about the adequacy of evidence supporting this classification, you argue that the presence of bacteria, including on the unsterilised surface justifies them to be part of the biofilm and you also propose that bacterial gene pathways are beneficial and functionally relevant for fungi based on their mere presence in bacteria. However, the sole presence of microorganisms on an unsterilized surface does not necessarily imply a symbiotic relationship. Do we consider viruses and contaminants from the environment, such as dust and sand particles, as a part of biofilm and tag them as symbionts? No. Then why bacteria. Simply because bacteria have the pathways that could potentially be beneficial for fungus does not mean the identified bacteria are symbiont. Do closely related non-symbiont bacteria lack these pathways?

Until sufficient evidence is generated, I would suggest to refrain from making such claims.

Furthermore, your reliance on publishing a perspective to support your argument may indicate a lack of openness to alternative viewpoints, which undermines the purpose of peer review. You have faint to no support for their idea of bacteria being symbionts but you argue that you will publish a perspective on this. So you will increase the support for your claim by publishing opinions and perspectives. We should be careful about supporting such approach that an idea/perspective is suggested as a hypothesis but becomes a "suggestion" and eventually a fact. A perspective therefore would not give more support to bacteria being symbionts.

Moreover, I suggest refraining from name-calling, such as "Duke school," and instead showing respect for alternative hypotheses. I understand the frustration given the work put into a manuscript. However, given the insufficient evidence to support a strong symbiotic relationship, it's important to acknowledge and address the specific concerns raised by the referees.

As the central idea of the manuscript is not supported by the evidence, I would recommend major restructuring.

Best regards.

Major concerns:

1) The manuscript introduction appears to lack a clear direction, with an imbalance evident in all the sections. For instance, given the title of the manuscript "Microbial occurrence and symbiont detection in lichen metagenomes" and the main knowledge gap being "overall content and limitations of these metagenomes have not been assessed," I would expect the introduction to center around issues of MAGs and risks of bacterial read contamination. Instead, there is an undue emphasis on explaining the relationship between fungi and algae. This suggests that the manuscript may have undergone several rewrites, leading to a loss of focus and coherence in conveying the central story.

2) My second major concern aligns with one of the reviewers, regarding the use of MAGs to identify lichen-associated bacteria. This also explains the use of convoluted terminology between the abstract and introduction to state the main aim of the paper. While the manuscript progresses, the authors shift from "overall content and limitations of MAGs have not been assessed" to implying consistently that bacteria are symbionts but without ever stating it directly. They keep referring to the studies that investigated the possibility that bacteria are symbionts.

3) Between the two possible directions of the manuscript (bacteria in MAG or bacteria as lichen symbionts), based on the data, I would recommend opting for the first one as hinted in the abstract. For the second option, 1) the data is not apt, and 2) even if the presence is established, there is not enough or any evidence for stating that bacteria are symbionts (see detailed comments below).

4) My fourth but actually one of the biggest concerns is that despite various reviewers in two rounds of reviews, the authors are not keeping an open mind to the fact that several unrelated reviewers, for one reason or another, but all very strong, advise the authors that they do not have evidence to even suggest that bacteria are symbiotic to lichen associations or lichens are composed of fungus, algae, and bacteria. It seems that the authors got fixed on the idea of proving bacteria are symbiotic in lichens, and they are looking at everything through that lens, completely ignoring the concerns of all the reviewers.

5) Misleading idea. Starting with bacteria in MAGs to slowly transitioning to bacteria are symbionts is conveying that authors want to make a claim without evidence but being stopped by the reviewers' refrain to state it directly but keep implying it through the manuscript. I would strongly recommend against it. Please define an aim, state a knowledge gap, address it, and REFRAIN not only from stating but also implying the conclusions not supported by evidence.

Specific comments

1) The manuscript introduction appears to be lacking a clear direction, with an imbalance evident in all the sections.

For instance given the title of the manuscript "Microbial occurrence and symbiont detection in lichen metagenomes", and the main knowledge gap being "overall content and limitations of these metagenomes have not been assessed" I would expect the introduction centers around issues of MAGs and risks of bacterial read contamination.

Instead, there is an undue emphasis on explaining the relationship between fungi and algae. This suggests that the manuscript may have undergone several rewrites, leading to a loss of focus and coherence in conveying the central story.

As the introduction flows the primary focus of the paper, i.e., bacteria in MAG (if we read the title and abstract), shifts to bacteria as lichen symbionts. See below

Abstract: In lichen research, metagenomes are increasingly being used for evaluating symbiont composition and metabolic potential, but the overall content and limitations of these metagenomes have not been assessed. We reassembled over 400 publicly available metagenomes, generated metagenome-assembled genomes (MAGs), constructed phylogenomic trees and mapped MAG occurrence and frequency across the dataset.

Introduction: (lines 86-89) However, though hundreds of bacterial strains have been detected, overall few lichen symbioses have been censused for their bacterial composition, with most work focused on the model symbiosis involving the ascomycete fungus Lobaria pulmonaria (reviewed by Grimm et al. [13]).

Discussion:

lines 446-450

Our analyses of lichen metagenomes yielded two contrasting sets of results. On the one hand, our data provide evidence that bacteria that have until now been studied as potential lichen symbionts in only a couple dozen lichens in fact occur in or on hundreds of lichens, and by extracting over 600 MAGs we increase the amount of available genomic data by orders of magnitude.

Results:

If the paper is about overall content and limitations of these MAGs

What is the logic of analyzing the bacterial pathways and genes and their functional categories? In that case authors analyze the pathways present in bacteria from different labs when the samples were prepared

If the paper analyzes the possibility of bacteria being symbionts

1) The dataset is not suitable. In short most lichen genomes are generated to obtain the fungus and hence may not implement appropriate sample preparation protocol and environment.

2) As the data is generated by several different labs, the amount of bacteria in the samples is uncomparable.

3) Even if we ignore these points and consider the dataset is suitable for the study, the study only shows bacteria are present in the thalli but if the pathways that bacteria contain benefit the fungus is anything but pure supposition, especially because some of these bacteria may also be free living or associated with other organisms. In that case one must prove that these pathways exist entirely in bacteria symbiotic with lichens to say that these pathways are relevant for fungi and hence bacteria are symbionts.

I would suggest to define the aim of the manuscript clearly it can either be :

Bacterial contamination in lichenized fungal MAGs; As evident from the abstract lines 28-32

In lichen research, metagenomes are increasingly being used for evaluating symbiont composition and metabolic potential, but the overall content and limitations of these metagenomes have not been assessed. We reassembled over 400 publicly available metagenomes, generated metagenome-assembled genomes (MAGs), constructed phylogenomic trees and mapped MAG occurrence and frequency across the dataset."

Or

Screening lichens for the bacterial composition/symbionts; As implied in the introduction lines 86-89

However, though hundreds of bacterial strains have been detected, overall few lichen symbioses have been censused for their bacterial composition, with most work focused on the model symbiosis involving the ascomycete fungus Lobaria pulmonaria (reviewed by Grimm et al. [13]).

But not implying bacteria as symbionts in either of the above

Introduction contains details and knowledge gaps which have nothing to do with this title or abstract of the article

for instance:

73-76 Although much about the biology of the disparate photobiont lineages remains unclear, especially with respect to what they receive from the fungus, their population structuring vis-a-vis their LFS partners is consistent with the definition of an open symbiosis [20].

Another example: No consensus exists however on how many other symbionts might be involved, and since most other associated microbes lack visible chlorophyll, their detection is not trivial. Researchers as early as Maria Cengia Sambo in the 1920s cultured additional bacteria from lichens, leading to the speculation that lichens could be 'polysymbioses' of more than two partners.

This is an example of slowly shifting focus from bacteria in MAG (implying that bacteria are symbionts?)

The major section of results deals with the functional profile of the detected bacteria (line 311 Functional profiles of the most commonly detected bacteria, line 342-367 Carbon usage, 368-377 Aerobic an oxygenic phototrophy, 378-388 Nitrogen fixation, 389-410 Cofactor and iron scavenging, 411-426 Carbohydrate degradation). But unless there is a proof that bacteria are symbiotic to lichens we are reading about which pathways do bacteria detected on surface of lichens have.

Discussion

Major part of discussion (lines 446-525) is focussed on proving why bacteria and algae may go undetected in lichens to support that bacteria are common occurrence (=symbiotic according to authors). They refer to studies which have and have not found Cystobasidiomycetes, photobionts etc. in lichens before. Regarding this, first this is of methodological relevance and does not require a major section in discussion and second still it only shows that bacteria are present.

Decision Letter 3

Roland G Roberts

11 Sep 2024

Dear Dr Spribille,

Thank you for your patience while we considered your revised manuscript "Microbial occurrence and symbiont detection in lichen metagenomes" for publication as a Research Article at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor.

Based oon our Academic Editor's assessment of your revision, we are likely to accept this manuscript for publication, provided you satisfactorily address the remaining editorial points. Please also make sure to address the following data and other policy-related requests.

a) We routinely suggest changes to titles to ensure maximum accessibility for a broad, non-specialist readership, and to ensure they reflect the contents of the paper. In this case, we would suggest a minor edit to the title, as follows. Please ensure you change both the manuscript file and the online submission system, as they need to match for final acceptance:

"Microbial occurrence and symbiont detection in global lichen metagenomes"

b) Thank you for providing the funding institutions. Please also provide the grant number for each grant.

c) We have no word limit. Please place the "extended" methods and results in the main text.

d) You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Please supply the numerical values either in the a supplementary file or as a permanent DOI’d deposition for the following figures:

Figure 4AB, 5ABC, 6, S1, S4, S6, S7

NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

e) Please cite the location of the data clearly in all relevant main and supplementary Figure legends, e.g. “The data underlying this Figure can be found in S1 Data” or “The data underlying this Figure can be found in https://doi.org/10.5281/zenodo.XXXXX

f) Please also provide the tree files for Figures 2ABC, S5

g) I notice the statement saying “All MAGs, assemblies, and de novo generated raw data are submitted to ENA under the study accession PRJEB59037 (data release pending). Phylogenomic trees in Newick format are available at FigShare (pending). Custom scripts used for data analysis and visualization are available on GitHub (https://github.com/Spribille-lab/Aglobal-survey-of-lichen-symbionts-from-metagenomes) and FigShare (pending).”. We will require access before accepting the manuscript.

h) Please ensure that your Data Statement in the submission system accurately describes where your data can be found and is in final format, as it will be published as written there.

i) Many thanks for providing the underlying code in GitHub. However, because Github depositions can be readily changed or deleted, please make a permanent DOI’d copy (e.g. in Zenodo) and provide this URL in the manuscript and Data Availability Statement.

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable, if not applicable please do not delete your existing 'Response to Reviewers' file.)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Sincerely,

Melissa

Melissa Vázquez Hernández, PhD

Associate Editor, PLOS Biology

mvazquezhernandez@plos.org

on behalf of

Roland

Roland Roberts, PhD

Senior Editor

rroberts@plos.org

PLOS Biology

Decision Letter 4

Roland G Roberts

24 Sep 2024

Dear Toby,

Thank you for the submission of your revised Research Article "Microbial occurrence and symbiont detection in a global sample of lichen metagenomes" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Nancy Moran, I'm pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS: We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Sincerely, 

Roli

Roland G Roberts, PhD, PhD

Senior Editor

PLOS Biology

rroberts@plos.org

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Dot histogram of lichen metagenomes arranged by sequencing depth.

    Each metagenome is shown with a dot; the dots are colored according to the taxonomy of the lichen fungal symbiont (LFS). The dots are arranged on the x-axis based on the sequencing depth (bp) and are “stacked” on top of each other. The data underlying this figure can be found in S1 Table.

    (TIF)

    pbio.3002862.s001.tif (1.6MB, tif)
    S2 Fig. Sampling locations for the metagenomic data used in the analysis and the geographic distributions of most frequent bacterial genera.

    Each dot represents a sample used for metagenomic sequencing. The basemap shapefile was taken from the rnaturalearthdata library (v0.1.0) and belongs to public domain (https://www.naturalearthdata.com/).

    (TIF)

    pbio.3002862.s002.tif (1.1MB, tif)
    S3 Fig. Geographic distribution of selected groups of symbionts, based on 3 methods of screening: presence of MAGs, presence of SSU rRNA gene in the metagenomic assemblies, and presence of the SSU rRNA in the raw, unassembled reads.

    Each dot represents a sample used for metagenomic sequencing. Here are shown data on the 4 most frequent bacterial families, on the genus Lichenihabitans, and on the 3 eukaryotic lineages known to be stably associated with lichens. The basemap shapefile was taken from the rnaturalearthdata library (v0.1.0) and belongs to public domain (https://www.naturalearthdata.com/).

    (TIF)

    pbio.3002862.s003.tif (5.6MB, tif)
    S4 Fig. Co-occurrence networks of lichen symbionts based on presence-absence of MAGs in each metagenome.

    Each node is a MAG, and edges represent the co-occurrence of MAGs within 1 metagenome; the thicker the edge, the more often 2 MAGs co-occur. Nodes are colored based on the taxonomy and function of the symbiont; in each network, yellow nodes represent MAGs of the LFS (lichen fungal symbiont). Only data on metagenomes that yielded an LFS MAG are shown. (A) Co-occurrence of LFSs, other known eukaryotic symbionts, and Cyanobacteria. (B) Co-occurrence of LFSs and Lichenihabitans. (C) Co-occurrence of LFSs and most frequent genera of Acetobacteraceae. (D) Co-occurrence of LFSs and most frequent genera of Sphingomonadaceae. (E) Co-occurrence of LFSs and most frequent genera of Acidobacteriaceae. The data underlying this figure can be found in S1 Data.

    (TIFF)

    pbio.3002862.s004.tiff (243.7KB, tiff)
    S5 Fig. Maximum likelihood phylogenetic tree of Hyphomicrobiales (Rhizobiales).

    The tree includes published genomes of Hyphomicrobiales and the Hyphomicrobiales MAGs derived from the lichen metagenomes (indicated in red). We generated the alignment of 120 marker genes using GTDB-Tk and calculated the tree using IQ-TREE. The color represents family-level taxonomic assignment. We used tblastn to search the genomes for key genes involved in nitrogen fixation and C1 metabolism. The presence of these genes is indicated with symbols. The full-size version of the tree in the Newick format is available at FigShare (10.6084/m9.figshare.27054937).

    (TIF)

    pbio.3002862.s005.tif (2.5MB, tif)
    S6 Fig. Recovery of MAGs of the main 2 symbionts as a function of sequencing depth.

    These graphs are based on the pre-dereplication set of MAGs, each dot represents a metagenome and is positioned based on its sequencing depth and on whether it contained MAGs assigned to one or both of the 2 main partners: (A) the LFS (lichen fungal symbiont); (B) the photobiont partner; (C) both the LFS and the photobiont partner. The data underlying this figure can be found in S1 Data.

    (TIFF)

    pbio.3002862.s006.tiff (425.3KB, tiff)
    S7 Fig. Relative abundances of symbionts in lichen metagenomes.

    The relative abundances were calculated by dividing the coverage depth of the symbiont MAG by the coverage of the LFS MAG. Here are shown data on the 13 most frequent bacterial genera and the eukaryotes known to be stably associated with lichens. The red line shows 1:1 ratio, where the symbiont is estimated to have the same cellular abundance as the main fungal symbiont. The boxplot elements are defined as: center line, median; box limits, 25th and 75th percentiles; whiskers, 1.5× interquartile range. The data underlying this figure can be found in S1 Data.

    (TIFF)

    pbio.3002862.s007.tiff (896KB, tiff)
    S1 Data. Excel spreadsheet containing, in separate sheets for each figure, the underlying data used for Figs 3, 4B, 5A, 5B, 5C, 6, 7, S4, S6, and S7.

    (XLSX)

    pbio.3002862.s008.xlsx (347.9KB, xlsx)
    S1 File. KEGG module completeness for prokaryotic MAGs selected for analysis.

    (PDF)

    pbio.3002862.s009.pdf (93.3KB, pdf)
    S2 File. KEGG module completeness for eukaryotic MAGs selected for analysis.

    (PDF)

    pbio.3002862.s010.pdf (1.3MB, pdf)
    S1 Table. Metagenomic data used in the analysis.

    (XLSX)

    pbio.3002862.s011.xlsx (45.1KB, xlsx)
    S2 Table. Details on the metagenomes generated de novo for this study.

    (XLSX)

    pbio.3002862.s012.xlsx (15.9KB, xlsx)
    S3 Table. Pairs of identical metagenomes (sourmash similarity = 1) identified during analysis.

    Using sourmash, we identified the pairs of identical metagenomes, potentially deriving from one data set having been submitted to NCBI twice under different accession numbers. This table contains all such pairs and shows which of the pair was retained for the further analysis.

    (XLSX)

    pbio.3002862.s013.xlsx (12.8KB, xlsx)
    S4 Table. Metagenome-assembled genomes generated from the lichen metagenomic data.

    For prokaryotic MAGs, contamination and completeness scores are based on the CheckM results, and taxonomic assignments are based on GTDB. For eukaryotes, contamination and completeness scores are based on the EukCC results, and taxonomic assignments are based on the BAT assignments corrected via phylogenomic analysis. Number of occurrences represents the number of metagenomes this MAG was detected in.

    (XLSX)

    pbio.3002862.s014.xlsx (69.3KB, xlsx)
    S5 Table. List of reference genomes used for construction of the phylogenomic trees.

    (XLSX)

    pbio.3002862.s015.xlsx (17.7KB, xlsx)
    S6 Table. Metagenomes derived from potentially misidentified samples.

    Metagenomes in this list had inconsistencies between their metadata and the taxonomy of the LFS as estimated from the phylogenomic tree. These metagenomes were excluded from the occurrence analysis.

    (XLSX)

    pbio.3002862.s016.xlsx (12.5KB, xlsx)
    S7 Table. Most frequent bacterial lineages (on the family and genus level) in various groups of lichen metagenomes.

    Bacterial lineages were ranked based on the total number of their occurrences. For each lichen group here we give top-ten bacterial taxa.

    (XLSX)

    pbio.3002862.s017.xlsx (29.3KB, xlsx)
    S8 Table. Occurrence of key bacterial groups in metagenomes originating from different studies.

    For 13 most common bacterial genus-level lineages and 4 bacterial families, we are showing the number of metagenomes in which they were detected in as MAGs, split by the source of the metagenomic data (the study it was reported in). The total number of metagenomes that come from each study and which were included in the Dataset 1 are shown at the bottom of the table.

    (XLSX)

    pbio.3002862.s018.xlsx (10.6KB, xlsx)
    S9 Table. Number of MAGs from different categories in each metagenome.

    The MAG categories were based on a combination of taxonomy and function. For the metagenomes that were identified as potentially derived from misidentified samples, we assigned their LFS to a separate category “LFS misassigned.”

    (XLSX)

    pbio.3002862.s019.xlsx (62.2KB, xlsx)
    S10 Table. Bacterial families ranked on their prevalence in lichen metagenomes, as detected via screening of rRNA genes in metagenomic assemblies and unassembled reads.

    Prevalence is calculated using only metagenomes from the Dataset 1 (n = 375).

    (XLSX)

    pbio.3002862.s020.xlsx (37.1KB, xlsx)
    S11 Table. Details on additional metagenomes from NCBI’s SRA.

    (XLSX)

    pbio.3002862.s021.xlsx (20.5KB, xlsx)
    S12 Table. Prevalence of key taxa in additionally screened metagenomes.

    (XLSX)

    pbio.3002862.s022.xlsx (14KB, xlsx)
    S13 Table. MAGs selected for the in depth functional annotation.

    To select these MAGs, we first calculated the number of occurrence per genus, and selected the 13 top genera. Next, we selected among them the MAGs with >95% completeness and <5% contamination, according to CheckM.

    (XLSX)

    pbio.3002862.s023.xlsx (17.2KB, xlsx)
    S14 Table. Sequences used as a blast query for the screening genomes for the signs of nitrogen fixation, methanotrophy, and methylotrophy.

    (XLSX)

    pbio.3002862.s024.xlsx (11.8KB, xlsx)
    S15 Table. Number of iron-related genes annotated with FeGenie.

    (XLSX)

    pbio.3002862.s025.xlsx (17.3KB, xlsx)
    S16 Table. Median number of CAZymes assigned to different classes per MAG for the most frequent bacterial genera.

    (XLSX)

    pbio.3002862.s026.xlsx (12.6KB, xlsx)
    S17 Table. Number of CAZymes assigned to different CAZy families in the 63 MAGs selected for in-depth annotation.

    (XLSX)

    pbio.3002862.s027.xlsx (48.1KB, xlsx)
    S18 Table. Biosynthetic Gene Clusters (BGC) predicted for the selected 63 bacterial MAGs.

    We used SanntiS to predict the BGCs and to annotate them by identifying the most similar BGC included in the MiBIG database. Only the hits with <0.7 Jaccard distance are listed in this table. BGCs that are similar to known clusters producing exopolysaccharides are indicated in bold.

    (XLSX)

    pbio.3002862.s028.xlsx (34.1KB, xlsx)
    S19 Table. Presence/absence of cobalamin-dependent MetH and cobalamin-independent MetE in the high-quality algal genomes (completeness >90%, contamination <10%, according to EukCC).

    (XLSX)

    pbio.3002862.s029.xlsx (11.8KB, xlsx)
    S20 Table. Reference genomes from NCBI used for the Hyphomicrobiales phylogenomic tree.

    Taxon sampling followed Volpiano and colleagues [138]. Taxonomic classification was done using GTDB-Tk.

    (XLSX)

    pbio.3002862.s030.xlsx (32.5KB, xlsx)
    Attachment

    Submitted filename: Responses_to_reviews_PLOS_July_2024.docx

    pbio.3002862.s031.docx (50.3KB, docx)
    Attachment

    Submitted filename: Responses_to_reviews_PLOS_July_2024.docx

    pbio.3002862.s032.docx (50.3KB, docx)

    Data Availability Statement

    The sequencing data in this project are submitted to ENA: de novo generated raw data (study accession PRJEB59037), metagenomic assemblies (PRJEB72384, PRJEB72386-PRJEB72404, PRJEB72498- PRJEB72501), and MAGs (PRJEB77567). Phylogenomic trees in Newick format are available at FigShare (10.6084/m9.figshare.27054937). Custom scripts used for data analysis and visualization are available on GitHub (https://github.com/Spribille-lab/2024-Microbial-occurrence-in-lichen-metagenomes) and FigShare (10.6084/m9.figshare.27054937).

    The sequencing data in this project are submitted to ENA: de novo generated raw data (study accession PRJEB59037), metagenomic assemblies (PRJEB72384, PRJEB72386-PRJEB72404, PRJEB72498- PRJEB72501), and MAGs (PRJEB77567). Phylogenomic trees in Newick format are available at FigShare (10.6084/m9.figshare.27054937). Custom scripts used for data analysis and visualization are available on GitHub (https://github.com/Spribille-lab/2024-Microbial-occurrence-in-lichen-metagenomes) and FigShare (10.6084/m9.figshare.27054937).


    Articles from PLOS Biology are provided here courtesy of PLOS

    RESOURCES