Abstract
Trans-Himalayan hot spring waters rich in boron, chlorine, sodium and sulfur (but poor in calcium and silicon) are known based on PCR-amplified 16S rRNA gene sequence data to harbor high diversities of infiltrating bacterial mesophiles. Yet, little is known about the community structure and functions, primary productivity, mutual interactions, and thermal adaptations of the microorganisms present in the steaming waters discharged by these geochemically peculiar spring systems. We revealed these aspects of a bacteria-dominated microbiome (microbial cell density ~8.5 × 104 mL-1; live:dead cell ratio 1.7) thriving in the boiling (85°C) fluid vented by a sulfur-borax spring called Lotus Pond, situated at 4436 m above the mean sea-level, in the Puga valley of eastern Ladakh, on the Changthang plateau. Assembly, annotation, and population-binning of >15-GB metagenomic sequence illuminated the numeral predominance of Aquificae. While members of this phylum accounted for 80% of all 16S rRNA-encoding reads within the metagenomic dataset, 14% of such reads were attributed to Proteobacteria. Post assembly, only 25% of all protein-coding genes identified were attributable to Aquificae, whereas 41% was ascribed to Proteobacteria. Annotation of metagenomic reads encoding 16S rRNAs, and/or PCR-amplified 16S rRNA genes, identified 163 bacterial genera, out of which 66 had been detected in past investigations of Lotus Pond’s vent-water via 16S amplicon sequencing. Among these 66, Fervidobacterium, Halomonas, Hydrogenobacter, Paracoccus, Sulfurihydrogenibium, Tepidimonas, Thermus and Thiofaba (or their close phylogenomic relatives) were presently detected as metagenome-assembled genomes (MAGs). Remarkably, the Hydrogenobacter related MAG alone accounted for ~56% of the entire metagenome, even though only 15 out of the 66 genera consistently present in Lotus Pond’s vent-water have strains growing in the laboratory at >45°C, reflecting the continued existence of the mesophiles in the ecosystem. Furthermore, the metagenome was replete with genes crucial for thermal adaptation in the context of Lotus Pond’s geochemistry and topography. In terms of sequence similarity, a majority of those genes were attributable to phylogenetic relatives of mesophilic bacteria, while functionally they rendered functions such as encoding heat shock proteins, molecular chaperones, and chaperonin complexes; proteins controlling/modulating/inhibiting DNA gyrase; universal stress proteins; methionine sulfoxide reductases; fatty acid desaturases; different toxin-antitoxin systems; enzymes protecting against oxidative damage; proteins conferring flagellar structure/function, chemotaxis, cell adhesion/aggregation, biofilm formation, and quorum sensing. The Lotus Pond Aquificae not only dominated the microbiome numerically but also acted potentially as the main primary producers of the ecosystem, with chemolithotrophic sulfur oxidation (Sox) being the fundamental bioenergetic mechanism, and reductive tricarboxylic acid (rTCA) cycle the predominant carbon fixation pathway. The Lotus Pond metagenome contained several genes directly or indirectly related to virulence functions, biosynthesis of secondary metabolites including antibiotics, antibiotic resistance, and multi-drug efflux pumping. A large proportion of these genes being attributable to Aquificae, and Proteobacteria (very few were ascribed to Archaea), it could be worth exploring in the future whether antibiosis helped the Aquificae overcome niche overlap with other thermophiles (especially those belonging to Archaea), besides exacerbating the bioenergetic costs of thermal endurance for the mesophilic intruders of the ecosystem.
Introduction
Hydrothermal environments, despite being the potential nurseries of early life on Earth [1–3], are characterized by the entropic disordering of biomacromolecular systems, which in turn restricts the habitability of these ecosystems [4,5]. Thermophilic and hyperthermophilic microorganisms [6], that are adept to counter chaos and restore order at the molecular level without incurring untenable energy costs for cell system maintenance, are the habitual natives of hydrothermal vents. For most other taxa abundant at the 30–40°C sites of these ecosystems (these are the phylogenetic relatives of mesophilic microorganisms), habitability is progressively constrained in the 50°C to 100°C trajectory [5,7–10]. Conversely, as the bioenergetic cost of cell system maintenance get relaxed down the temperature gradient, habitability, and therefore in situ diversity, increases for most microbial taxa except for the true thermophiles [11,12].
Idiosyncratic to the low-biodiversity community structure axiomatic for hydrothermal vent ecosystems, a geochemically unusual category of Trans-Himalayan hot springs, located on the either side of the tectonically active Indus Tsangpo Suture Zone (ITSZ, the collision intersection between the Asian and Indian continental crusts engaged in Himalayan orogeny) in eastern Ladakh, India [13–16], has recently been reported for their extra-ordinarily diversified vent-water microflora [5,10,17,18]. Compared with other well studied hydrothermal systems, the geochemical specialty of these Trans-Himalayan hot springs lies in their paucity of dissolved solids in general and calcium and silicon in particular, which again is accompanied by the exceptional abundance of boron, chlorine, sodium, and various sulfur species including both sulfide and sulfate. Concurrent to such reports of highly bio-diverse microbiomes from the hot springs of the Puga valley and Chumathang geothermal areas situated on the Changthang plateau, to the south and north of the ITSZ respectively, studies of microbial diversity within the vent-waters, and vent-adjacent areas, of a number of hot springs located across discrete Himalayan and Trans-Himalayan geothermal regions have corroborated that hydrothermal environments are not “Thermophiles only” territories [19–26].
As for the well-studied Puga hydrothermal system, PCR-amplified 16S rRNA gene sequence-based investigations of microbial communities within the boiling vent-waters of a number of springs have revealed the presence of several such genera whose members are incapable of laboratory-growth at >45°C [10,17,18]. Diurnal fluctuations in the biodiversity and chemistry of the vent-waters, in conjunction with circumstantial geological evidences, have indicated that the copious tectonic faults of this ITSZ-adjoining area serve as potential conduits for the infiltration of meteoric water, and therewith mesophilic microorganisms, to the shallow geothermal reservoir [10]. The phenomenon of mesophilic microorganisms getting introduced to hot spring systems has been reported from other parts of the world too; for instance, systemic infiltration of mesophiles from surface and sub-surface sources of cold water and meteoric fluids have been observed for hot springs of New Zealand and Yellowstone National Park, USA [27,28]. Furthermore, for the Puga hot spring system, shotgun metagenome sequencing-based investigation of microbial mat communities along the spring-water transits and dissipation channels of a microbialite has identified several genus-level populations to occupy such sites within the hydrothermal gradients which have temperatures tens of degree-Celsius above the in vitro growth-limits of those taxa [5,29]. While the overall geomicrobiology of the Puga hot springs have indicated the system’s peculiar chemistry (mineralogy) to be a key facilitator of the high habitability of the environment [5,18], physiological attributes of a vent-water isolate identified dissolved solutes such as lithium and boron, which are characteristic to the Puga hydrothermal system [15,16], as the key enablers of high temperature endurance by the mesophilic bacteria present in situ [30].
Despite a considerable progress in our basic awareness about the peculiar geobiology of Trans-Himalayan sulfur-borax hot springs, we still do not have any quantitative idea about the structural and functional dynamics of the vent-water microbiomes of these astrobiologically significant hydrothermal systems [16,31,32]. For instance, we have a fair measure of the microbial diversity in terms of the variety of taxa present, but we are completely uninformed about the relative abundance and temporal consistency of the individual taxa constituting the microbiome, the thermal adaptations of the native microorganisms (especially the phylogenetic relatives of mesophilic bacteria present), and the mutual interactions of the various components of the microbiome. We also know nothing about the pathways of primary productivity that sustain the ecosystem in these boiling spring waters. To address these shortcomings in our knowledge on the ecology of Trans-Himalayan sulfur–borax spring systems, this study comprehensively reveals the vent-water microbiome of the Lotus Pond hot spring, which is one of the most conspicuous hydrothermal manifestations within the Puga valley [10,16,17]. While the bacterial diversity of Lotus Pond’s vent-water has been investigated previously by means of 16S amplicon sequencing [10,17], the present exploration, for the first time, unraveled the qualitative as well as quantitative architecture of the microbiome via shotgun metagenome sequencing, and fluorescence microscopy. Besides elucidating the dominance and equitability of taxa within the microbiome, the >15 giga base metagenomic sequence data generated was analyzed to illuminate the aforesaid aspects of microbiome functioning. Furthermore, to know whether the Lotus Pond vent-water microbiome remained consistent over time, its bacterial and archaeal diversity was evaluated afresh by amplified 16S rRNA gene sequencing and comparing the current findings with the previous data obtained on the basis of similar analyses.
Materials and methods
Study site and sampling
Lotus Pond, located at 33°13’46.3”N and 78°21’19.7”E, is one of the most prominent hot springs or geysers within the Puga valley (Fig 1A), which in turn is one of the most vigorous geothermal areas of the Indian Trans-Himalayas. Notably, Puga Valley is situated at an altitude of 4436 m, where the boiling point of water remains at around 85°C [10,16,33]. The venting point (Fig 1B) is embedded in a hot water pool that has been formed by its own discharges and lies within the old crater (Fig 1A) which was formed from the same hydrothermal eruption that gave rise to the geyser itself. From one side of the hot water pool, the spring-water flows over thin unstable terraces of geothermal sinters, into the Rulang rivulet, which meanders through the valley of Puga. On the other flank of the hot water pool, a wall of travertine, sulfur, and boron mineral deposits [10,16] guard the crater and debars the spring-water from escaping the pool (Fig 1C).
Fig 1.
Topography of the study site: (A) the Lotus Pond hot spring in the context of the river Rulang and the valley of Puga, (B) the boiling vent-water explored for microbiome structure and function, (C) boratic sinters, and fresh condensates of sulfur and boron minerals covering the broken wall of the old crater within which Lotus Pond is embedded.
Water (having temperature 85°C and pH 7.5) was collected as described at length previously [5,10,17,18] from Lotus Pond’s vent at 13:00 h of 29 October 2021, when ebullition was strong and fumarolic activity copious. For microscopic studies, 500 mL vent-water was passed through a sterile 0.22 μm mixed cellulose ester membrane filter (Merck Life Science Private Limited, India) having a diameter of 47 mm, and inserted into a sterilized cryovial containing 5 mL of autoclaved 0.9% (w/v) NaCl solution supplemented with 15% (v/v) glycerol. Simultaneously, for the extraction of total environmental DNA (metagenome), overall 5 L vent-water was passed through five individual sterile 0.22 μm filters, each of which was inserted into a separate sterilized cryovial containing 5 mL of autoclaved 50 mM Tris:EDTA (pH 7.8). After filter insertion, all the cryovials were sealed with parafilm (Tarsons Products Limited, India), packed in polyethylene bags, put into dry ice, and air lifted to the laboratory at Bose Institute in Kolkata. The entire on-field procedure was carried out aseptically using sterilized syringes (Tarsons Products Limited, India), Swinnex filter holders (Merck KGaA, Germany), and forceps on every occasion of water collection from the vent, filtration through the membrane, and insertion of the filter to the cryovial.
Enumeration of microbial cells
Total microbial cell count per unit volume of vent-water was determined alongside the number of metabolically active and metabolically inactive cells present. The microbial cells that were filtered out from 500 mL vent-water were first detached from the adherence matrix by shredding the filter with sterilized scissors, and then vortexing for 30 min, within the NaCl-glycerol containing cryovial in which the filter was inserted on-field. The vial was spun at 1000 g for 5 seconds to allow the filter shreds to settle down. Finally, the supernatant was collected (without disturbing the bottom debris), its exact volume measured, and then analyzed as described previously [34] using a hemocytometer (Paul Marienfeld GmbH & Co. KG, Germany) and an upright fluorescence microscope (Olympus BX53 Digital, Olympus Corporation, Japan). Microbial cell density measurement was carried out based on the axiom that the final supernatant collected contained all the microbial cells that were present in the 500 mL vent-water filtered on-field. The cell suspension (i.e. the collected supernatant) was stained with 4′6-diamidino-2-phenylindole (DAPI), fluorescein diacetate (FDA) or propidium iodide (PI) solution, and cell numbers were counted as described previously [35]. Details of the procedure followed for microscopy and microbial cell enumeration have been given in the Supplementary Methods.
Metagenome extraction and sequencing
The 0.22 μm membrane filters suspended in sterile 50 mM Tris:EDTA (pH 7.8) were treated as described previously [5,10,17,18], and from the final cell suspension obtained metagenomic DNA was extracted using PureLink Genomic DNA Mini Kit (Thermo Fisher Scientific, USA) following the manufacturer’s protocol. DNA molecules having high levels of structural integrity, as evident from NanoDrop spectrophotometry and Qubit fluorometry (both the operations were carried out using platforms manufactured by Thermo Fisher Scientific), were used as the input material (approximately 1 ng total quantity) for preparing shotgun metagenome sequencing library with the help of Nextera XT DNA Library Prep Kit (Illumina Inc., USA). Quality of the final library was assesed using high sensitivity D1000 screen tape in 2200 TapeStation (Agilent Technologies Inc., USA), while its quantification was carried out using a Qubit fluorometer. Paired end (2 × 250 nucleotides) sequencing of the library was performed in a Novaseq 6000 (Illumina Inc.) next-generation DNA sequencer. The whole metagenomic sequence dataset of 31353866 read-pairs obtained in the process was submitted to the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI), USA, under the BioProject PRJNA296849.
In tandem with the aforesaid procedure, potential traces of contaminating microial DNA, often referred to as the “kitome” [36,37], was isolated in the same way as described above for the five membrane filters through which vent-water was passed on-field. The only difference in case of the kitome preparation was that no vent-water was passed through any of the five filters, which in this case also were inserted to five individual sterilized cryovials each containing 5 mL of autoclaved 50 mM Tris:EDTA (pH 7.8). The final suspension obtained for the blank filters was subjected to the same DNA extraction procedure using the same PureLink Genomic DNA Mini Kit as it was done for the actual sample suspension. No DNA could be detected using NanoDrop spectrophotometry or Qubit fluorometry, so the so-called kitome DNA preparation could not be subjected to shotgun sequencing on the Illumina platform.
Assembly and annotation of the metagenomic sequence
Clipping of adapters and quality-filtering of the trimmed sequences (for average Phred score ≥20) were carried out using Trim Galore v0.6.4 (https://github.com/FelixKrueger/TrimGalore) with default parameters. The 31342225 processed read-pairs obtained post quality filtering were subjected to de novo assembly by Megahit v1.2.9 using default parameters [38]. Open reading frames (ORFs) or genes were identified within the assembled contigs using Prodigal v2.6.3 with default parameters [39]. Annotation of the predicted ORFs into protein coding sequences (CDSs) was carried out by searching against the eggNOG database v5.0 [40] using eggNOG-mapper v2.1.9 in default mode using Diamond algorithm [41]. The software eggNOG-mapper aligns CDSs to precomputed orthology assignments in the eggNOG database in such a way that it can infer orthologs, and transfer the functional annotations and taxonomic information obtained to the queries, thereby providing detailed taxonomic annotations for the individual metagenomic CDSs analyzed [41]. Notably, CDS annotated in this way included potential pseudogenes, which could also be outcomes of sequencing error-induced frame shifts. The eggNOG-derived data was processed to assign orthology identities, and determine pathway affiliations, for the annotated CDSs with the help of the information available in the literature in tandem with those provided in the Kyoto Encyclopedia of Genes and Genomes (KEGG; https://www.genome.jp/kegg/). Furthermore, to identify the clusters of orthologous genes (COGs) to which the different ORFs predicted within the assembled metagenome belonged, the translated form of the Prodigal-derived gene catalog was annotated by searching against the COG Little Endian version of the Conserved Domain Database of NCBI located at https://ftp.ncbi.nih.gov/pub/mmdb/cdd/little_endian/Cog_LE.tar.gz, using COGclassifier v1.0.5 [42]. The bacterial version v7.0 of the antibiotics & Secondary Metabolite Analysis Shell (antiSMASH) pipeline was used in strict detection mode [43,44] to identify gene clusters for secondary metabolites biosynthesis within the assembled metagenome of Lotus Pond. To identify antibiotic resistance genes, the assembled metagenome was searched against the Comprehensive Antibiotic Resistance Database (CARD) v3.2.8 using the software Resistance Gene Identifier v6.0.3 with default parameters [45]. Here, gatifloxacin, levofloxacin, moxifloxacin, nalidixic acid, norfloxacin, sparfloxacin, and ciprofloxacin antibiotics were grouped as fluoroquinolones; cloxacillin and oxacillin were grouped under beta-Lactam antibiotics.
Direct annotation of metagenomic reads corresponding to rRNA gene sequences
The 62684450 quality-filtered reads available for the Lotus Pond metagenome were searched against the rrnDB database v5.8 [46] using Bowtie2 v2.2.5 [47] in “—end-to-end” mode to extract sequences matching prokaryotic 16S rRNA genes. The aligned FASTQ file was converted to FASTA format and used for sequence annotation with the help of RDP Classifier located at http://https://rrndb.umms.med.umich.edu/estimate/. While 82756 aligned reads were extracted from Lotus Pond’s metagenomic sequence dataset via mapping against rrnDB, their eventual annotation using RDP Classifier with confidence cut-off 0.8 led to the taxonomic classification of 81619 reads.
Construction and taxonomic characterization of metagenome-assembled genomes (MAGs)
The de novo assembled metagenomic contigs that were ≥2500 nucleotides in length were first binned into putative population genomes or MAGs, via three separate procedures using three different softwares with default parameters: Metabat2 v2.12.1 [48], MaxBin2 v2.2.4 [49], and CONCOCT v1.1.0 [50]. Best quality MAGs were then selected using DASTool v1.1.6 [51] with default parameters from the outputs of the three different binning procedures. The MAGs selected were checked for their completeness and contamination level with the help of CheckM v1.2.2 by searching against collocated sets of marker genes that are ubiquitous and single-copy within a given phylogenetic lineage [52].
Taxonomic identity of the selected MAGs was first predicted independently via searches for genom-genome relatedness conducted using Genome Taxonomy Database Toolkit (GTDB-Tk) v1.7.0 [53], Rapid Annotations using Subsystems Technology (RAST) [54], and Type Strain Genome Server (TYGS) in default mode [55]. As and when required, MAG versus genome/MAG pairs were tested for the extent of their in silico or digital DNA-DNA hybridization (dDDH) using the online software Genome-to-Genome Distance Calculator (GGDC) v3.0 [55]. In TYGS as well as GGDC analyses, total length of all high-scoring segment pairs (HSPs) divided by the total genome length (TGL) of the query MAG was considered as the dDDH value. Taxonomic classifications inferred using the aforesaid index, i.e. “(total HSP length) / (TGL)”, were further corroborated by testing for the index “(total identities found in HSPs) / (total HSP length)”. Notably, these two indices are also known as TYGS formula d0 / GGDC formula 1, and TYGS formula d4 / GGDC formula 2, respectively. Orthologous genes-based average nucleotide identity (ANI) between a pair of MAG and genome/MAG was calculated using ChunLab’s online ANI Calculator [56].
The MAG sequences were deposited to the GenBank and annotated for their gene contents using the Prokaryotic Genome Annotation Pipeline (PGAP; https://www.ncbi.nlm.nih.gov/genome/annotation_prok/) of the NCBI, USA. Phylogeny of bacterial MAGs, in relation to their closest genomic relatives, was reconstructed based on conserved marker gene sequences, using the Up-to-date Bacterial Core Gene Pipeline v3.0 [57] with default parameter. The dataset used for this purpose contained all the close genomic relatives of the individual MAGs that were indicated by GTDB-Tk and TYGS analyses. The best fit nucleotide substitution model was selected using ModelTest-NG v0.1.7 [58], following which a maximum likelihood tree was generated using RAxML v8.2.12 [59] with 10,000 bootstrap experiments. The software utility called Interactive Tree of Life v6.7.5 [60] was used to visualize the phylogram graphically. The KEGG automatic annotation server (KAAS) was used for ortholog assignment and pathway mapping for the CDSs identified within the MAGs [61]. Relative abundance of the individual MAGs within the metagenome was determined by mapping all quality-filtered metagenomic read-pairs available onto the individual MAG sequences via independent in silico experiments carried out using Bowtie2 v2.2.5 [47] in “—end-to-end” mode.
PCR-amplified 16S rRNA gene sequence analysis
PCR-amplified 16S rRNA gene sequence-based taxonomic diversity was revealed using the fusion primer protocol [10,17,22] on the Ion S5 DNA sequencing platform (Thermo Fisher Scientific), which in turn involved PCR amplification, sequencing, and analysis of the V3 regions of all Bacteria-specific 16S rRNA genes and V4-V5 regions of all Archaea-specific 16S rRNA genes [18] present in the Lotus Pond metagenome. The forward primer 5′ -CCTACGGGAGGCAGCAG- 3′ and the reverse primer 5’ -ATTACCGCGGCTGCTGG- 3′ were used to amplify the bacterial V3 regions, while the archaeal V4-V5 regions were amplified using the forward and reverse primers 5′ -GCYTAAAGSRICCGTAGC- 3′ and 5′ -TTTCAGYCTTGCGRCCGTAC- 3′ respectively [18]. Albeit no quantifiable DNA was present in the so-called kitome preparation, it was still subjected to individual PCRs using the two 16S rRNA gene sequence primer-pairs mentioned above. Notably, no DNA could be detected even after re-PCR with any of the two primer-pairs used, so the null PCR products were not subjected to sequencing on the Ion S5 platform.
Reads present in an amplicon sequence dataset (725025 and 589009 reads were there in the bacterial and archaeal sets respectively) were trimmed for their adapters, filtered for average Phred score ≥20 and length >100 nucleotides, and then clustered into operational taxonomic units (OTUs), i.e. species-level entities united by ≥97% 16S rRNA gene sequence similarities, by USEARCH v10.0.259 [62]. While singleton sequences were eliminated before clustering, OTUs generated for a given dataset were classified taxonomically based on their consensus sequences using the RDP Classifier with confidence cut-off 0.8. Rarefaction curves (S1 Fig in S1 File) were generated via USEARCH v10.0.259.
Results
Density of microbial cells in Lotus Pond
Gross density of microbial cells in the vent-water of Lotus Pond was found to be approximately 8.5 × 104 mL-1, via staining with DAPI, followed by hemocytometry using an upright fluorescence microscope (Fig 2). Using similar instrumentation, the density of metabolically active cells was estimated to be approximately 5.4 × 104 mL-1, via staining with FDA, while that of apparently dead cells was found to be 3.2 × 104 mL-1, via PI staining. Thus, the ratio between the densities of metabolically active and dead cells in Lotus Pond’s vent water was almost 1.7.
Fig 2.
Representative picture of the fluorescence microscopic fields based on which microbial cell density was calculated in the vent-water sample of Lotus Pond: (A) sample stained with DAPI, (B) sample stained with FDA, (C) sample stained with PI. Scale bars in all the three micrographs indicate 50 μm length.
Microbiome composition reconstructed from the assembled metagenome
Quality filtration of the native metagenomic sequence dataset having 31353866 read-pairs resulted in the retention of 31342225 read-pairs (S1 Table in S1 File). Assembly of the quality-filtered dataset yielded 286918 contigs having a minimum and an average length of 200 and 563 nucleotides respectively (96% of all quality-filtered reads participated in the assembly, at an average coverage of ~85X). While the largest contig obtained was 197270-nucleotide long, the consensus sequence of the assembled metagenome was ~162 million nucleotides, with a total of 379412, complete or partial, ORFs or genes identified therein. Out of the 379412 ORFs identified, 220687 could be annotated as CDSs, of which again 220635 could be ascribed to potential taxonomic sources (S2 Table in S1 File). Approximately 97.3%, 1%, 1.5% and 0.2% of the taxonomically classifiable CDSs, i.e. 214652, 2309, 3276 and 398 CDSs, belonged to Bacteria, Archaea, Eukarya and Viruses respectively (S2 and S3 Tables in S1 File).
The 214652 bacterial CDSs identified were found to be distributed over 23 phyla, with most CDSs being affiliated to Proteobacteria / Pseudomonadota (90793), followed by Aquificae / Aquificota (54151), Firmicutes / Bacillota (18670), Deinococcus-Thermus / Deinococcota (10591), Cyanobacteria (5827), Chloroflexi / Chloroflexota (4254), Bacteroidetes / Bacteroidota (2753), Thermotogae / Thermotogota (2671), Thermodesulfobacteria / Thermodesulfobacteriota (2547), Actinobacteria / Actinomycetota (2357). Unclassified Bacteria accounted for 18386 CDSs, while 13 other (minor) phyla, namely Planctomycetes / Planctomycetota (355), Synergistetes / Synergistota (200), Verrucomicrobia / Verrucomicrobiota (160), Acidobacteria / Acidobacteriota (134), Fusobacteria / Fusobacteriota (133), Nitrospirae / Nitrospirota (126), Chlorobi / Chlorobiota (93), Spirochaetes / Spirochaetota (77), Thermomicrobia / Thermomicrobiota (63), Deferribacteres / Deferribacterota (43), Tenericutes / Mycoplasmatota (25), Gemmatimonadetes / Gemmatimonadota (21) and Chlamydiae / Chlamydiota (20) were represented sparsely in the CDS catalog (S2 and S3 Tables in S1 File; Fig 3A).
Fig 3.
Taxonomic distribution of (A) the CDSs identified in the assembled metagenome of Lotus Pond, and (B) the metagenomic reads which corresponded to 16S rRNA genes (actual CDS count or metagenomic read count recorded for each taxon is given in parenthesis). In (A) “minor phyla” included the following: Acidobacteria, Chlamydiae, Chlorobi, Deferribacteres, Fusobacteria, Gemmatimonadetes, Nitrospirae, Planctomycetes, Spirochaetes, Synergistetes, Tenericutes, Thermomicrobia and Verrucomicrobia. In (B) “other phyla”, also marginal in representation, included the following: Cyanobacteria, Thermotogae, Bacteroidetes, Chloroflexi, Actinobacteria, Armatimonadetes and Synergistetes.
Among the different classes of Proteobacteria, Alphaproteobacteria, followed by Gammaproteobacteria, Betaproteobacteria, Deltaproteobacteria and Epsilonproteobacteria accounted for most CDSs in the assembled metagenome (41306, 31865, 14394, 1145 and 466 CDSs respectively). The other proteobacterial classes Hydrogenophilia, Acidithiobacillia and Oligoflexia were represented by 118, 106 and 101 CDSs respectively, while unclassified Proteobacteria accounted for 1046 CDSs (Fig 3A). Furthermore, within the proteobacterial CDS catalog, the Xanthomonadales-Lysobacterales complex, Vibrionales, Oceanospirillales (all three belonged to the Gammaproteobacteria), and Rhodocyclales (belonging to Betaproteobacteria) were the maximally represented orders, with 10039, 6025, 3,940 and 3582 CDSs affiliated to them respectively (S2 and S3 Tables in S1 File).
The small archaeal genetic diversity was contributed by Crenarchaeota / Thermoproteota, followed by Euryarchaeota (1479 and 647 CDSs respectively); unclassified Archaea accounted for 183 CDSs (S2 and S3 Tables in S1 File). While the Crenarchaeota related CDSs were not classifiable further, the Euryarchaeota-affiliated CDSs were distributed over the classes Halobacteria (177), Thermococci (169), Archaeoglobi (83), Methanococci (60), Methanomicrobia (30), Methanobacteria (19) and Thermoplasmata (9); unclassified Euryarchaeota accounted for 100 CDSs (S2 and S3 Tables in S1 File). Most of the putatively eukaryotic CDSs detected exhibited highest sequence similarities with homologs from Opisthokonta, affiliated to either Fungi or Metazoa (S2 Table in S1 File). The few viral CDSs detected were mostly contributed by Podoviridae, Myoviridae, Siphoviridae and Caudovirales (S2 Table in S1 File).
Genus-level constitution of the microbiome
Taxonomic classification of CDSs predicted within the assembled metagenome was not extended up to the genus level. Instead, generic components of the microbiome were delineated via taxonomic classification of (i) metagenomic reads corresponding to 16S rRNA genes, and (ii) OTUs clustered from PCR-amplified 16S rRNA gene sequences. Additionally, the taxonomic diversity data obtained from the current analysis of PCR-amplified 16S rRNA gene sequences were utilized to evaluate the consistency of the Lotus Pond vent-water microbiome by comparing with equivalent data available from previous explorations of the habitat.
According to the RDP-based classification of the metagenomic reads, out of the 62684450 high quality reads analyzed, 81619 represented parts of prokaryotic 16S rRNA genes: 81223 bacterial and 396 archaeal (S4 Table in S1 File).
Out of the total 81223 metagenomic reads corresponding to bacterial 16S rRNA genes 80348 were classifiable under 12 phyla (the remaining 875 reads belonged to unclassified Bacteria), of which Aquificae accounted for an overwhelming majority (65853) of reads (S4 Table in S1 File; Fig 3B). Substantive number of reads were also ascribable to Proteobacteria (11639), Deinococcus-Thermus (1517) and Firmicutes (567), while Thermodesulfobacteria (308), Cyanobacteria (131), Thermotogae (196), Bacteroidetes (56), Chloroflexi (56), Actinobacteria (21), Armatimonadetes / Armatimonadota (2), and Synergistetes (2) contributed much fewer reads. The proteobacterial reads, again, were mostly from Gammaproteobacteria (9553), followed by Betaproteobacteria (1345) and Alphaproteobacteria (595). Furthermore, 70729 out of the total 81223 bacterial 16S rRNA gene sequence related reads got ascribed to 92 genera (S4 Table in S1 File) led by Hydrogenobacter (44547), Sulfurihydrogenibium (16371), Halomonas (3075), Vibrio (2549), Thermus (1454), Tepidimonas (965) and Paracoccus (196). Whereas the two Aquificales genera Hydrogenobacter and Sulfurihydrogenibium, affiliated to the families Aquificaceae and Hydrogenothermaceae respectively, accounted for 75.8% of all 16S rRNA gene related reads present in the quality-filtered metagenomic sequence dataset; the other Aquificae members detected, namely Thermocrinis, Venenivibrio, Persephonella and Hydrogenothermus, accounted for only 180, 48, 6 and 2 such reads respectively (S4 Table in S1 File).
As for the 396 reads corresponding to archaeal 16S rRNA genes, all were attributable to the phylum Crenarchaeota and only 9 were classifiable up to the level of the genus, detecting Aeropyrum (6), Stetteria (2) and Pyrobaculum (1) in the process (S4 Table in S1 File).
A corroborative experiment involving PCR amplification, seuencing and analysis of the V3 regions of all Bacteria-specific 16S rRNA genes present in the metagenome extracted from the hot water discharged by Lotus Pond revealed 602 bacterial OTUs that in turn were classifiable under 16 different phyla: Proteobacteria (352), Firmicutes (63), Bacteroidetes (52), Actinobacteria (40), Cyanobacteria (20), Deinococcus-Thermus (16), Thermotogae (8), Aquificae (3), Acidobacteria (2), Armatimonadetes (2), Chloroflexi (2), Spirochaetes / Spirochaetota (2), Thermodesulfobacteria (2), Dictyoglomi / Dictyoglomota (1), Rhodothermota (1) and Synergistetes (1); unclassified Bacteria accounted for 35 OTUs (S5 Table in S1 File). Of the total 602 bacterial OTUs identified, 249 were classifiable upto the level of genus. Overall, 121 genera were identified in that way (S5 Table in S1 File), with maximum number of OTUs getting affiliated to Vibrio (21), followed by Halomonas (16) and Thermus (13).
Concurrent to the above experiment, PCR amplification, seuencing and analysis of the V4-V5 regions of all Archaea-specific 16S rRNA genes present in the Lotus Pond vent-water metagenome were carried out to reveal the in situ archaeal diversity (S5 Table in S1 File). Only 30 OTUs were detected in this way, of which, 10 were ascribable to Crenarchaeota, 6 each belonged to Euryarchaeota and Nitrososphaerota, and 8 were not classifiable below the domain level (S5 Table in S1 File). Furthermore, none of the Crenarchaeota OTUs were classifiable at the genus level, whereas out of the 6 OTUs affiliated to Euryarchaeota, 1 and 3 were classifiable as species of Methanomassiliicoccus and Methanospirillum respectively; all the 6 Nitrososphaerota OTUs were classified as members of Nitrososphaera (S5 Table in S1 File).
Predominant generic entities identified via population genome binning
When all the >2500 nucleotide contigs present in the assembled metagenome were binned using Metabat2, MaxBin2, and CONCOCT via separate in silico experiments, 19, 14 and 30 MAGs were obtained respectively. Refinement and optimization of these 63 population genome bins using DASTool yielded 14 bacterial (Table 1 and S6 Table in S1 File; Fig 4) and one archaeal (Table 1 and S6 Table in S1 File) MAGs for subsequent consideration and analysis. Each of these draft genomes had a contamination level of <5%, except the one designated as LotusPond_MAG_Paracoccus_sp._1. Likewise, most of the short-listed MAGs (9 out of 15) had >90% completeness; two, identified as Paracoccus_sp._2 and Unclassified_Thermodesulfobacteriaceae, had <50% completeness; and four, identified as Paracoccus_sp._1, Fervidobacterium_sp., Thermus_sp. and Unclassified_Bacteria, possessed approximately 51.2%, 70.2%, 70.3% and 81.9% completeness respectively. In terms of consensus sequence length, the population genomes classified as Vibrio metschnikovii and Halomonas sp. were the largest (approximately 3.5 and 3.4 mb respectively) whereas the one classified as a novel member of the family Thermodesulfobacteriaceae was the smallest (approximately 0.4 mb only). Furthermore, when the 31342225 high quality read-pairs used for metagenome assembly were searched against the 15 population genomes, a total of 68.08% read-pairs mapped concordantly on to the MAGs as a whole. Individually, the 15 MAGs accounted for 0.06% to 55.9% of the metagenomic read-pairs analyzed.
Table 1. Key features of the population genome bins (MAGs) constructed from the assembled metagenome of Lotus Pond’s vent-water.
Name of the population genome bin obtained (genome accession number) |
% of quality-filtered metagenomic read-pairs matching sequences of the MAG concordantly | No. of contigs | Genome size (nucleotides) | G+C content (%) | Completeness (%) |
Contamination (%) |
Number of ORFs (genes) / CDSs detected in the MAG |
---|---|---|---|---|---|---|---|
LotusPond_MAG_Unclassified_Aquificaceae (JARJNB000000000) | 55.92 | 131 | 1489711 | 43.32 | 95.12 | 0.61 | 1754 / 1707 |
LotusPond_MAG_Sulfurihydrogenibium_azorense (JARJNA000000000) | 3.48 | 47 | 1516769 | 32.77 | 98.98 | 0.00 | 1680 / 1639 |
LotusPond_MAG_Halomonas_sp. (JARJMQ000000000) | 2.65 | 97 | 3349510 | 60.21 | 99.46 | 0.94 | 3245 / 3176 |
LotusPond_MAG_Unclassified_Desulfurococcales (JARJMZ000000000) | 1.60 | 37 | 1553981 | 49.38 | 99.63 | 0.00 | 1621 / 1559 |
LotusPond_MAG_Vibrio_metschnikovii (JARJMP000000000) | 1.54 | 249 | 3487509 | 44.28 | 94.07 | 1.85 | 3222 / 3172 |
LotusPond_MAG_Thermus_sp. (JARJMW000000000) | 1.01 | 372 | 1887946 | 62.80 | 70.34 | 4.24 | 2206 / 2157 |
LotusPond_MAG_Tepidimonas_sp. (JARJMT000000000) | 0.46 | 200 | 2150705 | 66.30 | 94.16 | 1.29 | 2200 / 2155 |
LotusPond_MAG_Thermosynechococcus_sp. (JARJMS000000000) | 0.35 | 104 | 2328058 | 52.52 | 97.05 | 0.12 | 2367 / 2319 |
LotusPond_MAG_Tepidimonas_taiwanensis (JARJMR000000000) | 0.28 | 245 | 2568138 | 69.51 | 90.53 | 3.56 | 2567 / 2525 |
LotusPond_MAG_Fervidobacterium_sp. (JARJMY000000000) | 0.24 | 293 | 1525481 | 39.93 | 70.18 | 0.88 | 1639 / 1611 |
LotusPond_MAG_Paracoccus_sp._1 (JARJMU000000000) | 0.17 | 528 | 2138329 | 62.94 | 51.22 | 15.08 | 2383 / 2345 |
LotusPond_MAG_Unclassified_Bacteria (JARJMV000000000) | 0.11 | 335 | 2025372 | 61.84 | 81.86 | 0.31 | 2082 / 2039 |
LotusPond_MAG_Unclassified_Chromatiales (JARJMX000000000) | 0.10 | 214 | 1644354 | 62.57 | 91.04 | 2.01 | 1796 / 1758 |
LotusPond_MAG_Unclassified_Thermodesulfobacteriaceae (JARJND000000000) | 0.07 | 114 | 439694 | 36.23 | 30.15 | 0.31 | 477 / 467 |
LotusPond_MAG_Paracoccus_sp._2 (JARJNC000000000) | 0.06 | 284 | 1087115 | 64.14 | 34.31 | 1.10 | 1214 / 1201 |
Fig 4. Phylogenetic tree (number of bootstrap tests carried out = 10000) based on 92 universal bacterial core genes that shows the evolutionary relationships among the 14 bacterial MAGs reconstructed from the Lotus Pond vent-water metagenome and their closest relatives identified based on dDDH and/or rRNA gene sequence relationships.
Scale bar denotes a distance equivalent to 10% nucleotide substitution. In the phylogeny reconstructed, nucleotide substitutions were interpreted using the generalized time-reversible model, which considers the proportion of invariable sites and/or the rate of variation across the sites.
The genome bin LotusPond_MAG_Unclassified_Aquificaceae accounted for an overwhelming proportion of the metagenome (55.9% of all read-pairs assembled mapped back to this MAG), while another Aquificales bin identified as Sulfurihydrogenibium azorense, a member of Hydrogenothermaceae, accounted for 3.5% of all read-pairs. Four more bins individually represented ≥1% of the metagenome; these were identifed as (i) an unknown species of the genus Halomonas, (ii) an unclassified member of the order Desulfurococcales within the archaeal class Thermoprotei, (iii) the gammaproteobacteium Vibrio metschnikovii, and (iv) an unknown species of the genus Thermus. These MAGs accounted for 2.7%, 1.6%, 1.5% and 1% of all metagenomic read-pairs respectively. Seven other MAGs, ascribed to Tepidimonas taiwanensis, Tepidimonas sp., Fervidobacterium sp., Thermosynechococcus sp., Paracoccus sp., an unclassified member of the gammaproteobacterial order Chromatiales, and an unclassified taxon of Bacteria, individually accounted for >0.1% but <1% of the metagenome. The remaining two Lotus Pond MAGs individually represented <0.1% of the metagenome; these were identified as one unclassified member each from the family Thermodesulfobacteriaceae of the monotypic phylum Thermodesulfobacteria, and the genus Paracoccus. Details of the phylogenomic relationships, and the consequent basis of taxonomic identification, of the individual MAGs are given in Supplementary Results.
Environmentally advantageous functions encoded by the Lotus Pond MAGs
Apart from house-keeping genes, all the MAGs encompassed genes for heat stress management. Irrespective of their completeness level, MAGs affiliated to thermophilic taxa such as Aquificaceae, Sulfurihydrogenibium, Desulfurococcales, Thermodesulfobacteriaceae and Thermus had less than 10 such genes each. In contrast, almost all the MAGs affiliated to mesophilic taxa had ≥10 heat stress management genes each. Almost all the MAGs contained genes for managing oxidative and periplasmic stress; a half of them had genes for withstanding carbon starvation and nitrosative stress. Eight MAGs encoded genes conferring high frequency of lysogeny as a means of stress response. Every bacterial MAG, but not the archaeal one, encompassed genes for stringent response to environmental stimuli via (p)ppGpp metabolism. Most of the MAGs encoded versatile sporulation-associated proteins, while those affiliated to mesophilic or moderately thermophilic taxa encoded a universal stress protein. Four of the 15 MAGs possessed 12–34 genes for different toxin-antitoxin systems, eight had 1–9 such genes, whereas three (belonging to Aquificaceae, Desulfurococcales and Thermodesulfobacteriaceae) had none. All but one MAG contained genes for cation transport; most had genes for twin-arginine motif-based protein translocation and Ton and Tol transport systems, while some encoded ABC transporters, uni- sym- and antiporters, TRAP transporters, and various secretion systems. Half of the MAGs harbored genes for flagellar motility or chemotaxis; protection against metals, toxins, and antibiotics; and CRISPR systems.
Details of the adaptationally useful genes encompassed by the different MAGs are given in Supplementary Results.
Microbiome functions predicted from the assembled metagenome
Out of the 379412 genes predicted within the assembled Lotus Pond metagenome by Prodigal v2.6.3 212749 were ascribable to COGs that again were distributed across 25 functional categories (S7 Table in S1 File). Of these, Translation, ribosomal structure and biogenesis; and Energy production and conversion had the maximum number of CDSs ascribed to them (26153 and 22448 respectively). Next in attendance were the COG categories Amino acid transport and metabolism (19711 CDSs); Cell wall/membrane/envelope biogenesis (15685 CDSs); Coenzyme transport and metabolism (14510 CDSs); and Inorganic ion transport and metabolism (11949 CDSs). Notably, the lowest numbers of CDSs, i.e. 26, 23, 21 and 3, were allocated to Extracellular structures; Cytoskeleton; RNA processing and modification; and Chromatin structure and dynamics, respectively. The COG category termed as Nuclear structure had no gene assigned to it at all (S7 Table in S1 File).
For energy production and conversion, maximum homologs were detected for the genes encoding NADH-quinone oxidoreductase and cytochrome c oxidase. While oxidative phosphorylation was the most represented pathway under energy production and conversion, NADH:quinone oxidoreductase, the reductive tricarboxylic acid (rTCA) cycle or the reductive citrate cycle, the TCA or citrate cycle, and cytochrome c oxidase were the biochemical modules with most CDSs (S2 Table in S1 File).
For DNA metabolism, maximum number of homologs was detected for a transposase encoding gene. Mismatch repair, homologous recombination, replication, and nucleotide excision repair were the most represented pathways, while DNA polymerase III complex and fluoroquinolone-resistance gyrase-protecting protein Qnr, were the most represented biochemical modules (S2 Table in S1 File).
For translation and ribosomal structure and biogenesis, maximum homologs were detected for the elongation factor Tu. Aminoacyl-tRNA biosynthesis was the most represented pathway, while ribosome (bacterial followed by archaeal) was the maximally represented biochemical module (S2 Table in S1 File).
Regarding amino acid transport and metabolism, maximum homologs were there for glycine cleavage system protein H and cysteine desulfurase. Biosynthesis of secondary metabolites was the most represented pathway, while maximum CDSs were attributed to the biochemical module photorespiration.
As for cell wall/membrane/envelope biogenesis, maximum homologs were detected for the transaminase which isomerizes glutamine:fructose-6-phosphate, and the GTP-binding protein LepA, which acts as a potential fidelity factor of translation (for accurate and efficient protein synthesis) under certain stress conditions [63,64]. Under this category, amino sugar and nucleotide sugar metabolism, antibiotics biosynthesis, and peptidoglycan biosynthesis were the most represented pathways. At the module level, nucleotide sugar biosynthesis, dTDP-L-rhamnose biosynthesis, lipoprotein-releasing system, lipopolysaccharide biosynthesis (KDO2-lipid A), and multidrug resistance related efflux pumps AcrAB-TolC/SmeDEF, AcrAD-TolC, MexAB-OprM and AmeABC were maximally represented (S2 Table in S1 File).
For coenzyme transport and metabolism, maximum homologs were detected for uroporphyrin-III C-methyltransferase. Secondary metabolites biosynthesis, porphyrin metabolism, pantothenate and co-enzyme A (CoA) biosynthesis, and antibiotics biosynthesis were the most represented pathways, while heme biosynthesis was the most represented biochemical module (S2 Table in S1 File).
Regarding inorganic ion transport and metabolism, maximum homologs were detected for NADH-quinone oxidoreductase subunit L. Under this category, ABC transporters, oxidative phosphorylation, quorum sensing, and two-component system, were the most represented pathways, wheras at the module level, NADH:quinone oxidoreductase, peptides/nickel transport system, phosphate transport system, iron complex transport system, nickel transport system, NitT/TauT family transport system, nitrate/nitrite transport system, and iron(III) transport system were represented maximally (S2 Table in S1 File).
Chemolithotrophic sulfur oxidation (Sox) is the primary bioenergetic mechanism of the Lotus Pond vent-water ecosystem
Out of all the mechanisms of chemolithotrophy known, only sulfur oxidation (Sox) was found to have substantial numbers of coding sequences, partial or complete, in the assembled metagenome for most of its constituent biochemical reactions (enzymatic steps). Overall, 333 CDSs were there for the different components of the Sox multienzyme complex: 21 for the L-cysteine S-thiosulfotransferase protein SoxX (K17223), 53 for the SoxA subunit of L-cysteine S-thiosulfotransferase (K17222), 112 and 57 for the sulfur compounds chelating/binding protein subunits SoxY (K17226) and SoxZ (K17227) respectively, 79 for the S-sulfosulfanyl-L-cysteine sulfohydrolase SoxB (K17224), and 11 for the sulfane dehydrogenase protein’s molybdopterin-containing subunit SoxC (K17225) and diheme c-type cytochrome subunit SoxD (K22622), the CDSs of which occur universally as overlapping ORFs. In terms of taxonomic distribution, maximum number of Sox protein encoding sequences (~40% of the total 333) was ascribed to Aquificae, followed by Alphaproteobacteria (~25%), Betaproteobacteria (14%), and Deinococcus-Thermus (9%). Unclassified Bacteria, Gammaproteobacteria, and other Proteobacteria individually accounted for approximately 8, 4 and 1 percent of all the Sox CDSs annotated (S2 Table in S1 File).
As for the other forms of chemolithotrophy known in the microbial world, only a small number of CDSs could be detected for the key proteins of arsenite and hydrogen oxidation. Whereas five CDSs each were detected for the small and large subunits AoxA (K08355) and AoxB (K08356) of arsenite oxidase respectively, only a single CDS was identifiable for the small subunit of ferredoxin hydrogenase (K00534), even as no CDS could be detected for the hydrogenase large subunit (K00533). Notably, no CDS was also there for any of the other hydrogenase variants known in the literature.
No CDS was present in the Lotus Pond microbiome for the chemolithotrophic iron oxidation protein iron:rusticyanin reductase (K20150). Likewise, there was also no CDS for the enzymes concerned with the chemolithotrophic oxidation of ammonia to nitrite. However, despite the absence of genes encoding the different subunits of methane/ammonia monooxygenase (pmoABC—K10944, K10945 and K10946), or for that matter the nonexistence of any CDS for the other ammonia-oxidizing enzyme hydroxylamine dehydrogenase (K10535), the Lotus Pond metagenome did encompass several genes for the alpha (K00370) and beta (K00371) subunits of nitrite to nitrate oxidizing enzyme nitrate reductase / nitrite oxidoreductase (269 and 90 CDSs respectively).
So far as the population genome bins are concenrned, the one designated as LotusPond_MAG_Unclassified_Aquificaceae was found to have genes coding for the proteins SoxX, SoxA, SoxB, SoxY, and SoxZ, plus the accessory protein SoxW, which is a thioredoxin serving the purpose of reducing the different subunits of the Sox multienzyme complex to their functionally active states. The genome bin designated as LotusPond_MAG_Thermus_sp. encoded the proteins SoxX, SoxA, SoxB, SoxC, and SoxD, while the one entitled as LotusPond_MAG_Unclassified_Chromatiales contained genes for SoxX, SoxA, SoxB, and SoxC. LotusPond_MAG_Paracoccus_sp._1 contained gene encoding SoxB, SoxCD, and the additional sulfur oxidation proteins SoxF (a periplasmic flavoprotein sulfide dehydrogenase) and SoxG (a thiosulfate-induced periplasmic zinc metallohydrolase that can act as a potential thiol esterase), while the largely incomplete genome bin designated as LotusPond_MAG_Paracoccus_sp._2 contained genes for only SoxY, SoxZ, and the transcriptional regulator of the sox operon called SoxR.
The population genome bin LotusPond_MAG_Sulfurihydrogenibium_azorense encompassed genes for all the four subunits—alpha (K17993), delta (K17994), gamma (K17995) and beta (K17996)–of the signature protein of the genus called sulfhydrogenase (HydADGB), which is H2:polysulfide oxidoreductase catalyzing hydrogen oxidation via reduction of elemental sulfur or polysulfide to hydrogen sulfide.
Centrality of the rTCA cycle, and Aquificae, in Lotus Pond’s primary productivity
In the assembled metagenome of Lotus Pond there were substantial numbers of CDSs, whether complete or partial, for all the enzymatic steps necessary to execute (i) the rTCA (Arnon-Buchanan) as well as the modified rTCA cycle [65,66], (ii) the reductive pentose phosphate (Calvin-Benson-Bassham) cycle [67], (iii) the 3-hydroxypropionate bi-cycle [68], and (iv) the reductive acetyl Co-A (Wood-Ljungdahl) pathway [65,69]. For the other major pathways of carbon assimilation (autotrophic carbon fixation), namely (v) the dicarboxylate/4-hydroxybutyrate cycle [70] and (vi) the 3-hydroxypropionate/4-hydroxybutyrate cycle [65], genes concerned with a number of intermediate transformations were either missing from the metagenome or were represented by just one or two CDSs (Fig 5). For the other recently-reported and not so common pathways, the key biochemical steps are rendered by the specialized activities of certain typical enzyme variants for which genes are difficult to identify on the basis of homology analysis alone. These pathways are (vii) the reverse oxidative TCA (roTCA) cycle, where citrate is converted to acetyl-CoA and oxaloacetate by a reverse acting citrate synthase homolog [71], (vii) the transaldolase variant of the Calvin cycle mediated by Form III RubisCO [72], and (ix) the reductive glycine pathway, where CO2 is converted to formate by the action of a formate dehydrogenase, and 5,10-methylene-tetrahydrofolate is converted to glycine by a glycine cleavage/synthase system [73].
Fig 5. Simplified schematic view of the carbon assimilation pathways for which genes were identified in the assembled metagenome of Lotus Pond.
The relevant genes were identified based on the annotation of CDSs with reference to the published literature, and biochemical modules curated under the KEGG Pathway Maps database located at https://www.genome.jp/kegg/pathway.html: (A) reductive citrate cycle, (B) reductive pentose phosphate cycle, (C) dicarboxylate/4-hydroxybutyrate cycle, (D) 3-hydroxypropionate/4-hydroxybutyrate cycle, (E) 3-hydroxypropionate bi-cycle, and (F) reductive acetyl Co-A pathway. Names of the pathways for which one or more genes were undetectable in the metagenome have been written in pink fonts. The active inorganic carbon substrates assimilated by the different pathways are highlighted in yellow. The biochemical conversions (steps) of the different pathways for which CDSs could be identified in the assembled metagenome are indicated by the roman numerals I through L. While the roman numerals shaded green indicate the key enzymatic steps of the pathway in question, the numbers given in parentheses denote the numbers of CDSs identified for the individual enzymatic steps. The enzyme-coding genes actually detected in the metagenome for the biochemical steps labeled as I through L are enumerated below by their KEGG orthology identifiers. I—K00024; II—K01676, K01677, K01678, K01679; III—K18556, K18557, K18558, K18559, K18560; IV—K01902, K01903; V—K00174, K00175, K00176, K00177; VI—K00031; VII—K01681, K01682; VIIIA—K15232, K15233; VIIB—K15234; IX—K15230, K15231; X—K00169, K00170, K00171, K00172, K03737; XI—K01006, K01007; XII—K01595; XIII—K01958, K01959, K01960; XIV—K00855; XV—K01601, K01602; XVI—K00927; XVII—K00134, K00150; XVIII—K01623, K01624; XIX—K01086, K02446, K03841, K11532; XX—K00615; XXI—K01086, K11532; XXII—K01807, K01808; XXIII—K15038; XXIV—K14534; XXV—K15016; XXVI—K00626; XXVII—K00169, K00170, K00171, K00172; XXVIII—K01007; XXIX—K01677, K01678; XXX—K00239, K00240, K00241, K18860; XXXI—K01902, K01903; XXXII—K15037; XXXIII—K15020; XXXIV—K05606, K01848, K01849; XXXV—K02160, K01961, K01962, K01963; XXXVI—K14468; XXXVII—K14469; XXXVIII—K08691; XXXIX—K14449; XL—K14470; XLI—K09709; XLII—K15052; XLIII—K05606, K01847, K01848, K01849; XLIV—K14471, K14472; XLV—K00239, K00240, K00241, K01679; XLVI—K05299, K22015; XLVII—K00194, K00195, K00196, K00197, K00198, K14138; XLVIII—K01938; XLIX—K01491; L—K00297. In the panel A showing the different steps of the rTCA cycle, dotted lines indicate the reactions of the modified rTCA cycle which are mediated by the enzymes citryl-CoA synthetase (VIIIA) and citryl-CoA lyase (VIIIB).
In case of rTCA cycle, CDSs detected in Lotus Pond’s assembled metagenome included 5, 218, and 57 homologs for (i) ATP-citrate lyase AclAB (K15230 and K15231), (ii) NADH-dependent fumarate reductase FrdABCDE (K18556, K18557, K18558, K18559 and K18560), and (iii) ferredoxin-dependent 2-oxoglutarate synthase/oxidoreductase KorABCD (K00174, K00175, K00176 and K00177), which catalyze the key reactions (i) citrate to acetyl-CoA and oxaloacetate, (ii) fumarate to succinate, and (iii) succinyl CoA to 2-oxoglutarate, respectively. Most of these CDSs exhibited closest sequence similarities with homologs from members of Aquificae followed by Thermotogae. CDSs corresponding to K18556, K18557, K18558, K18559 and K18560 were also detectable within LotusPond_MAG_Unclassified_Aquificaceae, while CDSs corresponding to K15230 and K15231, and K18556 and K18557 were identified within LotusPond_MAG_Sulfurihydrogenibium_azorense. Furthermore, in the assembled metagenome of Lotus Pond, 411, 34, 57 and 274 CDSs were detected for the conversion of (i) acetyl-CoA to pyruvate (with CO2 addition), (ii) phosphoenolpyruvate to oxaloacetate (with HCO3¯ addition), (iii) succinyl-CoA to 2-oxoglutarate, and (iv) 2-oxoglutarate to isocitrate (the last two conversions involve CO2 addition) respectively (Fig 5A).
Apart from the key rTCA cycle genes mentioned above, 108 and 112 CDSs were detectable in Lotus Pond’s assembled metagenome for the two subunits of citryl-CoA synthetase (K15232 and K15233 respectively), while two CDSs were also there for citryl-CoA lyase (K15234). Notably, all the CDSs corresponding to these two enzymes catalyzing the two key reactions of the modified rTCA cycle, namely citrate to citryl-CoA, and citryl-CoA to oxaloacetate and acetyl-CoA, exhibited closest sequence similarities with homologs from Aquificae. Corroboratively, CDSs corresponding to K15232, K15233 and K15234 were identified within LotusPond_MAG_Unclassified_Aquificaceae, while a single CDS corresponding to K15234 was detectable in LotusPond_MAG_Sulfurihydrogenibium_azorense.
As regards the reductive pentose phosphate cycle, CDSs detected in Lotus Pond’s assembled metagenome included 32 and 40 homologs for (i) phosphoribulokinase (K00855) and (ii) ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO, K01601 and K01602), which catalyze the key reactions (i) ribulose-5-phosphate to ribulose-1,5-bisphosphate, and (ii) ribulose-1,5-bisphosphate to 3-phosphoglycerate (the sole CO2 fixing step of the pathway), respectively (Fig 5B). While most of these CDSs exhibited closest sequence similarities with homologs from members of Deinococcus-Thermus, CDSs corresponding to K00855, as well as K01601 and K01602, were there in LotusPond_MAG_Thermus_sp.; notably, homologs of K01601 and K01602 encoding the large (RbcL) and small (RbcS) subunits of RubisCO were also detected in LotusPond_MAG_Thermosynechococcus_sp. and LotusPond_MAG_Unclassified_Chromatiales.
So far as the dicarboxylate/4-hydroxybutyrate cycle is concerned, only one Lotus Pond CDS was found for 4-hydroxybutyryl-CoA dehydratase (K14534), which governs the key step of converting 4-hydroxybutyryl-CoA to crotonyl-CoA. Moreover, though 400 and 34 CDSs were there in the Lotus Pond metagenome for the conversion of (i) acetyl-CoA to pyruvate, and (ii) phosphoenolpyruvate to oxaloacetate, via addition of CO2 and HCO3¯ respectively, the numbers of CDSs detected for the reactions of the hydroxybutyrate half of the cycle were extremely low (Fig 5C). In this context it is noteworthy that the carbon fixation reactions of the dicarboxylate/4-hydroxybutyrate cycle are same as the first two carbon fixation steps of the rTCA cycle: phosphoenolpyruvate to oxaloacetate conversion involves the same enzyme phosphoenolpyruvate carboxylase (KEGG orthology: K01595) in either pathway, while there is only one additional enzyme [pyruvate-ferredoxin/flavodoxin oxidoreductase Por (K03737)] for acetyl-CoA to pyruvate conversion by the rTCA cycle, and that precisely accounted for the extra 11 CDSs ascribed to the corresponding step of the rTCA cycle (Fig 5). Evidently, the high CDS counts for the carbon fixation reactions of dicarboxylate/4-hydroxybutyrate cycle were only a reflection of the overall preponderance of the rTCA cycle, and dicarboxylate/4-hydroxybutyrate cycle was plausibly not prevalent in the microbiome.
The 3-hydroxypropionate/4-hydroxybutyrate cycle is characterized by its signature protein acetyl-CoA/propionyl-CoA carboxylase (biotin-dependent). This protein by virtue of its three subunits structured in an α4β4γ4 configuration and having the functional domains for biotin carboxylase (K01964), biotin carboxyl carrier protein (K15037), and carboxytransferase (K15036) respectively, catalyzes (i) the conversion of acetyl-CoA to malonyl-CoA, as well as (ii) the conversion of propanoyl-CoA to (S)-methylmalonyl-CoA, using HCO3¯ as the active inorganic carbon source in both the reactions. The Lotus Pond metagenome had only one Crenarchaeota-affiliated CDS matching the gene K15037 but no CDS for the genes K01964 and K15036 (Fig 5D).
The key steps of the 3-hydroxypropionate bi-cycle involve carbon fixation reactions similar to those encountered in the 3-hydroxypropionate/4-hydroxybutyrate cycle. However, the two HCO3¯ assimilating reactions in the bi-cycle are catalyzed by enzymes that are distinct from the acetyl-CoA/propionyl-CoA carboxylase of 3-hydroxypropionate/4-hydroxybutyrate cycle. For instance, conversion of acetyl-CoA to malonyl-CoA in the bi-cycle is rendered by a separate acetyl-CoA carboxylase complex AccABCD, which is made up of the biotin carboxylase subunit AccC (K01961), the biotin carboxyl carrier AccB (K02160), and the carboxyl transferase subunits alpha (AccA; K01962) and beta (AccD; K01963). Propanoyl-CoA, in the bi-cycle, is converted to (S)-methylmalonyl-CoA by an independent propionyl-CoA carboxylase (K15052). The Lotus Pond metagenome encompassed 479 CDSs for the AccABCD complex, whereas only eight CDSs could be identified for propionyl-CoA carboxylase (Fig 5E). Most of the Lotus Pond CDSs corresponding to AccABCD and propionyl-CoA carboxylase exhibited closest sequence similarities with homologs from Aquificae and Chloroflexi respectively. The Lotus Pond MAGs affiliated to unclassified Aquificaceae and Sulfurihydrogenibium azorense encompassed CDSs for all the subunits of AccABCD.
In the key step of the Wood–Ljungdahl pathway an O2-sensitive bifunctional multienzyme complex called carbon monoxide dehydrogenase/acetyl-CoA synthase (CODH/ACS) first reduces a CO2 molecule to a CO (carbonyl) moiety, and then combines the CO with a methyl (CH3) group (furnished by the other branch of the pathway) and CoA to generate acetyl-CoA. The CODH/ACS complex is made up of five subunits and four of them are homologous across Archaea and Bacteria. In Archaea they are referred to as the α, β, γ and δ subunits, or the CdhA (K00192), CdhC (K00193), CdhE (K00197) and CdhD (K00194) proteins, whereas in Bacteria the same four homologs are known as the β, α, γ and δ subunits, or the AcsA (K00198), AcsB (K14138), AcsC (K00197) and AcsD (K00194) proteins, respectively. In addition, an ε subunit called CdhB (K00195) is exclusively present in the Archaea, while the AcsE (K15023) protein is exclusive to Bacteria. Overall 13 CDSs for the different components of the Cdh and/or Acs complex were identified in Lotus Pond’s assembled metagenome (Fig 5F), and most of them showed closest sequence similarities with homologs from Thermodesulfobacteria. Concurrently, LotusPond_MAG_Unclassified_Thermodesulfobacteriaceae encompassed one CDS each for AcsA, AcsB and AcsD.
Copious invasion, virulence, and antibiosis related genes in the Lotus Pond metagenome
While there were several complete or partial CDSs (a total of 76) in the Lotus Pond vent-water’s assembled metagenome for proteins concerned with biofilm formation and cellular adhesion, quite a few CDSs attributed to hemagglutinin and hemolysin biosynthesis (86), and stationary phase metabolisms including the formation of persister cells (43) were also there (S2 Table in S1 File). Furthermore, the metagenome encompassed 1084 complete or partial CDSs encoding the different components of Type-I, Type-II, Type-III, Type-IV, Type-VI, Type-VIII and Type-IX secretion systems (SSs). Of these, most were parts of the Type-II, and Type-III and Type-IV SSs. In terms of taxonomic distribution (S2 Table in S1 File), most of the SS related CDSs identified (57% of the total 1084) were attributed to Aquificae, followed by Betaproteobacteria (~10%). Gammaproteobacteria and Firmicutes each accounted for 8%, while Alphaproteobacteria accounted for ~5% of all these CDSs. Approximately 1–3% of all SS related CDSs were ascribable to Deinococcus-Thermus, Unclassified Proteobacteria, Unclassified Bacteria, Thermodesulfobacteria, Cyanobacteria, Actinobacteria, and Archaea. Thermotogae accounted for ~0.6% of the SS related CDSs identified in the assembled metagenome. The Lotus Pond MAGs also encompassed quite a few CDSs for biofilm formation, cellular adhesion, invasion and virulence, with the one identified as Vibrio metschnikovii having the maximum number and diversity of CDSs for these functions.
Genomic loci (gene clusters) concerned with the biosynthesis of diverse bioactive natural compounds were identified using the antiSMASH pipeline. By searching for specific sequence signatures in the putative biosynthetic gene clusters (BGCs) detected the following classes of secondary metabolites were predicted to be synthesized within the microbiome: N-acyl amino acids, aryl polyene compounds, protease inhibitor-containing beta-lactones, ectoines, homoserine lactones, and terpenes. BGCs corresponding to non-ribosomal peptide synthetases (NRPSs) and NRPS-like fragments, Type I and other polyketide synthases, ribosomally synthesised and post-translationally modified peptides (RiPPs), RiPP recognition elements, and a host of unclassified secondary metabolite-related proteins were also detectable in Lotus Pond’s assembled metagenome. Overall, approximately 3000 complete or partial CDSs (S2 and S8 Tables in S1 File) could be identified as parts of BGCs and/or involved directly or indirectly in the biosynthesis of antibiotics such as the acarbose and validamycin, carbapenems, monobactams, neomycin/kanamycin/gentamicin, novobiocin, penicillins and cephalosporins, phenazine, prodigiosin, staurosporine, streptomycin, and vancomycin. So far as the key biosynthetic genes are concerned, several CDSs for non-ribosomal peptide synthetase and prephenate dehydrogenase, which are central to the vancomycin biosynthesis pathway, were found in the Lotus Pond metagenome (S2 Table in S1 File). CDSs for myo-inositol-1-phosphate synthase, dTDP-4-dehydrorhamnose 3,5-epimerase, and dTDP-4-dehydrorhamnose reductase, which are all crucial for the biosynthesis of streptomycin were also detected alongside CDSs for the phenazine biosynthesis protein PhzF (S2 Table in S1 File). For the biosynthesis of carbapenems, several homologs of the carA, carB and carC genes that encode the carbamoyl phosphate synthetase holoenzyme involved in the initial steps of carbapenem biosynthesis (formation of the carbapenem backbone) were detected alongside homologs of carD that is involved in the modification of the carbapenem backbone (S2 Table in S1 File). In terms of taxonomic distribution (S8 Table in S1 File), maximum number of these CDSs (~29% of the total) was attributed to Aquificae, followed by Alphaproteobacteria (~21%). The entire domain Archaea accounted for only ~2% of all antibiotic biosynthesis related genes identified. Apart from the BGCs, several genomic loci concerned with the resistance of diverse antibiotics, disinfecting agents, and antiseptics were identified by annotating the assembled metagenome with reference to the CARD database. The antimicrobial agents corresponding to which putative resistance related gene clusters were detected included beta-Lactam compounds, chloramphenicol, erythromycin, fluoroquinolones, metronidazole, teicoplanin, tetracycline, trimethoprim, and vancomycin. In the eggNOG-based annotation (S2 and S8 Tables in S1 File), overall ~5500 such complete or partial CDSs were identified in the assembled metagenome of Lotus Pond’s vent-water that were directly or indirectly involved in the resistance of aminoglycosides, bacitracin, beta-lactam compounds, cationic antimicrobial peptides, fluoroquinolones, imipenem, nisin, polymyxin antibiotics, tetracycline, and vancomycin. Furthermore, a wide variety of multi-drug resistance related efflux pumps were detected in the assembled metagenome via eggNOG annotation. In terms of taxonomic distribution (S8 Table in S1 File), considerable number of these antibiotic resistance related CDSs (30% of the total) was attributed to Aquificae, followed by Gammaproteobacteria (~28%). The entire domain Archaea accounted for only one antibiotic resistance related CDS, and the same was meant to protect against aminoglycosides.
Discussion
Comparing the microbial diversities revealed by shotgun metagenomics and 16S amplicon sequencing
The extents of Lotus Pond’s microbial diversity revealed by the different approaches of molecular genetics had their own differences, as well as concurrence. For instance, two archaeal phyla, Crenarchaeota and Euryarchaeota, were identified by taxonomic annotation of the CDSs predicted within the assembled metagenome (S3 Table in S1 File), but searching of metagenomic reads for 16S rRNA-encoding sequences detected only Crenarchaeota (S4 Table in S1 File), whereas PCR amplified 16S rRNA gene sequence based OTU analysis identified Crenarchaeota, Euryarchaeota and Nitrososphaerota (S5 Table in S1 File). Concurrently, the archaeal genera Aeropyrum, Pyrobaculum and Stetteria, identified by direct annotation of metagenomic reads, went undetected in 16S amplicon-based analysis, whereas Methanomassiliicoccus, Methanospirillum and Nitrososphaera, detected via 16S amplicon sequencing, were undetectable in the direct annotation of metagenomic reads (compare S4 and S5 Tables in S1 File).
For Bacteria, 23 phyla were identified via taxonomic classification of CDSs: of these, 10 (Actinobacteria, Aquificae, Bacteroidetes, Chloroflexi, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, Thermodesulfobacteria, and Thermotogae) were major phyla embracing >2000 CDSs each, and 13 (Acidobacteria, Chlamydiae, Chlorobi, Deferribacteres, Fusobacteria, Gemmatimonadetes, Nitrospirae, Planctomycetes, Spirochaetes, Synergistetes, Tenericutes, Thermomicrobia, and Verrucomicrobia) were minor phyla with <500 CDSs each (S3 Table in S1 File; Fig 3A). Annotation of metagenomic reads corresponding to 16S rRNA genes detected 12 phyla in all. These included all the 10 major phyla, and only Synergistetes out of the 13 minor phyla, detected via taxonomic classification of CDSs. Additionally, only one such new phylum, namely Armatimonadetes, was detected via direct metagenomic reads (S4 Table in S1 File; Fig 3B) classification that was not represented in the CDS catalog (S3 Table in S1 File; Fig 3A). 16S amplicon-based OTU analysis identified 16 bacterial phyla (S5 Table in S1 File), which included the 10 major ones identified from CDS annotation, three of the 13 minor phyla (Acidobacteria, Spirochaetes and Synergistetes) detected via CDS annotation, plus Armatimonadetes, and two new phyla (Dictyoglomi and Rhodothermota) that were not represented among the CDSs or the metagenomic reads encoding 16S rRNAs.
Annotation of metagenomic reads corresponding to bacterial 16S rRNA genes revealed 92 bacterial genera (S4 Table in S1 File), while OTU analysis of PCR amplified 16S rRNA gene sequences identified 121 (S5 Table in S1 File). Between these two sets of generic names obtained using two different approaches, 50 genera were common, of which seven (Fervidobacterium, Halomonas, Paracoccus, Sulfurihydrogenibium, Tepidimonas, Thermus and Vibrio) were represented in the MAGs obtained from the assembled metagenomic data, and two (Hydrogenobacter and Thiofaba) were represented by very closely related MAGs (namely, LotusPond_MAG_Unclassified_Aquificaceae and LotusPond_MAG_Unclassified_Chromatiales; detailed phylogenomic relationships are given in the Supplementary Results). Of all the genera represented among the MAGs (Tables 1 and S6 in S1 File; Fig 4), Thermosynechococcus alone was not detected in any 16S rRNA gene sequence-based analysis; however, LotusPond_MAG_Thermosynechococcus_sp. did encompass a near-complete 16S rRNA gene, 99.6% similar to the Thermosynechococcus elongatus BP-1 homolog. This indicated that the nonappearance of Thermosynechococcus in 16S rRNA gene sequence based analyses was a bioinformatic glitch.
Discrepancies are known to arise in the microbial diversities revealed by analyzing the sample sample via shotgun metagenome sequencing and PCR-amplified 16S rRNA gene sequencing. A number of studies have found 16S amplicon sequencing to underrepresent taxa at different hierarchical levels [74–77], whereas the opposite outcome has also been reported by quite a few studies [78,79]. The present data demonstrated that the annotation of metagenomic reads corresponding to 16S rRNA genes reliably delineated most part of the diversity under investigation, at both the phylum and genus levels. Although this approach did not fare well in revealing the scarcely represented taxa, the diversity revealed by this approach was fairly reproducible in the corresponding data obtained via taxonomic classification of CDSs annotated within the assembled metagenome, and the OTUs clustered from PCR-amplified 16S rRNA gene sequences. The latter two approaches, in turn, appeared to be more adept to unearthing the less-abundant taxa, at the phylum as well as genus level. With all the three approaches having their own strengths and limitations, a simultaneous use of all of them seems to be the best practice in obtaining a comprehensive picture of microbial diversity in a given environmental sample.
Lotus Pond’s vent-water microbiome remains stable over time
Over the last ten years, the Lotus Pond has been explored for its vent-water bacterial (but not archaeal) diversity, via 16S amplicon sequencing alone, on five previous time-points [10,17]. A comparison of the present 16S amplicon-based OTU data with those from the past showed that all the 16 bacterial phyla detected in the current investigation (S5 Table in S1 File), except Rhodothermota and Synergistetes, had been identified in previous exploration(s). The five previous explorations too had collectively discovered 16 bacterial phyla, of which only Ignavibacteriae and Nitrospirae went undetected in the present 16S amplicon-based survey. Furthermore, 11 bacterial phyla, namely Actinobacteria, Aquificae, Bacteroidetes, Chloroflexi, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, Synergistetes, Thermodesulfobacteria and Thermotogae were consistently detected in the present annotation of CDSs, 16S rRNA-encoding metagenomic reads, and OTUs clustered from 16S amplicon sequences; all of these, except Synergistetes, were detectable in one or more previous studies of 16S amplicon sequencing and analysis.
At the genus level, 121 entities were detected in the current PCR-based investigation (S5 Table in S1 File), out of which 59 were identified in one or more previous explorations, which in turn had collectively discovered 156 genera. Seven out of the 97 genera that were identified previously, but went undetected in the present PCR-based analysis (these were Acidovorax, Armatimonadetes_gp5, Buttiauxella, Cyanobacteria GpIV, Klebsiella, Ornithinimicrobium and Sphingomonas), got identified in the present annotation of metagenomic reads corresponding to 16S rRNA genes (S4 Table in S1 File). In this way, a total of 66 bacterial genera (Table 2) were found to be present consistently in Lotus Pond’s vent-water over a period of approximately ten years. Notably, six out of these 66 genera, namely Fervidobacterium, Halomonas, Paracoccus, Sulfurihydrogenibium, Tepidimonas and Thermus, were unequivocally represented in the population genome bins (MAGs) obtained from the current metagenomic data, while two more, namely Hydrogenobacter and Thiofaba, were represented by very closely related entities, if not direct member strains of the genera. In this way, there were only two such bacterial genera—namely Thermosynechococcus and Vibrio—that figured among the MAGs obtained in the current study but were not represented in any of the previous OTU data.
Table 2. Bacterial genera that were detected in the present exploration of the Lotus Pond vent-water as well as in one or more previous studies of the same habitat using 16S rRNA gene sequence analysis.
Phylum to which the genera belonged | Names of the genera1,2 | Maximum temperature for Laboratory-growth3 (°C) |
Sampling date of the previous study in which the genus was also identified | |
---|---|---|---|---|
23 July, 2013 [17] |
20 October, 2014 [10] |
|||
Actinobacteria | Brevibacterium | 42 | + | + |
Corynebacterium | 42 | + | + | |
Mycobacterium | 43 | - | + | |
Nocardioides | 37 | - | + | |
Ornithinimicrobium | 50 | + | - | |
Rothia | 38 | + | - | |
Armatimonadetes | Gp5 | - | - | + |
Gp7 | - | - | + | |
Aquificae | Hydrogenobacter | 85 | + | + |
Sulfurihydrogenibium | 80 | + | + | |
Bacteroidetes | Chryseobacterium | 37 | + | + |
Cloacibacterium | 40 | + | + | |
Pedobacter | 35 | - | + | |
Porphyromonas | 37 | + | - | |
Prevotella | 45 | + | - | |
Chloroflexi | Chloroflexus | 59 | + | + |
Cyanobacteria | GpXIII | - | - | + |
GpIV | - | - | + | |
Deinococcus-Thermus | Thermus | 80 | + | + |
Truepera | 50 | + | + | |
Dictyoglomi | Dictyoglomus | 80 | - | + |
Firmicutes | Anoxybacillus | 70 | + | + |
Bacillus | 70 | + | + | |
Chryseomicrobium | 45 | + | - | |
Enterococcus | 45 | + | - | |
Exiguobacterium | 49 | + | + | |
Gemella | 37 | - | + | |
Lactococcus | 40 | + | - | |
Lysinibacillus | 45 | - | + | |
Planococcus | 42 | + | + | |
Sporolactobacillaceae_incertae_sedis | - | - | + | |
Staphylococcus | 40 | + | + | |
Streptococcus | 45 | + | + | |
Alphaproteobacteria | Azospirillum | 37 | + | + |
Bradyrhizobium | 37 | + | + | |
Brevundimonas | 42 | + | + | |
Paracoccus | 45 | + | + | |
Sphingobium | 37 | - | + | |
Sphingomonas | 45 | + | + | |
Sulfitobacter | 40 | + | + | |
Betaproteobacteria | Achromobacter | 42 | + | + |
Acidovorax | 37 | + | + | |
Burkholderia | 37 | + | + | |
Comamonas | 44 | - | + | |
Delftia | 40 | - | + | |
Herbaspirillum | 45 | + | - | |
Pelomonas | 37 | + | + | |
Ralstonia | 41 | + | + | |
Tepidimonas | 60 | + | + | |
Gammaproteobacteria | Aeromonas | 41 | + | + |
Acinetobacter | 37 | + | + | |
Buttiauxella | 42 | - | + | |
Enhydrobacter | 41 | + | + | |
Escherichia/Shigella | 37 | - | + | |
Halomonas | 50 | + | + | |
Klebsiella | 37 | - | + | |
Marinobacter | 50 | + | + | |
Pseudomonas | 42 | - | + | |
Psychrobacter | 38 | + | + | |
Rheinheimera | 35 | - | + | |
Serratia | 35 | + | - | |
Shewanella | 30 | + | - | |
Stenotrophomonas | 42 | + | + | |
Thiofaba | 51 | + | + | |
Thiovirga | 34 | + | - | |
Thermotogae | Fervidobacterium | 90 | + | + |
1 Genera detected in the present PCR-amplified 16S rRNA gene sequence analysis are indicated by bold fonts.
2 Names of the genera that were detected in the present annotation of metagenomic reads corresponding to 16S rRNA genes are underscored.
3 Full references concerning the maximum temperatures for laboratory-growth of all the above genera have been cited in relation to S4 and S5 Tables in S1 File, within the Supplementary Information file.
Lotus Pond is not reserved for thermophiles, albeit Aquificales predominate numerically and nutritionally
Literature survey for the phenotypic characteristics of the cultured members of the 92 bacterial genera identified via taxonomic classification of 16S rRNA encoding metagenomic reads showed that the constituent strains of only 29 genera had reports of growth in the laboratory above 45°C (see S4 Table in S1 File and references therein). A similar exercise for the 121 genera detected via OTU-level clustering of PCR amplified 16S rRNA gene sequences showed that member strains of only 30 genera had reports of in vitro growth above 45°C (S5 Table in S1 File). Among the 66 bacterial genera regularly present in Lotus Pond vent-water, only 15 were found to have strains reported for laboratory growth above 45°C: these were Fervidobacterium, Hydrogenobacter, Sulfurihydrogenibium, Thermus, Dictyoglomus, Anoxybacillus, Bacillus, Tepidimonas, Chloroflexus, Thiofaba, Truepera, Halomonas, Marinobacter, Ornithinimicrobium and Exiguobacterium (Table 2). Corroborative to this high diversity of bacterial mesophiles, 41.2% of all taxonomically classifiable CDSs identified in the assembled metagenome were ascribable to Proteobacteria, whereas only 24.5% were affiliated to Aquificae (S3 Table in S1 File; Fig 3A). Interestingly, at least 70 CDSs for different cold shock proteins (most of them were DNA-binding) could be detected in the Lotus Pond metagenome, out of which, again, 22 were ascribable to Aquificae, two each to Deinococcus-Thermus and Thermotogae, and one to Thermodesulfobacteria; the rest were affiliated to Proteobacteria (38), Firmicutes (5), Actinobacteria (1), and Synergistetes (1). These genomic features corroborate that the Lotus Pond vent-water is neither a “Thermophiles only” habitat nor a closed, biophysically firewalled, ecosystem. Instead, it hosts physiologically diverse microorganisms, including mesophiles and psychrophiles, which come into the system from the adjoining territories of the cold and arid Changthang Plateau. Furthermore, it highlights the potential fluxes of the thermophilic bacteria out of the high temperature ecosystem, into the cold desert biome, over time and space. All these suppositions are in agreement with the biogeodynamic model of the Puga hydrothermal system [10], as well as the “hot spring–river” ecological continuum that embodies the topography of the Lotus Pond site (Fig 1).
Although phylogenetic relatives of mesophilic bacteria overwhelmed the taxonomic, as well as the overall genetic, diversity of the Lotus Pond vent-water microbiome, thermophilic Aquificales dominated the habitat in terms of relative abundance. Only two MAGs affiliated to Aquificaceae/Hydrogenobacter (maximum temperature for laboratory-growth of Aquificaceae is 95°C [80]), and Sulfurihydrogenibium (maximum temperature for laboratory-growth: 80°C [81]), collectively made up 59.4% of the whole metagenome (Table 1 and S6 Table in S1 File). Hydrogenobacter and Sulfurihydrogenibium also accounted for ~80% of all 16S rRNA-encoding reads identified within the quality-filtered metagenomic sequence dataset (S4 Table in S1 File).
Among the other taxa identified as MAGs, Fervidobacterium (maximum temperature for laboratory-growth: 90°C [82]), Thermus (maximum temperature for laboratory-growth: 80°C [83]), the family Thermodesulfobacteriaceae of the phylum Thermodesulfobacteria (maximum temperature for laboratory-growth: 100°C [84]), and the order Desulfurococcales within the class Thermoprotei of the phylum Crenarchaeota (maximum temperature for laboratory-growth: 110°C [85]) have strains growing at temperatures above or within the ten-year range (78–85°C) recorded for Lotus Pond’s vent-water [10,16,17]. These MAGs together accounted for ~3% of the metagenome.
Cyanobacteria have been found in 73–85°C springs [86,87], but Thermosynechococcus are restricted to ≤70°C waters [88,89] while its cultured strains are known to grow only at ≤60°C [90,91]. Halomonas [92], Paracoccus [28,93], Tepidimonas [94], Chromatiales [95] and Vibrio [96] have temperature maxima for laboratory-growth at 50°C, 45°C (or 50°C under special conditions), 60°C, 51°C, and 50°C respectively. Thus, eight out of the 15 Lotus Pond MAGs, collectively accounting for 5.6% of the metagenome, were affiliated to genera encompassing strains that grow only at temperatures below the range known for Lotus Pond’s vent-water. That said, Halomonas, Paracoccus, Tepidimonas, Thiofaba and Vibrio have been isolated from hydrothermal vents or vent-adjacent environments, and found to exhibit varying degrees of thermal endurance [30,94,95,97–99].
The Aquificae not only predominated Lotus Pond’s vent-water microbiome in terms of relative abundance, but also acted as the key primary producers of the ecosystem, accounting for most of the genes present in the metagenome for the most prevalent pathways of chemolithotrophy (sulfur oxidation) and autotrophy (carbon fixation), namely the Sox mechanism, and the rTCA cycle and the 3-hydroxypropionate bi-cycle, respectively. While the Sox multienzyme complex system is thought to have originated in ancient thermophilic bacteria ancestral to Aquificae and Epsilonproteobacteria [100], the putatively primordial rTCA cycle has supposedly originated in sulfide-rich hydrothermal vent environments of the early Earth [101].
Although the constituent protein subunits of Sox have coevolved parallel to one another as orthologous lineages in many extant sulfur chemolithotrophs, the system has also spread across the taxonomic spectrum of Bacteria through extensive paralogy of the components [100,102]. On the other hand, in the extant biota, the rTCA cycle is known to operate in members of Aquificales, and Chlorobi, Deltaproteobacteria, Epsilonproteobacteria, Nitrospirae, and Crenarchaeota [103–106]. Notably, the present exploration revealed the presence of rTCA cycle homologs not only in MAGs affiliated to Aquificaceae, but also those ascribed to unclassified Bacteria, Desulfurococcales, Fervidobacterium, Paracoccus, Tepidimonas, Thermus, and Vibrio. The occurrence of rTCA cycle genes in the unexpected residents of this hydrothermal habitat (i.e. the phylogenetic relatives of known mesophiles) shed new light on the ecological amplitude, selection, and evolution (including horizontal spread) of this ancient autotrophic pathway in the context of thermal stress.
The preponderance of Sox and rTCA in Lotus Pond conforms not only to the pathways’ ecology and evolution, but also the habitat’s physical chemistry, spatial abundance and isotopic ratios of redox sulfur species, and the copious presence of HCO3¯ [10,16]. In contrast, the 3-hydroxypropionate bi-cycle, discovered in the green non-sulfur bacterium called Chloroflexus aurantiacus [107,108], has long been considered as an exclusive feature of Chloroflexi. Recent studies, however, have shown that the pathway, or its analogous variants, could exist beyond this single phylum [109,110]. The present investigation of the Lotus Pond metagenome showed that genes for a few key enzymes of the 3-hydroxypropionate bi-cycle were present in the MAGs affiliated to Aquificae. However, since the pathway was not found in its complete form in these MAGs, it would be premature to vouch for the actual operation of 3-hydroxypropionate bi-cycle in the members of Aquificae native to Lotus Pond.
Furthermore, in this context, it is noteworthy that the presence of a substantial cache of genes for the reductive pentose phosphate cycle in Lotus Pond’s assembled metagenome as well as the MAGs belonging to Tepidimonas, Thermosynechococcus and unclassified Chromatiales is idiosyncratic to the biophysics of the ecosystem. This necessitates a renewed attention to the feasibility of this pathway at temperatures above the upper limit (70–75°C) known thus far for its functionality [65]. Additionally, several homologs of (i) the phosphoenolpyruvate carboxylase gene ppc (K01595), (ii) the malate dehydrogenase mdh (K00024), (iii) the oxaloacetate decarboxylating and NADP+ dependent malate dehydrogenase gene maeB (K00029), and (iv) the pyruvate, orthophosphate dikinase ppdK (K01006) were found to be parts of the Lotus Pond vent-water metagenome, as well as a few MAGs such as those belonging to Paracoccus and Tepidimonas. The protein products of these genes are known to govern the carbon fixation pathway called crassulacean acid metabolism (CAM) in plants adapted to aridity and high temperature stress [111]. To avoid gaseous exchange during the hot day time, CAM plants store CO2 in the form of malic acid at night, transport the malate to chloroplasts during the day, re-convert it CO2 and concentrate the same in the proximity of ribulose bisphosphate carboxylase molecules to carry out efficient carbon fixation [111]. A new line of pure culture based research of physiology and biochemistry needs to be initiated to explore the plausible roles of analogous mechanisms in bacteria adapted to thermal stress.
Lotus Pond’s vent-water metagenome contained several loci for the biosynthesis of secondary metabolites including antibiotics, a large proportion of which was ascribable to Aquificae. Members of this phylum native to Lotus Pond were also endowed with a rich repertoire of genes for antibiotic resistance and multi-drug efflux pumping. Archaea as a whole, Deinococcus-Thermus, Thermodesulfobacteria, and Thermotogae accounted for a small percentage and diversity of the antibiotic biosynthesis and resistance related genes detected; mesophilic taxa led by Alphaproteobacteria and Gammaproteobacteria, however, possessed these genes in large numbers and varieties (S8 Table in S1 File). While the potential antibiosis machinery of the Lotus Pond Aquificae might help them overcome plausible niche overlap with their ecological equivalents (especially the thermophilic and hyperthermophilic archaea), such contrivances can also exacerbate the bioenergetic costs of thermal endurance for the phylogenetic relatives of mesophilic bacteria that infiltrate the Lotus Pond hydrothermal system under the mediation of the local geodynamic forces [10].
Genomic resources that can help mesophiles survive in the Lotus Pond habitat
Most of the genetic diversity present in Lotus Pond’s vent-water microbiome was contributed by phylogenetic relatives of mesophilic bacteria, even though they were not numerally predominant (as subsequently evident from the direct annotation of metagenomic reads corresponding to rRNA gene sequences) and were also plausibly not alive or metabolically active in large numbers. That said, Lotus Pond’s assembled metagenome encompassed several such genes that could be potentially vital for thermal adaptation, especially in the context of the vent-water’s geochemistry and the spring’s topography (Supplementary Results). Also, a good majority of these CDSs were ascribable to phylogenetic relatives of mesophilic bacteria, and their most conspicuous examples included copious genes encoding heat shock proteins, molecular chaperones and chaperonin complexes that render the quality control of DNA and proteins amid biophysical stress. The metagenome contained several genes encoding proteins that control/modulate/inhibit DNA gyrase activity. In true thermophiles, reverse gyrases (RGs) are indispensible as they render positive supercoils in the DNA to counter-balance the negative supercoils promoted by gyrases, and in situ heat [112]. RGs also help avert incorrect aggregation of denatured DNA, and facilitate correct annealing [113]. Expectedly, Lotus Pond MAGs affiliated to Aquificaceae, Desulfurococcales, Fervidobacterium and Sulfurihydrogenibium encoded RGs, but those belonging to phylogenetic relatives of mesophilic or moderately thermophilic bacteria did not. Nevertheless, MAGs affiliated to Halomonas, Paracoccus, Tepidimonas, and Vibrio did encode gyrase inhibitors/modulators such as YacG/PmbA/TldD which are known to nullify gyrase activity [114,115]; notably, several homologs of these genes ascribable to other mesophilic taxa were also present in the assembled metagenome. Future studies of metatranscriptomics and metaproteomics are needed alongside multi-omic investigations of pure culture isolates to ascertain whether and how the above specialties help hot spring mesophiles endure high temperatures over time and space.
The metagenome encoded different toxin-antitoxin (TA) systems, including the Type II, Type IV, RelE/RelB, PrlF/YhaV, YdaS/YdaT, HicAB, and MraZ/MraW systems, which act as regulatory modules for growth and survival of bacterial cells under diverse stress conditions [116]. Other ecologically advantageous genes having large numbers of homologs in the assembled metagenome encoded the following: (a) universal stress proteins, homologs of which are also possessed by bacterial mesophiles occupying ecological niches adjacent to deep sea hydrothermal vents [98]; (b) methionine sulfoxide reductases (Msr), which repair oxidatively damaged/inactivated proteins by converting their methionine sulfoxides to methionines; (c) enzymes known to protect cells from the toxic effects of peroxides, which in turn are often abundant in hot spring waters [117]; (d) diverse proteins adapted structurally and/or functionally to high concentrations of heavy metals, which is a common feature of hydrothermal discharges [118–121] including those of Lotus Pond and other hot springs of Puga Valley [5,10,14–16]; (e) different Mnt proteins that render cellular transport and homeostasis of manganese, a key cofactor of different enzymes protecting against oxidative damage; (f) fatty acid desaturases that may help increase the rigidity of cell membranes (via unsaturation of fatty acids).
Numerous genes were also there in the Lotus Pond metagenome for flagellar structure/function, chemotaxis, cell adhesion/aggregation, biofilm formation, and quorum sensing. All these attributes have the potentials to assist Lotus Pond’s native microorganisms respond to in situ environmental conditions and position themselves dynamically at their biophysically/biochemically best-suited niches along the hydrothermal (Fig 1A) and biogeochemical [10] gradients occurring in the vent to river (Rulang) trajectory.
Why are virulence genes abundant in Lotus Pond?
The Lotus Pond metagenome encompassed several CDSs for the formation of biofilms, adhesion of cells, biosynthesis of hemagglutinin and hemolysin, and stationary phase metabolisms including persister cell formation (S2 Table in S1 File). While biofilm formation, cellular adhesion, and haemolysin synthesis may all have key roles in osmoregulation and extreme-temperature response [122,123], formation of persister cells can help bacterial mesophiles attain ultra-slow metabolic states, which have recently been reported to be central to the endurance of high in situ temperatures [30]. Furthermore, the metagenome encompassed more than a thousand genes for constructing Type-I through Type-IX SSs, excepting the Type-V and Type-VII variants. While all these molecular machineries secreting biomacromolecules to the cells’ exterior have clearly documented roles in bacterial survival within eukaryotic hosts (and consequent virulence), their potentially multifaceted ecological roles are also not unappreciated [98]. For instance, macromolecules moved across the cell membrane by Type-II SS often serve as local and long-range effectors/promoters of nutrient acquisition and biofilm formation under adverse environmental conditions [124,125]. Type-III SS, besides translocating effector bacterial proteins into host cells, plays important roles in stress response including those involving high salts and peroxides in the chemical milieu of the bacterium [126]. Systems level network analyses have also revealed interrelations between Type-III and Type-IV SSs on one hand and stress management on the other [127]. Furthermore, Type-IV SS homologs are also sometimes involved in inter-bacterial killing, which can not only eliminate competition but also turn out to be beneficial for the killers in nutrient deprived habitats [128,129]. Type-VI SS homologs, besides being central to survival within eukaryotic hosts, regulate quorum sensing with wide-ranging implications for environmental adaptation [130,131]. Type-VIII SS is implicated in adhesion, aggregation, and biofilm formation [132], while the Type-IX SS, reported thus far from Bacteroidetes alone, confers gliding motility to the environmental strains of the phylum [133]. Thus, in the context of the mesophiles present in the Lotus Pond vent-water, both these SSs might turn out to be instrumental in placing the native bacteria to biophysically favorable sites within the hydrothermal gradients (Fig 1A).
Summing up
High temperature habitats despite being the extant analogs of primordial ecosystems on Earth are situated at the functional extremities of the biosphere. They are characterized by such biophysical conditions which constrain habitability for most of the extant microbial taxa via entropic disordering of biomacromolecules that in turn imposes unsustainably high energetic costs on the maintenance of the metabolic machinery. In such a scenario, only those few specialized groups of thermophilic microorganisms, which can afford the high energy-cost of maintenance and avoid cell system failure due to high heat, manage to colonize hydrothermal ecosystems and prevail in situ. That said, hot springs are not totally out of bound territories for free-living environmental microorganisms that do not have any phylogenetic (taxonomic) relative known for in vitro growth at >45°C. For instance, Trans-Himalayan hot spring waters poor in calcium and silicon, but rich in boron, chlorine, sodium, and various sulfur species, are known based on PCR-amplified 16S rRNA gene sequence data to harbor high diversities of bacterial mesophiles that are stochastically introduced to the system by local geodynamic forces. The present study for the first time undertook deep shotgun metagenome sequencing and analysis for such a high-biodiversity Trans-Himalayan hot spring called Lotus Pond to comprehensively reveal the microbiome architecture of its boiling vent-water discharge.
The gross microbial cell density in Lotus Pond’s vent-water was found to be 8.5 × 104 mL-1. Approximately 63.5% of this microbial population (i.e. 5.4 × 104 cells mL-1 vent-water) was found to be metabolically active as the cells could take in FDA and hydrolyze it to fluorescein that eventually was detected via fluorescence microscopy. Assembly, annotation, and population-binning of >15-GB metagenomic sequence elucidated the relative abundance, metabolisms, and adaptations of all the microbial groups present in the 85°C fluid vented by the sulfur-borax spring called Lotus Pond, and in doing so illuminated the numeral predominance of Aquificae (particularly, Hydrogenobacter and Sulfurihydrogenibium), even as Proteobacteria contributed most of the genetic diversity of the microbiome; prevalence and diversity of Deinococcus-Thermus, Thermotogae, Thermodesulfobacteria, Crenarchaeota and Euryarchaeota were far less. Perpetual presence of bacterial mesophiles in Lotus Pond’s vent-water was evident from the detection of 51 such genera that had been discovered in previous 16S amplicon-based investigations of the habitat, but have no member with laboratory growth at >45°C. Within the Lotus Pond vent-water microbiome maximum genetic resource was devoted to Translation, ribosomal structure and biogenesis, followed by Energy production and conversion. Lotus Pond’s metagenome was replete with genes crucial for high temperature adaptation, especially in the context of Lotus Pond’s geochemistry and topography. By sequence homology, a majority of these genes were attributable to phylogenetic relatives of mesophilic bacteria. Copious genes within the metagenome were also directly or indirectly related to virulence functions and antibiotics synthesis/resistance. Since a good proportion of these genes were attributable to Aquificae, they could well help the group outcompete other thermophiles present in situ, and contain the proliferation of the mesophilic intruders of the ecosystem. That said, in the absence of culture-based microbiological data and gene expression analyses, the actual activity of the Lotus Pond microflora in this regard remains purely speculative and subject to future testimony. In the same vein, there is still no conclusive evidence to prove that the mesophiles present in the Trans-Himalayan sulfur-borax hot springs contribute to the live microflora in situ. In that context, however, it is noteworthy that the holistic elucidation of microbiome structure and functions for the boiling vent-waters of Trans-Himalayan sulfur-borax hot springs is an ongoing process. We had previously outlined the alpha diversity of the system via analysis of amplified 16S rRNA gene sequences [10,17,18], then we isolated a few strains that were phylogenetically related to mesophilic bacteria [17,30]; here we explored the microbiome via deep shotgun metagenomics, and in the near future we shall communicate the findings of metatranscriptome analysis.
Supporting information
(DOC)
(XLSX)
Acknowledgments
We thank Professor Shubhra Ghosh Dastidar and Sri Sanjib Kumar Gupta of Bose Institute, Kolkata, for helping with computational resources.
Data Availability
The shotgun metagenome sequence dataset was submitted to the Sequence Read Archive (SRA) of the National Center for Biotechnology Information (NCBI), USA, under the BioProject PRJNA296849, with the BioSample and Run accession numbers SAMN31749247 and SRR22364517 respectively. The bacterial and archaeal 16S amplicon sequence datasets were submitted to the SRA, under the BioProject PRJNA296849, with BioSample accession numbers SAMN32724325 and SAMN32724260, and Run accession numbers SRR23072972 and SRR23072970, respectively.
Funding Statement
This research was financed by Bose Institute (via intramural faculty grants) as well as the Science and Engineering Research Board (SERB), Government of India (GoI) (SERB grant number EMR/2016/002703). N.M. received fellowships from SERB and Bose Institute. S.D. and J.S. obtained their fellowships from Council of Scientific and Industrial Research, GoI. S.C. and M.M. received a fellowship from the Department of Biotechnology, GoI. The funders played no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Wächtershäuser G. From volcanic origins of chemoautotrophic life to Bacteria, Archaea and Eukarya. Philos Trans R Soc B. 2006;361: 1787–1808. doi: 10.1098/rstb.2006.1904 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Patel BH, Percivalle C, Ritson DJ, Duffy CD, Sutherland JD. Common origins of RNA, protein and lipid precursors in a cyanosulfidic protometabolism. Nat Chem. 2015;7: 301–307. doi: 10.1038/nchem.2202 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ranjan S, Todd ZR, Sutherland JD, Sasselov DD. Sulfidic anion concentrations on early earth for surficial origins-of-life chemistry. Astrobiology. 2018;18: 1023–1040. doi: 10.1089/ast.2017.1770 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Fields PA. Protein function at thermal extremes: balancing stability and flexibility. Comp Biochem Physiol A Mol Integr Physiol. 2001;129: 417–431. [DOI] [PubMed] [Google Scholar]
- 5.Roy C, Rameez MJ, Haldar PK, Peketi A, Mondal N, Bakshi U, et al. Microbiome and ecology of a hot spring-microbialite system on the Trans-Himalayan Plateau. Sci Rep. 2020;10: 5917. doi: 10.1038/s41598-020-62797-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kristjansson JK, Stetter KO. Thermophilic bacteria. In: Kristjansson JK, Stetter KO (eds). Thermophilic bacteria. 1st ed. CRC press, 2021. pp 1–18. [Google Scholar]
- 7.Hall JR, Mitchell KR, Jackson-Weaver O, Kooser AS, Cron BR, Crossey LJ, et al. Molecular characterization of the diversity and distribution of a thermal spring microbial community by using rRNA and metabolic genes. Appl Environ Microbiol. 2008;74: 4910–4922. doi: 10.1128/AEM.00233-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Everroad RC, Otaki H, Matsuura K, Haruta S. Diversification of bacterial community composition along a temperature gradient at a thermal spring. Microbes Environ. 2012;27: 374–381. doi: 10.1264/jsme2.me11350 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tang J, Liang Y, Jiang D, Li L, Luo Y, Shah MM, et al. Temperature-controlled thermophilic bacterial communities in hot springs of western Sichuan, China. BMC Microbiol. 2018;18: 134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Roy C, Mondal N, Peketi A, Fernandes S, Mapder T, Volvoikar SP, et al. Geomicrobial dynamics of Trans-Himalayan sulfur–borax spring system reveals mesophilic bacteria’s resilience to high heat. J Earth Syst Sci. 2020;129: 157. [Google Scholar]
- 11.Chiriac CM, Szekeres E, Rudi K, Baricz A, Hegedus A, Dragoş N, et al. Differences in temperature and water chemistry shape distinct diversity patterns in thermophilic microbial communities. Appl Environ Microbiol. 2017;83: e01363–17. doi: 10.1128/AEM.01363-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ruhl IA, Sheremet A, Smirnova AV, Sharp CE, Grasby SE, Strous M, et al. Microbial functional diversity correlates with species diversity along a temperature gradient. Msystems. 2022;7: e00991–21. doi: 10.1128/msystems.00991-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gansser A. Geology of the Himalayas. London: Interscience Publishers, 1964. [Google Scholar]
- 14.Garrett DE. Borates: Handbook of deposits, processing, properties, and use. Academic Press, 1998. [Google Scholar]
- 15.Rai AP. Compilation of data on chemical analysis of water and gas samples from North West Himalaya and adjoining areas. Bulletin Series-C, No. 5, Geological Survey of India, 2001.
- 16.Ghosh W, Mallick S, Haldar PK, Pal B, Maikap SC, Gupta SK. Molecular and cellular fossils of a mat-like microbial community in geothermal boratic sinters. Geomicrobiol J. 2012;29: 879–885. [Google Scholar]
- 17.Roy C, Alam M, Mandal S, Haldar PK, Bhattacharya S, Mukherjee T, et al. Global association between thermophilicity and vancomycin susceptibility in bacteria. Front Microbiol. 2016;7: 412. doi: 10.3389/fmicb.2016.00412 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Mondal N, Peketi A, Mapder T, Roy C, Mazumdar A, Chakraborty R, et al. Indus and Nubra Valley hot springs affirm the geomicrobiological specialties of Trans-Himalayan hydrothermal systems. J Earth Syst Sci. 2022;131: 12. [Google Scholar]
- 19.Wang S, Hou W, Dong H, Jiang H, Huang L, Wu G, et al. Control of temperature on microbial community structure in hot springs of the Tibetan Plateau. PLoS One. 2013;8: e62901. doi: 10.1371/journal.pone.0062901 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhang Y, Wu G, Jiang H, Yang J, She W, Khan I, et al. Abundant and rare microbial biospheres respond differently to environmental and spatial factors in Tibetan hot springs. Front Microbiol. 2018;9: 2096. doi: 10.3389/fmicb.2018.02096 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Das S, Sherpa MT, Najar IN, Thakur N. Prevalence of methanogens in the uncultured Sikkim hot spring solfataric mud archaeal microbiome. Environ Sustain. 2020;3: 453–469. [Google Scholar]
- 22.Ghosh W, Roy C, Roy R, Nilawe P, Mukherjee A, Haldar PK, et al. Resilience and receptivity worked in tandem to sustain a geothermal mat community amidst erratic environmental conditions. Sci Rep. 2015;5: 12179. doi: 10.1038/srep12179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Guo L, Wang G, Sheng Y, Sun X, Shi Z, Xu Q, et al. Temperature governs the distribution of hot spring microbial community in three hydrothermal fields, Eastern Tibetan Plateau Geothermal Belt, Western China. Sci Total Environ. 2020;720: 137574. doi: 10.1016/j.scitotenv.2020.137574 [DOI] [PubMed] [Google Scholar]
- 24.Kumar V, Kumar S, Singh D. Metagenomic insights into Himalayan glacial and kettle lake sediments revealed microbial community structure, function, and stress adaptation strategies. Extremophiles. 2022;26: 3. [DOI] [PubMed] [Google Scholar]
- 25.Nagar S, Talwar C, Motelica-Heino M, Richnow HH, Shakarad M, Lal R, et al. Microbial Ecology of Sulfur Biogeochemical Cycling at a Mesothermal Hot Spring Atop Northern Himalayas, India. Front Microbiol. 2022;13: 848010. doi: 10.3389/fmicb.2022.848010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Zhang P, Xiong J, Qiao N, Luo S, Yang Q, Li X, et al. High Variation in Protist Diversity and Community Composition in Surface Sediment of Hot Springs in Himalayan Geothermal Belt, China. Microorganisms. 2023;11: 674. doi: 10.3390/microorganisms11030674 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Power JF, Carere CR, Lee CK, Wakerley GL, Evans DW, Button M, et al. Microbial biogeography of 925 geothermal springs in New Zealand. Nat Commun. 2018;9: 2876. doi: 10.1038/s41467-018-05020-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Colman DR, Lindsay MR, Boyd ES. Mixing of meteoric and geothermal fluids supports hyperdiverse chemosynthetic hydrothermal communities. Nat Commun. 2019;10: 681. doi: 10.1038/s41467-019-08499-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Roy C, Bakshi U, Rameez MJ, Mandal S, Haldar PK, Pyne P, et al. Phylogenomics of an uncultivated, aerobic and thermophilic, photoheterotrophic member of Chlorobia sheds light into the evolution of the phylum Chlorobi. Comput Biol Chem. 2019;80: 206–216. doi: 10.1016/j.compbiolchem.2019.04.001 [DOI] [PubMed] [Google Scholar]
- 30.Mondal N, Roy C, Chatterjee S, Sarkar J, Dutta S, Bhattacharya S, et al. Thermal endurance by a hot-spring-dwelling phylogenetic relative of the mesophilic Paracoccus. Microbiol Spectr. 2022;10: e01606–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Steller LH, Nakamura E, Ota T, Sakaguchi C, Sharma M, Van Kranendonk MJ. Boron isotopes in the Puga geothermal system, India, and their implications for the habitat of early life. Astrobiology. 2019;19: 1459–1473. doi: 10.1089/ast.2018.1966 [DOI] [PubMed] [Google Scholar]
- 32.Sarkar S, Moitra H, Bhattacharya S, Dagar A, Ray D, Gupta S, et al. Spectroscopic studies on the Puga Hot Spring Deposits, Ladakh, an astrobiological Martian analog site in India. J Geophys Res Planets. 2022;127: e2022JE007299. [Google Scholar]
- 33.Harinarayana T, Azeez KA, Murthy DN, Veeraswamy K, Rao SE, Manoj C, et al. Exploration of geothermal structure in Puga geothermal field, Ladakh Himalayas, India by magnetotelluric studies. J Appl Geophys. 2006;58: 280–295. [Google Scholar]
- 34.Quan X, Nitta K, Matoba O, Xia P, Awatsuji Y. Phase and fluorescence imaging by combination of digital holographic microscopy and fluorescence microscopy. Opt Rev. 2015;22: 349–353. [Google Scholar]
- 35.Absher M. Hemocytometer counting. In Kruse PF, Patterson MK (eds). Tissue culture. Academic Press, 1973. pp 395–397. [Google Scholar]
- 36.Stinson LF, Keelan JA, Payne MS. Identification and removal of contaminating microbial DNA from PCR reagents: impact on low‐biomass microbiome analyses. Lett Appl Microbiol. 2019;68: 2–8. doi: 10.1111/lam.13091 [DOI] [PubMed] [Google Scholar]
- 37.Zubiria-Barrera C, Stock M, Neubert R, Vester A, Kulle A, Schneegans A, et al. A simple sequence-based filtering method for the removal of contaminants in low-biomass 16S rRNA amplicon sequencing approaches. J Microbiol Methods. 2020;178: 106060. doi: 10.1016/j.mimet.2020.106060 [DOI] [PubMed] [Google Scholar]
- 38.Li D, Liu CM, Luo R, Sadakane K, Lam TW. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31: 1674–1676. doi: 10.1093/bioinformatics/btv033 [DOI] [PubMed] [Google Scholar]
- 39.Hyatt D, Chen GL, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11: 119. doi: 10.1186/1471-2105-11-119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Huerta-Cepas J, Szklarczyk D, Heller D, Hernández-Plaza A, Forslund SK, Cook H, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47: D309–D314. doi: 10.1093/nar/gky1085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Cantalapiedra CP, Hernández-Plaza A, Letunic I, Bork P, Huerta-Cepas J. eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale. Mol Biol Evol. 2021;38: 5825–5829. doi: 10.1093/molbev/msab293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Shimoyama Y. COGclassifier: a tool for classifying prokaryote protein sequences into COG functional category. 2022. https://github.com/moshi4/COGclassifier. [Google Scholar]
- 43.Medema MH, Blin K, Cimermancic P, De Jager V, Zakrzewski P, Fischbach MA, et al. antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39: W339–W346. doi: 10.1093/nar/gkr466 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Blin K, Shaw S, Augustijn HE, Reitz ZL, Biermann F, Alanjary M, et al. antiSMASH 7.0: New and improved predictions for detection, regulation, chemical structures and visualisation. Nucleic Acids Res. 2023;51: W46–W50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Alcock BP, Huynh W, Chalil R, Smith KW, Raphenya AR, Wlodarski MA, et al. CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database. Nucleic Acids Res. 2023;51: D690–D699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Stoddard SF, Smith BJ, Hein R, Roller BR, Schmidt TM. rrn DB: improved tools for interpreting rRNA gene abundance in bacteria and archaea and a new foundation for future development. Nucleic Acids Res. 2015;43: D593–D598. doi: 10.1093/nar/gku1201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357–359. doi: 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3: e1165. doi: 10.7717/peerj.1165 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wu YW, Simmons BA, Singer SW. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics. 2016;32: 605–607. doi: 10.1093/bioinformatics/btv638 [DOI] [PubMed] [Google Scholar]
- 50.Alneberg J, Bjarnason BS, De Bruijn I, Schirmer M, Quick J, Ijaz UZ, et al. Binning metagenomic contigs by coverage and composition. Nat Methods. 2014;11: 1144–1146. doi: 10.1038/nmeth.3103 [DOI] [PubMed] [Google Scholar]
- 51.Sieber CM, Probst AJ, Sharrar A, Thomas BC, Hess M, Tringe SG, et al. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat Microbiol. 2018;3: 836–843. doi: 10.1038/s41564-018-0171-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25: 1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2020;36: 1925–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: rapid annotations using subsystems technology. BMC Genom. 2008;9: 75. doi: 10.1186/1471-2164-9-75 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Meier-Kolthoff JP, Carbasse JS, Peinado-Olarte RL, Göker M. TYGS and LPSN: a database tandem for fast and reliable genome-based classification and nomenclature of prokaryotes. Nucleic Acids Res. 2022;50: D801–D807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yoon SH, Ha SM, Lim J, Kwon S, Chun J. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Van Leeuwenhoek. 2017;110: 1281–1286. doi: 10.1007/s10482-017-0844-4 [DOI] [PubMed] [Google Scholar]
- 57.Na SI, Kim YO, Yoon SH, Ha SM, Baek I, Chun J. UBCG: up-to-date bacterial core gene set and pipeline for phylogenomic tree reconstruction. J Microbiol. 2018;56: 280–285. doi: 10.1007/s12275-018-8014-6 [DOI] [PubMed] [Google Scholar]
- 58.Darriba D, Posada D, Kozlov AM, Stamatakis A, Morel B, Flouri T. ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol Biol Evol. 2020;37: 291–294. doi: 10.1093/molbev/msz189 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312–1313. doi: 10.1093/bioinformatics/btu033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47: W256–W259. doi: 10.1093/nar/gkz239 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35: W182–W185. doi: 10.1093/nar/gkm321 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Edgar RC. UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat Methods. 2013;10: 996–998. doi: 10.1038/nmeth.2604 [DOI] [PubMed] [Google Scholar]
- 63.Evans RN, Blaha G, Bailey S, Steitz TA. The structure of LepA, the ribosomal back translocase. Proc Natl Acad Sci USA. 2008;105: 4673–4678. doi: 10.1073/pnas.0801308105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Heller JL, Kamalampeta R, Wieden HJ. Taking a step back from back-translocation: an integrative view of LepA/EF4’s cellular function. Mol Cell Biol. 2017;37: e00653–16. doi: 10.1128/MCB.00653-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Berg IA. Ecological aspects of the distribution of different autotrophic CO2 fixation pathways. Appl Environ Microbiol. 2011;77: 1925–1936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Nunoura T, Chikaraishi Y, Izaki R, Suwa T, Sato T, Harada T, et al. A primordial and reversible TCA cycle in a facultatively chemolithoautotrophic thermophile. Science. 2018;359: 559–563. doi: 10.1126/science.aao3407 [DOI] [PubMed] [Google Scholar]
- 67.Wildman SG. Along the trail from Fraction I protein to Rubisco (ribulose bisphosphate carboxylase-oxygenase). Photosynth Res. 2002;73: 243–250. doi: 10.1023/A:1020467601966 [DOI] [PubMed] [Google Scholar]
- 68.Fuchs G. Alternative pathways of carbon dioxide fixation: insights into the early evolution of life? Annu Rev Microbiol. 2011;65: 631–658. doi: 10.1146/annurev-micro-090110-102801 [DOI] [PubMed] [Google Scholar]
- 69.Adam PS, Borrel G, Gribaldo S. Evolutionary history of carbon monoxide dehydrogenase/acetyl-CoA synthase, one of the oldest enzymatic complexes. Proc Natl Acad Sci USA. 2018;115: E1166–E1173. doi: 10.1073/pnas.1716667115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Huber H, Gallenberger M, Jahn U, Eylert E, Berg IA, Kockelkorn D, et al. A dicarboxylate/4-hydroxybutyrate autotrophic carbon assimilation cycle in the hyperthermophilic Archaeum Ignicoccus hospitalis. Proc Natl Acad Sci USA. 2008;105: 7851–7856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Mall A, Sobotta J, Huber C, Tschirner C, Kowarschik S, Bačnik K, et al. Reversibility of citrate synthase allows autotrophic growth of a thermophilic bacterium. Science. 2018;359: 563–567. doi: 10.1126/science.aao2410 [DOI] [PubMed] [Google Scholar]
- 72.Frolov EN, Kublanov IV, Toshchakov SV, Lunev EA, Pimenov NV, Bonch-Osmolovskaya EA, et al. Form III RubisCO-mediated transaldolase variant of the Calvin cycle in a chemolithoautotrophic bacterium. Proc Natl Acad Sci USA. 2019;116: 18638–18646. doi: 10.1073/pnas.1904225116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Sánchez-Andrea I, Guedes IA, Hornung B, Boeren S, Lawson CE, Sousa DZ, et al. The reductive glycine pathway allows autotrophic growth of Desulfovibrio desulfuricans. Nat Commun. 2020;11: 5090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Campanaro S, Treu L, Kougias PG, Zhu X, Angelidaki I. Taxonomy of anaerobic digestion microbiome reveals biases associated with the applied high throughput sequencing strategies. Sci Rep. 2018;8: 1926. doi: 10.1038/s41598-018-20414-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Laudadio I, Fulci V, Palone F, Stronati L, Cucchiara S, Carissimi C. Quantitative assessment of shotgun metagenomics and 16S rDNA amplicon sequencing in the study of human gut microbiome. OMICS J Integr Biol. 2018;22: 248–254. [DOI] [PubMed] [Google Scholar]
- 76.Brumfield KD, Huq A, Colwell RR, Olds JL, Leddy MB. Microbial resolution of whole genome shotgun and 16S amplicon metagenomic sequencing using publicly available NEON data. PLoS One. 2020;15: e0228899. doi: 10.1371/journal.pone.0228899 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Durazzi F, Sala C, Castellani G, Manfreda G, Remondini D, De Cesare A. Comparison between 16S rRNA and shotgun sequencing data for the taxonomic characterization of the gut microbiota. Sci Rep. 2021;11: 3030. doi: 10.1038/s41598-021-82726-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Tessler M, Neumann JS, Afshinnekoo E, Pineda M, Hersch R, Velho LF, et al. Large-scale differences in microbial biodiversity discovery between 16S amplicon and shotgun sequencing. Sci Rep. 2017;7: 6589. doi: 10.1038/s41598-017-06665-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Peterson D, Bonham KS, Rowland S, Pattanayak CW, Resonance Consortium, Klepac-Ceraj V. Comparative analysis of 16S rRNA gene and metagenome sequencing in pediatric gut microbiomes. Front Microbiol. 2021;12: 670336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Huber R, Wilharm T, Huber D, Trincone A, Burggraf S, König H, et al. Aquifex pyrophilus gen. nov. sp. nov., represents a novel group of marine hyperthermophilic hydrogen-oxidizing bacteria. Syst Appl Microbiol. 1992;15: 340–351. [Google Scholar]
- 81.O’Neill AH, Liu Y, Ferrera I, Beveridge TJ, Reysenbach AL. Sulfurihydrogenibium rodmanii sp. nov., a sulfur-oxidizing chemolithoautotroph from the Uzon Caldera, Kamchatka Peninsula, Russia, and emended description of the genus Sulfurihydrogenibium. Int J Syst Evol Microbiol. 2008;58: 1147–1152. [DOI] [PubMed] [Google Scholar]
- 82.Cai J, Wang Y, Liu D, Zeng Y, Xue Y, Ma Y, et al. Fervidobacterium changbaicum sp. nov., a novel thermophilic anaerobic bacterium isolated from a hot spring of the Changbai Mountains, China. Int J Syst Evol Microbiol. 2007;57: 2333–2336. [DOI] [PubMed] [Google Scholar]
- 83.Chung AP, Rainey FA, Valente M, Nobre MF, da Costa MS. Thermus igniterrae sp. nov. and Thermus antranikianii sp. nov., two new species from Iceland. Int J Syst Evol Microbiol. 2000;50: 209–217. [DOI] [PubMed] [Google Scholar]
- 84.Kashefi K, Holmes DE, Reysenbach AL, Lovley DR. Use of Fe (III) as an electron acceptor to recover previously uncultured hyperthermophiles: isolation and characterization of Geothermobacterium ferrireducens gen. nov., sp. nov. Appl Environ Microbiol. 2002;68: 1735–1742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Stetter KO, König H, Stackebrandt E. Pyrodictium gen. nov., a new genus of submarine disc-shaped sulphur reducing archaebacteria growing optimally at 105°C. Syst Appl Microbiol. 1983;4: 535–551. [DOI] [PubMed] [Google Scholar]
- 86.Copeland JJ. Yellowstone thermal myxophyceae. Ann N Y Acad Sci. 1936;36: 4–223. [Google Scholar]
- 87.Ward DM, Castenholz RW, Miller SR, Whitton B. Cyanobacteria in geothermal habitats. In: Whitton BA (ed). Ecology of cyanobacteria II: their diversity in space and time. Springer Science and Business media, 2012. pp 39–63. [Google Scholar]
- 88.Katoh H, Itoh S, Shen JR, Ikeuchi M. Functional analysis of psbV and a novel c-type cytochrome gene psbV2 of the thermophilic cyanobacterium Thermosynechococcus elongatus strain BP-1. Plant Cell Physiol. 2001;42: 599–607. [DOI] [PubMed] [Google Scholar]
- 89.Komárek J, Johansen JR, Šmarda J, Strunecký O. Phylogeny and taxonomy of Synechococcus-like cyanobacteria. Fottea. 2020;20: 171–191. [Google Scholar]
- 90.Yamaoka T, Satoh K, Katoh S. Photosynthetic activities of a thermophilic blue-green alga. Plant Cell Physiol. 1978;19: 943–954. [Google Scholar]
- 91.Liang Y, Tang J, Luo Y, Kaczmarek MB, Li X, Daroch M. Thermosynechococcus as a thermophilic photosynthetic microbial cell factory for CO2 utilisation. Bioresour Technol. 2019;278: 255–265. [DOI] [PubMed] [Google Scholar]
- 92.Guan TW, Xiao J, Zhao K, Luo XX, Zhang XP, Zhang LL. Halomonas xinjiangensis sp. nov., a halotolerant bacterium isolated from a salt lake. Int J Syst Evol Microbiol. 2010;60: 349–352. [DOI] [PubMed] [Google Scholar]
- 93.Sun X, Luo P, Li M. Paracoccus angustae sp. nov., isolated from soil. Int J Syst Evol Microbiol. 2015;65: 3469–3475. [DOI] [PubMed] [Google Scholar]
- 94.Chen WM, Huang HW, Chang JS, Han YL, Guo TR, Sheu SY. Tepidimonas fonticaldi sp. nov., a slightly thermophilic betaproteobacterium isolated from a hot spring. Int J Syst Evol Microbiol. 2013;63: 1810–1816. [DOI] [PubMed] [Google Scholar]
- 95.Mori K, Suzuki KI. Thiofaba tepidiphila gen. nov., sp. nov., a novel obligately chemolithoautotrophic, sulfur-oxidizing bacterium of the Gammaproteobacteria isolated from a hot spring. Int J Syst Evol Microbiol. 2008;58: 1885–1891. [DOI] [PubMed] [Google Scholar]
- 96.Meng YC, Liu HC, Zhou YG, Cai M, Kang Y. Vibrio gangliei sp. nov., a novel member of Vibrionaceae isolated from sawdust in a pigpen. Int J Syst Evol Microbiol. 2018;68: 1969–1974. [DOI] [PubMed] [Google Scholar]
- 97.Kaye JZ, Marquez MC, Ventosa A, Baross JA. Halomonas neptunia sp. nov., Halomonas sulfidaeris sp. nov., Halomonas axialensis sp. nov. and Halomonas hydrothermalis sp. nov.: halophilic bacteria isolated from deep-sea hydrothermal-vent environments. Int J Syst Evol Microbiol. 2004;54: 499–511. [DOI] [PubMed] [Google Scholar]
- 98.Hasan NA, Grim CJ, Lipp EK, Rivera IN, Chun J, Haley BJ, et al. Deep-sea hydrothermal vent bacteria related to human pathogenic Vibrio species. Proc Natl Acad Sci USA. 2015;112: E2813–E2819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Du R, Gao D, Wang Y, Liu L, Cheng J, Liu J, et al. Heterotrophic sulfur oxidation of Halomonas titanicae SOB56 and its habitat adaptation to the hydrothermal environment. Front Microbiol. 2022;13: 888833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Ghosh W, Mallick S, DasGupta SK. Origin of the Sox multienzyme complex system in ancient thermophilic bacteria and coevolution of its constituent proteins. Res Microbiol. 2009;160: 409–420. doi: 10.1016/j.resmic.2009.07.003 [DOI] [PubMed] [Google Scholar]
- 101.Kitadai N, Kameya M, Fujishima K. Origin of the reductive tricarboxylic acid (rTCA) cycle-type CO2 fixation: A perspective. Life. 2017;7: 39. [Google Scholar]
- 102.Ghosh W, Dam B. Biochemistry and molecular biology of lithotrophic sulfur oxidation by taxonomically and ecologically diverse bacteria and archaea. FEMS Microbiol Rev. 2009;33: 999–1043. doi: 10.1111/j.1574-6976.2009.00187.x [DOI] [PubMed] [Google Scholar]
- 103.Schäfer S, Barkowski C, Fuchs G. Carbon assimilation by the autotrophic thermophilic archaebacterium Thermoproteus neutrophilus. Arch Microbiol. 1986;146: 301–308. [Google Scholar]
- 104.Beh M, Strauss G, Huber R, Stetter KO, Fuchs G. Enzymes of the reductive citric acid cycle in the autotrophic eubacterium Aquifex pyrophilus and in the archaebacterium Thermoproteus neutrophilus. Arch Microbiol. 1993;160: 306–311. [Google Scholar]
- 105.Campbell BJ, Cary SC. Abundance of reverse tricarboxylic acid cycle genes in free-living microorganisms at deep-sea hydrothermal vents. Appl Environ Microbiol. 2004;70: 6282–6289. doi: 10.1128/AEM.70.10.6282-6289.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Hügler M, Huber H, Stetter KO, Fuchs G. Autotrophic CO2 fixation pathways in archaea (Crenarchaeota). Arch Microbiol. 2003;179: 160–173. [DOI] [PubMed] [Google Scholar]
- 107.Holo H, Sirevåg R. Autotrophic growth and CO2 fixation of Chloroflexus aurantiacus. Arch Microbiol. 1986;145: 173–180. [Google Scholar]
- 108.Holo H., 1989. Chloroflexus aurantiacus secretes 3-hydroxypropionate, a possible intermediate in the assimilation of CO2 and acetate. Arch Microbiol. 1989;151: 252–256. [Google Scholar]
- 109.Ruiz‐Fernández P, Ramírez‐Flandes S, Rodríguez‐León E, Ulloa, O. Autotrophic carbon fixation pathways along the redox gradient in oxygen‐depleted oceanic waters. Environ Microbiol Rep. 2020;12: 334–341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Garritano AN, Song W. Thomas T. Carbon fixation pathways across the bacterial and archaeal tree of life. PNAS nexus. 2022;1: pgac226. doi: 10.1093/pnasnexus/pgac226 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Heyduk K. Evolution of Crassulacean acid metabolism in response to the environment: past, present, and future. Plant Physiol. 2022;190: 19–30. doi: 10.1093/plphys/kiac303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Lipscomb GL, Hahn EM, Crowley AT, Adams MW. Reverse gyrase is essential for microbial growth at 95°C. Extremophiles. 2017;21: 603–608. [DOI] [PubMed] [Google Scholar]
- 113.Kampmann M, Stock D. Reverse gyrase has heat-protective DNA chaperone activity independent of supercoiling. Nucleic Acids Res. 2004;32: 3537–3545. doi: 10.1093/nar/gkh683 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Murayama N, Shimizu H, Takiguchi S, Baba Y, Amino H, Horiuchi T, et al. Evidence for involvement of Escherichia coli genes pmbA, csrA and a previously unrecognized gene tldD, in the control of DNA gyrase by letD (ccdB) of sex factor F. J Mol Biol. 1996;256: 483–502. [DOI] [PubMed] [Google Scholar]
- 115.Sengupta S, Nagaraja V. YacG from Escherichia coli is a specific endogenous inhibitor of DNA gyrase. Nucleic Acids Res. 2008;36: 4310–4316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Page R, Peti W. Toxin-antitoxin systems in bacterial growth arrest and persistence. Nat Chem Biol. 2016;12: 208–214. doi: 10.1038/nchembio.2044 [DOI] [PubMed] [Google Scholar]
- 117.Mesle MM, Beam JP, Jay ZJ, Bodle B, Bogenschutz E, Inskeep WP. Hydrogen peroxide cycling in high-temperature acidic geothermal springs and potential implications for oxidative stress response. Front Mar Sci. 2017;4: 130. [Google Scholar]
- 118.Mulkidjanian AY, Bychkov AY, Dibrova DV, Galperin MY, Koonin EV. Origin of first cells at terrestrial, anoxic geothermal fields. Proc Natl Acad Sci USA. 2012;109: E821–E830. doi: 10.1073/pnas.1117774109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Sherpa MT, Das S, Thakur N. Physicochemical analysis of hot water springs of Sikkim-Polok tatopani, borong tatopani and Reshi tatopani. Recent Res Sci Technol. 2013;5: 63–67. [Google Scholar]
- 120.Van Kranendonk MJ, Baumgartner R, Djokic T, Ota T, Steller L, Garbe U, et al. Elements for the origin of life on land: a deep-time perspective from the Pilbara Craton of Western Australia. Astrobiology. 2021;21: 39–59. doi: 10.1089/ast.2019.2107 [DOI] [PubMed] [Google Scholar]
- 121.Jawadi HA, Malistani HA, Moheghy MA, Sagin J. Essential trace elements and arsenic in thermal springs, Afghanistan. Water. 2021;13: 134. [Google Scholar]
- 122.Smith B, Oliver JD. In situ and in vitro gene expression by Vibrio vulnificus during entry into, persistence within, and resuscitation from the viable but nonculturable state. Appl Environ Microbiol. 2006;72: 1445–1451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Kent AG, Garcia CA, Martiny AC. Increased biofilm formation due to high-temperature adaptation in marine Roseobacter. Nat Microbiol. 2018;3: 989–995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Alvarez-Martinez CE, Sgro GG, Araujo GG, Paiva MR, Matsuyama BY, Guzzo CR, et al. Secrete or perish: the role of secretion systems in Xanthomonas biology. Comput Struct Biotechnol J. 2021;19: 279–302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Naskar S, Hohl M, Tassinari M, Low HH. The structure and mechanism of the bacterial type II secretion system. Mol Microbiol. 2021;115: 412–424. doi: 10.1111/mmi.14664 [DOI] [PubMed] [Google Scholar]
- 126.Liu X, Luo Y, Li Z, Wei G. Functional analysis of a putative type III secretion system in stress adaption by Mesorhizobium alhagi CCNWXJ12-2T. Front Microbiol. 2018;9: 263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Banerjee G, Quan FS, Mondal AK, Sur S, Banerjee P, Chattopadhyay P. Interrelation between Stress Management and Secretion Systems of Ralstonia solanacearum: An In Silico Assessment. Pathogens. 2022;11: 730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Souza DP, Oka GU, Alvarez-Martinez CE, Bisson-Filho AW, Dunger G, Hobeika L, et al. Bacterial killing via a type IV secretion system. Nat Commun. 2015;6: 6453. doi: 10.1038/ncomms7453 [DOI] [PubMed] [Google Scholar]
- 129.Bayer-Santos E, Cenens W, Matsuyama BY, Oka GU, Di Sessa G, Mininel ID, et al. The opportunistic pathogen Stenotrophomonas maltophilia utilizes a type IV secretion system for interbacterial killing. PLoS Pathog. 2019;15: e1007651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Weber B, Hasic M, Chen C, Wai SN, Milton DL. Type VI secretion modulates quorum sensing and stress response in Vibrio anguillarum. Environ Microbiol. 2009;11: 3018–3028. [DOI] [PubMed] [Google Scholar]
- 131.Yu KW, Xue P, Fu Y, Yang L. T6SS mediated stress responses for bacterial environmental survival and host adaptation. Int J Mol Sci. 2021;22: 478. doi: 10.3390/ijms22020478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Hufnagel DA, Evans ML, Greene SE, Pinkner JS, Hultgren SJ, Chapman MR. The catabolite repressor protein-cyclic AMP complex regulates csgD and biofilm formation in uropathogenic Escherichia coli. J Bacteriol. 2016;198: 3329–3334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Lasica AM, Ksiazek M, Madej M, Potempa J. The type IX secretion system (T9SS): highlights and recent insights into its structure and function. Front Cell Infect Microbiol. 2017;7: 215. doi: 10.3389/fcimb.2017.00215 [DOI] [PMC free article] [PubMed] [Google Scholar]