Skip to main content
mLife logoLink to mLife
. 2026 Feb 23;5(1):108–121. doi: 10.1002/mlf2.70066

A hot origin of dissimilatory sulfite reduction catalyzed by DsrAB in the Paleoarchean Era

Lingyun Tang 1, Zhenhao Luo 2, Shaoming Gao 3, Zhiliang Lin 3, Mengqi Sun 1, Runsheng Li 4, Shu‐Hong Gao 5, Geng Wu 6, Yiliang Li 7, Linan Huang 3,, Lu Fan 1,
PMCID: PMC12948486  PMID: 41767950

Abstract

Dissimilatory sulfite reduction (DSR) has been essential to microbial energy metabolism in the biogeochemical sulfur cycle since the Paleoarchean Era. However, due to the lack of an integrated assessment of geological record and genomic data, the evolutionary origin of DSR remains elusive in terms of time, habitat, and genetic basis. In this study, we reconstructed the evolutionary pathways and the ancestral sequences of Dsr proteins by mining metagenomes ranging from mesothermal to hyperthermal environments. A phylogenetic analysis of the key catalytic enzyme, DsrAB, and other Dsr proteins indicates that the earliest and most basic functional cascade, DsrABCNM, emerged prior to the latest common ancestor of several basal branching DsrAB clusters encoded by bacteria and archaea. Using a molecular dating strategy that calibrates the protein tree with a species tree, we predicted that the DSR originated 3.508 billion years ago (Ga). This finding strongly confirms the earliest geological evidence of DSR ( ~ 3.47 Ga). Further predictions from ancestral sequence reconstruction indicate that the optimal catalytic temperature of DsrA at the time of DSR origin was approximately 73°C, which is consistent with the petrographic and geochemical evidence in early Archean hydrothermal deposits. After its hot origin, DsrA diversified into subclades that adapted to various temperature levels following the Great Oxidation Event. This is exemplified by the evolution of the reductive archaeal‐type DsrA. Our results synchronize the molecular ages with the geological record, which advances our understanding of the earliest DSR systems and highlights the enzymatic adaptations of microbial life in the Archean biosphere.

Keywords: archaeal evolution, dsrAB, metagenomics, sulfite reduction, sulfur cycling

Impact statement

Sulfite reduction was an essential biogeochemical process on early Earth, but its evolutionary origin remains unclear. By developing a new molecular dating strategy, we estimate that the ubiquitous dissimilatory sulfite reductase DsrAB originated more than 3.508 billion years ago (Ga). This finding provides strong support for the oldest geological record of dissimilatory sulfite reduction (~3.47 Ga), which was discovered over two decades ago. Furthermore, ancestral protein reconstruction suggests that the first sulfite‐reducing microorganisms were likely thermophiles or moderate thermophiles, a conclusion consistent with geological evidence from Archean hydrothermal deposits.

INTRODUCTION

Sulfur played a vital role in the evolution of early life. H2S and SO2 were released into the Earth's surface by volcanic activity, providing the primary source of sulfur‐containing compounds in microbial metabolism on early Earth 1 , 2 . Modeling studies have also suggested that sulfites may have been presented at concentrations equal to or greater than those of sulfate in oxidant‐poor water prior to the Great Oxidation Event (GOE) 3 , 4 . In such an anoxic environment, microorganisms could reduce these widely available compounds, such as (thio)sulfate and sulfite, for anaerobic respiration 5 , contributing to the earliest biogeochemical sulfur cycle on our planet 6 . However, the origin of dissimilatory sulfite reduction (DSR) in microorganisms is not fully understood, including its geological time, environmental conditions, and the catalytic proteins involved.

Sulfur isotopic data suggest that sulfite reduction should have appeared before 3.5 Ga 2 , 7 . The discovery of sulfide in the barite deposits of the “Dresser Formation” in Australia is the earliest evidence of DSR; however, the ambient temperature of this bioactivity is debated 8 . Using the methods of quadruple isotope systematics and interfacial angles, researchers concluded that these sulfate/sulfite reduction microorganisms were likely mesophiles (i.e., organisms that thrive in mesothermal environments—ambient temperature from 15°C to 45°C) or moderate thermophiles (i.e., organisms that thrive in moderate thermal environments—ambient temperature from 45°C to 60°C) 8 , 9 . However, a recent study of stromatolites, microbial palisade fabric, and gas bubbles revealed that the Dresser Formation layer was originally from terrestrial hot spring areas 10 . Therefore, it remains unclear whether the earliest DSR occurred only in the mesothermal (15°C to 45°C) to moderate thermal (45°C to 60°C) outskirt environments of the springs or also at the hot (60°C to 80°C) to hyperthermal (>80°C) spring vents.

On the other hand, genomic and phylogenetic analyses have yet to conclude on the ambient temperature during the earliest stage of DSR. The enzymes currently known to be involved in DSR include dissimilatory sulfite reductase (Dsr), anaerobic sulfite reductase, and cytochrome c sulfite reductase 11 , 12 , 13 , 14 , 15 , 16 , 17 . Whereas the latter two are sporadically encoded by bacteria 15 , Dsr enzymes are widely distributed among bacteria and archaea. They are thought to be among the earliest enzymes to play an essential role in microbial energy metabolism on early Earth 8 , 16 . Previous phylogenetic analyses have generally assigned homologs of the two essential proteins in the Dsr cascade, Dsr subunits A and B (DsrAB), to four major subclades: the basal‐type DsrAB (function unknown), the reductive archaeal‐type DsrAB (RA‐DsrAB, which catalyzes sulfite to sulfur reduction and is exclusively found in Archaea), the reductive bacterial‐type DsrAB (which catalyzes sulfite to sulfur reduction and is exclusively found in Bacteria, with very few exceptions), and the oxidative bacterial‐type DsrAB (which catalyzes sulfur to sulfite oxidation and is exclusively found in Bacteria) 11 , 12 , 15 , 16 , 18 . Since there is still a lack of empirical validation that basal‐type DsrAB homologs function as sulfite reductases, RA‐DsrAB has been identified as the earliest form of functional Dsr 16 , 17 , 19 . Whereas RA‐DsrAB have been previously mainly found in Thermoplasmatota (formerly known as Euryarchaeota) and Thermoproteota archaea from hydrothermal environments 17 , 18 , 20 , 21 , 22 , 23 , 24 , recent metagenomic approaches have detected archaeal lineages such as Nitrososphaeria (formerly known as Thaumarchaeota) and Thermoplasmatota, which carry dsrAB genes in mesothermal environments, including acid mine drainage (AMD) 25 , 26 . This leaves the ambient temperature at the origin of the DSR still an open question.

Furthermore, the modulation trajectory of proteins in the early evolutionary history of the Dsr cascade remains unclear. DsrAB is a heterotetrameric complex that produces a DsrC‐trisulfide from sulfite and DsrC 27 . DsrC‐trisulfide is an intermediate that acts as the final acceptor of electrons from the DsrAB complex 12 . DsrN is necessary for aiding the siroheme cofactor in DsrAB, and it is highly conserved 28 . DsrM and DsrK form an electron‐donating membrane complex DsrMK, which catalyzes the reduction of DsrC‐trisulfide and releases sulfide and DsrC 29 . The protein combination DsrABCNMK is considered the minimal functional set of the Dsr cascade 28 . In a recent genomic and phylogenetic study 28 , this minimal set was first assembled in sulfite‐reductive archaea, and then transferred to sulfite‐reducing and sulfur‐oxidizing bacteria. Subsequently, with the acquisition of additional Dsr proteins, including the DsrJOP complex that binds to DsrMK, a membrane complex formed 28 , 30 . DsrD regulates the DsrAB complex 31 , 32 , DsrT regulates the expression of DsrMKJOP 15 , 33 , and the DsrEFH complex acts as a sulfur donor for DsrC in sulfur‐oxidizing microorganisms 34 , 35 . However, the primitive forms of the functional Dsr cascade that predate the archaeal‐type DsrAB are still unknown.

To investigate the origin and early evolution of DSR, we conducted a large‐scale evolutionary analysis of DsrAB homologs, focusing on basal and archaeal DsrAB types close to the root of the DsrAB phylogenetic tree. We analyzed the evolutionary route of DsrAB and other proteins in the Dsr cascade and conducted molecular dating to estimate the time of DSR origin in the DsrA tree. Additionally, we reconstructed ancestral sequences of DsrA to predict the optimal catalytic temperature (T opt) of the enzyme throughout the evolutionary tree. We then contrasted the findings of the molecular evolutionary analysis with geological records to verify the consistency of the timing and environments of DSR origin.

RESULTS

The evolutionary diversity of DsrAB

A total of 415 unique DsrA‐DsrB concatenated sequences (hereafter referred to as DsrAB) were identified in 376 genomes, including 314 from the Genome Taxonomy Database (GTDB) v207_2 36 and 62 from 92 AMD metagenomes (Table S1) 37 , 38 . Phylogenetic inference using paralogous rooting revealed that the DsrAB homologs formed four major clusters. These clusters were named basal type I DsrAB, basal type II DsrAB, RA‐DsrAB, transitional archaeal‐type DsrAB, reductive bacterial‐type DsrAB, and oxidative bacterial‐type DsrAB (Figure 1). Basal type I contained DsrAB homologs of bacteria belonging to phyla Chloroflexota and Firmicutes. Basal type II DsrAB contained homologs of bacteria belonging to phyla Verrucomicrobiota, Planctomycetoa, Chloroflexota, and Methylomirabilota, and an archaeon of Hydrothermoarchaeota profundi. A basal branching DsrAB of Acidobateriota g_Gp6‐AA40 sp016210845 was found between these two DsrAB types. All homologs of RA‐DsrAB were from archaeal phyla Thermoplasmatota or Thermoproteota, and copies of the transitional archaeal type were from archaeal phyla EX4484‐52, Hydrothermarchaeota, and Thermoproteota. All DsrAB homologs of reductive bacterial type and oxidative bacterial type were from bacteria, except some homologs of the reductive bacterial type, which were from Archaeoglobaceae.

Figure 1.

Figure 1

Phylogenetic analysis of DsrAB proteins and the origin of Dsr protein assemblies. The maximum likelihood phylogenetic tree (model automatically selected as LG + I + R9) of DsrAB (415 sequences, 970 alignment positions) is rooted by setting the clade of three Chloroflexota and eight Firmicutes DsrAB sequences (basal type I DsrAB) as the outgroup, according to the result of paralogous rooting. Branches with ultrafast bootstrap support values ≥95% are indicated by black dots. The branches of different types of DsrAB and the five monophyletic subgroups of RA‐DsrAB are colored differently. The presence of Dsr proteins in the genomes that encode DsrAB, and the Dsr proteins are placed in the same types of clusters in their specific phylogenetic trees, corresponding to the DsrAB types in the DsrAB tree (Figures S2S11), shown as filled circles. Filled circles of folded branches of bacterial types indicate ≥50% cases of filled circles. The taxonomic classification is derived from the GTDB taxonomic ranking: A = Archaea, B = Bacteria, and “10 phyla” = Chloroflexota, Proteobacteria, Spirochaetota, Actinobacteriota, Desulfobacterota, Nitrospirota, SZUA‐79, SAR324, Myxococcota, and Acidobacteriota. The classification is derived from the GTDB taxonomic ranking, where “f_” and “g_” represent the classification of family and genus, respectively, and are used before the name of a candidate ranking. The asterisks indicate that the DSR functions of these DsrAB proteins have been experimentally verified 17 , 39 , 40 . The proposed combination of Dsr proteins for the latest common ancestor (LCA) of each DsrAB type is shown at the bottom of the figure.

Whereas this topology of the DsrAB tree is consistent with previous studies 15 , 17 , 18 , 28 , the updated GTDB database and the AMD data have provided us more details, particularly with regard to the basal and archaeal DsrAB types. For instance, three homologs of Chloroflex branching as sisters of Firmicutes and two bacterial DsrAB belonging to Planctomycetota were added as basal type II DsrAB. Moreover, the RA‐DsrAB diverged as a sister clade of basal type II DsrAB. Based on the tree topology and the results of pairwise sequence alignments, we divided the RA‐DsrAB sequences into five monophyletic clades by setting 60% as the protein identity threshold between clades (Figures 1 and S1). Sequences of clades 1 and 5 were all encoded by Thermoplasmatota, while all sequences in clades 2, 3, and 4 were from Thermoproteota. Clade 3 was restricted to the family Thermocladiaceae and clade 4 was restricted to the family Thermoproteaceae, suggesting taxonomy‐specific diversification of RA‐DsrAB (Table S1). Furthermore, the environmental abundances of these clades were correlated with various physicochemical parameters with statistical support (Supplementary Information).

The evolution of the Dsr cascade assembly and the functional origin of DSR

To elucidate the evolutionary trajectory of the Dsr cascade assembly alongside the evolution of DsrAB, we reconstructed phylogenetic trees of Dsr proteins (Figure S2S11 and Table S2S3) and compared their topologies with those of DsrAB. If the topology of a Dsr protein tree follows the same pattern of the DsrAB tree as shown in Figure 1 (i.e., basal‐type DsrAB is the most deeply branched, followed by RA‐DsrAB, transitional archaeal‐type DsrAB, reductive bacterial‐type DsrAB, and oxidative bacterial‐type DsrAB), then the protein is considered to have an evolutionary link with DsrAB since its early diversification of DsrAB. Otherwise, the protein may have been recruited to the Dsr cascade during the later divergence of DsrAB (i.e., in the reductive bacterial and oxidative bacterial DsrAB types).

In trees of DsrC, DsrN, and DsrM, the homologs in the identical genomes of basal type II DsrAB, RA‐DsrAB, and transitional archaeal‐type DsrAB were clustered in monophyletic basal branching clades, with clear separation between the basal type II DsrAB and RA‐DsrAB (Figures S2S4). This result suggests that in the latest common ancestor (LCA) of basal type II DsrAB, a minimum set of DsrABCNM might already exist (Figure 1). In the DsrK tree, homologs of RA‐DsrAB branched in a deep basal clade, suggesting that DsrK was likely recruited to the Dsr cascade in the LCA of RA‐DsrAB (Figures 1 and S5). After that, DsrD, DsrJ, DsrO, DsrP, and DsrT probably joined in the LCA of RA‐DsrAB, and DsrEFH was incorporated into the cascade at the LCA of the oxidative bacterial‐type DsrAB (Figures 1 and S6S11).

Based on the likely patterns of the cascades in the LCAs of basal and archaeal DsrAB types, we infer a Dsr cascade pattern in the LCA of basal type II DsrAB and RA‐DsrAB homologs, with a minimum set of DsrABCNM (Figure 1). Despite the unresolved status of DsrK presence in this ancestral set, the comprehensive inclusion of all other key components of a functional DSR pathway is noteworthy. Furthermore, this configuration is distinctly different from that of the LCA of basal type I DsrAB lineages. Given the current absence of experimental evidence supporting the sulfite‐reducing activity of basal type I DsrAB copies 19 , 41 , it is plausible that microbial DSR originated after the branching out of basal type I DsrAB but prior to the LCA of basal type II DsrAB and RA‐DsrAB (the node indicated in Figure 1). We refer to this LCA node as the LCA of DSR hereafter.

Regarding the placement of the DSR LCA in the DsrAB tree, the deepest‐branching clade predominantly comprises homologs that are currently encoded by bacteria (Figure 1). However, we cannot confirm whether the microbial host that encoded this DsrAB was a bacterium or an archaeon.

The transfer of RA‐DsrAB‐encoding genes in archaea

The reconstruction of the evolutionary history of RA‐DsrAB proteins in the tree of archaea shows that the genes of these proteins were frequently transferred within and between the lineages of two archaeal phyla: Thermoplasmatota and Thermoproteota (Figures 2 and 3). Specifically, RA‐DsrAB proteins of clade 1 and clade 5 were transferred within the UBA184, ARK‐15, GCA‐001856825, and Thermoplasmataceae family‐level clades of the phylum Thermoplasmatota (Figure 2). Homologs of clade 3 were transferred to the Thermocladiaceae, UBA164, and UBA183 family‐level clades of the phylum Thermoproteota (Figure 3). Homologs of clade 2 were transferred to Thermocladiaceae of Thermoproteota, and homologs of clade 4 were transferred to Thermoproteaceae of Thermoproteota (Figure 3).

Figure 2.

Figure 2

Evolutionary time scales and habitat feature of Thermoplasmatota encoding RA‐DsrAB. The maximum likelihood phylogenetic tree of archaea was reconstructed using 47 marker genes (model set up as LG + PMSF + F + G, with 12,166 alignment positions). The root was determined by setting the four bacterial genomes as the outgroup. Only the Thermoplasmatota clade of the tree is shown here, and the Thermoproteota clade is shown in Figure 3 for typeset need. Molecular dating was conducted using RelTime‐ML with the LG + G model, setting four node constraints, shown as numbers and blue dots on the respective nodes (Table S4). The geological time scale is shown above the tree. The period of the Rodinia Assembly (RA) and the Great Oxidation Event (GOE) are illustrated in the geological time scale 7 , 42 . The 95% confidence interval (CI) of the molecular dating is shown as green bars on selected nodes. The presence (solid circles) and absence (empty circles) of DsrA and DsrB in archaeal genomes are shown. DsrA and/or DsrB in specific archaeal lineages are also shown by coloring respective branches. The colors of the RA‐DsrAB subclades are consistent with Figure 1. The transfer events of DsrAB obtained by reconciling the species tree and the DsrAB tree are shown as brown dots on the respective branches. The numbers in brown represent the time scale of the branches used as constraints for tree calibration in Figure 4. The name of each genome contains the classification and the NCBI accession number. This classification is derived from the GTDB taxonomic ranking, where “c_”, “o_”, “f_”, and “g_” represent the classifications of class, order, family, and genus, respectively, and are used before the name of a candidate ranking. The habitats from which the genomes were obtained, as well as the temperature of the habitats, are shown in colored boxes.

Figure 3.

Figure 3

Evolutionary time scales and habitat feature of Thermoproteota encoding RA‐DsrAB. Only the Thermoproteota clade of the tree described in Figure 2 is shown here for typeset.

Dating the evolutionary origin and diversification of basal‐type and reductive archaeal‐type DsrA

First, molecular dating was conducted on a species tree comprising representative archaeal families and additional archaeal taxa encoding DsrAB. Four nodes were set as time constraints for calibration (Figures 2 and 3). The root of archaea was dated to 3.460 Ga. The detectable transfers of RA‐DsrAB to these taxa occurred as early as 1.993 Ga and as late as recently.

The molecular dating results of the archaeal species tree were used to calibrate the DsrA tree. Specifically, five branches of the archaeal species tree containing major DsrAB transfer events (Figures 2 and 3) were selected based on the results of ALE and projected onto five nodes in the DsrA tree (Figure 4A). The time scales of the branches in the archaeal species tree were set as the time scales of the respective nodes in the DsrA tree. These five nodes and the root with an upper age of 4.38 Ga in the DsrA tree were used as constraints to calibrate the tree time scale (Table S4). The five constraint nodes include (1) the LCA of homologs encoded by g_ARK‐15 archaea (1.078 Ga–465.8 Ma), (2) the LCA of homologs encoded by three g_JACWAH01 archaea (716.6–465.6 Ma), (3) the LCA of homologs encoded by four f_UBA184 archaea (1.697–1.078 Ga), (4) the LCA of homologs of Thermoproteales (1.993–1.229 Ga), and (5) the LCA of homologs of Vulcanisaeta (1.027 Ga–424.6 Ma) (Figures 23 and 4A).

Figure 4.

Figure 4

Molecular dating of the basal‐type DsrA and RA‐DsrA. (A) The maximum likelihood phylogenetic tree (model automatically selected as LG + I + G4) reconstructed with representative DsrA and DsrB homologs from 74 DsrAB sequences with 289 amino acid sites. The DsrB clade was set as the outgroup (not shown here) and only the DsrA part of the tree is shown. The constraints of nodes in brown are transferred from Figures 2 and 3 (Table S4). Dating values on selected nodes are displayed with 95% CI values, which are shown as green bars. The colors of the basal‐type DsrA, the transitional archaeal‐type DsrA, and the RA‐DsrA subclades are consistent with Figure 1. The period of the RA and the GOE are illustrated in the geological timeline 7 , 42 . Geological periods are marked as Ecoarchean, Paleoarchean, Mesoarchean, Neoarchean, Paleoproterozoic, Mesoproterozoic, and Neoproterozoic. The triangle represents the geological evidence of microbial sulfate reduction in Archean (Table S10) 8 , 43 , 44 , 45 , 46 , 47 , 48 . The T opt values of the ancestral and extant protein sequences are shown at the internal nodes and the external nodes, respectively. Color shades indicate four temperature ranges. (B, C) Correlation of evolutionary distance and T opt in internode ancestral (B) or extant (C) DsrA proteins. Linear regression equations are shown.

Molecular dating results show that the root of DsrA dates back to 3.906 Ga in the Eoarchean Era (Figure 4A). The LCA of the basal type II DsrA and the RA‐DsrA, which is considered the LCA of DSR, was dated to 3.507 Ga in the early Paleoarchean Era, a time just before the two oldest geological records of the DSR (3.470 and 3.400 Ga) 43 , 44 (Figure 4A). Moreover, the LCA of extant RA‐DsrA was dated to 2.796 Ga at the Meso–Neoarchean boundary, prior to the GOE (2.5–2.4 to 2.1–2.0 Ga) 7 , 49 , 50 , 51 . The diversification of RA‐DsrA subclades occurred after this point, and each clade began to diverge from 1.335 Ga (clade 2) to 945.8 Ma (clade 5). This period spans from the mid‐Mesoproterozoic to the early Neoproterozoic, and coincides with the Rodinia Assembly (RA, 1.3 Ga to 900 Ma) 42 , 52 (Figure 4A).

The environmental temperature at the origin of DSR

The amino acid composition and secondary structure of microbial proteins are often optimized through evolution to adapt to their habitat temperature 53 , 54 . To track the thermal adaptation of DsrA throughout its evolutionary history, we predicted the potential T opt values (values considering both the protein sequence and structure) for both extant and ancestral DsrA sequences (Figure 4A and Table S5) 55 .

The root of the DsrA tree had a T opt value of 76.5°C (Figure 4A). The T opt of the LCA protein of DSR was 73°C, while the LCA proteins of basal type II DsrA and transitional archaeal‐type DsrA had a T opt of 74.1°C. The LCA of RA‐DsrA had a T opt value of 73.9°C. These results suggest that early versions of DsrA, including the one at the origin of the functional Dsr cascade in DSR, were likely adapted to hot environments.

The environmental adaptation history of RA‐DsrAB‐encoding archaea

While the LCA of the extant RA‐DsrA was likely adapted to a hot environment (73.9°C), copies of the RA‐DsrA clades were then frequently transferred between lineages of Thermoproteota and Thermoplasmatota in environments of various levels of ambient temperatures (Figure 4A). By calculating T opt of the ancestral protein sequences, we found that the evolutionary diversification of RA‐DsrA followed an overall tendency from hyperthermal to mesothermal environments after its origin. However, this overall tendency varied among RA‐DsrA subclades. The T opt values of the ancestral proteins and the evolutionary distance of their nodes to the LCA of RA‐DsrA were negatively correlated in clades 1, 2, and 5 (R 2  > 0.46, p < 0.1, Figure 4B), but weakly correlated in clade 3 (R 2 = 0.14, p = 0.2222), indicating that these DsrA proteins originally in hyperthermal and hot environments adapted to moderate thermal and mesothermal environments during the evolutionary process. In contrast, a positive correlation was found between T opt values and evolutionary distance in the DsrA ancestral proteins of clade 4 (R 2 = 0.29, p = 0.0062), suggesting that these proteins adapted to environments of even higher temperatures than their thermophilic ancestors (i.e., proteins adapted to environments with ambient temperature from 60°C to 80°C) during later evolution.

The extant genes of RA‐DsrA are encoded by archaea from habitats with different temperature and pH levels (Figure 2 and Figure 3). A similar correlation tendency was observed between T opt values and evolutionary distance to the LCA in extant RA‐DsrA proteins, but in clade 2, a weak positive correlation was observed (Figure 4C).

DISCUSSION

The time, environment, and molecular basis of the origin of DSR remain long‐standing scientific questions. Isotopic evidence suggests that DSR activities date back to Archean, with the oldest record being 3.47 Ga 8 . The DSR pathway, which is catalyzed by Dsr proteins, is considered to be among the oldest and most prevalent in microorganisms 15 , 16 . As dsr genes are transferred extensively between bacterial and archaeal lineages, tracing the origin of the functional Dsr protein assemblies and their microbial hosts is challenging. Although many studies have conducted phylogenetic analyses of Dsr proteins, including the key catalytic enzymes DsrAB, only one recent study has explored the assembly of Dsr proteins in the early stages of evolution 28 . According to this study, archaea may have been the first to develop the complete set of the DsrABCMK(N) complex required for sulfite reduction. By recruiting genomes encoding Dsr proteins from the updated public database, specifically metagenomes sampled in mesothermal environments, we recovered further details of the basal and reductive archaeal types of DsrAB and their co‐assembled Dsr proteins. Using this updated information, we discovered that the functional Dsr assembly for DSR, including essential components DsrABCNM, likely happened before the LCA of basal type II DsrAB, RA‐DsrAB, and transitional archaeal‐type DsrAB, with an age of 3.51 Ga (Figure 1 and Figure 4A). This timing is highly coherent with the oldest known geological record of DSR (3.47 Ga) 8 . Therefore, our results provide the most congruent evidence to date on the enzymatic basis of DSR and suggest that the original DSR was catalyzed by a minimum enzyme cascade comprising DsrABCNM in the Paleoarchean Era. The time of origin of DsrK remains unresolved due to limited data. It may have been present at the onset of DSR (3.51 Ga). Otherwise, it could have emerged later in the common ancestor of the RA‐DsrAB lineages and subsequently been transferred to bacteria containing reductive bacterial‐type DsrAB (Figure 1). The DsrK subunit, which harbors a [4Fe–4S] cluster, plays a crucial role in DSR by catalyzing the reduction of the DsrC‐trisulfide 29 . In the absence of DsrK, DsrM is likely incapable of supporting complete DsrC reduction. Nevertheless, DsrM may still function as a membrane‐bound electron transfer module, potentially interacting with alternative redox partners 30 .

Our result suggests that the host microorganism encoding DsrAB can be either a bacterium or an archaeon at the origin of DSR. Before the node of DSR LCA, all the branches were of DsrAB encoded by bacteria (Figure 1). After this node, one branch contained the basal type II DsrAB including bacterial sequences exclusively, except for one archaeal copy. The other branch comprised two purely archaeal clusters: RA‐DsrAB and the transitional archaeal‐type DsrAB. Due to the current limitations in data availability, it is difficult to determine if DsrAB was transferred horizontally from a bacterial donor to a bacterial or archaeal receptor at the DSR LCA node.

Compared to the extant DsrA in hyperthermal to mesothermal environments, the T opt of the predicted ancestral DsrA protein at the origin of DSR (73°C) suggested that the microbial host might live in or very close to the hot spring vents (Figures 2, 3, 4). However, it is worth noting that enzymes with a high T opt may remain functional in microorganisms living in a slightly lower ambient temperature (Figures 2 and 3). This observation can readily explain the estimated ambient temperature during the formation of the earliest geological records of DSR ( < 60°C) 8 . Although this finding corroborates the conclusion of a previous genomic study to some extent 17 , the unresolved (bootstrap <90) branch from mesothermal environments in the DsrAB tree of that study made its conclusion arguable. Moreover, the presence of dsrAB branching as basal clades encoded by extant thermophiles and hyperthermophiles (i.e., organisms that thrive at ambient temperature >80°C), as shown in that study, does not necessarily imply that DSR originated in hot and hyperthermal environments. Dsr genes were frequently transferred between bacteria and archaea. The copies of extant thermophilic and hyperthermophilic archaea may have been transferred from a donor archaea or bacteria that once lived in mesothermal and moderate thermal environments. Therefore, the temperature at the origin of Dsr cannot be addressed by directly referring to the habitat temperature of extant dsrA/dsrB‐carrying microorganisms. In this study, we applied an ancestral sequence analysis of extant DsrAB proteins to determine the origin of DSR. This approach recruited phylogenetic models to infer the ancestral protein sequences independently of the host microorganisms that encode them. The T opt values inferred from these ancestral sequences thus reflect the intrinsic features of the enzymes themselves in temperature adaptation.

While DSR probably emerged before the LCA of RA‐DsrAB, RA‐DsrAB is one of the most well‐adapted and diverse DsrAB types and remains the most ancient type for which there is experimental evidence of DSR activity. By the time of LCA of RA‐DsrAB, which is dated to 2.795 Ga, a minimum set of the DsrABCNMK cascade had been established and was generally preserved across its five subclades (Figure 1). After their evolutionary origins in hot environments, some RA‐DsrAB‐carrying archaea gradually adapted to mesothermal and moderate thermal environments, while others stayed in hot habitats or adapted to hyperthermal environments. During the Mesoarchean Era and the Neoarchean Era (3.2–2.5 Ga), the Earth's atmosphere began to accumulate free oxygen, likely driven by oxygenic photosynthesis, until the atmosphere was permanently oxidized in the GOE 7 , 56 . Throughout this gradual oxidation process, weathered sulfides were imported into diverse environments on a larger scale. The widespread availability of sulfur oxides under varied physicochemical conditions created numerous ecological niches for sulfite‐reducing archaea 2 , 6 , 57 , 58 , supporting an adaptive expansion of both RA‐DsrAB (Figure 4) and their archaeal hosts (Figures 2 and 3) after the GOE. In particular, during the formation of the early Neoproterozoic supercontinent Rodinia (1.3–0.9 Ga) 42 , 52 , 59 , increased orogenic events might have led to the exposure of sulfide mineral deposits 60 , which were oxidized to produce sulfuric acid and contributed to localized acidic environments. These conditions may have facilitated the emergence of AMD‐like environments and consequently the thriving of mesophiles and acidophiles encoding RA‐DsrAB 61 , 62 , 63 . Indeed, RA‐DsrAB in clades 1, 3, and 5 were transferred to the phyla Thermoplasmatota and Nitrososphaeria in mesothermal acidic environments between 1.340 and 1.205 Ga (Figures 2, 3, 4).

In summary, we reconstructed the phylogenetic trees of DsrAB and other Dsr proteins, and inferred that the core components DsrABCNM of the DSR pathway were already assembled in the Paleoarchean Era. Molecular dating of both species and protein trees alongside ancestral sequence reconstruction indicates that the DSR pathway likely originated at around 3.508 Ga with an estimated ancestral DsrA T opt of 73°C, supporting the hot origin hypothesis of DSR. Additionally, our analysis of RA‐DsrAB subclades revealed how enzymes and species have adapted to diverse geological environments over time. This work expands our understanding of the early evolutionary assembly of the DSR, but several questions remain open. The scarcity of basal‐type genomes limits the analytical resolution of the earliest steps in the Dsr cascade assembly process, leaving the archaeal versus bacterial origin unresolved. Future recovery of deep‐branching genomes, coupled with functional assays and tighter geochemical constraints, will be essential to refine our understanding of microbial DSR origin and early diversification.

MATERIALS AND METHODS

DsrA and DsrB homolog search

3,031 AMD MAGs and 65,703 GTDB representative genomes (207_v2) 28 were collected to search for DsrA and DsrB homologs. The AMD MAGs were obtained from data deposits of our two previous studies (NCBI BioProjects PRJNA666025 and PRJNA666095) 37 , 38 . 90 AMD sediment samples were collected from 18 mine sites across six provinces in southern China between August and October 2017, 37 and two tailings cores were collected from the Fankou Pb/Zn sulfidic mine tailings site in Shaoguan, Guangdong, China, in October 2017 (Figure S12A) 38 . The detailed DNA extraction procedure and physicochemical parameter assay methods can be found in two previous studies. Coding proteins were predicted from the AMD MAGs using Prodigal v2.6.3 with the following parameters: “‐p meta ‐g 11 ‐f gff ‐q –m” 64 . Taxonomic classification of these MAGs was performed using GTDB‐tk 207_v2 65 . The coding proteins and taxonomic classification of the GTDB representative genomes were obtained from the GTDB website. The hidden Markov models of DsrA and DsrB from the KOfam database (downloaded on July 4, 2024) 66 and the TIGRfam database (downloaded on July 10, 2024) 67 databases were used to search for DsrA and DsrB homologs using hmmscan v3.2.1 with the “‐T 223” option set for DsrA and the “‐T 205” option set for DsrB (K11180 for DsrA and K11181 for DsrB, TIGR02064 for DsrA, and TIGR02066 for DsrB) 15 , 68 , 69 . Only dsrA and dsrB gene pairs that were adjacent to each other within the same genome were retained, and dsrA and dsrB gene sequences of the same pair were concatenated for downstream analysis (dsrAB is used to refer to the concatenated dsrA and dsrB genes). We used CD‐HIT v4.6.8 70 to remove redundant sequences with the parameter set “‐c 0.75 ‐n 5.” Further verification for DsrA sequences was conducted based on the presence of both the conserved siroheme‐binding Cx5CXnCX3C motif and the [Fe4S4] cluster‐binding CX2CX2C motif, resulting in the retrieval of 415 concatenated DsrAB sequences 17 . Multiple DsrAB copies within each genome were flagged (Table S1). Annotations of other Dsr proteins were conducted using Disco 71 and SCycDB 72 . The CbiA, CobB, and CfbB sequences were selected according to a previous study for analyzing the phylogeny of DsrN 28 .

Phylogenetic trees of Dsr proteins

The multiple sequence alignment of DsrAB, DsrC, DsrN, DsrM, DsrK, DsrJ, DsrO, DsrP, DsrD, DsrT, and DsrEFH proteins was constructed using MAFFT v7.505 with default parameters 73 . Gaps were removed using TrimAl v1.4.rev15 (“‐gt 0.95 ‐cons 50”) 74 . The maximum likelihood tree was reconstructed using IQTree v.2.2.0.3 (“‐m MFP ‐B 1000 –bnni”) 75 and visualized in iTOL v7 76 . Treemmer v0.3 (‐RTL 0.3) 77 was then used to reduce the number of branches in the trees of reductive bacterial‐type DsrAB and oxidative bacterial‐type DsrAB to 31. Paralogous rooting was conducted as suggested in a previous study to determine the root of this DsrAB concatenated tree 17 . This involved the pooling of paired DsrA and DsrB sequences that were not concatenated, followed by the construction of a phylogenetic tree using the same method as that for the concatenated DsrAB tree. Paralogous rooting was performed using an equivalent number of DsrA and DsrB subunits from the concatenated DsrAB tree. The monophyletic clade, comprising three Chloroflexota and eight Firmicutes sequences (sequences from the basal type I DsrAB), emerged as the most basal group for both the DsrA and the DsrB branches, consistent with previous studies (Figure S13) 11 , 41 . Therefore, we used these 11 taxa as the outgroup to root the DsrAB concatenation tree. The DsrC tree was rooted with the basal and archaeal clusters as counterparts of the DsrAB tree (Figure S2). CbiA, CobB, and CfbB sequences were used as outgroups to root the DsrN tree (Figure S3). DsrE, DsrF, and DsrH trees were also rooted using the paralogous rooting method (Figure S11). Other phylogenies were rooted at their midpoints.

Ancestral sequence generation and feature prediction

Amino acid sequences at internal nodes of the DsrA phylogenetic tree were computationally inferred from the sequence alignment and tree topology using the codelml program in the PAML v4.9 package 78 . The Bayesian statistical framework, incorporating a Gamma distribution, was used to infer the posterior amino acid probability per site. No molecular clock was set during this analysis. The universal code (icode = 0) and the fixed branch length option (Mgene = 0) were used. T opt of ancestral and extant DsrAB sequences were predicted using DeepET 55 .

Phylogenetic inference of the species tree

A phylogenetic species tree of representative archaeal and bacterial lineages encoding RA‐dsrAB was reconstructed. Specifically, the quality of 12 nonredundant RA‐dsrAB‐encoding AMD MAGs (CD‐HIT “‐c 0.75 ‐n 5”) was assessed using CheckM v1.1.3 79 . Those with completeness >75% and contamination <5% were included (Table S6). For GTDB genomes (r220), those with completeness >75% and contamination <5% were selected for downstream analysis. Two GTDB‐obtained genomes with the highest‐quality values, calculated as “completeness – 4 × contamination” as suggested in a previous study 80 , were selected as representatives of each archaeal phylum (except for the phyla Thermoplasmatota and Thermoproteota, in which two representative genomes were selected per class). Family‐level GTDB‐obtained genomes carrying dsrAB in Thermoplasmatota and Thermoproteota were further selected. Four bacterial genomes were randomly selected and downloaded from the GTDB website as outgroup species. Finally, 306 high‐quality genomes were selected for downstream analysis. Orthologs were identified using OMA v2.6.0 81 , and 47 candidate marker genes were retained based on their presence in 261 (85%) of the 306 high‐quality genomes. The marker genes were annotated using the NCBI BLAST website, and the best hit for each marker gene is shown in Table S7. The multiple sequence alignments of these candidate individual gene trees were generated using MAFFT with default parameters, followed by removal of the alignment gaps using TrimAl (“‐gappyout”). All the candidate marker genes were included after using the scripts tre_make_split.pl and tre_discordance_two.pl provided in a previous study to confirm the topological concordance between individual gene trees 82 . Multiple sequence alignments of individual marker genes were concatenated using catfasta2phyml.pl (https://github.com/nylander/catfasta2phyml). Compositional heterozygous sites of the concatenated marker genes (0%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, and 40%) were removed using the script alignment_pruner.pl, which was provided in a previous study 83 . Species trees were then inferred using IQTree with the parameters “‐m LG + PMSF + F + G ‐B 1000 –bnni.”

Molecular dating of the archaeal species tree and the DsrA tree

The node divergence times of the archaeal species tree were estimated in MEGA v11.0.13 using the RelTime‐ML and the LG + G models 84 . The root of archaea (4.38–3.46 Ga) 85 and three nodes related to the GOE (i.e., the root of Thermoproteales, Thermoproteaceae, and Thermocladiaceae (except g__JAADER01), <2.32 Ga) 86 , 87 were selected as calibration constraints (Table S4). To determine the latter three, we used hmmsearch v3.2.1 with the option “domE 1‐e5” to search for the cytochrome oxidases (K02274 and K02275) and cytochrome bd quinol oxidases (K00425 and K00426) in aerobic archaea and identify clades with anaerobic ancestors (Figure S14 and Table S8). These taxa would have evolved from anaerobic to aerobic lifestyles after the rise of atmospheric oxygen 2.32 Ga 87 .

For the molecular dating of the DsrA tree, DsrA and DsrB in 306 genomes were searched using hmmscan v3.2.1 with the Hidden Markov Models in the KOfam (K11180 for DsrA and K11181 for DsrB) 66 and TIGRfam (TIGR02064 for DsrA and TIGR02066 for DsrB) 67 databases, with the option “‐T 223” for DsrA and “‐T 205” for DsrB, respectively 15 , 68 , 69 . Neighboring DsrA and DsrB sequences within the same genome were retained, and an equivalent number of representative DsrA and DsrB sequences were obtained for multiple sequence alignment using MAFFT with default parameters 73 . Alignment gaps were removed using TrimAl (“‐gt 0.95 ‐cons 50”) 74 , and the maximum likelihood tree was reconstructed using IQTree (“‐m MFP ‐B 1000 –bnni”) 75 . The DsrB clade was used as the outgroup for rooting. The results of the species tree dating were used as constraints for the DsrA tree dating (Table S4). A DsrAB concatenated tree was reconstructed following the same steps as those for the DsrA tree, but with concatenated DsrA and DsrB sequences in the multiple sequence alignment.

The reconciliation of the DsrAB gene tree with the species tree was conducted using ALE v1.0 (https://github.com/ssolo/ALE), and the outputs with frequency of transfer events less than 0.3 were filtered out (Table S9). By screening the branch where DsrAB transfer events were detected, the time range of the branch on the species tree was set as the constraint for the LCA node of the corresponding clade on the DsrA tree. Only clades that contained three or more branches were used as constraints. The tree files were visualized using iTOL 76 and tvBOT v2.6.1 88 .

Gene abundance calculation of dsrAB clades in AMD metagenomes

Pairwise sequence alignments of the DsrA proteins in the DsrAB tree were performed using Diamond blastp 89 with the parameter “‐‐more‐sensitive ‐‐max‐target‐seqs. 60.” Heatmaps of pairwise sequence identities were then plotted to assess the identity thresholds between the five DsrA clades (Figure S1). An identity threshold of 60 was chosen to discriminate between DsrA proteins of different clades.

Quality control of the raw reads from the AMD sediments was conducted as previously described 37 , and high‐quality reads were assembled into contigs using SPAdes v3.14.1 90 . The abundance of the contigs in the AMD samples was calculated using CoverM 91 with minimap2‐sr. The coverage method was trimmed_mean, which calculates the average number of aligned reads overlapping each position after removing the deepest and shallowest covered positions 91 . The DsrA proteins in the DsrAB tree were searched against the coding proteins of the contigs using Diamond blastp with the parameter “‐k 5 ‐e 1e‐5 ‐‐id 60.” The abundance of the resulting DsrA proteins was defined as the abundance of the genomic contigs encoding them.

Statistical analyses

All statistical analyses were performed in the R environment (v.4.3.2) using R packages. Specifically, Spearman correlation analyses were performed using the “corrplot” function in the corrplot package (v 0.92) 92 to assess the relationships between abundances of dsrAB‐carrying archaeal populations and physicochemical factors in 90 AMD sediment samples. Redundancy analysis (RDA) was performed to show the community structure of dsrAB‐carrying archaea in response to physicochemical factors using the vegan package (v 2.6‐4) 93 . Prior to RDA, the microbial abundance matrix was standardized using the Hellinger method with the “decostand” function, and the environmental factor matrix was standardized by taking logarithms. The “envfit” function, which fits environmental factors to species distributions, was used to screen environmental factors in RDA. Environmental factors with p < 0.001 were retained. The Monte Carlo permutation test with the “permutest” function was used to evaluate the RDA results (p < 0.001). Nonparametric multivariate analysis of variance (Adonis) was used to test the abundances of dsrAB‐carrying archaea in different mineral types containing >10 samples (i.e., Cu, Pb/Zn, and Polymetallic) using vegan. The distance between the internal node or branch and the LCA of the RA‐DsrAB was calculated using the “dist. node” function of ggpmisc (v 0.6.0) 94 . For each of the five clades, linear regressions were performed between the T opt values predicted for the ancestral proteins of the internal nodes or the extant proteins of each branch, and the evolutionary distance of each internal node or branch to the last common ancestor. The lm function was used to calculate the equations of the linear regression curves, and to extract the R 2 and p values. Data visualization was performed using the ggplot2 package (v3.3.4) and the gglayer package (v0.0.4) 95 , 96 . We used analysis of variance (ANOVA) to analyze the differences in the abundance of the five RA‐DsrAB clades among different mineral types or sampling sites.

AUTHOR CONTRIBUTIONS

Lingyun Tang: Data curation; formal analysis; investigation; methodology; visualization; writing—original draft. Zhenhao Luo: Data curation; funding acquisition; methodology. Shaoming Gao: Writing—review and editing. Zhiliang Lin: Data curation; methodology. Mengqi Sun: Funding acquisition; writing—review and editing. Runsheng Li: Writing—review and editing. Shu‐Hong Gao: Data curation; methodology. Geng Wu: Writing—review and editing. Yiliang Li: Writing—review and editing. Linan Huang: Conceptualization; funding acquisition; writing—review and editing. Lu Fan: Conceptualization; funding acquisition; investigation; project administration; supervision; writing—review and editing.

ETHICS STATEMENT

The authors have nothing to report.

CONFLICT OF INTERESTS

The authors declare no conflict of interests.

Supporting information

Supplementary Information.

MLF2-5-108-s002.pdf (13.1MB, pdf)

Supplementary Table.

MLF2-5-108-s001.xlsx (306.1KB, xlsx)

ACKNOWLEDGMENTS

We thank Bu Xu, Wenxiu Wang, and Qi Li for suggestions in bioinformatics analysis. Computation in this study was supported by the Center for Computational Science and Engineering at the Southern University of Science and Technology. This study was supported by the National Key R&D Program of China (No. 2024YFA0919700 to L.F.), the National Natural Science Foundation of China (Nos. 32300001 to Z. Luo and 42206122 to M.S.), the Natural Science Foundation of Guangdong Province (No. 2025A1515010753 to L.H.), the Guangdong Basic and Applied Basic Research Foundation (No. 2021B1515120080 to L.F.), and the Shenzhen Key Laboratory of Marine Archaea Geo‐Omics, Southern University of Science and Technology (No. ZDSYS201802081843490 to L.F.).

Contributor Information

Linan Huang, Email: eseshln@mail.sysu.edu.cn.

Lu Fan, Email: fanl@sustech.edu.cn.

DATA AVAILABILITY

The tree files and the nexus files of tree dating can be found at Figshare (https://doi.org/10.6084/m9.figshare.30351535).

REFERENCES

  • 1. Blank CE. Evolutionary timing of the origins of mesophilic sulphate reduction and oxygenic photosynthesis: a phylogenomic dating approach. Geobiology. 2004;2:1–20. [Google Scholar]
  • 2. Havig JR, Hamilton TL, Bachan A, Kump LR. Sulfur and carbon isotopic evidence for metabolic pathway evolution and a four‐stepped earth system progression across the archean and paleoproterozoic. Earth‐Sci Rev. 2017;174:1–21. [Google Scholar]
  • 3. Ranjan S, Todd ZR, Sutherland JD, Sasselov DD. Sulfidic anion concentrations on early Earth for surficial origins‐of‐life chemistry. Astrobiology. 2018;18:1023–1040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Halevy I. Production, preservation, and biological processing of mass‐independent sulfur isotope fractionation in the Archean surface environment. Proc Natl Acad Sci USA. 2013;110:17644–17649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Zhou Z, Tran PQ, Cowley ES, Trembath‐Reichert E, Anantharaman K. Diversity and ecology of microbial sulfur metabolism. Nat Rev Microbiol. 2025;23:122–140. [DOI] [PubMed] [Google Scholar]
  • 6. Mateos K, Chappell G, Klos A, Le B, Boden J, Stüeken E, et al. The evolution and spread of sulfur cycling enzymes reflect the redox state of the early Earth. Sci Adv. 2023;9:eade4847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Lyons TW, Tino CJ, Fournier GP, Anderson RE, Leavitt WD, Konhauser KO, et al. Co‐evolution of early Earth environments and microbial life. Nat Rev Microbiol. 2024;22:572–586. [DOI] [PubMed] [Google Scholar]
  • 8. Shen Y, Buick R, Canfield DE. Isotopic evidence for microbial sulphate reduction in the early Archaean era. Nature. 2001;410:77–81. [DOI] [PubMed] [Google Scholar]
  • 9. Shen Y, Farquhar J, Masterson A, Kaufman AJ, Buick R. Evaluating the role of microbial sulfate reduction in the early Archean using quadruple isotope systematics. Earth Planet Sci Lett. 2009;279:383–391. [Google Scholar]
  • 10. Djokic T, Van Kranendonk MJ, Campbell KA, Walter MR, Ward CR. Earliest signs of life on land preserved in ca. 3.5 Ga hot spring deposits. Nat Commun. 2017;8:15263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Loy A, Duller S, Baranyi C, Mußmann M, Ott J, Sharon I, et al. Reverse dissimilatory sulfite reductase as phylogenetic marker for a subgroup of sulfur‐oxidizing prokaryotes. Environ Microbiol. 2009;11:289–299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Santos AA, Venceslau SS, Grein F, Leavitt WD, Dahl C, Johnston DT, et al. A protein trisulfide couples dissimilatory sulfate reduction to energy conservation. Science. 2015;350:1541–1545. [DOI] [PubMed] [Google Scholar]
  • 13. Wagner M, Roger AJ, Flax JL, Brusseau GA, Stahl DA. Phylogeny of dissimilatory sulfite reductases supports an early origin of sulfate respiration. J Bacteriol. 1998;180:2975–2982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Zverlov V, Klein M, Lücker S, Friedrich MW, Kellermann J, Stahl DA, et al. Lateral gene transfer of dissimilatory (bi)sulfite reductase revisited. J Bacteriol. 2005;187:2203–2208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Anantharaman K, Hausmann B, Jungbluth SP, Kantor RS, Lavy A, Warren LA, et al. Expanded diversity of microbial groups that shape the dissimilatory sulfur cycle. ISME J. 2018;12:1715–1728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Müller AL, Kjeldsen KU, Rattei T, Pester M, Loy A. Phylogenetic and environmental diversity of DsrAB‐type dissimilatory (bi)sulfite reductases. ISME J. 2015;9:1152–1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Colman DR, Lindsay MR, Amenabar MJ, Fernandes‐Martins MC, Roden ER, Boyd ES. Phylogenomic analysis of novel Diaforarchaea is consistent with sulfite but not sulfate reduction in volcanic environments on early Earth. ISME J. 2020;14:1316–1331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Diao M, Dyksma S, Koeksoy E, Ngugi DK, Anantharaman K, Loy A, et al. Global diversity and inferred ecophysiology of microorganisms with the potential for dissimilatory sulfate/sulfite reduction. FEMS Microbiol Rev. 2023;47:1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Rosenbaum FP, Poehlein A, Daniel R, Müller V. Energy‐conserving dimethyl sulfoxide reduction in the acetogenic bacterium Moorella thermoacetica . Environ Microbiol. 2022;24:2000–2012. [DOI] [PubMed] [Google Scholar]
  • 20. Ayala‐Muñoz D, Burgos WD, Sánchez‐España J, Couradeau E, Falagán C, Macalady JL. Metagenomic and metatranscriptomic study of microbial metal resistance in an acidic pit lake. Microorganisms. 2020;8:1350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Carr SA, Jungbluth SP, Eloe‐Fadrosh EA, Stepanauskas R, Woyke T, Rappé MS, et al. Carboxydotrophy potential of uncultivated Hydrothermarchaeota from the subseafloor crustal biosphere. ISME J. 2019;13:1457–1468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Hua ZS, Qu YN, Zhu Q, Zhou EM, Qi YL, Yin YR, et al. Genomic inference of the metabolism and evolution of the archaeal phylum Aigarchaeota . Nat Commun. 2018;9:2832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Lin X, Handley KM, Gilbert JA, Kostka JE. Metabolic potential of fatty acid oxidation and anaerobic respiration by abundant members of Thaumarchaeota and Thermoplasmata in deep anoxic peat. ISME J. 2015;9:2740–2744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. McKay LJ, Dlakić M, Fields MW, Delmont TO, Eren AM, Jay ZJ, et al. Co‐occurring genomic capacity for anaerobic methane and dissimilatory sulfur metabolisms discovered in the Korarchaeota . Nat Microbiol. 2019;4:614–622. [DOI] [PubMed] [Google Scholar]
  • 25. Luo ZH, Li Q, Xie YG, Lv AP, Qi YL, Li MM, et al. Temperature, pH, and oxygen availability contributed to the functional differentiation of ancient Nitrososphaeria . ISME J. 2024;18:wrad031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Wu Z, Yang X, Huang L, Li S, Xia F, Qiu Y, et al. In situ enrichment of sulphate‐reducing microbial communities with different carbon sources stimulating the acid mine drainage sediments. Sci Total Environ. 2023;898:165584. [DOI] [PubMed] [Google Scholar]
  • 27. Colman DR, Labesse G, Swapna GVT, Stefanakis J, Montelione GT, Boyd ES, et al. Structural evolution of the ancient enzyme, dissimilatory sulfite reductase. Proteins. 2022;90:1331–1345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Neukirchen S, Pereira IAC, Sousa FL. Stepwise pathway for early evolutionary assembly of dissimilatory sulfite and sulfate reduction. ISME J. 2023;17:1680–1692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Venceslau SS, Stockdreher Y, Dahl C, Pereira IAC. The “bacterial heterodisulfide” DsrC is a key protein in dissimilatory sulfur metabolism. Biochim Biophys Acta. 2014;1837:1148–1164. [DOI] [PubMed] [Google Scholar]
  • 30. Grein F, Venceslau SS, Schneider L, Hildebrandt P, Todorovic S, Pereira IAC, et al. DsrJ, an essential part of the DsrMKJOP transmembrane complex in the purple sulfur bacterium Allochromatium vinosum, is an unusual triheme cytochrome c . Biochemistry. 2010;49:8290–8299. [DOI] [PubMed] [Google Scholar]
  • 31. Rabus R, Venceslau SS, Wöhlbrand L, Voordouw G, Wall JD, Pereira IAC. A post‐genomic view of the ecophysiology, catabolism and biotechnological relevance of sulphate‐reducing prokaryotes. In: Advances in Microbial Physiology. Elsevier; 2015. p. 55–321. [DOI] [PubMed] [Google Scholar]
  • 32. Ferreira D, Barbosa ACC, Oliveira GP, Catarino T, Venceslau SS, Pereira IAC. The DsrD functional marker protein is an allosteric activator of the DsrAB dissimilatory sulfite reductase. Proc Natl Acad Sci USA. 2022;119:e2118880119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Akbar S, Gaidenko TA, Kang CM, O'Reilly M, Devine KM, Price CW. New family of regulators in the environmental signaling pathway which activates the general stress transcription factor ς B of Bacillus subtilis . J Bacteriol. 2001;183:1329–1338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Stockdreher Y, Venceslau SS, Josten M, Sahl H‐G, Pereira IAC, Dahl C. Cytoplasmic sulfurtransferases in the purple sulfur bacterium Allochromatium vinosum: evidence for sulfur transfer from DsrEFH to DsrC. PLoS One. 2012;7:e40785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Thorup C, Schramm A, Findlay AJ, Finster KW, Schreiber L. Disguised as a sulfate reducer: growth of the Deltaproteobacterium Desulfurivibrio alkaliphilus by sulfide oxidation with nitrate. mBio. 2017;8:e00671‐17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Parks DH, Chuvochina M, Rinke C, Mussig AJ, Chaumeil P‐A, Hugenholtz P. GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome‐based taxonomy. Nucleic Acids Res. 2022;50:D785–D794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Gao S, Paez‐Espino D, Li J, Ai H, Liang J, Luo Z, et al. Patterns and ecological drivers of viral communities in acid mine drainage sediments across Southern China. Nat Commun. 2022;13:2389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Luo ZH, Li Q, Chen N, Tang LY, Liao B, Yang TT, et al. Genome‐resolved metagenomics reveals depth‐related patterns of microbial community structure and functions in a highly stratified, AMD overlaying mine tailings. J Hazard Mater. 2023;447:130774. [DOI] [PubMed] [Google Scholar]
  • 39. Huber R, Kristjansson JK, Stetter KO. Pyrobaculum gen. nov., a new genus of neutrophilic, rod‐shaped archaebacteria from continental solfataras growing optimally at 100°C. Arch Microbiol. 1987;149:95–101. [Google Scholar]
  • 40. Larsen Ø, Lien T, Birkeland N. Dissimilatory sulfite reductase from Archaeoglobus profundus and Desulfotomaculum thermocisternum: phylogenetic and structural implications from gene sequences. Extremophiles. 1999;3:63–70. [DOI] [PubMed] [Google Scholar]
  • 41. Pierce E, Xie G, Barabote RD, Saunders E, Han CS, Detter JC, et al. The complete genome sequence of Moorella thermoacetica (f. Clostridium thermoaceticum) . Environ Microbiol. 2008;10:2550–2573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Cawood PA, Strachan RA, Pisarevsky SA, Gladkochub DP, Murphy JB. Linking collisional and accretionary orogens during Rodinia assembly and breakup: implications for models of supercontinent cycles. Earth Planet Sci Lett. 2016;449:118–126. [Google Scholar]
  • 43. Ueno Y, Ono S, Rumble D, Maruyama S. Quadruple sulfur isotope analysis of ca. 3.5 Ga dresser formation: new evidence for microbial sulfate reduction in the early Archean. Geochim Cosmochim Acta. 2008;72:5675–5691. [Google Scholar]
  • 44. Roerdink DL, Mason PRD, Farquhar J, Reimer T. Multiple sulfur isotopes in Paleoarchean barites identify an important role for microbial sulfate reduction in the early marine environment. Earth Planet Sci Lett. 2012;331–332:177–186. [Google Scholar]
  • 45. Wacey D, Kilburn MR, Saunders M, Cliff J, Brasier MD. Microfossils of sulphur‐metabolizing cells in 3.4‐billion‐year‐old rocks of Western Australia. Nat Geosci. 2011;4:698–702. [Google Scholar]
  • 46. Vysotskiy S, Velivetskaya T, Ignatiev A, Slabunov A, Aseeva A. Multiple sulfur isotope evidence for bacterial sulfate reduction and sulfate disproportionation operated in Mesoarchaean rocks of the Karelian Craton. Minerals. 2022;12:1143. [Google Scholar]
  • 47. Zhelezinskaia I, Kaufman AJ, Farquhar J, Cliff J. Large sulfur isotope fractionations associated with Neoarchean microbial sulfate reduction. Science. 2014;346:742–744. [DOI] [PubMed] [Google Scholar]
  • 48. Kakegawa T, Kawai H, Ohmoto H. Origins of pyrites in the ∼2.5 Ga Mt. McRae shale, the Hamersley District, Western Australia. Geochim Cosmochim Acta. 1998;62:3205–3220. [Google Scholar]
  • 49. Uveges BT, Izon G, Junium CK, Ono S, Summons RE. Aerobic nitrogen cycle 100 My before permanent atmospheric oxygenation. Proc Natl Acad Sci USA. 2025;122:e2423481122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Mayika KB, Moussavou M, Prave AR, Lepland A, Mbina M, Kirsimäe K. The Paleoproterozoic Francevillian succession of Gabon and the Lomagundi‐Jatuli event. Geology. 2020;48:1099–1104. [Google Scholar]
  • 51. Anbar AD, Buick R, Gordon GW, Johnson AC, Kendall B, Lyons TW, et al. Technical comment on “Reexamination of 2.5‐Ga ‘whiff’ of oxygen interval points to anoxic ocean before GOE. Sci Adv. 2023;9:eabq3736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Li ZX, Bogdanova SV, Collins AS, Davidson A, De Waele B, Ernst RE, et al. Assembly, configuration, and break‐up history of Rodinia: a synthesis. Precambrian Res. 2008;160:179–210. [Google Scholar]
  • 53. Hickey DA, Singer GA. Genomic and proteomic adaptations to growth at high temperature. Genome Biol. 2004;5:117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Zeldovich KB, Berezovsky IN, Shakhnovich EI. Protein and DNA sequence determinants of thermophilic adaptation. PLoS Comput Biol. 2007;3:e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Li G, Buric F, Zrimec J, Viknander S, Nielsen J, Zelezniak A, et al. Learning deep representations of enzyme thermal adaptation. Prot Sci. 2022;31:e4480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Planavsky NJ, Asael D, Hofmann A, Reinhard CT, Lalonde SV, Knudsen A, et al. Evidence for oxygenic photosynthesis half a billion years before the Great Oxidation Event. Nat Geosci. 2014;7:283–286. [Google Scholar]
  • 57. Donald E, Canfield R. Raiswell. The evolution of the sulfur cycle. Am J Sci. 1999;299:697. [Google Scholar]
  • 58. Fakhraee M, Hancisse O, Canfield DE, Crowe SA, Katsev S. Proterozoic seawater sulfate scarcity and the evolution of ocean–atmosphere chemistry. Nat Geosci. 2019;12:375–380. [Google Scholar]
  • 59. Dickin AP, McNutt RH, Martin C, Guo A. The extent of juvenile crust in the Grenville Province: Nd isotope evidence. Geol Soc Am Bull. 2010;122:870–883. [Google Scholar]
  • 60. Liu C, Runyon SE, Knoll AH, Hazen RM. The same and not the same: ore geology, mineralogy and geochemistry of Rodinia assembly versus other supercontinents. Earth‐Sci Rev. 2019;196:102860. [Google Scholar]
  • 61. Konhauser KO, Lalonde SV, Planavsky NJ, Pecoits E, Lyons TW, Mojzsis SJ, et al. Aerobic bacterial pyrite oxidation and acid rock drainage during the Great Oxidation Event. Nature. 2011;478:369–373. [DOI] [PubMed] [Google Scholar]
  • 62. Stüeken EE, Catling DC, Buick R. Contributions to late Archaean sulphur cycling by life on land. Nat Geosci. 2012;5:722–725. [Google Scholar]
  • 63. Havig JR, Grettenberger C, Hamilton TL. Geochemistry and microbial community composition across a range of acid mine drainage impact and implications for the Neoarchean‐Paleoproterozoic transition. J Geophys Res Biogeosci. 2017;122:1404–1422. [Google Scholar]
  • 64. Hyatt D, Chen G‐L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Chaumeil P‐A, Mussig AJ, Hugenholtz P, Parks DH. GTDB‐Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2020;36:1925–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Aramaki T, Blanc‐Mathieu R, Endo H, Ohkubo K, Kanehisa M, Goto S, et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics. 2020;36:2251–2252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Li W, O'Neill KR, Haft DH, DiCuccio M, Chetvernin V, Badretdin A, et al. RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation. Nucleic Acids Res. 2020;49:1020–1028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Anantharaman K, Brown CT, Hug LA, Sharon I, Castelle CJ, Probst AJ, et al. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. Nat Commun. 2016;7:13219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Eddy SR. Accelerated profile HMM searches. PLoS Comput Biol. 2011;7:e1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Fu L, Niu B, Zhu Z, Wu S, Li W. CD‐HIT: accelerated for clustering the next‐generation sequencing data. Bioinformatics. 2012;28:3150–3152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Neukirchen S, Sousa FL. DiSCo: a sequence‐based type‐specific predictor of Dsr‐dependent dissimilatory sulphur metabolism in microbial data. Microb Genom. 2021;7:000603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Yu X, Zhou J, Song W, Xu M, He Q, Peng Y, et al. SCycDB: a curated functional gene database for metagenomic profiling of sulphur cycling pathways. Mol Ecol Resour. 2021;21:924–940. [Google Scholar]
  • 73. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Capella‐Gutiérrez S, Silla‐Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large‐scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ‐TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37:1530–1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Letunic I, Bork P. Interactive tree of life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 2024;52:W78–W82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Menardo F, Loiseau C, Brites D, Coscolla M, Gygli SM, Rutaihwa LK, et al. Treemmer: a tool to reduce large phylogenetic datasets with minimal loss of diversity. BMC Bioinformatics. 2018;19:164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–1591. [DOI] [PubMed] [Google Scholar]
  • 79. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Woodcroft BJ, Singleton CM, Boyd JA, Evans PN, Emerson JB, Zayed AAF, et al. Genome‐centric view of carbon processing in thawing permafrost. Nature. 2018;560:49–54. [DOI] [PubMed] [Google Scholar]
  • 81. Altenhoff AM, Warwick Vesztrocy A, Bernard C, Train C‐M, Nicheperovich A, Prieto Baños S, et al. OMA orthology in 2024: improved prokaryote coverage, ancestral and extant GO enrichment, a revamped synteny viewer and more in the OMA ecosystem. Nucleic Acids Res. 2024;52:D513–D521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Schön ME, Martijn J, Vosseberg J, Köstlbacher S, Ettema TJG. The evolutionary origin of host association in the Rickettsiales . Nat Microbiol. 2022;7:1189–1199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Martijn J, Vosseberg J, Guy L, Offre P, Ettema TJG. Deep mitochondrial origin outside the sampled Alphaproteobacteria . Nature. 2018;557:101–105. [DOI] [PubMed] [Google Scholar]
  • 84. Tamura K, Tao Q, Kumar S. Theoretical foundation of the RelTime method for estimating divergence times from variable evolutionary rates. Mol Biol Evol. 2018;35:1770–1782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Valley JW, Cavosie AJ, Ushikubo T, Reinhard DA, Lawrence DF, Larson DJ, et al. Hadean age for a post‐magma‐ocean zircon confirmed by atom‐probe tomography. Nat Geosci. 2014;7:219–223. [Google Scholar]
  • 86. Blank CE. An expansion of age constraints for microbial clades that lack a conventional fossil record using phylogenomic dating. J Mol Evol. 2011;73:188–208. [DOI] [PubMed] [Google Scholar]
  • 87. Blank CE. Phylogenomic dating—a method of constraining the age of microbial taxa that lack a conventional fossil record. Astrobiology. 2009;9:173–191. [DOI] [PubMed] [Google Scholar]
  • 88. Xie J, Chen Y, Cai G, Cai R, Hu Z, Wang H. Tree visualization by one table (tvBOT): a web application for visualizing, modifying and annotating phylogenetic trees. Nucleic Acids Res. 2023;51:W587–W592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60. [DOI] [PubMed] [Google Scholar]
  • 90. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, et al. SPAdes: a new genome assembly algorithm and its applications to single‐cell sequencing. J Comput Biol. 2012;19:455–477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Aroney STN, Newell RJP, Nissen J, Camargo AP, Tyson GW, Woodcroft BJ. CoverM: read coverage calculator for metagenomics. 10.5281/zenodo.10531254. (January 2024, date last accessed). [DOI]
  • 92. Wei T, Simko V. R package “corrplot”: visualization of a correlation matrix. https://github.com/taiyun/corrplot. (2017, date last accessed).
  • 93. Dixon P. VEGAN, a package of R functions for community ecology. J Vegetation Sci. 2003;14:927–930. [Google Scholar]
  • 94. Aphalo PJ. ggpmisc: miscellaneous extensions to ‘ggplot2’. https://docs.r4photobiology.info/ggpmisc/. (2024, date last accessed).
  • 95. Wickham H. ggplot2: elegant graphics for data analysis. 2nd ed. Springer International Publishing; 2016. [Google Scholar]
  • 96. Yu G. gglayer: provide extra layers for ggplot2. https://github.com/GuangchuangYu/gglayer. (2019, date last accessed).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.

MLF2-5-108-s002.pdf (13.1MB, pdf)

Supplementary Table.

MLF2-5-108-s001.xlsx (306.1KB, xlsx)

Data Availability Statement

The tree files and the nexus files of tree dating can be found at Figshare (https://doi.org/10.6084/m9.figshare.30351535).


Articles from mLife are provided here courtesy of Wiley

RESOURCES