Skip to main content
Science Advances logoLink to Science Advances
. 2023 Jul 7;9(27):eade4847. doi: 10.1126/sciadv.ade4847

The evolution and spread of sulfur cycling enzymes reflect the redox state of the early Earth

Katherine Mateos 1,2,, Garrett Chappell 1,3,, Aya Klos 1, Bryan Le 1, Joanne Boden 4, Eva Stüeken 4, Rika Anderson 1,5,*
PMCID: PMC10328410  PMID: 37418533

Abstract

The biogeochemical sulfur cycle plays a central role in fueling microbial metabolisms, regulating the Earth’s redox state, and affecting climate. However, geochemical reconstructions of the ancient sulfur cycle are confounded by ambiguous isotopic signals. We use phylogenetic reconciliation to ascertain the timing of ancient sulfur cycling gene events across the tree of life. Our results suggest that metabolisms using sulfide oxidation emerged in the Archean, but those involving thiosulfate emerged only after the Great Oxidation Event. Our data reveal that observed geochemical signatures resulted not from the expansion of a single type of organism but were instead associated with genomic innovation across the biosphere. Moreover, our results provide the first indication of organic sulfur cycling from the Mid-Proterozoic onwards, with implications for climate regulation and atmospheric biosignatures. Overall, our results provide insights into how the biological sulfur cycle evolved in tandem with the redox state of the early Earth.


Phylogenomics analyses reveal the evolution of microbial sulfur metabolisms co-evolved with the redox state of the early Earth.

INTRODUCTION

The biogeochemical sulfur cycle has played a crucial role in the evolution of life and surface processes over geologic time. Dissimilatory metabolisms, including elemental sulfur reduction, sulfate reduction, sulfate disproportionation, and sulfide oxidation, fuel diverse microbes and play an important role in regulating the redox state of the surface of the Earth (Fig. 1) (1). For example, the burial of biogenic sulfide in marine sediments may have contributed to progressive oxygenation of surface environments (2). In addition, Earth’s carbon and sulfur cycles are linked through the metabolic reduction of sulfate, which is coupled with the oxidation of organic carbon. This process accounts for up to 50% of organic carbon mineralization (3). The sulfur cycle is also intricately entwined with cycling of other important elements, including nitrogen and various transition metals (4, 5). In particular, freely dissolved sulfide in seawater, especially during the Proterozoic eon (6), may have affected the solubility of essential micronutrients such as molybdenum (7). Thus, a deeper understanding of the evolution of the biological sulfur cycle can offer important insights into the oxidation state of our planet over time and the evolution of other biogeochemical cycles.

Fig. 1. Schematic of the biological sulfur cycle, highlighting the genes included in this analysis.

Fig. 1.

(Note that sat was excluded from the analysis, because it is also used in several other pathways). APS, adenylyl sulfate; DMS, dimethyl sulfide; DMSO, dimethyl sulfoxide.

Several geochemical studies suggest that sulfur metabolisms were probably among the earliest microbial metabolisms on the ancient Earth. Early analyses of sulfur isotopes of pyrite and barite in the 3.5-billion-year-old (Ga) Dresser Formation provided evidence for sulfate reduction in the Archaean era (8). While data from Philippot et al. (9) later argued that the isotopic fingerprint of the sulfides was more consistent with sulfur disproportionation, subsequent work supported the original claim (10). Since then, several studies have documented isotopic evidence of microbial sulfur cycling in a variety of Archean environments [e.g., (1113)].

One of the biggest questions regarding the evolution of the biological sulfur cycle is how it coevolved with the oxygenation of the Earth over time. It is now well established that the Earth’s surface underwent a major transformation at around 2.4 Ga, when atmospheric O2 levels increased above a threshold of 10−5 times present levels, known as the Great Oxidation Event (GOE) (14). As a consequence, volcanism was replaced by oxidative weathering as the major source of sulfur to the ocean, and this new sulfur source was predominantly in the form of sulfate, as opposed to volcanogenic SO2, dissolved sulfite, and the photochemical products S8 and sulfate (15). Furthermore, O2 became abundant in surface waters as a potent oxidant of reduced sulfur species (16). The Archean-Proterozoic transition also witnessed a decline in hydrothermal activity on the ocean floor (17). Last, the deep ocean became fully oxygenated in the Neoproterozoic or Early Paleozoic with the second rise of oxygen (6, 18), leading to enhanced sulfide oxidation within sediments (19). Geochemical data show isotopic expressions of these events in the sulfur cycle (1); however, it is so far unknown whether these isotopic signals reflect merely an enhancement of a preexisting process or true evolutionary innovations. This question has important implications for cause-effect relationships in Earth system evolution.

While most studies of the early sulfur cycle rely on geochemical analyses, this approach has limitations for three reasons: First, the isotopic signatures of differing sulfur metabolisms are not necessarily distinct enough to be recognizable in the sedimentary rock record. For example, analysis from the 3.2-Ga Moodies Group in South Africa confirmed the presence of reductive sulfur cycling and hinted at the presence of oxidative sulfur cycling (20); however, the latter could not be unambiguously inferred. Second, the concentration of sulfate in the Archean ocean was low, possibly as low as 2.5 μM (21), dampening the signal of microbial sulfur isotope fractionation in the rock record. This challenge was illustrated by the analysis of sulfur isotope ratios in a modern sulfate-poor analog of the Archaean ocean, where biological fractionations are muted despite the presence of active microbial sulfate reduction (21). Third, the Archean sulfur isotope record is famously affected by photochemical processes acting on volcanogenic SO2 gas in an anoxic atmosphere (22). These photochemical reactions are recognizable by so-called mass-independent isotopic fractionations; however, distinguishing these from biogenic isotope effects requires analyses of all four stable isotopes of sulfur and relatively large sample sets (10, 23, 24). Thus, while the geochemical record has produced valuable insight into sulfur cycling in the Archean, the results can be inconclusive and often cannot distinguish between specific metabolic pathways.

Given the limitations and uncertainties presented by the geochemical record, pairing geochemical analysis with top-down phylogenetics approaches can provide important insights into the emergence and spread of distinct sulfur cycling microbial metabolisms on the early Earth. Although no dedicated molecular clock work to date has been conducted on sulfur metabolisms, genomics-based studies suggest that dissimilatory sulfite and sulfate reduction and sulfide oxidation emerged as early forms of energy metabolism (2528). Phylogenetics analyses suggest that the dsr genes, which catalyze the reduction of sulfite to sulfide or in the reverse for sulfide oxidation, were among the first sulfur-based genes to arise (25, 27), with the reductive form arising first (29). Anaerobic anoxygenic photosynthesizers would likely have emerged soon thereafter, possibly using these genes to oxidize sulfur and sulfide (30, 31). Genomics analyses also indicate that sulfur disproportionation likely arose subsequently (27, 28). Because the oceans became increasingly oxygenated, thiosulfate would have become more available, paving the way for the evolution of thiosulfate oxidation/reduction via the sox pathway (26).

However, while phylogenetics-based approaches generally track the birth of specific lineages or genes, the birth of a gene does not necessarily coincide with the time at which the function of that gene became ecologically important. Because genes are horizontally transferred between divergent microbial lineages, the genes that serve a useful function are the ones most likely to be retained in a genome (3237). Thus, an increase in horizontal gene transfer (HGT) events for a particular gene at a specific point in time likely gives an indication that the gene in question was ecologically important during that time period. Similarly, speciation events indicate that the lineage encoding a specific functional gene, and therefore the metabolism it facilitates, has expanded into new ecological niches.

To examine the evolution of the biological sulfur cycle over time, we therefore used phylogenomics approaches to track the timing of speciation, duplication, loss, and HGT events for sulfur cycling genes across a time-calibrated tree of life. This analysis allows us to determine approximately when these genes first arose and then proliferated across the tree of life on the early Earth. A similar analysis of nitrogen cycling genes revealed that nitrogen fixation arose and spread early, while genes related to denitrification from nitrite arose and spread much later in Earth history (38). Here, we focus on constraining the timing of speciation, duplication, loss, and HGT events for genes related to dissimilatory sulfate reduction and sulfide oxidation via sulfide, transformations between sulfate and thiosulfate, as well as organic sulfur cycling.

RESULTS

Construction of species tree and time-calibrated chronogram

We constructed a species tree from an alignment of 16 universal single-copy ribosomal genes to conduct the phylogenetic reconciliation (fig. S1). The resulting tree placed the eukaryotes within the archaeal domain, consistent with a two-domain tree of life, as has been recovered previously using similar methods (3941). We constructed chronograms from this species tree using two autocorrelated clock models [log-normal (LN) and Cox-Ingersoll-Ross (CIR)] and one uncorrelated gamma multiplier (UGAM) clock model. We tested both liberal and conservative fossil calibration points to construct the molecular clock as a sensitivity test (Table 1; see Materials and Methods). For all clock models, the liberal calibration points returned unrealistic ages (i.e., between 5.3 and 6 Ga, before the formation of the Earth) for the last universal common ancestor (LUCA) (see chronograms with error bars shared Figshare at figshare.com/projects/The_evolution_and_spread_of_sulfur- cycling_enzymes_across_the_tree_of_life_through_deep_time/144267). Thus, we only used the conservative calibration points for the remainder of this analysis. Using the conservative calibration points, the LN clock returned a LUCA age of approximately 4.48 Ga, the CIR clock returned a LUCA age of approximately 4.05 Ga, and the UGAM clock returned a LUCA age of approximately 3.93 Ga. We report the results from the CIR clock here, because autocorrelated clock models have previously been shown to outperform uncorrelated models such as UGAM (42, 43); among the autocorrelated models, the CIR results were closer to most reports for the approximate age of LUCA (approximately 3.8 Ga) (44) than the results from the LN clock model. All results from the LN and UGAM clocks are reported in the Supplementary Materials. All Newick files, alignments, and chronograms with error bars have been deposited in Figshare at https://figshare.com/projects/The_evolution_and_spread_of_sulfur-cycling_enzymes_across_the_tree_of_life_through_deep_time/144267.

Table 1. Fossil calibration points used for calibrating molecular clocks.

Calibration points were set as the hard constraint in PhyloBayes, indicating the latest date by which a specific clade must have split. The “conservative” time points reflect the dates for which there is the most consensus; “liberal” time points reflect the earliest date reported in the literature. Note that the root prior [last universal common ancestor (LUCA)] was set using a gamma distribution with a mean of 3.8 (conservative) (44) or 4.1 billion years (Ga; liberal) (84) and an SD of 200 Ma.

Calibration events Conservative (Ga) Liberal (Ga)
LUCA (set as root prior) 3.8 ± 200 Ma (44) 4.1 ± 200 Ma (84)
Origin of methanogenesis >2.7 (85) >3.51 (86)
Origin of oxygenic photosynthesis >2.45 (87) >3.2 (14)
Origin of eukaryotes >1.7 (88) >3.2 (89)
Origin of plastids/rhodophytes diverge >1.05 (90) >1.2 (91)
Akinetes diverge from cyanobacteria lacking cell differentiation >1.0 (92) >1.5 (93)
Origin of animals >0.635 (94, 95) >0.635 (94, 95)

Phylogenetic distribution of sulfur cycling genes

We used AnnoTree (45) to determine the distribution of sulfur cycling genes across the tree of life. Genes related to dissimilatory sulfate reduction and sulfide oxidation via sulfite, including dsr and apr genes, tended to be fairly widespread across the tree of life: aprAB in particular is fairly widespread, occurring in approximately 47 bacterial and five to six archaeal phyla, and dsrAB is found in 32 bacterial and four to five archaeal phyla. In contrast, genes related to thiosulfate oxidation/reduction, particularly the sox group of genes, were more phylogenetically restricted, occurring in approximately 14 to 20 bacterial and one archaeal phylum, with the majority of gene hits restricted to the Proteobacteria superphylum. The exceptions to this rule were soxB and soxC, which were much more widespread across the tree of life, occurring in approximately 31 to 38 bacterial and four archaeal phyla. Last, the organic sulfur cycling genes dmdA, dmsA, and mddA each displayed a different phylogenetic distribution: dmdA was found in only 13 bacterial and two archaeal phyla, restricted mostly to the Proteobacteria and Actinobacteria; dmsA was much more widespread, identified in 43 bacterial and six archaeal phyla, but generally not observed in the Patescibacteria; and mddA was similarly widespread, found in 34 bacterial and four archaeal phyla, most noticeably absent from the Patescibacteria and the Firmicutes phyla.

Identification of duplication, loss, and HGT events for sulfur cycling genes

For the 13 genes of interest, we quantified gene speciation, duplication, loss, and HGT events using multiple reconciliation algorithms. Reconciliation was performed by comparing the topology of the maximum likelihood gene trees for each gene to fossil-calibrated chronograms using three different clock models (CIR, UGAM, and LN) using the reconciliation programs AnGST (46) and ecceTERA (47). The overall trends that we observed were the same between AnGST and ecceTERA. The results we report here are those from ecceTERA, which can take into account both sampled and unsampled (including extinct) lineages (47). The analyses presented below focus on results from ecceTERA based on the CIR clock model, with replicate analyses with similar results from ecceTERA using the UGAM and LN clock models presented in tables S1 and S2 and figs. S2 to S6.

In interpreting the dates for events identified through phylogenetic reconciliation, it is important to consider the limitations inherent in dating each of these gene events. The phylogenetic reconciliation identifies branches on which events occurred, and thus there is no way to determine when on a given branch a specific gene event occurred. This can be more clearly visualized in Fig. 2 and figs. S2 and S3, which show the full branch lengths along which events could have occurred. Early events tended to be more well constrained than late events, partly because most early events were speciation events, which occur at specific nodes on the tree; in addition, earlier loss/duplication/HGT events occurred on shorter internal nodes rather than on the leaves of the branches, which occurred later. This was likely due to the taxonomic sampling included in the tree, in which one representative from each class was represented; a tree with a large number of more closely related taxa (i.e., at the species or subspecies level) would have shorter branches at the tips but would be computationally intractable to create. Combined with the inherent error associated with dating events on chronograms dating back billions of years, estimates of when gene events occurred should not be taken as absolute dates. Instead, we emphasize the relative timing of these events. By examining the distributions of when specific gene events occurred, we are able to better understand the relative timing of when specific metabolisms became ecologically important. The histograms presented in figs. S4 to S6 present the same data as Fig. 2 and figs. S2 and S3 but instead depict the proportion of total gene events within each time bin according to the midpoint of the time range along which an event could have occurred. This depiction facilitates the visualization of trends. Figure 3 depicts a simplified summary of the results, representing each gene event as a single point defined by the midpoint dates of the time range along which an event could have occurred. The earliest events for each gene are reported in Table 2.

Fig. 2. Time ranges for each gene duplication, transfer, loss, and speciation event identified for all sulfur cycling genes.

Fig. 2.

The time ranges reflect the branch lengths for branches on the time-calibrated tree of life on which these events were identified; the event could have occurred anywhere along the branch. Reconciliations were conducted with the Cox-Ingersoll-Ross (CIR) clock model. Graphs are placed in chronological order according to the midpoint of the earliest event. Points are colored according to the type of gene event. Colored boxes outlining each graph represent the general metabolic pathway in which each gene belongs. dup, duplication; hgt, horizontal gene transfer; los, loss; spe, speciation event.

Fig. 3. Dot plot showing the timing of gene loss, duplication, speciation, and HGT events for sulfur cycling genes as identified by ecceTERA using the conservative CIR clock model.

Fig. 3.

Points are vertically jittered to facilitate visualization of individual events. Schematic of changes in atmospheric partial pressure of oxygen over time was replotted from Lyons et al. (14) for comparison purposes with the timing of the Great Oxidation Event (GOE). Ga, billion years. PAL, present atmospheric level.

Table 2. Identification of the earliest date for gene loss, duplication, speciation, and HGT events according to reconciliation with the chronogram generated using the CIR clock model and conservative calibration points, as identified by ecceTERA.

While gene births are not inferred using this reconciliation method, the gene must have emerged by this point in time. Speciation events can be located to a precise node and therefore have a specific date, whereas duplication, loss, and transfer events occur on branches and are reported with a date range. For those events, we report the “midpoint” date between these two node dates in the right-hand column. NA, not applicable, because these are speciation events at single nodes.

Gene Date or date range for earliest gene event (Ga) Midpoint date for earliest gene event (Ga)
dsrA 3.35 NA
dsrB 3.3 NA
soxC 2.65–2.88 2.77
soxA 2.6 NA
soxB 2.6 NA
soxX 2.6 NA
soxY 2.6 NA
soxZ 2.6 NA
aprA 2.47 NA
aprB 2.33 NA
dmsA 2.28 NA
mddA 1.77 NA
dmdA 0.00–2.38 1.19

The gene events are largely dominated by speciation events early in the evolution of each gene, often followed by a series of gene losses, and then later dominated by HGTs (Fig. 2 and figs. S2 to S6). According to the earliest events identified for each of the genes in Table 2 and as depicted in Figs. 2 and 3, the most ancient genes include dsrA and dsrB, which catalyze the oxidation and reduction of sulfite and sulfide. (See Fig. 1 for a schematic of the sulfur cycle and the steps catalyzed by each of these genes.) The gene apr, which catalyzes the oxidation and reduction of adenylyl sulfate (APS) and sulfite, had more varied results depending on the clock model used, but according to the CIR clock model emerged later than dsr and around the same time as the sox genes. The sox genes, which are involved in sulfate and thiosulfate (S2O32−) oxidation, generally show a marked rise in speciation events beginning around 2.4 Ga, followed by a second rise in HGT events around 1.2 Ga (although this is less well constrained in time) (Figs. 2 and 3). This trend is fairly consistent across all sox genes, with the exception of soxC, which had many more gene hits and gene events than the rest of the sox genes, and was found in a wider range of organisms overall; reconciliation revealed an earlier emergence and proliferation compared to the other sox genes. Note that reconciliation using the uncorrelated UGAM clock model showed that the sox genes had earlier gene events than dsrB and aprAB (figs. S3 and S4 and table S2). This was mostly driven by the speciation events; gene losses, transfers, and duplications for the sox genes generally occurred later than for the apr/dsr genes.

For all clock models, the organic sulfur cycling genes appeared to be much younger than the genes involved in sulfur oxidation and reduction for energy metabolism. The gene dmdA converts dimethyl sulfide [(CH3)2S, DMS] into methanethiol (CH3SH), dmsA converts DMS to dimethyl sulfoxide [(CH3)2SO, DMSO], and mddA is involved in the conversion of methanethiol (CH3SH) to DMS [(CH3)2S]. While the organic sulfur cycling genes had varying timelines for their initial gene events, some as far back as 2.6 Ga, the number of gene events did not meaningfully begin to rise for any of these genes until approximately 1.6 Ga (Figs. 2 and 3).

DISCUSSION

Our analysis of gene duplication, speciation, transfer, and loss events for sulfur cycling genes across Earth history provides insight into the relative timing for the proliferation of these genes across the tree of life and thus has implications for when specific sulfur metabolisms became ecologically important. As has been suggested previously, if a gene is acquired via HGT and retained in the genome, then this indicates that the horizontally transferred gene likely has been selected and retained, because it performs a useful ecological function (3237, 48). Thus, a rise in HGT events for a specific gene at a given time can indicate when these genes became favorable or ecologically useful given the conditions of the environment at that point in Earth’s history (38). Similarly, a speciation event occurs when a lineage containing that gene splits into two species, indicating that the metabolic pathway including that gene has begun to expand into new ecological niches and taxa.

Note that many of these events occurred on long branches terminating in leaves of the species chronogram, meaning that these specific events could have occurred anywhere on that branch, up to the present day. Because of limitations inherent in gene-tree-species-tree reconciliation, the date of each event can only be confined to the branch of the species tree where it occurred, meaning that it could have happened at any point in time between the two nodes of that branch (as illustrated in Fig. 2 and figs. S2 and S3). Caution is particularly warranted in interpreting the apparent peak in the number of events for many genes at ~750 Ma as shown in Fig. 3 and figs. S4 to S6. Many of these events occurred on leaves of the phylogenetic tree (as opposed to internal nodes), and the gene duplication, transfer, or loss events occurred at some point between the terminus of the leaf and the last node of that leaf, many of which occurred at approximately 1.5 Ga. Thus, although the midpoints of these events were calculated to have occurred around 750 Ma ago, in reality, these events occurred at some point during a prolonged time period that extends to the present day, as shown in Fig. 2. Because of these limitations in the data, caution is warranted in interpreting the timing of these gene events, as the calculated dates are rough estimates by necessity. Thus, our analysis emphasizes relative trends in the frequency of events over time, rather than focusing on precisely dated events. In addition, the approximate dates for gene birth events are inferred on the basis of the earliest event for that gene in the reconciliation, so the birth of that gene occurs before the earliest event by definition.

Dissimilatory sulfate and sulfite reduction and sulfide oxidation

Dissimilatory sulfide oxidation involves the genes dsrAB, with sulfite as an intermediate. The same gene is also involved in sulfite reduction back to sulfide. According to the two autocorrelated [and thus more reliable (42)] clock models, these genes show their first events around ~3.5 Ga (Figs. 2 and 3). Sulfide would have been readily supplied by hydrothermal vent systems in the Archean ocean (49), and biological oxidation of sulfide may have been coupled to the reduction of Fe3+ or trace O2 that occurred locally in the surface ocean. The findings that the dsrAB genes have an ancient origin are consistent with phylogenetic results from Wagner et al. (50), who used targeted gene sequencing and 16S ribosomal RNA sequencing techniques to show that dissimilatory sulfite reductase genes originated at around 3 Ga (50). Moreover, sulfite would have been supplied naturally by dissolution of volcanogenic SO2 in water (15). The reduction of sulfite could have been coupled to organic matter oxidation as well as to volcanogenic or biogenic H2 oxidation.

Dates for the rise and spread of aprAB, which is involved in dissimilatory sulfate reduction with APS as an intermediate, varied more widely across clock models and is therefore more difficult to constrain. The supply of sulfate to the Archean ocean, before the GOE, would have been limited to photochemical SO2 oxidation (12, 22). Nevertheless, geochemical data suggest that dissimilatory sulfate reduction is ancient, going back to 3.5 Ga (8). Previous studies have theorized that both the apr and dsr genes were involved in early oxidative pathways using sulfide in ancient microbial mats around 3 Ga ago (26).

While the oldest events for dsr date back to the Archean, our data also reveal an expansion for this gene around the GOE and again in the Late Proterozoic or Early Phanerozoic, concurrent with genes involved in dissimilatory sulfate reduction. The Paleoproterozoic events are likely linked to the increasing supplies of sulfate due to enhanced rates of oxidative weathering under an oxygenated atmosphere. The Late Proterozoic events may be linked to oxygenation of the deep ocean and associated growth of the marine sulfate reservoir (Fig. 3). Sulfate reduction and sulfide oxidation would have become more favorable metabolic pathways under these conditions (51). Our data thus suggest that these geochemical transformations of Earth’s surface directly impacted biological evolution.

Sulfate-thiosulfate transformations

The Sox enzyme system is involved in the reduction and oxidation of sulfate and thiosulfate, respectively (52). A version of this pathway, omitting the SoxCD complex, can also be used to oxidize hydrogen sulfide to elemental sulfur (1). Most sox genes arose and began to speciate around or after the time of the GOE, approximately 2.4 Ga. Subsequently, the majority of the HGT events associated with the sox genes did not occur until much later, at approximately the time of the Neoproterozoic Oxygenation Event (NOE) (14), in which the deep ocean is thought to have become more pervasively oxygenated approximately 850 to 540 Ma ago (Fig. 3). The rise in the number of speciation events for soxABXYZ approximately 2 Ga ago approximately coincides with increasing sulfate availability in the Earth’s oceans after the GOE (53). Thiosulfate has an intermediate redox state [S(+II)] between sulfide [S(−II)] and sulfate [S(+VI)] and forms most commonly during microbial sulfide oxidation (54). Hence, the expansion of sox genes across the tree of life in the Paleoproterozoic is most parsimoniously attributed to increasing availability of O2 and therefore enhanced sulfide oxidation. This finding is consistent with geochemical evidence for enhanced disproportionation of elemental sulfur in the Mid- to Late Proterozoic (16, 19), as elemental sulfur, like thiosulfate, is an intermediate in microbial sulfide oxidation. Moreover, the rise in HGT events around the time of the NOE, while not as well constrained in time, suggests that increasing oxygen levels enabled the expansion of the biological sulfur cycle. In other words, these results indicate that increasing oxygen on Earth paved the way for the expansion of niche space and innovations in microbial evolution.

We identified many more events for the gene soxC compared to the other sox genes, many of which occurred earlier than the other sox genes. While it is unclear why this is the case, it may be related to the fact that soxC is not involved in the alternate sox pathway that creates elemental sulfur from sulfide. The soxC gene is part of a sulfur dehydrogenase molybdenum enzyme complex called soxCD that catalyzes a six-electron transfer in the middle of the sox sequence and appears to be reliant on the other enzyme complexes in the sox sequence (55, 56). soxC exists in a wider range of organisms than the rest of the sox genes, which were primarily found in Proteobacteria. We speculate that this pattern may be the result of the gene’s relationship to another gene with a similar function, sorA, which has a 26.5% sequence identity to soxC (56) and is similarly widespread across the tree of life. Alternatively, it could indicate a separate function beyond the sox pathway for soxC that would require further investigation.

Organic sulfur cycling

The organic sulfur cycle involves the biological formation of volatile organic compounds such as DMS and methanethiol. The genes dmdA and dmsA, which are the key enzymes involved in DMS metabolisms, record their first events approximately 1.5 to 2 Ga, possibly linked to the rise of eukaryotic algae, whose production of organic sulfur gasses has been implicated in global cooling in the Late Proterozoic (57). Similarly, the gene mddA, which converts methanethiol into DMS, also seems to have emerged and proliferated approximately 1.5 to 2 Ga. Thus, our results suggest that bacteria were capable of generating and metabolizing DMS only after the GOE, possibly with important implications for climate regulation on the early Earth, because DMS particles are known to act as cloud condensation nuclei, which has been hypothesized to cool the Earth’s surface (58). We speculate that DMS-generating metabolisms arose in response to a larger sulfate reservoir in the ocean from the GOE onwards, which may have led to organic matter sulfurization in diagenetic settings [e.g., (59)].

Moreover, volatile organic sulfur compounds such as DMS are important as potential remotely detectable biosignatures, because they could conceivably be detected on other planets with an anoxic biosphere using analysis of the spectral signatures of the planet’s atmosphere (60). Our results thus suggest that Earth’s biosphere may potentially have been detectable through this technique only within the past 1.5 to 2 Ga.

Shifts in the biological sulfur cycle have a profound impact on the global carbon cycle and Earth’s climate and are closely tied to the redox state of the Earth. Our results suggest that microbial energy acquisition via sulfite reduction and possibly sulfide oxidation emerged early in Earth history, which is consistent with volcanic and hydrothermal sources of sulfite and sulfide, respectively. While our results cannot confirm geochemical evidence of microbial sulfate reduction (driven by apr) going back to 3.5 Ga, the dates obtained for this gene vary widely between clock models, and thus our results do not preclude the possibility that sulfate reduction arose earlier and will require further investigation. We also find that metabolisms involving intermediates such as thiosulfate proliferated across the tree of life only after the Paleoproterozoic GOE, as the Earth’s ocean and atmosphere became more oxidizing. However, our analysis goes beyond the geochemical records, because our data reveal that the expressions of these geochemical signatures were not merely the result of preservation or expansion of a single organism but instead caused by the radiation of genomic innovations across the tree of life. We further show that the growth of the marine sulfate reservoir after the GOE triggered an expansion of organic sulfur metabolisms, which would have added an important biosignature to Earth’s atmosphere from the Proterozoic onwards. Last, our analyses reveal an expansion in all sulfur metabolisms around the Neoproterozoic, highlighting that this time period not only witnessed the rise of eukaryotic life but was also an important driver of microbial evolution.

MATERIALS AND METHODS

Genome selection and construction of species tree

To construct the species tree, we included one representative genome from each bacterial and archaeal order, based on GTDB taxonomy (61, 62). Some eukaryotic genomes were also included to create a full tree of life to capture putative gene transfer events between the archaeal and bacterial domains and to include additional time calibration points in the eukaryote domain for the molecular clock. However, the focus of the study was on sulfur cycling genes within bacterial and archaeal genomes. GToTree (63) was applied to identify and align single-copy universal ribosomal genes from the genomes that we selected. The concatenated gene alignments were created from a set of 16 universal single-copy genes (64), and we excluded genomes with fewer than half of the single-copy genes. Briefly, the GToTree workflow used prodigal (65) to predict genes on input genomes, then identified genes with HMMER3 v3.2.2 (66), individually aligned genes with MUSCLE v5.1 (67), trimmed the alignment with trimal v1.4.rev15 (68), and concatenated aligned genes with FastTree2 v2.1.1 (69). The resulting alignment was used to construct a phylogeny using RAxML v. 8.2.9 (70) with 100 rapid bootstraps using the PROTGAMMALG model of evolution as per (64). The root of the tree was placed in between the archaeal and bacterial domains by designating the entire bacterial domain as the outgroup. The resulting tree contains 871 genomes, including 777 bacterial, 80 archaeal, and 14 eukaryotic genomes.

Construction of time-calibrated chronogram

The species tree was converted to a chronogram using PhyloBayes v4.1b (71). We tested two separate sets of calibration points, one conservative (which represents the earliest date for which there is the most consensus for a given event based on the current scientific literature) and one liberal (which represents the earliest date for which there is any evidence of a given event based on the current scientific literature) to test the sensitivity of methodology (Table 1). The root age was set via a normally distributed gamma root prior according to dates specified in Table 1 with an SD set to 200 Ma, consistent with previous studies (72).

To generate chronograms, we tested three different clock models: autocorrelated LN (73), UGAM (74), and the autocorrelated CIR process (42). For each model and set of calibration points, two chains were run concurrently and were compared as a test of convergence. We analyzed convergence using the tracecomp program in PhyloBayes, requiring an effective size of >100 and a maximum difference between chains of <0.3. Each chain was run for >60,000 cycles. Chronograms were generated using the readdiv function in PhyloBayes, discarding the first 2500 cycles as a burn in.

Identification of sulfur cycling genes and construction of gene trees

We identified sulfur cycling genes of interest using the sulfur metabolism pathway on the Kyoto Encyclopedia of Genes and Genomes (KEGG) (7577). In line with previous genomic studies of the sulfur cycle, we analyzed dissimilatory sulfate reduction/sulfide oxidation genes (aprAB and dsrAB) (78) and thiosulfate oxidizing/sulfate reducing genes (soxABCXYZ) (4). Although the sat gene catalyzes the first step of dissimilatory sulfur cycling, we excluded this gene from our study, because it is also used in other metabolic pathways that were not of specific interest here. We also examined genes that were involved in the production of volatile organic sulfur compounds, including methanethiol, DMSO, and DMS (mddA, dmdA, and dmsA). The list of sulfur cycling genes analyzed here was not meant to be exhaustive but rather focused on core genes involved in dissimilatory sulfur oxidation and reduction for energy acquisition as well as select genes involved in organic sulfur cycling.

For consistency in identifying genes across genomes, we used AnnoTree (45) to identify sulfur cycling genes in microbial genomes using KEGG orthology numbers as queries (Table 3). We limited our analysis to the core dissimilatory sulfur cycling and thiosulfate reduction and oxidation genes that were included in the AnnoTree database. Note that some metabolic pathways use the same genes for catalyzing oxidation and reduction reactions and thus cannot be distinguished using these methods. For example, sulfur disproportionation uses the same enzymatic pathways as sulfate reduction (28, 79). The default AnnoTree settings (minimum of 30% identity, maximum E value of 10−5, minimum of 70% subject, and query alignment) were applied for identifying genes in genomes. AnnoTree output was curated to only include genes from genomes within our species tree. Note that, although eukaryotic genomes were included in the species tree, AnnoTree does not include eukaryotic phyla in the gene distribution search, and so eukaryotic genes were excluded from this analysis. The number of hits for each gene can be found in Table 3. These genes were aligned using MUSCLE v3.8.31 (67) and then trimmed using TrimAl v.1.3 (68) with the -automated1 option as implemented in Phylemon2 (80). The model of evolution was determined by the Model Selection tool implemented in IQ-TREE 2.0.3 (81) with the flags -m MFP, -mrate E,I,G,I + G,R and -madd C10,C20,C30,C40,C50,C60,EX2,EX3,EHO,UL2,UL3,EX_EHO,LG4M,LG4X,CF4,LG+C10,LG+C20,LG+C30,LG+C40,LG+C50,LG+C60 to include all complex mixture models of protein evolution in model selection. All trees were constructed using IQ-TREE 2.0.3 (81) with a specification of 1000 ultrafast bootstraps (except aprB, which was set to 2000) and the default UFBoot convergence criterion. Within IQ-TREE, the “UFBoot stopping rule” automatically assesses the convergence of the split support values and stops collecting candidate trees once convergence is achieved (82). All gene trees reached convergence. The number of bootstraps used for each gene tree is reported in Table 3.

Table 3. Sulfur cycling genes analyzed in this study.

Number of gene hits indicates the number of genes identified among the genomes included in the species tree; best-fit model indicates the model of evolution used for generating the gene tree as determined by IQ-Tree ModelFinder; number of bootstraps indicates the number of bootstrap trees created to achieve convergence, which were subsequently used for reconciliation in ecceTERA; number of speciation, loss, duplication, and horizontal gene transfer (HGT) events are reported as determined by ecceTERA using the Cox-Ingersoll-Ross (CIR) clock model. KEGG, Kyoto Encyclopedia of Genes and Genomes.

Gene KEGG orthology number Number of gene hits Metabolic pathway Best-fit model Number of bootstraps Loss events Duplication events HGT events Speciation events Total number of events
aprA K00394 110 Dissimilatory sulfate oxidation/ reduction LG + C60 225 47 4 82 31 131
aprB K00395 103 Dissimilatory sulfate oxidation/ reduction LG4X 1000 44 2 78 34 129
dsrA K11180 83 Dissimilatory sulfate oxidation/ reduction C60 204 57 3 48 61 146
dsrB K11181 84 Dissimilatory sulfate oxidation/ reduction LG + C60 253 50 1 46 52 128
soxA K17222 40 Thiosulfate oxidation/ reduction C60 232 11 3 18 20 52
soxB K17224 45 Thiosulfate oxidation/ reduction LG + C50 103 13 3 23 22 61
soxC K17225 105 Thiosulfate oxidation/ reduction LG + C50 192 17 4 84 32 137
soxX K17223 38 Thiosulfate oxidation/ reduction LG + C50 127 15 2 17 25 59
soxY K17226 38 Thiosulfate oxidation/ reduction C50 200 13 2 17 23 55
soxZ K17227 58 Thiosulfate oxidation/ reduction C50 229 24 7 25 38 92
dmdA K17486 28 Volatile organic sulfur cycling C60 109 0 1 22 1 24
dmsA K07306 92 Volatile organic sulfur cycling LG + C60 201 35 11 72 12 100
mddA K21310 65 Volatile organic sulfur cycling C50 400 8 3 53 10 71

Reconciliation of gene trees with species chronogram with ecceTERA

Gene trees and species trees were reconciled using ecceTERA v1.2.5 (47) to identify gene loss, duplication, speciation, and transfer events. We used the default settings implemented in ecceTERA and amalgamated the gene trees (amalgamate = true). No transfers to the dead were allowed when reconciling gene and species trees. The output was configured to RecPhyloXML format (83) with the option “recPhyloXML.reconciliation = true.” Reconciliation analyses were performed on fully dated species trees and full sets of gene tree bootstraps (see Table 3 for bootstrap information). Using a combination of custom Python scripts developed for this project (provided on GitHub at https://github.com/carleton-spacehogs/sulfur as well as on Figshare at https://doi.org/10.6084/m9.figshare.23255627), we calculated the mean date for each event based on the midpoints of the two 95% confidence intervals that defined the nodes of the branch on which the event occurred. Distributions of the gene event data produced from ecceTERA were subsequently compared to distribution data of the gene event data produced from AnGST (46) to ensure that the results were not dependent on the reconciliation algorithm; the overall trends that we observed were the same.

Acknowledgments

We thank C. Scornavacca for assistance with ecceTERA and K. Anantharaman for advice regarding the biological sulfur cycle.

Funding: K.M. was supported by the Dean of the College Office at Carleton College. This work was performed by the Virtual Planetary Laboratory team, a member of the NASA Nexus for Exoplanet System Science, funded via NASA Astrobiology Program grant no. 80NSSC18K0829 to R.A. Financial support for this publication also results from a Scialog program sponsored jointly by Research Corporation for Science Advancement and the Heising-Simons Foundation and includes a grant (no. 28109) to Carleton College by RCSA. E.S. and J.B. acknowledge funding from a NERC Frontiers grant (NE/V010824/1).

Author contributions: R.A. and E.S. conceptualized the study. K.M., G.C., A.K., B.L., J.B., and R.A. performed bioinformatics and phylogenetics analyses. K.M., E.S., and R.A. wrote the manuscript with input from all authors. E.S. and R.A. supervised. E.S. and R.A. provided funding.

Competing interests: The authors declare that they have no competing interests.

Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Alignments, chronograms, Newick files, and scripts can be found at Figshare at: doi.org/10.6084/m9.figshare.20369301.v1, doi.org/10.6084/m9.figshare.20369313.v1, and doi.org/10.6084/m9.figshare.20369316.v1.

Supplementary Materials

This PDF file includes:

Tables S1 and S2

Figs. S1 to S6

View/request a protocol for this paper from Bio-protocol.

REFERENCES AND NOTES

  • 1.D. A. Fike, A. S. Bradley, C. V. Rose, Rethinking the ancient sulfur cycle. Ann. Rev. Planet. Sci. 43, 593–622 (2015). [Google Scholar]
  • 2.A. J. Krause, B. J. W. Mills, S. Zhang, N. J. Planavsky, T. M. Lenton, S. W. Poulton, Stepwise oxygenation of the Paleozoic atmosphere. Nat. Commun. 9, 4081 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.B. B. Jørgensen, Mineralization of organic matter in the sea bed—The role of sulphate reduction. Nature 296, 643–645 (1982). [Google Scholar]
  • 4.D. E. Canfield, F. J. Stewart, B. Thamdrup, L. De Brabandere, T. Dalsgaard, E. F. Delong, N. P. Revsbech, O. Ulloa, A cryptic sulfur cycle in oxygen-minimum-zone waters off the Chilean coast. Science 330, 1375–1378 (2010). [DOI] [PubMed] [Google Scholar]
  • 5.V. Boyko, K. Avetisyan, A. Findlay, Q. Guo, X. Yang, A. Pellerin, A. Kamyshny Jr., Biogeochemical cycling of sulfur, manganese and iron in ferruginous limnic analog of Archean ocean. Geochim. Cosmochim. Acta 296, 56–74 (2021). [Google Scholar]
  • 6.E. A. Sperling, C. J. Wolock, A. S. Morgan, B. C. Gill, M. Kunzmann, G. P. Halverson, F. A. Macdonald, A. H. Knoll, D. T. Johnston, Statistical analysis of iron geochemical data suggests limited Late Proterozoic oxygenation. Nature 523, 451–454 (2015). [DOI] [PubMed] [Google Scholar]
  • 7.C. T. Reinhard, N. J. Planavsky, L. J. Robbins, C. A. Partin, B. C. Gill, S. V. Lalonde, A. Bekker, K. O. Konhauser, T. W. Lyons, Proterozoic ocean redox and biogeochemical stasis. Proc. Natl. Acad. Sci. U.S.A. 110, 5357–5362 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Y. Shen, R. Buick, D. E. Canfield, Isotopic evidence for microbial sulphate reduction in the Early Archaean era. Nature 410, 77–81 (2001). [DOI] [PubMed] [Google Scholar]
  • 9.P. Philippot, M. Van Zuilen, K. Lepot, C. Thomazo, J. Farquhar, M. J. Van Kranendonk, Early archaean microorganisms preferred elemental sulfur, not sulfate. Science 317, 1534–1537 (2007). [DOI] [PubMed] [Google Scholar]
  • 10.Y. Ueno, S. Ono, D. Rumble, S. Maruyama, Quadruple sulfur isotope analysis of ca. 3.5 Ga dresser formation: New evidence for microbial sulfate reduction in the early Archean. Geochim. Cosmochim. Acta 72, 5675–5691 (2008). [Google Scholar]
  • 11.I. Zhelezinskaia, A. J. Kaufman, J. Farquhar, J. Cliff, Large sulfur isotope fractionations associated with Neoarchean microbial sulfate reduction. Science 346, 742–744 (2014). [DOI] [PubMed] [Google Scholar]
  • 12.D. L. Roerdink, P. R. D. Mason, J. Farquhar, T. Reimer, Multiple sulfur isotopes in Paleoarchean barites identify an important role for microbial sulfate reduction in the early marine environment. Earth Planet. Sci. Lett. 331–332, 177–186 (2012). [Google Scholar]
  • 13.N. McLoughlin, E. G. Grosch, M. R. Kilburn, D. Wacey, Sulfur isotope evidence for a Paleoarchean subseafloor biosphere, Barberton, South Africa. Geology 40, 1031–1034 (2012). [Google Scholar]
  • 14.T. W. Lyons, C. T. Reinhard, N. J. Planavsky, The rise of oxygen in Earth’s early ocean and atmosphere. Nature 506, 307–315 (2014). [DOI] [PubMed] [Google Scholar]
  • 15.M. W. Claire, J. F. Kasting, S. D. Domagal-Goldman, E. E. Stüeken, R. Buick, V. S. Meadows, Modeling the signature of sulfur mass-independent fractionation produced in the Archean atmosphere. Geochim. Cosmochim. Acta 141, 365–380 (2014). [Google Scholar]
  • 16.D. T. Johnston, B. A. Wing, J. Farquhar, A. J. Kaufman, H. Strauss, T. W. Lyons, L. C. Kah, D. E. Canfield, Active microbial sulfur disproportionation in the Mesoproterozoic. Science 310, 1477–1479 (2005). [DOI] [PubMed] [Google Scholar]
  • 17.S. Viehmann, M. Bau, J. Elis Hoffmann, C. Münker, Geochemistry of the Krivoy Rog Banded Iron Formation, Ukraine, and the impact of peak episodes of increased global magmatic activity on the trace element composition of Precambrian seawater. Precambrian Res. 270, 165–180 (2015). [Google Scholar]
  • 18.G. Shields-Zhou, L. Och, The case for a Neoproterozoic Oxygenation Event: Geochemical evidence and biological consequences. GSA Today 21, 4–11 (2011). [Google Scholar]
  • 19.M. Kunzmann, T. H. Bui, P. W. Crockford, G. P. Halverson, C. Scott, T. W. Lyons, B. A. Wing, Bacterial sulfur disproportionation constrains timing of Neoproterozoic oxygenation. Geology 45, 207–210 (2017). [Google Scholar]
  • 20.S. Nabhan, J. Marin-Carbonne, P. R. D. Mason, C. Heubeck, In situ S-isotope compositions of sulfate and sulfide from the 3.2 Ga Moodies Group, South Africa: A record of oxidative sulfur cycling. Geobiology 18, 426–444 (2020). [DOI] [PubMed] [Google Scholar]
  • 21.S. A. Crowe, G. Paris, S. Katsev, C. Jones, S.-T. Kim, A. L. Zerkle, S. Nomosatryo, D. A. Fowle, J. F. Adkins, A. L. Sessions, J. Farquhar, D. E. Canfield, Sulfate was a trace constituent of Archean seawater. Science 346, 735–739 (2014). [DOI] [PubMed] [Google Scholar]
  • 22.J. Farquhar, H. Bao, M. Thiemens, Atmospheric influence of Earth’s earliest sulfur cycle. Science 289, 756–758 (2000). [DOI] [PubMed] [Google Scholar]
  • 23.A. L. Zerkle, M. W. Claire, S. D. Domagal-Goldman, J. Farquhar, S. W. Poulton, A bistable organic-rich atmosphere on the Neoarchaean Earth. Nat. Geosci. 5, 359–363 (2012). [Google Scholar]
  • 24.J. Farquhar, J. Cliff, A. L. Zerkle, A. Kamyshny, S. W. Poulton, M. Claire, D. Adams, B. Harms, Pathways for Neoarchean pyrite formation constrained by mass-independent sulfur isotopes. Proc. Natl. Acad. Sci. U.S.A. 110, 17638–17643 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.C. E. Blank, Evolutionary timing of the origins of mesophilic sulphate reduction and oxygenic photosynthesis: A phylogenomic dating approach. Geobiology 2, 1–20 (2004). [Google Scholar]
  • 26.B. Meyer, J. Kuever, Phylogeny of the alpha and beta subunits of the dissimilatory adenosine-5′-phosphosulfate (APS) reductase from sulfate-reducing prokaryotes – origin and evolution of the dissimilatory sulfate-reduction pathway. Microbiology 153, 2026–2044 (2007). [DOI] [PubMed] [Google Scholar]
  • 27.J. R. Havig, T. L. Hamilton, A. Bachan, L. R. Kump, Sulfur and carbon isotopic evidence for metabolic pathway evolution and a four-stepped Earth system progression across the Archean and Paleoproterozoic. Earth Sci. Rev. 174, 1–21 (2017). [Google Scholar]
  • 28.L. M. Ward, E. Bertran, D. T. Johnston, Expanded genomic sampling refines current understanding of the distribution and evolution of sulfur metabolisms in the Desulfobulbales. Front. Microbiol. 12, 666052 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.A. L. Müller, K. U. Kjeldsen, T. Rattei, M. Pester, A. Loy, Phylogenetic and environmental diversity of DsrAB-type dissimilatory (bi)sulfite reductases. ISME J. 9, 1152–1165 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.P. F. Hoffman, Big time. Annu. Rev. Earth Planet. Sci. 47, 1–17 (2019). [Google Scholar]
  • 31.R. E. Blankenship, Molecular evidence for the evolution of photosynthesis. Trends Plant Sci. 6, 4–6 (2001). [DOI] [PubMed] [Google Scholar]
  • 32.M. L. Coleman, S. W. Chisholm, Ecosystem-specific selection pressures revealed through comparative population genomics. Proc. Natl. Acad. Sci. U.S.A. 107, 18634–18639 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.J. O. McInerney, A. McNally, M. J. O’Connell, Why prokaryotes have pangenomes. Nat. Microbiol. 2, 17040 (2017). [DOI] [PubMed] [Google Scholar]
  • 34.A. Moulana, R. E. Anderson, C. S. Fortunato, J. A. Huber, Selection is a significant driver of gene gain and loss in the pangenome of the bacterial genus Sulfurovum in geographically distinct deep-sea hydrothermal vents. mSystems 5, e00673-19 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.M. F. Polz, E. J. Alm, W. P. Hanage, Horizontal gene transfer and the evolution of bacterial and archaeal population structure. Trends Genet. 29, 170–175 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.O. Popa, E. Hazkani-Covo, G. Landan, W. Martin, T. Dagan, Directed networks reveal genomic barriers and DNA repair bypasses to lateral gene transfer among prokaryotes. Genome Res. 21, 599–609 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.V. Daubin, Bacterial genomes as new gene homes: The Genealogy of ORFans in E. coli. Genome Res. 14, 1036–1042 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.C. Parsons, E. E. Stüeken, C. J. Rosen, K. Mateos, R. E. Anderson, Radiation of nitrogen-metabolizing enzymes across the tree of life tracks environmental transitions in Earth history. Geobiology 19, 18–34 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.K. Raymann, C. Brochier-Armanet, S. Gribaldo, The two-domain tree of life is linked to a new root for the Archaea. Proc. Natl. Acad. Sci. U.S.A. 112, 6670–6675 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.T. A. Williams, C. J. Cox, P. G. Foster, G. J. Szöllősi, T. M. Embley, Phylogenomics provides robust support for a two-domains tree of life. Nat. Ecol. Evol. 4, 138–147 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.K. Zaremba-Niedzwiedzka, E. F. Caceres, J. H. Saw, D. Bäckström, L. Juzokaite, E. Vancaester, K. W. Seitz, K. Anantharaman, P. Starnawski, K. U. Kjeldsen, M. B. Stott, T. Nunoura, J. F. Banfield, A. Schramm, B. J. Baker, A. Spang, T. J. G. Ettema, Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017). [DOI] [PubMed] [Google Scholar]
  • 42.T. Lepage, D. Bryant, H. Philippe, N. Lartillot, A general comparison of relaxed molecular clock models. Mol. Biol. Evol. 24, 2669–2680 (2007). [DOI] [PubMed] [Google Scholar]
  • 43.G. P. Fournier, K. R. Moore, L. T. Rangel, J. G. Payette, L. Momper, T. Bosak, The Archean origin of oxygenic photosynthesis and extant cyanobacterial lineages. Proc. Biol. Sci. 288, 20210675 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.S. J. Mojzsis, G. Arrhenius, K. D. McKeegan, T. M. Harrison, A. P. Nutman, C. R. Friend, Evidence for life on Earth before 3,800 million years ago. Nature 384, 55–59 (1996). [DOI] [PubMed] [Google Scholar]
  • 45.K. Mendler, H. Chen, D. H. Parks, B. Lobb, L. A. Hug, A. C. Doxey, AnnoTree: Visualization and exploration of a functionally annotated microbial tree of life. Nucleic Acids Res. 47, 4442–4448 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.L. A. David, E. J. Alm, Rapid evolutionary innovation during an Archaean genetic expansion. Nature 469, 93–96 (2011). [DOI] [PubMed] [Google Scholar]
  • 47.E. Jacox, C. Chauve, G. J. Szöllősi, Y. Ponty, C. Scornavacca, ecceTERA: Comprehensive gene tree-species tree reconciliation using parsimony. Bioinformatics 32, 2056–2058 (2016). [DOI] [PubMed] [Google Scholar]
  • 48.C. Burke, P. Steinberg, D. Rusch, S. Kjelleberg, T. Thomas, Bacterial community assembly based on functional genes rather than species. Proc. Natl. Acad. Sci. U.S.A. 108, 14288–14293 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.S. Sievert, R. Kiene, H. Schulz-Vogt, The sulfur cycle. Oceanography 20, 117–123 (2007). [Google Scholar]
  • 50.M. Wagner, A. J. Roger, J. L. Flax, G. A. Brusseau, D. A. Stahl, Phylogeny of dissimilatory sulfite reductases supports an early origin of sulfate respiration. J. Bacteriol. 180, 2975–2982 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.D. E. Canfield, A. Teske, Late Proterozoic rise in atmospheric oxygen concentration inferred from phylogenetic and sulphur-isotope studies. Nature 382, 127–132 (1996). [DOI] [PubMed] [Google Scholar]
  • 52.B. Meyer, J. F. Imhoff, J. Kuever, Molecular analysis of the distribution and phylogeny of the soxB gene among sulfur-oxidizing bacteria–Evolution of the Sox sulfur oxidation enzyme system. Environ. Microbiol. 9, 2957–2977 (2007). [DOI] [PubMed] [Google Scholar]
  • 53.C. L. Blättler, M. W. Claire, A. R. Prave, K. Kirsimäe, J. A. Higgins, P. V. Medvedev, A. E. Romashkin, D. V. Rychanchik, A. L. Zerkle, K. Paiste, T. Kreitsmann, I. L. Millar, J. A. Hayles, H. Bao, A. V. Turchyn, M. R. Warke, A. Lepland, Two-billion-year-old evaporites capture Earth’s great oxidation. Science 360, 320–323 (2018). [DOI] [PubMed] [Google Scholar]
  • 54.B. B. Jørgensen, A. J. Findlay, A. Pellerin, The biogeochemical sulfur cycle of marine sediments. Front. Microbiol. 10, 849 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.C. G. Friedrich, F. Bardischewsky, D. Rother, A. Quentmeier, J. Fischer, Prokaryotic sulfur oxidation. Curr. Opin. Microbiol. 8, 253–259 (2005). [DOI] [PubMed] [Google Scholar]
  • 56.U. Zander, A. Faust, B. U. Klink, D. de Sanctis, S. Panjikar, A. Quentmeier, F. Bardischewsky, C. G. Friedrich, A. J. Scheidig, Structural basis for the oxidation of protein-bound sulfur by the sulfur cycle molybdohemo-enzyme sulfane dehydrogenase SoxCD. J. Biol. Chem. 286, 8349–8360 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.G. Feulner, C. Hallmann, H. Kienert, Snowball cooling after algal rise. Nat. Geosci. 8, 659–662 (2015). [Google Scholar]
  • 58.R. J. Charlson, J. E. Lovelock, M. O. Andreae, S. G. Warren, Oceanic phytoplankton, atmospheric sulphur, cloud albedo and climate. Nature 326, 655–661 (1987). [Google Scholar]
  • 59.J. S. S. Damste, J. W. De Leeuw, Analysis, structure and geochemical significance of organically-bound sulphur in the geosphere: State of the art and future research. Org. Geochem. 16, 1077–1101 (1990). [Google Scholar]
  • 60.S. D. Domagal-Goldman, V. S. Meadows, M. W. Claire, J. F. Kasting, Using biogenic sulfur gases as remotely detectable biosignatures on anoxic planets. Astrobiology 11, 419–441 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.D. H. Parks, M. Chuvochina, P.-A. Chaumeil, C. Rinke, A. J. Mussig, P. Hugenholtz, A complete domain-to-species taxonomy for Bacteria and Archaea. Nat. Biotechnol. 38, 1079–1086 (2020). [DOI] [PubMed] [Google Scholar]
  • 62.D. H. Parks, M. Chuvochina, D. W. Waite, C. Rinke, A. Skarshewski, P.-A. Chaumeil, P. Hugenholtz, A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat. Biotechnol. 36, 996–1004 (2018). [DOI] [PubMed] [Google Scholar]
  • 63.M. D. Lee, GToTree: A user-friendly workflow for phylogenomics. Bioinformatics 35, 4162–4164 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.L. A. Hug, B. J. Baker, K. Anantharaman, C. T. Brown, A. J. Probst, C. J. Castelle, C. N. Butterfield, A. W. Hernsdorf, Y. Amano, K. Ise, Y. Suzuki, N. Dudek, D. A. Relman, K. M. Finstad, R. Amundson, B. C. Thomas, J. F. Banfield, A new view of the tree of life. Nat. Microbiol. 1, 16048 (2016). [DOI] [PubMed] [Google Scholar]
  • 65.D. Hyatt, G.-L. Chen, P. F. Locascio, M. L. Land, F. W. Larimer, L. J. Hauser, Prodigal: Prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.S. R. Eddy, Accelerated profile HMM searches. PLOS Comput. Biol. 7, e1002195 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.R. C. Edgar, MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.S. Capella-Gutiérrez, J. M. Silla-Martínez, T. Gabaldón, trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.M. N. Price, P. S. Dehal, A. P. Arkin, FastTree 2–Approximately maximum-likelihood trees for large alignments. PLOS ONE 5, e9490 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.A. Stamatakis, RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.N. Lartillot, T. Lepage, S. Blanquart, PhyloBayes 3: A Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25, 2286–2288 (2009). [DOI] [PubMed] [Google Scholar]
  • 72.C. Magnabosco, K. R. Moore, J. M. Wolfe, G. P. Fournier, Dating phototrophic microbial lineages with reticulate gene histories. Geobiology 16, 179–189 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.J. L. Thorne, H. Kishino, I. S. Painter, Estimating the rate of evolution of the rate of molecular evolution. Mol. Biol. Evol. 15, 1647–1657 (1998). [DOI] [PubMed] [Google Scholar]
  • 74.A. J. Drummond, S. Y. W. Ho, M. J. Phillips, A. Rambaut, Relaxed phylogenetics and dating with confidence. PLOS Biol. 4, e88 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.M. Kanehisa, M. Furumichi, Y. Sato, M. Ishiguro-Watanabe, M. Tanabe, KEGG: Integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.M. Kanehisa, Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.M. Kanehisa, S. Goto, KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.K. Anantharaman, B. Hausmann, S. P. Jungbluth, R. S. Kantor, A. Lavy, L. A. Warren, M. S. Rappé, M. Pester, A. Loy, B. C. Thomas, J. F. Banfield, Expanded diversity of microbial groups that shape the dissimilatory sulfur cycle. ISME J. 12, 1715–1728 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.K. Finster, Microbiological disproportionation of inorganic sulfur compounds. J. Sulfur Chem. 29, 281–292 (2008). [Google Scholar]
  • 80.R. Sánchez, F. Serra, J. Tárraga, I. Medina, J. Carbonell, L. Pulido, A. de María, S. Capella-Gutíerrez, J. Huerta-Cepas, T. Gabaldón, J. Dopazo, H. Dopazo, Phylemon 2.0: A suite of web-tools for molecular evolution, phylogenetics, phylogenomics and hypotheses testing. Nucleic Acids Res. 39, W470–W474 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.B. Q. Minh, H. A. Schmidt, O. Chernomor, D. Schrempf, M. D. Woodhams, A. von Haeseler, R. Lanfear, IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.B. Q. Minh, M. A. T. Nguyen, A. von Haeseler, Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol. 30, 1188–1195 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.W. Duchemin, G. Gence, A.-M. Arigon Chifolleau, L. Arvestad, M. S. Bansal, V. Berry, B. Boussau, F. Chevenet, N. Comte, A. A. Davín, C. Dessimoz, D. Dylus, D. Hasic, D. Mallo, R. Planel, D. Posada, C. Scornavacca, G. Szöllosi, L. Zhang, É. Tannier, V. Daubin, RecPhyloXML: A format for reconciled gene trees. Bioinformatics 34, 3646–3652 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.E. A. Bell, P. Boehnke, T. Mark Harrison, W. L. Mao, Potentially biogenic carbon preserved in a 4.1 billion-year-old zircon. Proc. Natl. Acad. Sci. U.S.A. 112, 14518–14521 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.J. L. Eigenbrode, K. H. Freeman, Late Archean rise of aerobic microbial ecosystems. Proc. Natl. Acad. Sci. U.S.A. 103, 15759–15764 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.J. M. Wolfe, G. P. Fournier, Horizontal gene transfer constrains the timing of methanogen evolution. Nat. Ecol. Evol. 2, 897–903 (2018). [DOI] [PubMed] [Google Scholar]
  • 87.A. Bekker, H. D. Holland, P.-L. Wang, D. Rumble III, H. J. Stein, J. L. Hannah, L. L. Coetzee, N. J. Beukes, Dating the rise of atmospheric oxygen. Nature 427, 117–120 (2004). [DOI] [PubMed] [Google Scholar]
  • 88.K. Pang, Q. Tang, J. D. Schiffbauer, J. Yao, X. Yuan, B. Wan, L. Chen, Z. Ou, S. Xiao, The nature and origin of nucleus-like intracellular inclusions in Paleoproterozoic eukaryote microfossils. Geobiology 11, 499–510 (2013). [DOI] [PubMed] [Google Scholar]
  • 89.E. J. Javaux, C. P. Marshall, A. Bekker, Organic-walled microfossils in 3.2-billion-year-old shallow-marine siliciclastic deposits. Nature 463, 934–938 (2010). [DOI] [PubMed] [Google Scholar]
  • 90.T. M. Gibson, P. M. Shih, V. M. Cumming, W. W. Fischer, P. W. Crockford, M. S. W. Hodgskiss, S. Wörndle, R. A. Creaser, R. H. Rainbird, T. M. Skulski, G. P. Halverson, Precise age of Bangiomorpha pubescens dates the origin of eukaryotic photosynthesis. Geology 46, 135–138 (2018). [Google Scholar]
  • 91.N. J. Butterfield, Bangiomorpha pubescens n. gen., n. sp.: Implications for the evolution of sex, multicellularity, and the Mesoproterozoic/Neoproterozoic radiation of eukaryotes. Paleobiology 26, 386–404 (2000). [Google Scholar]
  • 92.K. Pang, Q. Tang, L. Chen, B. Wan, C. Niu, X. Yuan, S. Xiao, Nitrogen-fixing heterocystous cyanobacteria in the Tonian period. Curr. Biol. 28, 616–622.e1 (2018). [DOI] [PubMed] [Google Scholar]
  • 93.S. Golubic, V. N. Sergeev, A. H. Knoll, Mesoproterozoic Archaeoellipsoidès: Akinetes of heterocystous cyanobacteria. Lethaia 28, 285–298 (1995). [DOI] [PubMed] [Google Scholar]
  • 94.G. D. Love, E. Grosjean, C. Stalvies, D. A. Fike, J. P. Grotzinger, A. S. Bradley, A. E. Kelly, M. Bhatia, W. Meredith, C. E. Snape, S. A. Bowring, D. J. Condon, R. E. Summons, Fossil steroids record the appearance of Demospongiae during the Cryogenian period. Nature 457, 718–721 (2009). [DOI] [PubMed] [Google Scholar]
  • 95.J. A. Zumberge, G. D. Love, P. Cárdenas, E. A. Sperling, S. Gunasekera, M. Rohrssen, E. Grosjean, J. P. Grotzinger, R. E. Summons, Demosponge steroid biomarker 26-methylstigmastane provides evidence for Neoproterozoic animals. Nat. Ecol. Evol. 2, 1709–1714 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Tables S1 and S2

Figs. S1 to S6


Articles from Science Advances are provided here courtesy of American Association for the Advancement of Science

RESOURCES