Abstract
Members of Methylotenera are signature denitrifiers and methylotrophs commonly found together with methanotrophic bacteria in lakes and freshwater sediments. Here, we show that three distinct Methylotenera ecotypes were abundant in methane-rich groundwaters recharged during the Pleistocene. Just like in surface water biomes, groundwater Methylotenera often co-occurred with methane-oxidizing bacteria, even though they were generally unable to denitrify. One abundant Methylotenera ecotype expressed a pathway for aerobic methane production from methylphosphonate. This phosphate-acquisition strategy was recently found to contribute to methane production in the oligotrophic, oxic upper ocean. Gene organization, phylogeny, and 3D protein structure of the key enzyme, carbon–phosphorus lyase subunit PhnJ, were consistent with a role in phosphate uptake. We conclude that phosphate may be a limiting nutrient in productive, methane-rich aquifers, and that methylphosphonate degradation might contribute to groundwater methane production.
Keywords: methane paradox, Methylotenera, methylotroph, methanotroph, methylphosphonate, C-P lyase, groundwater
Introduction
Methylotenera (Pseudomonadota, Gammaproteobacteria, Burkholderiales, Methylophilaceae) are commonly found in freshwater lakes [1–3]. They are methylotrophs and facultative denitrifiers, specialized in using reduced one-carbon compounds such as methanol and methylamine as their carbon and energy sources [4]. Several isolates were obtained from Lake Washington sediments, including Methylotenera mobilis JLW8 [5], Methylotenera Strain G11, Methylotenera Strain L2L1 [1], Methylotenera versatilis 301, and M. versatilis 30S [6]. Methylotenera oryzisoli La3113 was isolated from rice field soil [7, 8].
Methane is a potent greenhouse gas that is prevalent in groundwater [9–11]. Although Methylotenera do not directly metabolize methane, they are known to engage in partnerships with methane-oxidizing Methylobacter. In these partnerships, which are beneficial to both, Methylobacter provide methanol and formaldehyde to denitrifying Methylotenera, whereas the presence of Methylotenera stimulates the growth and methane oxidation rate of Methylobacter [12–16].
Methylotenera have mainly been studied in surface water, but recent studies found they were one of the most abundant groundwater residents (up to 88% of all bacteria) [17, 18]. Fifty-five amplicon sequence variants (ASVs) affiliated with Methylotenera were found in groundwater from 112 out of 138 wells, with the relative abundance ranging from 0.023% to 89%, indicating that these bacteria are even more common and diverse in groundwater than in surface water [18]. Compared to surface water, opportunities for migration of microorganisms in groundwater are more limited. Groundwater can be over 10 000 years old, recharged during or before the last deglaciation and the start of the Holocene. Many aquifers are “confined”, covered by an impermeable layer that prevents mixing with younger water and preserves their isolation [19].
Groundwater aquifers are also structured habitats. The occurrence of spatial gradients might be needed to explain seemingly conflicting observations. For example, a single groundwater sample may contain both oxygen and abundant aerobic Methylotenera as well as methane and anaerobic, methane-producing Methanobacteriaceae [18]. This observation initially appears paradoxical because methanogens are typically considered strict anaerobes. However, a similar situation has been observed in the ocean, where aerobic methane production from organic phosphonates ultimately explained the presence of methane in an oxic ecosystem [20–22]. Here, we show that organic phosphonates may also contribute to methane production in pristine groundwater aquifers.
Briefly, we recovered thirty-six metagenome-assembled-genomes (MAGs) of Methylotenera from nineteen groundwater samples, with methane concentrations varying from below detection limit to 69 mg/L. We investigated the metabolic potential, activity, and biogeography of these Methylotenera populations based on their genomes and protein expression. Three groundwater ecotypes of Methylotenera were observed. One of the ecotypes appeared to produce methane aerobically from methylphosphonate.
Materials and methods
Groundwater sampling, amplicon, metagenome, and proteome sequencing
Sampling of groundwater wells in Alberta, Canada, and measurements of physicochemical parameters have been previously described [17]. Detailed information on well locations, aquifers, lithology, geology, and aqueous geochemistry of the groundwater samples is provided (Supplementary Table S1). DNA extraction, amplicon sequencing, shotgun metagenomic sequencing, protein extraction, and analysis were previously described [18]. This study included a 16S rRNA gene amplicon sequencing dataset with 265 groundwater samples collected from 138 wells from 2016 to 2021, 26 metagenomes collected from 25 wells in 2019 and 2022, and 5 proteomes collected from 5 wells in 2019.
Metagenome and proteome analysis
Quality trimming of raw reads, assembly, binning, and read mapping were previously described [18]. Quality and characteristics of the obtained MAGs were estimated with CheckM2 v0.1.3 [23]. Estimated genome size was calculated as actual MAG size divided by estimated MAG completeness. The taxonomic identity of MAGs was obtained with GTDBtk v2.3.0 [24]. In total, thirty-six MAGs affiliated with Methylotenera were obtained. There were twenty high-quality MAGs (>90% completeness and < 5% contamination), thirteen medium-quality MAGs (>50% completeness and < 10% contamination), and three MAGs of lower quality (completeness >85% and 10%–15% contamination). Average nucleotide identity (ANI) between the thirty-six MAGs was obtained with FastANI v1.33 [25]. The twenty high-quality MAGs were used to infer the metabolism based on gene content. Expression of proteins of nine Methylotenera MAGs was detected in proteomes obtained for five proteomes as previously described [18].
Relative abundance of populations
The relative abundance of Methylotenera and methanogens was calculated in two ways: (1) based on sequencing depths of MAGs estimated with the “checkm coverage” and “checkm profile” commands within CheckM v1.1.3b [26], and (2) based on sequencing depths of single copy marker genes estimated with SingleM v0.18.3 [27]. The relative abundance calculated with the first approach was generally lower than that calculated with the second approach (paired t-test, P < 0.05) but they showed the same trend (Pearson correlation, P < 0.05, Supplementary Table S1).
Annotation
MAGs and unbinned contigs were annotated with MetaErg v2.3.39 [28]. Key enzymes involved in methylotrophic processes were analyzed with Hidden Markov Models (HMMs) obtained from a previous study [29], which collected 1016 reference protein sequences from published genomes of methylotrophs. Twelve enzymes involved in cobalamin synthesis and transport were analyzed with HMMs downloaded from TIGRFAM and Pfam as previously described [30].
Estimation of the amount of time passed since the common ancestor of closely related Methylotenera colonized these groundwaters
Amongst Methylotenera MAGs, there were several that were very closely related, with average nucleotide identities of 99.906% to 99.999%. To estimate the amount of time that may separate these close relatives found in thousands-year-old groundwater, we made use of previous estimates of bacterial mutation rates. The bacterial mutation rate varies from 10−8 to 10−5 nucleotide substitutions per site per year [31, 32]. The mutation rate decreases as the sampling timescale increases, because a substantial fraction of mutations may appear over a short time scale but will ultimately be removed by natural selection. Therefore, the lower rate of 10−8 nucleotide substitutions per site per year was used.
Phylogeny of the Methylotenera genus
Eighty-eight Methylotenera reference genomes together with 313 genomes from other genera of the Methylophilaceae family were downloaded from the Genome Taxonomy Database (GTDB) version 214 [24]. Thirty-six Methylotenera MAGs recovered from this study together with these reference genomes were included in the phylogenetic analysis. Four genomes from other families in the Burkholderiales order were used as the outgroup. Seventy-one single-copy marker genes included in the bacterial marker gene set in Anvi’o v7.1 [33] were extracted from the genomes and aligned with the “anvi-get-sequences-for-hmm-hits” command. The phylogenetic tree was constructed using default parameters with the “anvi-gen-phylogenomic-tree” command (which uses FastTree [34]) in Anvi’o v7.1 and visualized using iTOL v6.9 [35].
Phylogeny, gene organization, and three-dimensional protein structure analysis of the carbon–phosphorus lyase core complex subunit J
A maximum-likelihood phylogenetic tree of the C-P lyase core complex subunit J (PhnJ) was constructed. First, PhnJ sequences from both binned and unbinned contigs in groundwater metagenomes were extracted based on annotation with MetaErg v2.3.39 [28] and reference sequences were obtained from the Metaerg database. This database contains RefSeq-annotated proteins from organisms in the GTDB database version 214 [24]. Reference PhnJ sequences were dereplicated and proteins shorter than 150 amino acids were not further considered. Next, a multiple sequence alignment of the recovered PhnJ sequences and reference sequences was constructed using MAFFT v7.508 with default parameters [36]. Poorly aligned regions were removed with trimAl v1.4 [37]. The phylogenetic tree was constructed with “-m MFP -B 1000” in IQ-TREE v2.2.0.3 [38] and visualized using iTOL v6.9 [35].
The gene clusters of the Phn complex found in MAGs were compared with the experimentally verified Escherichia coli K-12 gene cluster [39] with clinker v0.0.28 [40].
3D protein structure of the PhnJ proteins in Methylotenera and Methylomonadaceae was predicted with ColabFold v1.5.5 [41] by combining the fast homology search of MMseqs2 [42] with AlphaFold2 [43]. Predicted protein structures were searched against the Protein Data Bank database [44] with Foldseek [45]. The top hits of PhnJ in both Methylotenera and Methylomonadaceae were experimentally validated PhnJ proteins in E. coli K-12 (7Z19) [46] with e-values <10−40. 3D protein structure of the recovered PhnJ proteins was aligned with the validated PhnJ and visualized in ChimeraX v1.7.1. Positions of the four conserved cysteine residues and a glycine residue were highlighted.
Identification of the methylphosphonate synthase
Reference sequences of the methylphosphonate synthase (MPnS) were obtained as previously described [47]. All assembled contigs were searched against reference sequences with DIAMOND v2.0.9 using “-e 1e-5” [48]. The extracted putative MPnS sequences were aligned with reference sequences using MAFFT v7.508 with default parameters [36]. Poorly aligned regions were removed with trimAl v1.4 [37]. The phylogenetic tree was constructed with “-m MFP -B 1000” in IQ-TREE v2.2.0.3 [38] and visualized using iTOL v6.9 [35]. The presence of the 2-histidine-1-glutamine iron-coordinating triad and the two glutamine-adjacent residues (phenylalanine and isoleucine) required for methylphosphonate synthesis [47] were manually checked based on the multiple sequence alignment (Supplementary dataset 1).
Statistical analyses
Methane concentrations and the relative abundances of Methylotenera were tested for normality with the Kolmogorov–Smirnov test in PASW Statistics v18.0. As the data did not follow a normal distribution, the Spearman’s rank correlation coefficient (R) between methane concentration and relative abundance of Methylotenera was calculated in PASW Statistics v18.0. Genome size and coding density of Methylotenera in lake water and groundwater habitats were compared in t-tests in PASW Statistics v18.0. Statistical analyses were considered significant with P-values <0.05.
Results and discussion
Biogeography of groundwater Methylotenera
Consistent with previous work on surface water [14–16], the methane concentration explained 10% of the variability in Methylotenera abundance (Fig. 1a; Supplementary Table S2). As we will show below, methane concentrations explained 18% of the variability in the relative abundance of one Methylotenera ecotype, whereas it showed no relationship with the relative abundance of the other two ecotypes. What was striking about groundwater as a Methylotenera habitat, was that Methylotenera were so abundant, up to 88% relative sequence abundance in 16S rRNA gene amplicon libraries. To investigate the biogeography of Methylotenera in poorly connected groundwater habitats and discover what type of metabolisms contributed to their abundance, we analyzed shotgun metagenomes of twenty-six groundwater samples (Supplementary Table S1).
Figure 1.
Wide dispersal of Methylotenera in unconnected groundwater habitats with highly variable methane and oxygen concentrations. (a) Methane concentrations explain 10% of the variability in Methylotenera abundances based on 16S rRNA gene amplicon data [18]. The Spearman’s rank correlation coefficient was used as the data were not normally distributed (Supplementary Table S2). The relative abundance of Methylotenera in shotgun-sequenced samples calculated with SingleM [27] is in blue. (b) Occurrence of five distinct Methylotenera populations in groundwater recharged during the Pleistocene, across a range of oxygen and methane concentrations. These populations only occurred in 13 out of a total of 26 samples; the others are not shown here. Each population consists of a set of MAGs with ANI >99.9% and is indicated with a different shape: Diamonds (99.994% ANI), triangles (99.957%–99.985% ANI), circles (99.906%–99.999% ANI), squares (99.972%–99.996% ANI), and black dots (99.932% ANI; Fig. 2a; Fig. S1; Supplementary Table S4). Numbers in brackets following sample IDs indicate groundwater age measured with 14C in dissolved inorganic carbon (DIC). * shows detection of 3H, an indicator for mixing with young water. The arrow indicates the general direction of groundwater flow, from the Rocky Mountains in the southwest towards the northeast. The colours of the symbols indicate the aquifers sampled. Aquifers are named after the hosting geological formation (background colours). The map was created with ArcGIS pro v2.4.1. Thirteen out of twenty-six total samples (Supplementary Table S1) are shown on the map.
Thirty-six MAGs of Methylotenera were recovered (Supplementary Table S3). Twenty of these MAGs had estimated completeness >90% and contamination <5%. Based on sequencing reads mapping to these Methylotenera MAGs, the highest Methylotenera relative sequence abundance observed in the shotgun data was 35%, lower than observed with amplicon sequencing but still relatively high compared to surface water habitats.
Amongst the recovered Methylotenera MAGs, there were five sets of nearly identical MAGs (>99.9% ANI, Fig. 1b; Fig. S1; Supplementary Table S4). One might consider these five sets to represent five different Methylotenera populations. Based on estimates of bacterial mutation rates [31, 32], each of these populations emerged from a common ancestor between 1000 and 94 000 years ago, for populations sharing 99.999% and 99.906% ANI respectively. That is relatively recent in evolutionary terms. Sometimes MAGs from the same Methylotenera population were observed in the same aquifer. These nearly identical MAGs were also found in different aquifers that are currently not connected.
Based on radiocarbon and tritium dating (Supplementary Table S1), the groundwater in many of our samples recharged during the Pleistocene, during or before the last deglaciation (12 000–115 000 years ago) [49], with occasional contributions of much younger water (Fig. 1b). Groundwater recharge was much more active during ice ages, and the >2 km thick Laurentide ice sheet that covered much of Canada east of our study area may have altered or reversed groundwater flow directions [50]. During the Pleistocene, aquifers that are currently isolated from each other may have been connected. Supercharged groundwater flow could explain colonization of presently unconnected aquifers by the observed Methylotenera populations. Some of these populations must have lived isolated from each other and the surface for tens of thousands of years.
When the same aquifer was sampled at multiple locations, very different concentrations of oxygen and methane were measured, indicating that each aquifer contains different environments, likely impacted by local bedrock features such as deposits of shale and pyrite. Thus, each aquifer may offer multiple ecological niches that could favour different Methylotenera populations. Methane and oxygen were often present together, just like in the upper oceans. In the groundwater though, methane concentrations were always higher. Based on the isotopic signature, methane mainly originated from methanogenesis [17]. The presence of “dark oxygen” in ancient waters has been noted before [17]. It may indicate active oxygen production as it is hard to explain how oxygen could persist for thousands of years in the presence of methane; in addition to Methylotenera, these aquifers also hosted methane-oxidizing Methylomonadaceae (data not shown).
Phylogeny of groundwater Methylotenera
Methylotenera phylogeny was analyzed based on 71 bacterial marker genes and 88 reference genomes publicly available in GTDB version 214. The resulting phylogenetic tree featured five clades associated with specific habitats (Fig. 2a, Supplementary Table S5). Two of those (LW1, LW2) were made up exclusively of genomes found in the water column of lakes. Thirty-six of the groundwater MAGs formed three clades, G1, G2, and G3, which also contained some genomes previously obtained from groundwater, groundwater-associated habitats, or sediments.
Figure 2.
Groundwater and lake water Methylotenera have distinct phylogeny and genome size. (a) Phylogenetic tree of Methylotenera MAGs recovered in this study and reference genomes from GTDB including those of isolated strains. The scale bar inside the tree corresponds to 0.1 nucleotide substitution per site. Curves show groups of genomes that share >95% ANI (species). Black thick edges around the circles indicate the presence of C-P lyase genes involved in methylphosphonate breakdown (see Fig. 4). Marker shapes correspond to the five populations each consisting of a set of MAGs with >99.9% ANI (Fig. 1b). The ring around the tree indicates habitat. (b) Estimated genome size and (c) coding density of Methylotenera genomes recovered from different environments. Both MAG and sample (Fig. 1b) identifiers are indicated for each recovered genome.
G1 consisted of nineteen groundwater MAGs and three reference genomes, one from a groundwater well in Colorado, USA, and the other two recovered from a sand filter for a drinking water treatment plant in the Netherlands. G1 genomes shared >86% ANI and comprised seven different species-level clades, each sharing >97% ANI [51]. Four of these populations consisted of a set of MAGs with >99.9% ANI (Fig. 1b).
G2 consisted of four groundwater MAGs as well as six reference genomes, two from lake sediments, two from a groundwater-derived drinking water system, one from a hydrocarbon enrichment culture, and one from a thick biofilm clogging water pumps in an ice machine. G2 genomes shared >87% ANI and comprised three different species-level clades. Methylotenera oryzisoli (GCF_004661485), isolated from the rice rhizosphere [7], M. mobilis JLW8 (GCF_000023705) [5] and Methylotenera sp. L2L1 (GCF_000744605) [1], both isolated from Lake Washington sediments, formed a sister clade to G2. Methylotenera sp. G11 (GCF_ 000799735) [1], also from Lake Washington sediments, was at the root of G2. As we will show below, in groundwater G2 is less abundant than both G1 and G3, and mainly found in water with low methane concentrations.
G3 consisted of thirteen groundwater MAGs and two reference genomes, one from a lake sediment and one from the above-mentioned sand-filter. G3 genomes shared >78% ANI and comprised six different species-level clades, and one population consisting of two MAGs with >99.9% ANI (Fig. 1b). M. versatilis 301 (GCF_000093025) [6], isolated from Lake Washington sediments, was part of G3. As we will show below, G3 Methylotenera are associated with high methane concentrations and aerobic methane production from methylphosphonate.
Genomes recovered from the water columns of lakes were mainly within LW1 and LW2, with only three exceptions (Fig. 2a). They were mostly retrieved from a comprehensive dataset of metagenomes from stratified freshwater lakes in Sweden, Finland, Switzerland, and other countries [52]. LW1 and LW2 had significantly smaller genome size and higher coding density than groundwater Methylotenera clades (P < 0.05; Fig. 2b). This indicated that surface water Methylotenera were experiencing genome streamlining [53], which was previously observed within the same family (Methylophilaceae) for two pelagic lineages, Ca. Methylopumilus and OM43 [29]. LW1 and LW2 were not monophyletic but each shared a common ancestor with G3 Methylotenera (Fig. 2a). Therefore, ancestors of G3 Methylotenera must have colonized surface waters in at least two independent waves, with both resulting in a massive loss of genes. Apparently, surface water environments select for Methylotenera with limited, specialized metabolisms living on minimized energy and/or nutrient budgets. In contrast, groundwater Methylotenera are able to maintain a much larger accessory genome, leaving more room for metabolic flexibility and other cellular features. The intertwined evolutionary histories of groundwater and sediment Methylotenera points at overlaps in ecological niche space between these two structured habitats.
Metabolisms of groundwater and lake water Methylotenera
To explore if G1–3 might occupy distinct ecological niches and how those niches were different from LW1–2 lake Methylotenera, we compared the gene content of twenty-seven high-quality MAGs associated with G1–3, LW1–2. Expression of genes by groundwater Methylotenera was inferred from proteomics analysis of five groundwater samples (Fig. 3; Supplementary Tables S6 and S7).
Figure 3.
Metabolisms of Methylotenera in groundwater and lake water habitats. The heatmaps indicate the presence of associated genes in high-quality genomes of G1 (n = 11), G2 (n = 4), G3 (n = 5), LW1 (n = 2), and LW2 (n = 5). Pathways with at least one gene expressed in proteomes are indicated with stars. The metabolic pathways more enriched in Methylotenera genomes sampled from groundwater are shown on the left. Genes enriched in lake water are shown on the right. Abbreviations as follows: flavin mononucleotide (FMN), ribulose monophosphate cycle (RuMP cycle), hexulose-6-phosphate (H6P), fructose 6-phosphate (F6P), ribulose 5-phosphate (Ru5P), and glyceraldehyde-3-phosphate (G3P). The more detailed information is provided (Supplementary Table S6).
Key enzymes involved in the Calvin cycle, RubisCO and phosphoribulokinase, were exclusively observed in all LW2 genomes, indicating that only those lake water Methylotenera were facultative autotrophs. In contrast, lactate dehydrogenase was exclusively observed in all G1–2 genomes, indicating the potential for fermentative lactic acid production by groundwater Methylotenera. Urease was enriched in some G1–G2 genomes but absent in LW1–2, indicating that urea was a relevant nitrogen source for groundwater but not lake water Methylotenera. The decaheme-associated outer membrane protein MtrB/PioB for extracellular electron transport was exclusively present in some LW1–2 genomes.
LW1 was the only clade that lacked genes for flagella and type IV pili. Genes involved in chemotaxis were present and expressed in all G1–3 genomes but not in LW1–2. As both groundwater and sediment habitats are more structured than lake water, motility and chemotaxis could be essential to cope with spatial gradients of oxygen and nutrients.
Bacteriohemerythrin was observed in all G1 genomes and three G2 genomes. Expression of bacteriohemerythrin was also observed in proteomes (Supplementary Table S7). It was not present in LW1–2. Bacteriohemerythrin is used to bind oxygen, for storage, enzyme delivery, and sensing [54]. These functions could be important in a habitat with spatial oxygen gradients or temporal oxygen fluctuations. In lake water, oxygen levels might be more stable. Defence against reactive oxygen species (ROS) was still important in groundwater habitats, as catalase, peroxiredoxin, thioredoxin, superoxide dismutase, and peroxidase were present in all G1–3 genomes. ROS defence proteins were actively expressed in groundwater—the total relative abundance of ROS defence proteins was about 1% to 4% in Methylotenera proteomes.
As for cobalamin synthesis, most core genes were present in all G1–2 genomes. All G3, LW1–2 contained genes for corrin ring biosynthesis but lacked those involved in following steps, final synthesis and repair of cobalamin [30]. All genomes had the cobalamin transporter, BtuB, a TonB-dependent outer membrane cobalamin receptor and transporter, which was expressed as well, indicating these populations could also acquire it from their environment. All genomes contained 5,6-dimethylbenzimidazole (DMB) synthase, BluB, which is involved in synthesis and activation of DMB, the cobalamin lower ligand. The expression of BluB was only detected in G3, indicating G3 synthesized cobalamin from DMB.
With regard to methylotrophy, the lanthanide-dependent methanol dehydrogenase Xox was present in all genomes, whereas the calcium-dependent methanol dehydrogenase was only present amongst some G1–2 and all G3 genomes. All genomes lacked enzymes for direct conversion of formaldehyde to formate. Instead they used the tetrahydrofolate (THF)-dependent pathway [55]. In this pathway, formaldehyde first binds to tetrahydrofolate (H4F) spontaneously to form methylene-tetrahydrofolate (CH2-H4F), then the enzyme FolD converts CH2-H4F to methenyl-tetrahydrofolate (CH-H4F), which is subsequently converted to formyl-tetrahydrofolate (CHO-H4F). Formyltetrahydrofolate deformylase then converts this product to H4F and formate. Formyltetrahydrofolate deformylase was present in all genomes. Alternatively, formate-tetrahydrofolate ligase also catalyzes this step. It was only present in all G3 and some LW2 genomes. Formate-tetrahydrofolate ligase produces ATP, whereas formyltetrahydrofolate deformylase does not, so G3 and LW2 might conserve additional energy during formaldehyde oxidation. Formate dehydrogenase, the ribulose monophosphate pathway and the serine pathway for formaldehyde assimilation were present in all genomes. The methylcitrate cycle for oxidizing propionate to pyruvate via methylcitrate was present in most genomes [56]. Apart from methanol, formaldehyde, and formate, LW1 and LW2 were able to use methylamine, whereas three LW2 populations could also use dimethylamine.
The gene content of groundwater Methylotenera showed they were able to use methanol, formaldehyde, and formate. These were most likely provided by the accompanying methanotroph Methylomonadaceae. The co-existence of methanotrophic Methylobacter (Methylomonadaceae) and methylotrophic Methylotenera was previously observed in a methane-rich shallow aquifer [57], lake water [58], and sediments [12, 13]. The cross feeding was suggested to be regulated by the expression of the lanthanide-dependent Xox- versus the calcium-dependent Mxa-type methanol dehydrogenase in Methylobacter: the expression of Mxa with lower affinity for methanol leads to a loss of methanol to the external surroundings of cells which benefits Methylotenera. Excretion of a lanthanide-scavenging molecule by Methylotenera was shown to play a role in this interaction [59]. In the groundwater, we found both Mxa and Xox were expressed by Methylomonadaceae (Supplementary Table S6), indicating that they may have access to lanthanides. Alternatively, denitrifying Methylotenera could mop up C1 compounds that could not be oxidized further by oxygen-limited Methylomonadaceae [16]. Although previous studies showed the importance of nitrate as the electron acceptor in driving methylotrophic metabolism in Methylotenera [15, 16], we found denitrification was not so prevalent in Methylotenera, neither in lake water nor groundwater. Although the dissimilatory nitrate reductase Nar was present in most genomes, enzymes in subsequent steps of denitrification were less prevalent: the nitrite reductase NirK was only present in two LW2 genomes, co-existing with the nitric-oxide reductase Nor. One of the genomes also has the nitrous-oxide reductase NosZ. Nor and NosZ co-existed in six G1 genomes, and NosZ was also observed in a LW1 genome. Amongst all denitrifying enzymes, only Nar was expressed in proteomes. In addition to Nar, fermentation and arsenate reduction could also drive cross-feeding in groundwater Methylotenera.
Proteomics presents evidence for aerobic methane production from methylphosphonate
We detected a pathway for methylphosphonate assimilation in G3 (Fig. 3). In this pathway, the C-P lyase core complex Phn converts methylphosphonate into methane and phosphate as follows: first PhnIGHL converts methylphosphonate and ATP into alpha-D-ribose 1-methylphosphonate 5-triphosphate (RPnTP) and adenine. Next, PhnM cleaves off pyrophosphate resulting in alpha-D-ribose-1-phosphonate-5-phosphate (PRPn). Finally, PhnJ generates alpha-D-ribose-1,2-cyclic-phosphate-5-phosphate (PRcP) releasing methane. The pathway was present in nine genomes that formed a new species in G3. This species also featured one MAG (C18_maxbin.001_sub, Fig. 1a) that did not encode the C-P lyase core complex itself, but it did feature a short contig with an incomplete phosphonate metabolism transcriptional regulator PhnF next to PtsS, ArsBC, RF-2, and LysRS, the same gene arrangement found adjacent to the C-P lyase core complex genes in the other G3 Methylotenera genomes (Fig. 4a). Thus, the C-P lyase appears to be a defining feature of this G3 species, and proteomics showed that it was actively expressed (Figs 4 and 5; Supplementary Table S7).
Figure 4.
Conservation of essential genes, protein structure, and amino acids needed for methane production by G3 Methylotenera and Methylomonadaceae. (a) Gene arrangement of the Phn operon and flanking genes in experimentally verified Escherichia coli K-12, G3 Methylotenera, and Methylomonadaceae MAGs. The letters at the top indicate subunits of the Phn complex. * indicates gene expression was detected in proteomes. (b) Protein structure models of PhnJ in G3 Methylotenera (blue) and Methylomonadaceae (purple) aligned with E. coli PhnJ (protein data bank 7Z19, gold). Zoomed views of the conserved four cysteine residues and glycine-32 are shown in (c) and (d). Conserved residues are shown in corresponding dark colours. The zinc ion is shown in green. Gene abbreviations in (a) as follows: phosphonate ABC transporter ATP-binding protein (phnC), phosphonate ABC transporter substrate-binding protein (phnD), phosphonate ABC transporter permease protein (phnE), phosphonate metabolism transcriptional regulator (phnF), phosphonate C-P lyase system protein subunit GHIJKLM (phnGHIJKLM), phosphonate metabolism protein/1,5-bisphosphokinase (PRPP-forming; phnN), aminophosphonate N-acetyl-transferase (phnO), carbon-phosphorus lyase complex accessory protein (phnP), phosphate-selective porin (oprP), phosphate ABC transporter ATP-binding protein (pstB), phosphate ABC transporter permease protein (pstA), phosphate ABC transporter permease protein (pstC), phosphate ABC transporter substrate-binding protein (pstS), arsenite efflux pump (arsB), arsenate reductase (arsC), peptide chain release factor 2 (prfB), lysine-tRNA ligase (lysRS), phosphonate utilization associated putative membrane protein (eamA), alkaline phosphatase (phoX), ATP-dependent chaperone (clpB), large-conductance mechanosensitive channel protein (mscL), tRNA(Arg) A34 adenosine deaminase (tadA), phosphate regulon transcriptional regulator (phoB), and phosphate regulon sensor histidine kinase (phoR).
Figure 5.
Methane-rich groundwater is associated with methane-producing Methylotenera. (a) Relative abundance of G3 (R2 = 0.18, P = 0.046) but not G1 (R2 = 0.12, P = 0.105) and G2 Methylotenera (R2 = 0.04, P = 0.356) in metagenomes correlates with methane concentration (Supplementary Table S1, S3). G3 Methylotenera with PhnJ are indicated with thick edges around the circles. (b) Relative abundance of both methanogen MAGs and MAGs encoding the C-P lyase core complex increased with methane concentrations, whereas there was no correlation with phosphate concentrations (phosphate concentration in E1 is unknown). Arrows below sample identifiers indicate samples examined with proteomics. Purple arrows indicate the detection of MPnS. Dark blue stars indicate the expression of PhnJ. Red stars indicate the expression of methyl-coenzyme M reductase subunit alpha McrA. (c) Proposed metabolic handshakes between methanogens, methanotrophic Methylomonodaceae, and methylotrophic Methylotenera.
We explored if, amongst the bacteria in the investigated groundwater samples, methylphosphonate metabolism was unique to G3 Methylotenera. It appeared that Methylotenera’s methane-oxidizing partners, Methylomonadaceae, also encoded and expressed PhnJ, as well as chemolithoautotrophic Hydrogenophaga and many other bacteria often found in our groundwater samples, such as Phenylobacterium, Polaromonas, and Rhodoferax (Supplementary Table S8).
The Phn gene clusters found in G3 Methylotenera and Methylomonadaceae were compared to the experimentally validated E. coli gene cluster (Fig. 4a). All genes were observed in G3 Methylotenera and Methylomonadaceae, except for PhnO which is only required in the catabolism of 1-aminoalkylphosphonic acid [60]. PhnGHIJKLMNP in Methylotenera and Methylomonadaceae were in the same order as those in E. coli. For PhnCDEF in E. coli, the order of these genes was PhnFDCE in G3 Methylotenera. In Methylomonadaceaea, sometimes PhnF was not present, whereas PhnD was ahead of PhnC followed by two PhnE. The 3D structure of the key enzyme for methane production from methylphosphonate, PhnJ, was modelled and inspected (Fig. 4b). The positions of the four conserved cysteine residues (Cys241, 244, 266, and 272) in PhnJ of both G3 Methylotenera and Methylomonadaceae were confirmed (Fig. 4c). These participate in the binding of an iron–sulphur cluster and the coordination of the zinc atom. During the catalytic cycle of the enzyme, after activation by S-adenosyl-methionine, Cys272 and Gly32 (also conserved, Fig. 4d) donate and regenerate the H· radical needed for methane production [61]. We also compared the phylogeny of PhnJ to the E. coli protein (Fig. S2). The G3 Methylotenera and Methylomonadaceae proteins were closely related, sharing 80–82% amino acid identity. They were both in one of two major clades within this protein family, whereas the validated E. coli protein was in the other clade. In the ocean, PhnJ proteins from both clades were observed, whereas the clade with E. coli was more abundant and most PhnJ proteins were found in members of Alphaproteobacteria, such as SAR116 and Pelagibacter [62–64].
Breaking the stable carbon-phosphorus bond by means of the Phn complex requires four ATP [46], with an additional ATP consumed in binding the methylphosphonate itself. Oxidation of the produced methane would not yield sufficient energy to compensate for these energy expenses. However, the energy expenses are justified when growth is phosphate-limited [21, 62] and the aim is to assimilate phosphonate. The genes of the Phn complex were located next to genes encoding the phosphate ABC transporter in G3 Methylotenera and next to the Pho phosphate regulon in Methylomonadaceae (Fig. 4a). These genes are involved in the sensing and transport of phosphate [65]. An alkaline phosphatase gene, phoX, was also situated close to the Phn genes in G3 Methylotenera. PhoX is used to assimilate phosphate from organic compounds [66]. Together, these findings strongly suggested that PhnJ was expressed in response to phosphate limitation. We did not detect a relationship between the relative abundance of microorganisms with PhnJ and phosphate concentration (Fig. 5b; Supplementary Table S1). However, as mentioned before, subsurface habitats are structured and productive biofilms can be phosphate-limited even though the groundwater might still contain some phosphate [17].
Methylphosphonate is an intermediate in the breakdown of natural and man-made organic phosphonates. Studies of the marine phosphorus cycle have estimated that phosphonates make up about 10%–25% of the high molecular weight dissolved organic phosphorus compounds [67, 68]. These phosphonates can decorate lipids, polysaccharides, and proteins. Methylphosphonate is produced with the MPnS in microorganisms [47, 69]. MPnS was detected in only three samples, C4, C24, and E14 (Fig. 5b; Fig. S3). Methylphosphonate might also be derived from the breakdown of glyphosate, an important herbicide in agriculture and forestry [65]. However, as many of the investigated samples were from confined aquifers with water that infiltrated before the start of the Holocene, we would not expect herbicides to be relevant. Future research could address if groundwater harbours a geological source of methylphosphonate, and if so, what would be its isotopic composition.
Amongst groundwater Methylotenera, a high abundance of G3 with PhnJ was strongly associated with methane-rich groundwater (R2 = 0.18; P = 0.046; Fig. 5a; Fig. 1). G1 was enriched in water with high methane concentrations but was also present at lower methane concentrations. In contrast, G2 was mainly found in water with low methane concentrations. To explore if methylphosphonate-degrading Methylotenera were associated with high methane concentrations in other studies, we checked the presence of the Phn complex in all Methylotenera genomes shown in the phylogenetic tree (Fig. 2a). Phn was also found in two other Methylotenera MAGs: GCA_002842285 (in the sister group of G2) and GCA_013141495 (in G1). GCA_002842285 was recovered from a groundwater well with methane making up 99.4% of the headspace gas [70]. GCA_013141495 was from a primary rapid sand filter fed with groundwater rich in methane (5.2 mg/L) [71]. Similar to G3, the latter co-existed with several Methylomonadaceae populations [71].
To explore if methylphosphonate could potentially be a significant source of methane, we compared the relative abundances of methanogens and of PhnJ-containing microbes in methane-rich samples (Fig. 5b). Methanogens were somewhat abundant in some of the samples with high methane concentrations (e.g. E9, E21). In other samples, methanogens were barely detectable and the relative abundance of methylphosphonate degraders was much higher (e.g. C3, C17, E2, C9).
The five proteomes showed active expression of both McrA in methanogens, and PhnJ in two G3 Methylotenera and four Methylomonadaceae populations. Although G3 Methylotenera are unable to use methane, the produced methane might be oxidized by aerobic methanotroph Methylomonadaceae. Methylotenera could still benefit from methylphosphonate mineralization by cross-feeding off intermediates of methane oxidation, such as methanol, formaldehyde, and formate (Fig. 5c). As mentioned before, the large amount of energy needed to split the carbon-phosphorous bond could never be recovered by oxidizing the produced methane, so this strategy would not be able to drive net growth. Further, although methylphosphonate degradation could contribute to methane production, the isotopic signature of the methane in these samples was still consistent with methanogenesis as the main pathway of methane production [17]. Therefore, the correlation between methane concentration and Phn abundance might be explained by the fact that high methane concentrations in groundwater habitats are associated with higher productivity [17], and high productivity can lead to phosphate limitation.
Based on our results, we conclude the following: (i) Groundwater is a major habitat for Methylotenera, with relative abundances that are sometimes much higher than surface water or sediments. Indeed Methylotenera often dominated groundwater microbial communities, especially in environments high in methane, and was one of the most abundant taxa overall. Very recently, Methylotenera also appeared to be an abundant genus in glacier-fed streams [72]. These findings speak to the importance of Methylotenera as one of the most prominent and ubiquitous freshwater genera globally.
(ii) In methane-rich groundwaters, the growth of one Methylotenera ecotype appears to be phosphate-limited, as this population consistently possesses and expresses genes for methylphosphonate degradation. This emphasizes the previously proposed importance of methane-rich environments as hotspots of productivity in the terrestrial subsurface [17].
(iii) Assimilation of methylphosphate by Methylotenera means that methane can be produced in the presence of oxygen. Stable isotope composition of groundwater methane shows that the contribution of this process is likely small. However, the potential for aerobic methane production does further complicate the already muddled relationships between redox zones and redox metabolisms in groundwater ecosystems [17].
(iv) Despite the occurrence of phosphate limitation and the notion that groundwater ecosystems are generally oligotrophic, groundwater Methylotenera do not experience genomic streamlining, like their relatives in surface water [73].
(v) Many of the investigated groundwaters had average ages of >10 000 years. Laboratory experiments should now focus on how bacteria thrive in such static environments. Questions to address include: How old are their cells? How fast do they metabolize? What are the sources of oxygen and methane? These will be exciting questions guiding future discovery.
Supplementary Material
Acknowledgements
The authors thank the Groundwater Observation Well Network team members of Alberta Environment and Protected Areas (https://www.alberta.ca/groundwater-observation-well-network.aspx) for providing access to groundwater monitoring wells, for sampling and providing highest quality groundwater samples, and for sharing measurement results and expertise. Financial support from Alberta Innovates through its Water Innovation Program is gratefully acknowledged. We would like to thank the University of Calgary’s Center for Health Genomics and Informatics for sequencing and informatics services. We thank Gaute Lavik and Jan von Arx for helpful discussions.
Contributor Information
Shengjie Li, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada; Max Planck Institute for Marine Microbiology, Bremen 28359, Germany.
Xiaoli Dong, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada; Provincial Laboratory for Public Health, Calgary, AB T2N 4W4, Canada.
Pauline Humez, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada.
Joanna Borecki, Alberta Environment and Protected Areas, Calgary, AB T2L 2K8, Canada.
Jean Birks, Alberta Environment and Protected Areas, Calgary, AB T2L 2K8, Canada.
Cynthia McClain, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada; Alberta Biodiversity Monitoring Institute, University of Alberta, Edmonton, AB T6G 2E9, Canada.
Bernhard Mayer, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada.
Marc Strous, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada.
Muhe Diao, Department of Earth, Energy, and Environment, University of Calgary, Calgary, AB T2N 1N4, Canada; State Key Laboratory of Pollution Control and Resources Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai 200092, China.
Author contributions
Shengjie Li (Conceptualization, Investigation, Formal Analysis, Visualization, Writing—original draft, Writing—review & editing), Xiaoli Dong (Formal Analysis), Pauline Humez (Formal Analysis), Joanna Borecki (Formal Analysis), Jean Birks (Formal Analysis), Cynthia McClain (Formal Analysis), Bernhard Mayer (Formal Analysis, Funding acquisition), Marc Strous (Conceptualization, Formal Analysis, Funding acquisition, Resources, Supervision, Writing—review & editing), Muhe Diao (Conceptualization, Investigation, Formal Analysis, Funding acquisition, Writing—review & editing).
Conflicts of interest
The authors declare no competing interest.
Funding
This study was supported by the Fundamental Research Funds for the Central Universities to Muhe Diao, the Natural Sciences and Engineering Research Council (NSERC) through Discovery Grants to Marc Strous and Bernhard Mayer, Canada Research Chair (CRC-2020-00257, MS), the Canada Foundation for Innovation (CFI), the Digital Research Alliance of Canada, the Canada First Research Excellence Fund (CFREF), the Government of Alberta, and the University of Calgary.
Data availability
Amplicons in this study are under the Bioproject PRJNA861683 and PRJNA700657. Metagenomes and metagenome-assembled-genomes are under the Bioproject PRJNA700657 (NCBI). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository [74] with the dataset identifier PXD044305.
References
- 1. McTaggart TL, Benuska G, Shapiro N et al. Draft genome sequences of five new strains of Methylophilaceae isolated from Lake Washington sediment. Genome Announc 2015;3:e01511–4. 10.1128/genomeA.01511-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Beck DA, McTaggart TL, Setboonsarng U et al. The expanded diversity of Methylophilaceae from Lake Washington through cultivation and genomic sequencing of novel ecotypes. PLoS One 2014;9:e102458. 10.1371/journal.pone.0102458 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Kumaresan D, Stephenson J, Doxey AC et al. Aerobic proteobacterial methylotrophs in Movile cave: genomic and metagenomic analyses. Microbiome 2018;6:1. 10.1186/s40168-017-0383-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Chistoserdova L. Methylotrophs in natural habitats: current insights through metagenomics. Appl Microbiol Biotechnol 2015;99:5763–79. 10.1007/s00253-015-6713-z [DOI] [PubMed] [Google Scholar]
- 5. Kalyuzhnaya MG, Bowerman S, Lara JC et al. Methylotenera mobilis gen. nov., sp. nov., an obligately methylamine-utilizing bacterium within the family Methylophilaceae. Int J Syst Evol Microbiol 2006;56:2819–23. 10.1099/ijs.0.64191-0 [DOI] [PubMed] [Google Scholar]
- 6. Kalyuzhnaya MG, Beck DAC, Vorobev A et al. Novel methylotrophic isolates from lake sediment, description of Methylotenera versatilis sp. nov. and emended description of the genus Methylotenera. Int J Syst Evol Microbiol 2012;62:106–11. 10.1099/ijs.0.029165-0 [DOI] [PubMed] [Google Scholar]
- 7. Lv H, Sahin N, Tani A. Methylotenera oryzisoli sp. nov., a lanthanide-dependent methylotrophic bacteria isolated from rice field soil. Int J Syst Evol Microbiol 2020;70:2713–8. 10.1099/ijsem.0.004098 [DOI] [PubMed] [Google Scholar]
- 8. Hernandez ME, Beck DA, Lidstrom ME et al. Oxygen availability is a major factor in determining the composition of microbial communities involved in methane oxidation. PeerJ 2015;3:e801. 10.7717/peerj.801 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Humez P, Mayer B, Nightingale M et al. Redox controls on methane formation, migration and fate in shallow aquifers. Hydrol Earth Syst Sci 2016;20:2759–77. 10.5194/hess-20-2759-2016 [DOI] [Google Scholar]
- 10. Barth-Naftilan E, Sohng J, Saiers JE. Methane in groundwater before, during, and after hydraulic fracturing of the Marcellus shale. Proc Natl Acad Sci USA 2018;115:6970–5. 10.1073/pnas.1720898115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Osborn SG, Vengosh A, Warner NR et al. Methane contamination of drinking water accompanying gas-well drilling and hydraulic fracturing. Proc Natl Acad Sci USA 2011;108:8172–6. 10.1073/pnas.1100682108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Deng Y, Liang C, Zhu X et al. Methylomonadaceae was the active and dominant methanotroph in Tibet lake sediments. ISME Commun 2024;4:ycae032. 10.1093/ismeco/ycae032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. He R, Wang J, Pohlman JW et al. Metabolic flexibility of aerobic methanotrophs under anoxic conditions in Arctic lake sediments. ISME J 2022;16:78–90. 10.1038/s41396-021-01049-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Oshkin IY, Beck DA, Lamb AE et al. Methane-fed microbial microcosms show differential community dynamics and pinpoint taxa involved in communal response. ISME J 2015;9:1119–29. 10.1038/ismej.2014.203 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Kalyuhznaya MG, Martens-Habbena W, Wang T et al. Methylophilaceae link methanol oxidation to denitrification in freshwater lake sediment as suggested by stable isotope probing and pure culture analysis. Environ Microbiol Rep 2009;1:385–92. 10.1111/j.1758-2229.2009.00046.x [DOI] [PubMed] [Google Scholar]
- 16. van Grinsven S, Sinninghe Damsté JS, Harrison J et al. Nitrate promotes the transfer of methane-derived carbon from the methanotroph Methylobacter sp. to the methylotroph Methylotenera sp. in eutrophic lake water. Limnol Oceanogr 2020;66:878–91. [Google Scholar]
- 17. Ruff SE, Humez P, de Angelis IH et al. Hydrogen and dark oxygen drive microbial productivity in diverse groundwater ecosystems. Nat Commun 2023;14:3194. 10.1038/s41467-023-38523-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Li S, Mosier D, Kouris A et al. High diversity, abundance, and expression of hydrogenases in groundwater. ISME Commun 2024;4:ycae023. 10.1093/ismeco/ycae023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Khare D, Jat MK, Mishra PK. Groundwater hydrology: An overview. Sustainable Holistic Water Resources Management in a Changing Climate 4.1–4.26. Jain Brothers, India; 2017. [Google Scholar]
- 20. Lamontagne RA, Swinnerton JW, Linnenbom VJ et al. Methane concentrations in various marine environments. J Geophys Res 1973;78:5317–24. 10.1029/JC078i024p05317 [DOI] [Google Scholar]
- 21. Karl DM, Beversdorf L, Björkman KM et al. Aerobic production of methane in the sea. Nat Geosci 2008;1:473–8. 10.1038/ngeo234 [DOI] [Google Scholar]
- 22. Repeta DJ, Ferrón S, Sosa OA et al. Marine methane paradox explained by bacterial degradation of dissolved organic matter. Nat Geosci 2016;9:884–7. 10.1038/ngeo2837 [DOI] [Google Scholar]
- 23. Chklovski A, Parks DH, Woodcroft BJ et al. CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning. Nat Methods 2023;20:1203–12. 10.1038/s41592-023-01940-w [DOI] [PubMed] [Google Scholar]
- 24. Parks DH, Chuvochina M, Waite DW et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat Biotechnol 2018;36:996–1004. 10.1038/nbt.4229 [DOI] [PubMed] [Google Scholar]
- 25. Jain C, Rodriguez RL, Phillippy AM et al. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 2018;9:5114. 10.1038/s41467-018-07641-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Parks DH, Imelfort M, Skennerton CT et al. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 2015;25:1043–55. 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Woodcroft BJ, Aroney STN, Zhao R et al. SingleM and sandpiper: robust microbial taxonomic profiles from metagenomic data. bioRxiv 2024.01.30.578060. 10.1101/2024.01.30.578060 [DOI] [Google Scholar]
- 28. Dong X, Strous M. An integrated pipeline for annotation and visualization of metagenomic contigs. Front Genet 2019;10:999. 10.3389/fgene.2019.00999 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Salcher MM, Schaefle D, Kaspar M et al. Evolution in action: habitat transition from sediment to the pelagial leads to genome streamlining in Methylophilaceae. ISME J 2019;13:2764–77. 10.1038/s41396-019-0471-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Lu X, Heal KR, Ingalls AE et al. Metagenomic and chemical characterization of soil cobalamin production. ISME J 2020;14:53–66. 10.1038/s41396-019-0502-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Duchene S, Holt KE, Weill FX et al. Genome-scale rates of evolutionary change in bacteria. Microb Genom 2016;2:e000094. 10.1099/mgen.0.000094 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Gibson B, Eyre-Walker A. Investigating evolutionary rate variation in bacteria. J Mol Evol 2019;87:317–26. 10.1007/s00239-019-09912-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Eren AM, Kiefl E, Shaiber A et al. Community-led, integrated, reproducible multi-omics with anvi’o. Nat Microbiol 2021;6:3–6. 10.1038/s41564-020-00834-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Price MN, Dehal PS, Arkin AP. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol 2009;26:1641–50. 10.1093/molbev/msp077 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Letunic I, Bork P. Interactive tree of life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res 2024;52:W78–82. 10.1093/nar/gkae268 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 2013;30:772–80. 10.1093/molbev/mst010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 2009;25:1972–3. 10.1093/bioinformatics/btp348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Nguyen LT, Schmidt HA, von Haeseler A et al. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 2015;32:268–74. 10.1093/molbev/msu300 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Makino K, Kim SK, Shinagawa H et al. Molecular analysis of the cryptic and functional phn operons for phosphonate use in Escherichia coli K-12. J Bacteriol 1991;173:2665–72. 10.1128/jb.173.8.2665-2672.1991 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Gilchrist CLM, Chooi YH. Clinker & clustermap.js: automatic generation of gene cluster comparison figures. Bioinformatics 2021;37:2473–5. 10.1093/bioinformatics/btab007 [DOI] [PubMed] [Google Scholar]
- 41. Mirdita M, Schutze K, Moriwaki Y et al. ColabFold: making protein folding accessible to all. Nat Methods 2022;19:679–82. 10.1038/s41592-022-01488-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Steinegger M, Soding J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol 2017;35:1026–8. 10.1038/nbt.3988 [DOI] [PubMed] [Google Scholar]
- 43. Jumper J, Evans R, Pritzel A et al. Highly accurate protein structure prediction with AlphaFold. Nature 2021;596:583–9. 10.1038/s41586-021-03819-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Berman HM, Westbrook J, Feng Z et al. The protein data Bank. Nucleic Acids Res 2000;28:235–42. 10.1093/nar/28.1.235 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. van Kempen M, Kim SS, Tumescheit C et al. Fast and accurate protein structure search with Foldseek. Nat Biotechnol 2024;42:243–6. 10.1038/s41587-023-01773-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Amstrup SK, Ong SC, Sofos N et al. Structural remodelling of the carbon-phosphorus lyase machinery by a dual ABC ATPase. Nat Commun 2023;14:1001. 10.1038/s41467-023-36604-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Born DA, Ulrich EC, Ju KS et al. Structural basis for methylphosphonate biosynthesis. Science 2017;358:1336–9. 10.1126/science.aao3435 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods 2015;12:59–60. 10.1038/nmeth.3176 [DOI] [PubMed] [Google Scholar]
- 49. Kleman J, Jansson K, De Angelis H et al. North American ice sheet build-up during the last glacial cycle, 115–21 kyr. Quat Sci Rev 2010;29:2036–51. 10.1016/j.quascirev.2010.04.021 [DOI] [Google Scholar]
- 50. Person M, McIntosh J, Bense V et al. Pleistocene hydrology of North America: the role of ice sheets in reorganizing groundwater flow systems. Rev Geophys 2007;45:RG3007. 10.1029/2006RG000206 [DOI] [Google Scholar]
- 51. Van Rossum T, Ferretti P, Maistrenko OM et al. Diversity within species: interpreting strains in microbiomes. Nat Rev Microbiol 2020;18:491–506. 10.1038/s41579-020-0368-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Buck M, Garcia SL, Fernandez L et al. Comprehensive dataset of shotgun metagenomes from oxygen stratified freshwater lakes and ponds. Sci Data 2021;8:131. 10.1038/s41597-021-00910-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Giovannoni SJ, Cameron Thrash J, Temperton B. Implications of streamlining theory for microbial ecology. ISME J 2014;8:1553–65. 10.1038/ismej.2014.60 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Alvarez-Carreno C, Alva V, Becerra A et al. Structure, function and evolution of the hemerythrin-like domain superfamily. Protein Sci 2018;27:848–60. 10.1002/pro.3374 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Vorholt JA. Cofactor-dependent pathways of formaldehyde oxidation in methylotrophic bacteria. Arch Microbiol 2002;178:239–49. 10.1007/s00203-002-0450-2 [DOI] [PubMed] [Google Scholar]
- 56. Textor S, Wendisch VF, De Graaf AA et al. Propionate oxidation in Escherichia coli: evidence for operation of a methylcitrate cycle in bacteria. Arch Microbiol 1997;168:428–36. 10.1007/s002030050518 [DOI] [PubMed] [Google Scholar]
- 57. Cahill AG, Steelman CM, Forde O et al. Mobility and persistence of methane in groundwater in a controlled-release field experiment. Nat Geosci 2017;10:289–94. 10.1038/ngeo2919 [DOI] [Google Scholar]
- 58. Biderre-Petit C, Taib N, Gardon H et al. New insights into the pelagic microorganisms involved in the methane cycle in the meromictic Lake Pavin through metagenomics. FEMS Microbiol Ecol 2019;95:fiy183. 10.1093/femsec/fiy183 [DOI] [PubMed] [Google Scholar]
- 59. Krause SM, Johnson T, Samadhi Karunaratne Y et al. Lanthanide-dependent cross-feeding of methane-derived carbon is linked by microbial community interactions. Proc Natl Acad Sci USA 2017;114:358–63. 10.1073/pnas.1619871114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Hove-Jensen B, McSorley FR, Zechel DL. Catabolism and detoxification of 1-aminoalkylphosphonic acids: N-acetylation by the phnO gene product. PLoS One 2012;7:e46416. 10.1371/journal.pone.0046416 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Kamat SS, Williams HJ, Dangott LJ et al. The catalytic mechanism for aerobic formation of methane by bacteria. Nature 2013;497:132–6. 10.1038/nature12061 [DOI] [PubMed] [Google Scholar]
- 62. Sosa OA, Repeta DJ, DeLong EF et al. Phosphate-Limited Ocean regions select for bacterial populations enriched in the carbon-phosphorus lyase pathway for phosphonate degradation. Environ Microbiol 2019;21:2402–14. 10.1111/1462-2920.14628 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. von Arx JN, Kidane AT, Philippi M et al. Methylphosphonate-driven methane formation and its link to primary production in the oligotrophic North Atlantic. Nat Commun 2023;14:6529. 10.1038/s41467-023-42304-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Lockwood S, Greening C, Baltar F et al. Global and seasonal variation of marine phosphonate metabolism. ISME J 2022;16:2198–212. 10.1038/s41396-022-01266-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Hove-Jensen B, Zechel DL, Jochimsen B. Utilization of glyphosate as phosphate source: biochemistry and genetics of bacterial carbon-phosphorus lyase. Microbiol Mol Biol Rev 2014;78:176–97. 10.1128/MMBR.00040-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Yong SC, Roversi P, Lillington J et al. A complex iron-calcium cofactor catalyzing phosphotransfer chemistry. Science 2014;345:1170–3. 10.1126/science.1254237 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Clark LL. Marine organic phosphorus cycling; novel insights from nuclear magnetic resonance. Am J Sci 1999;299:724–37. 10.2475/ajs.299.7-9.724 [DOI] [Google Scholar]
- 68. Kolowith LC, Ingall ED, Benner R. Composition and cycling of marine organic phosphorus. Limnol Oceanogr 2001;46:309–20. 10.4319/lo.2001.46.2.0309 [DOI] [Google Scholar]
- 69. Metcalf WW, Griffin BM, Cicchillo RM et al. Synthesis of methylphosphonic acid by marine microbes: a source for methane in the aerobic ocean. Science 2012;337:1104–7. 10.1126/science.1219875 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Hernsdorf AW, Amano Y, Miyakawa K et al. Potential for microbial H2 and metal transformations associated with novel bacteria and archaea in deep terrestrial subsurface sediments. ISME J 2017;11:1915–29. 10.1038/ismej.2017.39 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Poghosyan L, Koch H, Frank J et al. Metagenomic profiling of ammonia- and methane-oxidizing microorganisms in two sequential rapid sand filters. Water Res 2020;185:116288. 10.1016/j.watres.2020.116288 [DOI] [PubMed] [Google Scholar]
- 72. Ezzat L, Peter H, Bourquin M et al. Diversity and biogeography of the bacterial microbiome in glacier-fed streams. Nature 2025;637:622–30. 10.1038/s41586-024-08313-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Chaudhari NM, Perez-Carrascal OM, Overholt WA et al. Genome streamlining in Parcubacteria transitioning from soil to groundwater. Environ Microbiome 2024;19:41. 10.1186/s40793-024-00581-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Perez-Riverol Y, Csordas A, Bai J et al. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res 2019;47:D442–50. 10.1093/nar/gky1106 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Amplicons in this study are under the Bioproject PRJNA861683 and PRJNA700657. Metagenomes and metagenome-assembled-genomes are under the Bioproject PRJNA700657 (NCBI). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository [74] with the dataset identifier PXD044305.





