Abstract
The planetary biosphere is powered by a suite of key metabolic innovations that emerged early in the history of life. However, it is unknown whether life has always followed the same set of strategies for performing these critical tasks. Today, microbes access atmospheric sources of bioessential nitrogen through the activities of just one family of enzymes, nitrogenases. Here, we show that the only dinitrogen reduction mechanism known to date is an ancient feature conserved from nitrogenase ancestors. We designed a paleomolecular engineering approach wherein ancestral nitrogenase genes were phylogenetically reconstructed and inserted into the genome of the diazotrophic bacterial model, Azotobacter vinelandii, enabling an integrated assessment of both in vivo functionality and purified nitrogenase biochemistry. Nitrogenase ancestors are active and robust to variable incorporation of one or more ancestral protein subunits. Further, we find that all ancestors exhibit the reversible enzymatic mechanism for dinitrogen reduction, specifically evidenced by hydrogen inhibition, which is also exhibited by extant A. vinelandii nitrogenase isozymes. Our results suggest that life may have been constrained in its sampling of protein sequence space to catalyze one of the most energetically challenging biochemical reactions in nature. The experimental framework established here is essential for probing how nitrogenase functionality has been shaped within a dynamic, cellular context to sustain a globally consequential metabolism.
Research organism: Azotobacter vinelandii
Introduction
The evolutionary history of life on Earth has generated tremendous ecosystem diversity, the sum of which is orders of magnitude larger than that which exists at present (Jablonski, 2004). Life’s historical diversity provides a measure of its ability to solve adaptive problems within an integrated planetary system. Accessing these solutions requires a deeper understanding of the selective forces that have shaped the evolution of molecular-scale, metabolic innovations. However, the early histories of many of life’s key metabolic pathways and the enzymes that catalyze them remain coarsely resolved.
Important efforts to advance understanding of early metabolic innovations have included phylogenetic inference (Gold et al., 2017; Sánchez-Baracaldo and Cardona, 2020), systems-level network reconstructions (Goldford et al., 2017), and the leveraging of extant or mutant biological models as proxies for their ancient counterparts (Zerkle et al., 2006; Soboh et al., 2010; Hurley et al., 2021). However, methods to directly study these metabolic evolutionary histories across past environmental and cellular transitions remain underexplored. A unified, experimental strategy that integrates historical changes to enzymes, which serve as the primary interface between metabolism and environment, and clarifies their impact within specific cellular and physiochemical contexts is necessary. To address this, phylogenetic reconstructions of enzymes can be directly integrated within laboratory microbial model systems (Garcia and Kaçar, 2019; Kacar et al., 2017b; Kędzior et al., 2022). In this paleomolecular framework, predicted ancestral enzymes can be ‘resurrected’ within a compatible host organism for functional characterization. These experimental systems can ultimately integrate multiple levels of historical analysis by interrogating critical features of ancient enzymes as well as dynamic interactions between enzymes, their broader metabolic networks, and the external environment.
The study of biological nitrogen fixation offers a promising testbed to thread these investigations of early metabolic evolution. Both phylogenetic and geological evidence (Garcia et al., 2020; Raymond et al., 2004; Boyd et al., 2011a; Stüeken et al., 2015; Parsons et al., 2021) indicate that the origin of biological nitrogen fixation was a singular and ancient evolutionary event on which the modern biosphere has since been built. The only known nitrogen fixation pathway (compared to, for example, at least seven carbon-fixation pathways [Garcia et al., 2021a]) is catalyzed by an early-evolved family of metalloenzymes called nitrogenases that reduce highly inert, atmospheric dinitrogen (N2) to bioavailable ammonia (NH3). The nitrogenase family comprises three isozymes that vary in their metal dependence (i.e. molybdenum, vanadium, and iron) and, in certain cases, all coexist within the same host organism (Mus et al., 2018). Many diazotrophs depend on genetic strategies for coordinating the biosynthesis and expression of multiple nitrogenase isozymes and their respective metalloclusters (Pérez-González et al., 2021; Burén et al., 2020), and, in oxic environments, protecting the oxygen-sensitive metalloclusters from degradation (Gallon, 1992). Thus, nitrogenase enzymes are a central component of a broader, co-evolving nitrogen fixation machinery. These features that create significant experimental challenges for nitrogen fixation engineering (Smanski et al., 2014; Bennett et al., 2023; Burén and Rubio, 2018) nevertheless also make this metabolism an ideal candidate for systems-level, paleomolecular study.
How biological nitrogen fixation emerged and evolved under past environmental conditions is still poorly constrained relative to its importance in Earth’s planetary and biological history. Because nitrogen has been a limiting nutrient over geological timescales (Falkowski, 1997; Allen et al., 2019), nitrogenase has long been a key constituent of the expanding Earth’s biosphere. The impact of nitrogen limitation is underscored by human reliance on the industrial Haber-Bosch process, an energetically and environmentally costly workaround for nitrogen fertilizer production (Vicente and Dean, 2017) designed to supplement a remarkable molecular innovation that biology has tinkered with for more than three billion years. How the structural domains and regulatory network of nitrogenase were recruited (Boyd et al., 2015; Mus et al., 2019) and under what selective pressures the metal dependence of nitrogenases was shaped (Garcia et al., 2020; Boyd et al., 2011b) remain open questions. Importantly, it is not known how the enzymatic mechanism for dinitrogen reduction has been tuned by both peptide and metallocluster to achieve one of the most difficult reactions in nature (Seefeldt et al., 2020; Stripp et al., 2022; Harris et al., 2019). At the enzyme level, previous insights into nitrogenase sequence-function relationships have primarily derived from single or dual substitution studies. These have often yielded diminished or abolished nitrogenase activity (Seefeldt et al., 2020; Stripp et al., 2022), though in certain cases improved reactivity toward alternate, industrially relevant substrates (Seefeldt et al., 2020). Despite illuminating key features of extant nitrogenase mechanisms in select model organisms, the combination of detailed functional studies within an explicit evolutionary scheme has not previously been accomplished for the nitrogen fixation system.
Here, we seek guidance from the Earth’s evolutionary past to reconstruct the history of the key metabolic enzyme, nitrogenase. We establish an evolutionary systems biology approach for the cellular- and molecular-level characterization of ancestral nitrogenases resurrected within the model diazotrophic bacterium, A. vinelandii. We find that variably replacing different protein subunits of the nitrogenase complex with inferred ancestral counterparts enables nitrogen fixation in A. vinelandii. Purified ancestral enzymes exhibit the specific N2 reduction mechanism retained by their studied, extant counterparts, and maintain the same catalytic selectivity between N2 and protons. Thus, the core strategy for biological nitrogen fixation is conserved across the investigated timeline. Our paleomolecular approach opens a new route to study the ancient functionality and evolution of nitrogenases both deeper in its ancestry and within the broader context of its supporting, cellular machinery.
Results
A resurrection strategy for ancestral nitrogenases
We designed an engineering pipeline for the resurrection and experimental characterization of ancestral nitrogenases (Figure 1A). In this scheme, phylogenetically inferred, ancestral nitrogenase genes are synthesized and engineered into the genome of a modern diazotrophic bacterium, enabling the assessment of in vivo nitrogenase activity and expression in parallel with biochemical analysis of the purified enzyme. The engineering and functional assessment of ancestral nitrogenases required a suitable diazotrophic microbial host, owing to challenges associated with nitrogenase heterologous expression (Vicente and Dean, 2017). We selected the obligately aerobic gammaproteobacterium, A. vinelandii (strain ‘DJ’), an ideal experimental model due to its genetic tractability and the availability of detailed studies on the genetics and biochemistry of its nitrogen fixation machinery (Noar and Bruno-Bárcena, 2018). We specifically targeted the extant A. vinelandii molybdenum-dependent (‘Mo-;) nitrogenase (hereafter referred to as wild-type, ‘WT’), which is the best-studied isozyme (Seefeldt et al., 2020) relative to the A. vinelandii vanadium (‘V-’) and iron (‘Fe-’) dependent nitrogenases. The WT Mo-nitrogenase complex comprises multiple subunits, NifH, NifD, and NifK (encoded by nifHDK genes), which are arranged into two catalytic components: a NifH homodimer and a NifDK heterotetramer (Figure 1B). During catalysis, both components transiently associate to transfer one electron from NifH to NifDK and subsequently dissociate. Transferred electrons accumulate at the active-site Mo-containing metallocluster (‘FeMoco’) housed within the NifD subunits for reduction of the N2 substrate to NH3.
Figure 1. Engineering strategy for ancestral nitrogenase resurrection.
(A) Experimental pipeline for nitrogenase resurrection in A. vinelandii and subsequent characterization, as described in the main text. (B) Structural overview of ancestral nitrogenases reconstructed in this study. Homology models (template PDB 1M34) of Anc1B NifH and NifDK proteins are shown with ancestral substitutions (relative to WT) highlighted in red. Select substitutions at relatively conserved sites in proximity to FeMoco (NifD, I355V) and the NifD:NifK interface (NifD, F429Y; NifK, R108K) are displayed in the insets. (C) Parallel genome engineering strategies were executed in this study, involving both ancestral replacement of only nifD (Anc1A and Anc2) and replacement of nifHDK (Anc1B). ‘PnifH’: nifH promoter, ‘KanR’: kanamycin resistance cassette. *Anc1A and Anc1B were each reconstructed from equivalent nodes of alternate phylogenies (see Materials and methods).
To infer Mo-nitrogenase ancestors, we built a maximum-likelihood nitrogenase phylogeny from a concatenated alignment of NifHDK amino acid sequences (Figure 2A; Figure 2—figure supplement 1). The phylogeny contains 385 sets of homologs representative of known nitrogenase molecular sequence diversity (including Mo-, V-, and Fe-nitrogenases), and is rooted by dark-operative protochlorophyllide oxidoreductase proteins classified within the nitrogenase superfamily (Ghebreamlak and Mansoorabadi, 2020). For this study, we selected ancestors that fall within the direct evolutionary lineage of A. vinelandii WT (Figure 2B), ‘Anc1’ and ‘Anc2’ (listed in order of increasing age), having ~90% and~85% amino acid sequence identity to WT across the full length of their concatenated NifHDK proteins, respectively (Figure 2C; Supplementary file 1a). A relatively conservative percentage identity threshold was chosen based on prior studies benchmarking the functional expression of ancestral elongation factor proteins in Escherichia coli (Kacar et al., 2017a) and Synechococcus elongatus (Kędzior et al., 2022). The high-dimensional, nitrogenase protein sequence space occupied by both extant and ancestral homologs is visualized in two dimensions in Figure 2D by machine-learning embeddings (see Materials and methods). This analysis highlights the swath of sequence space targeted here, as well as that made accessible by the resurrection of nitrogenase ancestors more broadly. WT, Anc1, and Anc2 lie within a Mo-nitrogenase clade (previously termed ‘Group I’ [Raymond et al., 2004]) that contains homologs from diverse aerobic and facultatively anaerobic taxa, including proteobacteria and cyanobacteria (Figure 2A). A maximum age constraint of ~2.5 Ga for Group I nitrogenases (and thus for both Anc1 and Anc2) can be reasoned based on the timing of the Great Oxidation Event (Lyons et al., 2014) and downstream emergence of aerobic taxa represented nearly exclusively within this clade.
Figure 2. Phylogenetic and genomic context of resurrected ancestral nitrogenases.
(A) Maximum-likelihood phylogenetic tree from which ancestral nitrogenases were inferred. Extant nodes are colored by microbial host taxonomic diversity. Red box highlights the clade targeted in this study and depicted in (B). Tree shown was used to infer Anc1A and Anc2 sequences (an alternate tree was used for Anc1B inference; see Materials and methods). (B) nif gene cluster complexity within the targeted nitrogenase clade. Presence and absence of each nif gene are indicated by gray and white colors, respectively. Because some homologs for phylogenetic analysis were obtained from organisms lacking fully assembled genomes, the absence of accessory nif genes may result from missing genomic information. (C) Amino acid sequence identity matrix of nitrogenase protein subunits harbored by WT and engineered A. vinelandii strains. (D) Extant and ancestral nitrogenase protein sequence space visualized by machine-learning embeddings, with the resulting dimensionality reduced to two-dimensional space. UMAP dimension axes are in arbitrary units. The field demarcated by dashed lines in the left plot is expanded on the right plot.
Figure 2—figure supplement 1. Maximum-likelihood phylogenies built from nitrogenase NifHDK homologs.
Figure 2—figure supplement 3. WT and ancestral NifD alignment with sitewise ConSurf conservation scores (see Materials and Methods).
Residue-level differences between ancestral and WT nitrogenases (‘ancestral substitutions’) are broadly distributed along the length of each ancestral sequence (Figure 1B; Figure 2—figure supplement 2). An ancestral substitution proximal to the active-site FeMoco metallocluster lies within a loop considered important for FeMoco insertion (Dos Santos et al., 2004) (NifD I355V; residue numbering from WT) and is observed across all targeted NifD ancestors (Figure 1B). Other ancestral substitutions are notable for their location at relatively conserved residue sites (assessed by ConSurf Ashkenazy et al., 2016; Figure 2—figure supplement 3; see Materials and methods) and/or within subunit interfaces, including two at the NifD:NifK interface that are proximal to one another, F429Y (NifD) and R108K (NifK). The Anc2 NifD protein contains five more ancestral substitutions at conserved sites than the younger Anc1 NifD protein. In all studied ancestors, the C275 and H442 FeMoco ligands, as well as other strictly conserved nitrogenase residues, are retained.
Phylogenetic analysis informs the compatibility of selected ancestors in extant microbial hosts. Extant nitrogenases within the Group I nitrogenase clade (which include Anc1 and Anc2 descendants) are associated with numerous accessory genes likely recruited to optimize the synthesis and regulation of the oxygen-sensitive nitrogenase for aerobic or facultative metabolisms (Boyd et al., 2015). For example, in addition to the structural nifHDK genes, A. vinelandii WT is assembled and regulated with the help of >15 additional nif genes. Likewise, the extant descendants of Anc1 and Anc2 are primarily aerobic or facultative proteobacteria and are thus associated with higher complexity nif gene clusters (Figure 2B). We hypothesized that the likely oxygen-tolerant, ancient proteobacterium harboring these ancestral nitrogenases were similar in nif cluster complexity to extant A. vinelandii. Thus, we predicted that the nif accessory genes present in A. vinelandii would support the functional expression of resurrected nitrogenase ancestors.
To gauge the degree of compatibility between ancestral and extant nitrogenase proteins and maximize the chance of recovering functional nitrogenase variants, we executed two parallel genome engineering strategies. First, we constructed A. vinelandii strains harboring only the ancestral nifD gene from both targeted ancestral nodes (‘Anc1A’ and ‘Anc2’), thereby expressing ‘hybrid ancestral-WT’ nitrogenase complexes (Figure 1C). This strategy is similar to in vitro ‘cross-reaction’ studies that have evaluated the compatibility of nitrogenase protein components from differing host taxa (Smith et al., 1976). Second, we constructed a strain harboring all Anc1 nifHDK genes, expressing a fully ancestral nitrogenase complex (‘Anc1B’; sequence reconstructed from a node equivalent to Anc1A from an alternate phylogeny, see Materials and methods). A. vinelandii strains were constructed by markerless genomic integration of ancestral nitrogenase genes, as described in Materials and methods.
Ancestral nitrogenases enable diazotrophic microbial growth
All A. vinelandii constructs harboring ancestral genes enabled diazotrophic growth in molybdenum-containing, nitrogen-free media. All strains had comparable doubling times to WT during the exponential phase (p>0.05; Figure 3A and B). The only significant difference among strains was a~14 hr increase in the lag phase of strain Anc2 relative to WT, harboring the oldest nitrogenase ancestor (p ≈ 2e-7). We did not detect growth under the same conditions for a control ΔnifD strain (DJ2278, see Supplementary file 1c). This result confirmed that the growth observed for ancestral strains did not stem from leaky expression of the alternative, V- or Fe-dependent nitrogen fixation genes in A. vinelandii, which were left intact.
Figure 3. Cellular-level characterization of ancestral nitrogenase activity and expression.
(A) Diazotrophic growth curves of A. vinelandii strains measured by the optical density at 600 nm (‘OD600’). A smoothed curve is shown alongside individual data points obtained from five biological replicates per strain. The non-diazotrophic DJ2278 (ΔnifD) strain was used as a negative control. (B) Mean doubling and midpoint times of A. vinelandii strains, calculated from data in (A). (C) In vivo acetylene (C2H2) reduction rates quantified by the production of ethylene (C2H4). Bars represent the mean of biological replicates (n=3) per strain. (D) Immunodetection and protein quantification of Strep-II-tagged WT (‘WT-Strep,’ strain DJ2102) and ancestral NifD. Top gel image shows Strep-II-tagged NifD proteins detected by anti-Strep antibody and bottom gel image shows total protein stain. Plot displays relative immunodetected NifD signal intensity normalized to total protein intensity and expressed relative to WT. Bars in the plot represent the mean of biological replicates (n=3) per strain. (B–D) Error bars indicate ±1 SD and asterisks indicate p<.01 (one-way ANOVA, post-hoc Tukey HSD) compared to WT or WT-Strep.
An acetylene reduction assay was performed to measure cellular nitrogenase activity in engineered strains. This assay quantifies the reduction rate of the non-physiological substrate acetylene (C2H2) to ethylene (C2H4) (Hardy et al., 1968), here normalized to total protein content. A. vinelandii strains harboring only ancestral nifD (Anc1A, Anc2) exhibited mean C2H2 reduction rates of ~5–6 μmol C2H4/mg total protein/hr,~40–45% that of WT (p ≈ 6e-4 and p ≈ 3e-4, respectively) (Figure 3C). Strain Anc1B, harboring ancestral nifHDK, exhibited a mean acetylene reduction rate of ~9 μmol C2H4/mg total protein/hr,~70% that of WT (p ≈ 3e-2).
The phenotypic variability we observed among engineered and WT A. vinelandii strains might result both from differences in nitrogenase expression and nitrogenase activity. To provide insights into these disparate effects, we quantified nitrogenase protein expression in engineered strains by immunodetection of ancestral and WT Strep-tagged NifD proteins (the latter from strain DJ2102, see Supplementary file 1c) and did not conclusively detect significant differences in protein quantity relative to WT (p>0.05; Figure 3D).
Purified ancestral nitrogenases conserve extant N2 reduction mechanisms and efficiency
Ancestral nitrogenase NifDK protein components were expressed and purified for biochemical characterization. All ancestral NifDK proteins were assayed with WT NifH protein (ancestral NifH proteins were not purified) for reduction of H+, N2, and C2H2. Ancestors were found to reduce all three substrates in vitro, supporting the cellular-level evidence of ancestral nitrogenase activity (Figure 4A).
Figure 4. In vitro analyses of ancestral nitrogenase activity profiles and mechanism.
All measurements were obtained from assays using purified NifDK assayed with WT NifH. (A) Specific activities were measured for H+, N2, and C2H2 substrates. (B) Partial schematic of the reductive-elimination N2-reduction mechanism of nitrogenase is shown above, centering on the N2-binding E4(4 H) state of FeMoco (see main text for discussion) (Harris et al., 2019). (C) Inhibition of N2 reduction by H2, evidencing the mechanism illustrated in (B). (D) Catalytic efficiencies of ancestral nitrogenases, described by the ratio of formed H2 to reduced N2 (H2/N2), mapped across the targeted phylogenetic clade. NifD homology models (PDB 1M34 template) are displayed with ancestral substitutions highlighted in red. (A,C) Bars represent the mean of independent experiments (n=2) with individual data points shown as black circles.

Figure 4—figure supplement 1. SDS-PAGE of purified WT and ancestral NifDK proteins.

We investigated whether the ancestral nitrogenases studied here would exhibit the general mechanism for N2 binding and reduction that has been observed for the studied, extant nitrogenase isozymes of A. vinelandii (Mo, V, and Fe) (Harris et al., 2019; Harris et al., 2022). This mechanism involves the accumulation of four electrons/protons on the active-site cofactor as metal-bound hydrides, generating the E4(4 H) state (Figure 4B). Once generated, N2 can bind to the E4(4 H) state through a reversible reductive elimination/oxidative addition (re/oa) mechanism, which results in the release (re) of a single molecule of hydrogen gas (H2). N2 binding is reversible in the presence of sufficient H2, which displaces bound N2 and results in the reformation of E4(4 H) with two hydrides (oa). Thus, a classic test of the (re/oa) mechanism is the ability of H2 to inhibit N2 reduction. We observed that the reduction of N2 to NH3 for all nitrogenase ancestors was inhibited in the presence of H2, indicating that the ancestors follow the same mechanism of N2 binding determined for extant enzymes (Figure 4C).
In the event the E4(4 H) state fails to capture N2, nitrogenases will simply produce H2 from the E4(4 H) state to generate the E2(2 H) state. The ratio of H2 formed to N2 reduced (H2/N2) can be used as a measure of the efficiency of nitrogenases in using ATP and reducing equivalents for N2 reduction. The stoichiometric minimum of the mechanism is H2/N2=1. Experimentally (under 1 atm N2), a ratio of ~2 is seen for Mo-nitrogenase and ~5 and~7 for V- and Fe-nitrogenase, respectively (Harris et al., 2019). H2/N2 values for all ancestors under 1 atm N2 was ~2, similar to extant Mo-nitrogenase (Figure 4D).
Discussion
In this study, we leverage a new approach to investigate ancient nitrogen fixation by the resurrection and functional assessment of ancestral nitrogenase enzymes. We demonstrate that engineered A. vinelandii cells can reduce N2 and C2H2 and exhibit diazotrophic growth rates comparable to WT, though we observe that the oldest ancestor, Anc2, has a significantly longer lag phase. Purified nitrogenase ancestors are active for the reduction of H+, N2, and C2H2, while maintaining the catalytic efficiency (described by the H2/N2 ratio) of WT enzymes. Our results also show that ancestral N2 reduction is inhibited by H2, indicating an early emergence of the reductive-elimination N2-reduction mechanism preserved by characterized, extant nitrogenases (Harris et al., 2019; Harris et al., 2022). These properties are maintained despite substantial residue-level changes to the peripheral nitrogenase structure (including relatively conserved sites), as well as a handful within the active-site or protein-interface regions within the enzyme complex.
It is important to consider that the nitrogenase ancestors resurrected here represent hypotheses regarding the true ancestral state. Uncertainty underlying ancestral reconstructions might derive from incomplete extant molecular sequence data, as well as incorrect assumptions associated with the implemented evolutionary models (Garcia and Kaçar, 2019). For example, a complicating feature of nitrogenase evolution is that it has been shaped significantly by horizontal gene transfer (Raymond et al., 2004; Parsons et al., 2021), which in certain cases has led to different evolutionary trajectories of individual nitrogenase structural genes. Specifically, certain H-subunit genes of the V-nitrogenase appear to have different evolutionary histories relative to their DK components (Raymond et al., 2004). However, since our study targets a Mo-nitrogenase lineage, we do not expect horizontal transfer to be a significant source of uncertainty in our reconstructions.
The N2-reduction activity of nitrogenase ancestors suggests that the required protein-protein interactions—both between subunits that comprise the nitrogenase complex as well as those required for nitrogenase assembly in A. vinelandii—and metallocluster interactions are sufficiently maintained for primary function. Still, our results reveal the degree to which the organism-level phenotype of host strains can be perturbed by varying both the number and age of ancestral subunits. Importantly, these changes appear to impact phenotypic properties in complex ways, representative of the type of cellular constraints on nitrogenase evolution that would be unobservable through an in vitro study alone. For example, we observed comparable growth characteristics of strains harboring single (Anc1A, ancestral NifD) versus multiple (Anc1B, ancestral NifHDK) ancestral subunits of equivalent age, whereas the lag phase of Anc2 hosting a single, older subunit (ancestral NifD) was increased. Here, growth is sensitive to older, ancestral substitutions in a single subunit while permissive of more recent ancestral substitutions across one or more subunits within the nitrogenase complex. However, a different pattern is observed across in vivo acetylene reduction rates. These are most negatively impacted relative to WT in strains with a single NifD ancestor (Anc1A, Anc2), whereas rates are more modestly decreased in a strain with a complete, ancestral NifHDK complex (Anc1A). These results suggest that ancestral subunits of equivalent age have greater compatibility and yield greater in vivo activity compared to subunits of disparate ages, perhaps owing to modified protein interactions within the nitrogenase complexes The discrepancy between in vivo activity and growth characteristics may also be attributable to impacted cellular processes external to the biochemical properties of the nitrogenase complex itself, and yet nevertheless vital in determining the overall fitness of the host organism. Finally, though we do not detect significant differences in ancestral protein expression here, it is possible that phenotypic outcomes of future reconstructions might be impacted by perturbed expression levels (e.g. Kędzior et al., 2022; Garcia et al., 2021b). To what degree these expression levels are representative of the ancestral state and impact the phenotypic property of interest should be considered in future work.
That nitrogenase ancestors perform the reductive-elimination N2-reduction mechanism—as the distantly related (Garcia et al., 2020), extant Mo-, V-, and Fe-nitrogenases of A. vinelandii do today (Harris et al., 2019)—likely indicates that this enzymatic characteristic was set early in nitrogenase evolutionary history and sustained through significant past environmental change (Lyons et al., 2014; Som et al., 2016; Catling and Zahnle, 2020) and ecological diversification (Boyd et al., 2015; Zehr et al., 2003). It is possible that life’s available strategies for achieving N2 reduction may be fundamentally limited, and that a defining constraint of nitrogenase evolution has been the preservation of the same N2 reduction mechanism across shifting selective pressures. For example, in the acquisition of V- and Fe-dependence from Mo-dependent ancestors (Garcia et al., 2020), nitrogenases may have required substantial sequence and structural changes (Sippel and Einsle, 2017; Eady, 1996) in order to facilitate reductive elimination given a different active-site metallocluster. It is also possible that alternate strategies for biological nitrogen fixation evolved early in the history of life and were subsequently outcompeted, leaving no trace of their existence in extant microbial genomes. Why these alternate possibilities were evidently not explored by nature to the same degree remains an open question, particularly given the several abiotic mechanisms for nitrogen fixation (Cherkasov et al., 2015; Dörr et al., 2003; Yung and McElroy, 1979) and the multiple biological pathways for another, globally significant metabolism, carbon fixation (Garcia et al., 2021a). Because our paleomolecular approach is ultimately informed by extant sequence data, it cannot directly evaluate extinct sequences that, for instance, due to contingency or entrenchment, did not persist and become preserved in extant microbial genomes. Nevertheless, evolutionarily informed studies of nitrogenase functionality that define the sequence-function space of this enzyme family will provide a foundation for laboratory efforts aimed toward exploring alternate scenarios. Future work that explores deeper into nitrogenase evolutionary history (and across extant and ancestral nitrogenase sequence space, as charted here (Figure 2D)) will clarify the degree of functional constraint exhibited by the nitrogenase family, both past and present.
Conclusion
Broadening the historical level of analysis beyond a single enzyme to the organism level is necessary to generate comprehensive insights into the evolutionary history and engineering potential of nitrogen fixation. Paleomolecular work that has expanded toward the systems-level investigation of early-evolved, crucial metabolic pathways remains in its infancy, despite the potential for provocative connections between molecular-scale innovations and planetary history (Garcia and Kaçar, 2019; Kędzior et al., 2022). Our results highlight the evolutionary conservation of a critical metabolic pathway that has shaped the biosphere over billions of years, as well as establish the tractability of leveraging phylogenetic models to carry out extensive, empirical manipulations of challenging enzymatic systems and their microbial hosts. Building on the empirical framework presented here will illuminate the evolutionary design principles behind ancient metabolic systems more broadly as well as leverage these histories to understand how key enzymes that allowed organisms to access nitrogen from the atmosphere evolved.
Materials and methods
Key resources table.
| Reagent type (species) or resource | Designation | Source or reference | Identifiers | Additional information |
|---|---|---|---|---|
| strain, strain background (A. vinelandii) | DJ | DOI:10.1128/JB.00504–09 | n/a | Dennis Dean, Virginia Tech; Wild-type (WT); Nif+ |
| genetic reagent (A. vinelandii) | DJ2102 | DOI:10.1016/bs.mie.2018.10.007 | n/a | Dennis Dean, Virginia Tech; Strep-tagged WT NifD; Nif+ |
| genetic reagent (A. vinelandii) | DJ2278 | Other | n/a | Dennis Dean, Virginia Tech; ΔnifD::KanR; Nif- |
| genetic reagent (A. vinelandii) | DJ884 | Other | n/a | Dennis Dean, Virginia Tech; nifDR187I mutant; Nif+(slow); overexpresses NifH |
| genetic reagent (A. vinelandii) | AK022 | This paper | n/a | ΔnifHDK::KanR; Nif- |
| genetic reagent (A. vinelandii) | AK013 | This paper | n/a | ‘Anc1A’; ΔnifD::nifDAnc1A; Nif+ |
| genetic reagent (A. vinelandii) | AK023 | This paper | n/a | ‘Anc1B’; ΔnifHDK::nifHDKAnc1B; Nif+ |
| genetic reagent (A. vinelandii) | AK014 | This paper | n/a | ‘Anc2’; ΔnifD::nifDAnc2; Nif+ |
| antibody | StrepMAB-Classic (Mouse monoclonal) | IBA Lifesciences | Cat# 2-1507-001, RRID: AB_513133 | WB (1:5000) |
| recombinant DNA reagent | pAG25 | This paper | n/a | KanR cassette (APH(3’)-I gene)+400 bp nifHDK flanking homology regions, synthesized into XbaI/KpnI sites in pUC19; used to construct strain AK022 from DJ |
| recombinant DNA reagent | pAG13 | This paper | n/a | nifDAnc1A + 400-bp nifD flanking homology regions, synthesized into XbaI/KpnI sites in pUC19; used to construct strain Anc1A from AK022 |
| recombinant DNA reagent | pAG19 | This paper | n/a | nifHDKAnc1B + 400-bp nifHDK flanking homology regions, synthesized into XbaI/KpnI sites in pUC19; used to construct strain Anc1B from AK022 |
| recombinant DNA reagent | pAG14 | This paper | n/a | nifDAnc2 +400 bp nifD flanking homology regions, synthesized into XbaI/KpnI sites in pUC19; used to construct strain Anc2 from AK022 |
| sequence-based reagent | 306_nifH_F | This paper | PCR primers | GCCGAACGTTCAAGTGGAAA |
| sequence-based reagent | 307_nifH_R | This paper | PCR primers | AGAGCCAATCTGCCCTGTC |
| sequence-based reagent | 308_nifD_F | This paper | PCR primers | CACCCGTTACCCGCATATGA |
| sequence-based reagent | 309_nifD_R | This paper | PCR primers | ACTCATCTGTGAACGGCGTT |
| sequence-based reagent | 310_nifK_F | This paper | PCR primers | GCTAACGCCGTTCACAGATG |
| sequence-based reagent | 311_nifK_R | This paper | PCR primers | TCAGTTGGCCTTCGTCGTTG |
| software, algorithm | MAFFT | MAFFT | RRID:SCR_011811 | |
| software, algorithm | trimAl | trimAl | RRID:SCR_017334 | |
| software, algorithm | IQ-TREE | IQ-TREE | RRID:SCR_017254 | |
| software, algorithm | RAxML | RAxML | RRID:SCR_006086 | |
| software, algorithm | PAML | PAML | RRID:SCR_014932 | |
| software, algorithm | MODELLER | MODELLER | RRID:SCR_008395 | |
| software, algorithm | ChimeraX | ChimeraX | RRID:SCR_015872 | |
| software, algorithm | Growthcurver | Growthcurver | n/a | R package |
Nitrogenase ancestral sequence reconstruction and selection
The nitrogenase protein sequence dataset was assembled by BLASTp (Camacho et al., 2009) search of the NCBI non-redundant protein database (accessed August 2020) with A. vinelandii NifH (WP_012698831.1), NifD (WP_012698832.1), and NifK (WP_012698833.1) queries and a 1e-5 Expect value threshold (Supplementary file 1b). BLASTp hits were manually curated to remove partially sequenced, misannotated, and taxonomically overrepresented homologs. BLASTp hits included protein sequences from homologous Mo-, V-, and Fe-nitrogenase isozymes (Garcia et al., 2020). H-, D, and K-subunit sequences from these isozymes were individually aligned by MAFFT v7.450 (Katoh and Standley, 2013) and concatenated along with outgroup dark-operative protochlorophyllide oxidoreductase sequences (Bch/ChlLNB). The final dataset included 385 nitrogenase sequences and 385 outgroup sequences. For sequences used to construct Anc1A and Anc2 (internal nodes #960 and #929, respectively), tree reconstruction (using a trimmed alignment generated by trimAl v1.2 [Capella-Gutiérrez et al., 2009]), and ancestral sequence inference (using the initial untrimmed alignment) were both performed by RAxML v8.2.10 (Stamatakis, 2014) with the LG +G + F evolutionary model (model testing performed by the ModelFinder Kalyaanamoorthy et al., 2017 in the IQ-TREE v.1.6.12 package [Nguyen et al., 2015]).
Due to concerns that RAxML v.8.2 does not implement full, marginal ancestral sequence reconstruction as described by Yang et al., 1995, we performed a second phylogenetic analysis as follows. The extant sequence dataset described above was realigned by MAFFT (untrimmed) and tree reconstruction was again performed by RAxML. Ancestral sequence reconstruction was instead performed by PAML v4.9j (Yang, 2007) using the LG +G + F model. From this second reconstruction, Anc1B (internal node #1312), equivalent to Anc1A, was selected for experimental analysis. Anc1B and Anc1A have identical sets of descendent homologs, and their NifD proteins are 95% identical (Figure 2B).
Only the ancestral sequences inferred with the most probable residue at each protein site were considered for this study (mean posterior probabilities of targeted nitrogenase subunits range from 0.95 to 0.99; see Supplementary file 1a). All ancestral sequences were reconstructed from well-supported clades (SH-like aLRT = 99–100 [Anisimova and Gascuel, 2006]).
Ancestral nitrogenase structural modeling and sequence analysis
Structural homology models of ancestral sequences were generated by MODELLER v10.2 (Webb and Sali, 2016) using PDB 1M34 as a template for all nitrogenase protein subunits and visualized by ChimeraX v1.3 (Pettersen et al., 2021).
Extant and ancestral protein sequence space was visualized by machine-learning embeddings, where each protein embedding represents protein features in a fixed-size, multidimensional vector space. The analysis was conducted on concatenated (HDK) nitrogenase protein sequences in our phylogenetic dataset. The embeddings were obtained using the pre-trained language model ESM2 (Lin et al., 2022; Rives et al., 2021), a transformer architecture trained to reproduce correlations at the sequence level in a dataset containing hundreds of millions of protein sequences. Layer 33 of this transformer was used, as recommended by the authors. The resulting 1024 dimensions were reduced by UMAP (McInnes et al., 2020) for visualization in a two-dimensional space.
Protein site-wise conservation analysis was performed using the Consurf server (Ashkenazy et al., 2016). An input alignment containing only extant, Group I Mo-nitrogenases was submitted for analysis under default parameters. Conserved sites were defined by a Consurf conservation score >7.
A. vinelandii strain engineering
Nucleotide sequences of targeted ancestral nitrogenase proteins were codon-optimized for A. vinelandii by a semi-randomized strategy that maximized ancestral nucleotide sequence identity to WT genes. Ancestral and WT protein sequences were compared using the alignment output of ancestral sequence reconstruction (RAxML or PAML). For sites where the ancestral and WT residues were identical, the WT codon was assigned. At sites where the residues were different, the codon was assigned randomly, weighted by A. vinelandii codon frequencies (Codon Usage Database, https://www.kazusa.or.jp/codon/). Nucleotide sequences were synthesized into XbaI/KpnI sites of pUC19 vectors (unable to replicate in A. vinelandii) (Twist Bioscience; GenScript). Inserts were designed with 400-base-pair flanking regions for homology-directed recombination at the relevant A. vinelandii nif locus. An ‘ASWSHPQFEK’ Strep-II-tag was included at the N-terminus of each synthetic nifD gene for downstream NifD immunodetection and NifDK affinity purification. See Supplementary file 1c for a list of strains and plasmids used in this study.
Engineering of A. vinelandii strains used established methods, following Dos Santos, 2019. A. vinelandii WT (‘DJ’), DJ2278 (ΔnifD::KanR), DJ2102 (Strep-II-tagged WT NifD), and DJ884 (NifH-overexpression mutant) strains were generously provided by Dennis Dean (Virginia Tech) (Supplementary file 1c). Strains Anc1A and Anc2 were constructed from the DJ2278 parent strain via transformation with plasmids pAG13 and pAG19, respectively (Supplementary file 1c). For the construction of strain Anc2, we first generated a ΔnifHDK strain, AK022, by transforming the DJ strain with pAG25. Genetic competency was induced by subculturing relevant parent strains in Mo- and Fe-free Burk’s medium (see below). Competent cells were transformed with at least 1 μg of donor plasmid. Transformants were screened on solid Burk’s medium for the rescue of the diazotrophic phenotype (‘Nif+’) and loss of kanamycin resistance, followed by Sanger sequencing of the PCR-amplified nifHDK cluster (see Supplementary file 1d for a list of primers). Transformants were passaged at least three times to ensure phenotypic stability prior to storage at –80 °C in phosphate buffer containing 7% DMSO.
A. vinelandii culturing and growth analysis
A. vinelandii strains were grown diazotrophically in nitrogen-free Burk’s medium (containing 1 μM Na2MoO4) at 30 °C and agitated at 300 rpm. To induce genetic competency for transformation experiments, Mo and Fe salts were excluded. For transformant screening, kanamycin antibiotic was added to solid Burk’s medium at a final concentration of 0.6 μg/mL. 50 mL seed cultures for growth rate and acetylene reduction rate quantification were grown non-diazotrophically in flasks with Burk’s medium containing 13 mM ammonium acetate.
For growth rate quantification, seed cultures were inoculated into 100 mL nitrogen-free Burk’s medium to an optical density of ~0.01 at 600 nm (OD600), after Carruthers et al., 2021, and monitored for 72 hr. Growth parameters were modeled using the R package Growthcurver (Sprouffske and Wagner, 2016).
Microbial acetylene reduction assays
A. vinelandii seed cultures representing independent biological replicates were prepared as described above and used to inoculate 100 mL of nitrogen-free Burk’s medium to an OD600 ≈ 0.01. Cells were grown diazotrophically to an OD600 ≈ 0.5, at which point a rubber septum cap was affixed to the mouth of each flask. 25 mL of headspace was removed and replaced by injecting an equivalent volume of acetylene gas. The cultures were subsequently shaken at 30 °C and agitated at 300 rpm. Headspace samples were taken after 15, 30, 45, and 60 min of incubation for ethylene quantification by a Nexis GC-2030 gas chromatograph (Shimadzu). After the 60 min incubation period, cells were pelleted at 4700 rpm for 10 min, washed once with 4 mL of phosphate buffer, and pelleted once more under the same conditions prior to storage at –80 °C. Total protein was quantified using the Quick Start Bradford Protein Assay kit (Bio-Rad) according to manufacturer instructions and a CLARIOstar Plus plate reader (BMG Labtech). Acetylene reduction rates for each replicate were normalized to total protein.
Nitrogenase expression analysis
Strep-II-tagged NifD protein quantification was performed on all ancestral strains (Anc1A, Anc1B, Anc2) and DJ2102 (harboring Strep-II-tagged WT NifD). Diazotrophic A. vinelandii cultures (100 mL) representing three independent biological replicates were prepared as described above and harvested at an OD600 ≈ 1. Cell pellets were resuspended in TE lysis buffer (10 mM Tris, 1 mM EDTA, 1 mg/mL lysozyme) and heated at 95 °C for 10 min. Cell lysates were centrifuged at 5000 rpm for 15 min. Total protein in the resulting supernatant was quantified using the Pierce BCA Protein Assay kit (ThermoFisher) following manufacturer instructions. Normalized protein samples were diluted in 2×Laemmli buffer at a 1:1 (v/v) ratio prior to SDS-PAGE analysis. Proteins were transferred to nitrocellulose membranes (ThermoFisher), stained with Revert 700 Total Protein Stain (LI-COR), and imaged on an Odyssey Fc Imager (LI-COR). Membranes were then destained with Revert Destaining Solution (LI-COR) and blocked with 5% non-fat milk in PBS solution (137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4, 1.8 mM KH2PO4.) for 1 hr at room temperature. Membranes were rinsed once with PBS-T (PBS with 0.01% Tween-20) and incubated with primary Strep-tag II antibody (Strep-MAB-Classic, IBA Lifesciences, Cat# 2-1507-001, RRID: AB_513133; 1:5000 in 0.2% BSA) for 2 hr at room temperature. Membranes were then incubated in LI-COR blocking buffer containing 1:15,000 IRDye 680RD Goat anti-Mouse (LI-COR) for 2 hr at room temperature and subsequently imaged with an Odyssey Fc Imager (LI-COR). Densitometry analysis was performed with ImageJ (Schneider et al., 2012), with Strep-II-tagged NifD signal intensity normalized to that of the total protein stain.
Nitrogenase expression, purification, and biochemical characterization
Ancestral nitrogenase NifDK proteins were expressed from relevant A. vinelandii strains (Anc1A, Anc1B, Anc2) and purified according to previously published methods (Jiménez-Vicente et al., 2018) with the following modifications: cells were grown diazotrophically in nitrogen-free Burk’s medium and no derepression step to a sufficient OD600 (~1.8) before harvesting. WT NifH was expressed in A. vinelandii strain DJ884 and purified by previously published methods (Christiansen et al., 1998). Protein purity was assessed at ≥95% by SDS-PAGE gel with Coomassie blue staining (Figure 4—figure supplement 1).
Assays were performed in 9.4 mL vials with a MgATP regeneration buffer (6.7 mM MgCl2, 30 mM phosphocreatine, 5 mM ATP, 0.2 mg/mL creatine phosphokinase, 1.2 mg/mL BSA) and 10 mM sodium dithionite in 100 mM MOPS buffer at pH 7.0. Reaction vials were made anaerobic and relevant gases (N2, C2H2, H2) were added to appropriate concentrations with the headspace balanced by argon. NifDK proteins (~240 kDa) were added to 0.42 µM, the vial vented to atmospheric pressure, and the reaction initiated by the addition of NifH (~60 kDa) protein to 8.4 µM. Reactions were run, shaking, at 30 °C for 8 min and stopped by the addition of 500 µL of 400 mM EDTA pH 8.0. NH3 was quantified using a fluorescence protocol (Corbin, 1984) with the following modifications: an aliquot of the sample was added to a solution containing 200 mM potassium phosphate pH 7.3, 20 mM o-phthalaldehyde, and 3.5 mM 2-mercaptoethanol, and incubated for 30 min in the dark. Fluorescence was measured at λexcitation of 410 nm and λemission of 472 nm and NH3 was quantified using a standard generated with NH4Cl. H2 and C2H4 were quantified by gas chromatography with a thermal conductivity detector (GC-TCD) and gas chromatography with a flame ionization detector (GC-FID) respectively, according to published methods (Khadka et al., 2016; Yang et al., 2011).
Statistical analyses
Experimental data were statistically analyzed by one-way ANOVA with the post-hoc Tukey HSD test.
Acknowledgements
We thank Dennis Dean and Valerie Cash for providing A. vinelandii strains DJ, DJ2278, DJ2102, and DJ884 and for guidance in genomic manipulations; Jean-Michel Ané, April MacIntyre, and Junko Maeda for guidance and instrumentation support in performing the in vivo acetylene reduction assays; Bruno Cuevas for assistance with visualizing nitrogenase sequence space; and the members of the Metal Selection and Utilization Across Eons (MUSE) Consortium for helpful suggestions and discussions. This research was supported by the National Aeronautics and Space Administration (NASA) Interdisciplinary Consortium for Astrobiology Research: Metal Utilization and Selection Across Eons, MUSE (19- ICAR19_2–0007), the University of Wisconsin-Madison College of Agricultural and Life Sciences, the NASA Postdoctoral Program (AKG), NASA Arizona Space Grant (BMC), the John Templeton Foundation (BK; 61926), the National Science Foundation (BK; 2228495), the NASA Early Career Faculty Award (BK), and the Hypothesis Fund Award (BK).
Funding Statement
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Contributor Information
Betül Kaçar, Email: bkacar@wisc.edu.
Christian R Landry, Université Laval, Canada.
Christian R Landry, Université Laval, Canada.
Funding Information
This paper was supported by the following grants:
National Aeronautics and Space Administration 19- ICAR19_2-0007 to Amanda K Garcia, Derek F Harris, Alex J Rivier, Brooke M Carruthers, Lance Seefeldt, Betül Kaçar, Azul Pinochet-Barros.
John Templeton Foundation 61926 to Betül Kaçar.
National Science Foundation 2228495 to Betül Kaçar, Alex J Rivier, Brooke M Carruthers, Amanda K Garcia.
University of Wisconsin-Madison to Betül Kaçar.
Arizona Space Grant Consortium to Brooke M Carruthers.
National Aeronautics and Space Administration 80NSSC19K1617 to Betül Kaçar.
Hypothesis Fund to Betül Kaçar.
Additional information
Competing interests
No competing interests declared.
Author contributions
Conceptualization, Data curation, Formal analysis, Validation, Investigation, Visualization, Writing – original draft, Writing – review and editing.
Data curation, Formal analysis, Investigation, Visualization, Writing – review and editing.
Data curation, Formal analysis, Investigation.
Data curation, Formal analysis, Investigation.
Data curation.
Resources, Supervision, Writing – review and editing.
Conceptualization, Resources, Data curation, Formal analysis, Supervision, Funding acquisition, Investigation, Methodology, Writing – original draft, Project administration, Writing – review and editing.
Additional files
(a) Sequence characteristics of ancestral nitrogenase subunits. (b) Host taxa of nitrogenase and outgroup dark-operative protochlorophyllide oxidoreductase homologs included for phylogenetic analysis. (c) Strains and plasmids used in this study. (d) Primers used in this study.
Data availability
Materials including bacterial strains and plasmids are available to the scientific community upon request. Phylogenetic data, including sequence alignments and phylogenetic trees, and the script for ancestral gene codon-optimization are publicly available at https://github.com/kacarlab/garcia_nif2023, (copy archived at swh:1:rev:c9b3cf5021e50b4a0995b3972ad81d5cedea4ed5). All other data are included as source data and supplementary files.
References
- Allen JF, Thake B, Martin WF. Nitrogenase inhibition limited oxygenation of earth’s proterozoic atmosphere. Trends in Plant Science. 2019;24:1022–1031. doi: 10.1016/j.tplants.2019.07.007. [DOI] [PubMed] [Google Scholar]
- Anisimova M, Gascuel O. Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. Systematic Biology. 2006;55:539–552. doi: 10.1080/10635150600755453. [DOI] [PubMed] [Google Scholar]
- Ashkenazy H, Abadi S, Martz E, Chay O, Mayrose I, Pupko T, Ben-Tal N. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Research. 2016;44:W344–W350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennett EM, Murray JW, Isalan M. Engineering nitrogenases for synthetic nitrogen fixation: From pathway engineering to directed evolution. BioDesign Research. 2023;5:bdr.0005. doi: 10.34133/bdr.0005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boyd ES, Anbar AD, Miller S, Hamilton TL, Lavin M, Peters JW. A late methanogen origin for molybdenum-dependent nitrogenase. Geobiology. 2011a;9:221–232. doi: 10.1111/j.1472-4669.2011.00278.x. [DOI] [PubMed] [Google Scholar]
- Boyd ES, Hamilton TL, Peters JW. An alternative path for the evolution of biological nitrogen fixation. Frontiers in Microbiology. 2011b;2:205. doi: 10.3389/fmicb.2011.00205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boyd ES, Costas AMG, Hamilton TL, Mus F, Peters JW. Evolution of molybdenum nitrogenase during the transition from anaerobic to aerobic metabolism. Journal of Bacteriology. 2015;197:1690–1699. doi: 10.1128/JB.02611-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burén S, Rubio LM. State of the art in eukaryotic nitrogenase engineering. FEMS Microbiology Letters. 2018;365:fnx274. doi: 10.1093/femsle/fnx274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burén S, Jiménez-Vicente E, Echavarri-Erasun C, Rubio LM. Biosynthesis of nitrogenase cofactors. Chemical Reviews. 2020;120:4921–4968. doi: 10.1021/acs.chemrev.9b00489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: Architecture and applications. BMC Bioinformatics. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. TrimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carruthers BM, Garcia AK, Rivier A, Kacar B. Automated laboratory growth assessment and maintenance of azotobacter vinelandii. Current Protocols. 2021;1:e57. doi: 10.1002/cpz1.57. [DOI] [PubMed] [Google Scholar]
- Catling DC, Zahnle KJ. The archean atmosphere. Science Advances. 2020;6:eaax1420. doi: 10.1126/sciadv.aax1420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cherkasov N, Ibhadon AO, Fitzpatrick P. A review of the existing and alternative methods for greener nitrogen fixation. Chemical Engineering and Processing. 2015;90:24–33. doi: 10.1016/j.cep.2015.02.004. [DOI] [Google Scholar]
- Christiansen J, Goodwin PJ, Lanzilotta WN, Seefeldt LC, Dean DR. Catalytic and biophysical properties of a nitrogenase apo-mofe protein produced by a nifb-deletion mutant of azotobacter vinelandii. Biochemistry. 1998;37:12611–12623. doi: 10.1021/bi981165b. [DOI] [PubMed] [Google Scholar]
- Corbin JL. Liquid chromatographic-fluorescence determination of ammonia from nitrogenase reactions: A 2-min assay. Applied and Environmental Microbiology. 1984;47:1027–1030. doi: 10.1128/aem.47.5.1027-1030.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dörr M, Kässbohrer J, Grunert R, Kreisel G, Brand WA, Werner RA, Geilmann H, Apfel C, Robl C, Weigand W. A possible prebiotic formation of ammonia from dinitrogen on iron sulfide surfaces. Angewandte Chemie. 2003;42:1540–1543. doi: 10.1002/anie.200250371. [DOI] [PubMed] [Google Scholar]
- Dos Santos PC, Dean DR, Hu Y, Ribbe MW. Formation and insertion of the nitrogenase iron-molybdenum cofactor. Chemical Reviews. 2004;104:1159–1173. doi: 10.1021/cr020608l. [DOI] [PubMed] [Google Scholar]
- Dos Santos PC. Genomic manipulations of the diazotroph azotobacter vinelandii. Methods in Molecular Biology. 2019;1876:91–109. doi: 10.1007/978-1-4939-8864-8_6. [DOI] [PubMed] [Google Scholar]
- Eady RR. Structureminus signfunction relationships of alternative nitrogenases. Chemical Reviews. 1996;96:3013–3030. doi: 10.1021/cr950057h. [DOI] [PubMed] [Google Scholar]
- Falkowski PG. Evolution of the nitrogen cycle and its influence on the biological sequestration of CO2 in the ocean. Nature. 1997;387:272–275. doi: 10.1038/387272a0. [DOI] [Google Scholar]
- Gallon JR. Reconciling the incompatible - N2 fixation and O2. The New Phytologist. 1992;122:571–609. [Google Scholar]
- Garcia AK, Kaçar B. How to resurrect ancestral proteins as proxies for ancient biogeochemistry. Free Radical Biology & Medicine. 2019;140:260–269. doi: 10.1016/j.freeradbiomed.2019.03.033. [DOI] [PubMed] [Google Scholar]
- Garcia AK, McShea H, Kolaczkowski B, Kaçar B. Reconstructing the evolutionary history of nitrogenases: evidence for ancestral molybdenum-cofactor utilization. Geobiology. 2020;18:394–411. doi: 10.1111/gbi.12381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garcia AK, Cavanaugh CM, Kacar B. The curious consistency of carbon biosignatures over billions of years of earth-life coevolution. The ISME Journal. 2021a;15:2183–2194. doi: 10.1038/s41396-021-00971-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garcia AK, Kedzior M, Taton A, Li M, Young JN, Kaçar B. System-Level Effects of CO2 and RuBisCO Concentration on Carbon Isotope Fractionation. bioRxiv. 2021b doi: 10.1101/2021.04.20.440233. [DOI] [PubMed]
- Ghebreamlak SM, Mansoorabadi SO. Divergent members of the nitrogenase superfamily: tetrapyrrole biosynthesis and beyond. Chembiochem. 2020;21:1723–1728. doi: 10.1002/cbic.201900782. [DOI] [PubMed] [Google Scholar]
- Gold DA, Caron A, Fournier GP, Summons RE. Paleoproterozoic sterol biosynthesis and the rise of oxygen. Nature. 2017;543:420–423. doi: 10.1038/nature21412. [DOI] [PubMed] [Google Scholar]
- Goldford JE, Hartman H, Smith TF, Segrè D. Remnants of an ancient metabolism without phosphate. Cell. 2017;168:1126–1134. doi: 10.1016/j.cell.2017.02.001. [DOI] [PubMed] [Google Scholar]
- Hardy RW, Holsten RD, Jackson EK, Burns RC. The acetylene-ethylene assay for n(2) fixation: Laboratory and field evaluation. Plant Physiology. 1968;43:1185–1207. doi: 10.1104/pp.43.8.1185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harris DF, Lukoyanov DA, Kallas H, Trncik C, Yang Z-Y, Compton P, Kelleher N, Einsle O, Dean DR, Hoffman BM, Seefeldt LC. Mo-, V-, and fe-nitrogenases use a universal eight-electron reductive-elimination mechanism to achieve N2 reduction. Biochemistry. 2019;58:3293–3301. doi: 10.1021/acs.biochem.9b00468. [DOI] [PubMed] [Google Scholar]
- Harris DF, Badalyan A, Seefeldt LC. Mechanistic insights into nitrogenase Femo-cofactor catalysis through a steady-state kinetic model. Biochemistry. 2022;61:2131–2137. doi: 10.1021/acs.biochem.2c00415. [DOI] [PubMed] [Google Scholar]
- Hurley SJ, Wing BA, Jasper CE, Hill NC, Cameron JC. Carbon isotope evidence for the global physiology of proterozoic cyanobacteria. Science Advances. 2021;7:eabc8998. doi: 10.1126/sciadv.abc8998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jablonski D. Extinction: past and present. Nature. 2004;427:589. doi: 10.1038/427589a. [DOI] [PubMed] [Google Scholar]
- Jiménez-Vicente E, Martin Del Campo JS, Yang Z-Y, Cash VL, Dean DR, Seefeldt LC. Application of affinity purification methods for analysis of the nitrogenase system from azotobacter vinelandii. Methods in Enzymology. 2018;613:231–255. doi: 10.1016/bs.mie.2018.10.007. [DOI] [PubMed] [Google Scholar]
- Kacar B, Garmendia E, Tuncbag N, Andersson DI, Hughes D. Functional constraints on replacing an essential gene with its ancient and modern homologs. MBio. 2017a;8:e01276-17. doi: 10.1128/mBio.01276-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kacar B, Guy L, Smith E, Baross J. Resurrecting ancestral genes in bacteria to interpret ancient biosignatures. Philosophical Transactions. Series A, Mathematical, Physical, and Engineering Sciences. 2017b;375:20160352. doi: 10.1098/rsta.2016.0352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. Model finder: Fast model selection for accurate phylogenetic estimates. Nature Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Molecular Biology and Evolution. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kędzior M, Garcia AK, Li M, Taton A, Adam ZR, Young JN, Kaçar B. Resurrected rubisco suggests uniform carbon isotope signatures over geologic time. Cell Reports. 2022;39:110726. doi: 10.1016/j.celrep.2022.110726. [DOI] [PubMed] [Google Scholar]
- Khadka N, Dean DR, Smith D, Hoffman BM, Raugei S, Seefeldt LC. CO2 reduction catalyzed by nitrogenase: pathways to formate, carbon monoxide, and methane. Inorganic Chemistry. 2016;55:8321–8330. doi: 10.1021/acs.inorgchem.6b00388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W. Language Models of Protein Sequences at the Scale of Evolution Enable Accurate Structure Prediction. bioRxiv. 2022 doi: 10.1101/2022.07.20.500902. [DOI]
- Lyons TW, Reinhard CT, Planavsky NJ. The rise of oxygen in earth’s early ocean and atmosphere. Nature. 2014;506:307–315. doi: 10.1038/nature13068. [DOI] [PubMed] [Google Scholar]
- McInnes L, Healy J, Melville J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv. 2020 https://arxiv.org/abs/1802.03426
- Mus F, Alleman AB, Pence N, Seefeldt LC, Peters JW. Exploring the alternatives of biological nitrogen fixation. Metallomics. 2018;10:523–538. doi: 10.1039/c8mt00038g. [DOI] [PubMed] [Google Scholar]
- Mus F, Colman DR, Peters JW, Boyd ES. Geobiological feedbacks, oxygen, and the evolution of nitrogenase. Free Radical Biology & Medicine. 2019;140:250–259. doi: 10.1016/j.freeradbiomed.2019.01.050. [DOI] [PubMed] [Google Scholar]
- Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolution. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Noar JD, Bruno-Bárcena JM. Azotobacter vinelandii: the source of 100 years of discoveries and many more to come. Microbiology. 2018;164:421–436. doi: 10.1099/mic.0.000643. [DOI] [PubMed] [Google Scholar]
- Parsons C, Stüeken EE, Rosen CJ, Mateos K, Anderson RE. Radiation of nitrogen-metabolizing enzymes across the tree of life tracks environmental transitions in earth history. Geobiology. 2021;19:18–34. doi: 10.1111/gbi.12419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pérez-González A, Jimenez-Vicente E, Gies-Elterlein J, Salinero-Lanzarote A, Yang Z-Y, Einsle O, Seefeldt LC, Dean DR. Specificity of nifen and vnfen for the assembly of nitrogenase active site cofactors in azotobacter vinelandii. MBio. 2021;12:e0156821. doi: 10.1128/mBio.01568-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, Morris JH, Ferrin TE. UCSF chimerax: structure visualization for researchers, educators, and developers. Protein Science. 2021;30:70–82. doi: 10.1002/pro.3943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raymond J, Siefert JL, Staples CR, Blankenship RE. The natural history of nitrogen fixation. Molecular Biology and Evolution. 2004;21:541–554. doi: 10.1093/molbev/msh047. [DOI] [PubMed] [Google Scholar]
- Rives A, Meier J, Sercu T, Goyal S, Lin Z, Liu J, Guo D, Ott M, Zitnick CL, Ma J, Fergus R. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. PNAS. 2021;118:e2016239118. doi: 10.1073/pnas.2016239118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sánchez-Baracaldo P, Cardona T. On the origin of oxygenic photosynthesis and cyanobacteria. The New Phytologist. 2020;225:1440–1446. doi: 10.1111/nph.16249. [DOI] [PubMed] [Google Scholar]
- Schneider CA, Rasband WS, Eliceiri KW. NIH image to imagej: 25 years of image analysis. Nature Methods. 2012;9:671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seefeldt LC, Yang Z-Y, Lukoyanov DA, Harris DF, Dean DR, Raugei S, Hoffman BM. Reduction of substrates by nitrogenases. Chemical Reviews. 2020;120:5082–5106. doi: 10.1021/acs.chemrev.9b00556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sippel D, Einsle O. The structure of vanadium nitrogenase reveals an unusual bridging ligand. Nature Chemical Biology. 2017;13:956–960. doi: 10.1038/nchembio.2428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smanski MJ, Bhatia S, Zhao D, Park Y, B A Woodruff L, Giannoukos G, Ciulla D, Busby M, Calderon J, Nicol R, Gordon DB, Densmore D, Voigt CA. Functional optimization of gene clusters by combinatorial design and assembly. Nature Biotechnology. 2014;32:1241–1249. doi: 10.1038/nbt.3063. [DOI] [PubMed] [Google Scholar]
- Smith BE, Thorneley RN, Eady RR, Mortenson LE. Nitrogenases from klebsiella pneumoniae and clostridium pasteurianum. kinetic investigations of cross-reactions as a probe of the enzyme mechanism. The Biochemical Journal. 1976;157:439–447. doi: 10.1042/bj1570439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soboh B, Boyd ES, Zhao D, Peters JW, Rubio LM. Substrate specificity and evolutionary implications of a nifdk enzyme carrying nifb-co at its active site. FEBS Letters. 2010;584:1487–1492. doi: 10.1016/j.febslet.2010.02.064. [DOI] [PubMed] [Google Scholar]
- Som SM, Buick R, Hagadorn JW, Blake TS, Perreault JM, Harnmeijer JP, Catling DC. Earth’s air pressure 2.7 billion years ago constrained to less than half of modern levels. Nature Geoscience. 2016;9:448–451. doi: 10.1038/ngeo2713. [DOI] [Google Scholar]
- Sprouffske K, Wagner A. Growthcurver: An R package for obtaining interpretable metrics889 from microbial growth curves. BMC Bioinformatics. 2016;17:172. doi: 10.1186/s12859-016-1016-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stripp ST, Duffus BR, Fourmond V, Léger C, Leimkühler S, Hirota S, Hu Y, Jasniewski A, Ogata H, Ribbe MW. Second and outer coordination sphere effects in nitrogenase, hydrogenase, formate dehydrogenase, and CO dehydrogenase. Chemical Reviews. 2022;122:11900–11973. doi: 10.1021/acs.chemrev.1c00914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stüeken EE, Buick R, Guy BM, Koehler MC. Isotopic evidence for biological nitrogen fixation by molybdenum-nitrogenase from 3.2 gyr. Nature. 2015;520:666–669. doi: 10.1038/nature14180. [DOI] [PubMed] [Google Scholar]
- Vicente EJ, Dean DR. Keeping the nitrogen-fixation dream alive. PNAS. 2017;114:3009–3011. doi: 10.1073/pnas.1701560114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Webb B, Sali A. Comparative protein structure modeling using MODELLER. Current Protocols in Bioinformatics. 2016;54:5. doi: 10.1002/cpbi.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Z, Kumar S, Nei M. A new method of inference of ancestral nucleotide and amino acid sequences. Genetics. 1995;141:1641–1650. doi: 10.1093/genetics/141.4.1641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Z. PAML 4: Phylogenetic Analysis by Maximum Likelihood. Molecular Biology and Evolution. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
- Yang ZY, Dean DR, Seefeldt LC. Molybdenum nitrogenase catalyzes the reduction and coupling of CO to form hydrocarbons. The Journal of Biological Chemistry. 2011;286:19417–19421. doi: 10.1074/jbc.M111.229344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yung YL, McElroy MB. Fixation of nitrogen in the prebiotic atmosphere. Science. 1979;203:1002–1004. doi: 10.1126/science.203.4384.1002. [DOI] [PubMed] [Google Scholar]
- Zehr JP, Jenkins BD, Short SM, Steward GF. Nitrogenase gene diversity and microbial community structure: A cross-system comparison. Environmental Microbiology. 2003;5:539–554. doi: 10.1046/j.1462-2920.2003.00451.x. [DOI] [PubMed] [Google Scholar]
- Zerkle AL, House CH, Cox RP, Canfield DE. Metal limitation of cyanobacterial N 2 fixation and implications for the precambrian nitrogen cycle. Geobiology. 2006;4:285–297. doi: 10.1111/j.1472-4669.2006.00082.x. [DOI] [Google Scholar]






