Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2013 Jan 30;3:419. doi: 10.3389/fmicb.2012.00419

Classifying the metal dependence of uncharacterized nitrogenases

Shawn E McGlynn 1,*, Eric S Boyd 2, John W Peters 2, Victoria J Orphan 1,*
PMCID: PMC3578447  PMID: 23440025

Abstract

Nitrogenase enzymes have evolved complex iron–sulfur (Fe–S) containing cofactors that most commonly contain molybdenum (MoFe, Nif) as a heterometal but also exist as vanadium (VFe, Vnf) and heterometal-independent (Fe-only, Anf) forms. All three varieties are capable of the reduction of dinitrogen (N2) to ammonia (NH3) but exhibit differences in catalytic rates and substrate specificity unique to metal type. Recently, N2 reduction activity was observed in archaeal methanotrophs and methanogens that encode for nitrogenase homologs which do not cluster phylogenetically with previously characterized nitrogenases. To gain insight into the metal cofactors of these uncharacterized nitrogenase homologs, predicted three-dimensional structures of the nitrogenase active site metal-cofactor binding subunits NifD, VnfD, and AnfD were generated and compared. Dendrograms based on structural similarity indicate nitrogenase homologs cluster based on heterometal content and that uncharacterized nitrogenase D homologs cluster with NifD, providing evidence that the structure of the enzyme has evolved in response to metal utilization. Characterization of the structural environment of the nitrogenase active site revealed amino acid variations that are unique to each class of nitrogenase as defined by heterometal cofactor content; uncharacterized nitrogenases contain amino acids near the active site most similar to NifD. Together, these results suggest that uncharacterized nitrogenase homologs present in numerous anaerobic methanogens, archaeal methanotrophs, and firmicutes bind FeMo-co in their active site, and add to growing evidence that diversification of metal utilization likely occurred in an anoxic habitat.

Keywords: nitrogenase, metalloenzyme, molecular similarity, sequence conservation, vanadium, iron, molybdenum

INTRODUCTION

The majority of N in the biosphere is in the kinetically stable form of dinitrogen (N2). The only known biological means of accessing this vast reservoir of N2 is via the activity of nitrogenase, a complex metalloenzyme that catalyzes the reduction of N2 to ammonia (NH3). Nitrogenase comprises a two component enzyme whereby in an ATP-dependent process, electrons are shuttled by the nitrogenase iron protein (NifH) to the dinitrogenase reductase (NifDK) which harbors the active site (Bulen and LeComte, 1966).

The nitrogenase active site consists of a biologically unique iron (Fe)–sulfur (S) cluster [X–7Fe–C-9S] where “X” is either molybdenum (Mo), vanadium (V), or iron (Fe) (Figure 1) (Einsle et al., 2002; Lancaster et al., 2011; Spatzal et al., 2011). These clusters are referred to by their metal type as FeMo-co, FeV-co, and FeFe-co, respectively. The cluster is ligated by non-protein homocitrate as well as cysteine and histidine side chains (Shah and Brill, 1977; Einsle et al., 2002). Biochemical studies have been conducted on representatives of Mo-dependent nitrogenase (Nif), V-dependent nitrogenase (Vnf), and Fe-dependent nitrogenase (Anf) (Robson et al., 1986; Eady et al., 1987; Chisnell et al., 1988; Eady, 1996). These different classes of nitrogenase have in common the ability to catalyze the reduction of N2 to produce NH3 with concomitant reduction of protons to molecular hydrogen (H2). However, they differ in their catalytic rates and efficiencies including the stoichiometry of NH3 and H2 produced per mol N2 reduced (Eady, 1996).

FIGURE 1.

FIGURE 1

The active site of nitrogenase from A.vinelandii (Einsle et al., 2002; Lancaster et al., 2011; Spatzal et al., 2011). The metal cofactor–homocitrate complex is shown as balls and sticks and the residues found to be within 5 Å of the cofactor are shown as lines (thin). Colors: light blue, carbon; blue, nitrogen; red, oxygen; yellow, sulfur; rust, iron; cyan, molybdenum.

Phylogenetic analyses of nitrogenase amino acid sequences derived from available genome databases indicate that nitrogenase homologs cluster into five major lineages (Raymond et al., 2004; Glazer and Kechris, 2009; Boyd et al., 2011a,b; Figure 2). These lineages comprise two Nif nitrogenase groups, as well as the Vnf and Anf groups. Between the Nif and Anf lineages is a separate lineage comprised of biochemically uncharacterized nitrogenases present in the genomes of representatives of the hydrogenotrophic methanogens, methanotrophic archaea, and firmicutes. Members of this lineage appear to be functionally competent as organisms which encode homologs are capable of N2 fixation (Mehta and Baross, 2006; Dekas et al., 2009). Previously, these homologs have been referred to as “uncharacterized nitrogenase” and were shown to form a lineage that branches later than Nif, Vnf, and possibly Anf, indicating that they evolved after metal differentiation in the active site cluster occurred (Boyd et al., 2011b). This observation has prompted questions regarding the metal composition and reactivity of the active site cluster present in these nitrogenase homologs (Boyd et al., 2011b).

FIGURE 2.

FIGURE 2

Maximum likelihood unrooted phylogram of nitrogenase sequences. FeMo-co NifD-A and NifD-B sequences in black and orange, respectively, uncharacterized nitrogenases in purple, AnfD in red, and VnfD in green.

In addition to the influence that active site metal composition has on reactivity, studies have shown that the peptide environment surrounding the active site cofactor also influences the enzyme activity of nitrogenase. For example, substitutions of amino acid residues in close proximity to the FeMo-cofactor (FeMo-co) N2 binding site of nitrogenase resulted in a Nif- phenotype in A. vinelandii and altered the electronic and substrate reduction properties of the protein (Scott et al., 1990). Other residues which surround the active site have also been shown to dramatically alter substrate range of the enzyme (Mayer et al., 2002; Benton et al., 2003; Seefeldt et al., 2009). These experimental results have been instrumental in implicating individual roles for various amino acid residues surrounding the nitrogenase active site in proton transfer, local movement of nearby side chains, substrate binding, reaction intermediate stabilization, and/or modulating the electronic state of the cluster. Thus, both the metal composition of the active site cofactor and the composition of residues lining the active site cavity influence the reactivity of nitrogenase.

In this study, phylogenetic reconstruction and structural modeling were used to compare nitrogenase homologs, specifically to gain insight into the composition of the cluster at the active site of the biochemically uncharacterized nitrogenases. Active site volumes, inferred from predicted structures, and the amino acid environment that surrounds the metallocofactor were compared among nitrogenase homologs. In addition, models and known crystal structures of nitrogenase were compared by structural relatedness. These data provide new insights into the metal cofactor binding proclivity of nitrogenase enzymes and extend our understanding of the primary sequence and structural variation of nitrogenase. Additionally, this work adds further support to the concept that the first nitrogenase was likely to contain a Mo-based active site cofactor (Boyd et al., 2011a,b).

MATERIALS AND METHODS

SEQUENCE AND PHYLOGENETIC ANALYSIS

NifD homologs were obtained from the NCBI non-redundant database through BLASTp searches using NifD, VnfD, and AnfD proteins from A. vinelandii as queries (YP_002797379, YP_002797497, and YP_002801974, respectively). In the case of NifD searches, sequences were compiled using Pattern Hit Initiated BLAST (PHI-BLAST) specifying the NifD conserved “CXRS” amino acid pattern, where X is variable. In addition, a length criterion of 300–700 amino acids was imposed by the [slen] command.

Sequences were aligned using the EMBL ClustalW2 server with default parameters (Gonnet weight matrix) and manipulated with Jalview (Clamp et al., 2004; Waterhouse et al., 2009) and ClustalX (Larkin et al., 2007). Sequences were examined for known catalytic residues as reported in Boyd et al. (2011a). Phylogenies were generated using the PhyML webserver (Guindon et al., 2005) and calculated using the maximum likelihood method (Guindon and Gascuel, 2003; Guindon et al., 2010) using Shimodaira–Hasegawa-like aLRT supports and the LG substitution matrix (Anisimova and Gascuel, 2006). The resulting phylogram was projected using Fig Tree version 1.3.11, and naming accomplished using the REFGEN/TREENAMER online web server (Leonard et al., 2009). Sequence alignments for each of the protein classes used in this study are available upon request from the authors. Analysis of amino acid conservation was judged by the presence of a * (conserved) or a colon above the alignment in ClustalW corresponding to amino groups of strongly similar properties (scoring >0.5 in the Gonnet PAM 250 matrix).

STRUCTURE PREDICTION

Models of NifD and NifD homologs were generated through sequence submission to the iterative threading assembly refinement (I-TASSER) server (Zhang, 2008, 2009; Roy et al., 2010). The top model based on C-score (Wu et al., 2007) was selected for further analysis. The C-scores are derived from calculating convergence of intermediate structures that are produced during the I-TASSER run and range from -5 to 2 with high scores signifying models with high confidence (Zhang, 2008). Structures were visualized using PyMol2. Models were selected in an effort to obtain a representative diversity from each nitrogenase lineage. In the case of uncharacterized nitrogenases, these included nitrogenase associated with the archaeal methanogen Methanocaldococcus infernus ME (YP_003615674.1), an uncultured archaeal methanotroph believed to belong to the ANME-2c clade (ADF27322.1; Pernthaler et al., 2008), and the firmicutes Candidatus Desulforudis audaxviator MP104C (YP_001716346.1) and Syntrophothermus lipocalidus DSM 12680 (YP_003703435.1). A NifD homolog from a ANME-1 phylotype (Meyerdierks et al., 2010) was not included in this analysis as the sequence lacks ligand binding residues and is likely a “Nif-like” protein which are conserved in methanogenic archaea (Staples et al., 2007).

STRUCTURAL COMPARISONS

Inferred structures were compared using an “all against all” comparison performed by the ProCKSI server. The relatedness of structures was calculated with standardized distance matrix derived from the Vorolign V-score (Birzele et al., 2007) and hierarchical clustering accomplished using the Complete Link (furthest neighbor) method (Defays, 1977) with the Clustering Calculator server3.

Active site volumetric calculations were performed on the CASTp server with a probe radius of 1.4 Å (Dundas et al., 2006). Cavities were manually inspected in each case to ensure correspondence with the nitrogenase metal cofactor-binding pocket. In order to identify putative conserved active site second shell residues, representative nitrogenase structures from the Protein Data Bank (PDB; Berman et al., 2003) and those created through homology modeling were structurally aligned with the Pymol program. The amino acids within 5 Å of the metal cofactor in the A. vinelandii 1M1N structure (Einsle et al., 2002), including homocitrate, were selected for analysis.

RESULTS

NITROGENASE HOMOLOG PHYLOGENY

Phylogenetic reconstruction of representative AnfD, VnfD, NifD, and uncharacterized D homologs revealed patterns of clustering that correspond to metal utilization (Figure 2), a finding that is consistent with phylograms reported previously (Raymond et al., 2004; Boyd et al., 2011a,b; Dos Santos et al., 2012). Two lineages comprising Nif were identified, denoted here as NifD-A and NifD-B. The uncharacterized homologs, in relation to Nif, do not resolve into a monophyletic lineage indicating that their phylogenetic placement based solely on D protein divergence should be interpreted with caution. However, the overall branching order of Nif and uncharacterized nitrogenase, whereby VnfD branch basal to AnfD, is consistent with the branching order observed for concatenations of HDK homologs reported previously (Boyd et al., 2011b).

GLOBAL STRUCTURAL COMPARISON

Inferred nitrogenase homolog structures displayed modeling scores of high confidence (C-score, Table 1). These scores range from [-5,2] and models with C-score >-1.5 generally have a correct fold (Roy et al., 2010). In addition, root mean square deviation (RMSD) values obtained through comparison using the program DALI-LITE (Holm and Park, 2000) indicated that the models match very closely to the alpha chain of the high resolution NifD structure 1M1N (Einsle et al., 2002). Similarity can be visually depicted by an overlay of all A. vinelandii inferred structures (Figure 3).

Table 1.

Protein structure modeling statistics.

Organism Accession Metal center Type C-score§ TM-score§ RMSD
Geobacter metallireducens GS-15 YP_383630.1 FeMo-co NifD-A 1.53 0.93 ± 0.06 0.7
Desulfitobacterium hafniense Y51 YP_520503.1 FeMo-co NifD-A 1.00 0.85 ± 0.08 0.7
Uncultured archaeon (ANME-2) ADF27322.1 FeMo-co# Unc. 1.91 0.99 ± 0.04 0.7
Ca. Desulforudis audaxviator MP104C YP_001716346.1 FeMo-co# Unc. 1.20 0.88 ± 0.07 0.9
Syntrophothermus lipocalidus DSM 12680 YP_003703435.1 FeMo-co# Unc. 1.26 0.89 ± 0.07 0.6
Methanocaldococcus infernus ME YP_003615674.1 FeMo-co# Unc. 0.72 0.81 ± 0.09 0.9
Clostridium kluyveri DSM 555 YP_001396457.1 FeMo-co NifD-B 1.59 0.94 ± 0.05 1.6
Methanosarcina acetivorans C2A NP_618769.1 FeMo-co NifD-B 1.54 0.93 ± 0.06 1.4
Methanosarcina mazei Go1 NP_632746.1 FeMo-co NifD-B 1.69 0.95 ± 0.05 1.4
Azotobacter vinelandii DJ P16266.3 FeFe-co AnfD -0.37 0.67 ± 0.13 0.8
Clostridium kluyveri DSM 555 YP_001393772.1 FeFe-co AnfD -0.04 0.71 ± 0.12 0.9
Methanosarcina acetivorans C2A NP_616149.1 FeFe-co AnfD -0.40 0.66 ± 0.13 0.9
Rhodobacter capsulatus Q07933.1 FeFe-co AnfD -0.59 0.64 ± 0.13 0.9
Azotobacter vinelandii DJ YP_002797497.1 VFe-co VnfD 1.66 0.95 ± 0.05 0.8
Clostridium kluyveri DSM 555 YP_001395137.1 VFe-co VnfD 1.76 0.96 ± 0.05 0.8
Methanosarcina acetivorans C2A NP_616155.1 VFe-co VnfD 0.67 0.80 ± 0.09 0.9
Rhodopseudomonas palustris CGA009 NP_946728.1 VFe-co VnfD 1.64 0.94 ± 0.05 0.8

C score and TM-scores are internal I-TASSER generated scores for estimating the quality of predicted models. C-scores exist in a range of [-5,2], where a C-score of higher value signifies a model with a high confidence (Wu et al., 2007; Roy et al., 2010). TM-score indicates topological similarity of protein structure pairs and estimated accuracy with a range of [0, 1]. A score of 1 indicates perfect structural similarity (Sali and Blundell, 1993). RMSD values in the table were calculated using DaliLite (Holm and Park, 2000) against the NifD subunit of the high resolution Nif 1M1N structure (Einsle et al., 2002). Uncharacterized nitrogenases are abbreviated as “Unc.”

§

Internal I-TASSER generated score.

DaliLite calculated RMSD of model against PDB structure 1M1N.

#

Cofactor content inferred in this work.

FIGURE 3.

FIGURE 3

Cartoon diagram of the A.vinelandii nitrogenase D subunits. The NifD structure, drawn from 1M1N is in black (A), and predicted structures for the A. vinelandii vanadium (VnfD) and Fe-only (AnfD) homologs are in green (B) and red (C), respectively. A structural superposition of the three as performed in PyMol is shown in (D).

Among the obtained models, the AnfD structures display the lowest I-TASSER (least confident) modeling scores (Table 1). The AnfD models were examined by superimposing structures and manually inspecting them for structural differences. This led to the identification of a short C-terminal stretch of amino acids universally present in AnfD homologs, which is absent from other nitrogenase homologs. AnfD sequences in which this segment had been removed in silico prior to submission for homology model construction displayed markedly improved scores (C-score of 1.46 and 1.40 for A. vinelandii and C. kluyveri sequences, respectively), indicating that this section of the protein is responsible for lower model fit statistics (Table 1).

Together with NifD structures obtained from the PDB [1M1N (Einsle et al., 2002), 1MIO (Kim et al., 1993), and 1QH1 (Mayer et al., 1999)], the inferred structures of AnfD, VnfD, NifD, and uncharacterized nitrogenase homologs were compared to examine whether D homologs can be resolved from other homologs at the structural level. The Vorolign V-score, which bases similarity on the conservation and evolutionary relationship of amino acid contact sets (i.e., Voronoi contacts) between proteins, resolved the various nitrogenase types into clusters by metal type without exception (Figure 4). AnfD and VnfD were resolved from NifD and uncharacterized D proteins. Uncharacterized D proteins cluster between two clades of NifD, a finding that is consistent with previous phylogenetic analyses of these proteins (Boyd et al., 2011b). This observation supports the hypothesis that the uncharacterized D protein homologs are structurally most similar to that of NifD and may harbor a similar cofactor to that present in the NifD active site (i.e., FeMo-co). In addition, these observations indicate that NifD, VnfD, and AnfD can be separated on the basis of inferred structure alone in a way that is largely consistent with the branching order based on phylogenetic reconstruction. The placement of the uncharacterized D homologs between the two NifD clades is different, however, between the sequence and structural clustering methods (purple groups, Figures 2 and 4). The discrepancy between structural and phylogenetic clustering is possibly the result of the unique structural features of the NifD-B group, which harbor an internal amino acid extension that all other types lack making them structurally distinct (Kim et al., 1993).

FIGURE 4.

FIGURE 4

Structural relationships among nitrogenase D subunits as calculated by Voronoi contacts and the complete linkage clustering method. NifD-A are denoted in black, NifD-B in orange, uncharacterized nitrogenases in purple, AnfD in red, and VnfD in green.

ACTIVE SITE STRUCTURAL RELATIONSHIPS

A comparison of amino acids within 5 Å of the active site FeMo-co in inferred structures indicated that they comprise nine separate sequence areas or motifs (Figure 1; Table 2). For simplicity, the numbering scheme from that of A. vinelandii NifD was adopted when discussing these motifs. Two of these areas correspond to the two active site cofactor ligands Cys275 and His442 and have been discussed elsewhere (Glazer and Kechris, 2009). The remaining sequence areas analyzed here do not share bonds to the active site and can be considered to be a part of the second shell of amino acids which surround the cofactor. The amino acid distribution at these positions between the nitrogenase classes are listed in Table 2. A number of amino acids in these regions were conserved across all nitrogenase proteins (Table 2, bold-faced entries). Unique to each class however, the different nitrogenase types exhibit distinct patterns of conservation. For example, the conserved His383 residue is flanked by either a Glu or Gln residue at position 380 in NifD, whereas this position is occupied by the positively charged Lys380 in the VnfD and AnfD. Uncharacterized nitrogenase homologs display Thr, Leu, or Met, but never a positively charged residue in this position. Another notable substitution occurs near conserved Gly424, where Lys or Arg is present at position 428 in NifD and uncharacterized nitrogenases but is present as Gly in the V- and Fe-dependent nitrogenase.

Table 2.

Sequence conservation near the nitrogenase active site.

Type Sequence feature
G66 R96 Q191 N230 G356 H383 G424
NifD-A CxyAGskGVVwG^ SxxxRRN* GvxQSLGHH1a dyNIGGd& vGGLRxRH§ GYEFxH sGiKEKf
Unc. CtyAGxxGVvxG# R GxsQSxGHh iGdYNixxD ixxGxPrxWHgraphic file with name fmicb-03-00419-g005.jpg FxH ixGxKEkfl
NifD-B CxyAGxkGVv sWxxRRn GVtQSxGHHiA lGEYNIGGD tGGSRxHxY GyeFxHR tGikxky
AnfD CAyCGAKHVIG$ TWqTKRYI GPSQSGGHHKI vGeYNIQGD lWxGGSKLWHW vYtKFGHQ ifTGxRPGE
VnfD CafCGaKLVIGG TWHTKRYP GVSQSKGHHxl IGDfNIQGD IWTGGPRLWHW MSSKFGHq ifTGPRVGxL
62 92 188 226 353 377 421

Sequence features are identified by a residue with a raised numeral indicating the A. vinelandii sequence number of the residue directly below it. Underlined residues are those within 5Å of the A. vinelandii active site cofactor. Residues in bold are conserved among all nitrogenases. Residue positions of lower case exhibit conservation of strongly similar amino acids. Areas with no conservation are indicated by an "x" or a blank space, and raised symbols note positions where an exception exists in the presence of an otherwise conserved residue. Sequence areas within three residues of the underlined positions are shown where conservation exists. Uncharacterized nitrogenases are abbreviated as “Unc.”

^

C is G in Cylindrospermum stagnale AAN63669.1.

*

S is R and second R replaced with C in Paenibacillus fujiensis BAH24166.1.

&

Sequence conservation limited to “NIG” when including Delftia tsuruhatensis AAS55952.1.

§

First G position is replaced by A in Frankia sp. AAB36877.1.

Lyngbya majuscula CCAP 1446 AAY78884.1 contains the insertion “FA” after x.

#

C is undetermined in Candidatus Desulforudis audaxviator YP_001716346.1.

graphic file with name fmicb-03-00419-g005.jpg G replaced with S in unnamed Archaea BAF96801.1.

H is R in BAF96801.1.

$

First G is V in Clostridium hungatei AAB02935.1.

GxxxxxG if including Clostridium hungatei AAB02935.1.

Other differences in sequence conservation in D homologs include Gly69 and Val70. In biochemical assays, Gly69 substitution has been shown to result in resistance to acetylene inhibition, while Val70 substitutions confer the A. vinelandii Mo-dependent nitrogenase the ability to reduce short chain alkynes thereby increasing the substrate size range of the enzyme (Christiansen et al., 2000; Mayer et al., 2002; Benton et al., 2003; Seefeldt et al., 2009). Both Gly69 and Val70 are conserved in the Mo-dependent nitrogenase. However, His or Leu is observed at position 69 in AnfD and VnfD, respectively. Importantly, Gly69 is conserved in the uncharacterized nitrogenase homologs, adding further support to the hypothesis that uncharacterized nitrogenases bind FeMo-co.

To further compare the relatedness of nitrogenase homologs and investigate the potential for uncharacterized nitrogenases to bind FeMo-co, active site volume calculations were performed (Dundas et al., 2006). These calculations were motivated by the untested hypothesis that some of the observed catalytic differences between the nitrogenase types could be related to the nature of the FeMo-co binding cavity. Calculations were performed on three members of each respective lineage, in the absence of the active site cluster. Representatives included homology models generated during this study as well as structures obtained from the PDB (1M1N and 1MIO). The results indicate that calculated active site volumes varied markedly depending on active site cluster composition, with NifD having active site cavities of approximately double the size of those associated with VnfD and AnfD (Table 3). Likewise, the uncharacterized nitrogenase D homologs were shown to have an active site volume that was similar to NifD. These trends were observed both when the inferred structures were compared to each other and when inferred structures were compared with those obtained through x-ray crystallography (e.g., NifD; Kim et al., 1995; Mayer et al., 1999; Einsle et al., 2002). The similarity in uncharacterized nitrogenase active site volume to that of NifD, and the large differences in active site volume between uncharacterized nitrogenase D homologs and AnfD/VnfD support the hypothesis that uncharacterized nitrogenases most likely harbor an active site cofactor that is most similar to that present in Mo-dependent nitrogenase (i.e., FeMo-co).

Table 3.

Average active site cofactor volume (n = 3 in each case) for nitrogenases computed by the CASTp server in the absence of the active site cofactor.

Nitrogenase type Active site volume (Å3) Average deviation
NifD-A 1701 178
Uncharacterized 1842 316
NifD-B 1931 113
AnfD 1198 351
VnfD 1015 168

DISCUSSION

The diversity of reactions catalyzed by metalloenzymes is a consequence of cofactor composition and the protein environment in which these cofactors are bound. This study applies computational tools to examine the relationship between active site cofactor composition, primary sequence variation, and three-dimensional structure in nitrogenase homologs. The results of the analyses of nitrogenase homologs collectively suggest that the uncharacterized nitrogenases are most similar to Mo-nitrogenase and bind an active site metal center similar to FeMo-co. This conclusion is supported by (i) global structure similarity, (ii) comparison of calculated active site volume, and (iii) active site neighboring amino acid composition.

Further support for this conclusion can be derived from genomic comparison. In the genomes of completely sequenced organisms harboring uncharacterized NifD analyzed here (n = 10), AnfD and VnfD homologs are not found, consistent with a previous finding that alternative nitrogenases (Anf/Vnf) are only found in genomes that have a full complement of Nif genes (i.e., Anf and Vnf encoding genomes appear to always encode Nif; Boyd et al., 2011a). In contrast uncharacterized nitrogenase genes occur as the only nitrogenase homolog in their respective genomes. Furthermore these genomes lack genes which encode a third structural subunit (anfG, vnfG) found to be associated with known alternative nitrogenases (Chatterjee et al., 1997; Lee et al., 2009), making it unlikely that they bind either FeV-co or FeFe-co.

Mutagenesis studies have shown that the substrate reduction properties of nitrogenase are modified by amino acid substitutions near the active site (for examples, see Scott et al., 1990; Fay et al., 2007; Seefeldt et al., 2009; Peters et al., 2011). In addition, there are significant differences in the specific activities and substrate reduction properties observed for different metal types of nitrogenase. For example, V- and Fe-only nitrogenases have lower specific activities and divert a higher proportion of reducing equivalent to hydrogen production than Mo-nitrogenase (Bulen and LeComte, 1966; Robson et al., 1986; Eady, 1996; Schneider et al., 1997). These observations indicate that nitrogenase activity is significantly impacted by both the local peptide environment and metal composition.

Many of the amino acids surrounding the cofactor listed in Table 2 are conserved between nitrogenase types, suggesting that these have a fundamental role in nitrogenase and were fixed early in the evolutionary history of nitrogenase. The majority of these residues are involved in direct coordination of nitrogenase metal clusters (P-cluster or FeMo-co in Mo-nitrogenase) or have been implicated in having some functional role through amino acid substitution studies (Scott et al., 1990; Benton et al., 2003; Barney et al., 2004; Igarashi et al., 2004; Lee et al., 2004; Seefeldt et al., 2009; Sarma et al., 2010; Yang et al., 2011). Other residues surrounding the cofactor that are conserved but only specifically within each form of nitrogenase are particularly diagnostic in terms of categorizing the uncharacterized nitrogenases. For example, the position of Arg96 (amino acid sequence of A. vinelandii) is conserved in NifD, whereas both the AnfD and VnfD have Lys in this position. The presence of Arg at this position in unclassified nitrogenase D homologs is suggestive of FeMo-co binding. The presence of a negatively charged glutamate at position 380 in NifD and a positively charged lysine in VnfD/AnfD is intriguing, as this position is located near the variable metal binding site of the nitrogenase cofactor. While uncharacterized nitrogenase D homologs do not exhibit conservation at this position, the amino acid observed at this position is not positively charged as it is in VnfD/AnfD. Other residues that may possibly delineate metal cofactor binding among nitrogenase homologs include position 65 in NifD. Here, both Mo-type and uncharacterized nitrogenase have a conserved Ala whereas both VnfD and AnfD have Cys. Near position 65, Gly69 of Nif and the uncharacterized lineage differs from the conserved His or Leu in Anf and Vnf, respectively. These observations are in line with the observation that in the environment of Cys275, which is a ligand of the nitrogenase cofactor, Mo-type nitrogenase exhibit the amino acid pattern Cys-Tyr-Arg-Ser, Cys-Gln-Arg-Ser, or Cys-His-Arg-Ser, whereas the V and Fe-only forms contain the sequence Cys-Ala-Arg-Ser (Glazer and Kechris, 2009). Together, these patterns suggest that the uncharacterized Nif bind FeMo-co.

In summary, the results presented here suggest that uncharacterized nitrogenase homologs present in the genomes of anaerobic methanogens, anaerobic methanotrophic archaea, and anaerobic firmicutes are likely to harbor FeMo-co. Previous phylogenetic studies indicate that the ancestral nitrogenase contained FeMo-co, and that a FeMo-co containing ancestor then diversified to give rise to alternative nitrogenase and uncharacterized nitrogenase homologs. That the uncharacterized nitrogenase homologs, which have yet to be identified in the genomes of aerobes or facultative anaerobes, harbor FeMo-co supports the notion that diversification of nitrogenase toward the use of alternative metals (e.g., V and Fe) likely occurred in an anoxic environment (Boyd et al., 2011b). Collectively, these observations contrast with hypotheses put forth based on the bioavailability of metals in marine environments and the evolution of nitrogenase, whereby Anf/Vnf were suggested to predate Nif (Anbar and Knoll, 2002; Raymond et al., 2004). Reconciling these observations, we suggest an important role for microenvironments and transient fluctuations in metal availability in driving the diversification of nitrogenase early during its evolutionary history.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

Shawn E. McGlynn is an Agouron Geobiology Option postdoctoral fellow in the Division of Geological and Planetary Sciences at Caltech and is grateful for support provided by the Agouron Institute. Additional support for this work was provided by the Department of Energy Grant DE-SC0004949 (to Victoria J Orphan) and the NASA Astrobiology Institute (NAI) NNA08C-N85A (to John W. Peters and Eric S. Boyd). Eric S. Boyd also wishes to acknowledge support from the National Science Foundation (EAR-1123689 and PIRE-0968421). The authors are grateful for comments from Aaron D. Goldman, Joshua A. Steele, Wolfgang Nitschke, and members of the laboratory of Victoria Orphan.

Footnotes

REFERENCES

  1. Anbar A. D., Knoll A. H. (2002). Proterozoic ocean chemistry and evolution: a bioinorganic bridge? Science 297 1137–1142 [DOI] [PubMed] [Google Scholar]
  2. Anisimova M., Gascuel O. (2006). Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative. Syst. Biol. 55 539–552 [DOI] [PubMed] [Google Scholar]
  3. Barney B. M., Igarashi R. Y., Dos Santos P. C., Dean D. R., Seefeldt L. C. (2004). Substrate interaction at an iron–sulfur face of the FeMo-cofactor during nitrogenase catalysis. J. Biol. Chem. 279 53621–53624 [DOI] [PubMed] [Google Scholar]
  4. Benton P. M. C., Laryukhin M., Mayer S. M., Hoffman B. M., Dean D. R., Seefeldt L. C. (2003). Localization of a substrate binding site on the FeMo-cofactor in nitrogenase: trapping propargyl alcohol with an alpha-70-substituted MoFe protein. Biochemistry 42 9102–9109 [DOI] [PubMed] [Google Scholar]
  5. Berman H., Henrick K., Nakamura H. (2003). Announcing the worldwide Protein Data Bank. Nat. Struct. Biol. 10 980 [DOI] [PubMed] [Google Scholar]
  6. Birzele F., Gewehr J. E., Csaba G., Zimmer R. (2007). Vorolign – fast structural alignment using Voronoi contacts. Bioinformatics 23 e205–e211 [DOI] [PubMed] [Google Scholar]
  7. Boyd E. S., Anbar A. D., Miller S., Hamilton T. L., Lavin M., Peters J. W. (2011a). A late methanogen origin for molybdenum-depen-dent nitrogenase. Geobiology 9 221–232 [DOI] [PubMed] [Google Scholar]
  8. Boyd E. S., Hamilton T. L., Peters J. W. (2011b). An alternative path for the evolution of biological nitrogen fixation. Front. Microbiol. 2:205 10.3389/fmicb.2011.00205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bulen W. A., LeComte J. R. (1966). The nitrogenase system from Azotobacter: two-enzyme requirement for N2 reduction, ATP-dependent H2 evolution, and ATP hydrolysis. Proc. Natl. Acad. Sci. U.S.A. 56 979–986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chatterjee R., Ludden P. W., Shah V. K. (1997). Characterization of VNFG, the delta subunit of the vnf-encoded apodinitrogenase from Azotobacter vinelandii. Implications for its role in the formation of functional dinitrogenase 2. J. Biol. Chem. 272 3758–3765 [DOI] [PubMed] [Google Scholar]
  11. Chisnell J. R., Premakumar R., Bishop P. E. (1988). Purification of a second alternative nitrogenase from a nifHDK deletion strain of Azotobacter vinelandii. J. Bacteriol. 170 27–33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Christiansen J., Cash V. L., Seefeldt L. C., Dean D. R. (2000). Isolation and characterization of an acetylene-resistant nitrogenase. J. Biol. Chem. 275 11459–11464 [DOI] [PubMed] [Google Scholar]
  13. Clamp M., Cuff J., Searle S. M., Barton G. J. (2004). The Jalview Java alignment editor. Bioinformatics 20 426–427 [DOI] [PubMed] [Google Scholar]
  14. Defays D. (1977). An efficient algorithm for a complete link method. Comput. J. 20 364–366 [Google Scholar]
  15. Dekas A. E., Poretsky R. S., Orphan V. J. (2009). Deep-sea archaea fix and share nitrogen in methane-consuming microbial consortia. Science 326 422–426 [DOI] [PubMed] [Google Scholar]
  16. Dos Santos P. C., Fang Z., Mason S. W., Setubal J. C., Dixon R. (2012). Distribution of nitrogen fixation and nitrogenase-like sequences amongst microbial genomes. BMC Genomics 13:162. 10.1186/1471-2164-13-162 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dundas J., Ouyang Z., Tseng J., Binkowski A., Turpaz Y., Liang J. (2006). CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues. Nucleic Acids Res. 34 W116–W118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Eady R. R. (1996). Structure–function relationships of alternative nitrogenases. Chem. Rev. 96 3013–3030 [DOI] [PubMed] [Google Scholar]
  19. Eady R. R., Robson R. L., Richardson T. H., Miller R. W., Hawkins M. (1987). The vanadium nitrogenase of Azotobacter chroococcum. Purification and properties of the VFe protein. Biochem. J. 244 197–207 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Einsle O., Tezcan F. A., Andrade S. L. A., Schmid B., Yoshida M., Howard J. B., et al. (2002). Nitrogenase MoFe-protein at 1.16 A resolution: a central ligand in the FeMo-cofactor. Science 297 1696–1700 [DOI] [PubMed] [Google Scholar]
  21. Fay A. W., Hu Y., Schmid B., Ribbe M. W. (2007). Molecular insights into nitrogenase FeMoco insertion – the role of His 274 and His 451 of MoFe protein alpha subunit. J. Inorg. Biochem. 101 1630–1641 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Glazer A. N., Kechris K. J. (2009). Conserved amino acid sequence features in the alpha subunits of MoFe, VFe, and FeFe nitrogenases. PLoS ONE 4:e6136 10.1371/journal.pone.0006136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Guindon S., Dufayard J.-F., Lefort V., Anisimova M., Hordijk W., Gascuel O. (2010). New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59 307–321 [DOI] [PubMed] [Google Scholar]
  24. Guindon S., Gascuel O. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52 696–704 [DOI] [PubMed] [Google Scholar]
  25. Guindon S., Lethiec F., Duroux P., Gascuel O. (2005). PHYML Online – a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Res. 33 W557–W559 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Holm L., Park J. (2000). DaliLite workbench for protein structure comparison. Bioinformatics 16 566–567 [DOI] [PubMed] [Google Scholar]
  27. Igarashi R. Y., Dos Santos P. C., Niehaus W. G., Dance I. G., Dean D. R., Seefeldt L. C. (2004). Localization of a catalytic intermediate bound to the FeMo-cofactor of nitrogenase. J. Biol. Chem. 279 34770–34775 [DOI] [PubMed] [Google Scholar]
  28. Kim C. H., Newton W. E., Dean D. R. (1995). Role of the MoFe protein alpha-subunit histidine-195 residue in FeMo-cofactor binding and nitrogenase catalysis. Biochemistry 34 2798–2808 [DOI] [PubMed] [Google Scholar]
  29. Kim J., Woo D., Rees D. C. (1993). X-ray crystal structure of the nitrogenase molybdenum–iron protein from Clostridium pasteurianum at 3.0-A resolution. Biochemistry 32 7104–7115 [DOI] [PubMed] [Google Scholar]
  30. Lancaster K. M., Roemelt M., Ettenhuber P., Hu Y., Ribbe M. W., Neese F., et al. (2011). X-ray emission spectroscopy evidences a central carbon in the nitrogenase iron-molybdenum cofactor. Science 334 974–977 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Larkin M. A., Blackshields G., Brown N. P., Chenna R., McGettigan P. A., McWilliam H., et al. (2007). Clustal W and Clustal X version 2.0. Bioinformatics 23 2947–2948 [DOI] [PubMed] [Google Scholar]
  32. Lee C. C., Hu Y., Ribbe M. W. (2009). Unique features of the nitrogenase VFe protein from Azotobacter vinelandii. Proc. Natl. Acad. Sci. U.S.A. 106 9209–9214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Lee H.-I., Igarashi R. Y., Laryukhin M., Doan P. E., Dos Santos P. C., Dean D. R., et al. (2004). An organometallic intermediate during alkyne reduction by nitrogenase. J. Am. Chem. Soc. 126 9563–9569 [DOI] [PubMed] [Google Scholar]
  34. Leonard G., Stevens J. R., Richards T. A. (2009). REFGEN and TREENAMER: automated sequence data handling for phylogenetic analysis in the genomic era. Evol. Bioinform. Online 5 1–4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Mayer S. M., Lawson D. M., Gormal C. A., Roe S. M., Smith B. E. (1999). New insights into structure–function relationships in nitrogenase: a 1.6 A resolution X-ray crystallographic study of Klebsiella pneumoniae MoFe-protein. J. Mol. Biol. 292 871–891 [DOI] [PubMed] [Google Scholar]
  36. Mayer S. M., Niehaus W. G., Dean D. R. (2002). Reduction of short chain alkynes by a nitrogenase α -70Ala-substituted MoFe protein. J. Chem. Soc. Dalton Trans. 4 802–807 [Google Scholar]
  37. Mehta M. P., Baross J. A. (2006). Nitrogen fixation at 92 degrees C by a hydrothermal vent archaeon. Science 314 1783–1786 [DOI] [PubMed] [Google Scholar]
  38. Meyerdierks A., Kube M., Kostadinov I., Teeling H., Glösckner F. O., Reinhardt R., et al. (2010). Metagenome and mRNA expression analyses of anaerobic methanotrophic archaea of the ANME-1 group. Environ. Microbiol. 12 422–439 [DOI] [PubMed] [Google Scholar]
  39. Pernthaler A., Dekas A. E., Brown C. T., Goffredi S. K., Embaye T., Orphan V. J. (2008). Diverse syntrophic partnerships from deep-sea methane vents revealed by direct cell capture and metagenomics. Proc. Natl. Acad. Sci. U.S.A 105 7052–7057 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Peters J. W., Boyd E. S., Hamilton T., Rubio L. M. (2011). “Biochemistry of Mo-nitrogenase” in Nitrogen Cycling in Bacteria: Molecular Analysis, ed.Moir J. W. B. (Norfolk: Caister Academic Press; ) 59–99 [Google Scholar]
  41. Raymond J., Siefert J. L., Staples C. R., Blankenship R. E. (2004). The natural history of nitrogen fixation. Mol. Biol. Evol. 21 541–554 [DOI] [PubMed] [Google Scholar]
  42. Robson R. L., Eady R. R., Richardson T. H., Miller R. W., Hawkins M., Postgate J. R. (1986). The alternative nitrogenase of Azotobacter chroococcum is a vanadium enzyme. Nature 322 388–390 [Google Scholar]
  43. Roy A., Kucukural A., Zhang Y. (2010). I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protoc. 5 725–738 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sali A., Blundell T. L. (1993). Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234 779–815 [DOI] [PubMed] [Google Scholar]
  45. Sarma R., Barney B. M., Keable S., Dean D. R., Seefeldt L. C., Peters J. W. (2010). Insights into substrate binding at FeMo-cofactor in nitrogenase from the structure of an alpha-70(Ile) MoFe protein variant. J. Inorg. Biochem. 104 385–389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Schneider K., Gollan U., Dröttboom M., Selsemeier-Voigt S, Müller A. (1997). Comparative biochemical characterization of the iron-only nitrogenase and the molybdenum nitrogenase from Rhodobacter capsulatus. Eur. J. Biochem. 244 789–800 [DOI] [PubMed] [Google Scholar]
  47. Scott D. J., May H. D., Newton W. E., Brigle K. E., Dean D. R. (1990). Role for the nitrogenase MoFe protein alpha-subunit in FeMo-cofactor binding and catalysis. Nature 343 188–190 [DOI] [PubMed] [Google Scholar]
  48. Seefeldt L. C., Hoffman B. M., Dean D. R. (2009). Mechanism of Mo-dependent nitrogenase. Annu. Rev. Biochem. 78 701–722 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Shah V. K., Brill W. J. (1977). Isolation of an iron–molybdenum cofactor from nitrogenase. Proc. Natl. Acad. Sci. U.S.A. 74 3249–3253 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Spatzal T., Aksoyoglu M., Zhang L., Andrade S. L. A., Schleicher E., Weber S., et al. (2011). Evidence for interstitial carbon in nitrogenase FeMo cofactor. Science 334 940 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Staples C. R., Lahiri S., Raymond J., Von Herbulis L., Mukhophadhyay B., Blankenship R. E. (2007). Expression and association of group IV nitrogenase NifD and NifH homologs in the non-nitrogen-fixing Archaeon Methanocaldococcus jannaschii. J. Bacteriol. 189 7392–7398 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Waterhouse A. M., Procter J. B., Martin D. M. A., Clamp M. E., Barton G. J. (2009). Jalview Version 2 – a multiple sequence alignment editor and analysis workbench. Bioinformatics 25 1189–1191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Wu S., Skolnick J., Zhang Y. (2007). Ab initio modeling of small proteins by iterative TASSER simulations. BMC Biol. 5:17 10.1186/1741-7007-5-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Yang Z.-Y., Dean D. R., Seefeldt L. C. (2011). Molybdenum nitrogenase catalyzes the reduction and coupling of CO to form hydrocarbons. J. Biol. Chem. 286 19417–19421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Zhang Y. (2008). I-TASSER server for protein 3D structure prediction. BMC Bioinformatics 9:40 10.1186/1471-2105-9-40 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Zhang Y. (2009). I-TASSER: fully automated protein structure prediction in CASP8. Proteins 77 (Suppl. 9) 100–113 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES