Abstract
Nitrification, the oxidation of ammonia via nitrite to nitrate, has always been considered as a two-step process catalyzed by chemolithoautotrophic microorganisms oxidizing either ammonia or nitrite. No known nitrifier carries out both steps, although complete nitrification should be energetically advantageous. This functional separation has puzzled microbiologists for a century. Here we report on the discovery and cultivation of a completely nitrifying bacterium from the genus Nitrospira, a globally distributed group of nitrite oxidizers. The genome of this chemolithoautotrophic organism encodes both the pathways for ammonia and nitrite oxidation, which are concomitantly expressed during growth by ammonia oxidation to nitrate. Genes affiliated with the phylogenetically distinct ammonia monooxygenase and hydroxylamine dehydrogenase genes of Nitrospira are present in many environments and were retrieved on Nitrospira-contigs in new metagenomes from engineered systems. These findings fundamentally change our picture of nitrification and point to completely nitrifying Nitrospira as key components of nitrogen-cycling microbial communities.
Introduction
Nitrification is catalyzed by ammonia-oxidizing bacteria (AOB)1 or archaea (AOA)2 and nitrite-oxidizing bacteria (NOB)1. Since the pioneering studies by Sergei Winogradsky more than a century ago3, nitrifying microorganisms are generally perceived as specialized chemolithoautotrophs that obtain energy for growth by oxidizing either ammonia or nitrite. The known ammonia-oxidizing microbes (AOM) and NOB are phylogenetically not closely related, and none of these organisms can oxidize both substrates. This separation of the two nitrification steps in different organisms leads to a tight cross-feeding interaction and the frequently observed co-aggregation of AOM with NOB in nitrifying consortia4. However, the functional separation is a puzzling phenomenon since complete nitrification would yield more energy (ΔG°'=-349 kJ mol-1 NH3) than either single step (ΔG°'=-275 kJ mol-1 NH3 for ammonia oxidation to nitrite and ΔG°'=-74 kJ mol-1 NO2- for nitrite oxidation to nitrate). Thus, an organism catalyzing complete nitrification should have growth advantages over the “incomplete” AOM and NOB. Based on kinetic theory of optimal pathway length5,6, Costa et al.7 argued that a hypothetical complete nitrifier would likely be outcompeted by incomplete, cross-feeding AOM and NOB in many environments. However, the same authors7 also pointed out that a complete nitrifier might be competitive under conditions that favour the maximization of growth yield rather than growth rate and coined the term “comammox” (complete ammonia oxidizer) to describe such a hypothetical microbe. Conditions selecting for comammox may be characterized by slow, substrate influx-limited growth with a spatial clustering of biomass in microbial aggregates and biofilms7. A prerequisite for the existence of comammox would also be that any biochemical incompatibilities of ammonia and nitrite oxidation can be overcome by adaptations of enzymes or cellular compartmentalization7. Aside from these theoretical considerations, the old question of whether comammox exists in nature has not been resolved.
The globally distributed genus Nitrospira represents the most diverse known group of NOB. Nitrospira members have been found in terrestrial8 and limnic habitats9,10, marine waters11, deep sea sediments, sponge tissue12, geothermal springs13, drinking water distribution systems14, corroded iron pipes15, and wastewater treatment plants (WWTPs)10,16. At least six phylogenetic sublineages of Nitrospira exist, of which lineage II appears to be most widely distributed in both natural and engineered ecosystems10. The ecological success of Nitrospira has been linked to an economical pathway for nitrite oxidation17 and a substantial metabolic versatility, which includes the utilization of various organic compounds in addition to nitrite and CO210,11,17–19, cyanate or urea degradation and nitrification by reciprocal feeding with AOM19,20, and chemolithoautotrophic aerobic hydrogen oxidation21.
Enrichment of conspicuous Nitrospira
A microbial biofilm developing on the walls of a pipe under the flow of hot water (56 °C, pH 7.5) raised from a 1,200 m deep oil exploration well (Aushiger, North Caucus, Russia) was sampled and incubated at 46 °C in ammonium-containing mineral medium to enrich moderately thermophilic AOM. After a series of subcultivation steps, we obtained enrichment culture “ENR4” that oxidized ammonia to nitrate and contained a dense population of cells morphologically resembling described Nitrospira species13,15 (Extended Data Figure 1a,b). A second abundant population consisted of rod-shaped cells, but no organism in ENR4 displayed the typical morphologies of known AOM. Inspection by fluorescence in situ hybridization (FISH) with nitrifier-specific rRNA-targeted probes10,22 confirmed that ENR4 contained Nitrospira (Extended Data Figure 1c) but no other detectable nitrifiers. Moreover, known bacterial or archaeal genes of ammonia monooxygenase (AMO) subunit alpha (amoA) and 16S rRNA genes of AOA were not detected by PCR in ENR4. Considering the intriguing possibility that the Nitrospira population might be responsible for both ammonia and nitrite oxidation, we sequenced the metagenome of the enrichment (Supplementary Tables 1-7) to identify the ammonia oxidizer. Sequence assembly and differential coverage binning23 showed that the ENR4 metagenome was dominated by two organisms (one Nitrospira strain and a betaproteobacterium affiliated with the family Hydrogenophilaceae) and revealed two additional rare populations (an alphaproteobacterium related to Tepidamorphus gemmatus, family Rhodobiaceae, and an actinobacterium affiliated with Thermoleophilum, family Thermoleophilaceae). Archaea were not detected (Extended Data Figure 2a). Based on the relative genome sequence coverage in three sequenced samples of the culture, Nitrospira was the most abundant population in ENR4 (68 to 80% of the community) followed by the betaproteobacterium (18 to 29%) and the other two organisms (≤2%). Subsequent FISH identified the relatively abundant rod-shaped cells as the betaproteobacterium (Extended Data Figure 1c), whereas the two rare populations were encountered only sporadically by microscopy. Further subcultivation led to enrichment “ENR6” that also oxidized ammonia to nitrate and, according to metagenome analysis, contained only Nitrospira (60% according to relative sequence coverage) and the betaproteobacterium (40%) (Extended Data Figure 2b). The time of enrichment, from sampling of the source biofilm to ENR6, was four years. The high sequence coverage (Extended Data Figure 2) allowed us to reconstruct complete and closed Nitrospira genomes and almost complete genomes of the other bacteria from the metagenomes of cultures ENR4 and ENR6, respectively. The Nitrospira genomes retrieved from the two enrichments were identical. We provisionally classify this highly enriched Nitrospira strain as “Candidatus Nitrospira inopinata” (in.o.pi.na'ta. L. fem. adj. inopinata unexpected, surprising).
Discovery of comammox
The obtained bacterial genomes were screened for the key functional genes of autotrophic nitrification. As expected, Ca. N. inopinata possesses the key enzyme for nitrite oxidation, nitrite oxidoreductase (NXR). Its genome contains the nxrA and nxrB genes coding for the subunits alpha and beta, respectively, of the periplasmic Nitrospira NXR17 and genes of four candidate Nitrospira NxrC gamma subunits17 (Extended Data Figure 3). Unlike other cultured Nitrospira, which possess two to five paralogous copies of the nxrAB genes8,17, Ca. N. inopinata has only one copy of these genes. Much more surprisingly, Ca. N. inopinata also possesses homologs to the hallmark enzymes of ammonia oxidation, AMO and hydroxylamine dehydrogenase (also referred to as hydroxylamine oxidoreductase, HAO)24 (Extended Data Figure 1d). Its amoA gene is dissimilar to those of canonical AOM and was thus not picked up in the initial amoA PCR screening of ENR4. The three AMO subunits alpha (AmoA), beta (AmoB), and gamma (AmoC) are encoded by a single amoCAB gene cluster and by two additional amoC genes at other genomic loci (the AmoC copies share amino acid sequence identities of 99.63 to 100%) (Figure 1, Extended Data Figure 3). The amoCAB gene order is conserved in all AOB24. Ca. N. inopinata also has homologs of the putative membrane-associated proteins AmoD and AmoE of AOB, which may interact with the ammonia-oxidizing machinery or the electron transport chain25, and a homolog of the putative membrane protein OrfM found in all AOB24 (Figure 1, Extended Data Figure 3). Similar to betaproteobacterial AOB24, genes of the copper resistance proteins CopC/D are located close to the amo locus (Figure 1). The single hao gene of Ca. N. inopinata encodes a predicted octaheme cytochrome (cyt.) c protein resembling the HAO of AOB26. Like in AOB, hao shares a genomic locus with gene haoB of a putative membrane protein found in all AOB27 and with two genes of tetraheme c-type cytochromes, which resemble cyt. c554 (CycA) and cyt. cm552 (CycB) of AOB24 (Figure 1). HAO, CycA, and CycB form the hydroxylamine ubiquinone reduction module (HURM) in AOB, which transfers electrons from hydroxylamine to the quinone pool28. The full genetic complement for both ammonia and nitrite oxidation strongly suggested that Ca. N. inopinata is a comammox organism (Extended Data Figure 1d). No canonical nitrification genes were found in the genomes of the other three bacteria detected in ENR4, suggesting that these co-enriched organisms were heterotrophs that utilized organic substrates produced by the autotrophic Nitrospira29. The betaproteobacterial genome, which was identical in enrichments ENR4 and ENR6, encodes a membrane-associated nitrate reductase that is highly similar to the known nitrate reductases of E. coli and other Proteobacteria.
Phylogenetic inference based on 16S rRNA gene sequences showed that Ca. N. inopinata belongs to the widely distributed lineage II of the genus Nitrospira10 (Extended Data Figure 4). The other cultured Nitrospira strains in lineage II are N. moscoviensis15, N. lenta30, and N. japonica31, which are NOB and do not possess the enzymatic repertoire to utilize ammonia as energy source. Consistently, the affiliation of Ca. N. inopinata with Nitrospira lineage II was supported by phylogenies based on nxrB gene and NxrA protein sequences (Extended Data Figure 5). The nxrB gene is a suitable phylogenetic marker to differentiate the Nitrospira lineages8. NxrA phylogenies reliably distinguish Nitrospira NxrA from related enzymes17, but their resolution within the genus Nitrospira has not been evaluated and assignments of NxrA sequences to specific Nitrospira lineages must be treated with caution. Ca. N. inopinata represents a different species than the two comammox Nitrospira strains described by van Kessel et al.32 based on the low pairwise average nucleotide identities (70.3 to 71.6%) between the genomes of Ca. N. inopinata and these organisms.
Full nitrification by Ca. N. inopinata
Complete nitrification by Ca. N. inopinata was demonstrated by incubation experiments in mineral media containing ammonium as the sole source of energy and reductant, and bicarbonate/CO2 as the sole carbon source. Consistent with the anticipated activity of comammox, Ca. N. inopinata nearly stoichiometrically oxidized 1 mM or 0.1 mM ammonium to nitrate (Figure 2a,b). A transient accumulation of nitrite (up to 30% of the added ammonium) was observed in parallel to nitrate production, but nitrite was completely oxidized after all ammonium had been consumed. Much lower nitrite accumulation was detected in an experiment with 10 μM ammonium (Figure 2c), suggesting that experimental parameters strongly influence this phenomenon and that nitrite accumulation might actually not occur under environmentally relevant conditions. In no experiment apparent nitrogen loss caused by formation of gaseous compounds was observed (Figure 2), suggesting that NO formation from nitrite by NirK (Extended Data Figure 1d) was not quantitatively important for Ca. N. inopinata. Growth of Ca. N. inopinata during oxidation of ammonium to nitrate was demonstrated by quantitative PCR targeting its single-copy amoA gene and continued after consumption of ammonium in the presence of nitrite until the end of the experiment (Figure 2d). A pure culture of the betaproteobacterium, the only non-Nitrospira microbe in ENR6, was isolated in acetate-containing medium and showed no nitrifying activity after inoculation into ammonium- or nitrite-media at cell densities higher than the density of Ca. N. inopinata in the growth experiment (Figure 2d, Extended Data Figure 6). The function of Ca. N. inopinata as comammox was further confirmed by metaproteome analysis of ENR4, which showed that all key proteins of Ca. N. inopinata for ammonia and nitrite oxidation were expressed during incubation with ammonium and that NXR, HAO, and AmoB were among the 50 most abundant proteins of this organism (Extended Data Figure 7).
When a culture grown on ammonium was transferred into mineral medium containing only 0.5 mM nitrite as energy source and electron donor, Ca. N. inopinata stoichiometrically oxidized nitrite to nitrate (Figure 2e). However, subsequent additions of nitrite did not result in further nitrite oxidation. We hypothesize that nitrite was first oxidized by residual NXR activity, but metabolic activity and biosynthetic processes finally stalled in the absence of ammonium. This would be consistent with an incapability of Ca. N. inopinata to use nitrite as nitrogen source for assimilation due to the absence of genes for a nitrite transporter and assimilatory nitrite reductase. It is interesting to note that Ca. N. inopinata could theoretically utilize nitrite as nitrogen source by respiratory ammonification catalyzed by a periplasmic cytochrome c nitrite reductase (NrfA) using electrons from quinol (Extended Data Figure 1d). Genes for respiratory ammonification have not been detected in the genomes of the two comammox strains reported by van Kessel et al.32 and of other Nitrospira17,19. However, the Ca. N. inopinata genome lacks a second copy of respiratory complex III, which may be needed for the reverse transport of high-potential electrons from nitrite to quinone in other Nitrospira17. If nitrite is the only available electron donor, this gap in the reverse electron transport chain likely prevents nitrite reduction by NrfA and CO2 fixation by the reductive tricarboxylic acid cycle, which is the autotrophic pathway in Ca. N. inopinata (Extended Data Figure 1d) and other Nitrospira17. Thus, Ca. N. inopinata may grow on nitrite only in the presence of a low-potential electron donor for quinone reduction such as H221 or intracellular storage compounds (Extended Data Figure 1d). In contrast, electrons derived from ammonia should be transferred to quinone as in AOB1 and thus enable autotrophic growth (Extended Data Figure 1d). An ammonium transporter also enables Ca. N. inopinata to use ammonium as nitrogen source. The incapability to grow on nitrite distinguishes Ca. N. inopinata from the strictly nitrite-oxidizing Nitrospira that grow on nitrite and CO215,17.
Distinct AMO and HAO of Ca. N. inopinata
The amoA gene is a functional and phylogenetic marker for AOM33,34, which has been utilized in numerous studies as a cultivation-independent tool to detect and identify AOM in microbial communities. Intriguingly, phylogenetic analyses revealed that Ca. N. inopinata possesses a new type of AmoA that differs from the AmoA forms of known AOB and AOA. It belongs to a distinct clade (“comammox AmoA clade A”) that contains numerous environmental sequences and shares a common ancestor with the AmoA lineage of the betaproteobacterial AOB (Figure 3, Extended Data Figure 8). Similar to the AmoA phylogeny, the amoB and amoC as well as the hao genes of Ca. N. inopinata fell into distinct lineages that are related to the respective homologs of AOB (Extended Data Figure 9).
A fascinating question is whether the unusual enzymes for ammonia oxidation are ancestral features of the genus Nitrospira or were acquired by Ca. N. inopinata by lateral gene transfer (LGT). The first scenario would imply that these genes have been lost by the strictly nitrite-oxidizing Nitrospira members. Indications for a possible LGT event in Ca. N. inopinata are putative transposase genes directly upstream of the amoCAB genes (Figure 1) and a tetranucleotide pattern of the amoCAB-containing region that significantly deviates from the genome-wide signature (Extended Data Figure 10). The tetranucleotide pattern of the amoCAB region also clearly differs from the genome-wide signature of the betaproteobacterium found in ENR4 and ENR6, strongly suggesting that these genes did not originate from this strain. Putative transposases are also located downstream of the nxrAB operon, whose tetranucleotide pattern also deviates from the genome-wide signature (Figure 1, Extended Data Figure 10). Ca. N. inopinata belongs to Nitrospira lineage II (Extended Data Figure 4), and its NXR also is affiliated with lineage II (Extended Data Figure 5). Thus, if LGT occurred, the nxr genes must have been received from another Nitrospira lineage II member. No indications were found for LGT of the other two amoC copies or of hao and the other HURM genes of Ca. N. inopinata (Figure 1, Extended Data Figure 10).
Distribution of comammox Nitrospira
A screening of public databases retrieved sequences within comammox AmoA clade A, which originated from paddy and other agricultural soils, forest soils, paddy field floodwater, freshwater environments such as wetlands, river beds, aquifers, and lake sediments, and from engineered systems (activated sludge and drinking water treatment plants) (Extended Data Figure 8). For most of these sequences no quantitative information regarding their abundance is available. However, for three metagenomic datasets from Rifle soils35, relative abundances could be estimated from raw sequence data. In these soils, archaeal amoA sequences were found to be 3.8 to 10.5 fold more abundant than comammox amoA sequences. Interestingly, only very low numbers of betaproteobacterial amoA sequences were found and those were retrieved exclusively from the unassembled Rifle datasets. Additional database searches retrieved sequences from soil, freshwater, and engineered environments that clustered in phylogenetic trees with the amoB, amoC, and hao genes of Ca. N. inopinata (Extended Data Figure 9). These results are consistent with a wide environmental distribution of comammox organisms with the possible exception of oceanic environments, as no comammox marker genes were identified in marine metagenomes.
To further elucidate the distribution of putative comammox Nitrospira in engineered systems, we sequenced total community metagenomes from a pilot-scale membrane bioreactor (MBR) at the municipal WWTP Aalborg West (Denmark) and of nitrifying activated sludge from the full-scale WWTP of the University of Veterinary Medicine, Vienna, Austria (“WWTP VetMed”). A great diversity of lineage I and II Nitrospira had previously been detected in WWTP VetMed18. Quantitative FISH in the VetMed sample used for metagenomics revealed dominance of Nitrospira (7.5±3% of all detectable bacteria, 1 s.d.) over AOB (2.5±1.2%). AOA did not occur in this WWTP at an abundance relevant for nitrification as no sequences affiliated with Thaumarchaeota were detected in the metagenomic dataset. Additionally, we sequenced metagenomes of pasty and suspended iron sludge from a groundwater well (GWW) of a waterworks (Wolfenbüttel, Germany) (Supplementary Tables 1 and 2). Nitrospira population genome bins were retrieved by differential coverage binning23 from all metagenomes (Supplementary Table 8). According to 16S rRNA and NXR phylogenies, these Nitrospira belonged to lineages I and II (Extended Data Figures 4 and 5). Intriguingly, amo and hao genes similar to those of Ca. N. inopinata were found in one or more Nitrospira bins from every sample, suggesting that comammox Nitrospira frequently occur in engineered systems (Extended Data Figures 8 and 9, Figure 1). However, not all Nitrospira bins contained genes for ammonia oxidation (Supplementary Table 8). Since nxr genes were found in all bins except GWW Nitrospira bin 7, we assume that comammox coexisted in these communities with Nitrospira that were strict nitrite oxidizers or utilized alternate metabolisms19,21. In several Nitrospira bins with sufficiently long contigs, the amo, hao and nxr loci and flanking genes were syntenic with Ca. N. inopinata (Figure 1). In particular, a gene cluster for cyt. c biogenesis is located directly upstream of the hao gene in Ca. N. inopinata and metagenomes (Figure 1). This gene arrangement is not found in genome-sequenced AOB36–40, suggesting that it may be diagnostic for comammox Nitrospira. Transposases were found close to the amo and nxr genes, respectively, in two Nitrospira bins (Figure 1). Moreover, a second type of AmoA was identified in some of the GWW Nitrospira bins. These sequences fell into a phylogenetic sister lineage of comammox AmoA clade A, which also contains other environmental sequences from soil and freshwater ecosystems (Extended Data Figure 8, Figure 3), and showed considerably lower identities to the AmoA of Ca. N. inopinata (Figure 1). Consequently, we refer to this lineage as “comammox AmoA clade B”. The amoB and amoC sequences from those Nitrospira bins, which contained clade B-amoA, also fell separately in the respective phylogenetic trees (Extended Data Figure 9a,b). Thus, two different and related new types of AMO occur in bacteria from the genus Nitrospira, and both share a common ancestor with the AMO of the betaproteobacterial AOB (Figure 3).
We have noticed that sequences in comammox AmoA clades A and B were previously assigned as particulate methane monooxygenase (PmoA) to uncultured gammaproteobacterial (clade A, Crenothrix polyspora) and alphaproteobacterial (clade B) methanotrophs41,42. While these assignments were based on indirect evidence, our study provides direct physiological proof that an organism expressing an enzyme in clade A oxidizes ammonia and metagenomic evidence for a Nitrospira origin of genes in both clades. However, it remains possible that genes in these clades were exchanged by LGT between nitrifiers and methanotrophs.
Discussion
The first cultured comammox organism Ca. N. inopinata is a moderately thermophilic Nitrospira member, and uncultured mesophilic comammox Nitrospira were identified by metagenomics in this study, too. The genus Nitrospira is one of the most diverse8,18 known nitrifier groups and colonizes virtually all oxic ecosystems10 including high-temperature environments13,15. It is tempting to speculate that the environmental distribution of comammox is largely congruent with that of Nitrospira, which are mostly uncultured and poorly characterized. Previous research was based on the dogma that all Nitrospira use nitrite, but not ammonia, as energy source. Due to this firm expectation, comammox Nitrospira were overlooked for decades and some repeatedly observed phenomena could not be well explained. For example, conspicuously high in situ abundances of uncultured Nitrospira, which exceeded the abundances of known AOM in the same samples, were detected in nitrifying biofilms, activated sludge, freshwater sediments, and drinking water distribution systems14,18,43–45. These puzzling observations are inconsistent with the classical concept of nitrification, which suggests a AOM:NOB ratio greater than one46. Aside from other energy-conserving metabolic activities of NOB in addition to nitrite oxidation10,21,46, the presence of comammox organisms in those Nitrospira communities would be a plausible explanation for the increased ratio of Nitrospira over known AOM. Indeed, we detected amo and hao genes in the Nitrospira metagenome from WWTP VetMed (Figure 1, Extended Data Figures 8 and 9), a system in which Nitrospira outnumber AOB according to FISH and comammox represents 43 to 71% of the Nitrospira population as estimated from gene abundances in the metagenomic datasets. A high relative abundance of comammox (58 to 74% of all Nitrospira) was also estimated for the GWW based on metagenome analysis. More precise analyses of comammox abundance as well as its spatial interactions with other community members will require the development of assays to rapidly differentiate in situ between strictly nitrite-oxidizing and comammox Nitrospira.
Studies with strictly nitrite-oxidizing representatives of this genus characterized Nitrospira as slow-growing microbial K-strategists adapted to low substrate concentrations18,43,47,48. Many Nitrospira, including Ca. N. inopinata, also form microcolonies, flocs, and biofilms10,43. These properties, if generally shared by comammox Nitrospira, would be in agreement with the theoretically predicted7 ecological niche of comammox. The engineered systems surveyed in this study are characterized by biofilm or floc formation. Diffusion barriers and ammonium or nitrite concentration gradients47 in biofilms could create niches with limited substrate influx, where comammox might outcompete incomplete nitrifiers. Complex biofilm or floc architectures with numerous microenvironments may support diverse nitrifier communities like in WWTP VetMed, which consist of comammox as well as canonical AOB and NOB. Future comammox isolates from the Ca. N. inopinata culture and from other enrichments may offer chances to experimentally define the conditions that select for these organisms and to study the competition of comammox with other nitrifiers, including strictly nitrite-oxidizing Nitrospira and AOA adapted to low substrate concentrations48,49.
The discovery of comammox has revealed that the division of metabolic labour in nitrification is not obligate and will thus have far-reaching implications for future studies on the microbiology of nitrogen cycling. It opens a new field in nitrification research and some of the most pressing open questions range from the biochemistry, regulation, inhibition, and kinetics of complete nitrification to the diversity, population dynamics, metabolic versatility, and biological interactions of comammox organisms. In particular, the integration of comammox in studies on the niche specialization and niche partitioning of AOB and AOA50 or NOB43 will be crucial to obtain a picture of nitrification as it actually occurs in nature. Such insights may lead to refined strategies to manage nitrification in sewage treatment, drinking water supply, and agriculture. The presence of new AMO and HAO types, which share common ancestry with these enzymes of betaproteobacterial AOB, in the phylogenetically deep-branching genus Nitrospira15 impressively exemplifies the modular evolution of the nitrogen cycle28 and adds further complexity to the intricate evolutionary history of nitrification17,28.
Methods
Sampling sites
The inoculum for the Ca. N. inopinata enrichment culture was sampled from a microbial biofilm that grew on the metal surface of a pipe and was covered by hot water, which was raised from a 1,200 m deep oil exploration well. The water temperature was 56 °C and the pH 7.5. The well was located in Aushiger, North Caucus, Russia (43°22'45.0''N, 43°43'26.1''E). The biofilm samples were taken in April 2011. Activated sludge, membrane biofilm, and foam (from a foaming event) samples were taken in August and October 2014 from a pilot-scale membrane bioreactor (MBR) performing nitrogen removal and enhanced biological phosphorus removal (EBPR) at the conventional full-scale WWTP Aalborg West, Aalborg, Denmark (57°02'59.9"N, 9°51'55.4"E). The influent wastewater for this MBR came from the primary settling tank of the full-scale plant, entering an anoxic/denitrification (2 m3) tank and going to an oxic/nitrification (2 m3) tank. An anaerobic tank (1.8 m3) used for return sludge sidestream hydrolysis provided easily degradable substrate for EBPR and denitrification. Activated sludge was also sampled from an aerated activated sludge basin (tank no. 2) of the full-scale WWTP of the University of Veterinary Medicine, Vienna, Austria (48°15'17.8"N, 16°25'45.6"E) in January 2015 (WWTP VetMed). The two continuously operated activated sludge tanks of this WWTP have a volume of 254 m3 each. The wastewater composition and nitrogen load vary with the amounts of animal feces and other sewage. This WWTP was known to host a large diversity of Nitrospira18. Iron sludge samples were taken from groundwater well (GWW) no. 1 of the well field of the Wolfenbüttel waterworks (Wolfenbüttel, Germany) (52°08'55.9", 10°32'33.9"E). The well has a depth of 50 m below ground level (bgl) and a diameter of 600 mm. Groundwater is extracted through two well intake screens in 28 to 38 m bgl and 46 to 48 m bgl. The normal well capacity is 160 m³/h. Before sampling, the well had been out of operation for about three weeks. The well water is a mixture of aerobic and anaerobic groundwater from two different ground water storeys and is characterized by the following parameters (values from years 2012 to 2014): pH about 7.2, about 10 °C, 5 to 10 mg/l dissolved oxygen, 0.13 to 0.17 mg/l ammonium, <0.01 mg/l nitrite, 12 to 16 mg/l nitrate, 0.16 to 0.42 mg/l total iron, 0.03 to 0.08 mg/l manganese, 0.64 to 0.99 mg/l total organic carbon, 0.44 to 0.78 mg/l dissolved organic carbon, 71 to 81 mg/l dissolved inorganic carbon, 121 to 138 mg/l calcium. The drop pipe, through which the extracted water is pumped to ground level, was drawn out of the well on April 27, 2015 and had deposits of pasty iron sludge on the inner surface. A sample was taken from these deposits at several points corresponding to depths between 20 and 10 m bgl. A second sample consisted of suspended iron sludge deposits that had been flushed away from the upper well intake screen and retained on a fleece filter during pumping out of the turbid water on April 28, 2015.
Enrichment and cultivation of Ca. N. inopinata
The biofilm used as inoculum was suspended and incubated at 46 °C with 0.5 mM NH4Cl in a modified AOM medium51 containing (per liter): 50 mg KH2PO4; 75 mg KCl; 50 mg MgSO4 × 7H2O; 584 mg NaCl; 4 g CaCO3 (mostly undissolved, acting as a solid buffering system and growth surface); 1 ml of specific trace element solution (TES); and 1 ml of selenium-wolfram solution (SWS)52. The composition of TES and SWS is described below. Both solutions were added to the autoclaved medium by sterile filtration using 0.2 µm pore-size cellulose acetate filters (Thermo Scientific). The pH of the medium was around 8.2 after autoclaving and was kept around 7.8 by the CaCO3 buffering system during growth of the enrichment. TES contained (per litre): 34.4 mg MnSO4 × 1H2O; 50 mg H3BO3; 70 mg ZnCl2; 72.6 mg Na2MoO4 × 2H2O; 20 mg CuCl2 × 2H2O; 24 mg NiCl2 × 6H2O; 80 mg CoCl2 × 6H2O; 1 g FeSO4 × 7H2O. All salts except FeSO4 × 7H2O were dissolved in 997.5 ml Milli-Q water and 2.5 ml of 37% HCl was added before dissolving the FeSO4 × 7H2O salt. SWS contained (per litre): 0.5 g NaOH; 3 mg Na2SeO3 × 5H2O; 4 mg Na2WO4 × 2H2O. The primary ammonium-consuming enrichment was subsequently treated with antibiotics (one treatment with 50 mg/l vancomycin, two treatments with 50 mg/l bacitracin). The ammonium concentration was increased to 1 mM NH4Cl for these and all further cultivation steps. After these treatments and repeated serial dilutions in AOM medium without antibiotics, enrichment culture ENR4 was obtained that was characterized in this study. An aliquot of ENR4 was incubated at 50 °C for four weeks and then subjected to serial dilution at 46 °C. Propagation of the most diluted (10-8) ammonia-oxidizing culture was followed by serial dilution in AOM medium containing 1 mM urea instead of ammonium. The most diluted (10-7) urea-consuming (i.e., nitrifying) culture was again cultivated in AOM medium with 1 mM NH4Cl and subjected to repeated serial dilutions, which resulted in culture ENR6 that was also characterized in this study. Enrichments ENR4 and ENR6 were further cultivated in 100 ml or 250 ml Schott bottles in AOM medium containing 1 mM NH4Cl. To obtain enough biomass for DNA extraction, enrichment ENR4 was up-scaled in 1 l and 2 l Schott bottles. The composition of enrichment cultures was analyzed by phase contrast microscopy, electron microscopy, FISH with rRNA-targeted probes, amoA- and 16S rRNA-specific PCR, and metagenomics (see below for methodological details).
Physiological experiments with Ca. N. inopinata
To study nitrification by Ca. N. inopinata, an actively nitrifying ENR4 stock culture was harvested by centrifugation (9,300×g, 30 min, 10 °C) and the biomass was suspended in AOM medium (see above) without ammonium. Aliquots (25 ml) of this suspension were distributed to 100 ml Schott bottles (all glassware was rinsed twice in 6 M HCl and three times in Milli-Q water, autoclaved, and dried at 60 °C prior to use). After addition of NH4Cl to final concentrations of 1 mM, 0.1 mM, or 10 µM, respectively, or of NaNO2 to a final concentration of 0.5 mM, the biomass was incubated at 46 °C for 9 h (10 µM NH4Cl) or 48 h (other experiments) without agitation in the dark. Samples (500 µl) for chemical analyses (see below) were taken directly after ammonium or nitrite addition and during the incubations. The samples were centrifuged (22,000×g, 10 min, 4 °C) to remove cells and undissolved CaCO3 and 450 µl of the supernatant was transferred to plastic tubes and stored at -20 °C until analysis. Each incubation condition except 10 µM NH4Cl was performed in parallel with four biological replicates (biological triplicates for 10 µM NH4Cl), two dead biomass controls (cells were killed by autoclaving), and two abiotic controls that contained only medium and substrate, but no biomass. After the experiments, the remaining biomass was harvested by centrifugation (9,300×g, 30 min, 10 °C), frozen immediately at -80 °C, and shipped on dry ice for proteome analysis. To quantify growth of Ca. N. inopinata by complete nitrification, culture ENR4 was incubated in mineral NOB medium, which has been used to cultivate nitrite-oxidizing Nitrospira21. In this experiment, the NOB medium was amended with ammonium instead of nitrite. The NOB medium was chosen because it contains less CaCO3, which can affect quantitative PCR (qPCR) efficiency and accuracy. Nitrifying activity of ENR4 in NOB medium was confirmed in preceding tests. Biomass from the supernatant (without undissolved CaCO3) from an ammonia-oxidizing culture was washed once in NOB medium, harvested by centrifugation (9,300×g, 30 min, 10 °C), and prepared for incubation as described above. Following the addition of NH4Cl to a final concentration of 0.6 mM, samples (100 µl) for quantitative PCR were taken immediately and after 4, 5, 7, and 8 days of incubation. Samples for chemical measurements (see below) were taken immediately and after 1, 4, 5, 7, and 8 days of incubation. All samples were stored at -20 °C until analysis. These incubation experiments were performed in biological triplicates. Copy numbers of the Ca. N. inopinata amoA gene were determined by qPCR using the newly designed Ca. Nitrospira inopinata amoA gene-specific primers Nino_amoA_19F (5’-ATAATCAAAGCCGCCAAGTTGC-3’) and Nino_amoA_252R (5’-AACGGCTGACGATAATTGACC-3’). The qPCR reactions were run with three technical replicates in a Bio-Rad C1000 CFX96 Real-Time PCR system, using the Bio-Rad iQ SYBR Green Supermix kit (Bio-Rad). Each qPCR reaction was performed in 20 μl reaction mix containing 10 μl SYBR Green Supermix, 2 μl of the sampled ENR4 cell suspension, 0.1 µl of each primer (50 μM), and 7.9 μl of autoclaved double-distilled ultrapure water. Cells were lysed and DNA was released for 10 min at 95 °C, followed by 43 PCR cycles of 40 s at 94 °C, 40 s at 52 °C, and 45 s at 72 °C. Plasmids carrying the Ca. N. inopinata amoA gene were obtained by PCR-amplifying the gene from the ENR4 culture and cloning the product into the pCR4-TOPO TA vector (Invitrogen). The M13-PCR product from these plasmids containing the amoA gene was used as standard for qPCR (the amoA copy number in the standard was calculated from DNA concentration). Tenfold serial dilutions of the standard were subjected to qPCR in triplicates to generate an external standard curve. The amplification efficiency was 92.6%, and the correlation coefficient (r2) of the standard curve was 0.999.
Isolation of the betaproteobacterium from ENR4
A 1 ml aliquot of the ENR4 culture was transferred to 25 ml modified AOM medium (see above) containing 6 mM sodium acetate. After three weeks of incubation at 46 °C, a 1 ml aliquot of the betaproteobacterial primary enrichment was transferred into 25 ml of fresh modified AOM medium containing 6 mM sodium acetate. After three more weeks, a 5 ml aliquot of this culture was centrifuged (9,300×g, 10 min, 10 °C) and the cells were resuspended in 25 ml NOB medium (see above) containing 1 ml of SWS and 4 mM sodium acetate. Thereafter, 1 ml of the betaproteobacterial enrichment was transferred into fresh NOB medium containing 4 mM sodium acetate every 2 weeks. The fourth transfer was checked for purity by FISH with the betaproteobacterium-specific probe Nmir1009, which showed 100% overlap with the EUB338 probe mix and DAPI signals. No Nitrospira cells were detected by FISH in the culture.
Physiological experiments with the betaproteobacterium
To test whether the betaproteobacterium had the capability to nitrify, 20 ml of a dense pure culture of this organism was centrifuged (9,300×g, 10 min, 10 °C), washed once in modified AOM medium without solid CaCO3, and resuspended in modified AOM medium without ammonium and solid CaCO3. Aliquots of this suspension were distributed into 100 ml Schott bottles, which had been rinsed twice in 6 M HCl, washed 3 times in Milli-Q water, closed with aluminum caps, autoclaved, and dried at 60 °C prior to use. Subsequently, the following substrates were added: 1 mM NH4Cl; or 0.5 mM NaNO2 and 0.1 mM NH4Cl; or 4 mM sodium acetate and 0.1 mM NH4Cl (the 0.1 mM NH4Cl was added to the nitrite and acetate incubations to provide the organism with a nitrogen source for assimilation). The biomass was incubated at 46 °C in the dark without agitation. All experiments were performed in parallel with biological triplicates. Samples (700 µl) for qPCR and chemical analyses (see below) were taken immediately after experimental set-up and after 19, 24, 30, 42, and 48 h of incubation. The samples were stored at -20 °C until analysis. Cell densities of the betaproteobacterium were quantified by qPCR targeting the soxB gene, which encodes the SoxB component of the periplasmic thiosulfate-oxidizing Sox enzyme complex. SoxB is a single-copy gene in the genome of the betaproteobacterium. The primers used to quantify the soxB gene were soxB_F1 (5’-GGACCAGACCGCCATCACTTACCC-3’) and soxB_R1 (5’-GCACCATGTCCCCGCCTTGCT-3’). The qPCR protocol and conditions were the same as described above.
Chemical analyses
Ammonium levels were measured photometrically as described previously53,54 with adjusted volumes of sample and reagents. Standards were prepared in AOM or NOB medium and ranged from 7.25 to 1,000 µM NH4Cl. Nitrite concentrations were determined photometrically by the acidic Griess reaction55. Nitrate was reduced to nitrite by vanadium chloride and measured as NOx by the Griess assay. Nitrate concentrations were calculated from the NOx measurements as described elsewhere56. Standards were prepared in AOA or NOB medium and ranged from 7.8 to 1,000 µM for NOx and from 3.9 to 500 µM for nitrite.
Replication of physiological experiments
The number of replications are detailed in the subsections for each specific experiment, and were mostly determined by the amount of biomass available for the different cultures. In all experiments, a minimum of three biological replications were used. No statistical methods were used to predetermine sample size.
FISH and microscopy
FISH with rRNA-targeted oligonucleotide probes was performed as described elsewhere57 using the EUB338 probe mix58,59 for the detection of Bacteria, probes Ntspa662 and Ntspa712 specific for Nitrospira10, and probes Nso1225, Nso190, and NEU specific for betaproteobacterial AOB22. The betaproteobacterium in ENR4 and ENR6 was detected by FISH with the specific probe Nmir1009 (5’-CACTCCCCCGTCTCCGGG-3’) with 35% of formamide in the hybridization buffer. If required, unlabelled competitor oligonucleotides were added in equimolar amounts as probes. Cells were counterstained by incubation for 5 min in a 0.1 µg/ml DAPI (4',6-diamidino-2-phenylindole) solution. Fluorescence micrographs were recorded by using a Leica SP7 confocal laser scanning microscope equipped with a white light laser. To determine the relative abundances of Nitrospira and AOB in WWTP VetMed by quantitative FISH, 20 confocal images of FISH probe signals were taken at random positions in the sample and analyzed as described elsewhere60 by using the digital image analysis software DAIME61. For whole-cell electron microscopy, cells were positively stained with 1% (w/v) uranyl acetate. Electron microscopy of thin sections was carried out as described elsewhere62.
PCR assays for marker genes of AOB and AOA
To check whether the Ca. N. inopinata enrichments contained known AOB or AOA, PCR tests were performed using primer sets amoA-1F/amoA-2R targeting betaproteobacterial amoA63, CamoA-19f/CamoA-616r targeting thaumarchaeal amoA33,64, and 771F/957R for thaumarchaeal 16S rRNA genes65 and the respective published reaction conditions. DNA was extracted for these PCR assays by using the PowerSoil DNA Isolation Kit (MoBio) according to the manufacturer's instructions.
Metaproteomic analysis
Protein extraction from concentrated ENR4 biomass, proteolytic digestion, analysis of peptide lysates by mass spectrometry (MS), processing of MS raw files, and analysis of MS spectra were carried out as described elsewhere20. MS spectra were searched against a database of predicted gene products on the ENR4 metagenome scaffolds containing 12,234 sequence entries and a common Repository of Adventitious Proteins (cRAP) database using the Sequest HT algorithm. The PROPHANE pipeline (http://www.prophane.de/index.php) was used to classify the lowest common phylogenetic ancestor of each protein group and to calculate the normalized spectral abundance factor (NSAF).
DNA extraction for metagenomics
Biomass of enrichment ENR4 was collected from three culture bottles (samples ENR4_A, ENR4_E, ENR4_F) by centrifugation and frozen over night at -80 °C before total nucleic acids were extracted by bead beating in the presence of phosphate buffer, 10% (w/v) SDS and phenol as described elsewhere66 (see http://dx.doi.org/10.1038/protex.2012.046 for full protocol). Bead beating was repeated twice to break remaining intact cells, the supernatants from each step were pooled, and nucleic acids purified by phenol/chloroform/isoamyl alcohol and chloroform/isoamyl alcohol extraction. Nucleic acids were precipitated using 20% (w/v) polyethylene glycol, washed in ice-cold 75% (v/v) ethanol, and resuspended in sterile 10 mM TRIS buffer. RNA was digested with RNase I (Promega) and the purity of DNA assessed by spectrophotometry. The same protocol was used to extract DNA from concentrated biomass of enrichment ENR6 (sample ENR6_N3), with the modification that bead beating was not repeated, and from an activated sludge sample of WWTP VetMed collecting only the supernatants of the second and third bead beating steps (DNA extract Vetmed_23). DNA was extracted from a second aliquot of the WWTP VetMed sample (DNA extract Vetmed_Pskit), and from pasty (sample GWW_HP_F1) or suspended (sample GWW_HP_D) iron sludge from the GWW, by using the PowerSoil DNA Isolation Kit (MoBio). DNA was extracted from all MBR samples by using the FastDNA SPIN Kit for Soil (MP Biomedicals) following the manufacturer's instructions.
Metagenome sequencing
Sequencing libraries were prepared using the Nextera or TruSeq PCR free kits (Illumina Inc.) following the manufacturer's recommendations. For the TruSeq PCR free kits, the 550 bp protocol was used with 1 µg of input DNA. The prepared libraries were sequenced using either an Illumina MiSeq with MiSeq Reagent Kit v3 (2x301 bp; Illumina Inc.) or an Illumina HiSeq2000 using the TruSeq PE Cluster Kit v3-cBot-HS and TruSeq SBS kit v.3-HS sequencing kit (Illumina Inc.). Nanopore sequencing was performed in addition to facilitate completion of the Ca. N. inopinata genome sequence. Library preparation was done using the Nanopore Sequencing kit (SQK-MAP005, Oxford Nanopore) following the manufacturer's recommendations (v. MN005_1124_revC_02Mar2015) with shearing in an Eppendorf MiniSpin plus centrifuge at 8,000 rpm and including the optional PreCR treatment step, as well as Ampure XP Bead purification after dA-tailing. The libraries were sequenced using nanopore flow cells (FLO-MAP003, Oxford Nanopore) using the MinION device (Oxford Nanopore) with the MinKNOW software (v. 0.50.1.15). Flow cells were primed twice with a mixture of 3 µl Fuel Mix, 75 µl 2× Running Buffer, and 72 µl nuclease-free water for 10 min. Libraries were prepared for loading onto the flow cell by mixing 75 µl 2× Running Buffer, 66 µl nuclease-free water, 3 µl Fuel Mix, and 6 µl Library (Pre-sequencing Mix). A sequencing run was started (MAP_48Hr_Sequencing_Run.py) after loading the library. Additional DNA library top-ups and restart of the run script was carried out to maximize yield by allowing a new selection of active pores. Base calling was carried out using Metrichor and the 2D Basecalling workflow (Rev 1.16). Details for each metagenome can be found in Supplementary Table 1.
Metagenome scaffold assembly and binning
Paired-end Illumina reads were imported into CLC Genomics Workbench v. 8.0 (CLCBio, Qiagen) and trimmed using a minimum phred score of 20 and a minimum length of 50 bp, with allowing no ambiguous nucleotides and trimming off Illumina sequencing adaptors if found. FASTQ files for the Oxford Nanopore 2D reads were obtained using the R package poRe v. 0.667 and error corrected using Illumina reads through Proovread v. 2.1368. For each environment, all trimmed Illumina reads were co-assembled using CLCs de novo assembly algorithm, using a kmer of 63 and a minimum scaffold length of 1 kbp. Trimmed reads were mapped to the assembled scaffolds using CLCs map reads to reference algorithm, with a minimum similarity of 95% over 70% of the read length. Open reading frames (ORFs) were predicted in the assembled scaffolds using Prodigal69. A set of 107 hidden Markov models (HMMs) of essential single-copy genes (Dupont et al., 2012) were searched against the ORFs using HMMER3 (http://hmmer.janelia.org/) with default settings, except option (-cut_tc) was used. Identified proteins were taxonomically classified using BLASTP against the RefSeq (v. 52) protein database with a maximum e-value cutoff of 10-5. MEGAN70 was used to extract class-level taxonomic assignments from the BLAST output. The script network.pl (http://madsalbertsen.github.io/mmgenome/) was used to extract paired-end read connections between scaffolds. PhyloPythiaS+71 was used to taxonomically classify all scaffolds of selected samples. In addition, selected metagenome assemblies were binned based on ESOM maps72. After training the ESOM using scaffolds > 5 kbp and large scaffolds chopped into 5 kbp pieces, all scaffolds were projected back to the ESOM map to retrieve a single coordinate for all scaffolds. Individual genome bins were extracted using the multi-metagenome principles23 implemented in the mmgenome R package (http://madsalbertsen.github.io/mmgenome/). All genome bins are fully reproducible from the raw metagenome assemblies using Rmarkdown files available on http://madsalbertsen.github.io/mmgenome/. The script extract.fastq.reassembly.pl (http://madsalbertsen.github.io/mmgenome/) was used to extract paired-end reads from the binned scaffolds, which were used for re-assembly using SPAdes73. For selected samples, error-corrected Oxford Nanopore 2D reads were used for scaffolding using SSPACE-LongRead74. For all genomes, quality was assessed using coverage plots through the mmgenome R package and through the use of QUAST75 and CheckM76. Details for each metagenome assembly can be found in Supplementary Table 2, and further details for the reconstructed bacterial genomes (including CheckM results) in Supplementary Tables 3-7. Relative genome sequence coverage was calculated as the fraction of sequence coverage of a reconstructed genome compared to the summed coverage of all genomes in these low-complexity metagenomes. The reconstructed bacterial genomes were uploaded to the MicroScope platform77 for automatic annotation and for manual annotation refinement17 of key pathways of Ca. N. inopinata.
To test for the presence of additional organisms capable of nitrification, the raw reads for each enrichment ENR4 and ENR6 were mapped to the amoA, amoB, amoC, hao and nxrB sequences used to generate the trees in Extended Data Figures 5b,d, 8, and 9. Reads were required to align to any one member of a target dataset over at least 70% of read length with BLASTN (word size=7). Reads that mapped with >97% nucleotide identity were automatically classified. Reads with lower identity were placed with the Evolutionary Placement Algorithm (EPA) using RAxML78. Using this procedure, no indication was found for the presence of any nitrifier other than Ca. N. inopinata in these enrichments.
Sequence collection and phylogenetics
For phylogenetic analyses of AMO and HAO, full amino acid datasets were downloaded from the Pfam79 site for bacterial (pfam02461) and archaeal (pfam12942) amoA. Additional amino acid sequences were identified from the NCBI Genbank80 and the Integrated Microbial Genomes databases (IMG-ER and -MER)81 that were returned using the search words “ammonia, methane, amo, pmo or monooxygenase” (Genbank) or had been annotated with one of the target pfams (IMG). A BLASTP82 search was performed using the Ca. Nitrospira inopinata amoA sequence as a query, word size=2, BLOSUM 45, E=10 and the top 1,000 returned sequences were downloaded. Comparable procedures were performed to generate a comprehensive set of amoB (pfam04744) and amoC (pfam04896) sequences. For construction of the hao (pfam13447) dataset, query words were changed to “Hydroxylamine” and “Hao”. For each gene set, amino acid sequences were filtered using hmmsearch (http://hmmer.janelia.org/) with the respective pfam HMMs, requiring an Expect value < 0.001. Amino acid sequences were clustered at 75% identity using USEARCH83 and aligned using Mafft84. Phylogenetic trees were calculated using PhyloBayes85, running 5 independent chains for 21,000 cycles each, using 11,000 cycles for burn-in and sampling every 20 cycles. Sequences that mapped to centroids that clustered within the comammox clade were used for additional phylogenetic calculations along with an outgroup of 27 betaproteobacterial amoA and 29 diverse pmoA sequences. Corresponding nucleotide sequences for this set were aligned according to their amino acid translations using MUSCLE86 and manually corrected for frameshifts. Nucleotide alignments were then used for constructing consensus trees in Phylobayes, running 5 independent chains for 21,000 cycles each, using 11,000 cycles for burn-in and sampling every 20 cycles.
To estimate relative abundances of amoA genes, comammox-type amoA sequences were identified from three publicly available Rifle soil metagenomic datasets (3300002121, 3300002122 and 3300002124) available within IMG. Functional profiles were generated within IMG using pfam12942 (archaeal amoA) and pfam02461 (bacterial amoA/pmoA) against the assembly and unassembled reads. All identified amoA/pmoA nucleotide sequences were downloaded as nucleic acid sequences and added to the existing amoA alignment used to generate Extended Data Figure 8 with the -add option in Mafft. EPA in RAxML was used to assign downloaded sequences into the reference tree that is the basis for Extended Data Figure 8. AmoA abundance for each amoA type (comammox, archaeal, betaproteobacterial AOB) was estimated by taking the sum of the estimated copy numbers of each assembled amoA gene of a given type as well as the number of unassembled reads assigned to a given amoA type.
Comammox, betaproteobacterial, and archaeal amoA sequences from the metagenomes of WWTP VetMed and the GWW were identified using the same procedure as above. Comammox amoA read abundances were then used to calculate an estimate of the fraction of Nitrospira that are comammox. AmoA was assumed to be a single copy gene in all comammox (as it is in Ca. N. inopinata). Total Nitrospira were enumerated by mapping raw reads from metagenomic samples using the first 700 nucleotides of the predicted ATP-citrate lyase subunit beta (aclB) gene from Ca. N. inopinata. Reads were required to align to Ca. N. inopinata aclB over at least 70% of read length and with >60% alignment identity with BLASTN (word size=7). AclB was chosen on the basis that this gene has a restricted taxonomic distribution, encodes a key enzyme of the reductive tricarboxylic acid cycle employed by all known Nitrospira for CO2 fixation, and is present in single copy within known Nitrospira genomes. To test its utility, all 150 nt segments (pos 1:150, 2:151…1051:1200) of the Ca. N. inopinata aclB gene was used as a query against the nr database (BLAST, word size=7, 70% read length and 60% alignment identity). Over the first 700 nucleotides of the aclB gene, test fragments mapped only to reference Nitrospira organisms. Downstream of this region, the aclB mapping was less specific, mapping to Nitrospira and Chlorobi with high (>90%) identity. Coverage of each gene was calculated by dividing the number of mapped reads by gene length of the query (843 nt for comammox amoA and 700 nt for Nitrospira aclB). Adjusted coverage was calculated by dividing gene coverage by total number of reads in the metagenomic dataset. Ratios discussed in the main text are then the adjusted coverage of comammox (as calculated from comammox amoA) divided by the adjusted coverage for all Nitrospira (as calculated from aclB).
For phylogenetic analyses of NXR, the NxrA and nxrB sequences of Ca. N. inopinata were imported into existing NxrA17 and nxrB8 sequence databases using the software ARB87. NxrA sequences were aligned using Mafft, nxrB sequences were manually aligned according to their amino acid translations. Maximum likelihood trees were calculated using RAxML with the GAMMA model of rate heterogeneity using empirical base frequencies and the LG substitution model (NxrA) or with the GAMMA model of rate heterogeneity and the GTR substitution model (nxrB). Bayesian inference trees were calculated using PhyloBayes, running 3 independent chains for 32,200 cycles each, using 6,440 cycles for burn-in (NxrA) or 3 independent chains for 35,500 cycles each, using 7,000 cycles for burn-in (nxrB). Nitrospira 16S rRNA genes from this study were added to an existing Nitrospira 16S rRNA sequence database and aligned in ARB. Phylogenetic trees were calculated using RAxML with the GAMMA model of rate heterogeneity and the GTR substitution model, and using MrBayes88 v.3.2.1, running 4 independent chains for 5 million generations each, with 1.25 million cycles for burn-in and sampling every 100 generations.
Average nucleotide identity and tetranucleotide signature analyses
Pairwise average nucleotide identity (ANI) values were calculated for comammox Nitrospira genomes using BLAST (ANIb) in JSpecies89. Genome-wide tetranucleotide signatures were calculated for the forward and reverse strand for each genome with the oligonucleotideFrequency(width=4) command using the Biostrings package in R90. Tetranucleotide patterns were also calculated across the length of the genome with a sliding window of 5kb (step=1 kb). The tetranucleotide pattern for each window was compared to the global tetranucleotide signature by calculating the Pearson correlation (r) of log(1+counts) of each window against the log(1+counts) of the global signature. P-values, indicating a significantly low correlation for tetranucleotide signature of a window, were calculated by modeling 1-r across all windows as a log-normal distribution. Multiple testing was accounted for by using the Benjamini-Hochberg procedure with a false discovery rate of 5%.
Extended Data
Supplementary Material
Supplementary information is linked to the online version of the paper at www.nature.com/nature.
Acknowledgments
We thank Tae Kwon Lee and Michaela Steinberger for help with PCR analyses, Nadezda V. Grigor’eva and Mario Pogoda for assistance with culture maintenance, Nadezhda A. Kostrikina for assistance with electron microscopy, Katharina Kitzinger for support with FISH analyses, Maria Mooshammer for help with chemical analyses, Roland Hatzenpichler for designing probe Nmir1009, Kathleen Eismann for help with proteome sample preparation, Benjamin Scheer for help with mass spectrometer maintenance, Purena GmbH (Wolfenbüttel, Germany) for cooperation, Nikolay Chernyh and Joachim Rosenthal for taking samples, and Hanna Koch and Eberhard Bock for helpful discussion. The authors are grateful for using the analytical facilities of the Centre for Chemical Microscopy (ProVIS) (Helmholtz Centre for Environmental Research), which is headed by Hans Richnow (Department of Isotope Biochemistry) and supported by European Regional Development Funds (EFRE–Europe funds Saxony) and the Helmholtz Association. P.P. and H.D. were supported by the Austrian Science Fund (FWF) projects P27319-B21 and P25231-B21 (to H.D.). M.P., J.V., P.H., and M.W. were supported by the European Research Council Advanced Grant project NITRICARE 294343 (to M.W). M.A., R.H.K., and P.H.N. were supported by the Danish Council for Independent Research, DFF – 4005-00369 and Innovation Fund Denmark (EcoDesign).
Footnotes
Author Contributions
H.D. did (meta)genomic analysis of Ca. N. inopinata and comammox Nitrospira, contributed to phylogenetic and proteomics data analyses, designed the study and wrote the paper; E.V.L. enriched Ca. N. inopinata; E.V.L., P.P., P.H., A.B. and M.P. performed physiological experiments, analyzed data, and characterized enrichments; M.A., R.H.K. and P.H.N. carried out metagenome sequencing, assembly and binning; C.H. performed bioinformatic and phylogenetic analyses; N.J. and M.vB. performed proteomics measurements and data analysis; T.R. performed bioinformatic analyses; P.H., M.P. and J.V. maintained enrichment cultures and performed experiments; J.V. carried out database analyses; B.B. organized sampling and characterized environmental samples; M.W. designed the study, analyzed data, and wrote the paper. All authors discussed the results and commented the manuscript.
Author Information
All raw sequence data is available in the European Nucleotide Archive (ENA) under the project PRJEB10139. The genome sequence of Ca. N. inopinata has been deposited at ENA under the project PRJEB10818, sequence accession LN885086. The draft genome of the betaproteobacterium from ENR4 and ENR6 is available in the JGI Integrated Microbial Genomes Database (https://img.jgi.doe.gov/cgi-bin/m/main.cgi) under the IMG Genome ID 2636415980. Reprints and permissions information is available at www.nature.com/reprints. The authors declare no competing financial interests. Correspondence and requests for materials should be addressed to M.W. (wagner@microbial-ecology.net).
References
- 1.Bock E, Wagner M. In: The Prokaryotes: A Handbook on the Biology of Bacteria. Dworkin M, et al., editors. Springer Science+Business Media; 2001. pp. 457–495. [Google Scholar]
- 2.Könneke M, et al. Isolation of an autotrophic ammonia-oxidizing marine archaeon. Nature. 2005;437:543–546. doi: 10.1038/nature03911. [DOI] [PubMed] [Google Scholar]
- 3.Winogradsky S. Contributions a la morphologie des organismes de la nitrification. Arch Sci Biol (St Petersb) 1892;1:88–137. [Google Scholar]
- 4.Arp D, Bottomley PJ. Nitrifiers: More than 100 years from isolation to genome sequences. Microbe. 2006;1:229–234. [Google Scholar]
- 5.Pfeiffer T, Bonhoeffer S. Evolution of cross-feeding in microbial populations. Am Nat. 2004;163:E126–E135. doi: 10.1086/383593. [DOI] [PubMed] [Google Scholar]
- 6.Heinrich R, Schuster S. The regulation of cellular systems. Chapman & Hall; 1996. [Google Scholar]
- 7.Costa E, Perez J, Kreft JU. Why is metabolic labour divided in nitrification? Trends Microbiol. 2006;14:213–219. doi: 10.1016/j.tim.2006.03.006. [DOI] [PubMed] [Google Scholar]
- 8.Pester M, et al. NxrB encoding the beta subunit of nitrite oxidoreductase as functional and phylogenetic marker for nitrite-oxidizing Nitrospira. Environ Microbiol. 2014;16:3055–3071. doi: 10.1111/1462-2920.12300. [DOI] [PubMed] [Google Scholar]
- 9.Hovanec TA, Taylor LT, Blakis A, DeLong EF. Nitrospira-like bacteria associated with nitrite oxidation in freshwater aquaria. Appl Environ Microbiol. 1998;64:258–264. doi: 10.1128/aem.64.1.258-264.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Daims H, Nielsen JL, Nielsen PH, Schleifer KH, Wagner M. In situ characterization of Nitrospira-like nitrite-oxidizing bacteria active in wastewater treatment plants. Appl Environ Microbiol. 2001;67:5273–5284. doi: 10.1128/AEM.67.11.5273-5284.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Watson SW, Bock E, Valois FW, Waterbury JB, Schlosser U. Nitrospira marina gen. nov. sp. nov.: a chemolithotrophic nitrite-oxidizing bacterium. Arch Microbiol. 1986;144:1–7. [Google Scholar]
- 12.Taylor MW, Radax R, Steger D, Wagner M. Sponge-associated microorganisms: evolution, ecology, and biotechnological potential. Microbiol Mol Biol Rev. 2007;71:295–347. doi: 10.1128/MMBR.00040-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lebedeva EV, et al. Isolation and characterization of a moderately thermophilic nitrite-oxidizing bacterium from a geothermal spring. FEMS Microbiol Ecol. 2011;75:195–204. doi: 10.1111/j.1574-6941.2010.01006.x. [DOI] [PubMed] [Google Scholar]
- 14.Martiny AC, Albrechtsen HJ, Arvin E, Molin S. Identification of bacteria in biofilm and bulk water samples from a nonchlorinated model drinking water distribution system: detection of a large nitrite-oxidizing population associated with Nitrospira spp. Appl Environ Microbiol. 2005;71:8611–8617. doi: 10.1128/AEM.71.12.8611-8617.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ehrich S, Behrens D, Lebedeva E, Ludwig W, Bock E. A new obligately chemolithoautotrophic, nitrite-oxidizing bacterium, Nitrospira moscoviensis sp. nov. and its phylogenetic relationship. Arch Microbiol. 1995;164:16–23. doi: 10.1007/BF02568729. [DOI] [PubMed] [Google Scholar]
- 16.Schramm A, de Beer D, Wagner M, Amann R. Identification and activities in situ of Nitrosospira and Nitrospira spp. as dominant populations in a nitrifying fluidized bed reactor. Appl Environ Microbiol. 1998;64:3480–3485. doi: 10.1128/aem.64.9.3480-3485.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lücker S, et al. A Nitrospira metagenome illuminates the physiology and evolution of globally important nitrite-oxidizing bacteria. Proc Natl Acad Sci U S A. 2010;107:13479–13484. doi: 10.1073/pnas.1003860107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gruber-Dorninger C, et al. Functionally relevant diversity of closely related Nitrospira in activated sludge. ISME J. 2015;9:643–655. doi: 10.1038/ismej.2014.156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Koch H, et al. Expanded metabolic versatility of ubiquitous nitrite-oxidizing bacteria from the genus Nitrospira. Proc Natl Acad Sci U S A. 2015;112:11371–11376. doi: 10.1073/pnas.1506533112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Palatinszky M, et al. Cyanate as an energy source for nitrifiers. Nature. 2015;524:105–108. doi: 10.1038/nature14856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Koch H, et al. Growth of nitrite-oxidizing bacteria by aerobic hydrogen oxidation. Science. 2014;345:1052–1054. doi: 10.1126/science.1256985. [DOI] [PubMed] [Google Scholar]
- 22.Mobarry BK, Wagner M, Urbain V, Rittmann BE, Stahl DA. Phylogenetic probes for analyzing abundance and spatial organization of nitrifying bacteria. Appl Environ Microbiol. 1996;62:2156–2162. doi: 10.1128/aem.62.6.2156-2162.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Albertsen M, et al. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nat Biotechnol. 2013;31:533–538. doi: 10.1038/nbt.2579. [DOI] [PubMed] [Google Scholar]
- 24.Arp DJ, Chain PSG, Klotz MG. The impact of genome analyses on our understanding of ammonia-oxidizing bacteria. Annu Rev Microbiol. 2007;61:503–528. doi: 10.1146/annurev.micro.61.080706.093449. [DOI] [PubMed] [Google Scholar]
- 25.Berube PM, Stahl DA. The divergent AmoC3 subunit of ammonia monooxygenase functions as part of a stress response system in Nitrosomonas europaea . J Bacteriol. 2012;194:3448–3456. doi: 10.1128/JB.00133-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Klotz MG, et al. Evolution of an octahaem cytochrome c protein family that is key to aerobic and anaerobic ammonia oxidation by bacteria. Environ Microbiol. 2008;10:3150–3163. doi: 10.1111/j.1462-2920.2008.01733.x. [DOI] [PubMed] [Google Scholar]
- 27.Bergmann DJ, Hooper AB, Klotz MG. Structure and sequence conservation of hao cluster genes of autotrophic ammonia-oxidizing bacteria: Evidence for their evolutionary history. Appl Environ Microbiol. 2005;71:5371–5382. doi: 10.1128/AEM.71.9.5371-5382.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Klotz MG, Stein LY. Nitrifier genomics and evolution of the nitrogen cycle. FEMS Microbiol Lett. 2008;278:146–156. doi: 10.1111/j.1574-6968.2007.00970.x. [DOI] [PubMed] [Google Scholar]
- 29.Rittmann BE, Regan JM, Stahl DA. Nitrification as a source of soluble organic substrate in biological treatment. Water Sci Technol. 1994;30:1–8. [Google Scholar]
- 30.Nowka B, Off S, Daims H, Spieck E. Improved isolation strategies allowed the phenotypic differentiation of two Nitrospira strains from widespread phylogenetic lineages. FEMS Microbiol Ecol. 2015;91 doi: 10.1093/femsec/fiu031. [DOI] [PubMed] [Google Scholar]
- 31.Ushiki N, Fujitani H, Aoi Y, Tsuneda S. Isolation of Nitrospira belonging to sublineage II from a wastewater treatment plant. Microbes Environ. 2013;28:346–353. doi: 10.1264/jsme2.ME13042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.van Kessel MAHJ, et al. Complete nitrification by a single microorganism. Nature. doi: 10.1038/nature16459. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pester M, et al. amoA-based consensus phylogeny of ammonia-oxidizing archaea and deep sequencing of amoA genes from soils of four different geographic regions. Environ Microbiol. 2012;14:525–539. doi: 10.1111/j.1462-2920.2011.02666.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Purkhold U, et al. Phylogeny of all recognized species of ammonia oxidizers based on comparative 16S rRNA and amoA sequence analysis: implications for molecular diversity surveys. Appl Environ Microbiol. 2000;66:5368–5382. doi: 10.1128/aem.66.12.5368-5382.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wrighton KC, et al. Fermentation, hydrogen, and sulfur metabolism in multiple uncultivated bacterial phyla. Science. 2012;337:1661–1665. doi: 10.1126/science.1224041. [DOI] [PubMed] [Google Scholar]
- 36.Stein LY, et al. Whole-genome analysis of the ammonia-oxidizing bacterium, Nitrosomonas eutropha C91: implications for niche adaptation. Environ Microbiol. 2007;9:2993–3007. doi: 10.1111/j.1462-2920.2007.01409.x. [DOI] [PubMed] [Google Scholar]
- 37.Chain P, et al. Complete genome sequence of the ammonia-oxidizing bacterium and obligate chemolithoautotroph Nitrosomonas europaea. J Bacteriol. 2003;185:2759–2773. doi: 10.1128/JB.185.9.2759-2773.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Suwa Y, et al. Genome sequence of Nitrosomonas sp Strain AL212, an ammonia-oxidizing bacterium sensitive to high levels of ammonia. J Bacteriol. 2011;193:5047–5048. doi: 10.1128/JB.05521-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Norton JM, et al. Complete genome sequence of Nitrosospira multiformis, an ammonia-oxidizing bacterium from the soil environment. Appl Environ Microbiol. 2008;74:3559–3572. doi: 10.1128/AEM.02722-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Klotz MG, et al. Complete genome sequence of the marine, chemolithoautotrophic, ammonia-oxidizing bacterium Nitrosococcus oceani ATCC 19707. Appl Environ Microbiol. 2006;72:6299–6315. doi: 10.1128/AEM.00463-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Radajewski S, et al. Identification of active methylotroph populations in an acidic forest soil by stable-isotope probing. Microbiology. 2002;148:2331–2342. doi: 10.1099/00221287-148-8-2331. [DOI] [PubMed] [Google Scholar]
- 42.Stoecker K, et al. Cohn's Crenothrix is a filamentous methane oxidizer with an unusual methane monooxygenase. Proc Natl Acad Sci U S A. 2006;103:2363–2367. doi: 10.1073/pnas.0506361103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Schramm A, de Beer D, van den Heuvel JC, Ottengraf S, Amann R. Microscale distribution of populations and activities of Nitrosospira and Nitrospira spp. along a macroscale gradient in a nitrifying bioreactor: quantification by in situ hybridization and the use of microsensors. Appl Environ Microbiol. 1999;65:3690–3696. doi: 10.1128/aem.65.8.3690-3696.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Altmann D, Stief P, Amann R, De Beer D, Schramm A. In situ distribution and activity of nitrifying bacteria in freshwater sediment. Environ Microbiol. 2003;5:798–803. doi: 10.1046/j.1469-2920.2003.00469.x. [DOI] [PubMed] [Google Scholar]
- 45.Foesel BU, et al. Nitrosomonas Nm143-like ammonia oxidizers and Nitrospira marina-like nitrite oxidizers dominate the nitrifier community in a marine aquaculture biofilm. FEMS Microbiol Ecol. 2008;63:192–204. doi: 10.1111/j.1574-6941.2007.00418.x. [DOI] [PubMed] [Google Scholar]
- 46.Winkler MKH, Bassin JP, Kleerebezem R, Sorokin DY, van Loosdrecht MCM. Unravelling the reasons for disproportion in the ratio of AOB and NOB in aerobic granular sludge. Appl Microbiol Biotechnol. 2012;94:1657–1666. doi: 10.1007/s00253-012-4126-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Maixner F, et al. Nitrite concentration influences the population structure of Nitrospira-like bacteria. Environ Microbiol. 2006;8:1487–1495. doi: 10.1111/j.1462-2920.2006.01033.x. [DOI] [PubMed] [Google Scholar]
- 48.Nowka B, Daims H, Spieck E. Comparison of oxidation kinetics of nitrite-oxidizing bacteria: nitrite availability as a key factor in niche differentiation. Appl Environ Microbiol. 2015;81:745–753. doi: 10.1128/AEM.02734-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Martens-Habbena W, Berube PM, Urakawa H, de la Torre JR, Stahl DA. Ammonia oxidation kinetics determine niche separation of nitrifying Archaea and Bacteria . Nature. 2009;461:976–979. doi: 10.1038/nature08465. [DOI] [PubMed] [Google Scholar]
- 50.Prosser JI, Nicol GW. Archaeal and bacterial ammonia-oxidisers in soil: the quest for niche specialisation and differentiation. Trends Microbiol. 2012;20:523–531. doi: 10.1016/j.tim.2012.08.001. [DOI] [PubMed] [Google Scholar]
- 51.Lebedeva EV, et al. Enrichment and genome sequence of the group I. 1a ammonia-oxidizing archaeon “Ca. Nitrosotenuis uzonensis” representing a clade globally distributed in thermal habitats. PLoS ONE. 2013;8:e80835. doi: 10.1371/journal.pone.0080835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Widdel F. Anaerober Abbau von Fettsäuren und Benzoesäure durch neu isolierte Arten Sulfat-reduzierender Bakterien. Universität Göttingen; 1980. [Google Scholar]
- 53.Kandeler E, Gerber H. Short-term assay of soil urease activity using colorimetric determination of ammonium. Biol Fertil Soils. 1988;6:68–72. [Google Scholar]
- 54.Hood-Nowotny R, Hinko-Najera Umana N, Inselbacher E, Oswald-Lachouani P, Wanek W. Alternative methods for measuring inorganic, organic, and total dissolved nitrogen in soil. Soil Sci Soc Am J. 2010;74:1018–1027. [Google Scholar]
- 55.Griess-Romijn van Eck E. NEN 1056 IV-2. Nederlands Normalisatie Instituut; Rijswijk, The Netherlands: 1966. Physiological and chemical tests for drinking water. [Google Scholar]
- 56.Miranda KM, Espey MG, Wink DA. A rapid, simple spectrophotometric method for simultaneous detection of nitrate and nitrite. Nitric Oxide-Biol Chem. 2001;5:62–71. doi: 10.1006/niox.2000.0319. [DOI] [PubMed] [Google Scholar]
- 57.Daims H, Stoecker K, Wagner M. In: Molecular Microbial Ecology. Osborn AM, Smith CJ, editors. Bios-Garland: 2005. pp. 213–239. [Google Scholar]
- 58.Daims H, Brühl A, Amann R, Schleifer K-H, Wagner M. The domain-specific probe EUB338 is insufficient for the detection of all Bacteria: Development and evaluation of a more comprehensive probe set. System Appl Microbiol. 1999;22:434–444. doi: 10.1016/S0723-2020(99)80053-8. [DOI] [PubMed] [Google Scholar]
- 59.Amann RI, et al. Combination of 16S rRNA-targeted oligonucleotide probes with flow cytometry for analyzing mixed microbial populations. Appl Environ Microbiol. 1990;56:1919–1925. doi: 10.1128/aem.56.6.1919-1925.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Daims H. Use of fluorescence in situ hybridization and the daime image analysis program for the cultivation-independent quantification of microorganisms in environmental and medical samples. Cold Spring Harb Protoc. 2009 doi: 10.1101/pdb.prot5253. [DOI] [PubMed] [Google Scholar]
- 61.Daims H, Lücker S, Wagner M. daime, a novel image analysis program for microbial ecology and biofilm research. Environ Microbiol. 2006;8:200–213. doi: 10.1111/j.1462-2920.2005.00880.x. [DOI] [PubMed] [Google Scholar]
- 62.Sorokin D, et al. Nitrification expanded: Discovery, physiology, and genomics of a nitrite-oxidizing bacterium from the phylum Chloroflexi. ISME J. 2012;6:2245–2256. doi: 10.1038/ismej.2012.70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Rotthauwe J-H, Witzel K-P, Liesack W. The ammonia monooxygenase structural gene amoA as a functional marker: molecular fine-scale analysis of natural ammonia-oxidizing populations. Appl Environ Microbiol. 1997;63:4704–4712. doi: 10.1128/aem.63.12.4704-4712.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Tourna M, Freitag TE, Nicol GW, Prosser JI. Growth, activity and temperature responses of ammonia-oxidizing archaea and bacteria in soil microcosms. Environ Microbiol. 2008;10:1357–1364. doi: 10.1111/j.1462-2920.2007.01563.x. [DOI] [PubMed] [Google Scholar]
- 65.Ochsenreiter T, Selezi D, Quaiser A, Bonch-Osmolovskaya L, Schleper C. Diversity and abundance of Crenarchaeota in terrestrial habitats studied by 16S RNA surveys and real time PCR. Environ Microbiol. 2003;5:787–797. doi: 10.1046/j.1462-2920.2003.00476.x. [DOI] [PubMed] [Google Scholar]
- 66.Angel R, Claus P, Conrad R. Methanogenic archaea are globally ubiquitous in aerated soils and become active under wet anoxic conditions. ISME J. 2012;6:847–862. doi: 10.1038/ismej.2011.141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Watson M, et al. poRe: an R package for the visualization and analysis of nanopore sequencing data. Bioinformatics. 2015;31:114–115. doi: 10.1093/bioinformatics/btu590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Hackl T, Hedrich R, Schultz J, Förster F. proovread: large-scale high-accuracy PacBio correction through iterative short read consensus. Bioinformatics. 2014;30:3004–3011. doi: 10.1093/bioinformatics/btu392. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Hyatt D, et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Huson DH, Mitra S, Ruscheweyh HJ, Weber N, Schuster SC. Integrative analysis of environmental sequences using MEGAN4. Genome Res. 2011;21:1552–1560. doi: 10.1101/gr.120618.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Gregor I, Dröge J, Schirmer M, Quince C, McHardy AC. PhyloPythiaS+: A self-training method for the rapid reconstruction of low-ranking taxonomic bins from metagenomes. ArXiv e-prints. 2014;1406:7123. doi: 10.7717/peerj.1603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Dick GJ, et al. Community-wide analysis of microbial genome sequence signatures. Genome Biol. 2009;10:R85. doi: 10.1186/gb-2009-10-8-r85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Bankevich A, et al. SPAdes: A new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Boetzer M, Pirovano W. SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information. BMC Bioinformatics. 2014;15:211. doi: 10.1186/1471-2105-15-211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Vallenet D, et al. MicroScope-an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data. Nucleic Acids Res. 2013;41:E636–E647. doi: 10.1093/nar/gks1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Finn RD, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42:D222–D230. doi: 10.1093/nar/gkt1223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Benson DA, et al. GenBank. Nucleic Acids Res. 2015;43:D30–D35. doi: 10.1093/nar/gku1216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Markowitz VM, et al. IMG 4 version of the integrated microbial genomes comparative analysis system. Nucleic Acids Res. 2014;42:D560–D567. doi: 10.1093/nar/gkt963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 83.Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–2461. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- 84.Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–3066. doi: 10.1093/nar/gkf436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Lartillot N, Lepage T, Blanquart S. PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics. 2009;25:2286–2288. doi: 10.1093/bioinformatics/btp368. [DOI] [PubMed] [Google Scholar]
- 86.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Ludwig W, et al. ARB: a software environment for sequence data. Nucleic Acids Res. 2004;32:1363–1371. doi: 10.1093/nar/gkh293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Ronquist F, et al. MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Richter M, Rossello-Mora R. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci USA. 2009;106:19126–19131. doi: 10.1073/pnas.0906412106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.R: A language and environment for statistical computing. R Foundation for Statistical Computing; Vienna, Austria: 2013. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.