Microorganisms of the superphylum Asgard Archaea are considered to be the closest living prokaryotic relatives of eukaryotes (including plants and animals) and thus promise to give insights into the early evolution of more complex life forms. However, very little is known about their biology as none of the organisms has yet been cultivated in the laboratory. Here we report on the ecological distribution of Asgard Archaea and on four newly sequenced genomes of the Lokiarchaeota and Thorarchaeota lineages that give insight into possible metabolic features that might eventually help to identify these enigmatic groups of archaea in the environment and to culture them.
KEYWORDS: Archaea, Lokiarchaeota, Thorarchaeota, ether lipids, eukaryotic evolution, reductive dehalogenase
ABSTRACT
The genomes of Asgard Archaea, a novel archaeal proposed superphylum, share an enriched repertoire of eukaryotic signature genes and thus promise to provide insights into early eukaryote evolution. However, the distribution, metabolisms, cellular structures, and ecology of the members within this superphylum are not well understood. Here we provide a meta-analysis of the environmental distribution of the Asgard archaea, based on available 16S rRNA gene sequences. Metagenome sequencing of samples from a salt-crusted lagoon on the Baja California Peninsula of Mexico allowed the assembly of a new Thorarchaeota and three Lokiarchaeota genomes. Comparative analyses of all known Lokiarchaeota and Thorarchaeota genomes revealed overlapping genome content, including central carbon metabolism. Members of both groups contained putative reductive dehalogenase genes, suggesting that these organisms might be able to metabolize halogenated organic compounds. Unlike the first report on Lokiarchaeota, we identified genes encoding glycerol-1-phosphate dehydrogenase in all Loki- and Thorarchaeota genomes, suggesting that these organisms are able to synthesize bona fide archaeal lipids with their characteristic glycerol stereochemistry.
INTRODUCTION
Recent discoveries of novel microbial lineages through metagenomic analyses have led to a fundamentally refined perception of the diversity of both Archaea and Bacteria (1). Among these, Lokiarchaeota of the domain Archaea and related lineages, now collectively embraced in the superphylum Asgard Archaea, have garnered particular attention due to their close affiliation to eukaryotes (2, 3). Together with the finding that multiple Asgard genomes encode eukaryotic signature proteins (ESPs) (2, 3), these data have led to the assumption that relatives of Asgard archaea participated in the formation of the eukaryotic lineage. However, there are currently no cultured members of this archaeal phylum, making predictions about their biology and the functions of these ESPs in Asgard archaeal cell biology challenging. Currently, five major groups or phyla within the Asgard archaea have been defined based on phylogenetic reconstructions from metagenome assembled genomes (MAGs) obtained from a range of different environments, including Lokiarchaeota (3), Thorarchaeota, Heimdallarchaeota, Odinarchaeota (2), and, more recently, Helarchaeota (4).
While the positioning of the Asgard archaea in the tree of life is intriguing and the exact position of the emergence of the eukaryal line of descent is continuously being refined and sometimes debated (4–6), it is equally crucial to obtain more information on the ecological distribution and metabolisms of Asgard archaea. The characterization of the phylogenetic diversity within the superphylum and the environmental distribution of these organisms will inform future practices in detection. Also, understanding the particular environments Asgard archaea inhabit will advance knowledge of their roles in biogeochemical cycling and aid efforts to bring these intriguing organisms to culture. Diversity studies are important for proper analyses of Asgard archaeal abundance and distribution (for instance, in amplicon sequencing studies), for the design of specific primers aiding detection and enumeration with molecular techniques, and congruency between 16S rRNA gene and ribosomal protein trees to accurately define newly discovered members of the superphylum.
The first two publicly available Lokiarchaeota genomes were derived from the metagenomes of marine sediments at the Arctic Mid-Ocean Ridge and a freshwater aquifer (2, 3, 7). An early analysis of the specific geochemical parameters of the marine sediments in which Lokiarchaeota, (then known as the “Deep Sea Archaeal Group” [DSAG]) were found suggested a correlation with organic carbon and oxides of iron or manganese as potential electron acceptors (8). While a first study of the composite Lokiarchaeota GC14 genome suggested that these organisms were hydrogenotrophic autotrophs, fixing CO2 via the archaeal-type (tetrahydromethanopterin [THMPT]) Wood-Ljungdahl (WL) pathway (9), their genomes as well as that of all Asgard genomes indicate a potential for growth on organic compounds as well (10, 11).
Thorarchaeota genomes obtained from marine and estuary sediments (2, 12) as well as mangroves (13) appeared to be metabolically diverse with potential for degradation and uptake of peptides and carbohydrates as well as potential to fix dinitrogen and make selenoproteins (12, 13). Like Lokiarchaeota, the Thorarchaeota genomes contain genes for the THMPT-WL pathway, but in addition, the latter encode also a (bacterial-type) tetrahydrofolate (THF) version (13). Together with further observations indicating the potential for acetate or ethanol production, they were suggested to have a mixotrophic lifestyle. In addition to a complete Wood-Ljungdahl pathway a methyl-CoM reductase-like enzyme was recently found in Helarchaeota, similar to those detected in Bathy- and Synthrophoarchaea (4), which together with further observations indicated a potential for anaerobic oxidation of short-chain hydrocarbons in this group (4). So far, only members of the Heimdallarchaeota seem to have the potential of facultative aerobic growth, as they were recently found to encode a complete electron transport chain with terminal oxidase as well as the aerobic kynurenine pathway (14).
The present study updates the existing phylogeny within the Asgard archaea and includes a comprehensive analysis of their environmental distribution, with metadata on abiotic parameters such as pH and temperature ranges. This study also presents three Lokiarchaeota genomes and one Thorarchaeota genome from a hypersaline biomat from a salt lagoon close to Puertecitos, Baja California, Mexico, that was shown to harbor many newly identified archaea lacking cultured representatives (15). These genomes, together with those previously reported, help to compare genome contents between the Lokiarchaeota and the Thorarchaeota and to identify new metabolic capacities. The reported inventory includes putative reductive dehalogenase (RDase) genes in the genomes of Loki- and Thorarchaeota that might be instructive for cultivation attempts, and it sheds new light on lipid biosynthesis, as we identify genes for bona fide archaeal lipid biosynthesis in Lokiarchaeota that were previously missing in the assembled published genomes.
RESULTS AND DISCUSSION
Diversity and distribution of Asgard archaea.
In order to study the environmental distribution and diversity of known Asgard archaea, all affiliated 16S rRNA gene sequences were retrieved from the SILVA database (v.132). The initial phylogenetic analysis of 4,458 putative Asgard 16S rRNA gene sequences proved challenging at first, since no monophyletic groups were retrieved at the phylum level (see Fig. S1 in the supplemental material). However, conclusive results were obtained after excluding 388 potentially chimeric or low-pintail-quality sequences (see Table S4 posted at figshare [https://doi.org/10.6084/m9.figshare.9258947]). The remaining 4,070 sequences were filtered for a length greater than 1,400 bp, resulting in 2,857 unique Asgard sequences. After further clustering at a 99% identity threshold, 246 operational taxonomic units (OTUs) were obtained. In addition to these, 32 additional Asgard 16S rRNA gene sequences from available genomes (including one new full-length 16S rRNA gene obtained in this study) and other studies (16) that were not part of the SILVA database were included in phylogenetic reconstructions. The obtained backbone tree of the Asgard archaea (Fig. 1A; see Fig. S2 in the supplemental material) exhibited generally a similar clade organization for the Lokiarchaeota lineages as reported in earlier studies on smaller data sets (7, 8, 17). After mapping the remaining 1,213 shorter sequences to the reference sequences in this backbone tree, 229 sequences were found to have a different taxonomic affiliation than was reflected in our phylogenetic analyses (see Table S4 posted at figshare). For example, there were 36 sequences that had a different taxonomic affiliation in the SILVA database (v.132) than here with the refined phylogeny, and moreover, 193 of the good-quality shorter sequences did not match any of the reference sequences in the refined phylogeny with the given threshold (see Table S4 posted at figshare). One of two distinct groups of sequences earlier assigned as the DAS (domain archaeal sequences) group (16) was found to be a sister group of the Thorarchaeota clade, and the second branched in between two Heimdallarchaeota clades. Further inferences regarding the phylogeny and naming of these groups should await full genome sequence analyses.
A thorough literature survey of metadata identified Lokiarchaeota as the group with the widest occurrence in different ecosystems compared to the other Asgard groups (Fig. 1B). However, out of the 4,070 16S rRNA gene sequences used in this analysis, environmental information was only available for 531 (13%). Of these, 22% (117) matched to one particular Lokiarchaeota OTU (group 3) found in hypersaline environments. A previous study reported that 60% of all Lokiarchaeota (then termed DSAG) sequences available at the time in the SILVA database (v.104) originated from hypersaline ponds in Guerrero Negro (Baja California, Mexico) (8). Lokiarchaeota are also found in the widest pH range (5 to 9) of any Asgard archaea, whereas Odinarchaeota were only found in neutral to moderately alkaline (pH 7.5 to 8.5) environments (Fig. 1C). However, the picture of the environmental distribution of Asgard lineages might change when more sequence data become available that are specifically dedicated to the detection of this superphylum in various environmental microbiomes.
Novel genomes of Loki- and Thorarchaeota.
We were able to reaffirm the presence of Lokiarchaeota (DSAG) in hypersaline environments by a diversity study of the prokaryotic community in another hypersaline environment in the Baja California region, a closed salt lagoon near Puertecitos, Mexico (15). Here, we used differential coverage binning from a metagenomic analysis of one of the sediment samples from that location (1-cm depth) and were able to acquire four new metagenome assembled genomes (MAGs) of Asgard archaea. One of these was of high quality and affiliated with the Thorarchaeota, while three additional MAGs were of medium quality and affiliated with Lokiarchaeota (according to MAG standards) (18) (Table 1). The Thorachaeota genome from this study (assigned as “Baja_Thor”) was the most complete of all Asgard genomes available in the database and had the least contamination according to CheckM analysis. The taxonomic affiliation of all four genomic bins that was based on their single-copy protein matches to the NR database was confirmed with the phylogenetic analysis of concatenated ribosomal proteins (Fig. 2). Published Lokiarchaeota genomes are generally larger than the available Thorarchaeota genomes, a trend that was also observed in this study, with the Thorarchaeota (Baja_Thor) genome size of 3.1 Mb notably smaller than the Lokiarchaeota genomes, which ranged from 3.9 to 4.1 Mb in size. The Orthologous Average Nucleotide Identity (OrthoANI) (19) values for the Baja_Thor genome compared with other Thorarchaeota genomes ranged from 65.6% to 66.2%. Individually, the Thorarchaeota genome from this study (Baja_Thor) was most closely related to the Thor_MP9T genome obtained from mangroves (13). The three Lokiarchaeota genomes obtained in this study were more closely related to each other than the Lokiarchaeota genomes previously published (2), with their OrthoANI values ranging from 66.8% to 68.2%.
TABLE 1.
Parameter | Result for Bin_id: |
|||
---|---|---|---|---|
Baja_Thor | Baja_Loki1 | Baja_Loki2 | Baja_Loki3 | |
Completeness, % | 92.99 | 75.19 | 83.64 | 88.32 |
Contamination, % | 3.74 | 1.4 | 9.35 | 3.74 |
Strain heterogeneity | 0 | 0 | 0 | 0 |
Size, Mb | 3.1 | 3.9 | 3.8 | 4.1 |
Scaffolds, no. | 19 | 274 | 114 | 137 |
GC, % | 41.2 | 33.2 | 32.6 | 33.6 |
N50, bp | 548,263 | 20,187 | 44,367 | 42,570 |
Length, bp | ||||
Max | 668,889 | 69,370 | 119,378 | 143,576 |
Mean | 166,420.1 | 14,312.4 | 33,109.3 | 30,262.1 |
Coverage, X | 167.8 | 15.77 | 9.21 | 7.13 |
Proteins, no. | 2,909 | 3,425 | 3,434 | 3,918 |
Coding density, % | 90.10 | 82.80 | 85.20 | 86.50 |
Reads mapping to genome, % | 2.07 | 0.27 | 0.15 | 0.14 |
Pangenomic analysis of Loki- and Thorarchaeota.
Since a number of genomic similarities between Lokiarchaeota and Thorachaeota had been pointed out earlier and since they often share the same environment and can even be found in the same sample (as in our study), we were interested in performing a general pangenomic comparison. A pairwise all-versus-all BLAST and clustering analysis was therefore performed in Orthofinder (20) with default parameters. Among the 47,666 proteins present in the 13 MAGs used for comparative analysis, 86% (40,712) belonged to one of the 5,234 protein clusters containing sequences from at least two genomes (Fig. 3). The remaining 6,954 sequences were classified as singletons. Using a strict criterion of presence in all genomes, the core Loki-Thor genome (i.e., protein families with at least one protein in all of the 13 genomes) consists of 253 clusters (∼5% of total protein families), in which the most-represented functional categories belong to information storage and processing related to translation (49 clusters) followed by energy metabolism (37 clusters). The Lokiarchaeota-specific clusters represented ∼29% (1,501) of the sequences, in which 214 protein clusters were present in all Lokiarchaeota and thus can be considered to be encoded by the Lokiarchaeota core genome. The Thorarchaeota-specific clusters were represented by ∼34% of the sequences (1,764), with 139 clusters present in all Thorarchaeota genomes. The remaining 1,969 clusters had members from both taxonomic groups.
Based on arCOG annotations, significant differences between the Loki- and the Thorarchaeota genomes became evident (Fig. 4). However, most of these differences corresponded to clusters lacking functional annotations or representing hypothetical proteins. Clear differences between the two groups were mostly observed in protein clusters related to mobile elements and defense mechanisms. Whereas Lokiarchaeota-specific protein clusters were mostly affiliated with transposable elements and recombinases (35/41 protein clusters), Thorarchaeota-specific protein clusters contained phage terminases and multidrug transport systems (11/15 protein clusters) (see Table S5 posted at figshare [https://doi.org/10.6084/m9.figshare.9258968]). The protein clusters for carbon metabolism (of both Thor- and Lokiarchaeota) contained at least one gene from all enzymatic complexes responsible for carrying out the steps from the archaeal version of the Wood-Ljungdahl (WL) pathway, as well as many of the enzymes for the bacterial (folate-dependent) WL pathway. However, the formate dehydrogenase enzyme responsible for the initial step of the reduction and fixation of CO2 through the bacterial WL-THF pathway was not found in any of the Loki- and Thorarchaeota genomes. Interestingly, even if several of the CO dehydrogenase (cdh) subunits were found encoded in genomes from the two groups, in the Baja_Thor genome, which is the most complete among all genomes, no cdh subunits were found, although genes encoding other enzymes of the THMPT-WL pathway were present. Recent analysis of the WL pathway indicated its great modularity and flexibility in archaea, in terms of both evolution as well as functional diversity (21). Consistent with previous findings (13), an incomplete tricarboxylic acid (TCA) cycle was found (see Table S6 posted at figshare [https://doi.org/10.6084/m9.figshare.9259094]) in all Loki- and Thorarchaeota analyzed, as well as potential pathways for degradation and assimilation of proteins. The latter included enzymes coding for extracellular proteases and peptidases: e.g., aminopeptidases like pepDPFN and serine proteases like aprE (see Table S6 posted at figshare). In addition, genes encoding an arsenic efflux pathway and selenocysteine biosynthesis (13) (see Table S6 posted at figshare) were present in all genomes from both groups. Similarly, several membrane transporter genes, including those encoding the enzymes for multisugar compounds (see Table S6 posted at figshare), like MFS family permease genes, were also found in all genomes analyzed.
Putative reductive dehalogenases.
Halogenated organic compounds have been introduced into the environment through anthropogenic activities but also occur naturally, particularly in marine sediments (22). Interestingly, from the orthogroup analysis, we identified genes encoding putative reductive dehalogenases (RDases) in nearly all Thor- and Lokiarchaeota genomes (see Fig. S3 in the supplemental material). The three genomes lacking RDase genes were least complete (Loki_CR4, Baja_Loki1, and Thor_SMTZ_45), providing an incomplete picture of gene content (see Table S3 posted at figshare [https://doi.org/10.6084/m9.figshare.9258926]). The Lokiarchaeota GC14 genome has two copies of a putative RDase gene (86.8% amino acid identity); however, this genome may be a composite of two genomes for closely related Lokiarchaeota, and it is not clear if a single genome harbors two putative RDase genes (3). To date, Ferroglobus placidus is the only other archaeon whose genome encodes a putative RDase (23).
The known and well-studied RDases contain a conserved RDase domain, pfam13486. The domain organizations of RDases are variable and fall into at least two categories. The RDases involved in organohalide respiration contain a C-terminal 4Fe-4S dicluster binding domain (pfam13484) or alternatively a 4Fe-4S dicluster domain (pfam12838) and an N-terminal twin arginine translocation (TAT) signal sequence (Fig. 5) (24). There are alternative domain organizations associated with catabolic versus respiratory RDases, which may include an additional flavin adenine dinucleotide (FAD)- or NAD-binding and 2Fe-2S-binding domains at the C termini (25). All of the putative RDases identified among the Asgard archaea share core RDase domains, including reductive dehalogenase (pfam13486), and the 4Fe-4S dicluster binding domains (pfam13484) (Fig. 5; see Fig. S4 in the supplemental material). The thorarchaeotal sequences, on the other hand, contain an additional 4Fe-4S domain (pfam00037) near the N terminus, a domain organization that has neither been observed neither among proven-function RDases (Fig. 5) nor by data mining and characterization of RDases in public databases (26). A TAT signal sequence responsible for the translocation of the enzyme across the membrane is present in all experimentally verified respiratory RDases (27) but is absent in catabolic RDases studied to date (25). TAT signals are not found in any of the Thor- or Lokiarchaeota genome sequences (Fig. 5; Fig. S4), suggesting that these putative RDases are not involved in a respiratory process; however, since archaeal signal peptides are less characterized, their absence does not prove that translocation across the cytoplasmic membrane does not occur.
A phylogenetic analysis comparing the Asgard archaeal putative RDases with the most closely related ortholog groups in the EggNOG database (28) revealed that the enzymes encoded on the Thor-, Lokiarchaeota, and Ferroglobus placidus genomes cluster among bacterial RDases rather than the most closely related groups of archaeal proteins (Fig. S3). Notably, the putative Asgard RDases cluster together, but separately from the RDase found in F. placidus. The most closely related functionally characterized enzyme (∼30 to 33% amino acid identity) is PceA from Sulfurospirillum multivorans. The PceA RDase is involved in organohalide respiration with tetrachloroethene as electron acceptor and reductive dechlorination of other organohalogens, including halogenated phenolic compounds, has been demonstrated (29, 30). Three highly conserved amino acid residues (Tyr246, Arg305, and Asn272) in the active site of PceA (30) are conserved in the putative RDases from Loki- and Thorarchaeota (Fig. S4). Taken together, these data suggest that the putative Asgard RDases probably represent enzymes with RDase function that may have been acquired by horizontal gene transfer from bacteria.
The gene neighborhoods of the putative Asgard RDase genes are notably conserved within the Thorarchaeota and within the Lokiarchaeota, but not between the two groups (see Fig. S5 in the supplemental material). The observed conservation of synteny suggests that the metabolic roles of these RDases may be informed by the functions encoded by consistently present surrounding genes (31). This is in contrast to the case of previously described respiratory and catabolic RDases, which show inconsistent gene neighborhoods. While a handful of genes speculated to be involved in functions such as molecular chaperoning, electron transfer, and corrinoid scavenging are conserved in some RDase-containing lineages, the only gene consistently associated with respiratory RDase genes encodes an accessory B protein, which tethers the RDase to the membrane, thereby enabling its role in a respiratory process. This pattern is nearly ubiquitous in respiratory RDases (reviewed in reference 24), although a subset of putative RDase genes appears to have fused with the respective gene encoding the accessory B protein (3, 31a). Genes encoding B proteins were not detected in Thorarchaeota and Lokiarchaeota genomes, nor were any RDase gene fusions detected. Many of the syntenic genes neighboring the archaeal RDases can be functionally annotated. In the Lokiarchaeota, the putative RDase is found near a trio of conserved genes annotated as arginyl-tRNA synthase (K01480), agmatinase (K01480), and deoxyhypusine synthase (K00809), suggesting a possible role in metabolism of small organoamines. These three syntenic genes encode proteins with ∼75% amino acid identity between the Baja_Loki3 and Loki_GC14 genomes (Fig. S5A). In the Thorarchaeota, the genes surrounding the putative RDase genes are remarkably syntenic (Fig. S5B), although their biological function is difficult to infer and a considerable synteny is not specific to this region but found along the whole Thorarchaeota genomes.
The analyzed genomes did not seem to have complete pathways for corrinoid biosynthesis. However, a cobalamin transporter (cbiM), which is necessary for the function of this enzyme (25, 32), is also present in both Loki- and Thorarchaeota genomes (see Table S6 posted at figshare [https://doi.org/10.6084/m9.figshare.9259094]). Orthogroup and BLAST analyses revealed that the Baja Loki- and Thorarchaeota genomes also contain potentially complete versions of pathways for aromatic amino acid degradation, similar to what has been found in Ferroglobus placidus (33) (see Table S7 posted at figshare [https://doi.org/10.6084/m9.figshare.9259097]). In particular, genes encoding the key enzyme phenylacetyl-coenzyme A (CoA) ligase present in the Baja_Loki3 genome share sequence similarities resulting in up to 52% amino acid identity with the enzyme of F. placidus (33, 34). The degradation of tyrosine includes 4-hydroxyphenylacetate as an intermediate, and given the presence of genes encoding this pathway, both Loki- and Thorarchaeota might dechlorinate chlorinated aromatic compounds, including aromatic amino acid derivatives, using an RDase and then channel dechlorination products into this pathway, leading directly to C and N scavenging from aromatic amino acids. The enzyme indolepyruvate:ferredoxin oxidoreductase (Ior), which is characterized to be involved in peptide fermentation through oxidative decarboxylation of pyruvate in archaea (35, 36), was also identified to be encoded within Loki- and Thorarchaeota genomes (see Table S7 posted at figshare). In several methanogens, however, the phenylacetyl-CoA ligase and Ior are also present, being part of one of the three aromatic amino acid biosynthetic pathways (or of pABA) and indicating assimilation of indoleacetate and phenylacetate by reductive carboxylation (37, 38). Thus, the genomic identification of this pathway within Loki- and Thorarchaeota genomes per se is not a proof of its physiological function. In any case, the utilization of chlorinated amino acid derivatives, which have been observed as natural products (39), would give Loki- and Thorarchaeota a competitive advantage in oligotrophic environments.
Lipid biosynthesis pathways.
The so-called “lipid divide” (i.e., the presence of fundamentally different lipids in the archaea as opposed to lipids in bacteria and eukaryotes) has long been a topic of interest when discussing transitions in early evolution and in particular the origin of the eukaryotic cell (40). Considering the close affiliation of Asgard archaea with eukaryotes, it is particularly interesting to study the archaeal lipids (2, 3). Intriguingly, an archaeal glycerol-1-phosphate dehydrogenase (G1PDH) was found in neither the first analysis of the Lokiarchaeota genome GC14 nor the Loki_CR4 genome (3, 40). This enzyme determines the stereo-configuration of archaeal lipids and is a hallmark enzyme of archaea. However, it was also found to be missing in the genomes of marine group II and III archaea, and a homolog has recently been found in a bacterium, Bacillus subtilis (40). Interestingly, the protein from Bacillus subtilis has been biochemically characterized and was shown to produce G1P for archaeal-type phosphoglycerolipid synthesis in bacteria (41, 42). Our study of the extensive orthologous group analysis revealed that all Thorarchaeota genomes and all three Lokiarchaeota genomes from Baja California possess a G1PDH (Fig. 6; see Fig. S6 and Fig. S7 in the supplemental material), and we identified a truncated G1PDH in Lokiarchaeota GC14 located at the end of a scaffold, which may have complicated previous annotation attempts. In addition, we identified a G1PDH homolog in the most complete of the recently newly released Lokiarchaeota genomes (14). The sequence alignment (Fig. S6) confirmed similarity to known G1PDH and showed conservation in residues, including Asp168, His248, and His264, for binding of Zn2+ ions in the G1PDH enzymes of Loki- and Thorarchaeota, identical to different G1PDH representatives (41). Residues for NAD(P)H binding in the G1PDH were also conserved in Thorarchaeota (6/6 conserved) and mostly in Lokiarchaeota (4/6). However, a difference was seen in the residues of the G1PDH for binding of the dihydroxyacetone phosphate (DHAP) moiety, as these were well conserved in the Thorarchaeota but not in the Lokiarchaeota (Fig. S6). Phylogenetic analysis revealed that the enzymes were part of two basal branches, with Thorarchaeota forming a monophyletic lineage with archaea and Lokiarchaeota grouping with enzymes from Bacillus spp. and Geobacillus spp. (Fig. S7) (41, 43). Although phylogenetic analyses and conserved amino acid residues suggest these enzymes are in fact G1PDHs, biochemical characterizations will be necessary to provide more support.
No Loki- or Thorarchaeota genome had a glycerol-3-phosphate dehydrogenase (G3PDH/gps) involved in forming G3P from DHAP, as needed for the formation of bacterial lipids. Although all genomes seemed to contain several homologs of glycerol kinase (glpK), a detailed phylogenetic analysis revealed that most were homologs of bacterial carbohydrate kinases. Only four genes from Lokiarchaeota genomes represented homologs of the bona fide glycerol kinase (see Fig. S8 in the supplemental material). Consequently, G3P could in principle represent the backbone for bacterial-/eukaryal-like lipids in these Lokiarchaeota (as speculated earlier [40]); however, the presence of a G3PDH/glpAD allows them to further metabolize G3P into DHAP like other heterotrophic archaea (Fig. 6; see Table S8 posted at figshare [https://doi.org/10.6084/m9.figshare.9259103]) (40). Alternatively, G3P could also be synthesized from glycerol phosphodiester via the diesterase UgpQ, which was found in all Loki- and Thorarchaeota genomes. While all Loki- and most Thorarchaeota genomes contained genes for connecting the isoprenoid side chains to the glycerol backbone via ether linkage (GGGP and DGGP synthases), they also contained genes for enzymes forming ester bonds to a fatty acid (FA) mimicking the bacterial synthesis. Such enzymes include homologs of acyl transferases, i.e., PlsY involved in ester-linking a fatty acid to a G3P in the sn-1 position (44), which we found in Loki- and Thorarchaeota genomes and could not be detected in any other archaea, as well as PlsC (responsible for ester-linking FA to G3P in the sn-2 position) found in four Lokiarchaeota genomes and also in few other archaea.
Taken together, our findings show that Loki- and Thorarchaeota have all the genes necessary for the synthesis of bona fide archaeal lipids. In addition some Lokiarchaea, but rather not Thorarchaeota, might potentially be able to produce bacterial-type lipids, but the distribution of the respective genes (glpK, plsY, and plsC) is scattered. Since genes for the archaeal and bacterial lipid biosynthesis pathways are present in Loki- and Thorarchaeota, and since genes for the synthesis of fatty acids have also been proposed (45), we can also not rule out the possibility that chimeric lipids could be produced—for example, consisting of an isoprenoid chain ester bonded to a G3P—as has been suggested before (45). This observation strengthens claims of previous studies, including the first Lokiarchaeum composite genome (Lokiarchaeum_GC14), which proposed that Lokiarchaeota could use a hypothetical DGGGP synthase to form ether-linked isoprenoids with G3P stereochemistry, use PlsC to form ester-linked fatty acids with G3P chemistry, or use PlsC in a hypothetical function to produce chimeric membranes with ether-linked isoprenoids in the sn-1 position and ester-linked fatty acids in the sn-2 position of a G3P backbone (40). The additions of the Baja_Loki and Baja_Thor genomes might support the idea that these Asgard archaea could also produce both types of membranes or a heterochiral membrane. A recent study on lipid engineering showed that biochemically the GGGP synthase enzyme of the archaeon Methanococcus maripaludis, responsible for linking an isoprenoid chain GGPP to G1P, had only a slightly higher preference for G1P over G3P and that, in practice, a mixed heterochiral membrane made an Escherichia coli strain more robust to cellular stressors (40, 46).
Conclusions.
A range of metabolic capacities points to potentially versatile metabolisms in Loki- and Thorarchaeota. The finding of putative reductive dehalogenase genes in the genomes of both groups (see also reference 10) together with potential genes encoding enzymes for aromatic amino acid utilization points to an adaptation of the organisms to specific niches. In the salt lagoon from which the genomes reported here were obtained, there might exist a pool of chlorinated compounds arising from the breakdown of organic material followed by subsequent chlorination, through biotic (47) or abiotic (47, 48) reactions, making these substrates inaccessible to nondehalogenating organisms and providing a competitive advantage for organisms with the ability to dehalogenate. Our study expands previous work (2, 7, 8, 10, 14) and shows that Loki- and Thorarchaeota are commonly found in anoxic environments, where they are involved in nutrient cycling potentially including halogenated compounds. This finding might be instructive for selective enrichments of the organisms from marine sediments or other locations. Further, our data provide evidence that Lokiarchaeota, like other Asgard archaea, can produce glycerol-1-phosphate for bona fide archaeal phosphoglycerolipids, which might as well assist in tracing them in environmental samples, albeit confirmations can only be obtained through cultivation of the organisms.
MATERIALS AND METHODS
Asgard 16S rRNA gene diversity.
The reference sequences of the 16S rRNA genes that belong the Asgard Archaea superphylum were downloaded from the latest SILVA rRNA database (v.132; 20 December 2017). The following filtering steps were implemented to these downloaded sequences to make a better reference phylogeny and taxonomic annotation of Asgard archaeal 16S rRNA gene sequences. (i) The sequences were filtered based on their “pintail” values (<95) from the Silva database and then were checked for “chimeric” regions based on the “Gold” reference sequences obtained and implemented through the UCHIIME pipeline (49). (ii) Then the longer sequences (>1,400 bp) were clustered at 99% sequence identity using UCLUST (50). (iii) These representative sequences (246 in total) along with 16S rRNA gene sequences from all available Asgard genomes, including this study (2, 3, 12, 13) together with longer Asgard 16S rRNA gene sequences from a primer-free method (16) (34 in total), were aligned using MAFFT (L-INS-i) (51) against a reference alignment from many representatives throughout TACK and Bathyarchaeota phylogeny (65 in total) (see Table S1 posted at figshare [https://doi.org/10.6084/m9.figshare.9257885]). (iv) The alignment was trimmed using Trim-Al operated under the “-gappyout” configuration (52), from which the maximum likelihood (ML) phylogenetic tree was calculated using IQ-TREE (53) with a minimum of 1,000 iterations (-bb 1000 -alrt 1000) and GTR-I-G4 as the best model. (v) Then all other high-quality shorter sequences (pintail value of >95, length of <1,400 bp) were mapped to the reference sequences of the backbone tree using USEARCH (-usearch_local -id 0.95 -evalue 1e−06) (50). (vi) Finally, all the taxonomic affiliations of sequences that differed in the SILVA database from the taxonomic affiliation based on this refined phylogeny (Fig. 1A) were corrected. An in-house script was then used to obtain the respective GenBank file for each of these sequences from the NCBI database (54). From here, the isolation source (environment), pH, and other linked information were obtained mainly based on the original publications that reported these sequences.
Sample collection.
Two 20-cm sediment cores (cores 1 and 2) were collected at a dry lagoon near Puertecitos on the Baja California Peninsula (latitude, 30°12′37.27″N; longitude, 114°39′48.36″W) in May 2016. Immediately after sampling, the cores were divided into four subsamples of ca. 2 g material collected at 5-cm intervals (1, 5, 10, and 15 cm) and were stored in RNAlater solution (Sigma-Aldrich, Vienna, Austria). Upon arrival in the laboratory (University of Vienna), the samples were washed with phosphate-buffered saline (PBS) buffer to remove the RNAlater solution. DNA for metagenome sequencing was extracted from the top 1-cm horizon of the core with 0.5 g starting material. The protocol included bead beating and phenol-chloroform extraction with 5% cetyltrimethylammonium bromide (CTAB)–phosphate extraction buffer. A second preparation included an additional pretreatment with lysozyme (1.5 mg/ml at 37°C for 30 min; Sigma-Aldrich, Vienna, Austria). Sequences were generated from both DNA preparations to allow for differential coverage binning. Whole-metagenome shotgun sequencing was performed on an Illumina HiSeq2500 (paired-end 150-bp reads) at the Vienna Biocenter Core Facilities (VBCF). Subsamples were taken from both cores for in-depth analysis of the prokaryotic community throughout the sediment horizons, including 16S rRNA gene amplicon sequencing and lipid biomarker analysis, and can be found elsewhere (13, 15).
Binning.
The raw sequences were trimmed (5-bp window with an average quality value of <20), and the sequencing adaptors were removed using TRIMMOMATIC (55) before the sequences with average low quality (<25) and low complexity were filtered using PRINSEQ (56). Trimmed reads from both lysozyme-treated and untreated samples were coassembled using both metaSPAdes (57) and MEGAHIT (58). The trimmed reads from each of the samples were then mapped to the coassembly using BBMap (59), and the average coverage for each scaffold in each sample was calculated further. The taxonomy of each scaffold was then predicted based on their universally conserved marker proteins (60) from each scaffold matching to the NCBI protein database (NR) as explained for multimetagenome binning strategies (61, 62). The differential coverage binning targeting the assembled scaffolds/contigs of the Asgard archaea was established using the “mmgenome” toolbox (62) from the assembled metagenomes resulting from the two different assembly programs. The reads mapping to the Asgard archaea from these assemblies were then extracted and reassembled together using metaSPAdes. The Asgard archaeal bins (metagenome assembled genomes [MAGs]) were then checked for completeness and contamination using checkM (63).
Genome annotation and orthologous groups.
The genes from the MAGs were predicted using Prodigal (64). The MAGs in this study were also analyzed with the support of MaGe (65) for various genomic features, including synteny. The COG and arCOG annotations of the proteins in the MAGs were predicted using COGsoft (45, 66). The proteins were also annotated for functions using KEGG (BlastKOALA) (67), BLASTP against nonredundant proteins, and the RefSeq protein database from NCBI (E value of <1e−10). The subcellular localization and the transmembrane domains of the proteins were predicted using multiple tools, including PSORTb, Phobius, and PRED-SIGNAL (68–70). The 16S and 23S rRNA regions in the MAGs were also predicted using RNAmmer (71). The predicted 62 ribosomal proteins (see Table S2 posted at figshare [https://doi.org/10.6084/m9.figshare.9258521]) from the Loki- and Thorarchaeota were aligned together with the representative organisms from TACK (Bathyarchaeota SMTZ-80, Nitrososphaera viennensis and Sulfolobus islandicus LS215), Euryarchaeota (Thermoplasma volcanicum GSS1, Halobaculum gomorrense, and Methanothermococcus okinawensis), and DPANN (Pacearchaeota RBG-13-36-9 and Woesearchaeota UBA94) with MAFFT (L-INS-i). The alignments were then refined with BMGE (72) and concatenated. Further, the phylogeny of the MAGs from this concatenated alignment was inferred using IQ-TREE (ML) with a minimum of 1,000 iterations (-bb 1000 -alrt 1000) and LG+F+I+G4 as the best model. The orthologous group analyses of the MAGs in this study were carried out along with all available genomes from Loki- and Thorarchaeota (April 2018) in the NCBI database together with the genomes from this study using OrthoFinder (20) (see Table S3 posted at figshare [https://doi.org/10.6084/m9.figshare.9258926]).
Analysis of putative reductive dehalogenase genes.
The amino acid sequences of the putative Asgard RDases were queried against the EggNOG database (28) with a likelihood threshold of 1e−40. Representative sequences of the resulting orthologous groups were collected from the UniProt database (The UniProt Consortium, 2019) via their accession numbers. Epoxyqueosine reductase (QueG), which shares sequence similarity with RDases (25), was selected as an outgroup for phylogenetic reconstruction. All QueG protein sequences were collected from the KEGG database orthology group for QueG (K18979), and a representative set was generated by application of CD-hit (73) at 50% similarity clustering. The putative Asgard RDases, EggNOG orthogroup representatives, and QueG representatives were aligned with MAFFT –auto (51). The alignment was trimmed using TrimAl –gappyout (52), and subjected to maximum likelihood phylogenetic reconstruction using RaxML rapid bootstrap analysis under the PROTGAMMALG substitution model with 500 iterations. The resulting phylogenetic tree was visualized using the Interactive Tree of Life (74).
Conserved protein domains of the putative Asgard RDases were compared to those of RDases with demonstrated reductive dechlorination activity as described previously (75) and the putative RDase of Ferroglobus placidus. pfam domains and signal peptides were characterized using the NCBI Web CD-Search Tool (76) and SignalP 5.0 (77), respectively. A maximum likelihood phylogenetic tree was constructed using the archaeal and proven-function RDases using MAFFT, TrimAl, RaxML, and the Interactive Tree of Life as described above.
BLASTp (78) with an E value threshold of 1e−10 was used to perform pairwise comparisons between protein sets encoded in putative RDase gene neighborhoods. Thorarchaeotal synteny was characterized by comparing all coding regions found on the RDase-containing scaffolds of Thor_SMTZ1-83, Thor_AB, and Thor_SMTZ1-45 to the 15 downstream and 26 upstream coding regions relative to the RDase detected in Baja_Thor. For Lokiarchaeota, sequential pairwise comparisons between all coding regions on the RDase-containing scaffolds were performed as shown in Fig. S5. BLASTp alignment results were visualized using the genoPlotR package (79). KEGG orthology annotations were assigned using GhostKoala (80), and COG and TIGRFAM annotations were assigned using the WebMGA server with E value thresholds of 1e−3. The protein sequences in MAGs predicted to be involved in the pathways of anaerobic degradation of aromatic amino acids were inferred through the pathways in Ferroglobus placidus (33) (BLASTp [>35% amino acid identity and >85% query coverage]).
Annotation of lipid biosynthesis pathways.
The different enzymes involved in the lipid biosynthesis pathway(s) identified in all MAGs from Loki- and Thorarchaeota were predicted based on the different functional annotation tools, including HMMER3 and orthogroup analysis. These enzymes were inferred based on the known enzymes and pathways (40). The phylogeny of the predicted enzymes such as glyceraldehyde-3-phosphate dehydrogenase (glpAD) and glycerol kinase (glpK) were obtained together with those of the other enzymes involved in the process (40). Similarly, the phylogeny of glyceraldehyde-1-phosphate dehydrogenases (G1PDH) in Lokiarchaeota was predicted together with those of different members of the glycerol dehydrogenase superfamily (glp and gps) and also with manually curated G1PDHs from UniProt. The Lokiarchaeal G1PDHs were aligned specifically with the bacterial G1PDHs (Bacillus subtilis) using MAFFT (L-INS-i) to understand their relationship (41).
Data availability.
The genome bins from this study can be found at NCBI BioProject ID PRJNA521734. The MAGs from this study have been deposited under the following accession numbers: Baja_Thor (“Candidatus Thorarchaeota archaeon” BC), SHMX00000000; Baja_Loki1 (“Candidatus Lokiarchaeota archaeon” BC1), SHMU00000000; Baja_Loki2 (“Candidatus Lokiarchaeota archaeon” BC2), SHMV00000000; and Baja_Loki3 (“Candidatus Lokiarchaeota archaeon” BC3), SHMW00000000.
ACKNOWLEDGMENT
This work was financed through the European Research Council Advanced Grant ERC-ADG 695192. Funding by the WWTF (VRG15-007 to Sousa FL) is also acknowledged.
Footnotes
Citation Manoharan L, Kozlowski JA, Murdoch RW, Löffler FE, Sousa FL, Schleper C. 2019. Metagenomes from coastal marine sediments give insights into the ecological role and cellular features of Loki- and Thorarchaeota. mBio 10:e02039-19. https://doi.org/10.1128/mBio.02039-19.
Contributor Information
Mary Ann Moran, University of Georgia.
Laura Hug, University of Waterloo.
Simonetta Gribaldo, Pasteur Institute, Paris France.
REFERENCES
- 1.Hug LA, Baker BJ, Anantharaman K, Brown CT, Probst AJ, Castelle CJ, Butterfield CN, Hernsdorf AW, Amano Y, Ise K, Suzuki Y, Dudek N, Relman DA, Finstad KM, Amundson R, Thomas BC, Banfield JF. 2016. A new view of the tree of life. Nat Microbiol 1:16048. doi: 10.1038/nmicrobiol.2016.48. [DOI] [PubMed] [Google Scholar]
- 2.Zaremba-Niedzwiedzka K, Caceres EF, Saw JH, Bäckström D, Juzokaite L, Vancaester E, Seitz KW, Anantharaman K, Starnawski P, Kjeldsen KU, Stott MB, Nunoura T, Banfield JF, Schramm A, Baker BJ, Spang A, Ettema T. 2017. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541:353–358. doi: 10.1038/nature21031. [DOI] [PubMed] [Google Scholar]
- 3.Spang A, Saw JH, Jørgensen SL, Zaremba-Niedzwiedzka K, Martijn J, Lind AE, van Eijk R, Schleper C, Guy L, Ettema T. 2015. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521:173–179. doi: 10.1038/nature14447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Seitz KW, Dombrowski N, Eme L, Spang A, Lombard J, Sieber JR, Teske AP, Ettema TJG, Baker BJ. 2019. Asgard archaea capable of anaerobic hydrocarbon cycling. Nat Commun 10:1822. doi: 10.1038/s41467-019-09364-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cunha VD, Da Cunha V, Gaia M, Gadelle D, Nasir A, Forterre P. 2017. Lokiarchaea are close relatives of Euryarchaeota, not bridging the gap between prokaryotes and eukaryotes. PLoS Genet doi: 10.1371/journal.pgen.1006810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Spang A, Eme L, Saw JH, Caceres EF, Zaremba-Niedzwiedzka K, Lombard J, Guy L, Ettema T. 2018. Asgard archaea are the closest prokaryotic relatives of eukaryotes. PLoS Genet 14:e1007080. doi: 10.1371/journal.pgen.1007080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jorgensen SL, Hannisdal B, Lanzén A, Baumberger T, Flesland K, Fonseca R, Ovreås L, Steen IH, Thorseth IH, Pedersen RB, Schleper C. 2012. Correlating microbial community profiles with geochemical data in highly stratified sediments from the Arctic Mid-Ocean Ridge. Proc Natl Acad Sci U S A 109:E2846–E2855. doi: 10.1073/pnas.1207574109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Jørgensen SL, Thorseth IH, Pedersen RB, Baumberger T, Schleper C. 2013. Quantitative and phylogenetic study of the Deep Sea Archaeal Group in sediments of the Arctic mid-ocean spreading ridge. Front Microbiol 4:299. doi: 10.3389/fmicb.2013.00299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sousa FL, Neukirchen S, Allen JF, Lane N, Martin WF. 2016. Lokiarchaeon is hydrogen dependent. Nat Microbiol 1:16034. doi: 10.1038/nmicrobiol.2016.34. [DOI] [PubMed] [Google Scholar]
- 10.Spang A, Stairs CW, Dombrowski N, Eme L, Lombard J, Caceres EF, Greening C, Baker BJ, Ettema T. 2019. Proposal of the reverse flow model for the origin of the eukaryotic cell based on comparative analyses of Asgard archaeal metabolism. Nat Microbiol 4:1138. doi: 10.1038/s41564-019-0406-9. [DOI] [PubMed] [Google Scholar]
- 11.MacLeod F, Kindler GS, Wong HL, Chen R, Burns BP. 2019. Asgard archaea: diversity, function, and evolutionary implications in a range of microbiomes. AIMS Microbiol 5:48–61. doi: 10.3934/microbiol.2019.1.48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Seitz KW, Lazar CS, Hinrichs K-U, Teske AP, Baker BJ. 2016. Genomic reconstruction of a novel, deeply branched sediment archaeal phylum with pathways for acetogenesis and sulfur reduction. ISME J 10:1696–1705. doi: 10.1038/ismej.2015.233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liu Y, Zhou Z, Pan J, Baker BJ, Gu J-D, Li M. 2018. Comparative genomic inference suggests mixotrophic lifestyle for Thorarchaeota. ISME J 12:1021–1031. doi: 10.1038/s41396-018-0060-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bulzu P-A, Andrei A-Ş, Salcher MM, Mehrshad M, Inoue K, Kandori H, Beja O, Ghai R, Banciu HL. 2019. Casting light on Asgardarchaeota metabolism in a sunlit microoxic niche. Nat Microbiol 4:1129. doi: 10.1038/s41564-019-0404-y. [DOI] [PubMed] [Google Scholar]
- 15.Kozlowski JA, Johnson ME, Ledesma-Vázquez J, Birgel D, Peckmann J, Schleper C. 2018. Microbial diversity of a closed salt lagoon in the Puertecitos area, Upper Gulf of California. Cienc Mar 44:71–90. doi: 10.7773/cm.v44i2.2825. [DOI] [Google Scholar]
- 16.Karst SM, Dueholm MS, McIlroy SJ, Kirkegaard RH, Nielsen PH, Albertsen M. 2018. Retrieval of a million high-quality, full-length microbial 16S and 18S rRNA gene sequences without primer bias. Nat Biotechnol 36:190–195. doi: 10.1038/nbt.4045. [DOI] [PubMed] [Google Scholar]
- 17.Inagaki F, Suzuki M, Takai K, Oida H, Sakamoto T, Aoki K, Nealson KH, Horikoshi K. 2003. Microbial communities associated with geological horizons in coastal subseafloor sediments from the Sea of Okhotsk. Appl Environ Microbiol 69:7224–7235. doi: 10.1128/aem.69.12.7224-7235.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bowers RM, Kyrpides NC, Stepanauskas R, Harmon-Smith M, Doud D, Reddy TBK, Schulz F, Jarett J, Rivers AR, Eloe-Fadrosh EA, Tringe SG, Ivanova NN, Copeland A, Clum A, Becraft ED, Malmstrom RR, Birren B, Podar M, Bork P, Weinstock GM, Garrity GM, Dodsworth JA, Yooseph S, Sutton G, Glöckner FO, Gilbert JA, Nelson WC, Hallam SJ, Jungbluth SP, Ettema TJG, Tighe S, Konstantinidis KT, Liu W-T, Baker BJ, Rattei T, Eisen JA, Hedlund B, McMahon KD, Fierer N, Knight R, Finn R, Cochrane G, Karsch-Mizrachi I, Tyson GW, Rinke C, Genome Standards Consortium, Lapidus A, Meyer F, Yilmaz P, Parks DH, Eren AM, Schriml L, Banfield JF, Hugenholtz P, Woyke T. 2017. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol 35:725–731. doi: 10.1038/nbt.3893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lee I, Kim YO, Park S-C, Chun J. 2016. OrthoANI: an improved algorithm and software for calculating average nucleotide identity. Int J Syst Evol Microbiol 66:1100–1103. doi: 10.1099/ijsem.0.000760. [DOI] [PubMed] [Google Scholar]
- 20.Emms DM, Kelly S. 2015. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol 16:157. doi: 10.1186/s13059-015-0721-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Borrel G, Adam PS, Gribaldo S. 2016. Methanogenesis and the Wood-Ljungdahl pathway: an ancient, versatile, and fragile association. Genome Biol Evol 8:1706–1711. doi: 10.1093/gbe/evw114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Field JA. 2016. Natural production of organohalide compounds in the environment, p 7–29. In Adrian L, Löffler F (ed), Organohalide-respiring bacteria, Springer, Berlin, Germany. [Google Scholar]
- 23.Anderson I, Risso C, Holmes D, Lucas S, Copeland A, Lapidus A, Cheng J-F, Bruce D, Goodwin L, Pitluck S, Saunders E, Brettin T, Detter JC, Han C, Tapia R, Larimer F, Land M, Hauser L, Woyke T, Lovley D, Kyrpides N, Ivanova N. 2011. Complete genome sequence of Ferroglobus placidus AEDII12DO. Stand Genomic Sci 5:50–60. doi: 10.4056/sigs.2225018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kruse T, Smidt H, Lechner U. 2016. Comparative genomics and transcriptomics of organohalide-respiring bacteria and regulation of rdh gene transcription, p 345–376. In Adrian L, Löffler F (ed), Organohalide-respiring bacteria, Springer, Berlin, Germany. [Google Scholar]
- 25.Payne KA, Quezada CP, Fisher K, Dunstan MS, Collins FA, Sjuts H, Levy C, Hay S, Rigby SE, Leys D. 2015. Reductive dehalogenase structure suggests a mechanism for B12-dependent dehalogenation. Nature 517:513–516. doi: 10.1038/nature13901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hug LA. 2016. Diversity, evolution, and environmental distribution of reductive dehalogenase genes, p 377–393. In Adrian L, Löffler F (ed), Organohalide-respiring bacteria, Springer, Berlin, Germany. [Google Scholar]
- 27.Palmer T, Berks BC. 2003. Moving folded proteins across the bacterial cell membrane. Microbiology 149:547–556. doi: 10.1099/mic.0.25900-0. [DOI] [PubMed] [Google Scholar]
- 28.Huerta-Cepas J, Szklarczyk D, Forslund K, Cook H, Heller D, Walter MC, Rattei T, Mende DR, Sunagawa S, Kuhn M, Jensen LJ, von Mering C, Bork P. 2016. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res 44:D286–D293. doi: 10.1093/nar/gkv1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kunze C, Bommer M, Hagen WR, Uksa M, Dobbek H, Schubert T, Diekert G. 2017. Cobamide-mediated enzymatic reductive dehalogenation via long-range electron transfer. Nat Commun 8:15858. doi: 10.1038/ncomms15858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bommer M, Kunze C, Fesseler J, Schubert T, Diekert G, Dobbek H. 2014. Structural basis for organohalide respiration. Science 346:455–458. doi: 10.1126/science.1258118. [DOI] [PubMed] [Google Scholar]
- 31.Dandekar T. 1998. Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem Sci 23:324. doi: 10.1016/S0968-0004(98)01274-2. [DOI] [PubMed] [Google Scholar]
- 31a.Atashgahi S. 2019. Discovered by genomics: putative reductive dehalogenases with N-terminus transmembrane helixes. FEMS Microbiol Ecol 95. doi: 10.1093/femsec/fiz048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Smidt H, de Vos WM. 2004. Anaerobic microbial dehalogenation. Annu Rev Microbiol 58:43–73. doi: 10.1146/annurev.micro.58.030603.123600. [DOI] [PubMed] [Google Scholar]
- 33.Aklujkar M, Risso C, Smith J, Beaulieu D, Dubay R, Giloteaux L, DiBurro K, Holmes D. 2014. Anaerobic degradation of aromatic amino acids by the hyperthermophilic archaeon Ferroglobus placidus. Microbiology 160:2694–2709. doi: 10.1099/mic.0.083261-0. [DOI] [PubMed] [Google Scholar]
- 34.Martínez-Blanco H, Reglero A, Rodriguez-Aparicio LB, Luengo JM. 1990. Purification and biochemical characterization of phenylacetyl-CoA ligase from Pseudomonas putida. A specific enzyme for the catabolism of phenylacetic acid. J Biol Chem 265:7084–7090. [PubMed] [Google Scholar]
- 35.Heider J, Mai X, Adams MW. 1996. Characterization of 2-ketoisovalerate ferredoxin oxidoreductase, a new and reversible coenzyme A-dependent enzyme involved in peptide fermentation by hyperthermophilic archaea. J Bacteriol 178:780–787. doi: 10.1128/jb.178.3.780-787.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Ozawa Y, Siddiqui MA, Takahashi Y, Urushiyama A, Ohmori D, Yamakura F, Arisaka F, Imai T. 2012. Indolepyruvate ferredoxin oxidoreductase: an oxygen-sensitive iron-sulfur enzyme from the hyperthermophilic archaeon Thermococcus profundus. J Biosci Bioeng 114:23–27. doi: 10.1016/j.jbiosc.2012.02.014. [DOI] [PubMed] [Google Scholar]
- 37.Tersteegen A, Linder D, Thauer RK, Hedderich R. 1997. Structures and functions of four anabolic 2-oxoacid oxidoreductases in Methanobacterium thermoautotrophicum. Eur J Biochem 244:862–868. doi: 10.1111/j.1432-1033.1997.00862.x. [DOI] [PubMed] [Google Scholar]
- 38.Porat I, Sieprawska-Lupa M, Teng Q, Bohanon FJ, White RH, Whitman WB. 2006. Biochemical and genetic characterization of an early step in a novel pathway for the biosynthesis of aromatic amino acids and p-aminobenzoic acid in the archaeon Methanococcus maripaludis. Mol Microbiol 62:1117–1131. doi: 10.1111/j.1365-2958.2006.05426.x. [DOI] [PubMed] [Google Scholar]
- 39.Gribble GW. 1994. The natural production of chlorinated compounds. Environ Sci Technol 28:310A–319A. doi: 10.1021/es00056a001. [DOI] [PubMed] [Google Scholar]
- 40.Villanueva L, Schouten S, Damsté J. 2017. Phylogenomic analysis of lipid biosynthetic genes of Archaea shed light on the “lipid divide.” Environ Microbiol 19:54–69. doi: 10.1111/1462-2920.13361. [DOI] [PubMed] [Google Scholar]
- 41.Guldan H, Sterner R, Babinger P. 2008. Identification and characterization of a bacterial glycerol-1-phosphate dehydrogenase: Ni2+-dependent AraM from Bacillus subtilis. Biochemistry 47:7376–7384. doi: 10.1021/bi8005779. [DOI] [PubMed] [Google Scholar]
- 42.Guldan H, Matysik F-M, Bocola M, Sterner R, Babinger P. 2011. Functional assignment of an enzyme that catalyzes the synthesis of an archaea-type ether lipid in bacteria. Angew Chem Int Ed Engl 50:8188–8191. doi: 10.1002/anie.201101832. [DOI] [PubMed] [Google Scholar]
- 43.Daiyasu H, Hiroike T, Koga Y, Toh H. 2002. Analysis of membrane stereochemistry with homology modeling of sn-glycerol-1-phosphate dehydrogenase. Protein Eng 15:987–995. doi: 10.1093/protein/15.12.987. [DOI] [PubMed] [Google Scholar]
- 44.Parsons JB, Rock CO. 2013. Bacterial lipids: metabolism and membrane homeostasis. Prog Lipid Res 52:249–276. doi: 10.1016/j.plipres.2013.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Kristensen DM, Kannan L, Coleman MK, Wolf YI, Sorokin A, Koonin EV, Mushegian A. 2010. A low-polynomial algorithm for assembling clusters of orthologous groups from intergenomic symmetric best matches. Bioinformatics 26:1481–1487. doi: 10.1093/bioinformatics/btq229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Caforio A, Siliakus MF, Exterkate M, Jain S, Jumde VR, Andringa RLH, Kengen SWM, Minnaard AJ, Driessen AJM, van der Oost J. 2018. Converting into an archaebacterium with a hybrid heterochiral membrane. Proc Natl Acad Sci U S A 115:3704–3709. doi: 10.1073/pnas.1721604115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ruecker A, Weigold P, Behrens S, Jochmann M, Laaks J, Kappler A. 2014. Predominance of biotic over abiotic formation of halogenated hydrocarbons in hypersaline sediments in Western Australia. Environ Sci Technol 48:9170–9178. doi: 10.1021/es501810g. [DOI] [PubMed] [Google Scholar]
- 48.Méndez-Díaz JD, Shimabuku KK, Ma J, Enumah ZO, Pignatello JJ, Mitch WA, Dodd MC. 2014. Sunlight-driven photochemical halogenation of dissolved organic matter in seawater: a natural abiotic source of organobromine and organoiodine. Environ Sci Technol 48:7418–7427. doi: 10.1021/es5016668. [DOI] [PubMed] [Google Scholar]
- 49.Edgar RC. 2016. UCHIME2: improved chimera prediction for amplicon sequencing. bioRxiv doi: 10.1101/074252. [DOI]
- 50.Edgar RC. 2010. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26:2460–2461. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- 51.Katoh K, Standley DM. 2013. MAFFT Multiple Sequence Alignment Software version 7: improvements in performance and usability. Mol Biol Evol 30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol 32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Alves RJE, Minh BQ, Urich T, von Haeseler A, Schleper C. 2018. Unifying the global phylogeny and environmental distribution of ammonia-oxidising archaea based on amoA genes. Nat Commun 9:1517. doi: 10.1038/s41467-018-03861-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Schmieder R, Edwards R. 2011. Quality control and preprocessing of metagenomic datasets. Bioinformatics 27:863–864. doi: 10.1093/bioinformatics/btr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. 2017. metaSPAdes: a new versatile metagenomic assembler. Genome Res 27:824–834. doi: 10.1101/gr.213959.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Li D, Luo R, Liu C-M, Leung C-M, Ting H-F, Sadakane K, Yamashita H, Lam T-W. 2016. MEGAHIT v1.0: a fast and scalable metagenome assembler driven by advanced methodologies and community practices. Methods 102:3–11. doi: 10.1016/j.ymeth.2016.02.020. [DOI] [PubMed] [Google Scholar]
- 59.Bushnell B. BBTools. BBMap. http://sourceforge.net/projects/bbmap/.
- 60.Rinke C, Schwientek P, Sczyrba A, Ivanova NN, Anderson IJ, Cheng J-F, Darling A, Malfatti S, Swan BK, Gies EA, Dodsworth JA, Hedlund BP, Tsiamis G, Sievert SM, Liu W-T, Eisen JA, Hallam SJ, Kyrpides NC, Stepanauskas R, Rubin EM, Hugenholtz P, Woyke T. 2013. Insights into the phylogeny and coding potential of microbial dark matter. Nature 499:431–437. doi: 10.1038/nature12352. [DOI] [PubMed] [Google Scholar]
- 61.Albertsen M, Hugenholtz P, Skarshewski A, Nielsen KL, Tyson GW, Nielsen PH. 2013. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nat Biotechnol 31:533–538. doi: 10.1038/nbt.2579. [DOI] [PubMed] [Google Scholar]
- 62.Karst SM, Kirkegaard RH, Albertsen M. 2016. mmgenome: a toolbox for reproducible genome extraction from metagenomes. bioRxiv doi: 10.1101/059121. [DOI]
- 63.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Vallenet D, Labarre L, Rouy Z, Barbe V, Bocs S, Cruveiller S, Lajus A, Pascal G, Scarpelli C, Médigue C. 2006. MaGe: a microbial genome annotation system supported by synteny results. Nucleic Acids Res 34:53–65. doi: 10.1093/nar/gkj406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Makarova KS, Wolf YI, Koonin EV. 2015. Archaeal Clusters of Orthologous Genes (arCOGs): an update and application for analysis of shared features between Thermococcales, Methanococcales, and Methanobacteriales. Life (Basel) 5:818–840. doi: 10.3390/life5010818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kanehisa M, Sato Y, Morishima K. 2016. BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J Mol Biol 428:726–731. doi: 10.1016/j.jmb.2015.11.006. [DOI] [PubMed] [Google Scholar]
- 68.Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, Dao P, Cenk Sahinalp S, Ester M, Foster LJ, Brinkman F. 2010. PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics 26:1608–1615. doi: 10.1093/bioinformatics/btq249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kall L, Krogh A, Sonnhammer E. 2007. Advantages of combined transmembrane topology and signal peptide prediction—the Phobius web server. Nucleic Acids Res 35:W429–W432. doi: 10.1093/nar/gkm256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bagos PG, Tsirigos KD, Plessas SK, Liakopoulos TD, Hamodrakas SJ. 2009. Prediction of signal peptides in archaea. Protein Eng Des Sel 22:27–35. doi: 10.1093/protein/gzn064. [DOI] [PubMed] [Google Scholar]
- 71.Lagesen K, Hallin P, Rødland EA, Staerfeldt H-H, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Criscuolo A, Gribaldo S. 2010. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol Biol 10:210. doi: 10.1186/1471-2148-10-210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Li W, Godzik A. 2006. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22:1658–1659. doi: 10.1093/bioinformatics/btl158. [DOI] [PubMed] [Google Scholar]
- 74.Letunic I, Bork P. 2016. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res 44:W242–W245. doi: 10.1093/nar/gkw290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Jugder B-E, Ertan H, Bohl S, Lee M, Marquis CP, Manefield M. 2016. Organohalide respiring bacteria and reductive dehalogenases: key tools in organohalide bioremediation. Front Microbiol 7:249. doi: 10.3389/fmicb.2016.00249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Lu F, Marchler GH, Song JS, Thanki N, Wang Z, Yamashita RA, Zhang D, Zheng C, Geer LY, Bryant SH. 2017. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res 45:D200–D203. doi: 10.1093/nar/gkw1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, von Heijne G, Nielsen H. 2019. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat Biotechnol 37:420–423. doi: 10.1038/s41587-019-0036-z. [DOI] [PubMed] [Google Scholar]
- 78.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 79.Guy L, Kultima JR, Andersson S. 2010. genoPlotR: comparative gene and genome visualization in R. Bioinformatics 26:2334–2335. doi: 10.1093/bioinformatics/btq413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. 2017. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res 45:D353–D361. doi: 10.1093/nar/gkw1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The genome bins from this study can be found at NCBI BioProject ID PRJNA521734. The MAGs from this study have been deposited under the following accession numbers: Baja_Thor (“Candidatus Thorarchaeota archaeon” BC), SHMX00000000; Baja_Loki1 (“Candidatus Lokiarchaeota archaeon” BC1), SHMU00000000; Baja_Loki2 (“Candidatus Lokiarchaeota archaeon” BC2), SHMV00000000; and Baja_Loki3 (“Candidatus Lokiarchaeota archaeon” BC3), SHMW00000000.