Abstract
Despite the environmental challenges and nutrient scarcity, the geographically isolated Challenger Deep in Mariana trench, is considered a dynamic hotspot of microbial activity. Hadal viruses are the least explored microorganisms in Challenger Deep, while their taxonomic and functional diversity and ecological impact on deep-sea biogeochemistry are poorly described. Here, we collect 13 sediment cores from slope and bottom-axis sites across the Challenger Deep (down to ~11 kilometers depth), and identify 1,628 previously undescribed viral operational taxonomic units at species level. Community-wide analyses reveals 1,299 viral genera and distinct viral diversity across the trench, which is significantly higher at the bottom-axis vs. slope sites of the trench. 77% of these viral genera have not been previously identified in soils, deep-sea sediments and other oceanic settings. Key prokaryotes involved in hadal carbon and nitrogen cycling are predicted to be potential hosts infected by these viruses. The detected putative auxiliary metabolic genes suggest that viruses at Challenger Deep could modulate the carbohydrate and sulfur metabolisms of their potential hosts, and stabilize host’s cell membranes under extreme hydrostatic pressures. Our results shed light on hadal viral metabolic capabilities, contribute to understanding deep sea ecology and on functional adaptions of hadal viruses for future research.
Subject terms: Microbial ecology, Microbial ecology
Analysis of 13 sediment cores from the Challenger Deep of Marian Trench (down to 11 kilometers depth) identified distinct operational taxonomic units and relevant auxiliary metabolic genes, providing further insight into deep-sea viral metabolic capabilities and ecology.
Introduction
The global ocean is the largest virosphere on Earth and a reservoir of high viral diversity1. The role of viruses in the open ocean has been extensively described by the large-scale expeditions of Tara Oceans and Malaspina that revealed the high endemicity, structure, and lifestyle of epipelagic viral communities, as well as, a suite of adaptations that support their success2–7. Likewise, studies of viral metabolic reprograming of marine prokaryotes8–10 suggested the potential for marine viruses to contribute to carbon and nutrient cycling in the ocean’s water column by affecting the central metabolic pathways of their hosts.
Aside from the water column, viruses have also been identified in marine sediments, where they demonstrate extraordinary viral genetic diversity11–16. Nonetheless, viral communities in marine sediments are less studied than in water columns, due to the challenges of recovering viral particles efficiently from sediments17–20. Viruses show high abundances in marine sediments (107–1010 particles g−1 of dry sediment)21. Yet, the viral particles bind firmly to sediments due to electrostatic, van der Waals, and hydrophobic interactions, which complicate their separation and enumeration from the surrounding sediment matrix21. The challenges of efficiently separating viral particles from the sediments are due to the features of the virus (e.g., size, isoelectric point) and the sediment physiochemical properties (e.g., size, mineralogy, pH) that control the type and strength of interactions between viral and sediment particles21–23.
Deep-sea sediments harbor ~160 Pg prokaryotic biomass24, and in some cases, viral abundances in these settings are reported to exceed those of their putative prokaryotic hosts13,14,25. The viral shunt in abyssal and hadal realms is estimated to contribute 35% of labile carbon in those habitats and is believed to sustain the sediment microbiota in hadal sediments by providing easily degradable carbon11,26. Among prokaryotes, Thaumarchaeota and other archaeal lineages in deep-sea sediments, are reported to be more susceptible to viral infections compared to bacterial taxa27.
The data on viruses that have been recovered so far from deep-sea sediments show extraordinary novelty, and can encode putative auxiliary metabolic genes (AMGs) involved in carbon and sulfur metabolisms15,16,28–30. These AMGs are suggested to enhance viral fitness and to impact the biogeochemistry of those habitats15,31. Recent studies of the New Britain trench identified novel viral clusters in sediments that have the potential to influence microbial hydrocarbon biodegradation at depths >8 km32. Still, studies of hadal viruses are limited26,28,32,33 and have targeted only a few sampling sites, which further constrains our understanding of the biogeographic distribution, diversity, and genetic potential of viruses in these isolated hadal settings.
Here, we analyzed 37 sediment metagenomes and 3 metatranscriptomes for sediments collected from slope (>5 km) and bottom-axis sites (>10 km depth) along the Challenger Deep (CD) for the presence of viral elements. CD is the deepest hadal oceanic realm (~11 km depth) located at the southern end of the Mariana Trench, and is characterized by extreme hydrostatic pressures (>1000 atm), low temperatures (~2.5 °C), and a deficiency in labile nutrients34,35. The V-shaped topography of the trench creates a funneling effect that enhances the accumulation of organic carbon; CD bottom-axis sites present a twofold higher organic carbon content and sevenfold higher prokaryotic cell counts, compared to adjacent slope sites35. CD still remains one of the most oligotrophic hadal settings36,37, which makes it challenging to explain these relatively high prokaryotic cell densities observed in bottom-axis sediments, and at the deeper sediment layers (>10 cm below sea floor; cmbsf)35. High viral production and turnover rates were reported in CD bottom-axis sediments26, with viral density ranging between 2.4 × 106–5.3 × 107 viruses cm−328. These numbers of viral particles could possibly provide labile organic carbon to sustain benthic prokaryotes in CD as has been described for other hadal trenches26. Whether the viral shunt and resulting prokaryotic turnover are linked to the high prokaryotic abundance in CD requires investigation of the lifestyle, metabolic potential, and virus-host interactions at different sites and sediment layers. The metagenomic analyses of viral communities collected from (hado)pelagic sediments in the northwest Pacific, including CD, showed that those viral communities were distinct from other marine habitats, with evidence for high endemicity28,30. Our previous study of microbial diversity in CD sediments revealed distinct prokaryotic communities between slope and bottom-axis sites38,39, which leaves an open question of whether the spatial distribution of hosts influences the distribution of viruses in CD. Using in silico phage identification pipelines, we identified 1628 virus operational taxonomic units (vOTUs) within the 37 metagenomes and examined their taxonomy, viral community structure, and linkages to prokaryotic hosts. We also analyzed all viral contigs to identify putative AMGs that might provide additional insights into the roles of hadal sediment viruses. We present viral information from hadal metagenomes collected at different sites across CD that demonstrate distinct prokaryotic communities and geochemical gradients. Our study includes also viral data from the deepest region (>10,900 m) of this trench and describes the potential ecological implications of viruses in this extreme ecosystem.
Results and discussion
We sequenced 37 microbial metagenomes from different depth horizons (2~3 cm intervals) of 13 sediment cores covering both slope and bottom-axis sites of the Challenger Deep (Fig. 1 and Supplementary Table 1). Clean reads of metagenomes from each site were co-assembled to yield 13 metagenomes, from which viral genomic data were extracted for analyses (Supplementary Data 1). We also generated three metatranscriptome libraries from one of the bottom-axis sediment cores (T3L11: 10,908 m; 6–9, 12–15, 18–21 cmbsf) to gain insights into potential viral activities.
Identification and description of the CD viruses
The assembled contigs from the CD metagenomes were analyzed initially using the What the Phage workflow, which utilizes output from 12 tools for phage annotation and identification40. We utilized 11/12 tools of this pipeline (Supplementary Data 2) and identified 9889 putative viral contigs with size >10 kb (10.6084/m9.figshare.14815068). Due to the highly variable prediction quality of the tools in the utilized pipeline (Supplementary Data 2), we also performed manual and other curation approaches, to remove putative viral contigs that could be false positives (see also Methods). After strict and laborious curation, we retained 1628 contigs (1628/9889), which we dereplicated into vOTUs that represent species-level taxon ranks, using consensus metrics of >95% identity and >85% coverage41–43. Overall, 1622/1628 vOTUs were >10 kb while six had sizes <10 kb (Fig. 2a and Supplementary Data 3). The degree of completeness and contamination of the CD vOTUs was estimated by comparing the sequences using CheckV44 against a large database of environmentally diverse and complete viral genomes. This resulted in assigning ~89% of identified CD vOTUs to four different quality tiers: complete genomes (73/1628 vOTUs; 100% completeness with direct terminal repeats), high- (79/1628; >90% completeness), medium- (193/1628; 50–90% completeness), and low-quality (1100/1628; <50% completeness) genomes (Fig. 2b). The completeness of 189 vOTUs, (~11%; 189/1628) could not be estimated (Fig. 2b). We also identified that 19% of vOTUs (316/1628) had at least 20% of genes mapped by >1 metatranscriptomic reads in our bottom-axis metatranscriptomic libraries (Supplementary Data 4 and 5).
To compare the CD vOTUs with those publicly available from other habitats, we used the gene sharing network analytic vConTACT245. vConTACT2 clustered CD vOTUs at genus level with viruses deriving from pelagic seawater, sediment, and soil viruses (Fig. 1; Supplementary Fig. 1). We identified 1299 CD viral genera among the CD vOTUs. The majority of these genera (~77%; 1005/1299) were mainly distinct from the viral clusters deriving from pelagic seawater (Global Ocean Virome 2.07), hadal and non-hadal deep-sea sediments (seven cold seeps15 and three hadal trenches30), wetland sediments46, and thawed permafrost47 (Fig. 2c). The remaining ~23% CD genera overlapped with viruses from the hadal and non-hadal deep-sea sediments and seawater from the Global Ocean Virome 2.07 data sets, that were used for comparison (Fig. 2c and Supplementary Fig. 1). The distinct number of CD viral genera, and the limited overlap with other hadal and non-hadal deep-sea sediment habitats, indicate that these hadal CD viruses are presumably endemic to Challenger Deep.
Our CD vOTUs were also distinct when compared with viruses identified at the hadal slope sediments of the Mariana Trench48. Specifically, 98% of our CD viral contigs have not been previously identified in the Challenger Deep (<95% identity in 85% of sequence length). To be best of our knowledge, 76% of our identified CD viral genera were new (estimated by vConTACT2), when compared with the identified viruses from the upper slope (5.4–6.7 km depth) of the trench48. Comparisons of distinct viral populations between different settings in Challenger Deep (e.g., slope sites at various depths as well as slopes vs. bottom axis) will be beneficial for understanding hadal viral ecology and links between viral diversity and hadal physicochemical characteristics. However, unless additional locations are sampled at Challenger Deep in the future, the paucity of available comparisons limits the interpretation of hadal viral diversity in different settings.
We were able to assign taxonomy to 39% of the detected vOTUs using the majority-rules approach7 (see Methods) (Fig. 2b). The CD vOTUs were mainly classified into three viral families that included Siphoviridae, Myoviridae, and Podoviridae. These viral families are well-classified in deep-sea sediments16,30 and hadal water columns (Mariana, Yap, and Kermadec Trenches)30 but also in pelagic settings (Tara Ocean)7. We note that since the time of data freeze for preparation of this manuscript, the taxonomy of phages has undergone a revision described in Walker et al. 202149 and is now implemented by the International Committee on Taxonomy of Viruses (ICTV). As a result, the taxon naming will need to be updated by interested users of our data with the new taxon names that were approved after our analyses were completed. The estimated abundances of vOTUs were summed at the family level, and the taxonomically classified viruses accounted for 8% to ~54% of all viral communities (Fig. 3 and Supplementary Data 6).
Recruiting deep-sea metagenomic reads to our CD vOTUs showed that, 98% of our data were not detected in other deep-sea metagenomes used for comparisons in this study (Supplementary Fig. 2). This can suggest that the identified vOTUs in Challenger Deep are possibly endemic viral species of the CD trench. Among vOTUs, vOTU T1L10_NODE_10823 was the most abundant and accounted for ~4% (on average) of the viral communities across CD (Supplementary Data 5). vOTU T1L10_NODE_10823 shared homologous regions with other viruses in 2/15 deep-sea reference metagenomes (Supplementary Fig. 2). Highly endemic viruses have also been reported in the upper ocean, where local environmental conditions (e.g., oxygen, temperature) affect the host community structure4,7. Despite the increasing number of viral populations identified from various environmental settings30,46,47, it appears that the deep-sea taxonomic diversity of viral communities still remains under sampled.
To evaluate the lifestyle of CD viral elements, we used VIBRANT50 to predict prophage and integrase-encoding contigs as potential temperate viruses based on protein signatures (bacteria-like, and integrase-like genes) from KEGG, Pfam, and VOG databases50. Our results indicated that 1,541 viral contigs (95%) in CD viral communities were not assigned to either a lytic or lysogenic lifestyle (Fig. 2b, undetermined). It is possible that many/most of these undetermined viral contigs belong to viruses that have a lytic lifestyle in hadal depths. This would be consistent with studies of viral communities from surficial sediments collected in different deep-sea oceanic settings (Arctic, Atlantic, Pacific Oceans, and Mediterranean Sea; >1000 m water depth) that report high viral lysis rates27. With regard to lysogeny, it was predicted only in 5% of the CD viruses. This differs from deep-sea sediments that showed lysogeny as a more common potential viral lifestyle (e.g., Baltic Sea; ~19% on average)25 but is more in line with the prediction results that we obtained for deep-sea cold seep sediments (7%)15 and ocean seawater viruses (3%)7 using VIBRANT50. Nonetheless, our arguments need to be interpreted with caution considering that 95% of viral contigs were not assigned as lytic or lysogenic.
To investigate the spatial distribution of the CD viruses, we estimated the relative abundance of CD vOTUs in the sediment samples collected from slope and bottom-axis sampling sites across the trench. The relative abundance of vOTUs in each 2~3 cm sediment layer was calculated as the normalized coverage of each vOTU divided by the total normalized coverage of vOTUs at the investigated sediment layer (Supplementary Data 5). Principal coordinate analysis (PCoA) utilizing a Bray–Curtis dissimilarity distance matrix showed a significant difference (p = 0.001) between the vOTUs isolated from the slope vs. bottom-axis samples (Fig. 4a and Supplementary Fig. 2). The distribution of the dominant viral populations, at species level, was also different between the slope and bottom-axis sites (Supplementary Fig. 3). This can be attributed to differences in the geographical isolation and nutrient availability between slope and bottom-axis sites that have been suggested to affect the distribution of prokaryotic communities across the V-shaped CD trench38,39. Differences in viral communities at discrete depths observed in this study and between this study and upper CD slope48 may possibly reflect in situ variations in available nutrients, and/or variations in DNA recovery or methods used for metagenome assembly and extraction of viral data. In our study, there were also ubiquitous vOTUs such as T1B5_NODE_690, T1B8_NODE_8617, and T1B5_NODE_8075 that were present in all of the 37 CD samples, but could not be classified. The alpha diversity of CD viral communities was significantly higher (p < 0.05) in the topographically isolated bottom-axis, when compared to slope sites. The different diversity scores between the bottom-axis and slope were supported by all three indices (Chao1, ACE, and Shannon), as well as by the identified vOTUs that were overall discrete between bottom-axis and slope sites (Fig. 4b–f and Supplementary Fig. 4). Our results indicate higher viral community diversity, and distinct viral components in the bottom-axis sediments which are at deeper and more remote water depths, compared to the slope sites of the trench. Also, the bottom-axis sediments accumulate higher amounts of detrital organic matter due to the V-shaped topography of the trench, which could increase the role of organic matter in shaping microbial host communities and subsequently, viral diversity.
Host and virus linkages
The ecological role of CD viruses and their potential to affect nutrient cycling5,15 across the trench was examined by screening 586 CD microbial metagenome-assembled genomes (MAGs) to identify putative hosts (NCBI BioProject accession: PRJNA635214). These prokaryotic MAGs were recovered from the same metagenomes as the viral contigs. For host prediction, we used VirMatcher51, which is the only current host prediction tool to assign confidence scores (see Methods). We predicted potential prokaryotic hosts for 14 CD vOTUs (Supplementary Data 7), which accounted only for a small fraction (14/1628) of the CD viral community. The in silico host prediction indicated that CD viruses may infect 42 of our CD MAGs, assigned at 27 taxa at the species level (spanning seven phyla). These taxa include heterotrophs (e.g., Proteobacteria) and chemoautotrophs (e.g., Thaumarchaeota, Planctomycetota) involved in nitrogen and carbon cycling whose taxonomic signatures were abundant in CD sediments39, but with different relative abundances (7% to 43%) across the discrete sampling sites (bottom-axis vs. slope) (Supplementary Data 7). Indeed, Proteobacteria, Thaumarchaeota, and Planctomycetota were the most frequently predicted hosts in CD sediments (Fig. 5). Thaumarchaeota has been identified as potential hosts for archaeal viruses in deep-sea sediments collected from various oceanic realms27. In this study, Thaumarchaeota were identified as potential hosts of the most abundant vOTU (T1L10_NODE_10823), which was detected in 32/37 CD metagenomes and accounted for 7%~15% of viral community composition at aerobic top sediment layers (0–3 cmbsf) of bottom-axis sites (Supplementary Data 5).
Interestingly, we also identified four potentially new Thaumarchaeota viruses that were present in the CD viruses (Supplementary Data 7). These were mainly detected in top sediment layers of cores from four bottom-axis sites, while it comprised up to 25% of the viral community in sediment layers from one slope site (Fig. 3). We used vConTACT2 to cluster these previously undescribed Thaumarchaeota viruses with 119 other known marine archaeal viruses10,52,53. Our four identified Thaumarchaeota viruses were distinct, and did not cluster with other known archaeal viruses. This indicates that these viruses are possibly hadal Thaumarchaeota-related viruses endemic to Challenger Deep. However, this requires further investigation.
VIBRANT predicted that chemoautotrophic taxa involved in nitrogen cyclings such as Scalindua (Planctomycetota), Nitrospinaceae (Nitrospinota), and Nitrososphaerales (Thaumarchaeota) would be infected by lytic viruses in our CD sediment samples (Supplementary Data 7). Nonetheless, recent studies from coastal waters have reported viral isolates (e.g., Nitrosopumilus; spindle-shaped viruses) from marine Thaumarchaeota that cause chronic infections accompanied by growth inhibition of the host and severe reduction in rates of ammonia/nitrite oxidation/reduction10. Based on our analyses, lysogeny is a less likely lifestyle (5% assigned) in our identified CD viral contigs. Yet, the inability to assign lifestyle to the majority of the viral contigs (95%) might underestimate the importance of lysogeny, while at the same time preventing us from predicting the lytic viruses in CD. We suggest that lytic infections (if occurring) might be important and affect available nutrient pools across the V-shaped Challenger Deep (bottom-axis vs. slopes sites). This potential virus-induced effect on nutrient availability could act as a selective force in shaping microbial composition across the CD36,37.
Many of the chemoautotrophs reported to be susceptible hosts of lytic viruses at CD can also affect carbon pools due to their activities as carbon fixers. Nitrospinota as well as Nitrososphaerales are important carbon fixers in the dark ocean54, and are among the predicted CD hosts. Similarly, we suggest that CD viruses (if indeed lytic) affect pools of available labile organic carbon along the CD by affecting host populations that transform nitrogen pools and fix carbon. This can shape the distribution patterns of prokaryotes and associated viruses in CD sediments as suggested for other deep-sea sediments15,27 (Fig. 4a). Our arguments require further experimental and culture-based investigations; however, viruses are known to regulate energy gain processes that occur in the deep subsurface biosphere, and recycle and/or divert the flow of carbon in the global ocean when they destroy or manipulate their hosts55,56.
The predicted potential prokaryotic hosts for the 14 vOTUs may suggest that CD viruses target specific prokaryotic hosts in these CD sediments; however, this requires cautious interpretation considering that our host predictions were successful for ~1% of the viral population that we identified. We detected only a small fraction of predicted hosts (16%) that could be possibly infected by more than one vOTU, while only six vOTUs had multiple potential hosts at the species level. One vOTU (T1B5_NODE_7184) had the potential to infect different Rhodospirillales taxa (Supplementary Data 7), which can be abundant in surficial Mariana Trench sediments57.
Putative metabolic genes in CD viruses
To further explore the potential ecological role of identified viral elements in CD sediments, we examined the VIBRANT and DRAM-v annotations of CD vOTUs (Supplementary Data 8) for putative viral metabolic genes. Putative viral metabolic genes, like AMGs, can affect the efficiency of host-microbial metabolic pathways15,47,58 that often encode central metabolic enzymes59. We searched for candidate AMGs using the DRAM-v pipeline, and performed manual curation to verify their viral origin and position on the viral contigs (see Methods). We were able to identify 249 putative AMGs (Supplementary Data 8), most of which are affiliated with amino acid and carbohydrate metabolisms, and the production of cofactors and vitamins (Supplementary Fig. 5 and Supplementary Data 9). We compared the putative AMGs from the bottom-axis and slope sites for potential differences in abundances and metabolic functions. Overall, no discrete separation between the putative AMGs from the different sampling sites was observed despite the apparent topographical separation of the viral communities (bottom-axis vs. slope sites; Fig. 4a and Supplementary Fig. 6). Nonetheless, the identified putative AMGs were carried by different viral species, which suggests that these AMGs encode essential metabolic functions that could be beneficial to prokaryotic hosts at both sites, and thus enhance viral fitness at both slope and bottom-axis CD locations.
Putative AMGs involved in assimilatory sulfate reduction were common in CD viruses with higher relative abundances in the bottom-axis samples, compared to the slope samples (Supplementary Fig. 5b and Supplementary Data 8). Among these AMGs, we identified nine putative cysC and cysH genes that participate in the reduction of sulfate to sulfite (Supplementary Data 10). AMGs coding for cysH were also recently reported in deep-sea viruses from the Southwestern Indian Ocean sediments16.
Gene maps of representative CD viral contigs with co-occurrences of cysC and cysH and phage terminase genes within viral genomes are shown in Fig. 6a. To understand the origin of the putative CD AMGs related to sulfur assimilation, we recruited the top five (a) CysC proteins from the eggNOG database (v5.0) with close homology to our CD viral CysC proteins, and (b) CysC-encoding AMGs predicted from different viral data sets7,15,46,48,58, respectively. The similarity between our CD CysC-encoding AMGs and those CysC proteins deposited in the eggNOG database (v5.0) ranged from 27% to 47% (Supplementary Data 10). These similarity percentages were lower when we compared our putative CysC-encoding AMGs with those identified in global-scale ocean viral data sets, including those from deep-sea sediments and permanently anoxic basins7,15,46,48,58 (34% to 61%; Supplementary Fig. 7a). The phylogenetic analysis for three of our CD CysC proteins showed that they are distinct from their prokaryotic CysC homologs but cluster with CysC proteins from the different viral data sets referred to above (Fig. 6b). Similar phylogenetic results were obtained for CysH proteins (Supplementary Fig. 7b and Supplementary Fig. 8).
The distinct phylogenetic results and the moderate similarity of the CD Cys proteins to those that are publicly available, prompted us to perform protein structure prediction for the CysC protein from the viral contig T1B8_NODE_1222 (Fig. 6c). We used the web-based Phyre2 tool that predicts protein structure and function using homology with known proteins available in protein data banks60 (see Methods). The Phyre2 results predicted that CD CysC sequences belong to P-loop containing nucleoside triphosphate hydrolases and specifically to those hydrolases with a structural domain for adenosine-5’phosphosulfate kinase (APS kinase). The top three Phyre2 hits showed that 92–99% of the CysC protein sequences (109–135 residues) have been modeled with 99% confidence and exhibit structural homology with prokaryotic APS kinases. This suggests that CD CysC proteins could catalyze the phosphorylation of APS to 3’-phospho-APS, an intermediate step in sulfate assimilation. The putative AMGs related to assimilatory sulfate reduction could probably increase viral fitness in these CD sediments by enhancing the metabolic flexibility of the prokaryotic hosts as described elsewhere58. In addition, these AMGs might benefit hosts by ensuring an adequate supply of soluble thiolome pools (S-containing compounds such as amino acids and their intermediates) generated via sulfate assimilation, which can be utilized by hosts for heavy metal and metalloid detoxification (e.g., arsenic and mercury)61,62. Bio-accumulation of toxic metals has been detected in the water column of Mariana Trench63,64, while our data indicate accumulation of heavy metals like mercury and arsenic in CD sediments39, especially at the bottom-axis sites (Supplementary Fig. 9). AMGs related to sulfur metabolism are reported to be carried by viruses in various oceanic settings, including hadal realms31,65, which could indicate that these AMGs can increase viral fitness under the different redox and nutrient conditions detected in CD. We also argue that the identified AMGs related to sulfur metabolism in Challenger Deep could enhance the heavy metal detoxification mechanisms of the prokaryotic hosts, and thus, increase viral fitness.
We identified 13 potential viral genes involved in the biosynthesis and accumulation of lipid A (e.g., lpxA/D and kdsB)66,67, and maturation of lipopolysaccharides (LPS) (e.g., WaaE/F/L)68–70 (Supplementary Fig. 10 and Supplementary Data 8). T4-like phages were reported to encode LPS biosynthesis genes, which might alter the surface composition of the infected host to prevent multiple phage infections, or may simply act as ‘stuffer DNA’ for headful packaging in phages with a large genome like cyanophages71,72. The potential viral LPS genes identified in CD viruses are not cyanophage-related genes, and thus their role is unclear. We suggest that they could be involved in cell membrane stability73, and potentially benefit viral fitness by enhancing the structural and mechanical role of the host’s outer membrane that is exposed to extreme hydrostatic pressures (>1000 atm) at these hadal depths. Putative viral metabolic genes related to membrane biogenesis (e.g., cytidylyltransferase) were detected in viruses from hydrothermal sediments and it was suggested that they might enhance viral fitness by regulating the host’s membrane fluidity and phospholipid homeostasis74.
We also identified 5 putative viral metabolic genes involved in rhamnose biosynthesis in the CD viruses (Supplementary Data 9). Large DNA viruses like chloroviruses and prasinoviruses are known to synthesize rhamnose75,76. However, chloroviruses and prasinoviruses are primarily known to infect algae77,78, and were absent from our CD viruses. The putative rmlB/C AMG recovered from the CD sediments was found in complete circular viral contigs and was flanked by viral-specific genes (Supplementary Fig. 10). The protein similarities of the viral rmlB/C sequences with those from closely related proteins available in public databases ranged from 27% to 51% (Supplementary Data 10). Rhamnose-containing cell wall polysaccharides are considered phage receptors79,80, while rhamnose operons are reported to affect bacterial motility and biofilm formation81.
Whether the putative genes related to LPS and rhamnose biosynthesis could enhance host cell membrane flexibility or regulate the host’s ability for biofilm formation in hadal surficial sediments requires further investigation. Overall, we suggest that the possible benefit of the putative AMGs to CD viruses might depend on whether the viruses are lytic or lysogenic82, and the lytic state of lysogenic viruses is influenced by environmental conditions83. Finally, viral elements can also appear as prophages that upon infection can replicate and produce viral particles without destroying the host cell84.
Conclusions
Metagenomes generated from hadal sediments (>6 km depth) collected along the V-shaped Challenger Deep in the Mariana Trench, describe previously unidentified hadal viral communities that display a discrete separation along the trench. This discrete separation appears to be influenced by available nutrient sources that shape the prokaryotic (host) community structure between slope and bottom-axis sites. The presence of potentially lytic viral communities in CD may enable the high microbial density detected in this otherwise nutrient-poor hadal realm via the viral shunt that affects the in situ availability of labile carbon in those sediments. Future work will benefit from high-throughput culturing experiments of hadal viruses, as well as host-virus interaction experiments that can reveal the metabolic potential, viral shunt efficiency, and virus-host interaction networks in hadal nutrient cycling. The factors that control the biogeochemistry of the Challenger Deep sediments, including anthropogenic impacts, and that shape the metabolic and functional adaptions of hadal viruses and microbes will be topics of future research.
Methods
Sampling
Sediment cores from 13 CD sites were collected at water depths between 5400 m to 10911 m with three hadal cruises (Dayang37, Tansuo01, and Tansuo03) at the Challenger Deep in 2016–2017. The sediment cores were immediately sectioned into 2 or 3 cm layers, and stored at −80 °C for nucleic acids (DNA and RNA) extraction, and viral metagenomic analysis (Supplementary Table 1).
Nucleic acids extraction, metatranscriptome, and metagenome library preparations
We selected 37 sediment layers from different sediment cores covering slope and bottom-axis samples (Supplementary Table 1), and extracted DNA from 10 g~40 g using the PowerMax soil DNA isolation kit (MoBio, Carlsbad, CA, USA) following the manufacturer’s instructions. DNA concentrations were measured using a Qubit™ 2.0 Fluorometer (Invitrogen, Carlsbad, CA, USA). Samples with <2 ng μl−1 of DNA were concentrated using AMPure XP beads (Beckman Coulter, CA, USA) before the preparation of the libraries. The extracted DNA was sheared randomly using ultrasonication (Covaris M220, 200 cycles per burst for 65 s or 45 s) and was used to prepare DNA libraries with insertion sizes of ≥350 bp and up to 550 bp using the TruSeq Nano DNA Sample Prep Kit (Illumina, San Diego, CA, USA). For negative controls, we included two blanks that were treated and processed the same as the sediment samples. We concentrated each control DNA with AMPure XP beads (Beckman Coulter, CA, USA). The DNA concentrations of our control samples yielded <2 ng in a total volume of 10 ul, which was far less than the DNA input (100 ng) recommended by TruSeq Nano DNA Sample Prep Kit (Illumina, San Diego, CA, USA). Due to this constraint we prepared the libraries of our two controls using the TD503 kit (Vazyme, Nanjing, China) with an insertion size of 350 bp.
Total RNA was extracted from 10 g of three sediment layers (6–9, 12–15, and 18–21 cmbsf) collected from the T3L11 site (10,908 m depth) using a PowerSoil Total RNA Isolation Kit (MoBio, Carlsbad, CA, USA) following the manufacturer’s instructions. The RNA extracts were treated with TURBO DNase (Invitrogen, Waltham, MA, USA) to remove genomic DNA. The absence of carryover DNA was confirmed with PCR reactions using prokaryotic primers for the V3–V4 region 341 F (5’-CCTAYGGGRBGCASCAG-3’) and 802 R (5’-TACNVGGGTATCTAATCC-3’). Each 50 μl reaction contained 1.25 U PrimeSTAR HS DNA Polymerase (Takara, Japan), 5× PrimeSTAR Buffer (Takara, Japan), 200 mM dNTPs (Takara, Japan, dNTP Mixture), and 0.3 μΜ of each primer (final concentrations). The PCR reactions were performed at 94 °C for 10 s, followed by 35 cycles of 98 °C (10 s), 55 °C (10 s), and 72 °C (30 s). We used the Ovation® RNA-Seq System V2 Kit (NuGEN, San Carlos, CA, USA) to make double-stranded cDNA (ds-cDNA) from 1 ng total RNA with random primers. The ds-cDNA was used to prepare the metatranscriptome libraries as described in the metagenome library preparation.
Metagenome sequencing and assembly
Libraries were sequenced using 300 bp paired-end reads at Miseq platform or 150 bp paired-end reads at Novaseq 6000 or Hiseq 2000 platform. Fastp (v.0.20.0)85 was used to remove adapter and low-quality reads (assigned by >20% of the read length have quality score <20 or read length <50) with parameters (-w 16 -q 20 -u 20 -g -c -W 5 −3 -l 50). For metatranscriptomes, reads that mapped onto the rRNA sequences by SortMeRNA (v.2.1)86 and sequences in an in-house contaminant database (including sequences of mouse, human and common laboratory contaminant bacteria genomes87 downloaded from NCBI) by Bowtie2(v.2.4.1)88 with setting -N 1 were discarded. The high-quality metagenome reads for each site (MC02, D1T1, D1T2,T1B3, T1B5, T1B8, T1L6, T1B10, T1B11, T1L10, T3L8, T3L11, and T3L14) were merged for assembly using SPAdes (v3.13)89 with a k-mer set of 21, 33, 55, 77, 99 and 127 under the ‘--careful’ mode to achieve the best assembly results for low-abundance microbial groups90. Contigs ≤10,000 bp were removed prior to viral identification.
Identification, decontamination, and classification of CD viruses
We identified viral sequences from metagenomic assemblies in four steps using the published standards91, and the following enhancements: 1. Metagenomic contigs (>10 kb) from co-assembled metagenomes of each site were processed with the viral identification tools wrapped in What the Phage (Version 1.0.1, setting: --filter 10000 –identify40; tools: MARVEL92, VirFinder93, PPR-Meta94, VirSorter95, MetaPhinder96, DeepVirFinder97, VIBRANT50, VirNet98, Phigaro99, Virsorter2100, and Seeker101) to obtain putative viral contigs. 2. We annotated the predicted putative virus contigs using the eggNOG database (v5.0.0)102. Previous studies retained those contigs as putative viral if they contained hallmark viral genes including those contigs with viral sequences that had high percentages (e.g., ≥80%) of genes of unknown and hypothetical function103,104. Similarly, we retained those contigs as putative viral if they contained ≥2 virus-specific genes (annotation contains words from the list: “capsid”, “phage”, “terminase”, “base plate”, “baseplate”, “prohead”, “virion”, “virus”, “viral”, “tape measure”, “tapemeasure neck”, “tail”, “head”, “bacteriophage”, “prophage”, “portal”, “DNA packaging”, “T4”, “p22”, and “holin”)105, or contained viral sequences and had ≥70% of proteins assigned as hypothetical protein, unknown function or Viruses. 3. The retained contigs that contained prokaryote-specific genes (e.g., ribosomal genes) were further removed. 4. CheckV (v.0.8.1)44 was used to assess the quality of all putative viral contigs and to detect the viral-host boundaries for subsequent removal of host region from provirus. Contigs without determined completeness and viral-specific genes (as predicted by CheckV44) must contain viral signatures using benchmarked viral prediction tools (DeepVirFinder, VirSorter, VirSorter2, MARVEL, and VIBRANT) with conservative cutoff, published in standard operating procedures91. The putative viral contigs that remained after applying the four steps explained above were considered high-confidence viral contigs in this study.
Bowtie288 was used to map reads from the two blank controls. Viral contigs mapped with ≥1 read(s) from the controls were considered potential contaminants. This resulted in the removal of ten viral contigs from further analysis.
All positive and host-contamination-free viral contigs (simplified as bona fide viral contigs) were clustered into vOTUs with cd-hit-est (v4.8.1, setting: -c 0.95 -aS 0.85 -n 10 -d 0)106 at species level, based on >85% alignment of the smallest contigs at 95% average nucleotide identity41. We adopted a previously described majority-rules approach to assign viral family7. In brief, we used blastp (version 2.9.0+) to query all proteins from CD vOTUs against NCBI viral RefSeq database release 208 (https://ftp.ncbi.nlm.nih.gov/refseq/release/viral/, downloaded on 4 January 2022). A vOTU was assigned as a family if ≥50% of its proteins hit to family level with a bitscore ≥50. The remaining unassigned vOTUs were classified using Demovir pipeline (https://github.com/feargalr/Demovir)107 with default settings. Demovir pipeline searched proteins from CD vOTUs against a redundant viral subset of the TrEMBL database (cluster at 95% identity; https://www.uniprot.org/downloads). We assigned families to unassigned viral contigs only if the similarity of the proteins to a homolog for one taxon reached ≥50%.
Comparison to viruses from other data sets
We compared the 1628 identified CD vOTUs at species and genus level using vConTACT245 with viral contigs from published databases, including: (i) hadal and non-hadal deep-sea sediments15,30 (n = 7305); (ii) wetland sediments46 (n = 1212); (iii) thawed permafrost47 (n = 1896); (iv) seawater (Global Ocean Virome 2.0, n = 195,728)7 and (v) reference sequences (Prokaryotic Viral RefSeq85-ICTV, n = 1825). For species level, cd-hit-est (v4.8.1, setting: -c 0.95 -aS 0.85 -n 10 -d 0) was used to identify CD vOTUs as novel (nucleic acid similarity <95%). Prodigal v2.6.3108 was used to predict open reading frames (ORFs) for all viral contigs used for comparisons, and the predicted protein sequences were imported to vConTACT245 for identification of previously undescribed viral clusters (genus level).
CD gene annotation and metabolic genes analysis
CD viral genes in viral contigs were predicted by Prodigal108. All predicted proteins were annotated against eggNOG v5.0.0 database by eggnog-mapper with the default setting. High-confidence viral contigs were imported to DRAM-v (v.1.2.4)109 and VIBRANT (v.1.2.1)50 for annotation and identification of AMGs based on KEGG, Pfam, UniRef90, dbCAN, RefSeq viral, VOGDB (including pVOGs) and MEROPS databases, using default parameters. We removed AMGs identified from VIBRANT50 with T, B flag, auxiliary scores >3. We also removed those AMGs that were not included in the manually curated list of potential AMGs that we compiled after the DRAM-v109 results (see below). All remaining putative AMGs were further checked for their position on contigs to ensure their viral origin. We retained only those manually curated AMGs that contained clearly-identified viral genes (Supplementary Data 9).
Further analyses were also conducted on the key AMGs identified by VIBRANT50. Viral AMGs assigned as K00860 (cysC, APS kinase gene) were identified in the viral genomes, and they were compared to protein sequences from eggNOG v5.0.0 database and publicly available viruses (blastp, 10−5 for E value) to recruit relevant reference sequences. We recruited the top 5 reference CysC sequences in eggNOG v5.0.0 database and publicly available viruses, respectively, for each identified viral CysC protein. We aligned all CysC protein sequences (CD viral CysC proteins, CysC reference proteins from eggNOG database, and two CysC sequences with experimental evidence at protein level from the Uniprot database) and two CysH protein sequences as outgroup with Mafft (v7.453, setting: --maxiterate 1000 -localpair)110, and filtered them with TrimAL (v1.4.rev15111; default parameters) to remove poorly aligned columns. Maximum-likelihood phylogenetic trees were reconstructed using iqtree (v2.0.3, setting: -m MFP -bb 10000), which automatically searched the best-fit partition model before tree reconstruction112. The resulting newick file with bootstrap value was uploaded to iTOL v4113 for visualization. The structure prediction for CysC protein was performed with the web-based Phyre2 tool60. Structural homologies were analyzed using models generated by Phyre2 using a confidence threshold of >98%, and identity threshold of >29%. The accuracy of the models constructed using Phyre2 is described as extremely high when the sequence identity is above 30–40%. However, lower sequence identities can be equally accurate and useful as long as the confidence threshold is high, which was the case in our examined CysC proteins. The functional domain for CysC was identified and annotated by SMART114. This workflow was also applied to analyze other key viral metabolic genes.
Prediction of virus and host linkages
We collected 586 MAGs39 recovered from the metagenomes used for prediction of viruses in this study. Virus-host interaction was predicted by four different in silico methods47,51 that include: 1. search for sequence homology by calculating the nucleotide identity of vOTUs and prokaryotic MAG sequences using BLASTn (v2.9.0+, setting: -evalue 0.001 -perc_identity 70). The conditions for retaining positive matches were (a) alignment length ≥2500 bp, (b) minimum nucleotide identity ≥70%, and (c) alignment coverage <90% of host sequences5. 2. search for matches between viral contigs and host CRISPR. Spacers and repeats of host were predicted in clean reads of metagenomes using crass (v1.0.1)115 using default settings, and were extracted using crisprtools (https://github.com/ctSkennerton/crisprtools). Spacers and repeats in assemblies of metagenomes were predicted by MinCED (v.0.4.2)116. The combined spacers from reads and assemblies were compared to sequences of viral contigs using BLASTn (v2.9.0 + ). We retained matches that contained ≤1 mismatch and had an E value of ≤10−5. For each spacer with a match in any genome of CD vOTU, the repeat sequence from the same assembled CRISPR region was compared to all prokaryotic population genomes via BLASTn (v2.9.0+, 100% nucleotide identity and E value of >10−10). We performed this step to link the assembled CRISPR region (and, therefore, any viruses matching spacers in that CRISPR region), to a host. 3. search for viral and host tRNA genes using tRNA-scan (v. 2.0.7)117. Only exact matches were considered. 4. search for similarity of k-mer frequencies using WIsH118 with default parameters to predict the potential host of a query virus. Sequence similarity-based matches, were also manually curated to avoid false positive results caused by viral contigs binned into prokaryotic genomes.
All the prediction results generated using the four steps above were scored by VirMatcher. VirMatcher (https://bitbucket.org/MAVERICLab/virmatcher/src/master/) is the only available tool that provides a confidence score for host predictions. High-confidence predictions (scores ≥3) were used to assign hosts in this study. The coverage of hosts (species level) in each metagenome was calculated by CoverM v0.6.1 (parameters: -m trimmed_mean --min-read-percent-identity 0.95 --min-read-aligned-length 50 --proper-pairs-only --min-covered-fraction 0.1). The relative abundance of a host was calculated by the coverage of the host divided by the total coverage of all MAGs (species level).
Viral abundance and diversity
To calculate the coverage (sequencing depth) of each viral contig, clean and qualified reads from each sample were mapped against all CD viral contigs using BWA (v 0.7.17)119 and sorted with samtools (v1.9)120 to generate a bam file. CoverM v0.6.1 (https://github.com/wwood/CoverM) was used to filter low-quality mappings in bam file and generate the abundance profiles for samples (parameters: contig mode for each viral contig, -m mean --min-read-percent-identity 0.95 --min-read-aligned-length 50 --min-covered-fraction 10). Contigs with sequencing coverage rate <10% were reported as having zero sequencing depth. Normalized coverage values (sequencing depths per giga base sequencing data) were used to represent relative abundances of viruses for comparison across metagenomes. The normalized coverage was multiplied by 10 and ceiled to represent the times of one species present in each sample for the calculation of the ACE and Chao1 indices. The metatranscriptome reads were also mapped to viral contigs/genes using BWA (v 0.7.17)119 and were then counted with aligned length ≥50 bp and identity ≥95% by CoverM v0.6.1 (parameters: --min-read-percent-identity 0.95 --min-read-aligned-length 50) to generate abundance profile.
Statistics and reproducibility
All statistical analyses were performed using R (v.4.0.4). We used vegan (v.2.5–7) in R to calculate the alpha and beta diversities of viral communities and Bray–Curtis distances for vOTUs abundance profiles. Pheatmap package (v.1.0.12) was used to cluster metagenomes in heatmap. For comparing the vOTUs between slope and bottom-axis sites, we used Shannon, Simpson, ACE, and Chao1 indices. The detected vOTUs were tested using the Wilcoxon test with the ggpubr package (v.0.4.0).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We give special thanks to the members of the R/V DY37, TS01, and TS03 for their invaluable efforts in the sampling cruises. We thank J. Li, S.X. Wang, Y.Z. Xin, J. Chen, and D.S. Cai for their skillful handling of the lander and sediment sampler. We also thank the supercomputer center of Sanya University. This research was supported by the Hainan Provincial Natural Science Foundation of China (No. 322CXTD531).
Author contributions
Y.Z. and Y.W. conceived and designed the study. Y.Z. conducted bioinformatic analyses and results visualization. Y.Z. conceived and generated Fig. 1. Y.Z. and P.M. analyzed data and summarized the results. Y.Z. and P.M. drafted the manuscript. V.P.E., D.V., M.B.S., and Y.W. critically revised the manuscript.
Peer review
Peer review information
Communications Biology thanks Jason McDermott and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: George Inglis. Peer reviewer reports are available.
Data availability
The raw reads of CD metagenomes and metatranscriptomes and MAGs were deposited in GenBank under BioProject number PRJNA635214. The DNA sequences of the 1628 viral contigs were deposited in NCBI GenBank under accession number JAODGZ000000000. The marine viruses-related data sets utilized in this study are cited in Supplementary Table 2. Source data for figures also provided in Supplementary Data 11.
Code availability
All software and R packages used are open source and described in the Methods section. No custom code was used to analyze data in this study, and further details are available on request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Ying-Li Zhou, Paraskevi Mara.
Supplementary information
The online version contains supplementary material available at 10.1038/s42003-022-04027-y.
References
- 1.Suttle CA. Marine viruses — major players in the global ecosystem. Nat. Rev. Microbiol. 2007;5:801–812. doi: 10.1038/nrmicro1750. [DOI] [PubMed] [Google Scholar]
- 2.Angly FE, et al. The marine viromes of four oceanic regions. PLOS Biol. 2006;4:2121–2131. doi: 10.1371/journal.pbio.0040368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Labonté JM, Suttle CA. Previously unknown and highly divergent ssDNA viruses populate the oceans. ISME J. 2013;7:2169–2177. doi: 10.1038/ismej.2013.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Brum JR, et al. Patterns and ecological drivers of ocean viral communities. Science. 2015;348:1261498. doi: 10.1126/science.1261498. [DOI] [PubMed] [Google Scholar]
- 5.Roux S, et al. Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses. Nature. 2016;537:689–693. doi: 10.1038/nature19366. [DOI] [PubMed] [Google Scholar]
- 6.Allen LZ, et al. The Baltic sea virome: diversity and transcriptional activity of DNA and RNA viruses. mSystems. 2017;2:e00125–00116. doi: 10.1128/mSystems.00125-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gregory AC, et al. Marine DNA viral macro- and microdiversity from pole to pole. Cell. 2019;177:1109–1123. doi: 10.1016/j.cell.2019.03.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hurwitz BL, Hallam SJ, Sullivan MB. Metabolic reprogramming by viruses in the sunlit and dark ocean. Genome Biol. 2013;14:R123. doi: 10.1186/gb-2013-14-11-r123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Howard-Varona C, et al. Phage-specific metabolic reprogramming of virocells. ISME J. 2020;14:881–895. doi: 10.1038/s41396-019-0580-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kim J-G, et al. Spindle-shaped viruses infect marine ammonia-oxidizing thaumarchaea. Proc. Natl Acad. Sci. USA. 2019;116:15645–15650. doi: 10.1073/pnas.1905682116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Danovaro R, et al. Major viral impact on the functioning of benthic deep-sea ecosystems. Nature. 2008;454:1084–1087. doi: 10.1038/nature07268. [DOI] [PubMed] [Google Scholar]
- 12.Danovaro R, et al. Viriobenthos in freshwater and marine sediments: a review. Freshw. Biol. 2008;53:1186–1213. [Google Scholar]
- 13.Engelhardt T, Kallmeyer J, Cypionka H, Engelen B. High virus-to-cell ratios indicate ongoing production of viruses in deep subsurface sediments. ISME J. 2014;8:1503–1509. doi: 10.1038/ismej.2013.245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Middelboe M, Glud RN, Filippini M. Viral abundance and activity in the deep sub-seafloor biosphere. Aquat. Micro. Ecol. 2011;63:1–8. [Google Scholar]
- 15.Li Z, et al. Deep sea sediments associated with cold seeps are a subsurface reservoir of viral diversity. ISME J. 2021;15:2366–2378. doi: 10.1038/s41396-021-00932-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zheng X, et al. Extraordinary diversity of viruses in deep-sea sediments as revealed by metagenomics without prior virion separation. Environ. Microbiol. 2021;23:728–743. doi: 10.1111/1462-2920.15154. [DOI] [PubMed] [Google Scholar]
- 17.Helton RR, Liu L, Wommack KE. Assessment of factors influencing direct enumeration of viruses within estuarine sediments. Appl Environ. Microbiol. 2006;72:4767–4774. doi: 10.1128/AEM.00297-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pan D, Morono Y, Inagaki F, Takai K. An improved method for extracting viruses from sediment: detection of far more viruses in the subseafloor than previously reported. Front Microbiol. 2019;10:878. doi: 10.3389/fmicb.2019.00878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Armanious A, et al. Viruses at solid–water interfaces: a systematic assessment of interactions driving adsorption. Environ. Sci. Technol. 2016;50:732–743. doi: 10.1021/acs.est.5b04644. [DOI] [PubMed] [Google Scholar]
- 20.Trubl G, et al. Optimization of viral resuspension methods for carbon-rich soils along a permafrost thaw gradient. PeerJ. 2016;4:e1999. doi: 10.7717/peerj.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Maat DS, Prins MA, Brussaard CPD. Sediments from arctic tide-water glaciers remove coastal marine viruses and delay host infection. Viruses. 2019;11:123. doi: 10.3390/v11020123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Loveland JP, Ryan JN, Amy GL, Harvey RW. The reversibility of virus attachment to mineral surfaces. Colloids Surf. A Physicochem. Eng. Asp. 1996;107:205–221. [Google Scholar]
- 23.Fuhs GW, Chen M, Sturman LS, Moore RS. Virus adsorption to mineral surfaces is reduced by microbial overgrowth and organic coatings. Microb. Ecol. 1985;11:25–39. doi: 10.1007/BF02015106. [DOI] [PubMed] [Google Scholar]
- 24.Whitman WB, Coleman DC, Wiebe WJ. Prokaryotes: the unseen majority. Proc. Natl Acad. Sci. USA. 1998;95:6578–6583. doi: 10.1073/pnas.95.12.6578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cai L, et al. Active and diverse viruses persist in the deep sub-seafloor sediments over thousands of years. ISME J. 2019;13:1857–1864. doi: 10.1038/s41396-019-0397-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Manea E, et al. Viral infections boost prokaryotic biomass production and organic C cycling in hadal trench sediments. Front Microbiol. 2019;10:1952. doi: 10.3389/fmicb.2019.01952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Danovaro R, et al. Virus-mediated archaeal hecatomb in the deep seafloor. Sci. Adv. 2016;2:e1600492. doi: 10.1126/sciadv.1600492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yoshida M, Takaki Y, Eitoku M, Nunoura T, Takai K. Metagenomic analysis of viral communities in (hado)pelagic sediments. PLoS One. 2013;8:e57271. doi: 10.1371/journal.pone.0057271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bäckström D, et al. Virus genomes from deep sea sediments expand the ocean megavirome and support independent origins of viral gigantism. mBio. 2019;10:e02497–02418. doi: 10.1128/mBio.02497-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Jian H, et al. Diversity and distribution of viruses inhabiting the deepest ocean on Earth. ISME J. 2021;15:3094–3110. doi: 10.1038/s41396-021-00994-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Anantharaman K, et al. Sulfur oxidation genes in diverse deep-sea viruses. Science. 2014;344:757–760. doi: 10.1126/science.1252229. [DOI] [PubMed] [Google Scholar]
- 32.Zhou H, et al. Revealing the viral community in the hadal sediment of the New Britain Trench. Genes. 2021;12:990. doi: 10.3390/genes12070990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dell’Anno A, Corinaldesi C, Danovaro R. Virus decomposition provides an important contribution to benthic deep-sea ecosystem functioning. Proc. Natl Acad. Sci. USA. 2015;112:E2014–E2019. doi: 10.1073/pnas.1422234112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Jamieson AJ, Fujii T, Mayor DJ, Solan M, Priede IG. Hadal trenches: the ecology of the deepest places on Earth. Trends Ecol. Evol. 2010;25:190–197. doi: 10.1016/j.tree.2009.09.009. [DOI] [PubMed] [Google Scholar]
- 35.Glud RN, et al. High rates of microbial carbon turnover in sediments in the deepest oceanic trench on Earth. Nat. Geosci. 2013;6:284–288. [Google Scholar]
- 36.Luo M, et al. Benthic carbon mineralization in hadal trenches: insights from in situ determination of benthic oxygen consumption. Geophys. Res. Lett. 2018;45:2752–2760. [Google Scholar]
- 37.Hiraoka S, et al. Microbial community and geochemical analyses of trans-trench sediments for understanding the roles of hadal environments. ISME J. 2020;14:740–756. doi: 10.1038/s41396-019-0564-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cui G, Li J, Gao Z, Wang Y. Spatial variations of microbial communities in abyssal and hadal sediments across the Challenger Deep. PeerJ. 2019;7:e6961. doi: 10.7717/peerj.6961. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Zhou Y-L, Mara P, Cui G-J, Edgcomb VP, Wang Y. Microbiomes in the Challenger Deep slope and bottom-axis sediments. Nat. Commun. 2022;13:1515. doi: 10.1038/s41467-022-29144-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Marquet M., et al. What the Phage: a scalable workflow for the identification and analysis of phage sequences. bioRxivhttps://www.biorxiv.org/content/10.1101/2020.07.24.219899v1 (2020). [DOI] [PMC free article] [PubMed]
- 41.Roux S, et al. Minimum Information about an Uncultivated Virus Genome (MIUViG) Nat. Biotechnol. 2019;37:29–37. doi: 10.1038/nbt.4306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Gregory AC, et al. Genomic differentiation among wild cyanophages despite widespread horizontal gene transfer. BMC Genomics. 2016;17:930. doi: 10.1186/s12864-016-3286-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Bobay L-M, Ochman H. Biological species are universal across life’s domains. Genome Biol. Evol. 2017;9:491–501. doi: 10.1093/gbe/evx026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Nayfach S., et al. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat. Biotechnol.39, 578–585 (2020). [DOI] [PMC free article] [PubMed]
- 45.Bin Jang H, et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 2019;37:632–639. doi: 10.1038/s41587-019-0100-8. [DOI] [PubMed] [Google Scholar]
- 46.Dalcin Martins P, et al. Viral and metabolic controls on high rates of microbial sulfur and carbon cycling in wetland ecosystems. Microbiome. 2018;6:138. doi: 10.1186/s40168-018-0522-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Emerson JB, et al. Host-linked soil viral ecology along a permafrost thaw gradient. Nat. Microbiol. 2018;3:870–880. doi: 10.1038/s41564-018-0190-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Zhao J, et al. Novel viral communities potentially assisting in carbon, nitrogen, and sulfur metabolism in the upper slope sediments of Mariana Trench. mSystems. 2022;7:e01358–01321. doi: 10.1128/msystems.01358-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Walker PJ, et al. Changes to virus taxonomy and to the International Code of Virus Classification and Nomenclature ratified by the International Committee on Taxonomy of Viruses (2021) Arch. Virol. 2021;166:2633–2648. doi: 10.1007/s00705-021-05156-1. [DOI] [PubMed] [Google Scholar]
- 50.Kieft K, Zhou Z, Anantharaman K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome. 2020;8:90. doi: 10.1186/s40168-020-00867-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gregory AC, et al. The gut virome database reveals age-dependent patterns of virome diversity in the human gut. Cell Host Microbe. 2020;28:724–740 e728. doi: 10.1016/j.chom.2020.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Nishimura Y, et al. Environmental viral genomes shed new light on virus-host interactions in the ocean. mSphere. 2017;2:e00359–00316. doi: 10.1128/mSphere.00359-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Vik DR, et al. Putative archaeal viruses from the mesopelagic ocean. PeerJ. 2017;5:e3428. doi: 10.7717/peerj.3428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pachiadaki MG, et al. Major role of nitrite-oxidizing bacteria in dark ocean carbon fixation. Science. 2017;358:1046–1051. doi: 10.1126/science.aan8260. [DOI] [PubMed] [Google Scholar]
- 55.Daly RA, et al. Viruses control dominant bacteria colonizing the terrestrial deep biosphere after hydraulic fracturing. Nat. Microbiol. 2019;4:352–361. doi: 10.1038/s41564-018-0312-6. [DOI] [PubMed] [Google Scholar]
- 56.Wilhelm SW, Suttle CA. Viruses and nutrient cycles in the sea: viruses play critical roles in the structure and function of aquatic food webs. BioScience. 1999;49:781–788. [Google Scholar]
- 57.Peoples LM, et al. Microbial community diversity within sediments from two geographically separated hadal trenches. Front. Microbiol. 2019;10:347. doi: 10.3389/fmicb.2019.00347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Mara P, et al. Viral elements and their potential influence on microbial processes along the permanently stratified Cariaco Basin redoxcline. ISME J. 2020;14:3079–3092. doi: 10.1038/s41396-020-00739-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Breitbart M, Thompson LR, Suttle CA, Sullivan MB. Exploring the vast diversity of marine viruses. Oceanography. 2007;20:135–139. [Google Scholar]
- 60.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJ. The Phyre2 web portal for protein modeling, prediction and analysis. Nat. Protoc. 2015;10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Rubino FM. Toxicity of glutathione-binding metals: a review of targets and mechanisms. Toxics. 2015;3:20–62. doi: 10.3390/toxics3010020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Jozefczak M, Remans T, Vangronsveld J, Cuypers A. Glutathione is a key player in metal-induced oxidative stress defenses. Int J. Mol. Sci. 2012;13:3145–3175. doi: 10.3390/ijms13033145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Liu M, et al. Methylmercury bioaccumulation in deepest ocean fauna: implications for ocean mercury biotransport through food webs. Environ. Sci. Technol. Lett. 2020;7:469–476. [Google Scholar]
- 64.Welty CJ, Sousa ML, Dunnivant FM, Yancey PH. High-density element concentrations in fish from subtidal to hadal zones of the Pacific Ocean. Heliyon. 2018;4:e00840. doi: 10.1016/j.heliyon.2018.e00840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Chen P, et al. Revealing the full biosphere structure and versatile metabolic functions in the deepest ocean sediment of the Challenger Deep. Genome Biol. 2021;22:207. doi: 10.1186/s13059-021-02408-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Williams AH, Raetz CRH. Structural basis for the acyl chain selectivity and mechanism of UDP-N-acetylglucosamine acyltransferase. Proc. Natl Acad. Sci. USA. 2007;104:13543–13550. doi: 10.1073/pnas.0705833104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Ahmad S, Raza S, Abro A, Liedl KR, Azam SS. Toward novel inhibitors against KdsB: a highly specific and selective broad-spectrum bacterial enzyme. J. Biomol. Struct. Dyn. 2019;37:1326–1345. doi: 10.1080/07391102.2018.1459318. [DOI] [PubMed] [Google Scholar]
- 68.Gronow S, Brabetz W, Brade H. Comparative functional characterization in vitro of heptosyltransferase I (WaaC) and II (WaaF) from Escherichia coli. Eur. J. Biochem. 2000;267:6602–6611. doi: 10.1046/j.1432-1327.2000.01754.x. [DOI] [PubMed] [Google Scholar]
- 69.Abeyrathne PD, Daniels C, Poon KKH, Matewish MJ, Lam JS. Functional characterization of WaaL, a ligase associated with linking O-antigen polysaccharide to the core of Pseudomonas aeruginosa Lipopolysaccharide. J. Bacteriol. 2005;187:3002–3012. doi: 10.1128/JB.187.9.3002-3012.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Kneidinger B, et al. Biosynthesis pathway of ADP-l-glycero-β-manno-heptose in Escherichia coli. J. Bacteriol. 2002;184:363–369. doi: 10.1128/JB.184.2.363-369.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Sullivan MB, Coleman ML, Weigele P, Rohwer F, Chisholm SW. Three Prochlorococcus cyanophage genomes: Signature features and ecological interpretations. Plos Biol. 2005;3:790–806. doi: 10.1371/journal.pbio.0030144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Sullivan MB, et al. Genomic analysis of oceanic cyanobacterial myoviruses compared with T4-like myoviruses from diverse hosts and environments. Environ. Microbiol. 2010;12:3035–3056. doi: 10.1111/j.1462-2920.2010.02280.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Auer GK, Weibel DB. Bacterial cell mechanics. Biochemistry. 2017;56:3710–3724. doi: 10.1021/acs.biochem.7b00346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Castelán-Sánchez HG, et al. Extremophile deep-sea viral communities from hydrothermal vents: structural and functional analysis. Mar. Genomics. 2019;46:16–28. doi: 10.1016/j.margen.2019.03.001. [DOI] [PubMed] [Google Scholar]
- 75.Chothi MP, et al. Identification of an L-rhamnose synthetic pathway in two nucleocytoplasmic large DNA viruses. J. Virol. 2010;84:8829–8838. doi: 10.1128/JVI.00770-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Bachy C., et al. Viruses infecting a warm water picoeukaryote shed light on spatial co-occurrence dynamics of marine viruses and their hosts. ISME J.15, 3129–3147 (2021). [DOI] [PMC free article] [PubMed]
- 77.Zhang W, et al. Four novel algal virus genomes discovered from Yellowstone Lake metagenomes. Sci. Rep. 2015;5:15131. doi: 10.1038/srep15131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Clerissi C, et al. Prasinovirus distribution in the Northwest Mediterranean Sea is affected by the environment and particularly by phosphate availability. Virology. 2014;466–467:146–157. doi: 10.1016/j.virol.2014.07.016. [DOI] [PubMed] [Google Scholar]
- 79.Mistou MY, Sutcliffe IC, van Sorge NM. Bacterial glycobiology: rhamnose-containing cell wall polysaccharides in Gram-positive bacteria. FEMS Microbiol. Rev. 2016;40:464–479. doi: 10.1093/femsre/fuw006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wendlinger G, Loessner MJ, Scherer S. Bacteriophage receptors on Listeria monocytogenes cells are the N-acetylglucosamine and rhamnose substituents of teichoic acids or the peptidoglycan itself. Microbiology. 1996;142:985–992. doi: 10.1099/00221287-142-4-985. [DOI] [PubMed] [Google Scholar]
- 81.Michael V, et al. Biofilm plasmids with a rhamnose operon are widely distributed determinants of the ‘swim-or-stick’ lifestyle in roseobacters. ISME J. 2016;10:2498–2513. doi: 10.1038/ismej.2016.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Chen X, Weinbauer MG, Jiao N, Zhang R. Revisiting marine lytic and lysogenic virus-host interactions: Kill-the-Winner and Piggyback-the-Winner. Sci. Bull. 2021;66:871–874. doi: 10.1016/j.scib.2020.12.014. [DOI] [PubMed] [Google Scholar]
- 83.Breitbart M, Bonnain C, Malki K, Sawaya NA. Phage puppet masters of the marine microbial realm. Nat. Microbiol. 2018;3:754–766. doi: 10.1038/s41564-018-0166-y. [DOI] [PubMed] [Google Scholar]
- 84.Howard-Varona C, Hargreaves KR, Abedon ST, Sullivan MB. Lysogeny in nature: mechanisms, impact and ecology of temperate phages. ISME J. 2017;11:1511–1520. doi: 10.1038/ismej.2017.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Kopylova E, Noé L, Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28:3211–3217. doi: 10.1093/bioinformatics/bts611. [DOI] [PubMed] [Google Scholar]
- 87.Salter SJ, et al. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014;12:87. doi: 10.1186/s12915-014-0087-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Bankevich A, et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Gao Z-M, et al. In situ meta-omic insights into the community compositions and ecological roles of hadal microbes in the Mariana Trench. Environ. Microbiol. 2019;21:4092–4108. doi: 10.1111/1462-2920.14759. [DOI] [PubMed] [Google Scholar]
- 91.Pratama A. A., et al. Expanding standards in viromics: in silico evaluation of dsDNA viral genome identification, classification, and auxiliary metabolic gene. PeerJ9, e11447 (2021). [DOI] [PMC free article] [PubMed]
- 92.Amgarten D, Braga LPP, da Silva AM, Setubal JC. MARVEL, a tool for prediction of bacteriophage sequences in metagenomic bins. Front Genet. 2018;9:304. doi: 10.3389/fgene.2018.00304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Ren J, Ahlgren NA, Lu YY, Fuhrman JA, Sun F. VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data. Microbiome. 2017;5:69. doi: 10.1186/s40168-017-0283-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Fang Z., et al. PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning. GigaScience8, giz066 (2019). [DOI] [PMC free article] [PubMed]
- 95.Roux S, Enault F, Hurwitz BL, Sullivan MB. VirSorter: mining viral signal from microbial genomic data. PeerJ. 2015;3:e985. doi: 10.7717/peerj.985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Jurtz VI, Villarroel J, Lund O, Voldby Larsen M, Nielsen M. MetaPhinder—identifying bacteriophage sequences in metagenomic data sets. PLoS One. 2016;11:e0163111. doi: 10.1371/journal.pone.0163111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Ren J, et al. Identifying viruses from metagenomic data using deep learning. Quant. Biol. 2020;8:64–77. doi: 10.1007/s40484-019-0187-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Abdelkareem A. O., Khalil M. I., Elaraby M., Abbas H. & Elbehery A. H. A. VirNet: deep attention model for viral reads identification. In: 2018 13th International Conference on Computer Engineering and Systems (ICCES) (2018).
- 99.Starikova EV, et al. Phigaro: high-throughput prophage sequence annotation. Bioinformatics. 2020;36:3882–3884. doi: 10.1093/bioinformatics/btaa250. [DOI] [PubMed] [Google Scholar]
- 100.Guo J, et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome. 2021;9:37. doi: 10.1186/s40168-020-00990-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Auslander N, Gussow AB, Benler S, Wolf YI, Koonin EV. Seeker: alignment-free identification of bacteriophage genomes by deep learning. Nucleic Acids Res. 2020;48:e121–e121. doi: 10.1093/nar/gkaa856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Huerta-Cepas J, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2018;47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Paez-Espino D, Pavlopoulos GA, Ivanova NN, Kyrpides NC. Nontargeted virus sequence discovery pipeline and virus clustering for metagenomic data. Nat. Protoc. 2017;12:1673–1682. doi: 10.1038/nprot.2017.063. [DOI] [PubMed] [Google Scholar]
- 104.Gao S-M, et al. Depth-related variability in viral communities in highly stratified sulfidic mine tailings. Microbiome. 2020;8:89. doi: 10.1186/s40168-020-00848-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Dudek NK, Sun C, Burstein D, Kantor R, Relman D. Novel microbial diversity and functional potential in the marine mammal oral microbiome. Curr. Biol. 2017;27:3752–3762.e3756. doi: 10.1016/j.cub.2017.10.040. [DOI] [PubMed] [Google Scholar]
- 106.Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28:3150–3152. doi: 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Shkoporov AN, et al. The human gut virome is highly diverse, stable, and individual specific. Cell Host Microbe. 2019;26:527–541.e525. doi: 10.1016/j.chom.2019.09.009. [DOI] [PubMed] [Google Scholar]
- 108.Hyatt D, et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinform. 2010;11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Shaffer M, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020;48:8883–8900. doi: 10.1093/nar/gkaa621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47:W256–W259. doi: 10.1093/nar/gkz239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Letunic I, Khedkar S, Bork P. SMART: recent updates, new developments and status in 2020. Nucleic Acids Res. 2021;49:D458–D460. doi: 10.1093/nar/gkaa937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Skennerton CT, Imelfort M, Tyson GW. Crass: identification and reconstruction of CRISPR from unassembled metagenomic data. Nucleic Acids Res. 2013;41:e105. doi: 10.1093/nar/gkt183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Bland C, et al. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinform. 2007;8:209. doi: 10.1186/1471-2105-8-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Lowe TM, Eddy SR. tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Galiez C, Siebert M, Enault F, Vincent J, Söding J. WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs. Bioinformatics. 2017;33:3113–3114. doi: 10.1093/bioinformatics/btx383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Preprint at https://ui.adsabs.harvard.edu/abs/2013arXiv1303.3997L (2013).
- 120.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Wessel P, et al. The Generic Mapping Tools version 6. Geochem. Geophys. Geosystems. 2019;20:5556–5564. [Google Scholar]
- 122.Tozer B, et al. Global bathymetry and topography at 15 Arc Sec: SRTM15+ Earth Space Sci. 2019;6:1847–1864. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw reads of CD metagenomes and metatranscriptomes and MAGs were deposited in GenBank under BioProject number PRJNA635214. The DNA sequences of the 1628 viral contigs were deposited in NCBI GenBank under accession number JAODGZ000000000. The marine viruses-related data sets utilized in this study are cited in Supplementary Table 2. Source data for figures also provided in Supplementary Data 11.
All software and R packages used are open source and described in the Methods section. No custom code was used to analyze data in this study, and further details are available on request.