Abstract
Viruses play an important role in the marine ecosystem. However, our comprehension of viruses inhabiting the dark ocean, and in particular, under the Antarctic Ice Shelves, remains limited. Here, we mine single-cell genomic, transcriptomic, and metagenomic data to uncover the viral diversity, biogeography, activity, and their role as metabolic facilitators of microbes beneath the Ross Ice Shelf. This is the largest Antarctic ice shelf with a major impact on global carbon cycle. The viral community found in the cavity under the ice shelf mainly comprises endemic viruses adapted to polar and mesopelagic environments. The low abundance of genes related to lysogenic lifestyle (<3%) does not support a predominance of the Piggyback-the-Winner hypothesis, consistent with a low-productivity habitat. Our results indicate a viral community actively infecting key ammonium and sulfur-oxidizing chemolithoautotrophs (e.g. Nitrosopumilus spp, Thioglobus spp.), supporting a “kill-the-winner” dynamic. Based on genome analysis, these viruses carry specific auxiliary metabolic genes potentially involved in nitrogen, sulfur, and phosphorus acquisition. Altogether, the viruses under Antarctic ice shelves are putatively involved in programming the metabolism of ecologically relevant microbes that maintain primary production in these chemosynthetically-driven ecosystems, which have a major role in global nutrient cycles.
Subject terms: Microbiology, Environmental sciences, Virology, Systems virology, Microbial ecology
The diversity and impact of viruses in remote marine environments are less well understood. In this study, the authors analyse an -omics dataset to gain insights into the genomics and potential role of endemic viruses infecting ecologically important microbes inhabiting the ocean cavity beneath the Ross Ice Shelf.
Introduction
The Antarctic Ice Shelf is an expansive mass of floating ice that extends from the Antarctic continent into the surrounding Southern Ocean and exerts a profound influence on global carbon cycles1,2. Ice shelves comprise a top freshwater layer of meteoric snow and a bottom layer of marine ice (i.e. frozen seawater) that can be as thick as several hundred meters1,3,4. In the past, ice shelves were considered inactive elements of the C cycle and mostly ignored in global models2. However, research conducted in the last decade has revolutionized this perspective, revealing the presence of uniquely adapted microbial communities that drive key biogeochemical cycles2,5, high rates of biogeochemical/physical weathering in ice sheets, and storage and cycling of organic carbon (>104 Pg C) and nutrients2. Thus, spanning vast areas, the ice shelf plays a pivotal role in nutrient cycling and the sequestration and release of carbon, contributing to regulating atmospheric CO2 levels and the overall climate system1,2.
The Ross Ice Shelf (RIS), in particular, represents a dominant feature within the polar region, both in terms of its vast size, ecological importance, and singular physico-chemical properties, such as the coldest seawater in the oceans limiting the life6,7. The RIS encompasses an area of ~487,000 km2, such as the size of France, and is the largest ice shelf in Antarctica floating atop a 54,000 km3 ocean cavity that act as a critical component of the Antarctic ice system6,7. Its unique characteristics, make it a significant contributor to the global climate system, influencing the marine ecosystem’s dynamics and acting as a key regulator of local and regional biogeochemical cycles. These polar ecosystems are vast reservoirs of life with a remarkable diversity of (micro)-organisms across various habitats that serve as sentinels in climate change8–10. However, the Antarctic Ice Shelves´s stability is increasingly threatened due to climate change1,3,11. Rising temperatures and changing ocean currents have resulted in accelerated ice melt, which has important implications for carbon cycling.
A recent study characterized the microbial community beneath the RIS, shedding light on their functional diversity and ecological role in one of the least-studied ecosystems in the world´s ocean5. This study uncovered the global biogeochemical processes taking place under the ice, in a vast area of Antarctic continent (> a million km2) that has major impact on global ecosystem processes. In that survey, seawater samples collected from three different depths of the ocean cavity were used to generate a microbial dataset to unveil the metabolic capabilities of microbes inhabiting these dark environments. Near the base of the RIS, high concentrations of ammonium drove high abundances and activities of ammonium oxidizing archaeon Nitrosopumilus spp. These archaea together with sulfur-oxidizing bacteria, such as Thioglobus spp., were the main primary producers and source of new organic matter in the ecosystem. In addition, a mixture of metabolically versatile and diverse heterotrophic bacteria were also abundant relying on complex organic matter compounds5.
Nowadays, the impact of viruses on ecosystem functioning is unquestionable12–16. The study of marine viruses has gained significant attention due to their ecological and biogeochemical impact on marine ecosystems14. Laboratory and regional scale observations have revealed that viruses play a crucial role in the biological carbon pump, specifically through viral “shunt” and “shuttle” mechanisms. For instance, some viruses predicted to infect ecologically important hosts, explain 67% of the variation in the organic carbon export in coastal and open ocean16. Of all marine viruses, those residing in the meso- and bathypelagic marine environments have garnered particular interest, as these constitute the largest ecosystems on Earth and are crucial components of the global carbon cycle13,17–19. Despite the crucial ecological significance of viruses inhabiting the dark ocean environments17,20–22, our comprehension of their diversity, abundance, and ecological roles remains significantly limited, more even for those thriving in the permanently dark ocean under the polar Antarctic Ice Shelves, due to titanic sampling efforts that involve drilling through hundreds of meters of ice in a remote location. Thus, the identification of major viruses both in terms of abundance, potential ecological role and influence on this globally relevant ecosystem remains enigmatic. Given the extent and ecological significance of the Antarctic Ice Shelves, advancing our understanding of the virioplankton and its interaction with their hosts in these singular, ecologically important dark environments is essential for developing a comprehensive understanding of the ecology and biogeochemistry of these ecosystems. This is particularly relevant in light of climate change, which dramatically affects these ecosystems. To fill this gap, here we studied the viral diversity, biogeography, activity, and their role as metabolic facilitators of microbes inhabiting beneath the RIS. In this work, we show that the viral community found in the cavity under the ice shelf mainly comprises novel endemic viruses, actively infecting key abundant ammonium and sulfur-oxidizing chemolithoautotrophs that sustain primary production in these chemosynthetically-driven habitats.
Results and discussion
Virioplankton community structure beneath the RIS
In this study, we accessed and mined the genetic information of viruses collected beneath the RIS from previously reported cellular metagenomes, metagenome assembled genomes (MAGs), metatranscriptomes, and single-amplified genomes (SAGs) datasets5. The obtained data were used to unveil the ecogenomics of viruses that were likely been transcribed23, infecting single cells24–26, and/or present in the cell fraction as prophage or lytic virus27,28 (Fig. 1a). Ice drilling was performed in 2017 at site HWD-2, which is located 300 km from the RIS front. During the RIS Program, seawater samples were collected at three depths (30, 180, and 330 m) below the bottom of the shelf. It is important to clarify that specific sampling for assessing the free living fraction of the viral community was not originally conceived in the RIS Program and thus, such data were not available. Using a rather conservative method (combination of Virsorter 2.029, CheckV30, and PPR-meta programs31; see “Methods” section for details), we found a total of 607 bona fide viral genomes. Since taxonomic classification of viruses is complex when addressing uncultured viruses, we used two different approaches: classification by Virsorter 2.0 program that identifies viral hallmark genes for different type of viruses (e.g. RNA or ssDNA viruses), and also using Genomad with the most updated classification and database from the International Committee on Taxonomy of Viruses (ICTV) (Fig. 1b). According to ICTV classification, most of the recovered viral genome fragments (≈90% of assembled viral contigs) belonged to Caudoviricetes (Duplodnaviria; dsDNA viruses, Fig. 1b). Nearly all detected Caudoviricites displayed an uncertain classification indicating that they could correspond to novel families (Source data are provided in a Source data file). Other less abundant viral contigs recovered from our transcriptomic and metagenomic datasets belonged to ssDNA viruses (Monodnaviria, 3% of total detected viruses), RNA viruses (Riboviria), and Varidnaviria (including for instance nucleocytoplasmic large DNA viruses (NCLDV) and virophages) (Fig. 1c). Common hallmark genes of these viral groups were clearly detected such as single-stranded binding proteins for ssDNA32 viruses or RNA-directed RNA polymerase for RNA viruses33,34 (Fig. 1d and Source data file), such as in the case for RNA virus k121_168914, which, as discussed below, was one of the most transcribed viruses. As expected, the recovered size of assembled genome fragments (mean ≈4 kb) from ssDNA and RNA viruses were significantly lower than dsDNA viruses (mean contig size of 19,4 kb; Fig. 1c). Gene annotation of predicted ORFs (n = 11,017) corroborated that the retained contigs were indeed viruses containing common viral hallmark genes, such as capsid and other virion structural proteins (Fig. 1d). Standard viral metagenomic techniques used in our study are well optimized for recovering dsDNA viruses35, and therefore we cannot rule out that some technical limitations and biases during sampling and processing have affected the recovery of RNA viruses36–38 that commonly are less stable. However, our employed experimental and bioinformatic methodologies to recover RNA viral genomes from transcriptomics have been successfully proven in environmental virology and are very useful to uncover abundant and active RNA viruses in soil and aquatic environments29,39–41.
Endemicity, global biogeography and lifestyle of the RIS viruses
Beneath the RIS, prokaryoplankton were of comparable diversity and abundance, though distinct composition, relative to those in the open, meso-, and bathypelagic ocean5. To put the viroplankton RIS community into global context, we compared all the predicted viral genes against the GOV 2.0 Tara virome datasets including polar stations and Malaspina expedition data from the deep ocean20,42–44, isolated viruses deposited in the Genbank database, and virome datasets obtained from the Southern Ocean45. Half of the predicted viral genes beneath the RIS show no homology with viral databases, while the other half showed mostly homology with ORFs from viruses mainly obtained from polar stations (Fig. 2). Overall, within GOV 2.0 database, most of the hits were against viruses obtained in mesopelagic (≈44%) followed by deep-chlorophyl maximum zone (27%) and surface (26%). Our results are thus consistent with the prokaryoplankton results5, indicating that a large fraction of the virioplankton under the RIS is locally adapted to polar environments and more similar to open ocean meso- and bathypelagic than to surface communities.
We also performed a further comparison using viral protein sharing network with more than 5000 representative viruses including reference isolates and uncultured viral representatives from some of the most abundant clusters from Tara expedition42 and other surveys, such as virus vSAG 37-F6; supposed to be one of the most abundant virus in the surface ocean46. We found that 56% of the recovered viruses under the RIS represented singletons or outliers in the network without any connection with other known viruses and viral clusters (Fig. 2), consistent with the gene search similarity analysis discussed above. This suggests that a significant fraction of the virioplankton that resides under RIS is unique and might be represented by novel families never described before20,46. A deeper genomic analysis comparison of all viruses showed that our dataset was comprised of ≈600 different genera (according to thresholds demarcated by ICTV criteria), with only a few viral members belonging to the same species or genus, which agreed with viral network analysis data, since these ≈600 different genera mostly belonged to singleton viral clusters (Source data are provided in a Source data file). This suggests a high genomic diversity under the RIS. ICTV has implemented genome-based criteria47 and recently updated viral taxonomy (see ICTV webpage); although, demarcation of viral genera or families remain controversial and complicated for uncultured viruses46,47. Our network analyses on the RIS viroplankton also revealed a high density connection with viruses from the Southern Ocean (Fig. 2). Physical isolation and high productivity characterize the Southern Ocean (SO), which is responsible for up to one-fifth of the total carbon drawdown worldwide48. In the Southern Ocean, a positive selection of several viral protein clusters related to cold-shock-event responses and quorum-sensing mechanisms involved in the lysogenic-lytic cycle shift decision suggested marked temperature-driven genetic selection in the SO45. Piggyback-the-Winner (PtW) predicts that phages integrate into their hosts’ genomes as prophages when microbial abundances and growth rates are high49, such as in the gut50 or other productive environments51. Other strategies have been proposed, such as the Piggy-back-the-Persistent, in which viral community is dominated by temperate rather lytic lifestyles in ultra-oligotrophic polluted environments52. However, under the RIS, only 3% viruses contained integrases or were detected as prophages (Supplementary Material). Microbial cell abundance (9 × 104–1.2 × 105 cells mL−1) and prokaryotic heterotrophic production under the RIS, are rather low with turnover time of the microbial community between 339-461 days5. Thus, the lack of integrases in our data is consistent with the lifestyle of microbial community where peaks of high growth and production are not expected.
Local abundance and activity of virioplankton and host prediction
Despite the relatively low proportion of viruses assigned to their hosts (≈10%), albeit within the value obtained in other metagenomic surveys20, we were able to unveil host-virus pairs (n≈60) representing the most ecologically important prokaryotic members under the RIS (Fig. 3) involved in primary production, such as Thioglobus spp. and Nitrosopumilus spp. (sulfur and ammonia oxidation, respectively), and in carbon remineralization (Fig. 3, Source data file). A combination of multiple in silico approaches were implemented to assign virus to hosts, such as detection of CRISPR spacer-protospacer or tRNA match (see supplementary for more details) amongst others.
Metagenomic fragment recruitment has been commonly used to estimate the in silico abundances of viruses20,53. Here, as viromes are unavailable, we used metagenomes and metatranscriptomes, which provided information about viral abundances in the cell fraction (i.e. likely viruses infecting cells) and transcription activity for RNA and DNA viruses. Nearly all detected viruses (Fig. 3), except a few cases (<5%) exhibited no significant variation in fragment recruitment values among the different samples collected, indicating very similar abundances along the depth profile (Source data are provided in a Source data file). The top 3 most abundant viruses in the cell metagenomes (Fig. 3) belonged to Duplodnaviria, and one of them likely infected a marine Archaea. Remarkably, despite the abundance of RNA viruses recovered under the RIS was low, one of the most transcribed viruses were precisely one Riboviria (RNA virus “contig K121_168914” belonging to Chrymotiviricetes) infecting a putative unknown eukaryotic host. Other highly transcribed viruses were a dsDNA Caudoviricetes virus infecting a Nitrosphirales archaeon and a virus putatively infecting Pelagibacter spp (i.e. Pelagiphage, viral contig k121_349967). Under the RIS, nitrifying marine archaea are key abundant and active community members and contribute significantly to primary production5. Thus, to find archaeal viruses within the top 3 most abundant and active viruses was consistent with the “kill-the-winner” model54–56.
A diverse Pelagibacter spp. population with a moderate-low transcriptional activity was also discovered beneath the RIS5. Remarkably, our viral network analysis and genetic comparison indicated that the active transcribed Pelagiphage shared several orthologues genes, including the hallmark capsid viral gene (ORF 9), with virus vSAG 37-F6 (confirmed by 3D-capsid protein structure prediction (Fig. S1). Virus 37-F6 infects Pelagibacter spp. and was discovered by single-virus genomics in temperate and tropical and subtropical surface oceanic regions as well as in some deep ocean samples25,46,57–60. The capsid protein of vSAG 37-F6 (ORF9) was the most abundant protein in viral proteomes generated from Tara expedition. We also found that several viruses (n = 19) from the RIS belonged to 37-F6 viral group sharing the orthologue capsid gene ORF9 (average identity 61.3% and query coverage 93.3%; Source data file). Despite the high abundance of Pelagiphages in marine viromes (i.e. metagenomes from free viral particles in the sea)20,46,57,61,62, this group overall showed significantly lower transcription rates than other marine viruses (e.g. viruses infecting SAR116)63. In our study, the uncultured pelagiphages k121_349967 belonging to vSAG 37-F6 viral group was one of the most active members, especially in the sample collected at 180 m depth below the ice. It appears contradictory that some of the most actively transcribed viruses were viruses infecting a common slow-growth microbe, such as Pelagibacter spp. Nevertheless, considering the extreme conditions of the RIS, with the lowest possible temperature in the sea, it is plausible that slow-growing microbes and their viruses thrive and compete under these environmental conditions, where killing the winner dynamics co-exist alongside other viral strategies for infection. Indeed, high viral to prokaryotic ratios in low productive areas have previously been reported in bathypelagic layers of the Pacific and Atlantic Oceans64,65 suggesting that at low-temperature, viruses remain active for an extended period of time and infect slow-growing bacteria66, such as under the RIS.
Viruses as microbial helpers under the ice: niche-specific auxiliary metabolic genes
Our data showed that most of the detected viruses infect microbial hosts that play a crucial role in the biogeochemical cycles (Fig. 3). A large fraction of environmental viruses possess AMGs that participate in host metabolism facilitating host adaptation67–69. To investigate the niche specialization and ecological functions of these viruses, and how they affect marine ecosystem function beneath the Antarctic Ice Shelf, we further identified, based on genome analysis, the key potential AMGs they carry. In total, 52 different viruses from our virus genomic dataset potentially had niche-specific AMG (Fig. 4 and Source data file). Molecular chaperone and cold shock proteins were fairly abundant in different viruses infecting numerous microbial phyla in our data (Fig. 4). These findings suggest that under RIS extreme conditions, there is a strong selective pressure favoring adaptation to low temperatures5. Similar adaptations have been observed in other environments, such as the Southern Ocean, where viral proteins exhibit a lower hydrophobicity index pattern20. This adaptation favors protein flexibility and improves performance at lower temperatures20, which has been also observed here in the globally widespread open ocean uncultured vSAG 37-F6 virus, where the hydrophobicity index of the capsid gene was −0.29 or lower (Fig. S1 and Source data file).
The most striking result was that several viruses infecting key chemolithoautotrohps, such as Thioglobus spp. (that relies on available reduced sulfur compounds), putatively carry iron-sulfur cluster proteins. It has been described that these proteins can bind sulfur and/or iron (i.e. Fe/S cluster protein) before being transferred to apoproteins. 3D-Prediction structure of selected viral AMGs (see for instance gene product ORF2 of virus “metag_c_000000144917” classified as high-confidence viral genome) showed nearly identical protein structure to that of potential hosts relying on sulfur oxidation (Figs. 4 and S3). Likewise, Fe/S cluster proteins were reported as abundant AMG in viruses infecting symbiont SUP05 sulfur-oxidizing bacteria70. In that model, it has been proposed that this viral AMG encodes a Fe-S cluster proteins that supplement the assembly of Fe-S clusters in SUP05, thereby increasing the efficiency of energy conservation from sulfur oxidation. Similarly, we propose that viruses beneath the RIS potentially boost sulfur metabolism of chemolithoautotrophs, as in the SUP05 bacterial group by providing an auxiliary gene encoding for proteins with high affinity to sulfur. In other sulfur oxidizing bacteria, Fe-S cluster proteins (i.e. encoded by gene IscA) resulted to be essential for sulfur oxidation71. Indeed, when an homologs Fe-S gene was experimentally deleted in Allochromatium vinosum, the sulfur oxidation rates were significantly reduced71. These proteins are essential in supporting bacterial life as they are involved in three vital processes such as, photosynthesis, nitrogen fixation and oxidation/respiration, and hydrogen/sulfur metabolism72,73. Considering that photosynthesis and nitrogen fixation were absent under RIS5, it is reasonable to speculate that this Fe-S cluster proteins boost sulfur metabolism as shown in other sulfur-oxidizing bacterial models71. This could be potentially equivalent to viral photosynthesis performed by cyanophages in sunlit waters74–76. Interestingly, ferredoxin, which is also involved in Fe/S transportation, was another highly abundant viral AMG found in several viruses (Figs. 4 and S4 and Suppl Dataset), likely involved in cell respiration and electron transport chain. Moreover, considering the ubiquity of Fe-S cluster proteins detected in viruses infecting sulfur-oxidizing bacteria, it is also possible that these proteins may also participate in other house-keeping processes beyond sulfur metabolism63.
Furthermore, Caudoviricetes viruses beneath the RIS showed the potential to facilitate phosphorous acquisition, since PhoH and acid phosphatase genes were detected in viruses putatively infecting Marinisomataceae (formerly named as SAR406) and Nitrosphirales (SAG_4 and SAG_57), respectively (see Fig. S4 and Source data file). In addition, a glutamine synthetase AMG involved in nitrogen metabolism and biosynthesis was found in viruses (e.g. virus “metag_c_000000198991”, ORF_7) putatively infecting SAR86 (SAG_12 and MAG_167). Interestingly, both genomes had homologs glutamine synthetase. This enzyme is fundamental to the nitrogen metabolism of bacteria by catalyzing the condensation of glutamate and ammonia to form glutamine. Ammonia, which is abundant in this habitat, can be used both as an electron donor by chemolithoautotrophs and/or as a nitrogen source for nitrogen assimilatory pathway by different microbial groups5. Likewise, although speculative, isocitrate lyase involved in anaplerotic fixation in planktonic marine bacteria77, was found in viruses infecting Pelagibacter spp. and Woesiales (Fig. S2). Finally, as expected multiple AMGs homologs to host genes involved nucleotide/nucleoside synthesis and recycling and DNA processing were detected (Figs. 4 and S2 and Source data file). Although the discovery of AMGs are solely based on genome analysis, it is reasonable to consider our hypotheses plausible. Our findings are supported by previous AMG reports from different reference marine viral models, such as viral photosynthesis78, which were later confirmed through experimental studies75.
In conclusion, we have characterized the viral diversity, biogeography, and activity (i.e. transcriptional activity) inhabiting beneath the Antarctic Ice Shelf. Our results unveiled a unique viroplankton diversity and community composition, mostly comprised of unknown, endemic, and novel taxa. Consistent with the RIS prokaryoplankton, the viroplankton community was more related to open ocean polar and mesopelagic communities. The virus-host dynamics revealed insights into the ecology of these viruses and did not support a Piggyback-the-Winner, consistent with a very low productive environment. The analysis of viral transcripts also revealed an active viral community that was more actively infecting key active prokaryotes driving elemental cycling (e.g. Nitrosopumilus spp, Thioglobus spp.), supporting a kill-the-winner dynamic. Moreover, genome analysis of these viruses showed the presence of specific AMGs putatively involved in nitrogen, sulfur, and phosphorus acquisition. Altogether, most of the viruses below the RIS are novel and have the potential to impact the cycling of nitrogen, phosphorus, and sulfur in this ecosystem, consequently influencing the global biogeochemical cycles. Further mechanistic studies will help to provide more biological insights into the role of viruses in one of the most underexplored ecosystems on Earth.
Methods
Sampling, assembly, and viral contig detection
All details regarding site selection, ice drilling and sampling are described in ref. 5. Briefly, hot drilling and seawater sampling was conducted from the sub-shelf water column in the central region of the RIS (Latitude −80.6577 N, Longitude 174.4626 W). The sampling site was located ≈300 km from the shelf front. A borehole (30 cm diameter) conducted by hot water drilling was used for direct sampling of seawater from three depths (400 m, 550 m, and 700 m from the top of the shelf, which correspond to 30 m, 180 m, and 330 m from the bottom of the ice shelf, respectively). Seawater samples were processed accordingly for single cell genomics, metagenomics, and transcriptomics as described5, and the resulting assembled and co-assembled contigs (min. length 1 kb) from single-amplified genomes, bins and transcriptomics were mined for detecting viral contigs. Since viral metagenomes were not available for these collected samples nor conceived at the moment of sampling, we applied here a rather conservative method for detecting and considering a bona fide viral genome fragment as follows. Initially, a first round of viral contig detection was performed with Virsorter2.029 and CheckV30 for the presence of RNA viruses, Lavidaviridae, NCVLD, dsDNA and ssDNA viruses. A total of 37,674 putative viral contigs were detected (contig length ≥ 1 kb). Then, the dataset was filtered out by contig length criteria, considering only contigs ≥10 kb for those potentially classified as dsDNA viruses, NCVLD, and Lavidaviridae. For RNA and ssDNA viruses we applied a lower cut-off, 1.5 and 2.5 kb, respectively. For the latter, very small genomes from marine ssDNA and RNA viruses have been described79,80. Then, the resulting viral contig dataset was filtered out in a second round using several parameters obtained from the program CheckV as follows: only bona fide viral genome fragments were considered when at least 1 or more viral hallmark genes were detected in a contig -non classified as provirus or integrated- as long as the ratio of cell host genes to viral genes were <1. For those viral contigs classified as provirus integrated in the host genome, the criterium was more flexible, and a viral contig was considered as long as 1 viral hallmark was detected. Viral contigs with 0 host genes and 0 viral hallmark genes were not further considered. This resulted in a total of 607 bona fide viral contigs. Finally, in a third round, the curated viral contig dataset was again independently screened with the program PPR-meta31 which showed that more than 94% of these contigs were of viral origin and only in a few cases the source was unclear.
Genome classification, annotation, viral genome network, and host assignment
Putative viral contigs were initially classified into five groups according to Virsorter2.0 five groups (dsDNA, ssDNA, and RNA viruses, Lavidaviridae, and NCLDV). Then, they were formally classified with the current most updated classification recently released by ICTV using the program geNomad with the following parameters: genomad end-to-end --min-score 0.7 –cleanup. Viral contigs were annotated with program DRAM-v with default parameters81. Genomic comparison of all viral contigs was performed with the VIRIDIC program82, with default parameters in order to estimate the genetic relatedness. Viral network interaction showing taxonomic assignment and relatedness of viruses from the RIS with other viruses in databases was performed with vConTACT v.2.0 as described46,83,84. For this analysis, a total of 5461 viruses were compared including reference viruses recovered from Tara and Malaspina expeditions20,42, isolated reference viruses from Genbank and ICTV, and marine viruses from different samples of the Southern Ocean45. Viral proteins were predicted and compared through all-verses-all BLASTP with an E-value threshold of 10−5 and 50 for bit score46. Viral protein clusters (PCs) were then defined using Markov Clustering Algorithm (MCL)85, using default parameters and 2 for an inflation value. vContact (https://bitbucket.org/MAVERICLab/vcontact) was then used to estimate a similar score between every pair of viral genomes based on the number of PCs shared between two sequences and all pairs using the hypergeometric similarity, as previously described20,86. MCL was applied to the similarity scores using a threshold of 1 and MCL inflation of 2 to generate viral clusters (VCs, ≥2 sequences). Sequences were analyzed to identify highly similar VCs from the dataset using the Jaccard similarity as described46, predicted taxonomy using reference sequences present within the VCs, and constructed a network87 using the similarity scores generated by vContact between each genome pair. The final dataset was exported to Cytoscape (v3.3.0)87 and images were post-processed.
Comparison of virus abundance and viral protein in GOV 2.0/Tara and RIS datasets
Abundances in cell metagenomes and transcriptomes of the recovered viral contigs beneath the RIS were estimated in silico by metagenomic fragment recruitment. In addition, abundances in marine Tara viromes from different samples were also estimated. This analysis was performed as previously described in20,46,61, employing the following two identity thresholds (query coverage ≥85% plus nucleotide identity cutoff ≥95% and query coverage ≥50% plus nucleotide identity cutoff ≥70%) in order to estimate the normalized abundance of the viral populations at the species and genus level, respectively46, and expressed as recruited kilobases per a genome kilobase and a metagenome (or Metatranscriptome) gigabase (KPKG). To complement these analyses, viral proteins predicted from the RIS viruses were compared against the whole GOV 2.0 protein database and proteome from the Southern Ocean viruses45 using blastp with the following cut-offs: e-value better than 0.00005 and query coverage and identity values ≥ 50%. In addition, we also searched specifically for the presence of hallmark genes and orthologues genes of virus vSAG 37-F6 using the previously mentioned parameters in the detected viral contigs.
Detection of auxiliary metabolic genes in viral contigs and virus-host assignment
To search for the presence of auxiliary metabolic genes (AMG) in viral contigs, all the predicted viral proteins, regardless if the function was known or not (e.g. hypothetical proteins), were queried against all viral proteins obtained from binned MAGs and SAGs obtained from the same samples5. For that, BLASTp was used with the following criteria: e-value better than 0.00005 and query coverage and identity values ≥ 50%.
Detection of CRISPR arrays in MAGs and SAGs was performed with CRT Tool88 and a search of match between host spacer and viral protospacer was performed with BLASTn adjusting parameters to short sequences according to program´s manual. The obtained viral tRNA were queried against MAG and SAGs database using BLASTn according to the following thresholds previously used (alignment ≥ 60 bp, identity ≥ 97%, mismatches <10 search89). Codon usage was calculated using the on-line Sequence Manipulation Suite90. WisH program was also used for assigning virus to host considering only those pairs that showed a Log Likelihood value between −1.30 and −1.20 according to reported data91. The presence of provirus was conducted with CheckV program30. It is important to remark that sometimes binner softwares could wrongly place a viral contig in a MAG based simply on similar %GC or sequencing coverage. In our analysis, we did not assume that the simple presence of a viral contig in a MAG or bin was enough to conclude such host assignment but other complementary and more unequivocal proofs were needed, such as a match of CRISPR spacer-protospacer or tRNA, and/or presence of a host homologs auxiliary metabolic gene in the viral genome. In some virus-host pairs, multiple coincident proofs were obtained.
The hydropathicity index of proteins was calculated using Protscale Expasy tool (https://web.expasy.org/protscale/) according to92, which represents the hydrophobic or hydrophilic properties of a protein. The larger the number is, the more hydrophobic the protein, while the lower the number is, the more hydrophilic the protein. Robetta on-line software (https://robetta.bakerlab.org/submit.php) was used for protein 3D-structure prediction.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
We thank the research grant to MMG funded by the Spanish Ministry of Science and Innovation and Agencia Estatal de Investigación (PID2021-125175OB-I00). We also thank the Victoria University of Wellington Hot Water Drilling Team led by A. Pyne and D. Mendeno. This research was facilitated by the New Zealand Antarctic Research Institute (NZARI) funded Aotearoa New Zealand Ross Ice Shelf Programme, the New Zealand Antarctic Science Platform ANTA1801, the Austrian science fond (FWF) projects OCEANIDES (P34304-B), ENIGMA (TAI534), EXEBIO (P35248), and OCEANBIOPLAST (P35619-B) and a Rutherford Discovery Fellowship from the Royal Society of New Zealand to F.B.
Author contributions
J.L.S. performed experiments, analyzed data, and contributed to write the manuscript. A.R. analyzed data. D.C. analyzed data and edited the manuscript. F.B. conceived and designed experiments, analyzed data, provided funds, and wrote the manuscript. M.M.G. conceived and designed experiments, analyzed data, provided funds, and wrote the main manuscript draft.
Peer review
Peer review information
Nature Communications thanks James Mitchell, and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
The original sequenced metagenomic and metatranscriptomic raw and assembled data including contigs mined and identified here as viruses along with single-amplified genomes and metagenome-assembled genomes, were available at EMBL Nucleotide Sequence Database (ENA) database under Bioproject PRJEB35712. Data on viral contigs analysed in this study (fasta nucleotide and amino acid sequences) are also available in the Source Data file that support and complement this manuscript, as well as in Figshare [10.6084/m9.figshare.24581331 and 10.6084/m9.figshare.24581334]. Source Data file provided with this paper contains additionally source data on viral classification, viral network analysis, virus-host assignment, transcriptional activity and abundance of viruses, and data on AMG information. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Federico Baltar, Email: federico.baltar@univie.ac.at.
Manuel Martinez-Garcia, Email: m.martinez@ua.es.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-44028-x.
References
- 1.Rignot E, Jacobs S, Mouginot J, Scheuchl B. Ice-shelf melting around antarctica. Science (80-) 2013;341:266–270. doi: 10.1126/science.1235798. [DOI] [PubMed] [Google Scholar]
- 2.Wadham JL, et al. Ice sheets matter for the global carbon cycle. Nat. Commun. 2019;10:1–17. doi: 10.1038/s41467-019-11394-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Adusumilli S, Fricker HA, Medley B, Padman L, Siegfried MR. Interannual variations in meltwater input to the Southern Ocean from Antarctic ice shelves. Nat. Geosci. 2020;13:616–620. doi: 10.1038/s41561-020-0616-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Oerter H, et al. Evidence for basal marine ice in the Filchner–Ronne ice shelf. Nature. 1992;358:399–401. doi: 10.1038/358399a0. [DOI] [Google Scholar]
- 5.Martínez-Pérez C, et al. Phylogenetically and functionally diverse microorganisms reside under the Ross Ice Shelf. Nat. Commun. 2022;13:1–15. doi: 10.1038/s41467-021-27769-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Holland DM, Jacobs SS, Jenkins A. Modelling the ocean circulation beneath the Ross Ice Shelf. Antarct. Sci. 2003;15:13–23. doi: 10.1017/S0954102003001019. [DOI] [Google Scholar]
- 7.Robinson, N. J., Williams, M. J. M., Barrett, P. J. & Pyne, A. R. Observations of flow and ice-ocean interaction beneath the McMurdo Ice Shelf, Antarctica. J. Geophys. Res.115, C03025 (2010).
- 8.Morán, X. A. G. et al. More, smaller bacteria in response to ocean’s warming? Proc. R. Soc. B Biol. Sci.282, 0150371 (2015). [DOI] [PMC free article] [PubMed]
- 9.Cavicchioli, R. et al. Scientists’ warning to humanity: microorganisms and climate change. Nat. Rev. Microbiol.17, 569–586 (2019). [DOI] [PMC free article] [PubMed]
- 10.Ibarbalz FM, et al. Global trends in marine plankton diversity across kingdoms of life. Cell. 2019;179:1084–1097.e21. doi: 10.1016/j.cell.2019.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hellmer HH, Kauker F, Timmermann R, Hattermann T. The fate of the Southern Weddell sea continental shelf in a warming climate. J. Clim. 2017;30:4337–4350. doi: 10.1175/JCLI-D-16-0420.1. [DOI] [Google Scholar]
- 12.Brum, J. R. & Sullivan, M. B. Rising to the challenge: accelerated pace of discovery transforms marine virology. Nat. Rev. Microbiol.13, 147–159 (2015). [DOI] [PubMed]
- 13.Breitbart, M., Bonnain, C., Malki, K. & Sawaya, N. A. Phage puppet masters of the marine microbial realm. Nat. Microbiol.1, 754–766 (2018). [DOI] [PubMed]
- 14.Suttle CA. Viruses in the sea. Nature. 2005;437:356–361. doi: 10.1038/nature04160. [DOI] [PubMed] [Google Scholar]
- 15.Fuchsman CA, et al. Cyanobacteria and cyanophage contributions to carbon and nitrogen cycling in an oligotrophic oxygen-deficient zone. ISME J. 2019;13:2714–2726. doi: 10.1038/s41396-019-0452-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kaneko H, et al. Eukaryotic virus composition can predict the efficiency of carbon export in the global ocean. iScience. 2021;24:102002. doi: 10.1016/j.isci.2020.102002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lara E, et al. Unveiling the role and life strategies of viruses from the surface to the dark ocean. Sci. Adv. 2017;3:e1602565. doi: 10.1126/sciadv.1602565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Obtaining targeted metagenomes from uncultivated environmental Synechococcus using flow cytometry-based viral tagging: Protocol Exchange. http://www.nature.com/protocolexchange/protocols/3355#/procedure. Accessed 18 Sep 2015.
- 19.Guidi L, et al. Plankton networks driving carbon export in the oligotrophic ocean. Nature. 2016;532:465–470. doi: 10.1038/nature16942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Roux S, et al. Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses. Nature. 2016;537:689–693. doi: 10.1038/nature19366. [DOI] [PubMed] [Google Scholar]
- 21.Martinez-Hernandez F, Fornas O, Martinez-Garcia M. Into the dark: exploring the deep ocean with single-virus genomics. Viruses. 2022;14:1589. doi: 10.3390/v14071589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jian H., et al. Diversity and distribution of viruses inhabiting the deepest ocean on Earth. ISME J15, 3094–3110 (2021). [DOI] [PMC free article] [PubMed]
- 23.Yoshida T, et al. Locality and diel cycling of viral production revealed by a 24 h time course cross-omics analysis in a coastal region of Japan. ISME J. 2018;12:1287–1295. doi: 10.1038/s41396-018-0052-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Roux S, et al. Ecology and evolution of viruses infecting uncultivated SUP05 bacteria as revealed by single-cell- and meta- genomics. Elife. 2014;3:e03125. doi: 10.7554/eLife.03125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Martinez-Hernandez F, et al. Single-cell genomics uncover Pelagibacter as the putative host of the extremely abundant uncultured 37-F6 viral population in the ocean. ISME J. 2018;13:232–236. doi: 10.1038/s41396-018-0278-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Jarett, J. K. et al. Insights into the dynamics between viruses and their hosts in a hot spring microbial mat. ISME J.14, 2527–2541 (2020). [DOI] [PMC free article] [PubMed]
- 27.Zaragoza-Solas, A., Rodriguez-Valera, F. & López-Pérez, M. Metagenome mining reveals hidden genomic diversity of pelagimyophages in aquatic environments. mSystems5, e00905-19 (2020). [DOI] [PMC free article] [PubMed]
- 28.Dutilh BE, et al. A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes. Nat. Commun. 2014;5:4498. doi: 10.1038/ncomms5498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Guo J, et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome. 2021;9:1–13. doi: 10.1186/s40168-020-00990-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nayfach S, et al. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat. Biotechnol. 2020;39:578–585. doi: 10.1038/s41587-020-00774-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Fang Z. et al. PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning. Gigascience8, giz066 (2019). [DOI] [PMC free article] [PubMed]
- 32.Hopkins M, et al. Diversity of environmental single-stranded DNA phages revealed by PCR amplification of the partial major capsid protein. ISME J. 2014;8:2093–2103. doi: 10.1038/ismej.2014.43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dolja VV, Koonin EV. Metagenomics reshapes the concepts of RNA virus evolution by revealing extensive horizontal virus transfer. Virus Res. 2018;244:36–52. doi: 10.1016/j.virusres.2017.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zayed AA, et al. Cryptic and abundant marine viruses at the evolutionary origins of Earth’s RNA virome. Science. 2022;376:156–162. doi: 10.1126/science.abm5847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Roux S, et al. Towards quantitative viromics for both double-stranded and single-stranded DNA viruses. PeerJ. 2016;4:e2777. doi: 10.7717/peerj.2777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Cohen LJ, et al. Commensal bacteria make GPCR ligands that mimic human signalling molecules. Nature. 2017;549:48–53. doi: 10.1038/nature23874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Roux S, Emerson JB, Eloe-Fadrosh EA, Sullivan MB. Benchmarking viromics: an in silico evaluation of metagenome-enabled estimates of viral community composition and diversity. PeerJ. 2017;5:e3817. doi: 10.7717/peerj.3817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Steward GF, et al. Are we missing half of the viruses in the ocean? ISME J. 2013;7:672–679. doi: 10.1038/ismej.2012.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kolundžija, S., Cheng, D. Q. & Lauro, F. M. RNA viruses in aquatic ecosystems through the lens of ecological genomics and transcriptomics. Viruses14, 702 (2022). [DOI] [PMC free article] [PubMed]
- 40.Hillary LS, Adriaenssens EM, Jones DL, McDonald JE. RNA-viromics reveals diverse communities of soil RNA viruses with the potential to affect grassland ecosystems across multiple trophic levels. ISME Commun. 2022;2:1–10. doi: 10.1038/s43705-022-00110-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Vila-Nistal, M., Maestre-Carballa, L., Martinez-Hernández, F. & Martinez-Garcia, M. Novel RNA viruses from the Atlantic Ocean: ecogenomics, biogeography, and total virioplankton mass contribution from surface to the deep ocean. Environ. Microbiol.10.1111/1462-2920.16502 (2023). [DOI] [PubMed]
- 42.Gregory AC, et al. Marine DNA Viral Macro- and Microdiversity from Pole to Pole. Cell. 2019;177:1109–1123.e14. doi: 10.1016/j.cell.2019.03.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Brum JR, et al. Patterns and ecological drivers of ocean viral communities. Science (80-) 2015;348:1261498–1261498. doi: 10.1126/science.1261498. [DOI] [PubMed] [Google Scholar]
- 44.Acinas SG, et al. Deep ocean metagenomes provide insight into the metabolic architecture of bathypelagic microbial communities. Commun. Biol. 2021;4:604. doi: 10.1038/s42003-021-02112-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Alarcón-Schumacher, T., Guajardo-Leiva, S., Martinez-Garcia, M. & Díez, B. Ecogenomics and adaptation strategies of southern ocean viral communities. mSystems6, e0039621 10.1128/msystems.00396-21 (2021). [DOI] [PMC free article] [PubMed]
- 46.Martinez-Hernandez F, et al. Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nat. Commun. 2017;8:15892. doi: 10.1038/ncomms15892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Simmonds P, et al. Virus taxonomy in the age of metagenomics. Nat. Rev. Microbiol. 2017;15:161–168. doi: 10.1038/nrmicro.2016.177. [DOI] [PubMed] [Google Scholar]
- 48.Terhaar J, Frölicher TL, Joos F. Southern Ocean anthropogenic carbon sink constrained by sea surface salinity. Sci. Adv. 2021;7:5964–5992. doi: 10.1126/sciadv.abd5964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Knowles B, et al. Lytic to temperate switching of viral communities. Nature. 2016;531:466–470. doi: 10.1038/nature17193. [DOI] [PubMed] [Google Scholar]
- 50.Shkoporov AN, Hill C. Bacteriophages of the human gut: the ‘known unknown’ of the microbiome. Cell Host Microbe. 2019;25:195–209. doi: 10.1016/j.chom.2019.01.017. [DOI] [PubMed] [Google Scholar]
- 51.López-García P, et al. Metagenome-derived virus-microbe ratios across ecosystems. ISME J. 2023;2023:1–12. doi: 10.1038/s41396-023-01431-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Paterson, J. S. et al. A hydrocarbon-contaminated aquifer reveals a Piggyback-the-Persistent viral strategy. FEMS Microbiol. Ecol.95, fiz116 (2019). [DOI] [PubMed]
- 53.De La Cruz Peña, M. J. et al. Deciphering the human virome with single-virus genomics and metagenomics. Viruses10, 113 (2018). [DOI] [PMC free article] [PubMed]
- 54.Rodriguez-Brito B, et al. Viral and microbial community dynamics in four aquatic environments. ISME J. 2010;4:739–751. doi: 10.1038/ismej.2010.1. [DOI] [PubMed] [Google Scholar]
- 55.Thingstad TF. Elements of a theory for the mechanisms controlling abundance, diversity, and biogeochemical role of lytic bacterial viruses in aquatic systems. Limnol. Oceanogr. 2000;45:1320–1328. doi: 10.4319/lo.2000.45.6.1320. [DOI] [Google Scholar]
- 56.Weitz JS, et al. Phage-bacteria infection networks. Trends Microbiol. 2013;21:82–91. doi: 10.1016/j.tim.2012.11.003. [DOI] [PubMed] [Google Scholar]
- 57.Martinez-Hernandez F, et al. Droplet digital PCR for estimating absolute abundances of widespread pelagibacter viruses. Front. Microbiol. 2019;10:1226. doi: 10.3389/fmicb.2019.01226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Martinez-Garcia, M., Martinez-Hernandez, F. & Martínez, J. M. Single-virus genomics: studying uncultured viruses, one at a time. 4th edition Encyclopedia of Virology Vol. 1, 184–190 10.1016/B978-0-12-809633-8.21497-0 (Academic Press, 2020).
- 59.Martinez‐Hernandez F, et al. Diel cycling of the cosmopolitan abundant Pelagibacter virus 37‐F6: one of the most abundant viruses on earth. Environ. Microbiol. Rep. 2020;12:214–219. doi: 10.1111/1758-2229.12825. [DOI] [PubMed] [Google Scholar]
- 60.Martínez Martínez J, Martinez-Hernandez F, Martinez-Garcia M. Single-virus genomics and beyond. Nat. Rev. Microbiol. 2020;18:705–716. doi: 10.1038/s41579-020-00444-0. [DOI] [PubMed] [Google Scholar]
- 61.Mizuno CM, Rodriguez-Valera F, Kimes NE, Ghai R. Expanding the marine virosphere using metagenomics. PLoS Genet. 2013;9:e1003987. doi: 10.1371/journal.pgen.1003987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Eggleston EM, Hewson I. Abundance of two pelagibacter ubique bacteriophage genotypes along a latitudinal transect in the North and South Atlantic Oceans. Front. Microbiol. 2016;7:1534. doi: 10.3389/fmicb.2016.01534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Alonso-Sáez L, Morán XAG, Clokie MR. Low activity of lytic pelagiphages in coastal marine waters. ISME J. 2018;12:2100–2102. doi: 10.1038/s41396-018-0185-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Yang Y, Yokokawa T, Motegi C, Nagata T. Large-scale distribution of viruses in deep waters of the Pacific and Southern Oceans. Aquat. Micro. Ecol. 2014;71:193–202. doi: 10.3354/ame01677. [DOI] [Google Scholar]
- 65.De Corte D, Sintes E, Yokokawa T, Reinthaler T, Herndl GJ. Links between viruses and prokaryotes throughout the water column along a North Atlantic latitudinal transect. ISME J. 2012;6:1566–1577. doi: 10.1038/ismej.2011.214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Parada V, Sintes E, van Aken HM, Weinbauer MG, Herndl GJ. Viral abundance, decay, and diversity in the meso- and bathypelagic waters of the north atlantic. Appl. Environ. Microbiol. 2007;73:4429–4438. doi: 10.1128/AEM.00029-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Jian H, et al. Diversity and distribution of viruses inhabiting the deepest ocean on Earth. ISME J. 2021;15:3094–3110. doi: 10.1038/s41396-021-00994-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Trubl G. et al. Soil viruses are underexplored players in ecosystem carbon processing. mSystems3, e00076-18 (2018). [DOI] [PMC free article] [PubMed]
- 69.Crummett LT, Puxty RJ, Weihe C, Marston MF, Martiny JBH. The genomic content and context of auxiliary metabolic genes in marine cyanomyoviruses. Virology. 2016;499:219–229. doi: 10.1016/j.virol.2016.09.016. [DOI] [PubMed] [Google Scholar]
- 70.Zhou K, et al. Potential interactions between clade SUP05 sulfur-oxidizing bacteria and phages in hydrothermal vent sponges. Appl Environ. Microbiol. 2019;85:1–20. doi: 10.1128/AEM.00992-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Grimm F, Cort JR, Dahl C. DsrR, a novel IscA-like protein lacking iron- and Fe-S-binding functions, involved in the regulation of sulfur oxidation in allochromatium vinosum. J. Bacteriol. 2010;192:1652. doi: 10.1128/JB.01269-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Loiseau L, et al. ErpA, an iron-sulfur (Fe-S) protein of the A-type essential for respiratory metabolism in Escherichia coli. Proc. Natl Acad. Sci. USA. 2007;104:13626–13631. doi: 10.1073/pnas.0705829104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Esquilin-Lebron, K., Dubrac, S., Barras, F. & Boyd, J. M. Bacterial approaches for assembling iron-sulfur proteins. MBio12, e0242521 (2021). [DOI] [PMC free article] [PubMed]
- 74.Puxty RJ, Evans DJ, Millard AD, Scanlan DJ. Energy limitation of cyanophage development: implications for marine carbon cycling. ISME J. 2018;12:1273–1286. doi: 10.1038/s41396-017-0043-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Thompson, L. R. et al. Phage auxiliary metabolic genes and the redirection of cyanobacterial host carbon metabolism. Proc. Natl Acad. Sci. USA108, E757–E764 (2011). [DOI] [PMC free article] [PubMed]
- 76.Puxty RJ, Millard AD, Evans DJ, Scanlan DJ. Shedding new light on viral photosynthesis. Photosynth. Res. 2014;126:71–97. doi: 10.1007/s11120-014-0057-x. [DOI] [PubMed] [Google Scholar]
- 77.Palovaara, J. et al. Stimulation of growth by proteorhodopsin phototrophy involves regulation of central metabolic pathways in marine planktonic bacteria. Proc. Natl Acad. Sci. USA111 (2014). [DOI] [PMC free article] [PubMed]
- 78.Lindell D, Jaffe JD, Johnson ZI, Church GM, Chisholm SW. Photosynthesis genes in marine viruses yield proteins during host infection. Nature. 2005;438:86–89. doi: 10.1038/nature04111. [DOI] [PubMed] [Google Scholar]
- 79.Zhan Y. & Chen F. The smallest ssDNA phage infecting a marine bacterium. Environ. Microbiol.21, 1916–1928 (2018). [DOI] [PubMed]
- 80.Lang AS, Rise ML, Culley AI, Steward GF. RNA viruses in the sea. FEMS Microbiol. Rev. 2009;33:295–323. doi: 10.1111/j.1574-6976.2008.00132.x. [DOI] [PubMed] [Google Scholar]
- 81.Shaffer M, et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 2020;48:8883–8900. doi: 10.1093/nar/gkaa621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Moraru C., Varsani A. & Kropinski A. M. VIRIDIC-a novel tool to calculate the intergenomic 1 similarities of prokaryote-infecting viruses 2. bioRxiv10.1101/2020.07.05.188268 (2020). [DOI] [PMC free article] [PubMed]
- 83.Bolduc B, et al. vConTACT: an iVirus tool to classify double-stranded DNA viruses that infect Archaea and Bacteria. PeerJ. 2017;5:e3243. doi: 10.7717/peerj.3243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Bin Jang H, et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 2019;37:632–639. doi: 10.1038/s41587-019-0100-8. [DOI] [PubMed] [Google Scholar]
- 85.Enright AJ, Van Dongen S, Ouzounis CA. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30:1575–1584. doi: 10.1093/nar/30.7.1575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Lima-Mendez G, Van Helden J, Toussaint A, Leplae R. Reticulate representation of evolutionary and functional relationships between phage genomes. Mol. Biol. Evol. 2008;25:762–777. doi: 10.1093/molbev/msn023. [DOI] [PubMed] [Google Scholar]
- 87.Shannon P, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Bland C, et al. CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinform. 2007;8:1–8. doi: 10.1186/1471-2105-8-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Coutinho FH, et al. RaFAH: host prediction for viruses of Bacteria and Archaea based on protein content. Patterns. 2021;2:100274. doi: 10.1016/j.patter.2021.100274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Stothard P. The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and dna sequences. BioTechniques10.2144/00286ir01 (2018). [DOI] [PubMed]
- 91.Galiez, C., Siebert, M., Enault, F., Vincent, J. & Söding, J. WIsH: Who is the host? Predicting prokaryotic hosts from metagenomic phage contigs. Bioinformatics.33, 3113–3114 (2017). [DOI] [PMC free article] [PubMed]
- 92.Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982;157:105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original sequenced metagenomic and metatranscriptomic raw and assembled data including contigs mined and identified here as viruses along with single-amplified genomes and metagenome-assembled genomes, were available at EMBL Nucleotide Sequence Database (ENA) database under Bioproject PRJEB35712. Data on viral contigs analysed in this study (fasta nucleotide and amino acid sequences) are also available in the Source Data file that support and complement this manuscript, as well as in Figshare [10.6084/m9.figshare.24581331 and 10.6084/m9.figshare.24581334]. Source Data file provided with this paper contains additionally source data on viral classification, viral network analysis, virus-host assignment, transcriptional activity and abundance of viruses, and data on AMG information. Source data are provided with this paper.