Skip to main content
iScience logoLink to iScience
. 2020 Aug 6;23(9):101439. doi: 10.1016/j.isci.2020.101439

Adsorption Sequencing as a Rapid Method to Link Environmental Bacteriophages to Hosts

Patrick A de Jonge 1,2, FA Bastiaan von Meijenfeldt 1, Ana Rita Costa 2, Franklin L Nobrega 2, Stan JJ Brouns 2, Bas E Dutilh 1,3,
PMCID: PMC7452251  PMID: 32823052

Summary

An important viromics challenge is associating bacteriophages to hosts. To address this, we developed adsorption sequencing (AdsorpSeq), a readily implementable method to measure phages that are preferentially adsorbed to specific host cell envelopes. AdsorpSeq thus captures the key initial infection cycle step. Phages are added to cell envelopes, adsorbed phages are isolated through gel electrophoresis, after which adsorbed phage DNA is sequenced and compared with the full virome. Here, we show that AdsorpSeq allows for separation of phages based on receptor-adsorbing capabilities. Next, we applied AdsorpSeq to identify phages in a wastewater virome that adsorb to cell envelopes of nine bacteria, including important pathogens. We detected 26 adsorbed phages including common and rare members of the virome, a minority being related to previously characterized phages. We conclude that AdsorpSeq is an effective new tool for rapid characterization of environmental phage adsorption, with a proof-of-principle application to Gram-negative host cell envelopes.

Subject Areas: Ecology, Environmental Science, Microbiology, Techniques in Genetics

Graphical Abstract

graphic file with name fx1.jpg

Highlights

  • AdsorpSeq allows rapid determination of bacteria-bacteriophage interactions

  • Model phages can be differentially sequenced based on binding ability with AdsorpSeq

  • With AdsorpSeq 26 new phage-host interactions were determined in hospital wastewater


Ecology; Environmental Science; Microbiology; Techniques in Genetics

Introduction

Bacteriophages (viruses that infect bacteria) are omnipresent and impact every ecosystem (Cobián Güemes et al., 2016). Their impact on microbial communities makes phages both useful and detrimental. On the one hand, they are potential bioengineered drug delivery systems (Karimi et al., 2016) and alternatives to antimicrobials (Nobrega et al., 2015). On the other hand, they spread bacterial pathogenicity (Chen and Novick, 2009) and disrupt food production chains like milk fermentations used in the dairy industry (Marcó et al., 2012). Phages also affect ecosystems at larger scales by controlling bacterial evolution and community structure, affecting, e.g., our microbiomes (De Sordi et al., 2019; Manrique et al., 2016), marine nutrient cycling through bacterial lysis (Corinaldesi et al., 2012; Danovaro et al., 2008), and global oxygen production by encoding photosynthesis genes that are expressed in cyanobacterial hosts (Sharon et al., 2009). Although their global importance makes understanding phage-host interactions crucial, most remain undetermined (Cobián Güemes et al., 2016).

A major reason for the mass of undetermined phage-host interactions is a shortage of readily applicable viromics techniques that can simultaneously (1) identify phages in an environmental sample and (2) link them to host(s). Unstudied phage genomes can be identified with metagenomics, but despite constant improvements (e.g., Ahlgren et al., 2017; Galiez et al., 2017; Liu et al., 2019; Mihara et al., 2016; Villarroel et al., 2016; Zhang et al., 2017) it remains challenging to predict to which hosts these phages adsorb, especially at low taxonomic levels (Edwards et al., 2016). Although CRISPR-Cas memory (spacers) and prophage regions can result in reliable host predictions (Edwards et al., 2016), many bacterial lineages do not have CRISPR-Cas systems (Burstein et al., 2016) and not all phages form prophages. Beyond computational approaches, phages can be linked to their host with isolation techniques like double-layer agar plates. Such techniques depend on lytic phages to form visible plaques (Abedon and Yin, 2009; Łoś et al., 2008; Serwer et al., 2007) and can be biased to phages with narrow host ranges (Guyader and Burch, 2008; Kauffman et al., 2018). These assays furthermore often employ a few highly related hosts, each needing a separate assay (Hyman and Abedon, 2010). Thus, available information on phage host range is limited. Other proposed methods include meta3C, which infers interactions based on physical proximity of phage and host DNA (Marbouty et al., 2017), and those summarized in an earlier review (Edwards et al., 2016). However, such methods are generally cumbersome and thereby hard to implement.

An alternative approach to determining phage-host interactions is by focusing on the first step of the phage infection cycle, the adsorption of the phage to bacterial surface receptors. Although phage adsorption is not always followed by successful phage infection, it is a crucial step for successful infections and often specific (de Jonge et al., 2019a). Utilizing phage adsorption specificity could thus allow studies of phage-host interactions in environmental samples. This was recently shown through viral tagging (Deng et al., 2012; Džunková et al., 2019) where fluorescently labeled phages are added to bacteria. Bacteria bound by fluorescently labeled phages are isolated with fluorescence-activated cell sorting, and phage-bacterium pairs are sequenced. This approach allows abundant viruses to be linked to hosts, but it can remain challenging to identify phage-host links for rare members of the virome. Finally, viral tagging requires a specialized experimental setup.

Here, we rapidly identify phage-host pairings by linking cell envelope adsorption to phage sequencing and statistical analysis (adsorption sequencing or AdsorpSeq). AdsorpSeq allows identification of novel phages and their host interactions by exploiting differential migration of phages bound to host receptors and unbound phages in agarose gel electrophoresis. This enables selective sequencing of phages based on their interaction with cell envelopes of a specific host. Thereby multiple phages that interact with a given host can be rapidly and simultaneously identified. We show that model phages can be differentially identified based on the presence of their receptor molecule. Subsequently, we apply AdsorpSeq on a hospital wastewater virome and the cell envelopes of nine taxonomically distinct Gram-negative bacteria, uncovering 26 novel phage-host interactions with a range of abundances in the virome.

Results and Discussion

Identification of Model Phages Based on Adsorption to Their Hosts

AdsorpSeq aims to selectively sequence phages based on their adsorption to bacterial cell envelopes. This is achieved by a five-step process (steps 1–5 in Figure 1A). First, a phage mixture is added to a cell envelope suspension that was isolated from a bacterium of interest (step 1 in Figure 1A). An incubation then allows phage adsorption to their receptors (2). Next, agarose gel electrophoresis separates bound and unbound phages (3). Unlike unbound phages, phages bound to cell envelope suspensions will migrate slower into agarose gels owing to the larger size and altered charge of the adsorption complex. The result is rapid separation of phages based on adsorption abilities. Finally, genomic material of bound phages is isolated from the gel (4) and sequenced (5). To validate AdsorpSeq, we tested the method with Escherichia phage λ and Salmonella phage P22 as two model phages with well-described adsorption properties. These two phages differ in host, morphology, and the receptor type. The receptor of Siphoviridae phage λ is the E. coli maltose pore protein LamB (Wang et al., 2000), whereas the S. enterica subspecies enterica (hereafter: S. enterica) lipopolysaccharide O-antigen chain serves as receptor for Podoviridae phage P22 (Andres et al., 2010). Because absorption to the bacterial cell envelope does not guarantee successful infection (de Jonge et al., 2019a) AdsorpSeq may detect phage-bacteria interactions beyond infecting host range. In addition, the isolation of bacterial membrane fractions may alter the interactions of the viruses with these membranes. For example, membrane fragments can form vesicles both in normal and inverted conformations (Coakley et al., 1977; Poole, 1993), therewith supplying the phage with the ability to bind to the cell envelope side that is normally pointed inward. However, as both phages are specific to their respective receptors (Andres et al., 2010; Randall-Hazelbauer and Shwartz, 1973), we could gauge preservation of adsorption specificity in AdsorpSeq.

Figure 1.

Figure 1

AdsorpSeq Allows the Selective Sequencing of Model Phages Based on Adsorption

(A) Schematic of AdsorpSeq. It shows the main steps of (1) mixing phages with bacterial cell envelopes, (2) allowing phages to adsorb to cell envelopes, (3) separating phages using agarose gel electrophoresis based on adsorbing capability, (4) isolating the genomes of adsorbed phages, and (5) sequencing genomes of adsorbed phages isolated from gels.

(B) Adsorption of phages λ and P22 to host cell envelopes hinders their migration into agarose gels. Agarose gels of phages λ and P22 after being added to cell envelope suspensions of E. coli K12 and S. enterica S1400, and bar graphs showing DNA quantities that were isolated from the gel slots at the top of the gels. Arrows indicate the location of free phages (migrated into the gel) and adsorbed phages (in the gel slot) in the first gel. This is identical in the other gels.

(C) AdsorpSeq maintains receptor molecule specificity of phage λ. Agarose gel of phage λ after being added to E. coli strain K12, to which it can adsorb, and E. coli ΔLamB, to which it cannot adsorb. Bar graph depicts DNA isolated from gel slots at the top of the gel. Note: although the smear seems visually stronger in the K12 lane, significantly more DNA was retained in the well containing the K12 envelope fraction than in the ΔLamB envelope fraction (see bar graphs).

(D) Applying AdsorpSeq to a mixture of phages leads to differentiation based on adsorbing capacity. Stacked bar graph showing the number of reads mapped to phages λ and P22 after AdsorpSeq was applied using an equal mixture of the two phages and cell envelopes from either E. coli K12 or S. enterica S1400. Significance levels according to a paired t test, error bars depict standard deviations, points are biological replicates. ∗p < 0.05, ∗∗p < 0.01.

As a first test, we added either phage λ or P22 to E. coli and S. enterica cell envelope suspensions. Upon adding non-host cell envelopes (e.g., adding S. enterica to λ), phage particles migrated into agarose gels, whereas adding host cell envelopes (e.g., adding E. coli to λ) resulted in phage particle retention around the sample slot at the origin of electrophoresis (Figure 1B). These gel regions consequently contained significantly more DNA when using host cell envelopes than when using non-host cell envelopes (two-tailed t test, p < 0.05). This showed that adsorption of phage particles to host cell envelopes prevented migration into agarose gels. To confirm receptor specificity, we repeated the experiment with a LamB-knockout strain (ΔLamB) (Baba et al., 2006), which resulted in phage λ losing adsorption ability (Figure 1C). These results agree with earlier studies (Andres et al., 2010) showing slower migration of phage particles into agarose gels after addition of purified phage receptor particles.

After establishing that presence of host cell envelopes alters phage migration in agarose gels, we used this property to preferentially sequence phage genomes from a mixture based on the presence of host cell envelopes. We performed AdsorpSeq with a mixture of equal parts phage λ and P22, which we added to cell envelopes of either E. coli or S. enterica. Upon sequencing of genomic material isolated from agarose slices, two to three times more reads mapped to the phage λ genome than the P22 genome when E. coli cell envelopes were added and vice versa (Figure 1D). The resulting difference in phage genome abundance was highly significant (Fisher's exact test, p = 2.5 × 10−16). As AdsorpSeq was thus capable of discerning phage-host associations in a simple phage mixture, we next applied it to a complex environmental phage mixture.

AdsorpSeq Results in Selection of Unique Phage Subsets

We next applied AdsorpSeq to identify phages targeting specific bacterial cell envelopes in a complex virome derived from a hospital wastewater influent pipe (Figure S1A). In such environments, phages can be found at concentrations of 108–1010 particles per milliliter (Otawa et al., 2007; Rosario et al., 2009; Tamaki et al., 2012; Wu and Liu, 2009). Phage adsorption targets consisted of cell envelope suspensions from nine taxonomically diverse Gram-negative bacteria, including three Enterobacterales (Escherichia coli, Citrobacter freundii, and Klebsiella pneumoniae), two Pseudomonadales (Pseudomonas aeruginosa and Acinetobacter baumannii), two Bacteroidales (Bacteroides fragilis and Bacteroides dorei), one Burkholderiales (Ralstonia pickettii), and one Fusobacterales (Fusobacterium necrophorum). All these bacteria are either part of the healthy human gut microbiome (e.g., B. dorei) (Huttenhower et al., 2012) or pathogens linked with hospital infections (e.g., K. pneumoniae) (Rice, 2008). We expect that AdsorpSeq may be most useful for Gram-negative bacteria, because these bacteria possess an outer membrane that is both easily isolated and contains the main receptors targeted during phage infections (Nobrega et al., 2018; Silva et al., 2016). Phage receptors in Gram-positive bacteria are often associated with the thick outer peptidoglycan layer (Dowah and Clokie, 2018) that can be difficult to break and isolate. Thus, in the following validation and application of AdsorpSeq, we focused our efforts on Gram-negative bacteria. Next to the virome treated with the cell envelopes of these nine bacteria, we sequenced the full untreated virome as control. As we increased DNA quantities of all samples by multiple displacement amplification (MDA), which alters apparent viral community compositions (Pinard et al., 2006), the full virome was sequenced both before and after MDA. This allowed us to gauge and correct for MDA effects during data analysis (below).

Sequencing of the nine cell envelope-treated samples and the two viromes (pre- and post-MDA) resulted in 138 Mbp of read-level data. A cross-assembly resulted in 23,373 contigs longer than 2,500 bp, representing 71.4% of the total dataset, as determined by mapping the reads back to the contigs (for annotated contig metadata and contig abundances, see Table S1). Taxonomic classification showed that the cross-assembly contained 1,111 viral contigs and a further 8,921 contigs that were taxonomically unclassified, whereas the remaining 13,341 contigs were mostly derived from bacteria, with a minority of archaeal and eukaryote contigs. The group of unclassified contigs likely reflects the large numbers of unstudied human gut phages (Shkoporov and Hill, 2019). We therefore combined the viral and unclassified sets to create a dataset of 10,032 confirmed and suspected viral contigs, which represented 51.5% of total reads. The percentage of reads represented by selected contigs fluctuated across the cell envelope-treated samples, ranging from 41.0% (E. coli treated) to 81.9% (B. fragilis treated) of reads (median: 70.2%, Figure S1B). These differences between the samples were a first indication that phage adsorption depended on the cell envelope suspension used.

Although there were 842 contigs with identical ends in the selected dataset representing putatively complete and circular genomes, many contigs also represented likely genomic fragments. We therefore assigned contigs with similar tetranucleotide usage patterns and read depth patterns across the nine cell envelope-treated samples to 1,058 viral populations. Binning of contigs is based on nucleotide usage and abundance signals. Thus, the bins do not necessarily represent single phage genomes, but rather groups of contigs with similar characteristics, which we refer to as viral populations. Some viral populations may contain fragments of different phages specifically binding to a host and similar phages that non-specifically bind to the same host. Jaccard distances based on viral population relative abundances showed distinct dissimilarity between the cell envelope-treated samples (Figure S1C). This indicated that each sample contained a unique set of viral populations and thus selected different phages. The viral populations with the highest abundance across the samples were also highly abundant in both virome controls (Figures S2A and S2B) suggesting that some phage particles are retained in the wells through non-specific interactions, as also observed for λ and P22 (Figure 1D). Perhaps some phages are physically prevented from migrating into agarose gels by the vesicles that cell envelope suspensions likely form (Poole, 1993). Although we did not achieve perfect separation between bound and unbound phage particles, we concluded from the evident dissimilarity in composition between the samples (Figure S1C) that selection of phages by AdsorpSeq is dependent on the type of bacterial cell envelope used.

Selection of Phages with Putative Adsorption Activity

To identify phages that specifically adsorb to one or more of the nine bacterial species, we selected viral populations that were overrepresented in one or more of the samples. To focus on the strongest adsorbing phages first, we defined overrepresented viral populations based on outlier analysis (see Methods for details), which meant that we selected viral populations with a relative abundance of at least 1.58 times higher in one sample than in the other eight samples. Relative abundance values from samples of the same bacterial taxonomic order were discounted when determining overrepresented viral populations to allow for phages with a broad adsorption or infection host range. In total, 123 viral populations represented phages that specifically adsorb to the cell envelope fractions of one of the nine bacterial species (Figure S3A).

We next refined the selection of putatively adsorbing viral populations by applying two filters. The first filter removed viral populations that were positively selected for by MDA. MDA can result in efficient rolling circle amplification but has also been shown to lead to a bias for small single-stranded DNA (ssDNA) phage genomes (Kim and Bae, 2011; Probst et al., 2015; Yilmaz et al., 2010). This held true in our dataset, as comparing viral populations in the virome before and after MDA showed 10- to 100-fold higher amplification of ssDNA Microviridae than other phage families (Figure S3B). Note that 23,564 reads from the unamplified virome mapped to Microviridae contigs, showing that these small circular DNA viruses are abundant in the hospital virome and their detection does not fully depend on MDA. To reduce the impact of MDA bias, we thus filtered out 79 viral populations with strong MDA selection, leaving 44 viral populations that passed the MDA selection filter (Figure S3A).

In addition to MDA selection, we tested if certain phages were universally selected for by the AdsorpSeq technique (Figure S3C). This identified 18 putative adsorbing viral populations for which the relative abundance in all nine samples was higher than in post-MDA virome (Figure S3A). This methodological bias was highest among Inoviridae and Microviridae (Figure S3C). Although in silico evidence suggests that some phages may have very broad host ranges (Paez-Espino et al., 2016; Roux et al., 2016), most phages likely have a narrow host range spanning a few closely related strains within the same species or genus (de Jonge et al., 2019a; Džunková et al., 2019). We thus interpreted our findings as a methodological selection bias that may reflect the inability of a phage particle to migrate into agarose gels owing to large size, low charge, or non-specific interactions with bacterial cell envelopes. This may explain the stronger methodological selection pressure on Inoviridae, which have lipid membrane-adsorbing coat proteins (Stopar et al., 2003). The 18 viral populations that were under strong methodological bias were filtered out of the final selection of adsorbing vial populations.

After applying MDA and methodological selection filters, 26 viral populations with predicted adsorbing activity remained (Figure S3A). All 26 selected viral populations were highly specific to cell envelopes of a single bacterium and thus represented phages with a single predicted host. This was despite our allowance of broad host range at the order level in selecting viral populations but agrees with a recent report that found that broad host-range phages are rare in gut viromes based on single-cell viral tagging experiments (Džunková et al., 2019).

Notably, 22 of the 26 selected viral populations putatively adsorb to Proteobacteria cell envelopes, consistent with recent findings that Proteobacteria are the dominant bacterial phylum in global wastewater treatment plant microbiomes and are abundant in wastewater influent (Petrovich et al., 2019; Wu et al., 2019). It is thus likely that Proteobacteria phages are common in wastewater microbiomes, which supports our findings.

Selected Viral Populations Are Rare and Similar to Proteobacteria Phages

Next, we examined the final selection of 26 viral populations with adsorption predictions. Their relative abundance in the virome ranged from 0.0001% to 1%, covering the spectrum from relatively rare to relatively abundant in the virome before and after MDA (Figure 2A). Characterization of the ORFs by direct homology searches showed that the majority of ORFs (64% of total) in selected viral populations had no significant similarity to protein sequences in the National Institute for Biotechnology Information (NCBI) non-redundant (nr) database (Agarwala et al., 2017) (Figure 2B). From these findings we concluded that AdsorpSeq can be used to identify adsorption hosts of both common and rare uncharacterized environmental phages.

Figure 2.

Figure 2

Most Selected Viral Populations Represent Rare and Uncharacterized Viral Sequences

(A) Relative abundance of selected viral populations with adsorption predictions in the virome before and after MDA shows that AdsorpSeq is not biased for abundant or rare phage sequences. Numbers next to data points show the viral population number.

(B) ORF-level taxonomical predictions using CAT show most ORFs from selected viral populations have no similarities in the NCBI nr protein sequence database (dark gray). Some contigs had database hits but could not be classified because the hits involved proteins from different superkingdoms. These are labeled as unclassified (light gray).

(C) The hospital wastewater virome contained a large diversity of uncharacterized phage sequences, as shown by a gene-sharing network of 10,032 viral contigs and all phage genomes in the NCBI viral RefSeq database (Pruitt et al., 2007), made using vContact2 (Bin Jang et al., 2019). Large colored contigs represent those in the final selection of 26 putative adsorbing viral populations. Proteobacteria-infecting characterized phages are orange.

To assess AdsorpSeq host predictions, we performed a whole genome clustering of all viral populations (including those without adsorption predictions) with all characterized phages whose genome sequences were available in the NCBI bacterial and viral RefSeq V85 database (Pruitt et al., 2007) (Figure 2C). First, we observed that the contigs of the viral populations identified with AdsorpSeq clustered together, confirming our contig binning approach. Second, most of the 22 selected viral populations that were predicted to adsorb to Proteobacteria species were similar to characterized Proteobacteria phages. This supports the notion that the viral populations that were detected with AdsorpSeq on the cell envelopes of specific bacterial pathogens reflect naturally occurring phages capable of infecting these bacteria.

Selected Viral Populations with Similarity to Characterized Phages

To further assess the phage-host associations detected by AdsorpSeq, we searched for matches between contigs in selected viral populations and bacterial CRISPR-Cas spacers. We identified 1.4 million spacers predicted from bacteria in the Pathosystems Resource Integration Center (PATRIC) database (Wattam et al., 2014) and queried them against contigs in the selected viral populations. No full-length identical protospacers were detected on any of the viral contigs. Two contigs from viral populations adsorbing to Pseudomonas aeruginosa contained spacer hits with a single mismatch each. These hits originated from CRISPR-Cas arrays encoded on the genomes of Aeromonas caviae and Tolumonas auensis, which are members of the same taxonomical class as P. aeruginosa (Gammaproteobacteria). Several examples have been observed of phage genera that adsorb to differently related bacteria, such as Tequatroviruses (Nolan et al., 2006), Gap227likeviruses (Wang et al., 2016), and Plpelikeviruses (Comeau et al., 2012; Walker et al., 2019). Alternatively, cell envelope adsorption does not necessitate successful infection, as many factors play a role in the completion of the infection cycle (de Jonge et al., 2019a), including differences in transcriptional programs between hosts (Howard-Varona et al., 2018) or the presence of molecular defense systems (Hampton et al., 2020; Labrie et al., 2010). No other selected viral populations contained contigs with spacer hits with fewer than five mismatches. As five mismatches corresponds to a ∼50% false discovery rate at the species level (Edwards et al., 2016), hits with more mismatches were not considered for analysis.

Next, we placed the viral populations identified with AdsorpSeq in the context of characterized phages by using protein sharing networks. We narrowed our search to contigs that contained over five ORFs with known homologs, yielding five selected viral populations with extensive similarity to characterized phages (Figure 3 and Table S3).

Figure 3.

Figure 3

Selected Viral Populations Related to Characterized Phages

Protein-sharing networks of viral populations show their relationships to characterized phages. ORFs from selected viral populations were used in BLASTp searches against proteins of phages in the viral RefSeq database (Pruitt et al., 2007). Bubbles are phages. Edge color and labels show similar protein counts (E-value ≤ 10−5).

(A) P. aeruginosa-adsorbing viral population 292.

(B) P. aeruginosa-adsorbing viral population 447.

(C) C. freundii-adsorbing viral population 4720. One additional contig did not share protein similarity to characterized phages.

(D) E. coli-adsorbing viral population 18.

(E) E. coli-adsorbing viral population 2019.

First, P. aeruginosa-adsorbing viral population 292 contained a single circular contig on which 15/85 ORFs (17.6%) were similar to proteins from the known Pseudomonas phages JBD44, phi297, and YMC11/07/P54_PAE_BP (Figure 3A). Notably, phi297 infects the P. aeruginosa strain used in this study (Bourkal'tseva et al., 2011). The fifteen shared proteins included the often conserved terminase (Low et al., 2019; Serwer et al., 2004) and several adjacent genes (Figure 4A and Table S4), stressing the relatedness between these phages. Following earlier practice (Lavigne et al., 2008), this protein similarity would classify these phages into the same taxonomical family. Although the conservative CAT ORF-level predictions provided no taxonomic classifications for most ORFs in viral population 292, direct homology searches found significant similarity (BLASTp, e-value ≤ 10−5) to proteins from Pseudomonas bacteria for 45 of the 85 ORFs (53%, Figure 4A and Table S4) from viral population 292. This viral population thus represents a novel Pseudomonas phage identified in the hospital sewage inlet by AdsorpSeq with P. aeruginosa cell envelopes as bait.

Figure 4.

Figure 4

Several Viral Populations and their Relations to Characterized Families

(A) Similarity of viral population 447, containing contig 356, to Pseudomonas phages JBD44 and phi297. Depicted is a whole genome comparison made using Easyfig (Sullivan et al., 2011). In the line representing contig 356, the top half shows ORFs with BLASTp hit against Pseudomonas bacteria proteins in the NCBI nr database, whereas the bottom half shows protein function.

(B) Similarity between five contigs from C. freundii-adsorbing viral population 4720 and T4-like Citrobacter phage Margaery, as shown by genome comparisons made using Easyfig (Sullivan et al., 2011). Numbers indicate contig numbers, contig 13,003 was placed below phage Margaery as it overlaps with contig 18,603. Colors indicate tBLASTx hits and use the same legend as (A).

(C) The relation of E. coli-adsorbing viral populations 18 and 2019 to jumbo phages displayed in an unrooted approximate maximum likelihood tree of jumbo phage terminases. Dots on branches represent ultrafast bootstrap support of ≥85 (Hoang et al., 2018).

A second viral population that also adsorbed to P. aeruginosa (447) contained a contig of 38,882 bp that had 5/62 ORFs with similarity to Myxococcus phage Mx8 (Figure 3B), which translates to less than 5% of 85 Mx8 ORFs. The similarity between viral population 447 and phage Mx8 is thus limited to a small number of genes, which, although not adjacent, all belong to the structural section of the Mx8 genome (Table S3). We suggest that these limited shared proteins reflect shared gene cassettes (Hatfull and Hendrix, 2011; Lavigne et al., 2008).

In addition to the Pseudomonas-adsorbing viral populations, viral population 4720 represents a Citrobacter-adsorbing phage. This viral population contains five short (2,974–4,505 bp) contigs. The four longest of these had significant protein sequence similarity (BLASTp, e-value ≤ 10−5) to at least five ORFs from several Citrobacter phages (Figure 3C), whereas all five contigs showed full-length sequence similarity to Citrobacter phage Margaery (tBLASTx, e-value ≤ 0.001, Figure 4B). Combined, they shared similarity to 25/280 (9%) Citrobacter phage Margaery proteins and several other T4-like Citrobacter phages (Figure 4B). As these known Citrobacter phages have genomes over 150,000 bp, we suggest that viral population 4720 may represent fragments of a larger Citrobacter phage genome.

Finally, two selected E. coli-adsorbing viral populations (18 and 2019) shared five to eighteen ORFs with several known jumbo phages (Figures 3D and 3E). These included Yersinia phage phiR1-37 and Erwinia phage Ea35-70, both of which infect bacteria from the same taxonomic order as E. coli (Gammaproteobacteria). Notably, E. coli phages like T4 (Tétart et al., 1996) and T3 (Garcia et al., 2003) need only limited genomic alterations to extend their host range to Yersinia species. To gain added insight into the relation of these phages to other jumbo phages, we built a phylogeny of their terminase genes (Figure 4C). Earlier analysis showed that jumbo phages with phylogenetically closely related terminases often infect related hosts (Yuan and Gao, 2017). For our analysis, we gathered 148 sequences from the NCBI nr database (Agarwala et al., 2017) and 74 jumbo phage proteins from a recent study (Al-Shayeb et al., 2020) that had significant homology to viral population 18 and 2019 terminases (BLASTp, e-value ≤ 10−5). The resulting tree placed viral population 2019 in a branch that mostly held recently described jumbo phages (Al-Shayeb et al., 2020), with hosts predicted to belong to the Firmicutes. Viral population 18 terminase belonged to a more diverse clade with multiple Ralstonia phages, a single Alteromonadaceae bacterial sequence (a γ-proteobacteria species), and multiple jumbo phages previously predicted to infect proteobacterial hosts (Al-Shayeb et al., 2020). These phages might interact with a surface element that is common to all these hosts. This hypothesis is reinforced by the fact that Ralstonia were long assigned to the genus Pseudomonas within the γ-proteobacteria, despite differences in membrane composition (Yabuuchi et al., 1995) and lipopolysaccharide structure (Zdorovenko et al., 2008). Alternatively, in similar fashion to Enterobacteria phage phi92 (Schwarzer et al., 2012), they may harbor multiple receptor-adsorbing proteins in their large genomes. Besides the terminase, the circular contig in viral population 18 notably also contained an FtsZ homolog. In Pseudomonas jumbo phages, FtsZ-proteins are part of a nucleus-like defense mechanism (Chaikeeratisak et al., 2017; Mendoza et al., 2019), together with a nucleus-forming protein. Interestingly, protein homology searches did not identify similar nucleus-forming proteins in viral population 18, although this population consists of a single circular contig. It may thus contain a yet unknown system like the nucleus-like defense mechanism found in Pseudomonas jumbo phages or use the FtsZ homolog in a system that is different altogether.

The results discussed above represent 6 of 26 AdsorpSeq-selected viral populations with at least five protein similarities to characterized phages. The ORFs encoded on the remaining 20 selected viral populations represented novel or highly divergent proteins, unrelated to previously characterized phages. Together, our results underscore the ability of AdsorpSeq to uncover environmental phages and their potential host associations, without bias for known or abundant phages.

Conclusion

Despite recent advances in viromics, linking environmental phage sequences to hosts remains problematic (Edwards et al., 2016). Here, we presented AdsorpSeq, a rapid method for detecting phage adsorption to host cellular envelopes. AdsorpSeq is easily implementable, as besides sequencing it uses only commonly available laboratory methods, such as cell disruption and agarose gel electrophoresis.

In the current study, we validated AdsorpSeq on model bacteriophages. This showed that AdsorpSeq can separate phages based on the presence of a specific host receptor in bacterial cell envelopes. Future improvements to AdsorpSeq could exploit this feature, for instance, through heterologous expression of certain bacterial features of interest and uncovering phages that adsorb to them. Therewith, AdsorpSeq could aid in uncovering interactions between phages and microbial surface molecules like antimicrobial efflux pumps. AdsorpSeq uncouples analysis of phage adsorption and phage infection, potentially allowing for analysis of phage-bacterium interactions that do not result in infection, for instance, owing to intracellular defense systems (Hampton et al., 2020).

The experiments laid out in this study identified 26 phage-host interactions in a hospital wastewater virome. Most of these were predictions of phages adsorbing to Proteobacteria, which is consistent with recent findings that these bacteria are ubiquitous and abundant in global waste water communities (Petrovich et al., 2019; Wu et al., 2019). Several of the putative adsorbing viral populations represented rare members of the sampled hospital wastewater virome. Although rare here, these viruses may still be important in another time and place as they become transiently dominant through ecological dynamics (Arkhipova et al., 2018). Moreover, expanding our knowledge to include rare members of the virosphere is important to address fundamental questions about the evolution of viral genes and genomes (de Jonge et al., 2019b; Mavrich and Hatfull, 2017), to identify candidate viruses with potentially promising genomic properties for phage therapy (Nobrega et al., 2015) and for targeted monitoring (Metsky et al., 2019). This underscores the necessity for novel methods such as AdsorpSeq with which phage-host interactions can be rapidly assessed, even for uncommon phages within a complex environmental mixture.

Limitations of the Study

Here we presented AdsorpSeq as a method to link environmental bacteriophages to their bacterial hosts. An important caveat for future AdsorpSeq applications is that it determined whether phages bind to the cell surface of a host cell, which is not necessarily followed by a successful infection. Investigations on the ability of phages with AdsorpSeq binding predictions to complete infections, such as phage isolations, would thus be a useful addition of the method. Of course, targeted phage isolation is a non-trivial task, and an inability to observe infection in methods such as plaque assays does not necessarily indicate an inability to infect, as phage infections can be (pseudo-)lysogenic or chronic and as not all lytic phages form plaques (Serwer et al., 2007). Additionally, we focused our efforts on Gram-negative bacterial envelopes, whereas the applicability of AdsorpSeq to Gram-positive bacteria remains to be tested.

Resource Availability

Lead Contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Bas E. Dutilh (bedutilh@gmail.com)

Materials Availability

Materials and protocols used in this study are available from the authors upon request. This study did not generate new unique reagents.

Data and Code Availability

The accession number for the sequencing data project reported in this paper is ENA: PRJEB37817. The accession number of sequencing reads reported in this paper are ENA: ERS4427880–ERS4427890, whereas the accession number of cross-assembled contigs reported in this paper is ENA: ERZ1305919.

Methods

All methods can be found in the accompanying Transparent Methods supplemental file.

Acknowledgments

We thank Utrecht Sequencing Facility for providing sequencing service and data. Utrecht Sequencing Facility is subsidized by the University Medical Centre Utrecht, Hubrecht Institute, Utrecht University and The Netherlands X-omics Initiative (Netherlands Organisation for Scientific Research (NWO) project 184.034.019). We further thank Reinier de Graaf hospital in Delft (the Netherlands) for providing wastewater samples and B.E. Estrada Bonilla (TU Delft) for assisting in sample filtration. P.A.d.J., F.A.B.v.M., and B.E.D. were supported by NWO Vidi grant 864.14.004. F.A.B.v.M. and B.E.D. were supported by and ERC Consolidator grant 865694: DiversiPHI. F.L.N. was supported by NWO Veni grant 016.Veni.181.092, and S.J.J.B. was supported by Vici grant VI.C.182.027 and European Research Council (ERC) Stg grant 639707.

Author Contributions

P.A.d.J., F.A.B.v.M., and B.E.D. designed and performed the experiments. P.A.d.J. and F.A.B.v.M. analyzed the data. A.R.C., F.L.N., and S.J.J.B. assisted with experimental design and data interpretation. P.A.d.J. and B.E.D. prepared the figures and wrote the paper with input from all other authors.

Declaration of Interests

The authors declare no competing interests.

Published: September 25, 2020

Footnotes

Supplemental Information can be found online at https://doi.org/10.1016/j.isci.2020.101439.

Supplemental Information

Document S1. Transparent Methods and Figures S1–S4
mmc1.pdf (913KB, pdf)
Table S1. Read Depth Data of All Cross-Assemble Contigs, Related to Figure 1
mmc2.xlsx (1.8MB, xlsx)
Table S2. CAT Classifications of ORFs in Selected Viral Populations, Related to Figures 3 and 4
mmc3.xlsx (59.2KB, xlsx)
Table S3. BLASTp Hits to characterized Phages Among Selected Contigs, Figure 3 and 4
mmc4.xlsx (66.8KB, xlsx)
Table S4. Annotation of Viral Population 292, Related to Figure 3
mmc5.xlsx (12.8KB, xlsx)

References

  1. Abedon S.T., Yin J. Bacteriophage plaques: theory and analysis. In: Clokie M.R.J., Kropinski A., editors. Bacteriophages: Methods and Protocols, Volume 1: Isolation, Characterization, and Interactions. Humana Press; 2009. pp. 161–174. [Google Scholar]
  2. Agarwala R., Barrett T., Beck J., Benson D.A., Bollin C., Bolton E., Bourexis D., Brister J.R., Bryant S.H., Canese K. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2017;45:D12–D17. doi: 10.1093/nar/gkw1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ahlgren N.A., Ren J., Lu Y.Y., Fuhrman J.A., Sun F. Alignment-free d2∗ oligonucleotide frequency dissimilarity measure improves prediction of hosts from metagenomically-derived viral sequences. Nucleic Acids Res. 2017;45:39–53. doi: 10.1093/nar/gkw1002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Al-Shayeb B., Sachdeva R., Chen L.X., Ward F., Munk P., Devoto A., Castelle C.J., Olm M.R., Bouma-Gregson K., Amano Y. Clades of huge phages from across Earth’s ecosystems. Nature. 2020;578:425–431. doi: 10.1038/s41586-020-2007-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Andres D., Hanke C., Baxa U., Seul A., Barbirz S., Seckler R. Tailspike interactions with lipopolysaccharide effect DNA ejection from phage P22 particles in vitro. J. Biol. Chem. 2010;285:36768–36775. doi: 10.1074/jbc.M110.169003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Arkhipova K., Skvortsov T., Quinn J.P., McGrath J.W., Allen C.C.R., Dutilh B.E., McElarney Y., Kulakov L.A. Temporal dynamics of uncultured viruses: a new dimension in viral diversity. ISME J. 2018;12:199–211. doi: 10.1038/ismej.2017.157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baba T., Ara T., Hasegawa M., Takai Y., Okumura Y., Baba M., Datsenko K.A., Tomita M., Wanner B.L., Mori H. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2006;2:2006.0008. doi: 10.1038/msb4100050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bin Jang H., Bolduc B., Zablocki O., Kuhn J.H., Roux S., Adriaenssens E.M., Brister J.R., Kropinski A.M., Krupovic M., Lavigne R. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 2019;37:632–639. doi: 10.1038/s41587-019-0100-8. [DOI] [PubMed] [Google Scholar]
  9. Bourkal’tseva M.V., Krylov S.V., Kropinski A.M., Pleteneva E.A., Shaburova O.V., Krylov V.N. Bacteriophage phi297, a new species of Pseudomonas aeruginosa temperate phages with a mosaic genome: potential use in phage therapy. Russ. J. Genet. 2011;47:794–798. [PubMed] [Google Scholar]
  10. Burstein D., Sun C.L., Brown C.T., Sharon I., Anantharaman K., Probst A.J., Thomas B.C., Banfield J.F. Major bacterial lineages are essentially devoid of CRISPR-Cas viral defence systems. Nat. Commun. 2016;7:1–8. doi: 10.1038/ncomms10613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chaikeeratisak V., Nguyen K., Khanna K., Brilot A.F., Erb M.L., Coker J.K.C., Vavilina A., Newton G.L., Buschauer R., Pogliano K. Assembly of a nucleus-like structure during viral replication in bacteria. Science. 2017;355:194–197. doi: 10.1126/science.aal2130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chen J., Novick R.P. Phage-mediated intergeneric transfer of toxin genes. Science. 2009;323:139–141. doi: 10.1126/science.1164783. [DOI] [PubMed] [Google Scholar]
  13. Coakley W.T., Bater A.J., Lloyd D. Disruption of micro-organisms. Adv. Microb. Physiol. 1977;16:279–341. doi: 10.1016/s0065-2911(08)60050-8. [DOI] [PubMed] [Google Scholar]
  14. Cobián Güemes A.G., Youle M., Cantú V.A., Felts B., Nulton J., Rohwer F. Viruses as winners in the game of life. Annu. Rev. Virol. 2016;3:197–214. doi: 10.1146/annurev-virology-100114-054952. [DOI] [PubMed] [Google Scholar]
  15. Comeau A.M., Tremblay D., Moineau S., Rattei T., Kushkina A.I., Tovkach F.I., Krisch H.M., Ackermann H.W. Phage morphology recapitulates phylogeny: the comparative genomics of a new group of myoviruses. PLoS One. 2012;7:1–11. doi: 10.1371/journal.pone.0040102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Corinaldesi C., Dell’Anno A., Danovaro R. Viral infections stimulate the metabolism and shape prokaryotic assemblages in submarine mud volcanoes. ISME J. 2012;6:1250–1259. doi: 10.1038/ismej.2011.185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Danovaro R., Dell’Anno A., Corinaldesi C., Magagnini M., Noble R., Tamburini C., Weinbauer M. Major viral impact on the functioning of benthic deep-sea ecosystems. Nature. 2008;454:1084–1087. doi: 10.1038/nature07268. [DOI] [PubMed] [Google Scholar]
  18. Deng L., Gregory A., Yilmaz S., Poulos B.T., Hugenholtz P., Sullivan M.B. Contrasting life strategies of viruses that infect photo- and heterotrophic bacteria, as revealed by viral tagging. MBio. 2012;3:e00373-12. doi: 10.1128/mBio.00373-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Dowah A.S.A.A., Clokie M.R.J.J. Review of the nature, diversity and structure of bacteriophage receptor binding proteins that target Gram-positive bacteria. Biophys. Rev. 2018;1:1–8. doi: 10.1007/s12551-017-0382-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Džunková M., Low S.J., Daly J.N., Deng L., Rinke C., Hugenholtz P. Defining the human gut host–phage network through single-cell viral tagging. Nat. Microbiol. 2019;4:2192–2203. doi: 10.1038/s41564-019-0526-2. [DOI] [PubMed] [Google Scholar]
  21. Edwards R.A., McNair K., Faust K., Raes J., Dutilh B.E. Computational approaches to predict bacteriophage-host relationships. FEMS Microbiol. Rev. 2016;40:258–272. doi: 10.1093/femsre/fuv048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Galiez C., Siebert M., Enault F., Vincent J., Söding J. WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs. Bioinformatics. 2017;33:3113–3114. doi: 10.1093/bioinformatics/btx383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Garcia E., Elliott J.M., Ramanculov E., Chain P.S.G.G., Chu M.C., Molineux I.J. The genome sequence of Yersinia pestis bacteriophage ϕA1122 reveals an intimate history with the coliphage T3 and T7 genomes. J. Bacteriol. 2003;185:5248–5262. doi: 10.1128/JB.185.17.5248-5262.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Guyader S., Burch C.L. Optimal foraging predicts the ecology but not the evolution of host specialization in bacteriophages. PLoS One. 2008;3:e1946. doi: 10.1371/journal.pone.0001946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hampton H.G., Watson B.N.J., Fineran P.C. The arms race between bacteria and their phage foes. Nature. 2020;577:327–336. doi: 10.1038/s41586-019-1894-8. [DOI] [PubMed] [Google Scholar]
  26. Hatfull G.F., Hendrix R.W. Bacteriophages and their genomes. Curr. Opin. Virol. 2011;1:298–303. doi: 10.1016/j.coviro.2011.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hoang D.T., Chernomor O., von Haeseler A., Minh B.Q., Vinh L.S. UFBoot2: improving the ultrafast bootstrap approximation. Molecular biology and evolution. Mol. Biol. Evol. 2018;35:518–522. doi: 10.1093/molbev/msx281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Howard-Varona C., Hargreaves K.R., Solonenko N.E., Markillie L.M., White R.A., Brewer H.M., Ansong C., Orr G., Adkins J.N., Sullivan M.B. Multiple mechanisms drive phage infection efficiency in nearly identical hosts. ISME J. 2018;12:1605–1618. doi: 10.1038/s41396-018-0099-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Huttenhower C., Fah Sathirapongsasuti J., Segata N., Gevers D., Earl A.M., Fitzgerald M.G., Young S.K., Zeng Q., Alm E.J., Alvarado L. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–214. doi: 10.1038/nature11234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hyman P., Abedon S.T. Advances in Applied Microbiology. Elsevier Inc.; 2010. Bacteriophage host range and bacterial resistance; pp. 217–248. [DOI] [PubMed] [Google Scholar]
  31. de Jonge P.A., Nobrega F.L., Brouns S.J.J., Dutilh B.E. Molecular and evolutionary determinants of bacteriophage host range. Trends Microbiol. 2019;27:51–63. doi: 10.1016/j.tim.2018.08.006. [DOI] [PubMed] [Google Scholar]
  32. de Jonge P.A., von Meijenfeldt F.A.B., van Rooijen L.E., Brouns S.J.J., Dutilh B.E. Evolution of BACON domain tandem repeats in crAssphage and novel gut bacteriophage lineages. Viruses. 2019;11:1085. doi: 10.3390/v11121085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Karimi M., Mirshekari H., Moosavi Basri S.M., Bahrami S., Moghoofei M., Hamblin M.R. Bacteriophages and phage-inspired nanocarriers for targeted delivery of therapeutic cargos. Adv. Drug Deliv. Rev. 2016;106:45–62. doi: 10.1016/j.addr.2016.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kauffman K.M., Hussain F.A., Yang J., Arevalo P., Brown J.M., Chang W.K., VanInsberghe D., Elsherbini J., Sharma R.S., Cutler M.B. A major lineage of non-tailed dsDNA viruses as unrecognized killers of marine bacteria. Nature. 2018;554:118–122. doi: 10.1038/nature25474. [DOI] [PubMed] [Google Scholar]
  35. Kim K.H., Bae J.W. Amplification methods bias metagenomic libraries of uncultured single-stranded and double-stranded DNA viruses. Appl. Environ. Microbiol. 2011;77:7663–7668. doi: 10.1128/AEM.00289-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Labrie S.J., Samson J.E., Moineau S. Bacteriophage resistance mechanisms. Nat. Rev. Microbiol. 2010;8:317–327. doi: 10.1038/nrmicro2315. [DOI] [PubMed] [Google Scholar]
  37. Lavigne R., Seto D., Mahadevan P., Ackermann H.W., Kropinski A.M. Unifying classical and molecular taxonomic classification: analysis of the Podoviridae using BLASTP-based tools. Res. Microbiol. 2008;159:406–414. doi: 10.1016/j.resmic.2008.03.005. [DOI] [PubMed] [Google Scholar]
  38. Liu D., Ma Y., Jiang X., He T. Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion. BMC Bioinformatics. 2019;20:1–10. doi: 10.1186/s12859-019-3082-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Low S.J., Džunková M., Chaumeil P.-A., Parks D.H., Hugenholtz P. Evaluation of a concatenated protein phylogeny for classification of tailed double-stranded DNA viruses belonging to the order Caudovirales. Nat. Microbiol. 2019;4:1306–1315. doi: 10.1038/s41564-019-0448-z. [DOI] [PubMed] [Google Scholar]
  40. Łoś J.M., Golec P., Węgrzyn G., Węgrzyn A., Łoś M. Simple method for plating Escherichia coli bacteriophages forming very small plaques or no plaques under standard conditions. Appl. Environ. Microbiol. 2008;74:5113–5120. doi: 10.1128/AEM.00306-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Manrique P., Bolduc B., Walk S.T., van der Oost J., de Vos W.M., Young M.J. Healthy human gut phageome. Proc. Natl. Acad. Sci. U S A. 2016;113:10400–10405. doi: 10.1073/pnas.1601060113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Marbouty M., Baudry L., Cournac A., Koszul R. Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay. Sci. Adv. 2017;3:e1602105. doi: 10.1126/sciadv.1602105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Marcó M.B., Moineau S., Quiberoni A. Bacteriophages and dairy fermentations. Bacteriophage. 2012;2:149–158. doi: 10.4161/bact.21868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Mavrich T.N., Hatfull G.F. Bacteriophage evolution differs by host, lifestyle and genome. Nat. Microbiol. 2017;2:1–9. doi: 10.1038/nmicrobiol.2017.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Mendoza S.D., Nieweglowska E.S., Govindarajan S., Leon L.M., Berry J.D., Tiwari A., Chaikeeratisak V., Pogliano J., Agard D.A., Bondy-Denomy J. A bacteriophage nucleus-like compartment shields DNA from CRISPR nucleases. Nature. 2019;577:244–248. doi: 10.1038/s41586-019-1786-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Metsky H.C., Siddle K.J., Gladden-Young A., Qu J., Yang D.K., Brehio P., Goldfarb A., Piantadosi A., Wohl S., Carter A. Capturing sequence diversity in metagenomes with comprehensive and scalable probe design. Nat. Biotechnol. 2019;37:160–168. doi: 10.1038/s41587-018-0006-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Mihara T., Nishimura Y., Shimizu Y., Nishiyama H., Yoshikawa G., Uehara H., Hingamp P., Goto S., Ogata H. Linking virus genomes with host taxonomy. Viruses. 2016;8:10–15. doi: 10.3390/v8030066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Nobrega F.L., Costa A.R., Kluskens L.D., Azeredo J. Revisiting phage therapy: new applications for old resources. Trends Microbiol. 2015;23:185–191. doi: 10.1016/j.tim.2015.01.006. [DOI] [PubMed] [Google Scholar]
  49. Nobrega F.L., Vlot M., de Jonge P.A., Dreesens L.L., Beaumont H.J.E., Lavigne R., Dutilh B.E., Brouns S.J.J. Targeting mechanisms of tailed bacteriophages. Nat. Rev. Microbiol. 2018;16:760–773. doi: 10.1038/s41579-018-0070-8. [DOI] [PubMed] [Google Scholar]
  50. Nolan J.M., Petrov V., Bertrand C., Krisch H.M., Karam J.D. Genetic diversity among five T4-like bacteriophages. Virol. J. 2006;3:1–15. doi: 10.1186/1743-422X-3-30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Otawa K., Lee S.H., Yamazoe A., Onuki M., Satoh H., Mino T. Abundance, diversity, and dynamics of viruses on microorganisms in activated sludge processes. Microb. Ecol. 2007;53:143–152. doi: 10.1007/s00248-006-9150-9. [DOI] [PubMed] [Google Scholar]
  52. Paez-Espino D., Eloe-Fadrosh E.A., Pavlopoulos G.A., Thomas A.D., Huntemann M., Mikhailova N., Rubin E., Ivanova N.N., Kyrpides N.C. Uncovering Earth’s virome. Nature. 2016;536:425–430. doi: 10.1038/nature19094. [DOI] [PubMed] [Google Scholar]
  53. Petrovich M.L., Ben Maamar S., Hartmann E.M., Murphy B.T., Poretsky R.S., Wells G.F. Viral composition and context in metagenomes from biofilm and suspended growth municipal wastewater treatment plants. Microb. Biotechnol. 2019;12:1324–1336. doi: 10.1111/1751-7915.13464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Pinard R., de Winter A., Sarkis G.J., Gerstein M.B., Tartaro K.R., Plant R.N., Egholm M., Rothberg J.M., Leamon J.H. Assessment of whole genome amplification-induced bias through high-throughput, massively parallel whole genome sequencing. BMC Genomics. 2006;7:216. doi: 10.1186/1471-2164-7-216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Poole R.K. The isolation of membranes from bacteria. Methods Mol. Biol. 1993;19:109–122. doi: 10.1385/0-89603-236-1:109. [DOI] [PubMed] [Google Scholar]
  56. Probst A.J., Weinmaier T., DeSantis T.Z., Santo Domingo J.W., Ashbolt N. New perspectives on microbial community distortion after whole-genome amplification. PLoS One. 2015;10:1–16. doi: 10.1371/journal.pone.0124158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Pruitt K.D., Tatusova T., Maglott D.R. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007;35:61–65. doi: 10.1093/nar/gkl842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Randall-Hazelbauer L., Shwartz M. Isolation of the bacteriophage lambda receptor from E. coli. J. Bacteriol. 1973;116:1436–1446. doi: 10.1128/jb.116.3.1436-1446.1973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Rice L.B. Federal funding for the study of antimicrobial resistance in nosocomial pathogens: No ESKAPE. J. Infect. Dis. 2008;197:1079–1081. doi: 10.1086/533452. [DOI] [PubMed] [Google Scholar]
  60. Rosario K., Nilsson C., Lim Y.W., Ruan Y., Breitbart M. Metagenomic analysis of viruses in reclaimed water. Environ. Microbiol. 2009;11:2806–2820. doi: 10.1111/j.1462-2920.2009.01964.x. [DOI] [PubMed] [Google Scholar]
  61. Roux S., Brum J.R., Dutilh B.E., Sunagawa S., Duhaime M.B., Loy A., Poulos B.T., Solonenko N., Lara E., Poulain J. Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses. Nature. 2016;537:689–693. doi: 10.1038/nature19366. [DOI] [PubMed] [Google Scholar]
  62. Schwarzer D., Buettner F.F.R., Browning C., Nazarov S., Rabsch W., Bethe A., Oberbeck A., Bowman V.D., Stummeyer K., Muhlenhoff M. A multivalent adsorption apparatus explains the broad host range of phage phi92: a comprehensive genomic and structural analysis. J. Virol. 2012;86:10384–10398. doi: 10.1128/JVI.00801-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Serwer P., Hayes S.J., Zaman S., Lieman K., Rolando M., Hardies S.C. Improved isolation of undersampled bacteriophages: finding of distant terminase genes. Virology. 2004;329:412–424. doi: 10.1016/j.virol.2004.08.021. [DOI] [PubMed] [Google Scholar]
  64. Serwer P., Hayes S.J., Thomas J.A., Hardies S.C. Propagating the missing bacteriophages: a large bacteriophage in a new class. Virol. J. 2007;4:21. doi: 10.1186/1743-422X-4-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Sharon I., Alperovitch A., Rohwer F., Haynes M., Glaser F., Atamna-Ismaeel N., Pinter R.Y., Partensky F., Koonin E.V., Wolf Y.I. Photosystem I gene cassettes are present in marine virus genomes. Nature. 2009;461:258–262. doi: 10.1038/nature08284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Shkoporov A.N., Hill C. Bacteriophages of the human gut: the “known unknown” of the microbiome. Cell Host Microbe. 2019;25:195–209. doi: 10.1016/j.chom.2019.01.017. [DOI] [PubMed] [Google Scholar]
  67. Silva J.B., Storms Z., Sauvageau D., Bertozzi Silva J., Storms Z., Sauvageau D. Host receptors for bacteriophage adsorption. FEMS Microbiol. Lett. 2016;363:1–11. doi: 10.1093/femsle/fnw002. [DOI] [PubMed] [Google Scholar]
  68. De Sordi L., Lourenço M., Debarbieux L. The battle within: interactions of bacteriophages and bacteria in the gastrointestinal tract. Cell Host Microbe. 2019;25:210–218. doi: 10.1016/j.chom.2019.01.018. [DOI] [PubMed] [Google Scholar]
  69. Stopar D., Spruijt R.B., Wolfs C.J.A.M., Hemminga M.A. Protein-lipid interactions of bacteriophage M13 major coat protein. Biochim. Biophys. Acta. 2003;1611:5–15. doi: 10.1016/s0005-2736(03)00047-6. [DOI] [PubMed] [Google Scholar]
  70. Sullivan M.J., Petty N.K., Beatson S.A. Easyfig: a genome comparison visualizer. Bioinformatics. 2011;27:1009–1010. doi: 10.1093/bioinformatics/btr039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Tamaki H., Zhang R., Angly F.E., Nakamura S., Hong P.-Y., Yasunaga T., Kamagata Y., Liu W.-T. Metagenomic analysis of DNA viruses in a wastewater treatment plant in tropical climate. Environ. Microbiol. 2012;14:441–452. doi: 10.1111/j.1462-2920.2011.02630.x. [DOI] [PubMed] [Google Scholar]
  72. Tétart F., Repoila F., Monod C., Krisch H.M. Bacteriophage T4 host range is expanded by duplications of a small domain of the tail fiber adhesin. J. Mol. Biol. 1996;258:726–731. doi: 10.1006/jmbi.1996.0281. [DOI] [PubMed] [Google Scholar]
  73. Villarroel J., Kleinheinz K.A., Jurtz V.I., Zschach H., Lund O., Nielsen M., Larsen M.V. HostPhinder: a phage host prediction tool. Viruses. 2016;8:1–22. doi: 10.3390/v8050116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Walker P.J., Siddell S.G., Lefkowitz E.J., Mushegian A.R., Dempsey D.M., Dutilh B.E., Harrach B., Harrison R.L., Hendrickson R.C., Junglen S. Changes to virus taxonomy and the international code of virus classification and nomenclature ratified by the international committee on taxonomy of viruses (2019) Arch. Virol. 2019;164:2417–2429. doi: 10.1007/s00705-019-04306-w. [DOI] [PubMed] [Google Scholar]
  75. Wang J., Hofnung M., Charbit A. The C-terminal portion of the tail fiber protein of bacteriophage lambda is responsible for binding to LamB, its receptor at the surface of Escherichia coli K-12. J. Bacteriol. 2000;182:508–512. doi: 10.1128/jb.182.2.508-512.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Wang J.-B., Lin N.-T., Tseng Y.-H., Weng S.-F. Genomic characterization of the novel Aeromonas hydrophila phage Ahp1 suggests the derivation of a new subgroup from phiKMV-like family. PLoS One. 2016;11:e0162060. doi: 10.1371/journal.pone.0162060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Wattam A.R., Abraham D., Dalay O., Disz T.L., Driscoll T., Gabbard J.L., Gillespie J.J., Gough R., Hix D., Kenyon R. PATRIC, the bacterial bioinformatics database and analysis resource. Nucleic Acids Res. 2014;42:581–591. doi: 10.1093/nar/gkt1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Wu Q., Liu W.-T. Determination of virus abundance, diversity and distribution in a municipal wastewater treatment plant. Water Res. 2009;43:1101–1109. doi: 10.1016/j.watres.2008.11.039. [DOI] [PubMed] [Google Scholar]
  79. Wu L., Ning D., Zhang B., Li Y., Zhang P., Shan X., Zhang Q., Brown M., Li Z., Van Nostrand J.D. Global diversity and biogeography of bacterial communities in wastewater treatment plants. Nat. Microbiol. 2019;4:1183–1195. doi: 10.1038/s41564-019-0426-5. [DOI] [PubMed] [Google Scholar]
  80. Yabuuchi E., Yano I., Hotta H., Nishiuchi Y., Kosako Y., Yano I., Hotta H., Nishiuchi Y. Transfer of two burkholderia and an alcaligenes species to Ralstonia. Gen. Nov. Microbiol. Immunol. 1995;39:897–904. doi: 10.1111/j.1348-0421.1995.tb03275.x. [DOI] [PubMed] [Google Scholar]
  81. Yilmaz S., Allgaier M., Hugenholtz P. Multiple displacement amplification compromises quantitative analysis of metagenomes. Nat. Methods. 2010;7:943–944. doi: 10.1038/nmeth1210-943. [DOI] [PubMed] [Google Scholar]
  82. Yuan Y., Gao M. Jumbo bacteriophages: an overview. Front. Microbiol. 2017;8:1–9. doi: 10.3389/fmicb.2017.00403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Zdorovenko E.L., Vinogradov E., Wydra K., Lindner B., Knirel Y.A. Structure of the oligosaccharide chain of the SR-type lipopolysaccharide of Ralstonia solanacearum toudk-2. Biomacromolecules. 2008;9:2215–2220. doi: 10.1021/bm800326u. [DOI] [PubMed] [Google Scholar]
  84. Zhang M., Yang L., Ren J., Ahlgren N.A., Fuhrman J.A., Sun F. Prediction of virus-host infectious association by supervised learning methods. BMC Bioinformatics. 2017;18:60. doi: 10.1186/s12859-017-1473-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Transparent Methods and Figures S1–S4
mmc1.pdf (913KB, pdf)
Table S1. Read Depth Data of All Cross-Assemble Contigs, Related to Figure 1
mmc2.xlsx (1.8MB, xlsx)
Table S2. CAT Classifications of ORFs in Selected Viral Populations, Related to Figures 3 and 4
mmc3.xlsx (59.2KB, xlsx)
Table S3. BLASTp Hits to characterized Phages Among Selected Contigs, Figure 3 and 4
mmc4.xlsx (66.8KB, xlsx)
Table S4. Annotation of Viral Population 292, Related to Figure 3
mmc5.xlsx (12.8KB, xlsx)

Data Availability Statement

The accession number for the sequencing data project reported in this paper is ENA: PRJEB37817. The accession number of sequencing reads reported in this paper are ENA: ERS4427880–ERS4427890, whereas the accession number of cross-assembled contigs reported in this paper is ENA: ERZ1305919.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES