Skip to main content
mSystems logoLink to mSystems
. 2022 Sep 19;7(5):e00651-22. doi: 10.1128/msystems.00651-22

A Metagenomic Investigation of Spatial and Temporal Changes in Sewage Microbiomes across a University Campus

Noah Fierer a,b,, Hannah Holland-Moritz c, Alexandra Alexiev a, Harpreet Batther d, Nicholas B Dragone a,b, Liam Friar d, Matthew J Gebert a,b, Sarah Gering a,b, Jessica B Henley b, Sierra Jech a, Emily M Kibby e, Tina Melie a, William B Patterson f, Eric Peterson g, Kyle Schutz a, Elías Stallard-Olivera a,b, John Sterrett f, Corinne Walsh a,b, Cresten Mansfeldt g
Editor: Rachel Poretskyh
PMCID: PMC9599454  PMID: 36121163

ABSTRACT

Wastewater microbial communities are not static and can vary significantly across time and space, but this variation and the factors driving the observed spatiotemporal variation often remain undetermined. We used a shotgun metagenomic approach to investigate changes in wastewater microbial communities across 17 locations in a sewer network, with samples collected from each location over a 3-week period. Fecal material-derived bacteria constituted a relatively small fraction of the taxa found in the collected samples, highlighting the importance of environmental sources to the sewage microbiome. The prokaryotic communities were highly variable in composition depending on the location within the sampling network, and this spatial variation was most strongly associated with location-specific differences in sewage pH. However, we also observed substantial temporal variation in the composition of the prokaryotic communities at individual locations. This temporal variation was asynchronous across sampling locations, emphasizing the importance of independently considering both spatial and temporal variation when assessing the wastewater microbiome. The spatiotemporal patterns in viral community composition closely tracked those of the prokaryotic communities, allowing us to putatively identify the bacterial hosts of some of the dominant viruses in these systems. Finally, we found that antibiotic resistance gene profiles also exhibit a high degree of spatiotemporal variability, with most of these genes unlikely to be derived from fecal bacteria. Together, these results emphasize the dynamic nature of the wastewater microbiome, the challenges associated with studying these systems, and the utility of metagenomic approaches for building a multifaceted understanding of these microbial communities and their functional attributes.

IMPORTANCE Sewage systems harbor extensive microbial diversity, including microbes derived from both human and environmental sources. Studies of the sewage microbiome are useful for monitoring public health and the health of our infrastructure, but the sewage microbiome can be highly variable in ways that are often unresolved. We sequenced DNA recovered from wastewater samples collected over a 3-week period at 17 locations in a single sewer system to determine how these communities vary across time and space. Most of the wastewater bacteria, and the antibiotic resistance genes they harbor, were not derived from human feces, but human usage patterns did impact how the amounts and types of bacteria and bacterial genes we found in these systems varied over time. Likewise, the wastewater communities, including both bacteria and their viruses, varied depending on location within the sewage network, highlighting the challenges and opportunities in efforts to monitor and understand the sewage microbiome.

KEYWORDS: antibiotic resistance genes, bacteriophage, metagenomics, sewage, sewer systems, wastewater

INTRODUCTION

While often hidden from view and rarely attracting our attention (unless they malfunction), municipal sewage systems are important microbial ecosystems. Domestic wastewater harbors large numbers of microorganisms, including bacteria, archaea, protists, and viruses. The microbial taxa found in these wastewaters can be derived from human sources (including fecal, skin, and oral microbes deposited into the system) and external environmental sources (including soil, groundwater, or sediment microbes introduced during passage through the system and microbes introduced directly from tap water) (1). In addition to these more transient inputs, many of the microbes found in wastewater streams are more permanent residents of sewage systems found in the sewer sediments and biofilms (1).

Research into the microbiology of sewage systems can provide valuable insights relevant to a range of basic and applied scientific disciplines, including epidemiology, microbial ecology, environmental engineering, and human microbiome research. For example, previous work has demonstrated that studies of microbes found in sewage can be used to track pathogen prevalence in a population (2), understand pipe corrosion (35), track the presence and dissemination of antibiotic resistance genes (68), characterize fecal microbiomes across human populations (9), and contribute to an understanding of the biogeochemical processes occurring in wastewater (10, 11).

Sewer systems harbor dynamic microbial communities; there is no single type of wastewater microbial community. Rather, like other microbial ecosystems, the microbial communities found in sewage systems can be highly variable across time and space. This is true regardless of the specific aspect of wastewater microbial communities studied. The composition of bacterial, archaeal, viral, and microeukaryotic communities in wastewater communities can exhibit pronounced variation at time scales ranging from hours to months (1214) with appreciable spatial variation within a given sewage network (13) or across different geographic regions (9, 15). The same goes for particular genes or gene categories of interest (including antibiotic resistance genes) which can also vary with respect to their diversity and abundances depending on the sampling location and the timing of sample collection (8, 16, 17).

From those studies that have comprehensively assessed spatial and temporal variation in the wastewater microbiome, we know that a number of factors can influence the amounts and types of microbes found in wastewater. These factors can include temperature, position along a wastewater system, the human populations contributing to a given wastewater system, system materials, or other characteristics (13). This potential for high spatial and temporal variation underlies much of the promise of using wastewater for the epidemiological surveillance of human pathogens (2, 18, 19). To use a recent example, there are many ongoing and published studies focused on detecting SARS-CoV-2 in wastewater samples to detect changes in the prevalence of COVID-19 infections over time or across locations (20, 21). This potential for a high degree of spatial and temporal variation in microbial distributions is not restricted to specific pathogens but can also be observed when looking more broadly at the taxonomic structure of sewage-associated communities (9, 12, 13) and their genomic attributes, as evidenced from recent work using shotgun metagenomic approaches to study wastewater microbial communities (15, 16, 22). Quantifying the spatial and temporal variation in wastewater microbial communities is critical for understanding the factors that influence these microbial communities and their functional capabilities, identifying the likely sources of particular microorganisms (or microbial genes) relevant to public health, monitoring the impacts of changes in the design and operation of sewage networks, and understanding the potential for specific biogeochemical processes to occur in sewage systems, including those associated with nutrient cycling and biocorrosion (1). The value of documenting spatial and temporal variation is not restricted to the study of wastewater microbial communities, but their dynamic nature, their direct and indirect associations with human populations served by sewer systems, and their potential relevance to public health make studies of the “sewer microbiome” broadly relevant.

Despite the large body of previous work on the microbiology of sewage systems, what often remains undetermined is how spatial and temporal variation in the structure of microbial communities and their genomic attributes compares at finer levels of spatiotemporal resolution, i.e., the variation across multiple locations within a single sewage network over days to weeks. Our study was designed to address this knowledge gap. We collected 188 wastewater samples from 17 locations on the University of Colorado campus, with each location representing wastewater outflow from a single building, or cluster of buildings, within the same sewer network (Fig. 1A). Samples were collected from each of the locations up to four times per week for 3 weeks. This sampling intensity allowed us to identify spatial patterns across a sewage network, assess the degree to which the wastewater communities exhibit shared temporal dynamics across the network, and determine whether these spatiotemporal patterns are predictably related to differences in building occupancy, wastewater chemistry, sewer construction, and weather events. Each sample was analyzed using a shotgun metagenomic approach, with the resulting data used to explore spatiotemporal patterns in bacterial and archaeal community composition and their functional genes, with a focus on antibiotic resistance genes given their relevance to public health (23, 24). We also used these data to assess spatiotemporal patterns in viral (phage) communities as they have potentially important, but understudied, contributions to the biology of these systems (15), testing the hypothesis that changes in viral communities would mirror the associated changes in host microbial communities, as has been observed in other systems (2527). As one of the more comprehensive studies of the wastewater microbiome within a single system, this work highlights the utility of understanding spatiotemporal dynamics in these microbial communities.

FIG 1.

FIG 1

(A) Map of the 17 locations across the University of Colorado campus from which sewage samples were collected for this study with each sampling location labeled following t naming convention used previously (50). (B) Summary presentation of the identified clusters of community types across the 17 sampling locations based on the overall degree of dissimilarity in community composition, with results shown using a hierarchical clustering diagram (Ward method), as calculated from average Bray-Curtis dissimilarity scores per location. (C) Average relative abundances (proportions) of the dominant bacterial families across each of the 17 locations. (D) Variation in the three measured variables significantly associated with spatial differences in community composition, across each location and across the five identified clusters. Note that panels B to C do not highlight the temporal variation observed within individual locations since these are just based on average dissimilarities in community composition (panel B) and mean relative abundances within a given sampling location (panel C).

RESULTS AND DISCUSSION

General description of the prokaryotic communities.

Across the entire data set, we identified 1,087 unique prokaryotic taxa using phyloFlash (28). Since eukaryotic taxa represented <0.3% of extracted rRNA gene reads, we focus on the prokaryotic communities here. On average, we identified 357 unique prokaryotic taxa per sample (range, 182 to 530), but nearly all of these were bacterial. We extracted rRNA genes from seven archaeal taxa (all members of the Methanobacteriales, Methanomicrobiales, and Methanosarcinales groups), and these archaeal taxa represented only 0.02% of the rRNA gene reads (range, 0 to 0.4% per sample). The bacterial communities were dominated by members of the following phyla: Proteobacteria, Firmicutes, Bacteroidota, and Campylobacterota. At the family level of resolution, the most abundant bacterial families are highlighted in Fig. 1C. We note that the overall taxonomic composition of the microbial communities is similar to that observed in other wastewater surveys (1), but the composition of the communities varied appreciably across the sample set. For example, the top three most abundant families across the data set Comamonadaceae, Enterobacteriaceae, and Aeromonadaceae ranged in relative abundances from 5 to 51%, 0.3 to 39%, and 0.07 to 35% per sample, respectively. This high degree of variance in the taxonomic composition of the prokaryotic communities is explored in more detail below. Notably, the viral communities were also highly variable in composition across this sample set with these patterns explored in more detail below.

A relatively small fraction of the prokaryotes identified in these wastewater samples appear to be derived from the human microbiome. This is qualitatively evident by examining the abundances of the bacterial families commonly found in human feces (Bacteroidaceae, Ruminococcaceae, Lachnospiraceae, Porphyromonadaceae, Rikenellaceae, and Prevotellaceae [9]), which collectively account for only 13% of the bacterial and archaeal rRNA gene reads recovered from all sewage samples. To investigate these patterns in more detail, we used the indicator taxon approach described by Barberán et al. (29), which identified particular taxa that are consistently found in human feces and either absent, or present in low abundances, in other sample types, including soil and aquatic environments. The summed abundances of these human microbiome “indicator” taxa represent a relatively small percentage of all 16S rRNA gene reads recovered from the wastewater metagenomes (mean, 8.6% of total reads; range, 0.2 to 35% reads per sample). Thus, whereas bacteria derived from human feces are present in the sewage microbiome, the human microbiome is a reasonably small contributor to the wastewater microbiome, confirming results reported previously (9, 12, 13, 30). Together, these results suggest that a large fraction of the bacteria found in the wastewater samples are more permanent residents of the sewage system and likely derived from biofilms, sediments, or other locations within the sewer system itself. The wastewater microbiome is a unique microbial habitat that harbors distinct microbial communities; it is not exclusively a system transporting transient microbes added into the system from human feces.

We calculated Bray-Curtis dissimilarities in overall functional gene profiles across the sample set from the SqueezeMeta output (31), using normalized gene abundances across 168 unique pathways, and found that overall functional gene composition was well-correlated with the corresponding Bray-Curtis dissimilarities in taxonomic composition (Mantel r = 0.82, P < 0.001). Differences in taxonomic composition were closely associated with overall differences in functional gene composition. For this reason, we assume that the observed spatiotemporal patterns in taxonomic composition (described below), mirror observed differences in community-level functional gene composition. However, this is not necessarily true for individual genes or gene categories which could exhibit unique distribution patterns. While there are many individual genes or gene categories worthy of detailed exploration, we focused on one category of genes that are particularly relevant to public health, antibiotic resistance genes, with the spatiotemporal patterns in antibiotic resistance gene profiles described below.

Spatial patterns in the taxonomic composition of the prokaryotic communities.

The 188 sewage samples included in this study were collected from 17 different locations in the same sewage network across an ~1 km2 area (Fig. 1A). Sampling location was a significant predictor of differences in sewage prokaryotic community composition (permutational analysis of variance [PERMANOVA] R2 = 0.38, P < 0.001). In particular, we noted 5 general clusters of community types (Fig. 1B) and the taxa driving differences across the 5 clusters are detailed in Fig. S1 in the supplemental material. These location-specific differences in taxonomic composition do not appear to be strongly related to sewer material, sewer depth, or the resident human population served by each sewer since PERMANOVA R2 values were <0.04 for all of these variables. Rather, the spatial variation was most strongly associated with differences in sample pH (Mantel r = 0.23, P < 0.001) with total suspended solids (TSS) concentrations and sample volume of lesser importance (Mantel r = 0.13 and 0.11, respectively, P = 0.004 for both variables). The importance of sample pH in structuring the sewage microbial communities is similar to other aquatic and terrestrial environments, where pH has also been demonstrated to be closely associated with shifts in bacterial community composition (3234). There are clearly other location-specific variables, including unmeasured variables like temperature (13) or specific organic carbon or nutrient concentrations, that could be contributing to the observed spatial patterns. However, it is important to note that the sewage communities found at individual locations are far from static and exhibit pronounced temporal variation, as discussed in more detail below.

FIG S1

Variation in the 20 dominant bacteria families, across each location and across the five identified clusters. All 20 families are significantly different in relative abundance across clusters (P < 0.01 in all cases, except for Moraxellaceae with P < 0.05). Download FIG S1, PDF file, 0.7 MB (744KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

We next identified which particular taxa were differentially abundant across the location-specific clusters of community types (Fig. 1B). To do so, we used an indicator taxon approach, focusing on the more abundant taxa that were over-represented in clusters 2 to 5 compared to cluster 1 (which included 11 of 17 sampling locations). These results are presented in Fig. S1. Of note, we observed that samples from the location assigned to cluster 2 had higher relative abundances of Pseudomonadaceae than samples from other locations and samples from the three locations assigned to cluster 3 had higher relative abundances of Arcobacteraceae and Selenemonadaceae. Likewise, samples from the cluster 4 location had higher abundances of Weeksellaceae with samples from the cluster 5 location having higher abundances of Lachnospiraceae and Bacteroidaceae than other locations. Interpreting these observed patterns is difficult since these are relatively broad taxonomic groups with diverse physiologies. However, these patterns do suggest that there are location-specific signatures in the sewage bacterial communities, signatures that are most likely a product of the unique biogeochemical conditions found at these sampling locations.

Temporal patterns in the taxonomic composition of the prokaryotic communities.

The communities found in sewage samples collected from a given location were highly variable in taxonomic composition over the 3-week sampling period. This is evident from Fig. 2, left which highlights that the average Bray-Curtis dissimilarity in taxonomic composition within a given sampling location over time (mean = 0.39) was not appreciably lower than the average Bray-Curtis dissimilarity in taxonomic composition between sampling locations (mean = 0.47). In other words, despite observing significant differences in community composition across sampling locations (as detailed above), the variation observed within a given location over time was similar in magnitude to the differences between sites, with some sampling locations (e.g., sites R and O) exhibiting higher variance in taxonomic community composition than others (e.g., sites F and C) (Fig. 2, right). Perhaps more importantly, the temporal patterns observed in the taxonomic composition of the wastewater communities were not consistent across sites. Sampling dates when we observed pronounced changes in the composition of the wastewater communities (compared to the average composition at a given location) were rarely shared across sites (see Fig. S2). We next sought to determine which variables (days since first collection, daily high/low temperatures, the volume of sewage collected, sewage pH, daily precipitation amounts, or total suspended solid concentrations) best explained the temporal variation observed at individual sampling locations. Using Mantel tests, “days since” was significant at 9 of the 15 locations and TSS was significant at 6 of the 15 locations (R > 0.25 and P < 0.05) (see Table S1). No other variable was significant at more than two individual sites. Thus, Mantel tests indicate that “days since” and TSS are the factors most strongly correlated with temporal variation in community composition. These results are consistent with the results from the MRM models which were variable in their overall explanatory power (R2 values from 0.09 to 0.74) with the four most significant scores (R2 ≥ 0.25, P < 0.01) each indicating one or both of “days since” and TSS in the top two variables (see Table S1). Although temperature has been noted previously as being important in structuring wastewater communities (13), changes in daily high and low temperatures were not strongly associated with observed changes in community composition across any of the sites. We note, however, that surface air temperature may not reflect the temperature of the wastewater at these upstream locations.

FIG 2.

FIG 2

Temporal variation in overall community composition within individual sampling locations. We summarized pairwise Bray-Curtis distances in the taxonomic composition of the prokaryotic communities within sites (temporal variation) versus between sites (spatial variation). Also shown are the pairwise Bray-Curtis distances in community composition across sampling dates within each individual sampling location. Note that some of the sampling locations (e.g., sites O and R) show higher temporal variance in taxonomic community composition than others (e.g., sites C and F).

FIG S2

Heatmap showing the Bray-Curtis distance from mean community composition per location and per sampling date, with warmer colors indicating higher dissimilarity in overall community composition. This figure highlights the asynchrony in the temporal patterns in community composition across all sampling locations. Cells that are gray indicate locations and days when samples were not collected. Download FIG S2, PDF file, 0.03 MB (35KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TABLE S1

Mantel test and multiple regression on distance matrices (MRM) results for environmental variables associated with temporal patterns in prokaryotic community composition. Download Table S1, PDF file, 0.04 MB (43KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

The associations between temporal changes in community composition and both days from the start of the study and total suspended solid concentrations are most likely driven by differences in usage patterns, most notably temporal changes in human fecal inputs into the sewage system. This is evident from an increase in the relative abundance of human fecal indicator bacteria in week 2 compared to weeks 1 and 3, a pattern that was consistent across most sites (see Fig. S3). We hypothesize that this week 2 increase in human fecal indicator bacteria is a product of the large early-season snow event that occurred 8 and 9 September 2020, which may have increased the ratio of blackwater to graywater by altering residential bathing habits through a notable temperature decrease from 33 to 0°C within 18 h. Together, these results highlight that there is pronounced temporal variation observed at individual sampling locations, and this variation is most likely associated with changes in fecal inputs to the system (although fecal bacteria represent a relatively small proportion of taxa in these communities [see discussion above]), reflecting the variation noted in other studies on short temporal timescales (12). We acknowledge that most of the observed temporal variation at the individual sampling locations remains statistically unexplained, highlighting the importance of collecting additional information on wastewater systems and their usage. There are likely other unmeasured factors, including sewer temperatures, sewage inputs, or specific usage patterns, that could be used to predict when and how wastewater communities will change over time (13). More generally, these results emphasize the importance of considering temporal variation when assessing spatial differences in wastewater communities, even in a given sewer network. A single time point sampling will give a limited, and perhaps even misleading, perspective on the composition of wastewater prokaryotic communities.

FIG S3

Heatmap depicting the mean proportional abundances of fecal indicator taxa each week separated by sampling site. Darker colors indicate higher proportional abundances of fecal indicator taxa, while lighter colors represent a lower proportional abundance. The week 2 samples had higher proportional abundances of fecal indicator taxa across 13 of the 15 sites, a pattern we suggest is associated with increases in sewage inputs during the snowstorm on 8 to 9 September 2020. Download FIG S3, PDF file, 0.1 MB (53KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

An additional factor that may have contributed to the observed temporal variability is the fact the communities may not have reached a steady-state for this university sewer network. Samples were collected within 1 to 2 weeks after students returned to on-campus residences, following a nearly stagnant and unused sewer system from March to August 2020. This flushing of stagnant water over time may explain why “days since the start of sampling” was often a factor associated with the observed temporal variation in bacterial community composition (see Table S1), a result consistent with our hypothesis that flushing of stagnant water, lack of residual disinfectant, and idle plumbing can contribute to changes in wastewater microbiome composition (35). More specifically, we expected that flushing of stagnant water could be associated with increases in the abundances of premise-plumbing associated pathogens such as Legionella pneumophila and Acinetobacter baumannii (3638). For the first 5 days of the sampling period, when stagnant water was likely being flushed through the system, we found that there was an initial spike followed by a consistent decrease in the relative abundances of Legionella at 10 of the 15 sampling locations and of Acinetobacter at 7 of 15 sample locations (see Fig. S4). However, additional work is needed to investigate these dynamics in more detail and to determine the public health relevance of these or other potential pathogens accumulating in plumbing or sewer systems upon prolonged stagnation.

FIG S4

Line plot highlighting changes in the relative abundances of Legionella and Acinetobacter across time. Error bars represent 95% confidence intervals around the mean, as calculated from all sampling locations. The initially high relative abundances, followed by a decrease through day 5, highlight the potential influence of flushing of stagnant water after campus activities resumed and vacant buildings were reoccupied at the start of this study. Download FIG S4, PDF file, 0.03 MB (27.7KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Spatiotemporal patterns in the abundances of antibiotic resistance genes.

Given the public health importance of documenting antibiotic resistance genes (ARGs) in wastewater and other built environments (8, 39, 40), we focused our functional gene analyses of the metagenomic data on those genes presumed to be associated with antibiotic resistance. However, we note that we are likely missing other potentially important ARGs and not all genes annotated as ARGs necessarily confer antibiotic resistance (41). The number of metagenomic sequences classified as ARGs was low, with ARGs representing an average of 0.003% of reads per sample (range, 0.0003 to 0.010%) (Fig. 3A). This low fraction of ARGs detected is similar to what has been observed in comparable metagenomic data sets (16, 42, 43). Even with these low abundances, we detected a broad diversity of ARGs across the sample set, including ARGs associated with resistance to 27 different drug classes, but the proportional abundances of ARG types were reasonably consistent across the locations sampled (Fig. 3B). The most abundant ARGs detected are those associated with resistance to tetracycline, macrolide, and beta-lactam drug classes (Fig. 3B). These dominant classes of ARGs were also recovered in metagenomic analyses of similar sample types (16), but we note that this may be, in part, a product of the reference database being biased toward ARGs that are well characterized.

FIG 3.

FIG 3

(A) Variation in the proportion of reads assigned to antibiotic resistance genes (ARGs) normalized to total read count at each sampling location. We observed a significant effect of sampling location on ARG abundances (one-way ANOVA, F value = 8.06, P < 0.001). (B) Proportional abundances of ARGs assigned to different drug classes, highlighting the consistency in general ARG types across locations.

We did observe pronounced variation in the relative abundance of ARG genes across sampling locations and over time within individual locations (Fig. 3A). Given that ARGs are often considered to be particularly abundant in human fecal material-derived bacteria, we hypothesized that those samples with higher abundances of fecal bacteria would have higher relative abundances of ARGs. This was not the case, as the correlation between ARGs and fecal indicator abundances across the whole sample set was relatively weak (linear regression, R2 = 0.14, P < 0.001). While ARGs are clearly present in wastewater samples, these ARGs are not necessarily indicative of fecal contamination, and the ARGs are often likely to be associated with other bacteria residing in the sewer system. Not all ARGs are necessarily derived from anthropogenic inputs and presumably ARGs would be present in this system even in the absence of appreciable fecal inputs (as evidenced from the abundance of ARGs detected at the “Admin” site which served a building with few if any occupants during the sampling period). More work is needed to identify which bacteria in sewage systems harbor particular ARGs, where these bacteria reside within sewer systems, and the public health relevance of these sewer-associated ARGs.

Spatiotemporal patterns in viral communities.

In addition to investigating the composition of the prokaryotic communities and how they vary across sampling locations or within individual locations over the 3-week period, we also characterized the corresponding spatiotemporal variation in the viral communities. This study was motivated by the likely importance of bacteriophage to the structure of sewage and other microbiomes (15, 44). Moreover, with the growing interest in documenting viral community dynamics in built environments (45), it is important to assess how spatiotemporal variation in viral community structure compares to the corresponding variation in prokaryotic communities which are far more commonly studied. Viral communities were characterized using FastVirome Explorer with the IMG/VR and NCBI RefSeq databases (see Materials and Methods). We start by noting three important caveats associated with the viral analyses. First, as our analyses were restricted to double-stranded DNA (dsDNA), we did not capture RNA or single-stranded DNA (ssDNA) viruses. Second, we are likely identifying only the more abundant viruses in these samples since a deeper sequencing depth would be required to identify relatively rare viral taxa in these samples. Third, since we used a reference viral genome database for classifying viral reads, our analyses are restricted only to viruses included in the reference database, and we would not capture novel viral diversity, which is likely high in these types of environments (46). Despite these caveats, we were able to document pronounced spatiotemporal variation in the viral communities.

The dominant viral taxa identified when using the IMG/VR database were within the order Caudovirales (tailed dsDNA bacteriophages), representing 99.40% of the viral reads that could be classified. While 68.63% of Caudovirales reads were not resolved to family level, three families within this order—Myoviridae, Podoviridae, and Siphoviridae—represented 12.00, 10.86, and 7.49% of total viral reads, respectively. These viral taxa have also been found to be dominant in another metagenomic-based sewage survey (15). Using results from analyses conducted with the NCBI RefSeq database, we recovered sequences assigned to the crAssphage (cross-assembly phage) group (69% of viral reads on average; see Fig. S5A), which is notable given that this viral group is often considered an indicator of fecal contamination (47). Importantly, we observed variation in the proportional abundances of the dominant viral taxa recovered across the sampling locations as annotated with both the NCBI RefSeq database (see Fig. S5A) and the IMG/VR database (see Fig. S5B).

FIG S5

Heatmap of the dominant viral taxa across sampling locations. Viral taxa with relative abundances of >0.5% are included in the heatmap. Download FIG S5, PDF file, 0.2 MB (172.7KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

The observed variation in the composition of the viral communities was reasonably well-correlated with the variation in prokaryotic community composition. This was evident when we compared viral to prokaryotic community composition across all samples (Mantel r = 0.67, P = 0.001) and within individual locations over time (Mantel r = 0.42 to 0.87, P < 0.001 in all cases). This correspondence between viral and prokaryotic community composition is not surprising given the host specificity of many bacteriophage (48, 49) with similarly close associations between sewage phage and prokaryotic community composition having been observed previously (15). Moreover, the close correspondence between viral and prokaryotic community composition indicates that the spatiotemporal patterns in prokaryotic communities (as described above) apply to both components of the sewage microbiome with viral communities also exhibiting pronounced variation in composition across sampling locations and over time within individual locations.

To investigate the associations between viral and prokaryotic communities in more detail, we used network analyses based on cooccurrence patterns to identify viruses (including those with both known and undetermined bacterial hosts) that consistently cooccur with particular prokaryotic taxa. Although cooccurrence patterns provide only putative associations between particular viruses and their potential bacterial hosts, we were able to accurately reconstruct known phage-host relationships (Fig. 4). For example, we identified a number of phage taxa known to infect bacteria within the Enterobacteriaceae and Bacteroidaceae families (Fig. 4) that also cooccurred with these bacterial families across the sample set. However, there were a few viruses that cooccurred with multiple distinct bacterial taxa, a result we might expect given that some phage can infect a broad range of bacterial hosts (48, 49). Likewise, some of the viruses with undetermined host specificity cooccurred with specific bacterial taxa (Fig. 4), highlighting that these cooccurrence analyses can be used to generate hypotheses about the likely host preferences of poorly characterized viral taxa.

FIG 4.

FIG 4

Bipartite cooccurrence network of top viral and bacterial taxa detected across all sewage samples (only viral and bacterial taxa correlated with a Pearson r value > 0.75 are included here). Square nodes are bacterial taxa colored by bacterial family. Circular nodes are viral (phage) nodes colored by putative bacterial host as assigned in the IMG database, illustrating the accuracy of phage host assignments.

Conclusions.

By conducting shotgun metagenomic analyses on 188 wastewater samples collected over a 3-week period from 17 locations within a single sewage network on a university campus, we were able to document the spatial and temporal patterns in prokaryotic and viral communities at reasonably high spatiotemporal resolution. As overall differences in functional gene profiles closely mirrored differences in the taxonomic composition of the prokaryotic communities (see “General Description of the Prokaryotic Communities” above), we focused our functional gene analyses on genes putatively coding for antibiotic resistance functions given their potential importance. Together, the results from these analyses lead us to several broader conclusions. First, sewage microbial communities are not dominated by taxa derived from human fecal inputs. Rather, many of the taxa found in sewage are likely derived from sewer biofilms or accumulated sediments within the sewer system, highlighting the importance of understanding how these resident microbial communities develop over time and across different locations within an individual sewer system. Second, we observed pronounced spatial variation, even across locations within the same network in close proximity, but the spatial variation was often similar in magnitude to the temporal variation observed at a given sampling location, a pattern that was observed for prokaryotic and viral community composition as well as ARG profiles. Just as importantly, the temporal patterns in the sewage communities were often inconsistent across sampling locations with different patterns (and different abiotic variables associated with those patterns) across the sampling locations. This high degree of temporal variation, variation that is often difficult to predict a priori, is relevant to future work since it highlights the importance of collecting additional information to characterize the sewage environment across space and time, including specific usage patterns in buildings served by a given sewer system (e.g., frequency of toilet or appliance usage), sewage nutrient or xenobiotic concentrations, temperature, and other variables that were not directly measured for this study. Presumably, additional information describing conditions in the sewage system, or the buildings feeding the sewage system, would provide more insight into the specific factors associated with the observed variation in sewage microbial communities. More generally, these results highlight that collecting sewage samples from a single time point, or just a few time points, may not adequately capture microbial dynamics at a given location, and studies that do not explicitly characterize potential temporal variation when assessing spatial variation may yield an incomplete perspective on the sewage microbiome.

MATERIALS AND METHODS

Sample collection.

Wastewater samples were collected from 17 locations across the University of Colorado campus in Boulder, CO (40.00°N, 105.26°W). All locations are within the same sewer network with 14 of the 17 sampling locations representing sewage outflow from individual buildings with the remaining locations representing flow from multiple buildings, meaning they were either downstream of several buildings in the sewer network, or regularly received backflow from several buildings (Fig. 1A). Sampling was conducted from surface-accessible manholes with wastewater collected at each location using composite autosamplers (50). Sampling locations are labeled followed the naming convention used previously (50). All samples included in this study were collected over a 3-week period (1 to 20 September 2020) and up to four samples were collected per week (Tuesday, Thursday, Saturday, and Sunday) for a total of 6 to 12 sampling dates per location (13 of the 17 locations had at least 11 sampling time points). Each sample represented a 24-h flow-proportional composite sample (50). In total, we collected 188 sewage samples, with each sample representing an integrated, continuous collection over a 24-h period withdrawn using peristaltic pumps. The total volume of sewage collected from each location on each sampling date (mean, 8 L; range, 0.3 to 16.2 L) represented a proxy for the total volume of sewage moving through each location over each 24-h period, withdrawing lower volumes when the sewer pipe was dry or sampling inlet mispositioned. All samples were stored on ice during the 24 h withdrawal and immediately upon collection. After transportation to the laboratory, 2-mL aliquots were withdrawn and centrifuged at 14,000 × g for 10 min at 4°C, and the decanted pellets with were frozen at −80°C within 4 h of collection for subsequent DNA analyses.

In addition to the shotgun metagenomic analyses described below, we also measured the pH of each sample (mean pH, 8.4; range, 7.0 to 9.6) and the amount of total suspended solids (mean, 80 mg L−1; range, 10 to 300 mg L−1) using standard methods (50). Additional data collected for each sampling location included: sewer material (brick versus concrete), sewer depth (the depth below the ground surface from which samples were collected [mean, 3.0 m; range, 1.5 to 5.5 m]), and the estimated population of full-time residents served by that particular sewer line (mean, 544 individuals; range, 0 to 2,536 individuals; with only one location, a classroom/office building, having no full-time residents). For the temporal analyses, we also included information on daily high/low air temperatures and precipitation amounts with these data compiled from the National Oceanic and Atmospheric Administration Physical Sciences Laboratory.

Shotgun metagenomic sequencing.

DNA was extracted from homogenized 2-mL aliquots of each of the 188 samples with two DNA extraction “blanks” and two no-template control samples processed alongside the samples to check for potential contamination. DNA was extracted using a DNeasy PowerSoil HTP 96 kit (Qiagen, Germantown, MD) according to the manufacturer’s instructions, and libraries for shotgun metagenomic sequencing were prepared using Illumina DNA Prep kits and IDT for Illumina DNA Unique Dual Index Sets (Illumina, San Diego, CA). Briefly, DNA was fragmented and adapter tag sequences were added using bead-linked transposomes, followed by a PCR step to add adapter sequences. The resulting libraries were then cleaned by bead purification, quantified, and pooled in equimolar concentrations. Pooled libraries (which included 188 sewage samples and the 4 “blank” samples) were sequenced on duplicate Illumina NovaSeq lanes running the 2 × 150-bp paired-end sequencing chemistry. This sequencing effort yielded 3.4 to 24 million paired-end reads per sewage sample (mean, 10 million) with the four “blank” samples yielding 0 to 1,700 reads, indicating nonexistent to minimal contamination.

Bioinformatics.

Read quality was assessed using FastQC (51) and adapters were removed using Cutadapt version 2.4 (52). We used Sickle (53) to trim bases of insufficient quality (q-score >20) and reads of insufficient length (<50 bp). After these quality control and filtering steps, a total of 983 million paired-end reads remained across all 188 sewage samples, with a mean of 5.23 million reads per sample (range, 1.66 to 10.64 million).

To assess the taxonomic composition of the communities, we used phyloFlash (28), a tool that extracts small-subunit rRNA gene reads from the shotgun metagenomic data, with taxonomic classification of the reads determined by comparing extracted reads against those in the SILVA reference database, version 138.1 (54). As nearly all of these reads were prokaryotic (bacterial and archaeal), with eukaryotic reads representing only 0.26% of all rRNA gene reads across the data set, we removed reads from the resulting taxon table that were classified as chloroplasts, mitochondria, or eukaryotes (this process removed only 18,475 of the 2,156,483 rRNA gene reads across the entire data set). We also removed those taxa from the table that were represented by fewer than 10 reads across the entire data set to be conservative in our assessment of prokaryotic diversity (removing an additional 4,469 reads in total). The resulting taxon table yielded a mean of 11,370 prokaryotic rRNA gene reads per sample (range, 1,702 to 22,056). To provide a broad assessment of how functional gene profiles may relate to the taxonomic composition of the communities, we annotated functional gene reads with the SqueezeMeta pipeline’s “read-only” analysis script, sqm_reads.pl (31), to obtain KEGG annotations of genes. To determine the presence of unique genes, we then used the KEGG ontology (KO) with gene abundances normalized against the abundances of single-copy genes with MUSiCC (55). We identified >17,000 unique genes (KOs) across the entire data set. These KOs collectively represented 168 unique gene pathways with Bray-Curtis dissimilarities calculated from the assigned normalized abundances of genes per sample.

We quantified the classes and relative abundances of antibiotic resistance genes in each sample by processing trimmed and filtered metagenomic reads through the Resistance Gene Identifier pipeline (56) using default parameters for shotgun metagenomic data and the Comprehensive Antibiotic Resistance Database (CARD; version 3.0.1) as the reference database. After assigning ARGs, we removed “rare” genes (those that occurred in <10 of the 188 samples) and those deemed to be of insufficient quality (<40 MAPQ score).

For the viral community analyses, we used FastViromeExplorer (57) with quality-filtered reads against the IMG/VR database (58) and, separately, the NCBI RefSeq database to detect and identify viruses (dsDNA phages) present in the sewage metagenome samples. We used these two different databases to characterize as many viral reads as possible due to database differences in annotation and taxonomic resolution. Specifically, 1,921,070 of our reads matched against the NCBI database, while 8,961,471 of our reads matched against the IMG/VR database. However, the taxonomic resolution and species identification varied between the databases. For all analyses, with the exception of the calculations of crAssphage abundances, we used the IMG/VR annotation due to the larger amount of reads characterized. As the number of recovered viral reads per sample was correlated with overall per-sample sequencing depth (r2 = 0.72, P < 0.001), we rarefied to 2206 viral reads per sample and removed viral taxa represented by <10 reads across the entire data set prior to downstream analyses. As explained in more detail above (see Results and Discussion), we note that this pipeline would be expected to underestimate the total amount of viral diversity that could be found in these samples. A bipartite cooccurrence network of the more abundant viral and bacterial taxa was constructed in R using the iGraph package (“graph_from_incidence_matrix”) (59). Input for the bipartite network was a correlation table of viral-bacterial abundances across samples (filtered to include only correlations with Pearson r values > 0.75).

Data visualization and statistical analyses.

All downstream analyses were conducted in R (60), unless otherwise noted. For the spatial analyses, we calculated average pairwise Bray-Curtis dissimilarity levels across the 17 sampling locations, with differences across sample categories determined using PERMANOVA as implemented in the R package vegan 2.5.7 (61). Location-specific differences in community composition were visualized via hierarchical clustering (Ward method, R package Stats 4.0.5). To determine what measured sewage variables (including sampling date, weekend versus weekday, location, pH, sewer depth, total suspended solids, sewer age, sewer material, resident population, sample volume, geographic location, and total read counts) were associated with the observed variation in the taxonomic composition of bacterial communities, we used Mantel tests (R package vegan 2.5.7) with Euclidean distances in the measured continuous variables and Bray-Curtis distances in the taxonomic composition of the microbial communities. Analysis of volatility in taxonomic composition was performed in Python 3.6.9, with Bray-Curtis distances calculated via the package scikit-bio version 0.5.6. Analyses of the temporal variation in community composition were restricted to samples from 15 of the 17 sites as these 15 sites had samples from at least 10 of the 12 total sampling times over the 3-week period (2 sites—“K” and “Admin”—were represented by only 9 and 6 samples, respectively). We used both Mantel tests (R package vegan v2.5.7) and multiple regression on distance matrices (MRM, R package ecodist v2.0.7) to identify the specific variables that explain the observed temporal variation in bacterial community composition within sites.

Data availability.

Raw sequence reads for all samples and the associated sample metadata have been uploaded to the NCBI Sequence Read Archive repository under accession number PRJNA875025.

Supplementary Material

Reviewer comments
reviewer-comments.pdf (335KB, pdf)

Contributor Information

Noah Fierer, Email: noahfierer@gmail.com.

Rachel Poretsky, University of Illinois at Chicago.

REFERENCES

  • 1.McLellan SL, Roguet A. 2019. The unexpected habitat in sewer pipes for the propagation of microbial communities and their imprint on urban waters. Curr Opin Biotechnol 57:34–41. doi: 10.1016/j.copbio.2018.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Choi PM, Tscharke BJ, Donner E, O’Brien JW, Grant SC, Kaserzon SL, Mackie R, O’Malley E, Crosbie ND, Thomas KV, Mueller JF. 2018. Wastewater-based epidemiology biomarkers: past, present, and future. Trends Analyt Chem 105:453–469. doi: 10.1016/j.trac.2018.06.004. [DOI] [Google Scholar]
  • 3.Li X, Kappler U, Jiang G, Bond PL. 2017. The ecology of acidophilic microorganisms in the corroding concrete sewer environment. Front Microbiol 8:683. doi: 10.3389/fmicb.2017.00683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Cayford BI, Dennis PG, Keller J, Tyson GW, Bond PL. 2012. High-throughput amplicon sequencing reveals distinct communities within a corroding concrete sewer system. Appl Environ Microbiol 78:7160–7162. doi: 10.1128/AEM.01582-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Vollertsen J, Nielsen AH, Jensen HS, Wium-Andersen T, Hvitved-Jacobsen T. 2008. Corrosion of concrete sewers: the kinetics of hydrogen sulfide oxidation. Sci Total Environ 394:162–170. doi: 10.1016/j.scitotenv.2008.01.028. [DOI] [PubMed] [Google Scholar]
  • 6.Munir M, Wong K, Xagoraraki I. 2011. Release of antibiotic resistant bacteria and genes in the effluent and biosolids of five wastewater utilities in Michigan. Water Res 45:681–693. doi: 10.1016/j.watres.2010.08.033. [DOI] [PubMed] [Google Scholar]
  • 7.LaPara TM, Burch TR, McNamara PJ, Tan DT, Yan M, Eichmiller JJ. 2011. Tertiary-treated municipal wastewater is a significant point source of antibiotic resistance genes into Duluth-Superior Harbor. Environ Sci Technol 45:9543–9549. doi: 10.1021/es202775r. [DOI] [PubMed] [Google Scholar]
  • 8.Auguet O, Pijuan M, Borrego CM, Rodriguez-Mozaz S, Triadó-Margarit X, Giustina SVD, Gutierrez O. 2017. Sewers as potential reservoirs of antibiotic resistance. Sci Total Environ 605–606:1047–1054. doi: 10.1016/j.scitotenv.2017.06.153. [DOI] [PubMed] [Google Scholar]
  • 9.Newton RJ, McLellan SL, Dila DK, Vineis JH, Morrison HG, Eren AM, Sogin ML. 2015. Sewage reflects the microbiomes of human populations. mBio 6:e02574-14. doi: 10.1128/mBio.02574-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Shi X, Ngo HH, Sang L, Jin P, Wang XC, Wang G. 2018. Functional evaluation of pollutant transformation in sediment from combined sewer system. Environ Pollut 238:85–93. doi: 10.1016/j.envpol.2018.03.007. [DOI] [PubMed] [Google Scholar]
  • 11.Nielsen PH, Raunkjær K, Norsker NH, Jensen NA, Hvitved-Jacobsen T. 1992. Transformation of wastewater in sewer systems: a review. Water Sci Technol 25:17–31. doi: 10.2166/wst.1992.0111. [DOI] [Google Scholar]
  • 12.Guo B, Liu C, Gibson C, Frigon D. 2019. Wastewater microbial community structure and functional traits change over short timescales. Sci Total Environ 662:779–785. doi: 10.1016/j.scitotenv.2019.01.207. [DOI] [PubMed] [Google Scholar]
  • 13.LaMartina EL, Mohaimani AA, Newton RJ. 2021. Urban wastewater bacterial communities assemble into seasonal steady states. Microbiome 9:116. doi: 10.1186/s40168-021-01038-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Matus M, Duvallet C, Soule MK, Kearney SM, Endo N, Ghaeli N, Brito I, Ratti C, Kujawinski EB, Alm EJ. 2019. 24-hour multi-omics analysis of residential sewage reflects human activity and informs public health. bioRxiv. https://www.biorxiv.org/content/10.1101/728022v2.
  • 15.Strange JES, Leekitcharoenphon P, Møller FD, Aarestrup FM. 2021. Metagenomics analysis of bacteriophages and antimicrobial resistance from global urban sewage. Sci Rep 11:1600. doi: 10.1038/s41598-021-80990-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hendriksen RS, Munk P, Njage P, van Bunnik B, McNally L, Lukjancenko O, Röder T, Nieuwenhuijse D, Pedersen SK, Kjeldgaard J, Kaas RS, Clausen PTLC, Vogt JK, Leekitcharoenphon P, van de Schans MGM, Zuidema T, de Roda Husman AM, Rasmussen S, Petersen B, Amid C, Cochrane G, Sicheritz-Ponten T, Schmitt H, Alvarez JRM, Aidara-Kane A, Pamp SJ, Lund O, Hald T, Woolhouse M, Koopmans MP, Vigre H, Petersen TN, Aarestrup FM, Global Sewage Surveillance Project Consortium . 2019. Global monitoring of antimicrobial resistance based on metagenomics analyses of urban sewage. Nat Commun 10:1124. doi: 10.1038/s41467-019-08853-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fresia P, Antelo V, Salazar C, Giménez M, D’Alessandro B, Afshinnekoo E, Mason C, Gonnet GH, Iraola G. 2019. Urban metagenomics uncover antibiotic resistance reservoirs in coastal beach and sewage waters. Microbiome 7:35. doi: 10.1186/s40168-019-0648-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bibby K, Peccia J. 2013. Identification of viral pathogen diversity in sewage sludge by metagenome analysis. Environ Sci Technol 47:1945–1951. doi: 10.1021/es305181x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Wigginton KR, Ye Y, Ellenberg RM. 2015. Emerging investigators series: the source and fate of pandemic viruses in the urban water cycle. Environ Sci Water Res Technol 1:735–746. doi: 10.1039/C5EW00125K. [DOI] [Google Scholar]
  • 20.Thompson JR, Nancharaiah YV, Gu X, Lee WL, Rajal VB, Haines MB, Girones R, Ng LC, Alm EJ, Wuertz S. 2020. Making waves: wastewater surveillance of SARS-CoV-2 for population-based health management. Water Res 184:116181. doi: 10.1016/j.watres.2020.116181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bivins A, North D, Ahmad A, Ahmed W, Alm E, Been F, Bhattacharya P, Bijlsma L, Boehm AB, Brown J, Buttiglieri G, Calabro V, Carducci A, Castiglioni S, Cetecioglu Gurol Z, Chakraborty S, Costa F, Curcio S, de Los Reyes FL, III, Delgado Vela J, Farkas K, Fernandez-Casi X, Gerba C, Gerrity D, Girones R, Gonzalez R, Haramoto E, Harris A, Holden PA, Islam MT, Jones DL, et al. 2020. Wastewater-based epidemiology: global collaborative to maximize contributions in the fight against COVID-19. Environ Sci Technol 54:7754–7757. doi: 10.1021/acs.est.0c02388. [DOI] [PubMed] [Google Scholar]
  • 22.Su J-Q, An X-L, Li B, Chen Q-L, Gillings MR, Chen H, Zhang T, Zhu Y-G. 2017. Metagenomics of urban sewage identifies an extensively shared antibiotic resistome in China. Microbiome 5:84. doi: 10.1186/s40168-017-0298-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.World Health Organization. 2014. Antimicrobial resistance: global report on surveillance. World Health Organization, Geneva, Switzerland. [Google Scholar]
  • 24.Hay SI, Rao PC, Dolecek C, Day NPJ, Stergachis A, Lopez AD, Murray CJL. 2018. Measuring and mapping the global burden of antimicrobial resistance. BMC Med 16:78. doi: 10.1186/s12916-018-1073-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Coutinho FH, Silveira CB, Gregoracci GB, Thompson CC, Edwards RA, Brussaard CPD, Dutilh BE, Thompson FL. 2017. Marine viruses discovered via metagenomics shed light on viral strategies throughout the oceans. Nat Commun 8:15955. doi: 10.1038/ncomms15955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Emerson JB, Roux S, Brum JR, Bolduc B, Woodcroft BJ, Jang HB, Singleton CM, Solden LM, Naas AE, Boyd JA, Hodgkins SB, Wilson RM, Trubl G, Li C, Frolking S, Pope PB, Wrighton KC, Crill PM, Chanton JP, Saleska SR, Tyson GW, Rich VI, Sullivan MB. 2018. Host-linked soil viral ecology along a permafrost thaw gradient. Nat Microbiol 3:870–880. doi: 10.1038/s41564-018-0190-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Camarillo-Guerrero LF, Almeida A, Rangel-Pineros G, Finn RD, Lawley TD. 2021. Massive expansion of human gut bacteriophage diversity. Cell 184:1098–1109. doi: 10.1016/j.cell.2021.01.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gruber-Vodicka HR, Seah BKB, Pruesse E. 2020. phyloFlash: rapid small-subunit rRNA profiling and targeted assembly from metagenomes. mSystems 5:e00920-20. doi: 10.1128/mSystems.00920-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Barberán A, Dunn RR, Reich BJ, Pacifici K, Laber EB, Menninger HL, Morton JM, Henley JB, Leff JW, Miller SL, Fierer N. 2015. The ecology of microscopic life in household dust. Proc R Soc B 282:20151139. doi: 10.1098/rspb.2015.1139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Vandewalle JL, Goetz GW, Huse SM, Morrison HG, Sogin ML, Hoffmann RG, Yan K, McLellan SL. 2012. Acinetobacter, Aeromonas, and Trichococcus populations dominate the microbial community within urban sewer infrastructure. Environ Microbiol 14:2538–2552. doi: 10.1111/j.1462-2920.2012.02757.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Tamames J, Puente-Sanchez F. 2019. SqueezeMeta, a highly portable, fully automatic metagenomic analysis pipeline. Front Microbiol 9:3349. doi: 10.3389/fmicb.2018.03349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lauber CL, Hamady M, Knight R, Fierer N. 2009. Pyrosequencing-based assessment of soil pH as a predictor of soil bacterial community structure at the continental scale. Appl Environ Microbiol 75:5111–5120. doi: 10.1128/AEM.00335-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Griffiths RI, Thomson BC, James P, Bell T, Bailey M, Whiteley AS. 2011. The bacterial biogeography of British soils. Environ Microbiol 13:1642–1654. doi: 10.1111/j.1462-2920.2011.02480.x. [DOI] [PubMed] [Google Scholar]
  • 34.Niño-García JP, Ruiz-González C, Del Giorgio PA. 2016. Interactions between hydrology and water chemistry shape bacterioplankton biogeography across boreal freshwater networks. ISME J 10:1755–1766. doi: 10.1038/ismej.2015.226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Bautista-de Los Santos QM, Chavarria KA, Nelson KL. 2019. Understanding the impacts of intermittent supply on the drinking water microbiome. Curr Opin Biotechnol 57:167–174. doi: 10.1016/j.copbio.2019.04.003. [DOI] [PubMed] [Google Scholar]
  • 36.Hozalski RM, LaPara TM, Zhao X, Kim T, Waak MB, Burch T, McCarty M. 2020. Flushing of stagnant premise water systems after the COVID-19 shutdown can reduce infection risk by Legionella and Mycobacterium spp. Environ Sci Technol 54:15914–15924. doi: 10.1021/acs.est.0c06357. [DOI] [PubMed] [Google Scholar]
  • 37.Cullom AC, Martin RL, Song Y, Williams K, Williams A, Pruden A, Edwards MA. 2020. Propensity of premise plumbing pipe materials to enhance or diminish growth of Legionella and other opportunistic pathogens. Pathogens 9:957. doi: 10.3390/pathogens9110957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ley CJ, Proctor CR, Singh G, Ra K, Noh Y, Odimayomi T, Salehi M, Julien R, Mitchell J, Nejadhashemi AP, Whelton AJ, Aw TG. 2020. Drinking water microbiology in a water-efficient building: stagnation, seasonality, and physicochemical effects on opportunistic pathogen and total bacteria proliferation. Environ Sci Water Res Technol 6:2902–2913. doi: 10.1039/D0EW00334D. [DOI] [Google Scholar]
  • 39.Vikesland PJ, Pruden A, Alvarez PJJ, Aga D, Bürgmann H, Li X-D, Manaia CM, Nambi I, Wigginton K, Zhang T, Zhu Y-G. 2017. Toward a comprehensive strategy to mitigate dissemination of environmental sources of antibiotic resistance. Environ Sci Technol 51:13061–13069. doi: 10.1021/acs.est.7b03623. [DOI] [PubMed] [Google Scholar]
  • 40.Nguyen AQ, Vu HP, Nguyen LN, Wang Q, Djordjevic SP, Donner E, Yin H, Nghiem LD. 2021. Monitoring antibiotic resistance genes in wastewater treatment: current strategies and future challenges. Sci Total Environ 783:146964. doi: 10.1016/j.scitotenv.2021.146964. [DOI] [PubMed] [Google Scholar]
  • 41.Sukhum KV, Diorio-Toth L, Dantas G. 2019. Genomic and metagenomic approaches for predictive surveillance of emerging pathogens and antibiotic resistance. Clin Pharmacol Ther 106:512–524. doi: 10.1002/cpt.1535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Che Y, Xia Y, Liu L, Li A-D, Yang Y, Zhang T. 2019. Mobile antibiotic resistome in wastewater treatment plants revealed by Nanopore metagenomic sequencing. Microbiome 7:44. doi: 10.1186/s40168-019-0663-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Yang Y, Li B, Zou S, Fang HHP, Zhang T. 2014. Fate of antibiotic resistance genes in sewage treatment plant revealed by metagenomic approach. Water Res 62:97–106. doi: 10.1016/j.watres.2014.05.019. [DOI] [PubMed] [Google Scholar]
  • 44.Hayes S, Mahony J, Nauta A, van Sinderen D. 2017. Metagenomic approaches to assess bacteriophages in various environmental niches. Viruses 9:127. doi: 10.3390/v9060127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Prussin AJ, II, Belser JA, Bischoff W, Kelley ST, Lin K, Lindsley WG, Nshimyimana JP, Schuit M, Wu Z, Bibby K, Marr LC. 2020. Viruses in the Built Environment (VIBE) meeting report. Microbiome 8:1. doi: 10.1186/s40168-019-0777-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Dion MB, Oechslin F, Moineau S. 2020. Phage diversity, genomics, and phylogeny. Nat Rev Microbiol 18:125–138. doi: 10.1038/s41579-019-0311-5. [DOI] [PubMed] [Google Scholar]
  • 47.Stachler E, Bibby K. 2014. Metagenomic evaluation of the highly abundant human gut bacteriophage crAssphage for source tracking of human fecal pollution. Environ Sci Technol Lett 1:405–409. doi: 10.1021/ez500266s. [DOI] [Google Scholar]
  • 48.Paez-Espino D, Eloe-Fadrosh EA, Pavlopoulos GA, Thomas AD, Huntemann M, Mikhailova N, Rubin E, Ivanova NN, Kyrpides NC. 2016. Uncovering Earth’s virome. Nature 536:425–430. doi: 10.1038/nature19094. [DOI] [PubMed] [Google Scholar]
  • 49.de Jonge PA, Nobrega FL, Brouns SJJ, Dutilh BE. 2019. Molecular and evolutionary determinants of bacteriophage host range. Trends Microbiol 27:51–63. doi: 10.1016/j.tim.2018.08.006. [DOI] [PubMed] [Google Scholar]
  • 50.Reeves K, Liebig J, Feula A, Saldi T, Lasda E, Johnson W, Lilienfeld J, Maggi J, Pulley K, Wilkerson PJ, Real B, Zak G, Davis J, Fink M, Gonzales P, Hager C, Ozeroff C, Tat K, Alkire M, Butler C, Coe E, Darby J, Freeman N, Heuer H, Jones JR, Karr M, Key S, Maxwell K, Nelson L, Saldana E, Shea R, Salveson L, Tomlinson K, Vargas-Barriga J, Vigil B, Brisson G, Parker R, Leinwand LA, Bjorkman K, Mansfeldt C. 2021. High-resolution within-sewer SARS-CoV-2 surveillance facilitates informed intervention. Water Res 204:117613. doi: 10.1016/j.watres.2021.117613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Andrews S. 2010. FastQC: a quality control tool for high throughput sequence data. http://www.bioinformatics.babraham.ac.uk/projects/fastqc.
  • 52.Martin M. 2011. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet j 17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  • 53.Joshi NA, Fass JN. 2011. Sickle: a sliding-window, adaptive, quality-based trimming tool for FastQ files (1.33). https://github.com/najoshi/sickle.
  • 54.Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. 2013. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 41:D590–D596. doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Manor O, Borenstein E. 2015. MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome. Genome Biol 16:53. doi: 10.1186/s13059-015-0610-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Alcock BP, Raphenya AR, Lau TTY, Tsang KK, Bouchard M, Edalatmand A, Huynh W, Nguyen A-LV, Cheng AA, Liu S, Min SY, Miroshnichenko A, Tran H-K, Werfalli RE, Nasir JA, Oloni M, Speicher DJ, Florescu A, Singh B, Faltyn M, Hernandez-Koutoucheva A, Sharma AN, Bordeleau E, Pawlowski AC, Zubyk HL, Dooley D, Griffiths E, Maguire F, Winsor GL, Beiko RG, Brinkman FSL, Hsiao WWL, Domselaar GV, McArthur AG. 2020. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database. Nucleic Acids Res 48:D517–D525. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Tithi SS, Aylward FO, Jensen RV, Zhang L. 2018. FastViromeExplorer: a pipeline for virus and phage identification and abundance profiling in metagenomics data. PeerJ 6:e4227. doi: 10.7717/peerj.4227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Paez-Espino D, Chen I-MA, Palaniappan K, Ratner A, Chu K, Szeto E, Pillay M, Huang J, Markowitz VM, Nielsen T, Huntemann M, K Reddy TB, Pavlopoulos GA, Sullivan MB, Campbell BJ, Chen F, McMahon K, Hallam SJ, Denef V, Cavicchioli R, Caffrey SM, Streit WR, Webster J, Handley KM, Salekdeh GH, Tsesmetzis N, Setubal JC, Pope PB, Liu W-T, Rivers AR, Ivanova NN, Kyrpides NC. 2017. IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses. Nucleic Acids Res 45:D457–D465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Csardi G, Nepusz T, et al. 2006. The igraph software package for complex network research. InterJournal, Complex Systems 1695:1–9. [Google Scholar]
  • 60.R Core Team. 2017. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. [Google Scholar]
  • 61.Oksanen J, et al. 2011. Multivariate analysis of ecological communities in R: vegan tutorial. R Package Version 1:1–43. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

FIG S1

Variation in the 20 dominant bacteria families, across each location and across the five identified clusters. All 20 families are significantly different in relative abundance across clusters (P < 0.01 in all cases, except for Moraxellaceae with P < 0.05). Download FIG S1, PDF file, 0.7 MB (744KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S2

Heatmap showing the Bray-Curtis distance from mean community composition per location and per sampling date, with warmer colors indicating higher dissimilarity in overall community composition. This figure highlights the asynchrony in the temporal patterns in community composition across all sampling locations. Cells that are gray indicate locations and days when samples were not collected. Download FIG S2, PDF file, 0.03 MB (35KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TABLE S1

Mantel test and multiple regression on distance matrices (MRM) results for environmental variables associated with temporal patterns in prokaryotic community composition. Download Table S1, PDF file, 0.04 MB (43KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S3

Heatmap depicting the mean proportional abundances of fecal indicator taxa each week separated by sampling site. Darker colors indicate higher proportional abundances of fecal indicator taxa, while lighter colors represent a lower proportional abundance. The week 2 samples had higher proportional abundances of fecal indicator taxa across 13 of the 15 sites, a pattern we suggest is associated with increases in sewage inputs during the snowstorm on 8 to 9 September 2020. Download FIG S3, PDF file, 0.1 MB (53KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S4

Line plot highlighting changes in the relative abundances of Legionella and Acinetobacter across time. Error bars represent 95% confidence intervals around the mean, as calculated from all sampling locations. The initially high relative abundances, followed by a decrease through day 5, highlight the potential influence of flushing of stagnant water after campus activities resumed and vacant buildings were reoccupied at the start of this study. Download FIG S4, PDF file, 0.03 MB (27.7KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S5

Heatmap of the dominant viral taxa across sampling locations. Viral taxa with relative abundances of >0.5% are included in the heatmap. Download FIG S5, PDF file, 0.2 MB (172.7KB, pdf) .

Copyright © 2022 Fierer et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Reviewer comments
reviewer-comments.pdf (335KB, pdf)

Data Availability Statement

Raw sequence reads for all samples and the associated sample metadata have been uploaded to the NCBI Sequence Read Archive repository under accession number PRJNA875025.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES