Skip to main content
FEMS Microbiology Reviews logoLink to FEMS Microbiology Reviews
. 2023 Jun 7;47(4):fuad028. doi: 10.1093/femsre/fuad028

Have genetic targets for faecal pollution diagnostics and source tracking revolutionized water quality analysis yet?

Katalin Demeter 1,2, Rita Linke 2,2, Elisenda Ballesté 3, Georg Reischer 4,2, René E Mayer 5,6,2, Julia Vierheilig 7,2, Claudia Kolm 8,2, Margaret E Stevenson 9,2, Julia Derx 10,2, Alexander K T Kirschner 11,12,2, Regina Sommer 13,2, Orin C Shanks 14, Anicet R Blanch 15, Joan B Rose 16, Warish Ahmed 17, Andreas H Farnleitner 18,19,2,
PMCID: PMC10368376  PMID: 37286726

Abstract

The impacts of nucleic acid-based methods - such as PCR and sequencing - to detect and analyze indicators, genetic markers or molecular signatures of microbial faecal pollution in health-related water quality research were assessed by rigorous literature analysis. A wide range of application areas and study designs has been identified since the first application more than 30 years ago (>1100 publications). Given the consistency of methods and assessment types, we suggest defining this emerging part of science as a new discipline: genetic faecal pollution diagnostics (GFPD) in health-related microbial water quality analysis. Undoubtedly, GFPD has already revolutionized faecal pollution detection (i.e., traditional or alternative general faecal indicator/marker analysis) and microbial source tracking (i.e., host-associated faecal indicator/marker analysis), the current core applications. GFPD is also expanding to many other research areas, including infection and health risk assessment, evaluation of microbial water treatment, and support of wastewater surveillance. In addition, storage of DNA extracts allows for biobanking, which opens up new perspectives. The tools of GFPD can be combined with cultivation-based standardized faecal indicator enumeration, pathogen detection, and various environmental data types, in an integrated data analysis approach. This comprehensive meta-analysis provides the scientific status quo of this field, including trend analyses and literature statistics, outlining identified application areas, and discusses the benefits and challenges of nucleic acid-based analysis in GFPD.

Keywords: faecal pollution microbiology, DNA/RNA analysis, faecal indicator, faecal and MST markers, microbial source tracking, systematic review


The proposed discipline of genetic faecal pollution diagnostics helps addressing complex water, sanitation, and human health-related issues: this systematic review elucidates application areas, key research questions and study designs from over 1100 scientific articles and provides critical insights and conclusions.


Abbreviations

16S AmpSeq

16S rRNA gene amplicon sequencing

aLOD

assay limit of detection

AMR

antimicrobial resistance

ARB

antibiotic resistant bacteria

ARG

antibiotic resistance gene

CSO

combined sewer overflow

DGGE

denaturing gradient gel electrophoresis

dPCR

digital polymerase chain reaction

ET-qPCR

enzymatic treatment qPCR

FIO

faecal indicator organism

GFPD

genetic faecal pollution diagnostics

HAdV

human adenovirus

HDA

helicase dependent amplification

HRWM

health-related water microbiology

HTS

high-throughput sequencing

LAMP

loop-mediated isothermal amplification

MST

microbial source tracking

mtDNA

(host) mitochondrial DNA

PCR

polymerase chain reaction

PMA

propidium monoazide

PMMoV

pepper mild mottle virus

QMRA

quantitative microbial risk assessment

sLOD

sample limit of detection

USEPA

United States Environmental Protection Agency

qPCR

quantitative polymerase chain reaction

WASH

water, sanitation, and hygiene

WCA

whole chain analysis (sampling, processing, and analysis)

WWTP

wastewater treatment plant

Glossary

General terms

Genetic (method, detection, target, and so on): nucleic acid-based

Microbial source tracking (MST): methods to discriminate between human and various nonhuman sources of faecal contamination using microorganisms as primary diagnostic sources of information. Chemical and other parameters may provide complimentary information.

Terms describing indicator types

General FIO: an intestinal microorganism whose presence in the environment indicates the presence of faecal matter (without discrimination among sources).

Host-associated faecal indicator: an intestinal microorganism, i.e. strongly associated with its particular host species or range of host species. Its presence provides information about the faecal pollution sources in the environment.

Index organism: a microorganism (often a faecal indicator) that indicates the presence of a specific intestinal pathogen or groups of intestinal pathogens.

Risk indicator: a microorganism (often a faecal indicator) for which the correlation to waterborne disease has been clearly demonstrated and quantified. Threshold values are then derived, where a certain concentration of the risk indicator corresponds, in the given exposure scenario, to a given health risk (rate of the selected waterborne disease).

Treatment indicator: a microorganism indicative of the behaviour of a certain pathogen (group) in wastewater treatment and disinfection processes.

Transport surrogate: a microorganism mimicking the behaviour of a certain pathogen (group) in surface and subsurface microbial fate and transport.

Terms related to genetic methods for faecal pollution detection

General faecal marker: a nucleic acid target indicative of total faecal pollution (with no discrimination among sources), including the genetic detection of traditional general faecal indicators also amenable to cultivation-based enumeration (such as E. coli, enterococci) and of abundant intestinal obligate anaerobes [e.g. universal Bacteroidota (formerly Bacteroidetes) markers].

Host-associated faecal marker or MST marker: a nucleic acid target strongly associated with a particular host species or range of host species. Its presence in water provides information about the faecal pollution source(s) in the environment. Prokaryotic MST targets are often host-associated, occurring in nontargets at a lower rate. In contrast, viral MST targets can be host-specific and not detectable in nontargets.

Genetic faecal pollution diagnostics (GFPD): any methodology that relies on the detection and/or quantification of nucleic acid-based targets to detect or characterize microbial faecal pollution in the broadest sense.

Introduction

Safe drinking water, sanitation, and hygiene (WASH) are prerequisites to good health and well-being. Despite considerable global progress in recent decades, ~829 000 people still die each year from diarrheal disease, primarily through faecal–oral pathways, due to unsafe WASH practices (World Health Organisation 2019). While there is clear evidence that safely managed water resources, water supply, and adequate sanitation reduce the health risks related to water exposure and consumption (drinking, recreational activities, household exposure as well as transmission through irrigation, aquaculture, and so on), there is a constant, urgent need for more comprehensive, informative, and rapid microbiological assessment approaches to elucidate intricate WASH-related questions and to clarify complex faecal contamination issues.

For well over 100 years, faecal pollution assessment through the microbiological analysis of water has relied on the cultivation-based detection of facultative anaerobic bacterial colonizers of the animal and human gut, e.g. total coliforms, faecal coliforms, Escherichia coli, and intestinal enterococci. Recent advances in nucleic acid sequencing methods and bioinformatics have revealed the immense richness and diversity of the gut microbiota, opening unprecedented possibilities to develop new microbiological assessment approaches. Given the great diversity of assessment types made possible by genetic detection and analysis methods, we introduce the new term ‘genetic faecal pollution diagnostics (GFPD)’ to cover the entirety of this field, wherein ‘genetic’ means ‘nucleic acid-based’. For terms and definitions, please refer to the 'Glossary'.

Gut microbiotas are profoundly different from free-living microbial communities (e.g. Chen et al. 2018) across the biosphere (Ley et al. 2008). The Human Microbiome Project revealed Bacteroidota and Firmicutes to be the dominant phyla in the human gut, with substantial variability among individuals (The Human Microbiome Project Consortium 2012). The microbiome of municipal wastewater provides a community fingerprint that captures this diversity, with significantly lower community-level variability compared to individuals (Newton et al. 2015). In addition to faecal taxa, the wastewater microbiome also harbours a large proportion of wastewater infrastructure-related microorganisms (Shanks et al. 2013). The within-species variability in the human gut proves to be minor in comparison to the stark differences among other animal species, where both host phylogeny and diet are key drivers (Ley et al. 2008, Youngblut et al. 2019, Mallott and Amato 2021, Youngblut et al. 2021). In addition to the prokaryotic community, the gut also harbours a great diversity of viruses (bacteriophages, viruses of archaea, and of human cells as well as viruses transiently present in food; Liang and Bushman 2021). Novel molecular biological and genetic tools offer fascinating new ways to analyse and track faecal microorganisms or viruses in water. To date, these opportunities have only partially been exploited, and future research is poised to further the discovery and impacts of the GFPD field.

The aim of this work is to assess the impacts of nucleic acid-based methods on faecal pollution detection and analysis in the field of health-related water microbiology (HRWM). For the first time, this review provides a critical analysis of the new possibilities that state-of-the-art genetic methods have opened in a great diversity of application areas. This is accomplished via a systematic literature review to identify GFPD application areas, key research questions, and study designs from more than 1100 peer-reviewed publications, since the very beginning of using such molecular techniques in the environmental water compartment. The review focuses on genetic targets and parameters that take a faecal indication role; therefore, specific pathogen detection is only included if the indicator role is explicitly stated. Furthermore, description of the various methodological developments of molecular methods and their evaluation is outside the scope of this effort (please find a selection of methodological review articles in the section ‘Background information on genetic targets and methods: a historical overview’). The outcomes of the systematic literature review include trend analyses of relevant scientific literature (‘Outcomes of the systematic study design analysis’), followed by the analysis and discussion of seven identified application areas in HRWM (‘In-depth review of the application areas of genetic faecal pollution diagnostic through case studies’). The review concludes with a critical discussion on the benefits and limitations of GFPD in health-related water quality research and management. Figure 1 provides an overview of this article.

Figure 1.

Figure 1.

Overview and structure of this review article.

Background information on genetic targets and methods: a historical overview

Cultivation-based methods for faecal pollution detection: where it all began

The first routine bacteriological analyses of drinking water were initiated by Percy and Grace Frankland in London in 1885, building on the seminal work of Robert Koch and colleagues regarding microbiological media for detecting bacteria (Koch 1881). Around this time, Escherich described the bacterium that was later renamed Escherichia coli, in the faeces of breast-fed children (Escherich 1886, Castellani and Chalmers 1919). E. coli is currently one of the most widely used faecal indicator organisms (FIO; see the section ‘Glossary’) for water quality testing (Levine 1921, Perry and Bayliss 1936, Geldreich 1966), together with intestinal enterococci (Kjellander 1960, Geldreich and Kenner 1969) and their phages, such as somatic coliphages, and F-specific RNA bacteriophages (Grabow 2001, Jofre et al. 2016).

These standardized, cultivation-based FIO parameters have found their way into regulations all over the world and are still the gold standards for monitoring general faecal pollution in most types of water resources. While these FIOs revolutionized water quality testing and public health protection at the end of the 19th century, they also face several limitations. For example, most protocols require more than one working day to produce results, and these FIOs are unable to differentiate between faecal pollution sources (i.e. human, bird, cattle, and so on). It must be mentioned that host-associated cultivable enteric microorganisms, such as human-associated sorbitol-fermenting bifidobacteria are known (Mara and Oragui 1983, Mushi et al. 2010) and have paved the way for the field of microbial source tracking (MST; see the section ‘Glossary’). However, advances in molecular biology offered an unprecedented range of new opportunities to develop genetic technologies that can provide same-day water quality results and characterize key sources of faecal pollution.

The early days of genetic methods for faecal pollution diagnostics

Faecal indicator bacteria often show tremendous genotypic subspecies variation. MST studies in the early 2000s intensively attempted to exploit this strain-level diversity by genetic fingerprinting and -typing methods (e.g. repetitive element PCR, ribotyping, amplified fragment length polymorphism, and pulsed-field gel electrophoresis) to track the origin of E. coli and enterococci isolates (Mott and Smith 2011). Large isolate libraries, covering faecal pollution sources and polluted water bodies in a given catchment of interest, were typed. Band/fingerprint-patterns were statistically analysed to account for the high spatial and temporal variation (classical library-based MST; Domingo et al. 2007, Mott and Smith 2011). Such library-based genotyping strategies were also used to evaluate the general faecal indication capacity of faecal indicator bacteria (Ishii et al. 2006, Ishii and Sadowsky 2008).

Detection and quantification of genetic markers for faecal pollution diagnostics

Genetic characterization has led to the identification of key genes associated with a specific host, which represents a significant source of pollution (Bernhard and Field 2000). With the advent of conventional end-point PCR in the 1990s, the first studies appeared on targeted detection of general and host-associated genetic bacterial and viral targets for water quality monitoring (Bej et al. 1990, Puig et al. 1994, Bernhard and Field 2000), reviewed in Scott et al. (2002) and Noble and Weisberg (2005), which were later adapted to quantitative real-time PCR (qPCR; Seurinck et al. 2005).

The use of conventional PCR for target quantification has many limitations. Thus, qPCR appeared in the field of GFPD in the early 2000s and became the most widespread cultivation-independent technology (Jofre and Blanch 2010). Today, there are numerous qPCR assays for a wide variety of bacterial and viral targets, such as enterococci (USEPA 2012a, 2013), E. coli (Sivaganesan et al. 2019), human- and other animal-associated bacterial markers [original works: (Reischer et al. 2006, Shanks et al. 2008, Mieszkin et al. 2009), large-scale evaluations: (Layton et al. 2013, Reischer et al. 2013, Mayer et al. 2018), and reviews: (Wuertz et al. 2011, García-Aljaro et al. 2018)], viral MST markers including crAssphage [(García-Aljaro et al. 2017, Stachler et al. 2017) reviewed in Bivins et al. (2020)] and pepper mild mottle virus [PMMoV, (Rosario et al. 2009), reviewed in Kitajima et al. (2018), Symonds et al. (2018)] or human enteroviruses (reviewed in Farkas et al. (2020). Archaeal targets (Ufnar et al. 2006) and host mitochondrial DNA targets (Martellini et al. 2005, Schill and Mathes 2008, Malla and Haramoto 2020) have also been proposed as host-associated MST tools. Interestingly, intestinal fungi have not yet been targeted. A good overview of the most useful indicators and MST markers for which qPCR assays are available is provided in the online Global Water Pathogens Project (GWPP) book for bacterial (Harwood et al. 2018) and viral indicators of faecal pollution (Ahmed and Harwood 2017) or in a recent review article (Li et al. 2021a). Many of these methods have been subjected to multiple laboratory performance assessments and shown to be highly reproducible when standardized protocols are used (Ebentier et al. 2013, Shanks et al. 2016). Some human-associated qPCR assays are even available as government agency standardized protocols (USEPA 2019a, b) with certified companion reference materials (Kralj et al. 2021, Sivaganesan et al. 2022, Willis et al. 2022).

More recent research foci of genetic analysis methods include ease of use, rapid field-testing, and more sensitive and reproducible methods. For example, isothermal amplification assays such as LAMP (loop-mediated isothermal amplification; Martzy et al. 2017) or HDA (helicase dependent amplification; Kolm et al. 2017) have been developed for rapid enterococci detection in environmental waters; an overview can be found in Nieuwkerk et al. (2020).

In contrast to qPCR, where quantification of target genes relies on a calibration model, digital PCR (dPCR) allows quantification based on Poisson statistics of presence/absence results from thousands to millions of reaction mixture compartments per sample. Advances in microfabrication technologies in the 2010s have allowed the development of commercial dPCR platforms, making this an emerging and highly promising technology for the GFPD field (Tiwari et al. 2022).

High-throughput DNA sequencing for genetic faecal pollution diagnostics

With the advent of high-throughput DNA sequencing (HTS) in the 2010s, whole-community profiling revolutionized gut microbiome research. This, in turn, has enabled the identification of new host-associated and general faecal pollution targets followed by the development of new qPCR assays (McLellan and Eren 2014, Bibby et al. 2019). Applying HTS to environmental samples stimulated the development of entirely new concepts for the GFPD field. HTS-based approaches have evolved rapidly, concomitant with rising capabilities in computing and bioinformatics (Garner et al. 2021). Currently, the two most widely used methods are 16S rRNA gene amplicon sequencing (16S AmpSeq) providing taxonomic information, and whole metagenome sequencing, allowing, in addition to taxonomic profiling, the identification of functional genes, such as virulence or antibiotic resistance genes (ARGs; Chan et al. 2019). There are two strategies to use HTS for faecal pollution analysis in aquatic environments. One approach works by identifying gut-associated taxa within the complex aquatic microbiome signal and thus identifying the presence of faecal pollution (e.g. Ulrich et al. 2016). The other approach relies on predefined faecal reference sequence libraries, based on a local sample collection and public sequence databases and aims to identify specific sources of faecal pollution. Sophisticated machine learning algorithms such as SourceTracker, FEAST, or FORENSIC are then required for data analysis and interpretation (Tan et al. 2015, Unno et al. 2018, Mathai et al. 2020, Raza et al. 2021). HTS, as currently applied for most applications in microbiomics, only provides relative quantification within the sequence pool recovered (% of target sequences within total recovered sequences). The resolution depends on the applied sequencing depth (i.e. number of total sequence reads per sample). Per se, it does not provide quantitative information on the analysed sequences in relation to their occurrence in the water sample (see the ‘Sensitivity of environmental detection of nucleic acid targets’ section). In its current form of application in GFPD, HTS seems to be of complementary nature to the qPCR/dPCR quantification of genetic fecal markers.

Methods of the systematic study design analysis

Literature database searches

The literature databases Scopus and Web of Science/Core Collection were searched for studies on genetic methods to detect microbial faecal pollution in water. In both cases, the query included the following building blocks: ‘genetic methods’ AND ‘faeces’ AND ‘water quality’, with a suite of related words for each term. ‘Genetic methods’: (genetic OR qPCR OR ddPCR OR PCR OR ribotyp* OR DGGE OR metagenomics OR ‘microbial communit*’ OR ‘bacterial communit*’ OR ‘microbial diversity’ OR (source AND track*)); ‘faeces’: (feces OR faeces OR fecal OR faecal OR wastewater OR sewage OR enteric OR intestinal); and ‘water quality’: ((water* OR freshwater OR seawater) AND (quality OR pollution OR contamination)). Each of the blocks was searched in the title, the abstract and the author keyword fields. The document type was restricted to research articles. The time period covered expanded from the first such article up until the end of 2022. The resulting list included 3112 articles from Web of Science/Core Collection and 3508 articles from Scopus. After removing duplicates and articles with no DOI, the combined list contained 3554 articles (Fig. 2). The search syntax and the retrieved records are available as supplementary data (‘Demeter et al GFPD review Suppl Data.xlsx’).

Figure 2.

Figure 2.

Methodology of the systematic literature analysis.

Article screening

Next, the combined list (titles and abstracts) was screened manually to remove off-topic studies. Only articles that explicitly stated the use of at least one genetic microbial parameter as an indicator for faecal pollution diagnostics (but not if used as e.g. enteric pathogen) were retained. Studies developing and evaluating new methods for GFPD as well as their field application were retained. A total of 1122 articles fulfilled these criteria (‘all genetic studies’, Fig. 2).

Broad categorization of ‘all genetic studies’

The 1122 articles in the ‘all genetic studies’ pool were then categorized based on their broad study aim, as follows: (1) method establishment articles: the research question relates to method development and evaluation/validation (sensitivity/specificity, persistence, resistance, and so on). (2) Application articles: the research question relates to the environment, and the genetic parameter is assumed to have been previously validated. Studies on, e.g. the detection and source tracking of faecal pollution, or the estimation of the associated health risk, belong to this category. (3) Both: articles having both method establishment and application aspects. Since the review aims to assess application areas, articles from (2) and (3) were retained for detailed analysis (‘application studies’, n = 649, Fig. 2).

Systematic analysis of the ‘application studies’

Titles and abstracts from all application studies (n = 649, Fig. 2) were reviewed to extract information on five study elements: (i) genetic faecal parameters, (ii) other types of parameters, (iii) sample type and use, (iv) data analysis approach, and (v) application area. The following section and Table 1 describe the study element definitions.

Table 1.

Systematic study design analysis. Each article in the ‘application studies’ pool was assessed for each study element (columns), with a single or multiple choices from the categories (rows).

Genetic faecal parameters Other types of parameters Data analysis approach Sample Application area
Class Target organism Host Method Class Sample type Use type
General faecal marker, traditional (e.g. E. coli, enterococci) Prokaryotes General faecal PCR Cultivation-based FIO Summary statistics, qualitative data Freshwater Recreational Faecal pollution detection
General faecal marker, new (e.g. general Bacteroidetes) Viruses Human or sewage qPCR/dPCR Cultivation-based MST Summary statistics, quantitative data Seawater Irrigation Source tracking: single source
Microbial source tracking marker (MST marker) Host cell mitochondrial DNA (mtDNA) Nonhuman Sequencing Pathogen Correlations, hypothesis tests, or simple bioinformatics Estuary Drinking Source tracking: multiple sources
Microbial source tracking, other approaches (MST other) Other Multiple hosts Other Epidemiology Multivariate statistics or advanced bioinformatics Domestic water Shellfish-growing Evaluation of treatment processes
Community analysis Chemical tracers QMRA, fate, and transport modelling Groundwater Other Infection and health risk assessment
Other Physicochemistry and nutrients Other data analyses Rainwater N/A Outbreak tracing and wastewater surveillance
Antibiotic resistance Faeces Other applications
Hydrology Sewage
Meteorology Stormwater, CSO
Land use Soil
Other Sediment and sand
Microcosm or spiked water
Other
  1. Genetic faecal parameters: the two selection criteria for microbial parameters included here were (1) detection using genetic methods and (2) an indicator role; pathogens were only included if the indicator role was explicitly stated (e.g. ‘viral indicator’).

    1. Class. Six genetic faecal parameter ‘classes’ were distinguished, where parameter ‘class’ is defined as a group of similar parameters. General faecal markers, indicating faecal pollution in general (covering human and other animal sources), are represented by two classes: ‘traditional general faecal markers’ that target microorganisms or bacteriophages for which the cultivation-based analysis is standardized and widely used (e.g. E. coli, enterococci), and ‘new general faecal markers’ that have been more recently developed and target highly abundant obligate anaerobes of the gut, such as Bacteroides spp. MST methods are divided into two classes: the various host-associated viral, bacterial or mitochondrial DNA-based markers are in the ‘MST markers’ class, while the ‘MST other’ class includes HTS-based as well as classical library-based, genotyping MST approaches. The class ‘community analysis’ covers genotyping- or HTS-based approaches to describe the microbial community. Finally, all other genetic methods for faecal pollution analysis, such as nonlibrary-based genotyping (e.g. E. coli population structure using strains or E. coli phylogroups) or treatment indicators that are not typical faecal indicators (e.g. pathogens), are included in the ‘other’ class.

    2. Target organism. This study element describes taxonomical groupings covering the major target types in genetic faecal pollution analysis, such as ‘prokaryotes’, ‘viruses’, and the mitochondrial DNA of the host animal itself (‘mtDNA’). Other target types, such as eukaryotes using 18S rRNA gene sequencing or if the target organism was not defined, are included in the category ‘other’.

    3. Host. Target organisms, and therefore, nucleic acid targets, may be host-associated, i.e. associated with a particular host species or narrow range of host species or may be general, i.e. associated with a wide range of host species. Four host categories are distinguished, ‘general’, ‘human’ (human- or sewage-associated), ‘nonhuman’ (associated with other animals), and ‘multiple hosts’ (more than one host was targeted). The category ‘not applicable’ was assigned to community analyses (fingerprinting, sequencing, and so on).

    4. Method. The great diversity of genetic methods for the detection of faecal pollution targets were grouped into four categories. Qualitative PCR methods (and cases where it was unclear whether qualitative or quantitative PCR was performed), are included in the category ‘PCR’. Quantitative PCR and dPCR are pooled because of their quantitative aspect in the category ‘qPCR/dPCR’. ‘Sequencing’ covers amplicon sequencing and whole metagenome analysis (shotgun sequencing). Finally, genetic fingerprinting techniques e.g. DGGE or BOX-PCR, hybridization, isothermal amplification, other methods or in case the method was not defined, are pooled in the category ‘other’.

  2. Other types of parameters:

    Class. All other parameters that the analysed articles reported were assessed on the level of parameter ‘class’, allowing an overview of the study design. Table 1 lists the 11 parameter classes that were identified. The class ‘other’ covers diverse parameters with low occurrence, e.g. biological oxygen demand, heterotrophic plate count, observational data on WASH practices.

  3. Sample type and use. A total of 13 categories of ‘sample type’, including various water types, faecal matter, and other materials, were identified. If the authors stated the intended use of the water resource, this was also logged. For a list of ‘sample type’ and ‘use type’ categories, please refer to Table 1.

  4. Data analysis approach. This study element describes how the dataset, characterized by the three study elements explained above, was analysed by the authors. In contrast to the three study elements, where several items could be logged, depending on the study design of the article, here each article was assigned to one of the six categories listed in Table 1. In cases in which only summary statistics were reported, we differentiated between qualitative data (occurrences) and quantitative data (minimun, maximum, median, and so on). Correlation analyses, hypothesis testing and simple bioinformatics such as sequence annotation and community analysis (e.g. Bray–Curtis dissimilarities) were grouped together into the category ‘correlations, hypothesis tests, or simple bioinformatics’. The category ‘multivariate statistics or advanced bioinformatics’ includes multivariate statistics, classification algorithms in the case of classical library-based MST, MST algorithms with HTS data, and HTS-based community analyses involving statistical analysis with metadata. Studies performing Quantitative Microbial Risk Assessment (QMRA) or microbial fate and transport models were grouped together in the category ‘QMRA, fate & transport modelling’. Other data analysis approaches, such as GIS-based data analysis, or, in the case of classical library-based MST, genotyping fingerprints without reporting a statistical classification method were assigned to the category ‘other data analyses’.

  5. Application area. Each article was assigned to one of the seven scientific application areas identified during the study design analysis. The application assignment is based on the predominant research question. For a list of the application areas, please refer to Table 1.

The assessment was performed in MS Excel. The resulting study design database (available as supplementary data, ‘Demeter et al GFPD review Suppl Data.xlsx’) was analysed and visualized in R, using tidyverse (Wickham et al. 2019). Co-occurrence networks were computed and visualized using igraph (Csardi and Nepusz 2006), following Ognyanova (2021). The pie diagrams over the map were created using scatterpie (Yu 2023) and ggplot2 (part of tidyverse). Alluvial diagrams that group and visualize categorical data, were created with ggalluvial (Brunson and Read 2023).

Outcomes of the systematic study design analysis

Broad study design trends across all articles

A systematic scientific literature database search followed by manual screening identified 1122 scientific articles (Fig. 2, ‘all genetic studies’). Research with genetic methods in this field started in the 1990s with a few articles per year, increasing to up to almost 100 articles in 2021 (Fig. 3). The broad categorization of study design types revealed three distinct phases: (i) the emergence of genetic methods in the 1990s with just a handful of articles published yearly; (ii) between ~2003 and 2010, the field started to grow with the main focus of research being on the development and validation (establishment) of new methods, namely, new general and host-associated faecal markers; (iii) since 2011, the field continues to grow, but there is a clear shift from method establishment activities to the implementation across a broad range of applications (Fig. 3). A closer look at the author affiliations reveals that Northern America is the dominant hub of both method establishment and application studies, with Europe and Asia coming second and third, respectively. Cooperation was evident among continents, demonstrating the international and interconnected nature of the GFPD field (Fig. 4).

Figure 3.

Figure 3.

Number of publications in the broad study design types over the years in the ‘all genetic studies’ pool (narticle = 1122, Fig. 2). The stacked bars show the broad categorization. The yellow line represents the percentage of pooled ‘application’ and ‘both’ categories (i.e. the ‘application studies’, Fig. 2) in the ‘all genetic studies’ pool.

Figure 4.

Figure 4.

Distribution of author affiliations among continents, according to the broad study design types. The size of pies indicates the number of articles affiliated on the given continent, the thickness of lines represents the number of coauthored articles (narticle = 1122).

Since the aspects of establishing methods have been duly reviewed elsewhere (see references in the ‘Background information on genetic targets and methods: a historical overview’ section), articles focused on these aspects were excluded from further analyses (Fig. 2).

‘Application studies’ trend analyses

‘Application studies’ (n = 649; Fig. 2) were reviewed to extract defined study elements ranging from parameters measured to ‘application area’ (Table 1, ‘Methods of the systematic study design analysis’). The following sections describe study element assignments and occurrence trends.

Parameter ‘class’ assignment and trends

Parameter ‘class’ assignments were designed to provide a coarse overview of the general experimental study design where parameter ‘class’ was defined as a group of similar parameters. A total of 17 parameter ‘class’ types, including six genetic and eleven other parameter classes, were identified during the systematic review ranging from ‘MST markers’ (measured by n = 434 articles) and ‘cultivation-based FIOs’ (n = 410) to ‘epidemiology’ (n = 13). A total of 468 articles (72% of ‘application studies’) included three or fewer parameter classes. A total of four parameter classes were reported by 116 articles, while complex study designs with five or more parameter classes were rare with only 65 articles. A co-occurrence network analysis indicated that the combination ‘MST markers’ and traditional ‘cultivation-based FIO’ was the most common one (n = 277 articles). In fact, not only were ‘MST markers’ paired often with ‘cultivation-based FIO’, but this was the most common combination for each of the genetic parameter classes. Additionally, ‘MST markers’ were often combined with ‘pathogens’ (n = 126 articles) and ‘physicochemistry and nutrients’ (n = 87 articles, Fig. 5).

Figure 5.

Figure 5.

Network analysis of the parameter ‘class’ assignment occurrence in the genetic faecal and other types of parameters (Table 1) in the ‘application studies’ pool (narticle = 649). The node size is proportional to the number of articles, the line thickness reflects the number of articles for a respective combination. Blue lines mark more than 20 co-occurrences while grey lines show less than 20 co-occurrences.

Genetic parameters: ‘target organism’, ‘host’, and ‘method’ assignments

All ‘application studies’ were mined for detailed information on the genetic parameters. For each parameter reported, the target organism, host organism, and analytical method were recorded, resulting in a total of 952 parameter occurrences across the 649 application studies. The most widely reported target organism was ‘prokaryotes’ (n = 756 parameter occurrences) followed by ‘viruses’ (n = 166). In contrast, ‘host mitochondrial DNA’ and ‘other’ target organisms collectively accounted for 30 parameter occurrences. Host assignments indicated that ‘human’ (n = 322) is the most widely researched host animal followed by ‘multiple hosts’ (n = 209), ‘general’, ‘faecal’ (n = 157), and ‘nonhuman’ (n = 40). Method assignments suggest that PCR-based methods account for the vast majority of parameter occurrences (n = 720), with ‘qPCR/dPCR’ methods used 82% of the time. ‘Sequencing’ was the next most prevalent method assignment group (n = 146). An alluvial plot (Fig. 6) illustrates linkages or lack thereof between class, target organism, host, and method parameters.

Figure 6.

Figure 6.

Alluvial plot showing the occurrence of genetic parameter types in the ‘application studies’ pool (narticle = 649). Each item, i.e. each line corresponds to one parameter measured in one study, so one ‘class’—‘target organism’—‘host’—‘method’ assignment. The thickness of the stratum (ribbon) corresponds to the number of studies that measured that particular class–organism–host–method combination. However, since a study might have measured several genetic parameters, the y-axis does not correspond to the number of articles in the ‘application studies’ pool.

Sample ‘type’ and intended ‘use’ assignments and trends

A total of 14 sample types were identified ranging from ‘freshwater’ (n = 394 articles, 61% of articles) and ‘seawater’ (n = 113) to ‘rainwater’, ‘microcosm’, ‘shellfish’, and ‘biofilm’ (each n ≤ 10, Fig. 7). The most common combinations were ‘freshwater’ and ‘sewage’ (n = 56), ‘freshwater’ and ‘faecal matter’ (n = 56) and ‘freshwater’ and ‘sediments and sand’ (n = 50, Fig. 7). Of the 836 reported sample types, where an intended use would potentially be relevant (i.e. all water types, ‘sewage’ and ‘sediment and sand’), the intended use was reported for 284 sample types, representing 254 articles. ‘Recreational’ and ‘drinking’ water were the most frequently described, accounting for 131 and 108 occurrences, respectively. ‘Irrigation’ and ‘shellfish-growing’ were seldom studied (n = 22, n = 15 occurrences, Fig. 8).

Figure 7.

Figure 7.

Network analysis of the ‘sample type’ assignment occurrence in the ‘application studies’ pool (narticle = 649). The node size is proportional to the number of articles, the line thickness reflects the number of articles for a respective combination. Blue lines mark more than 10 co-occurrences while grey lines show less than 10 co-occurrences. CSO denotes combined sewer overflow.

Figure 8.

Figure 8.

Alluvial plot showing the ‘sample type’—‘sample use’ combinations in the subpopulation of ‘application studies’ that reported this information (narticle = 254 articles). Since a study might have analysed several ‘sample types’, or indicated several ‘water uses’, the y-axis does not correspond to the total number of articles.

‘Data analysis approach’ assignment and trends

While 171 articles (26%) only report summary statistics (qualitative and quantitative), the majority report more sophisticated data analysis approaches such as ‘correlation analyses, hypothesis tests or simple bioinformatics’ (n = 214, 33%) and ‘multivariate statistics or advanced bioinformatics’ (n = 177, 27%). ‘QMRA or fate & transport modelling’ were found to be conducted only by a small portion of the articles (n = 29, 4%, Fig. 9).

Figure 9.

Figure 9.

Data analysis approach in each article in the ‘application studies’ pool (narticle = 649).

‘Application’ type assignment and trends

A total of seven genetic method application areas were identified in this systematic literature review (Fig. 10). In addition to faecal pollution detection using general faecal indicators (‘Application 1’, 91 articles), MST was the predominant use of genetic faecal markers (‘Application 2’ and ‘Application 3’, 356 articles in total). Most of these studies performed MST in the classical sense, investigating several potential sources (‘Application 3’, 230 articles), while 126 articles targeted just one source type, mainly human (‘Application 2’). To a much smaller extent, genetic faecal markers were found in performance assessments of (waste)water treatment and in studies of microorganism fate and transport in groundwater as transport surrogates (‘Application 4’, 44 articles). An equally small, but emerging field is health and infection risk assessment, where genetic methods have been found to be employed as risk indicators, or as support in selected steps of QMRA (‘Application 5’, 26 articles). Host-associated faecal indicators have also been used to trace the origin of waterborne outbreaks, elucidate pathogen transmission routes and support the interpretation of SARS-CoV-2 wastewater surveillance data (‘Application 6’, 25 articles). Apart from these core application areas, GFPD tools have also been found to support other scientific disciplines, such as the tracking of the source of nutrients or ARGs, as well as archaeology. The section ‘Application 7’ provides an overview of these additional areas (107 articles).

Figure 10.

Figure 10.

Assigned application areas in the ‘application studies’ pool (narticle = 649).

In-depth review of the application areas of genetic faecal pollution diagnostic through case studies

The following sections demonstrate the successful implementation of GFPD in the seven identified application areas of water quality research (Fig. 11). To do so, trend analyses of selected study elements for a given application area are presented at the beginning of each section, followed by an illustration of these findings through a collection of cutting-edge case studies.

Figure 11.

Figure 11.

Overview of the application areas of the genetic faecal pollution diagnostics field.

Application 1: faecal pollution detection

In general, there are two approaches to detect faecal pollution using genetic methods, and the 91 articles in this application category can be divided along these lines, with just a small overlap: (i) the targeted detection of traditional or new general faecal markers, mostly using qPCR (for definitions, see the section ‘Systematic analysis of the ‘application studies’, n = 36 articles); (ii) the nontargeted detection of faeces-related taxa using HTS (n = 50); and (iii) five articles measuring both. ‘Traditional general faecal markers’ were used more often than ‘new general faecal markers’ (n = 37 and n = 9 articles, respectively). In most instances, ‘traditional general faecal markers’ were measured in parallel with the corresponding ‘cultivation-based FIO’ parameter (28 out of 37 articles). The dominant method for community composition analysis was 16S AmpSeq (45 articles). ‘Freshwater’, ‘seawater’, and ‘sediments and sand’ were the most common sample types while ‘recreational’ and ‘drinking’ were the most frequently observed intended use types.

Targeted detection of general faecal indicators

Regulatory agencies, such as the United States Environmental Protection Agency (USEPA) have begun to capitalize on the potential of qPCR as a rapid monitoring solution for recreational waters, providing same-day results (< 4 h). In 2012, water quality beach action values for qPCR measurements of enterococci were included in the U.S. Recreational Water Quality Criteria (USEPA 2012b). This addition was based upon epidemiological studies conducted at freshwater and marine beaches that provided evidence that enterococci levels measured by qPCR are predictive for swimmer-related illness ((Wade et al. 2008, 2010), see details in the section ‘Application 5’).

Since then, enterococci qPCR (USEPA Methods 1611 and 1609.1) has been applied in several beach monitoring demonstration and implementation programs (Ferretti et al. 2013, Dorevitch et al. 2017, Byappanahalli et al. 2018). In one of the largest studies, nine Chicago beaches were monitored over the course of 894 beach-days in 2015 and 2016, resulting in 1796 water samples that were analysed by enterococci qPCR while maintaining standard E. coli cultivation testing, which is typically used at the Great Lakes (Dorevitch et al. 2017). Side-by-side comparison of the two approaches showed that enterococci qPCR beach action values were exceeded 3.4 times less frequently than E. coli cultivation beach action values (6.6% vs. 22.6% of beach-days) (Dorevitch et al. 2017). However, generalizations—such as that qPCR testing necessarily leads to fewer beach action value exceedances than cultivation-based testing—cannot be made. Several prior studies have found varying levels of agreement between E. coli cultivation and enterococci qPCR beach action value exceedances (Haugland et al. 2014, Byappanahalli et al. 2018). Moreover, data analysis of this large multibeach, multiyear evaluation study found that prior-day E. coli cultivation results are no better than chance alone at predicting current-day water quality at Chicago beaches (Dorevitch et al. 2017). Based upon these findings, enterococci qPCR testing was expanded by the local authority at up to 20 Lake Michigan beach locations from 2017 onwards and E. coli cultivation-based testing was discontinued (Shrestha and Dorevitch 2020).

More recently, the USEPA developed a draft standard method for qPCR testing of E. coli (‘Draft Method C’; Sivaganesan et al. 2019) driven by the need for rapid E. coli testing. In a large-scale method comparison effort, data from 101 Michigan (USA) recreational beaches from more than 6000 samples showed 91.5% agreement in beach notification outcomes between the cultivation-based standard of 300 MPN or CFU/100 ml and a putative threshold of 1.863 log10 gene copies/reaction, estimated in this study (Haugland et al. 2021). A strong correlation was observed between cultivation and qPCR results, with a Pearson R-squared value of 0.641 for the pooled data of the 39 sites passing the data eligibility criteria (sample n = 2092) (Haugland et al. 2021).

The universal Bacteroidales marker BacUni, a new general faecal marker, was evaluated together with three cultivation-based FIOs as a predictor of protozoan and bacterial pathogens in samples from rivers and estuaries in California, USA (Schriewer et al. 2010). The universal Bacteroidales marker was detected in all water samples at concentrations two orders of magnitude higher than cultivation-based FIOs. The results also showed the universal Bacteroidales marker to have a comparable or higher mean predictive potential than cultivation-based FIOs (Schriewer et al. 2010). The high abundance of new general faecal markers is certainly an asset, as sensitivity can become a challenging aspect for genetic faecal pollution detection in water resources with low faecal pollution levels (for details, see ‘Sensitivity of environmental detection of nucleic acid targets’ in the ‘Discussion’).

Nontargeted detection of faeces-related taxa using high-throughput sequencing

HTS approaches have emerged in microbial water quality monitoring allowing for new opportunities. From a public health perspective, HTS surveys have been shown to identify faecal taxa (e.g. Bacteroides) in aquatic microbial communities (Ulrich et al. 2016, Vadde et al. 2019). For instance, Ulrich et al. (2016) tracked changes in bacterial community composition in a riverine system during and after Superstorm Sandy (a 100-year storm event in 2012) using HTS and traditional cultivation-based faecal indicator testing. Bioinformatic analyses of 16S AmpSeq data showed a drastic restructuring of the bacterial community, associated with hydrological dynamics. The relative abundances of sequences matching faecal bacteria (Bacteroides, Clostridium, and Blautia genera) and potentially pathogenic populations (Campylobacter and Helicobacter) were observed to increase after the peak of the storm (Ulrich et al. 2016). Given that HTS applications can provide profiles of microbial communities and information on faeces-associated taxa, such genetic approaches may become useful as a screening tool in the future for identifying potential health risks and for prioritizing sites for follow-up analysis of water samples using targeted quantitative PCR approaches (Vadde et al. 2019, Jiang et al. 2020).

Application 2: MST of faecal pollution from a single source type

Faecal pollution may originate from a multitude of point and nonpoint sources. The need to identify the sources of faecal pollutions arose years ago, and since then, many different approaches have been developed and validated (‘Background information on genetic targets and methods: a historical overview’). Focusing the investigation on a single type of faecal source often happens (i) if there is evidence regarding the dominant source of pollution such that neglecting other sources is acceptable or (ii) the investigation specifically addresses one source type because, e.g. some faecal sources could represent a higher public health risk than others. In any case, the need to validate the hypothesis of the origin of contamination using a reliable analytical tool exists, since scientific evidence facilitates posterior effective measures.

Of the 126 articles in this application area, the single source was ‘human’ in 113 cases and only a handful of articles focused on ‘nonhuman’ sources such as ruminants, gulls, ducks, chickens, or dogs. The majority, 73%, of the articles combine ‘MST markers’ with the measurement of traditional ‘cultivation-based FIOs’. Other parameter classes that often appeared were ‘pathogens’, ‘traditional general faecal markers’, ‘physicochemistry and nutrients’ and ‘chemical tracers’ (n = 31 to n = 15 articles). ‘Freshwater’ was most often sampled (n = 80 articles), followed by ‘seawater’ (n = 23) and ‘sewage’ (n = 18). A total of 44 articles reported ‘summary statistics’ (qualitative or quantitative), while 48 articles performed ‘correlations, hypothesis testing or simple bioinformatics’. A smaller set of articles performed more advanced data analyses, such as ‘multivariate statistics or advanced bioinformatics’ (n = 23) or ‘QMRA, fate & transport modelling’ (n = 4).

Human sources: decentralized wastewater systems

The interpretation of MST results is greatly enhanced by cultivation-based FIO and land-use data or additional parameters that can help to explain the origin, fate and transport of a specific pollution source. For example, in watersheds with more than 1621 septic systems in Michigan, USA, higher concentrations of Bacteroides thetaiotaomicron (human-associated marker) were detected under baseflow conditions suggesting that control measures should include septic system maintenance and construction in the area (Verhougstraete et al. 2015). In this study, analyses were performed using a classification regression tree including riparian buffers, septic tanks, and physicochemical data. Beyond chronic pollution scenarios, rainfall events can impair water quality through combined sewer overflows, septic tank seepages, agricultural runoff or other events governed by precipitation. A similar study found that three human-associated Bacteroides markers correlated positively with septic tank density during wet weather, suggesting that septic tanks are a significant source (Peed et al. 2011). Since there was no correlation with FIO during baseflow conditions, the authors postulate that other sources might be implicated in chronic pollution.

Human sources: centralized wastewater systems

In some cases, genetic MST markers can be combined with other types of tracers to strengthen the interpretations and to overcome markers’ limitations, such as low specificity, differing decay rates or different transport. For example, the detection of the human-associated genetic marker HF183 and optical brighteners in private drinking water supplies in rural areas of Virginia, USA, revealed sewage as a potential pollution source. However, only a few samples showed E. coli together with the optical brighteners, suggesting a different fate and transport of these indicators within the aquifer (Smith et al. 2014). In Montreal, Canada, a study applied a multiparameter source tracking toolbox combining chemical source tracking markers for sewage (caffeine, theophylline, and carbamazepine) together with the human-associated genetic markers HF183 and mitochondrial DNA to detect illicit wastewater discharges into storm sewers during dry weather (Hachad et al. 2022). The authors used a composite index of the different markers together with the levels of E. coli to identify household cross connections or indirect illicit discharges and verified them successfully with dye tracing.

Hydrological and meteorological data are often indispensable to understand the fate of faecal microorganisms in the environment. For example, hydrological and meteorological data combined with the human-associated marker HMBif, cultivation-based MST parameters and FIO allowed modelling the self-depuration distance of a small Mediterranean river (Pascual-Benito et al. 2020). The obtained models provided information about the recuperation of the river’s initial conditions after receiving treated sewage discharge. MST tools are also useful after extreme meteorological events. For example, after Hurricane Harvey, the detection of the human-associated markers HF183 and BacHum and their correlation with FIO indicated a large input of sewage through sewage overflows and stormwater in two catchments in Texas, USA (Kapoor et al. 2018).

HTS applications have also been reported. After the pioneering work of Unno et al. (2010) in South Korea, the study by Newton et al. (2013) was one of the first large-scale studies that also demonstrated the complex challenges in data interpretation. The authors examined chronic human faecal pollution at an urban site in Lake Michigan, USA, and set out to identify its sources and delivery routes. By identifying the relative abundance of sewer infrastructure-associated, faecal and human faecal signatures in lake water samples, they identified combined sewer overflows as the dominant pollution source during heavy rainfall events, whereas nonhuman faecal sources exhibited the highest relative abundance during dry weather and noncombined sewer overflow producing rain events. More recently, Zimmer-Faust et al. (2021) tracked the plume of a wastewater treatment plant (WWTP) outfall in the coastal Pacific Ocean on the USA/Mexico border and showed that its behaviour differs depending on oceanic and meteorological conditions. They used a human-associated MST marker and 16S AmpSeq together with the algorithm SourceTracker, with pristine marine water, WWTP discharge and a nearby river as potential sources to derive the spatial extent and concentration gradient of human pollution.

Recreational waters

Coastal waters have important value for leisure, tourism, and coastal ecosystems including shellfish harvesting areas; therefore, MST tools have been extensively tested in these areas (Korajkic et al. 2009, González-Fernández et al. 2021). In Thailand, Kongprajug et al. (2021) used two genetic viral MST markers, crAssphage and HPyV, at various beaches during dry and wet seasons to verify human waste practices as the main faecal source. Their results reported temporal variability but not spatial variability, thus recommending a future monitoring strategy based on more frequent sampling at a unique sentinel site. Other studies include environmental data such as precipitation and solar radiation, oceanographic data like tides and currents, and use correlations or more complex models to be able to predict a potential pattern. For example, at different sites in San Francisco, USA, the human-associated marker HF183 was found to correlate mainly with 72 h precipitation but also water temperature, tides or insolation (Jennings et al. 2018). Cao et al. (2018) sought to develop a standardized data analysis approach that incorporates all qPCR measurements from a defined group of samples (i.e. nondetections, detections, and measurements in the range of quantification) to assess average human faecal pollution levels at recreational water sites. The authors proposed a metric, the human faecal score, that combines the results of the human-associated qPCR marker HF183/BacR287 with a defined sampling strategy (sampling intensity and number of replicates) and a Bayesian weighted average approach. The score can be used to prioritize sites for remediation and has more recently been used to compare source-associated impacts under wet and dry conditions (Shrestha et al. 2020) and identify trends with cultured FIO paired measurements (Li et al. 2021b). In addition to human sources, wild animals can also contribute to faecal indicator bacterial loads in coastal areas with large gull colonies. The presence of the gull-associated bacterium Catellicoccus marimammalium in 58% of the water samples and at all sampling sites as well as their correlation with faecal indicators suggested a chronic impact of gull faeces on the water quality in southern Ontario, USA (Lu et al. 2011). The same marker showed a decrease together with faecal indicators and bacterial pathogens after gull removal in Lake Michigan, USA (Converse et al. 2012).

Rural areas, domestic animals

Single source characterization is also relevant in rural areas with high agricultural pressure where tracking animals such as swine, ruminants, or poultry can be of interest (Weidhaas et al. 2011, Heaney et al. 2015, Wiesner-Friedman et al. 2021). These studies include, in addition to the relevant genetic faecal marker, data on land uses, land-applied manure, and/or animal feeding operations. For example, after testing for a ruminant-associated marker, BoBac, and including data on animal feeding operations, the authors found that applying manure in the fields implied an increase in faecal indicators in riverbed sediments (Wiesner-Friedman et al. 2021).

Application 3: MST of faecal pollution from multiple sources

Many impaired water bodies are polluted by more than one source. Thus, it is important to characterize key sources because the corresponding health risk as well as the mitigation steps may differ by source. Nevertheless, study design and choice of methods are highly dependent on the water resource type, the intended water use and other factors.

Of the 230 articles with a focus on multisource MST, MST was achieved predominantly using ‘MST markers’ (n = 180, 78% of articles) followed by classical library-based MST (n = 33, mostly published before 2015) or HTS (n = 21, mostly published after 2015). In multisource MST articles, FIOs are measured predominantly with cultivation-based methods (‘cultivation-based FIO’, n = 163 articles). In contrast, ‘traditional’ and ‘new genetic faecal markers’ played a minor role (n = 17 and n = 20, respectively). The most common parameter combination was ‘MST markers’ with ‘cultivation-based FIO’ (n = 132, 57% of articles). Other common parameter classes were ‘physicochemistry and nutrients’, ‘pathogens’, ‘meteorology’, and ‘land use’ (n = 50 to n = 25 articles). The proportion of articles with four or more parameter classes was higher than in single-source MST (31% in multisource MST and 28% in single-source MST, ‘Application 2’). This higher study design complexity was reflected in the data analysis approach: 35% of articles performed ‘multivariate statistics or advanced bioinformatics analyses’ (18% in single-source MST, ‘Application 2’).

Elevated pollution levels on a watershed scale

The starting point in watershed studies is usually elevated levels of cultivation-based FIOs in rivers, lakes, or coastal waters. Often, the spatial scale is relatively large and there are multiple potential sources ranging from human faeces (via leaky infrastructure, treated, or untreated wastewater or combined sewer overflows) to livestock (grazing or stabled), pets as well as avian and mammalian wildlife. Often, there is limited knowledge on hydrology, meteorology, and land use. An illustrative example is given by three studies conducted over a span of 16 years in the Tillamook Bay catchment in Oregon, USA demonstrating how state-of-the-art genetic MST applications have evolved over time. Bernhard et al. (2003) and Shanks et al. (2006) compared PCR-based ruminant and human marker frequencies with faecal pollution levels considering rainfall patterns and seasonal pollution dynamics to identify pollution sources. Much more recently, Li et al. (2019) used quality-controlled and, in several cases, standardized qPCR assays for five faecal sources, and high-resolution GIS for land-use and meteorological data to not only identify but also quantify and locate pollution sources and patterns to guide remediation efforts and risk assessment. In a similar approach, Bushon et al. (2017) ranked tributaries to the Little Blue River catchment in Missouri, USA, based on estimated contributions to water quality impairment. The studies by Nguyen et al. (2018) and Yamahara et al. (2020) demonstrate how hypothesis-formulation can support study design for GFPD. Both studies also try to shed light on the potentially confounding role that soil and sediments might have on MST applications, especially in tropical waters. To elucidate the relative roles of human and other animal sources polluting the Danube River and its tributaries, Kirschner et al. (2017) used a combination of longitudinal survey along more than 2500 km of river and a temporal survey over the course of a year at three sites successfully identifying human waste as the dominant source. Bambic et al. (2015) encountered difficulties segregating pollution sources due to the confounding influence of disinfected municipal wastewater. Separating wet from dry weather based on meteorological data allowed data interpretation, with municipal wastewater (human) being the dominant dry-weather pollution source, while during wet weather, agricultural runoff, and stormwater (ruminant and dog) dominate. Using bacterial and viral markers allowed the authors to demonstrate the difficulty to detect the presence of viral pathogens when only using bacterial indicators. The authors used cutting edge data handling methods, including statistical methods to account for the large proportion of nondetects, and an estimation of spatial and temporal variations of same-host contribution using ratios between given Bacteroidales MST markers and a general Bacteroidales marker (Bambic et al. 2015). Separating the sample set into dry and wet periods allowed Liang et al. (2021) to reveal differing pollution pathways. The results of MST markers agreed with those from 16S AmpSeq and the FEAST algorithm: humans were the main pollution source in the dry season, and ruminant and swine were the main pollution sources in the wet season at this river site near Beijing, China. MST methods have also been used to more generally identify factors and features that promote or reduce watershed faecal pollution rather than just identifying pollution sources. As an example, Green et al. (2021) used MST and cultivation-based FIO in an investigation of 68 streams in New York State, USA, to identify stream features, land use practices and meteorological patterns that drive faecal pollution levels from multiple sources.

Recreational waters

In contrast to general watershed pollution scenarios, bathing water studies are usually triggered by persistently elevated FIO levels at public beaches directly threatening the health of visitors, necessitating beach closures and inflicting considerable economic damage. Study areas are often smaller, and the potential sources are less diverse (e.g. sewage discharges, birds, and pets) (Staley et al. 2018). Prudently, studies often make efforts to consider the influence of hydrology (flows, tides, and so on) and the effect of precipitation and solar radiation on water quality changes and to resolve faecal source contributions (Williams et al. 2022). In a proof of concept study in Xiamen, China, An et al. (2020) used high-throughput qPCR for a large number of assays targeting multiple faecal sources and pathogens to investigate bathing waters.

Drinking water

Impairment of drinking water quality is one of the most pressing issues worldwide. The specific challenge in this application field is that low levels of pollution already pose relevant health risks. For example, elevated FIO levels observed in karst and fractured aquifers after precipitation were the starting point for several MST studies. The problem of highly variable pollution dynamics in the course of very short time periods can be approached by linking sampling to hydrological dynamics (Reischer et al. 2008) and nested sampling with higher sampling frequencies during periods of hydrological fluctuations and during/after rainfall events (Reischer et al. 2011). The very short residence times of faecal pollution in the studied springs also allowed direct source apportionment based on MST marker concentrations in spring water because differential persistence can be disregarded when measuring very recent pollution. To determine the source and risk factors for nitrate and microbial pollution in private dolomite karst wells, Borchardt et al. (2021) used multivariate regression models with potential drivers such as land use, precipitation, hydrogeology, and well construction.

Aquaculture and irrigation water

Shellfish harvesting areas in coastal waters and aquaculture in general are also under a large amount of anthropogenic pressure often resulting in the contamination of products with FIOs and pathogens. The applicability of MST approaches to identify and prioritize pollution sources has been demonstrated in shellfish harvesting waters and products such as oysters (Mieszkin et al. 2013). Klase et al. (2019) integrated MST markers, ARG assays, and pathogen detection with bacterial community-based analysis to broadly investigate the potential public health risks associated with pollution of fishponds. Similarly, faecal pollution levels, ARG and pathogen occurrence were investigated in irrigation waters used for fresh produce to determine sources of pollution and risk factors (Weller et al. 2020).

Application 4: evaluation of treatment processes

Pathogen removal is one of the primary functions of wastewater and drinking water treatment. However, relying on direct pathogen determination only is not practicable due to the low and varying concentrations in raw water as well as the high number of different pathogens potentially occurring. Thus, treatment performance assessment often relies on treatment indicators used as representative surrogates for pathogen removal (see the section ‘Glossary’; Momba et al. 2019). While cultivation-based microbial parameters are the most commonly employed treatment indicators (Jofre et al. 2016, Momba et al. 2019), the systematic literature review revealed 44 articles that used genetic markers as treatment indicators. In this article pool, ‘MST markers’ and ‘traditional general faecal markers’ were the most often measured genetic parameter classes (23 and 18 articles), whereas ‘cultivation-based FIO’ and ‘pathogens’ were the most common other parameter classes (22 and 15 articles). For the treatment type, 36 articles dealt with engineered treatment processes, with the majority, 24 studies, focusing on wastewater treatment and water reuse. The various steps of drinking water treatment, as well as stormwater and greywater treatment, were the topics of the other 12 articles. The attenuation of microorganisms during groundwater transport was the focus of eight studies. In total, five of these involved natural tracers, and three involved injected tracers. Riverbank filtration, managed aquifer recharge and the drinking water treatment step of slow sand filtration were found to be the main processes studied. Investigations of microorganism attenuation express changes in treatment indicator concentration during a treatment step as percentage reduction or as log10 reduction values (LRV, the difference in log10-transformed concentrations before and after the treatment step; Momba et al. 2019).

In summary, the identified studies using GFPD, as representatively shown below, predominantly focus on nucleic acid target concentration changes, as an indication for the decrease of cell and virus concentrations during biological wastewater treatment or aquifer transport. Importantly, investigation of water treatment processes often also determine disinfection efficacies by characterizing the microbicidal and virucidal effects on FIO and pathogens (section ‘Generating viability – and infectious status information by molecular tools’ in the section ‘Discussion’). Viability PCR and enzymatic treatment PCR (ET-qPCR) are molecular techniques used to assess the viability and infectious status of microorganisms. In our systematic search, three articles were identified that employed these methods.

Evaluating microorganism removal during engineered treatment processes

Detection of nucleic acids

For the characterization of the removal of pathogens, such as viruses, through wastewater treatment, viral qPCR MST markers have been increasingly used and offer some advantages over traditional indicator viruses such as phages. The most important aspect of qPCR MST markers is that their concentrations in untreated wastewater are expected to be far greater than those of most viral pathogens (Hughes et al. 2017, Kitajima et al. 2018). This is particularly important because an indicator whose concentration is high can be detected consistently and more easily in different stages of treatment processes. The concentrations of coliphages in wastewater were found to be 7-log10 PFU/l, while the concentrations of enteric viruses such as human adenovirus and human polyomaviruses were variable and reported to be on the scale of 6 to 9-log10 copies/l (reviewed in Ahmed et al. 2020). Several studies have reported high numbers of PMMoV, crAssphage, Bacteroides (HF183) and Lachnospiraceae (Lachno3) and other qPCR MST markers in untreated wastewater (Rosario et al. 2009, Hughes et al. 2017, Ahmed et al. 2018, 2019). Furthermore, qPCR MST markers show little variation in untreated wastewater, and the concentrations range between 8 and 10 log10 copies/l (Hughes et al. 2017, Ahmed et al. 2019).

Several studies determined the log reduction values of human MST qPCR markers such as crAssphage and PMMoV in full-scale WWTPs (reviewed in Ahmed et al. 2020, Sabar et al. 2022). For example, Hamza et al. (2011) reported an ~3-log10 reduction in PMMoV concentrations in a conventional activated sludge treatment plant in Germany, which was similar to the reduction in polyomavirus and torque teno virus. Hughes et al. (2017) reported an ~1.1-log10 reduction in PMMoV concentrations in an activated sludge WWTP, which was less than those of HAdV and HPyV but similar to those of norovirus and enterovirus. Similar log reduction value of PMMoV was reported by Kuroda et al. (2015) in a WWTP in Vietnam. Schmitz et al. (2016) reported < 1-log10 reduction of PMMoV during activated sludge and biological trickling filter and the reduction rate was similar to aichivirus, norovirus, sapovirus, adenovirus, and polyomavirus. Based on the log reduction values reported in the literature PMMoV appears to be a conservative viral indicator for the reduction of pathogenic viruses in WWTPs. Several studies reported the reduction of crAssphage ‘the most abundant [known] virus’ in the human gut in WWTPs with activated sludge. Tandukar et al. (2020) reported a log reduction of 3.3 log10, while (Farkas et al. 2018) reported 1.0–2.0 log10 reduction.

Asami et al. (2016) determined the log10 reduction of PMMoV and JC polyomavirus for coagulation-sedimentation and rapid sand filtration processes in a drinking water treatment plant (DWTP) in Bangkok, Thailand using qPCR. The observed removal efficiencies varied depending on treatment step, season, and raw water quality, with LRVs ranging between 0.4 and 1.6 for PMMoV and between 0.5 and 1.9 for JC polyomavirus.

Molecular strategies to indicate the viability and the infectious status

The original idea of applying viability qPCR to bacterial MST markers was to gain information on recent faecal pollution events in water resources (Bae and Wuertz 2012, 2015). More recently, Jager et al. (2018) used qPCR with and without propidium monoazide (PMA) pretreatment as well as cultivation-based methods for E. coli, enterococci and P. aeruginosa to evaluate the removal efficiency of wastewater ozonation, a tertiary treatment step. PMA is an intercalating DNA dye that penetrates cells with impaired membranes and prevents PCR-based amplification (Nocker et al. 2006). It thus allows for selective detection of viable cells. PMA-qPCR is, therefore, also known as viability qPCR. FIO removal rate estimates were ranked in the following order: cultivation-based > viability qPCR > qPCR (Jager et al. 2018), emphasizing the differences among the culturable population, the viable but not culturable population and the total bacterial DNA. Viability qPCR, in comparison with qPCR, has also been applied to FIO (E. coli) and bacterial and viral MST markers (crAssphage, JC, and BK polyomavirus, human adenovirus, human-associated Bacteroides HF183) in sewage sludge flocs to assess their removal and inactivation during potassium ferrate treatment (Wang et al. 2023). Spatial distribution and movement resulting from the potassium ferrate treatment of the FIO and MST markers could be analysed in different compartments of the sludge flocs, encompassing various extracellular polymeric substance fractions. The reduction of the MST marker determined by qPCR was up to two orders of magnitude lower than the reduction determined by viability qPCR (Wang et al. 2023).

Similarly, enzymatic treatment qPCR (ET-qPCR), which applies enzymatic treatment using proteinase K and RNase, was used to estimate infectivity of bacteriophage MS2 in water (Pecson et al. 2009). By utilizing multiple-PCR-amplicons (providing whole genome coverage) and partial inactivation using different virucidal agents (such as heat, UV-B light, and singlet oxygen), the authors demonstrated that genome damage does not fully explain viral inactivation. Therefore, PCR-based assays would never yield results equivalent to infectivity assays. These assays fail to completely account for specific false positives that may arise when testing for MS2 bacteriophages. Consequently, to effectively monitor MS2 infectivity using ET-qPCR, it becomes crucial to determine a statistical ratio of total inactivation by cell culture in advance. Therefore, this calculation should be established beforehand for the applied treatment conditions and the given virus, but culture methods are not available for all human pathogenic viruses (Pecson et al. 2009). A follow-up study investigating UV-C treatment and relying on qPCR without pretreatment demonstrated that viral inactivation may be estimated in conjunction with mathematical models for JC polyomavirus and HAdV (Calgua et al. 2014). For more discussion on this topic, please refer to the section 'Direct detection of nucleic acids: characteristics and challenges' in the 'Discussion'.

Evaluating microorganism attenuation in groundwater

Pathogen removal during subsurface passage may be studied by investigating infiltrated faecal pollution (e.g. managed aquifer recharge), and monitoring the removal of pathogenic or indicator microorganisms. One way to investigate pathogen removal is to analyse water samples for naturally present microorganisms along a transect. Another way is with tracer tests using an injected target microorganism or surrogates. This can be done either as a laboratory experiment, using columns packed with aquifer material, or in the field.

The vast majority of such transport studies quantify microbial targets with microscopy or cultivation-based methods. Using genetic tools to quantify surrogate or pathogenic organisms (i.e. bacteriophages and enteric viruses) for groundwater transport studies is a relatively novel application of this technology, and therefore, limited literature exists. These approaches allow innovative analyses such as the quantification of multiple microorganism cotransport using multiplex qPCR and differentiating between infectious and inactivated viruses, when qPCR is used together with culture techniques (Betancourt et al. 2014, Bellou et al. 2015, Wang et al. 2022). In addition, genetic methods are a reliable way to enumerate microorganisms attached to particles, such as sediment and microplastics (Hassard et al. 2016). Genetic tools can also be used to confirm possible false-negatives derived from microscopy or cultivation-based methods. This is especially useful, as field tests are often expensive and labour intensive, and practical (small) sampling volumes often yield negative results. Low concentrations of target organisms require sampling larger volumes, which often presents additional challenges (Haramoto et al. 2018, Forés et al. 2022).

Natural tracers

Managed aquifer recharge involves natural subsurface processes to treat intentionally infiltrated surface water or wastewater effluent. In a study of the treatment efficiencies of three such systems in the USA, (Betancourt et al. 2014) measured viral pathogens and PMMoV, a human-associated viral marker, by qPCR in the infiltrated water and in a series of wells, providing the log reduction rates over given distances. Near the highly polluted Rocha River in Bolivia, surface water and riverbank filtrate are often used for irrigation, another example of indirect wastewater reuse (Verbyla et al. 2016). The removal (log reduction) during riverbank filtration was assessed for this study using reference pathogens recommended for wastewater reuse, PMMoV, as well as a human-associated bacterial indicator, and a QMRA of the consumption of the irrigated lettuce was performed.

Injected tracers

If the aim is to study the transport of pathogenic microorganisms in field tests, a surrogate is often used as a tracer, that mimics the pathogen in size and surface characteristics, while die-off rates are determined separately using batch tests. The transport of the surrogate can be compared to the pathogenic microorganism in small column tests in the laboratory, using aquifer material, while the surrogate is injected or applied at a field site. In this way, it is possible to upscale the transport of dangerous substances using transport models. With this goal in mind, Stevenson et al. (2015) used qPCR to quantify the transport and removal of HAdV and its surrogate, PRD1 phages, in small column tests. In regards to water treatment, the removal of Cryptosporidium parvum and its surrogate Clostridium perfringens by slow sand filtration was evaluated by Hijnen et al. (2007) as the last step in drinking water treatment using water taken from the Rhine River and spiked with the microorganisms. C. perfringens was enumerated using cultivation, and the colonies identified with PCR. Bauer et al. (2011) used qPCR to analyse enteric adenoviruses to evaluate the efficiency of slow sand filtration and river bank filtration as drinking water treatment steps. Wang et al. (2022) investigated the transport of MS2 phages, a surrogate for enteric viruses, from a surface water pond to groundwater via riverbank filtration. The authors differentiated between infectious phages by plaque assay versus the total number of phages detected by qPCR.

Synthetic tracers

A unique application of genetic tools is using synthetic DNA as a tracer which can be employed as multipoint tracers thanks to the practically unlimited sequence options and their specific quantification using qPCR (Dahlke et al. 2015, Pang et al. 2022). Another innovative idea is the use of DNA-labelled microspheres as surrogates for pathogenic microorganisms (Pang et al. 2014). This enables the enumeration of the pathogen and its surrogate by the same analytical procedure, qPCR, allowing more direct comparability.

Application 5: infection and health risk assessment

GFPD are increasingly applied to support infection- and health risk estimation regarding human usage of water and water resources. The range of applications is very broad and includes guidance in hazard identification (e.g. reference pathogen selection), calibration of fate, transport, and QMRA models targeted to specific sources, and the genetic detection of risk indicators and markers, as alternative to cultivation-based enumeration techniques.

The study design analysis found 26 articles that estimated health risk by the support of GFPD: seven epidemiology studies at ‘recreational’ water sites and 19 QMRA studies, most of which were conducted in ‘recreational’ waters, five focussed on ‘drinking’ water and one on ‘irrigation’ water. The epidemiology studies compare ‘traditional general faecal markers’ with illness rate, while the QMRA studies apply ‘MST markers’ to QMRA, using one of the above approaches. The most prominent parameter classes are ‘traditional general faecal markers’ and ‘MST markers’ (n = 7 and n = 21, respectively), measured by qPCR. The relevance of obtaining information on the viability- or infectious status for infection and health risk assessment is addressed in the ‘Discussion’ section (‘Direct detection of nucleic acids: characteristics and challenges’).

Guidance in hazard identification for QMRA

Host-associated faecal marker quantification in water resources can guide reference pathogen selection for QMRA. This concept has been included in the framework of integrated faecal pollution analysis and management (‘3-step approach’) of karstic drinking water resources (Farnleitner et al. 2018, Savio et al. 2018). The three steps involve (1) catchment pollution source profiling, (2) monitoring of general faecal pollution, and finally, (3) hypothesis-guided qPCR MST marker enumeration in spring water. At a large, complex and hardly accessible alpine karstic spring water catchment with importance for public water supply in Austria, the results pointed at zoonotic pathogens from ruminants, including cattle as the priority QMRA reference targets (Reischer et al. 2011, Savio et al. 2018). The approach introduced by Farnleitner et al. (2018) was later extended to urban river catchments using probabilistic modelling to simulate the occurrence and extent of faecal pollution sources in parallel with zoonotic pathogens from direct human as well as indirect livestock and wildlife faecal pollution sources (Derx et al. 2023). The probabilistic estimates from the catchments and the direct measurements in the river indicated that combined sewer overflows and communal WWTPs were the largest contributors to faecal pollution at the studied site. The developed approach was indicated to be a robust basis for microbial fate and transport modelling and for QMRA (Derx et al. 2023).

MST qPCR marker analysis was also used to associate cases of human illness predicted by QMRA with bovine, human, or unknown sources in contaminated private wells in Wisconsin, USA. Although some of the cases of illness were indicated to be of human pollution origin, the results suggested that most of the cases were caused by bovine faecal pollution. This outcome had important implications for land use and water safety and health risk management of the fractured aquifers (Burch et al. 2021). In a study in the Netherlands, MST qPCR marker analysis was applied to trace back the origin of infection risks from Campylobacter sp. at a stormwater collection site (water plaza). The presence of human MST markers indicated a cross-connection with the combined sewer system (Sales-Ortells and Medema 2015).

Importantly, the performance characteristics (e.g. faecal sensitivity and specificity) of MST markers as well as their application design have to match the infection and health risk characteristics of the human and zoonotic pathogens considered (e.g. specific infectivity, specific health burden) to avoid masking of faecal hazards and their associated risk levels (Table 2).

Table 2.

Overview of essential biological–diagnostic attributes of faecal indicators and associated genetic targets. The overview shows the ‘big five’: sensitivity, specificity, persistence, resistance, and mobility. Most of the shown attributes can be divided into subcharacteristics. Various methods and specification metrics have been suggested for determination.

Characteristic Basic definition Remarks Methods/metrics
Faecal sensitivity Occurrence of faecal indicator or genetic target in faecal pollution source(s) to be indicated. Should be ubiquitous and abundant in targeted pollution source(s).
Indicator/genetic target of total faecal pollution in human and vertebrate animal faecal pollution sources (primary enteric habitats). MST indicator/genetic marker only in faecal pollution-source-groups to be indicated (human, ruminant, pig, and so on).
Incidence or (binary) faecal sensitivity (% presence in targeted source; e.g. Ahmed et al. 2009, Farnleitner et al. 2010, Shanks et al. 2010a, b);
Abundance in target excreta or wastewater (conc. per volume/mass; Farnleitner et al. 2010, Ervin et al. 2013, Mayer et al. 2018)
Faecal specificity Nonoccurrence of faecal indicator or genetic target in the pristine environment or nontargeted compartment Indicators/targets of total faecal pollution should be absent in pristine environments not polluted with faeces.
MST indicator/marker also absent in nontargeted faecal pollution sources (e.g. human faecal marker not in ruminant excreta).
False-positive occurrence in pristine habitats (concentration per volume or mass; Vierheilig et al. 2012)
(Binary) faecal specificity
(% presence in nontargeted source groups; e.g. Shanks et al. 2010a,b, Linke et al. 2021)
25th/75th percentile discrimination metric (25th percentile of target minus 75th percentile of nontarget, log-transformed concentrations (Reischer et al. 2013)
Persistence Extent of survival (i.e. viability) of indicator or molecular detectability (i.e. nondegraded amenable nucleic acids) of genetic target in the (aquatic) environment. Persistence varies widely among microorganisms and genetic targets and is influenced by many potential abiotic and biotic ecological factors, such as sunlight, temperature, salinity, grazing, and so on. T90, T99 [time in days needed for a 1 log10 (T90) or 2 log10 (T99) reduction in indicator or genetic target concentration (Mitchell and Akram 2017)]
Decay rate coefficient k of, e.g. a first-order decay model (Chick 1908, Balleste and Blanch 2010)
Resistance Extent of survival (i.e. viability, proliferation, and infectivity) or molecular detectability (i.e. nondegraded amenable nucleic acids) of indicator or genetic target, respectively, towards chemical substances (e.g. metals, antibiotics) and during technical treatment and disinfection processes. Resistance varies widely among microorganisms and genetic targets and is influenced by many chemical and physical factors, such as type of chemicals (chlorine, ozone,), concentration and contact time (ct-value), temperature and time in thermal processes, and fluence in UV irradiation. Inactivation rate and kinetics obtained under carefully controlled conditions (Hoff and Akin 1986)
Inactivation rate constants
Log reduction of the concentration of microorganisms/pathogens;
a measure for the effect of a substance or for the efficacy of the process (Guerrero-Latorre et al. 2016). The log reduction to be achieved for a target is determined by risk assessment.
Mobility Transport characteristics of the indicator or genetic target in the (aquatic) environment Mobility is influenced by many factors, such as mass and size of the microorganism/phage, its attachment and aggregation behaviour (electrostatic and hydrophobic forces), its detachment behaviour, as well as the motility of certain microorganisms. Mobility characteristics may change as the microorganism decays. Sedimentation onto the river bed applies to larger-sized microorganisms (protozoa) or microorganisms attached to sediment (Jiang et al. 2015, Wu et al. 2019)
Resuspension of microorganisms attached to the riverbed sediments (Jamieson et al. 2005, Kim et al. 2010, Park et al. 2017)
Straining due to microorganism size and aquifer material grain size distribution e.g. (Bradford et al. 2003, Tufenkji et al. 2004)
Attachment/detachment (Schijven and Hassanizadeh 2000)
Motility (Becker et al. 2004)

Calibration of catchment models to estimate pathogen concentrations for QMRA

Genetic faecal marker quantification also proved valuable for catchment-based QMRA modelling of faecal pollution sources. One of the principles of the ‘QMRAcatch’ philosophy is the catchment-specific calibration of microbial transport (i.e. dilution, advection, and dispersion) and fate (i.e. decay/persistence) models for specific faecal pollution sources by the use of MST markers. The calibrated and verified models can be used to derive pollution and management scenarios for given points of interest (e.g. drinking water abstraction sites) based on pathogen transport/fate simulations. Reference pathogens are quantified in pollution sources or derived from epidemiological data and the literature (Schijven et al. 2015, Derx et al. 2016).

In a scenario analysis considering river water as a raw water source for drinking water production, the authors calibrated QMRAcatch for human faecal pollution pathways, such as from communal wastewater disposal, using human-associated MST qPCR marker data for the Austrian section of the Danube River (Demeter et al. 2021). By use of a conceptual semidistributed hydrological model and regional climate model outputs, the authors simulated the interplay of future changes (e.g. climate change, population) and wastewater management measures (enhanced WWTP treatment, prevention of combined sewer overflows) with respect to the infection risks for viral and bacterial reference pathogens (Demeter et al. 2021). The study demonstrated that the degree to which future changes affect drinking water safety strongly depends on the type and magnitude of faecal pollution sources, and is thus highly site- and scenario-specific.

More recently, the modelling approach was extended towards source-specific calibration to multiple faecal pollution sources, using MST markers for humans, ruminants, pigs, and birds. An improved hydrological module (2D hydrodynamic flow, rainfall-runoff, and differential MST decay) allowed comparing external (allochthonous) and internal (autochthonous) faecal pollution sources and their associated infection risks from zoonotic parasites (Giardia, Cryptosporidium) for the Danube River (human wastewater input) and its floodplains (animal sources) downstream of Vienna (Derx et al. 2021). An important result for best management practices is that autochthonous and allochthonous faecal sources during flood and rainfall events contributed pathogen loads with similar orders of magnitude.

Infection and health risk indicator role trough epidemiological studies

The traditional method of recreational water quality monitoring of surface waters has been based on the application of cultivation-based FIOs. For example, the relative risk of illness for swimmers and nonswimmers in recreational waters was estimated based on cultivation-based enterococci levels (USEPA 1986). However, a revision of these guidelines in 2012 (‘NEEAR study’) reported that qPCR measurements of general enterococci concentrations are better predictors of the rate of gastrointestinal illness among swimmers in recreational waters compared to cultivation-based enterococci levels (USEPA 2012a). This study established a combined approach, using cultivation-based E. coli enumeration (beach action value of 235 CFU per 100 ml of water) and genetic enterococci qPCR quantification (beach action value of 1000 calibrator cell equivalents per 100 ml) with a health-based compliance target of 36 cases of gastrointestinal illnesses per 1000 swimmers (USEPA 2012b).

MST marker quantification by qPCR has also been incorporated in epidemiological studies. For example, Griffith et al. (2016) applied several bacterial and viral indicators to predict gastrointestinal illness in three Californian beaches (n = 10 785 swimmers) by comparing qPCR and cultivation-based methods. At one beach, human-associated genetic MST marker levels displayed the highest associations with gastrointestinal illness. The authors concluded that performance of a selected parameter is likely site-specific. Napier et al. (2017) conducted a prospective cohort study also using human-associated genetic MST markers in water (self-reported gastrointestinal illness among 12 060 swimmers at six beaches across USA). Inconsistent associations were noted between results; however, the authors concluded that qPCR MST marker data may be useful in assessing human health risks in recreational water bodies.

Infection and health risk indicator role through indicator to pathogen ratio and QMRA

An increasing number of studies have attempted to establish a link between genetic MST marker concentrations and infection risks in recreational waters using a QMRA modelling framework. One of the first studies of this type was conducted to estimate the risk of gastrointestinal illness for adults swimming in waters contaminated with untreated sewage (Staley et al. 2012). In this study, norovirus was selected as the reference pathogen. The HF183 marker was detected in sewage dilutions indicating gastrointestinal illness risks greater than or equal to the benchmark value of 10/1000 primary contact recreators in several sampling sites based on the 1986 Ambient Water Quality Criteria (USEPA, 1986). Boehm et al. (2015) established a relationship between concentrations of the human-associated qPCR markers HF183 and HumM2 and gastrointestinal illness risk of swimmers in recreational waters using a QMRA approach. The authors noted that the benchmark gastrointestinal illness rate of 30/1000 primary contact recreators occurred when the median concentrations of HF183 and HumM2 marker genes were 4200 and 2800 GC/100 ml of water, respectively. In a subsequent study, Boehm et al. (2018) incorporated the decay of both human faecal-associated markers and norovirus in the model to determine the risk associated with scenarios in which the age of contamination is unknown or water is contaminated by fresh untreated sewage. When an untreated sewage contamination scenario was considered, the risk-based threshold was ∼9700 GC/100 ml. The analysis suggested that a risk-based threshold of 4100 GC/100 ml is warranted for the HF183 marker gene when the age of contamination is unknown. Schoen et al. (2020) modelled risk-based thresholds across different mixture and sewage-age scenarios for crAssphage, HF183 and polyomavirus using QMRA. The authors concluded that genetic markers may not be effective when aged sewage contributes most pathogens relative to fresh contamination. Similar risk-based MST marker thresholds have also been estimated for gull Catellicoccus, human Bacteroides, and human Lachnospiraceae markers (Brown et al. 2017, Boehm et al. 2018, McLellan et al. 2018).

Such information can be extremely valuable to regulators in interpreting quantitative MST marker data concerning potential human health risk and developing plans for faecal pollution mitigation and to assess human health risks more accurately (Zhang et al. 2019).

Application 6: outbreak tracing and wastewater surveillance

The GFPD toolbox has also proved useful in fields that traditionally focus on the detection and characterization of pathogens, such as waterborne disease outbreaks or pathogen transmission route characterization. Twenty outbreak and pathogen transmission tracing articles were retrieved, predominantly employing ‘MST markers’ with paired measurements of ‘pathogens’ and ‘cultivation-based FIO’. Additionally, five of the retrieved articles applied MST markers in wastewater surveillance for SARS-CoV-2. Given the importance of this topic, additional literature searches were performed and revealed three different roles in which MST markers may be implemented for wastewater surveillance.

Outbreak tracing, disease transmission routes, and sanitation trials

Waterborne disease outbreaks occur worldwide and may be caused by several factors, e.g. in the case of drinking water, these may include raw water contamination, treatment deficiencies, and drinking water distribution network failures. Tracing an outbreak is done predominantly by tracking the pathogen strain from patients through the transmission routes back to the exposure source by genetic typing and sequencing (molecular epidemiology, e.g. Popa et al. 2021). Alternatively, host-associated genetic faecal indicators can help identify the source for contamination and support the elucidation of disease or pathogen transmission routes. While they provide less specific outbreak-related information compared to pathogen typing, these markers are much more abundant than the pathogen in question, making them easier to detect in the environment. For example, host-associated markers were used in outbreak studies in Finland with ~450 illness cases to identify the source of pollution and to ensure the success of contaminant removal from the drinking water distribution system (Kauppinen et al. 2019). A novel approach used the human-associated genetic marker HF183 in a norovirus outbreak involving 179 cases in Pennsylvania, USA. It was applied as a microbial tracer to demonstrate the hydrogeological connection between a malfunctioning septic system, drinking water well, and recreational water area and, therefore, helped inform outbreak prevention strategies in the area (Mattioli et al. 2021). The coastal Biobío Region of Chile had been affected by repeated hepatitis A outbreaks. Human mitochondrial DNA, faecal coliforms, and live microbial biomass correlation was investigated and the concordance between human faecal pollution in the coastal waters and a seasonal hepatitis A outbreak strongly suggests that the investigated parameters can be used as a proxy to evaluate the risk of outbreaks of thalassogenic diseases (González-Saldía et al. 2019). During a large Campylobacter outbreak in Norway with over 2000 cases and 76 hospitalizations, an old cave used as a drinking water pool was identified to be faecally contaminated as indicated by the presence of E. coli. Host-associated genetic markers for humans, ruminants, horses, pigs, and other animals were applied to generate a faecal source distribution profile. This revealed that the faecal contamination was likely zoogenic in origin (horses) (Paruch et al. 2020).

In settings with poor sanitation facilities and practices, pathogen transmission routes can be multiple, therefore, planning WASH interventions to reduce pathogen exposure is challenging. A study in an urban slum in Nairobi, Kenya, set out to separate two types of human faecal waste, originating from children and from adults, because mitigation steps to reduce contamination could differ (Bauza et al. 2019). Using 16S AmpSeq analysis of faeces from both cohorts and various surfaces and waters, as well as the algorithm SourceTracker, the authors identified child faeces as the dominant pollution source inside households, whereas faecal pollution from adults was more prevalent outside households.

GFPD tools can also be used to evaluate WASH interventions. A controlled, before-and-after trial was performed in neighbourhoods of Maputo, Mozambique to estimate the potential health impacts of a sanitation intervention (installation of improved pit latrines). The authors first assessed the transmission routes through a comprehensive sanitary, environmental, and socioeconomic survey, including the measurement of a set of general and host-associated faecal indicators. They found widespread faecal contamination in soil, water, and food preparation surfaces, including from human sources. However, faecal contamination levels were largely disconnected from these analysed factors (Holcomb et al. 2020). In the before-and-after trial, the authors used a Bayesian hierarchical modelling approach to account for MST marker performance. Bootstrap estimates found no effect of the sanitation intervention on the prevalence of general and human-associated indicators, which highlights the complexity of the system and the need for multisectorial, ‘transformative’ WASH interventions (Holcomb et al. 2021).

Wastewater surveillance

Wastewater surveillance, also called wastewater-based epidemiology, seeks to relate the occurrence of a public health target of interest measured in wastewater to the public health of a respective population (e.g. Choi et al. 2018, Lorenzo and Picó 2019). COVID-19 gave a strong boost to the field, where SARS-CoV-2 RNA occurrence in wastewater is used as a proxy for the prevalence and dynamics of the infection in the population (Ahmed et al. 2022). In contrast to HRWM, which focuses on the users of the water (e.g. drinking water, recreation, and irrigation), wastewater surveillance is an ‘upstream approach’, looking back at the population’s health. Samples for wastewater surveillance are taken from raw wastewater collected by centralized sewer systems. Surface waters heavily contaminated by sewage may also exhibit an epidemiological indicator function in terms of wastewater surveillance (e.g. (Kolarević et al. 2022, Maidana-Kulesza et al. 2022).

Successful wastewater surveillance applications require the accurate measurement of public health targets in wastewater. However, this can be challenging because the proportion of human waste in a wastewater sample can be highly variable in time and space (i.e. between/within sampling site variability). In addition, the sample matrix may be challenging from an analytical point of view. In response, many scientists have suggested to use faecal markers (e.g. PMMoV, crAssphage, HF183) to support sample characterization and provide quality control in wastewater surveillance.

One application category is the characterization of surveillance samples, which mainly aims to quantify the human faecal levels in (waste)water but could also be used to characterize other animal sources. One study examining the epidemiological indicator function of SARS-CoV-2 in surface waters for countries with poor wastewater treatment, e.g. applied an advanced sampling site characterization approach including measurement of human- (BacHum), ruminant- (BacR), and pig- (Pig-2-Bac) associated genetic faecal markers. By using this approach, they could trace and identify sites with significant raw sewage influence from human populations, which may serve as sampling locations for wastewater surveillance where no obvious sewage outlets exist (Kolarević et al. 2022).

In addition, MST methods have also been used as internal process controls within wastewater surveillance investigations, either as a proxy for the public health target of interest to ensure adequate recovery and/or as performance metrics of sampling/sample processing protocols. In a monitoring study of SARS-CoV-2 in the wastewater and rivers of Tapachula (southern Mexico), e.g. PMMoV was not only used as a faecal pollution marker but also as an analytical control to confirm RNA extraction and amplification (Zarza et al. 2022). In another study investigating the intraday variability in 1-h and 24-h composite wastewater samples, the concentrations of the human viral indicators crAssphage and PMMoV were monitored in addition to the less prevalent human pathogen adenovirus (HAdV) to inform the design of appropriate wastewater sampling strategies for wastewater surveillance (Ahmed et al. 2021).

The most widely observed use of faecal markers for wastewater surveillance was the normalization of pathogen occurrence data. In this context, different MST markers were used either to describe spatial and temporal trends of the public health target of interest or to support the prediction of community infection trends. For example, Wolfe et al. (2021) describe how normalizing SARS-CoV-2 concentrations from multiple WWTPs with PMMoV can be used to compare the incidence of laboratory-confirmed new COVID-19 cases by accounting for variability in recovery and differences in human faecal loads within or between WWTPs. Another study investigated the suitability and performance of various normalization parameters and how well they correlated with local clinical cases. Normalization by crAssphage and PMMoV (amongst others) was found to show varying performance for different sampling sites (Mitranescu et al. 2022). Similar findings were described for PMMoV in a study by Nagarkar et al. (2022) suggesting that the most suitable faecal marker for normalization may vary by site and wastewater management practices.

Wastewater surveillance represents an exciting new application for GFPD. However, additional research is warranted, especially in areas highly relevant for wastewater surveillance, such as the behaviour of MST targets in sewer systems, distribution between hosts, or protocol performance assessments with wastewater sample processing methods. Although genetic faecal markers have already proven to be valuable, it remains unclear which of the many available methods are most suitable. Optimal method selection will likely vary by use scenario, surveillance target, and geographic location. In addition, applications will likely not be restricted to MST markers, but will use the entire methodological capacity of GFPD.

Application 7: other applications

Assessing water resources for the possible presence of faecal pathogens is the foundation of GFPD. However, these tools have also proven useful in other arenas. For example, 48 out of the 107 articles in this category had antibiotic resistance as the primary research focus, complemented with a GFPD method, mostly MST pertaining to markers. In total, 12 articles used MST markers to trace nutrient inputs into ambient waters. Interestingly, three articles were observed from the archaeology field, and employed genetic methods for faecal bacteria. These three disciplines are further discussed below.

Identification of the sources of ARGs

Antimicrobial resistance (AMR) is one of the top 10 global public health threats (World Health Organisation 2021). The spread of antibiotic resistant bacteria (ARB) and their ARG from hotspots such as WWTPs or agricultural run-off into freshwater and coastal ecosystems is of growing concern (Gao et al. 2018). Identifying such hotspots is, therefore, a pressing issue. Beyond the monitoring of a large panel of ARB and ARG targets of concern and the genotyping of ARGs (similar to pathogen typing), two additional approaches have been established that allow tracking their source.

The first relies on the differing AMR patterns of the gut microbiota of various host species, reflecting the differing antibiotic usage in human and veterinary medicine. This differing pattern is exploited for MST, where the pattern of the environmental samples of unknown pollution profile is compared to a library of known faecal sources. In the early 2000s, this ‘antibiotic resistance analysis’ relied on the phenotypic AMR characterization of E. coli or enterococci isolates (see also the section ‘The early days of genetic methods for faecal pollution diagnostics’; Mott and Smith 2011). More recently, Li et al. (2018) adapted the Bayesian source tracking tool SourceTracker, originally relying on 16S AmpSeq data, to ARG data from whole metagenome sequencing. At two rivers in China with dense human and livestock populations and with excess nutrient levels, this tool identified WWTPs as the major source of ARG at the majority of sites (Hu et al. 2020). At one site, nonhuman animal faeces proved to be the major pollutant. Correlations with host-associated faecal indicator genera, identified based on 16S AmpSeq data, helped identify swine manure as the main nonhuman faecal input.

The second approach relies on the co-occurrence of host-associated faecal microorganisms and ARG and/or ARB, because of a common source. Williams et al. (2022) studied persistent faecal pollution in an urban coastal bay in Sydney, Australia. qPCR MST and 16S AmpSeq together with SourceTracker were employed to pinpoint which stormwater drains drive dry-weather or wet-weather faecal pollution. Significant correlations between ARGs and the human-associated MST marker HF183 showed that the same stormwater drains were the main sources of ARG and of human faecal pollution. The Bolivian Andes is an intense mining area, and heavy metals exert selective pressure for the coselection of ARGs. Through multiple linear regression between the first principal component of a PCA of ARG data as dependent variable and metals, the human-associated viral marker crAssphage and physicochemical parameters as independent variables, Agramont et al. (2020) demonstrated that it is likely that human wastewater inputs, rather than heavy metals, drive ARG concentrations in the three rivers studied.

Identification of the sources of nutrient inputs

Nutrients, such as nitrite (NO2), nitrate (NO3), and phosphate (PO43−), are essential for plant life. However, excess concentrations can lead to eutrophication and harmful algal blooms (Kendall et al. 2007, Fenech et al. 2012). In addition, ingestion of high amounts of nitrate, e.g. through drinking water, may have serious health consequences such as methemoglobinemia of infants (blue baby syndrome), colorectal cancer, and thyroid disease (Ward et al. 2018). The World Health Organisation Drinking Water Guidelines recommend setting thresholds of nitrate and nitrite concentrations in drinking water (WHO 2017) and some countries also regulate surface water and groundwater (European Union: 91/676/EEC and 2006/118/EC, within the frame of 2000/60/EC). Mitigation of excessive nutrient inputs is, therefore, a key water quality management task. Tracing nutrient inputs relies on the fact that ratios of rare to abundant isotopes of certain elements differ among environmental and biological compartments, due to isotopic fractionation during physiochemical and biochemical reactions. As a typical example, nitrate sources can be tracked using δ15N and δ18O isotopes (Kendall et al. 2007, Fenech et al. 2012). Since nitrate has numerous biotic and abiotic sources and isotope tracing cannot separate all source types, a toolbox approach is often useful, which can include MST markers (Fenech et al. 2012).

One of the early studies combining δ15N and δ18O isotope tracing with MST was conducted along the Sava River, a tributary of the Danube River, that crosses Slovenia, Croatia, Bosnia and Herzegovina, and Serbia. The combined results indicated that soil nitrification and human wastewater were the primary nitrate sources in the Sava River, and the latter was also the main faecal pollution source (Vrzel et al. 2016). Carrey et al. (2021) assessed the main sources of nitrate pollution in surface water and groundwater across Catalonia, Spain in a government-led effort to review vulnerable zones as defined by the European Union Nitrates Directive (91/676/EEC). Nearly 200 samples were analysed for multiple isotopes (δ15N, δ18O, δ2H, and δ11B from various molecules), viral and bacterial FIO, human-, ruminant-, and swine-associated MST markers and complemented by land use data. Each sampling location was interpreted individually. The conclusions from multi-isotopic and MST data agreed or partially agreed in 79% of the samples. The authors offered detailed discussion on the complementary nature of the two approaches and the possible sources of disagreement (Carrey et al. 2021). In the coastal areas of Southwest Florida, harmful algal blooms caused by elevated nutrient levels are a recurring problem. Malfunctioning septic tanks were suspected to be the source of nutrients. Brewton et al. (2022) applied δ15N and δ13C isotope tracing, elemental composition of particulate matter (C:N:P), a panel of nutrients, chemical tracers, cultivation-based FIO and human-, bird-, and gull-associated MST markers to tackle the complex challenge. These multiple lines of evidence pointed to a link between septic systems, groundwater, and surface water, ultimately resulting in harmful algal blooms. Additionally, chemical tracers and bird- and gull-associated MST markers indicated rainfall runoff to be a contributing factor (Brewton et al. 2022). The Changle River catchment in China has a high human population, intensive livestock farming (swine), and agricultural activities, all of which potentially contribute to the high nutrient levels of the river. A Bayesian isotopic mixing model using data from the nitrate dual stable isotope technique (δ15N- NO3 and δ18O- NO3) suggested manure and sewage to be the dominant pollution sources (Cao et al. 2022). Since nitrate isotopes cannot differentiate between manure and sewage, Cao et al. (2022) applied MST using 16S AmpSeq together with the algorithm SourceTracker, which suggested untreated and treated domestic wastewater as the main sources. Redundancy analysis brought all lines of evidence (isotopes, MST, land use, and various ions) together to reveal domestic wastewater as a probable cause of nutrient pollution (Cao et al. 2022).

Archaeology

Genetic markers can remain detectable much longer in sediments than in the overlaying water column (Korajkic et al. 2019). Sediments may therefore offer time-integrated information on faecal pollution. In a tidal freshwater marsh in South Carolina, USA, the ruminant-associated MST marker BoBac was found in all sections of a soil core, the deepest section of which dated to 1961 (Drexler et al. 2014). While in this hydrogeological system the bacterial community of fresh pollution might migrate through the layers, the findings provide evidence of at least recent, but potentially long-term faecal pollution likely from deer and/or cow manure. On much larger timescales, lake sediments may act as biological archives of sedimentary ancient DNA from autochthonous (in-lake) and allochthonous (from the catchment and beyond) sources (Capo et al. 2021). Among other tools, palaeoenvironmental enquiries into ancient human presence and pastoral activities may also use MST markers or DNA sequencing techniques (Capo et al. 2021). In a study in Northern France, the authors documented a shift from agro-pastoral practices to forested landscapes during the Roman period. Testing for ovine and bovine mtDNA markers revealed sheep as the dominant livestock before the transition (Etienne et al. 2015).

Discussion

Emergence of a new field in health-related water quality analysis

The advent of genetic faecal pollution diagnostics (GFPD)

Our search for peer-reviewed science regarding the analysis of faecal pollution-associated nucleic acid targets in water demonstrates the rapid development of genetic diagnostics within the field of HRWM since the start of the new millennium. The meta-analysis of the currently existing application types also highlights that this novel scientific discipline extends far beyond the enumeration of genetic MST markers. Many traditional HRWM aspects, such as treatment and microbial transport indications, infection risk assessment and QMRA, as well as integration into modelling and simulations were found to be supplemented by GFPD (sections ‘Application 1’ through ‘Application 7’). In addition, several novel aspects such as the support of epidemiological outbreak tracing, wastewater surveillance, and supplementing ABR research, have also been developed.

The emerging scientific field of GFPD still grows; no plateau phase is in view (Fig. 3). In the past decade, the focus of research has shifted from method establishment to the implementation of these methods in scientific field research. An emphasis on field implementation is also indicated by the frequent use of certain genetic faecal markers, with some of them already standardized at the national level (section ‘Application 1’). However, method development has not halted, and it is very likely that expected future technological developments in molecular biological analytics, sequencing and bioinformatics (e.g. Callaway 2022) will further promote diversification within the field of GFPD research.

It thus seems justified to define this emerging part of science as a new discipline: genetic faecal pollution diagnostics in health-related microbial water quality analysis (see the section ‘Glossary’). The aim of GFPD is to open up the ‘black box’ of microbial faecal pollution of water resources to support problem-oriented water safety management, covering aspects such as catchment protection and management, water quality monitoring, health risk management, and treatment requirement evaluation. Additionally, GFPD can be applied to areas outside the water sector, as exemplarily indicated by its use in archaeology (section ‘Application 7’).

GFPD analyses distinct nucleic acid-based faecal pollution signatures

Vertebrate gut microbial communities fundamentally differ from environmental ‘nondigestive’ microbial communities (e.g. water, sediment, soil, plant, and nonvertebrate), as first demonstrated by the meta-analysis of 16S AmpSeq data by Ley et al. (2008). Long coevolution between host vertebrate animals (including humans) and their intestinal microbiomes, driven by many selective forces (e.g. adaptive immune system, host selection pressure, and unique biochemical environment), is likely responsible for this clear distinction (Ley et al. 2008). Although cosmopolitan populations do occur, strong vertebrate gut-associations also exist on the individual taxa level of microorganisms (McLellan and Eren 2014, Youngblut et al. 2019, 2020, 2021). This clear intestinal versus nonintestinal microbial community dichotomy forms the essential basis of specific detection of faecal pollution in water, targeting nucleic acid-based signatures from gut-associated bacteria, archaea and viruses. Similarly, evolutionary adaptations between macro- and intestinal microorganisms also exist on the host level, providing the basis for MST (section ‘Introduction’).

GFPD of today primarily focuses on the cultivation-independent detection of nucleic acid-based targets in the environment. The literature analysis highlighted that GFPD thus far predominantly relies on targeted analysis, where faecal pollution-associated sequences are directly detected by amplification methods (e.g. PCR, qPCR, and dPCR), using specific primers and probes. Owing to the enormous technological developments in HTS, nontargeted approaches, using broader taxonomic sequencing and subsequent specific in silico sequence alignment to faecal-associated signatures, have substantially improved during the past decade (Fig. 12, section ‘Outcomes of the systematic study design analysis’).

Figure 12.

Figure 12.

Analytical methods over the years in the ‘application studies’ pool, with the bar chart of all papers per year in the background (narticle = 649).

Advances in intestinal microbiomics will certainly further benefit GFPD, expanding our understanding of ecophylogenetics and providing access to representative sequence databases to support, (i) in silico design and evaluation of molecular assays, and (ii) bioinformatic analysis of big data from HTS (Fig. 13). Human and other animal intestinal microbiome research, with the greatest relevance in life sciences and medicine, is a very young discipline, and much is expected to be achieved in the future.

Figure 13.

Figure 13.

Conceptual framework of study design in the genetic faecal pollution diagnostics field.

Identified revolutionary aspects of GFPD for HRWM research

Genetic faecal pollution detection and MST: a methodological quantum leap

The use of GFPD has fundamentally changed the way scientific questions on faecal pollution problems in the environment can be addressed and answered (Malakoff 2002). MST using genetic methods has opened the way to identify and quantify many different pollution sources that cultivation-based methods do not allow. Approximately, half of the identified GFPD studies (356 out of 649 articles) dealt with MST, i.e. the characterization and origin determination of faecal pollution. Many novel cutting edge GFPD studies, covering single and multiple sources in differing types of water resources, including elevated faecal pollution levels in watersheds, recreational waters, groundwater resources, aquaculture and others, could be successfully realized (sections ‘Application 2’ and ‘Application 3’).

Biobanking: a new key element in HRWM research

Traditional cultivation-based FIO analysis requires sample transport, processing, and subsequent cultivation within a short time period (usually <1 working day). This often significantly constrains the possibilities and extent of research. In contrast, GFPD enables long-term nucleic acid preservation (>1 year) before performing the diagnostic analysis (De Paoli 2005, Jackson et al. 2011, Cary and Fierer 2014).

The possibility of storing nucleic acids for posterior analysis has several essential implications for HRWM research. Assuming that there is sufficient capacity to establish a representative sample bank over time and space, researchers can (i) focus on selected samples of interest (e.g. pollution event-based analysis), (ii) focus on the parameters appearing most appropriate at the time of analysis, and (iii) extend the investigation to other samples and/or genetic parameters at any time, if sufficient analyte is available. In hydrological sciences, this type of sample archiving for posterior analysis (e.g. isotopes) has already been a standard practice for decades.

Nucleic acid sample transfer supports the globalization of HRWM research

Nucleic acid sample conservation during field work also opens the way to international network structures, useful for performing centralized analysis in specialized laboratories (Layton et al. 2013, Reischer et al. 2013, Mayer et al. 2018). This point is especially interesting for developing regions that lack the infrastructure for advanced GFPD. During the COVID-19 pandemic, infrastructures for molecular biological analysis were established in many urban centres throughout the globe and will likely contribute to centralized GFPD activities in the future. Thus, even advanced GFPD will not be limited to certain regions of the world but will be accessible from any remote location, provided that basic infrastructure for sample collection, processing, storage, and transfer, as well as standard operating procedures, are available.

Direct detection of nucleic acids: characteristics and challenges

Characteristics of DNA/RNA-based target analysis

The literature analysis highlighted that GFPD targeting of prokaryotic microbiota (bacteria and archaea) has almost exclusively relied on DNA analysis, with the 16S rRNA gene as the most frequently used diagnostic region. In addition, alternative targets, such as gene regions for protein coding parts, have also been used (Shanks et al. 2008, Green et al. 2014a). The primary aim of tracing intestinal DNA signatures in the environment is the sensitive detection and characterization of faecal pollution. Such DNA analysis does not give any information about the physiological status of the targeted microbiota in the analysed water. Active, inactive, starving, viable but not culturable, or dead microbial populations are often detected equally. Depending on the applied extraction procedure, DNA attached to cells, organic debris, biofilms, or sediments, and even freely suspended DNA, is also detectable (Carini et al. 2016). The same is true for viral targets. Detecting viral DNA or RNA does not provide information on the infectious or noninfectious status of the targeted populations.

Notably, it was reported, that the application of ribosomal RNA via RT-qPCR for bacterial general faecal markers and MST markers increases the sensitivity and frequency of faecal pollution detection for several water resource types (Pitkänen et al. 2013). In addition, rRNA analysis may also be interesting for viability investigations (section ‘Generating viability- and infectious status information by molecular tools’).

Relevance of viability- or infectious status information

While the majority of genetic detection methods available do not account for information on the viability or infectivity status of the microorganisms or viruses from which the nucleic acids originate, it is important to note, that this is not the main purpose for many GFPD applications. For example, this is clearly the case for most of the identified faecal pollution detection and MST studies throughout the literature analysis (sections ‘Application 1’ to ‘Application 3’). Nevertheless, as outlined below, robust information on the persistence and resistance properties of the genetic targets is essential for the correct selection and application of genetic MST markers and for the appropriate data interpretation (section ‘GFPD (MST) application frame: status quo and research needs’). Other identified GFPD application areas, such as the support of outbreak tracing or wastewater surveillance, do not rely on the viability status of the microbial targets either (section ‘Application 6’).

Even the use in recreational water quality monitoring seems to be a realistic exercise, without the need for a viability endpoint (section ‘Application 5’). For example, a recent investigation on swimming-associated health risks, including 80 000 beachgoers at 13 beaches (pooled data), revealed the strongest associations between gastrointestinal symptoms and qPCR-quantified enterococci, but not with cultivation-based enumeration (Wade et al. 2022). It was previously hypothesized, that enterococci DNA, as quantifiable by qPCR, better reflects the survival of resistant pathogens during wastewater treatment (e.g. resistant enteric viruses) than cultivable enterococci concentrations (Wade et al. 2006, Srinivasan et al. 2011). Obviously, it is desirable for pathogen die-off kinetics to match the decay kinetics of the analysed indicator signals, irrespective of whether viability- or nonviability-based parameters are considered. Undoubtedly, more research is needed to better understand the principles behind these important relationships in GFPD and health risk assessment. However, the extent of already existing innovative research by nucleic acid-based qPCR analysis for infection- and health risk indication holds great promise for the future (section ‘Application 5’).

Information on viability or infectious status becomes an essential criterion when microbicidal and virucidal treatments are to be characterized. In particular, the efficacy assessment of disinfection, including all technologies (e.g. by heat, chlorine, ozone, UV light, and so on), requires the application of representative and reliable indicators for viability and especially infectivity, often supplemented by selected reference pathogens. The assessment is historically based on cultivation methods, the considered lege artis gold standard, especially when disinfection processes and log-reduction targets are to be monitored, validated or verified. For example, a recent European Union regulation requires the cultivation-based validation monitoring of reclaimed water for agricultural irrigation (class A) using E. coli, somatic coliphages and C. perfringens spores, with defined performance targets of ≥5, ≥6, and ≥4 log10 reductions within the treatment chain, respectively (European Union 2020).

Generating viability- and infectious status information by molecular tools

In addition to cultivation-based enumeration, cultivation-independent, molecular strategies for viability- and infectious status analysis are also increasingly applied in research. For prokaryotes, a vast array of different techniques, including RNA-based methods (rRNA, messenger RNA), membrane integrity (e.g. viability stains, viability PCR), cellular metabolism (e.g. ATP, respiration, isotope labelling), protein-based methods (e.g. BONCAT), and microcalorimetry, have been suggested within the broad field of microbial ecology (Emerson et al. 2017). However, the delineation of dead versus viable microbial cells is complex and still under debate (Davey 2011, Kirschner et al. 2021). There is consensus that living microbial cells should have, (i) intact functional cell membranes, (ii) intact cellular- and energy metabolism, and (iii) the capability to reproduce (i.e. intact transcription/translation mechanisms). Straightforward determination strategies frequently address only one of these aspects of microbial viability (e.g. ‘live/dead’ protocols), leaving room for uncertainty (Emerson et al. 2017). Thus, (more time-consuming) multiple criteria are to be applied simultaneously, if precise viability characterization of the target microbiota is required (Kirschner et al. 2021). Detection of infectious viruses is equally challenging, and no single method is available to detect all infectious viruses in water (Gerba et al. 2018). At least three criteria must be fulfilled for infectious viruses, (i) sufficient genomic integrity to produce the required proteins for replication and to provide an accurate genetic template for subsequent generations, (ii) protection of the genome from degradation, and, (iii) the ability of the virus to recognize and infect the host cell (Pecson et al. 2009, Gerba et al. 2018).

Viability PCR and a similar approach, ET-qPCR, were introduced to the field of GFPD more than a decade ago (Bae and Wuertz 2009, Pecson et al. 2009) and have been increasingly applied in HRWM research in recent years. The original idea of applying viability PCR to bacterial MST markers was to gain information on recent faecal pollution events in water resources (Bae and Wuertz 2012, 2015). Viability PCR relies on the pretreatment of the sample with an intercalating dye, PMA, that penetrates cells with impaired membranes and prevents PCR-based amplification (Nocker et al. 2006), thus allowing the selective detection of cells with an intact membrane. Virus capsid integrity may also be assessed using the same principles (reviewed in Leifels et al. 2021) or using ET-PCR (Pecson et al. 2009). However, several authors note challenges related to conditions of procedure confounding the results and emphasize that experimental conditions need to be optimized and validated for the microorganism under investigation (Fittipaldi et al. 2012, Lazou et al. 2019, Leifels et al. 2021). The application of viability PCR now extends to the assessment of microorganism attenuation during treatment processes (section ‘Application 4’).

In summary, molecular tools to generate information on viability and infectious state constitute a novel and innovative area of research in GFPD. Relatively little experience exists in comparison to traditional PCR and qPCR analysis (section ‘Application 4’). Many challenges are still associated with their application, such as problems with methodical reproducibility, cross-reaction with background- or free nucleic acids, selection of optimal reagents, and experimental conditions and protocols (Gerba et al. 2018, Codony et al. 2020). Furthermore, the success of these methods often depends on the particular mechanism of inactivation (e.g. chemical vs. physical agents). Nonetheless, further development activities in the future will likely open new windows of opportunity in HRWM as well as in complementing cultivation-based standards. In addition, many potential areas within the range of these available molecular tools have not yet been exploited (Emerson et al. 2017). For example, and in contrast with viability PCR applications, RNA-based methods have only very rarely been applied and evaluated in GFPD (Pitkänen et al. 2013). As successfully demonstrated in other fields of environmental microbiology, RNA analysis may significantly contribute to information on the activity status of microbial populations (Gourse et al. 1996, Amann and Ludwig 2000, Deutscher 2006).

Sensitivity of environmental detection of nucleic acid targets

A common narrative is that molecular DNA/RNA diagnostics are highly specific and sensitive. This may be true for theoretical considerations. For ‘real world’ applications, this dictum, especially in relation to sensitivity, must be considered in the context of the overall analytical measurement challenge (Wintzingerode et al. 1997). For example, an optimally designed qPCR test should be able to detect, in theory, one target molecule of DNA/RNA, if present in a single reaction unit. However, as target molecules follow a stochastic distribution during analyte dilution for parallel analysis, the assay limit of detection (aLOD) cannot be less than three target molecules for a 95% detection probability per qPCR analysis, even with perfect PCR kinetics (Bustin et al. 2009). However, overall considerations require whole chain analysis (WCA), including sampling, recovered sampling volume, filtration- and enrichment-, nucleic acid extraction-, and purification efficacies, and finally, the amount of nucleic acid analysed (Table S2, Supporting Information). The resulting overall WCA sensitivity, reported for instance as the sample limit of detections (sLOD), can be quite elevated (Domingo et al. 2007). To illustrate, sLOD or alternative estimates on WCA sensitivity for qPCR DNA/RNA target enumeration were reported to be in the range of log10 1.5–3.9 genetic targets per 100 ml sample (Pitkänen et al. 2013).

Selected genetic targets for GFPD often target highly abundant intestinal bacterial and viral populations as occurring in faecal excreta or wastewater, to compensate for the abovementioned WCA sensitivity issues. This fundamental design criterion is achieved by almost all top performing qPCR assays of genetic faecal markers (Layton et al. 2013, Reischer et al. 2013, Green et al. 2014b, Mayer et al. 2018, Sabar et al. 2022). Less abundant intestinal targets, such as traditional E. coli or enterococci (Farnleitner et al. 2010) can still be detected using genetic methods, if faecal pollution levels are elevated, as frequently observed for surface waters under communal and agricultural influence. However, in situations with low to very low faecal pollution levels, such as groundwater and drinking water resources, the sensitivity issues of genetic faecal markers can be very limiting. High-volume sampling, specific enrichment or alternative amplification systems may bring improved sensitivity and thus extend the possibilities of GFDP to such situations (Min and Baeumner 2002, Heijnen and Medema 2009, Rhodes et al. 2011, Liu et al. 2012).

In scenarios of low faecal pollution levels, it is common for a large proportion (>50%) of measurements to be below a GFPD method limit of quantification. For these censored data, the true genetic target concentration cannot be firmly established and can represent a significant source of bias in downstream statistical analyses. While it may be convenient to ignore censored data, these measurements offer important information. As a result, there is a growing interest in the development and use of statistical methods that can responsibly incorporate censored data into concentration estimates, hypotheses tests, regressions, and other analyses to help minimize potential bias and maximize faecal pollution trend insights. For example, Cao et al. (2018) developed a qPCR censored data faecal score approach to estimate a weighted-average genetic marker concentration from a defined group of samples using all measurements (e.g. nondetection, below the limit of quantification, or within the range of quantification). Additional research is needed to further advance censored data analysis methodologies custom designed for GFPD applications.

HTS applications as identified in our literature analysis (section ‘Outcomes of the systematic study design analysis’), face challenges in addition to WCA. In fact, the achievable sensitivity of 16S AmpSeq applications, applying general primers for broad taxonomic detection, such as kingdom and phylum level, strongly depends on the relative abundance of faecal pollution-associated intestinal microbiota compared to nonfaecal pollution associated microbiota (i.e. environmental ‘background microbiome’). Water resources, showing low to moderate faecal pollution levels and abundant aquatic microbiomes (e.g. 109–1011 cells/l for lakes or rivers; Kirschner et al. 2004, Velimirov et al. 2011), become problematic, even when applying high amplicon sequencing-depth (Vierheilig et al. 2015). Consequently, identified studies have most frequently focused on water resources with significant municipal and agricultural faecal pollution levels (section ‘In-depth review of the application areas of genetic faecal pollution diagnostic through case studies’).

GFPD (MST) application frame: status quo and research needs

Cutting-edge solutions require in-depth expert knowledge

The ability of GFPD methods to detect (is there a pollution problem?), quantify (what is the extent of pollution?), and allocate (what are the sources of pollution?) faecal pollution in water and water resources has undoubtedly revolutionized this area of HRWM research during the last two decades (Malakoff 2002). However, the application of general and host-associated faecal markers to generate accurate information on the responsible faecal pollution sources is not trivial. For example, the available genetic faecal marker targets as well as their quantification systems, differ in pollution source abundance and environmental persistence. Therefore, differences in these characteristics may severely compromise or prevent meaningful interpretation of results. Box 1 (upper panel, nonoptimal parameter setup) shows a hypothetical MST situation to illustrate the confusing effects that differential abundance and persistence of MST markers can impose for correct indication. Quantitative comparisons of MST results, or the more complex task of source apportionment (i.e. computation of faecal loads from the various sources), solely based on qPCR results, may therefore, only be achievable for a limited ‘diagnostic space’ (see the examples t0, t1, and t2 in Box 1, and ‘A toolbox approach with case-dependent selection criteria’ below). Having sound expert knowledge on the potentials and limits of GFPD is thus an essential prerequisite for correct application of GFPD in the field.

Box 1:

Microbial source tracking markers: diagnostic scenarios

graphic file with name fuad028ufig1.jpg

Simple hypothetical MST situation with two different point sources of pollution [e.g. human wastewater and animal (pig) manure] of equivalent discharge and contamination load for a small river. For reasons of simplicity, only dilution at the time of contamination (t0) and decay of the MST markers is considered (i.e. batch-reactor system with complete mixing and no sedimentation). Three time slots (t0, t1, and t2) are chosen to illustrate the different ‘diagnostic windows’ of MST indications at the given detection limit (sample limit of detection, sLOD).

Nonoptimal toolbox. All four applied MST markers show different abundance in their respective faecal excreta and persistence in the water body. At t0, all four MST markers allow correct qualitative detection of both sources (differential persistence insignificant). Due to the differential abundances of MST markers, no direct estimation on the relative importance of PS1/PS2 is possible. However, mathematical corrections of concentration differences in excreta would make this possible. At t1, MST marker 1B leads to false negative detection of PS1, due to differential persistence. Even in the case of accounting for differential abundance, only MST markers 1A and 2A can be used to estimate the relative importance of pollution of PS1/PS2 thanks to their similar persistence. At t2, only PS2 is detectable by MST marker 2B, thus the diagnosis would miss PS1 (false negative detection at the given sLOD).

Optimal toolbox. Both selected pairs of MST markers show comparable pollution source abundance in faecal excreta and persistence in the water body. The MST markers pair 1A–2A allow the estimation of the relative contribution of PS1 and PS2 at all times (t0–t2). Due to lower source abundance, the MST marker pair 1B–2B only allow detection and comparison at t0 and t1, but not at t2.

A toolbox approach with case-dependent selection criteria

No method comes without limitations, and no single method can have a universal application. Each genetic faecal parameter has specific biological–diagnostic and technical–analytical attributes (Table 2; Table S1, Supporting Information). The selection of diagnostic tools, as well as the chosen field investigation strategy, should therefore, be designed to best suit the given faecal pollution problem (Schoen et al. 2020), including a sound knowledge of the catchment characteristics and hydrological regime (Reischer et al. 2008). A basic catchment survey or pollution source profile can substantially improve the understanding of the situation and guide the selection of GFPD parameters and methods with appropriate performance characteristics (Reischer et al. 2011, Derx et al. 2023).

In addition to persistence, it is equally essential for MST to have appropriate (binary) faecal sensitivity and specificity of the selected genetic marker (Table 2). The minimum acceptable levels of faecal sensitivity and specificity depend on the faecal pollution scenario under investigation (such as the relative abundance of the diagnosed faecal pollution sources). These levels can be determined through statistical considerations or catchment-based scenario simulations (Kildare et al. 2007, Reischer et al. 2011, Derx et al. 2021). A well-selected combination of markers, along with an algorithm that considers the sensitivity and specificity characteristics of the markers, enables more confident source identification compared to an individual marker (Ballesté et al. 2020). Faecal specificity is also important for general faecal markers (should be absent in a pristine environment, Table 2) and, in analogy with MST markers, should be evaluated in the studied catchments (Vierheilig et al. 2012). There are significant knowledge gaps regarding the mobility of indicator microorganisms and viruses detected by GFPD (Table 2). Mobility can be an essential factor in almost any natural and technical aquatic compartment. For example, mobility may codetermine the fate of MST markers, (i) in deposited fresh cow pats on pastures (e.g. activation tendency and run-off during rainfall; Devane et al. 2022); (ii) during wastewater treatment (e.g. attachment to sewage sludge fraction or dispersion in the water phase; Wang et al. 2023); (iii) in surface water transport processes, such as river water (e.g. attachment to settling particles and sedimentation or transport in suspended fraction; Fauvel et al. 2017); or (iv) during river bank filtration (e.g. straining or attachment in the aquifer or aquifer transport; Wang et al. 2022). Mobility characteristics are often complex, as they are potentially influenced by physical, chemical and biological processes, depending on the aquatic scenario. Finally, the resistance of genetic markers to technical treatment is an additional important biological–diagnostic attribute (Table 2). Different resistance is expected for cultivation-based parameters than for DNA/RNA-based parameters (section ‘Generating viability- and infectious status information’). For instance, in contrast to cultivation-based FIO concentrations, almost no reduction was observed for qPCR-based prokaryotic MST markers during UV-treatment of wastewater and drinking water (Steinbacher et al. 2021).

Prokaryotic targets dominate, but the importance of viral targets increases

Key biological elements in the current state-of-the-art toolbox are the various general- and host-associated faecal markers, quantified by qPCR or dPCR assays (section ‘Background information on genetic targets and methods: a historical overview’). The systematic literature analysis revealed that prokaryotic faecal markers have dominated GFPD up to now (Fig. 14). However, viral faecal markers have seen an increase in the past 10 years, while mitochondrial markers have been applied to a far lesser extent (Fig. 14). The combined use of selected MST markers, with adequate performance characteristics, hold promise for detecting and allocate faecal pollution with increased confidence, even under challenging faecal pollution scenarios (e.g. undiluted vs. diluted, fresh versus aged, untreated vs. treated faecal pollution). In this respect, complementing prokaryotic GFPD applications with viral faecal markers can be especially important to account for the increased persistence, resistance and mobility characteristics of such types of intestinal contaminants (‘Application 4: Microorganisms attenuation during treatment’ and ‘Application 5: Estimating of infection and health risk’). Cultivation-based FIO (see discussion ‘hybrid application’ below), pathogen detection and antibiotic resistance analysis complements the current array of biological elements in GFPD (Fig. 5).

Figure 14.

Figure 14.

Timeseries of microorganisms targeted by genetic methods, with the bar chart of all papers per year in the background (narticle = 649). mtDNA indicates host mitochondrial DNA.

The GFPD toolbox is steadily growing, although now at a slower pace than during the first pioneering decade of the new millennium (Fig. 3, number of establishment/application studies). New genetic faecal markers and/or improved detection systems are, without any doubt, essential for the further development of the discipline. However, it should be kept in mind that providing detailed information on their environmental behaviour and application characteristics (Table 2; Tables S1 and S2, Supporting Information) is equally important, if not more, to successfully implement them in HRWM research (Fig. 13). There is a disparity between the availability of genetic markers and detection systems compared to the availability of information on their biological–diagnostic characteristics and on their applicability in different types of water resources and subcompartments (under given biotic and abiotic conditions, e.g. (Boehm et al. 2019, Korajkic et al. 2019, Lu and Imlay 2021)).

Integrated data analysis and modelling

The inclusion of other microbiological and environmental parameters into the study design can greatly enhance the information gained from GFPD investigations. Remarkably, almost two-thirds of the identified source tracking studies (single and multiple sources) simultaneously applied genetic MST-marker qPCR quantification and traditional cultivation-based enumeration of FIOs (Fig. 3), determined by standardized parameters such as those for E. coli (ISO 1998a, 2012, 2014, 2018) or intestinal enterococci (ISO 1998b, 2000). The need to determine the causes responsible for faecal pollution in water obviously promotes this most popular ‘hybrid application’ (sections ‘Application 2’ and ‘Application 3’). In addition, data on pathogen occurrence and physicochemical water quality were used to complement the investigation, although to a far lesser extent. In contrast, the identified GFPD studies hardly utilized information on hydrology, meteorology, land use or epidemiology for statistical data analysis (Fig. 5). This is contrary to expectations, as environmental data, such as data on catchment hydrology and land use (GIS, mapping), have proven essential for an improved application, understanding and interpretation of GFPD in water quality research (Reischer et al. 2008, Peed et al. 2011, Bambic et al. 2015, Verhougstraete et al. 2015, Frick et al. 2020, Green et al. 2021, Diedrich et al. 2023). Without a doubt, there is significant potential to better utilize and integrate environmental data in GFPD analysis in future HRWM research (Fig. 13).

Data from GFPD, together with FIO and pathogens, are increasingly used in modelling and simulation. Potential areas of interest include all issues and scales of HRWM research (ranging from faecal marker persistence/dilution models to catchment-based source/sink transport simulations) as well as application types (such as faecal pollution, MST, treatment, and infection- and health risk assessment) as covered in this literature analysis (Dorner et al. 2006, Sokolova et al. 2012, Boehm et al. 2015, Pascual-Benito et al. 2020, Derx et al. 2021). To highlight the importance, modelling becomes essential, e.g. to estimate the required microbial/viral log reduction targets for wastewater or drinking water treatment or to determine the appropriate setback distances during riverbank filtration. Importantly, modelling and simulations also allow the assessment of future scenarios and even the prediction of the management measures that will be required considering future climate and global change phenomena (Demeter et al. 2021). One of the big challenges of modelling and simulation in health-related water quality research and GFPD is to provide all the data and data collections required (Fig. 13).

Conclusions

  • The tools and approaches developed for GFPD have revolutionized HRWM research in the last two decades in terms of faecal pollution detection and microbial source tracking, the current core areas of application. Together with nucleic acid extract biobanking, GFPD represents a new level of methodological possibilities in health-related water quality research in the 21st century, even in remote or less developed regions.

  • GFPD is ready to expand to many other application areas within and outside the field of HRWM. For instance, it will further gain importance in infection and health risk assessment (e.g. recreational water quality monitoring) and will increasingly support the evaluation and verification of water treatment and disinfection processes, in combination with standardized treatment indicators and cultivation-based enumeration.

  • The COVID-19 pandemic gave a strong boost to the field of wastewater surveillance. Wastewater surveillance for SARS-CoV-2 is currently transforming into a global early warning disease monitoring system. GFPD will likely increasingly support wastewater surveillance in data generation, pollution source characterization, normalization, and quality assurance. Since both ‘sister’ disciplines use the same molecular biological framework and infrastructure, potential synergies are significant. In general, GFPD has the potential to support any environmental global infectious disease surveillance system, covering human and other animal populations.

  • As demonstrated by the many identified studies, internationally accepted, cultivation-based water quality parameters, such as E. coli or intestinal enterococci, can be effectively complemented with GFPD, thus significantly expanding the methodical possibilities in water quality monitoring and management, when needed (e.g. MST to trace the origin of cultivation-based FIO). GFPD constitutes a toolbox approach. Tailor-made scientific investigation and monitoring solutions can be rapidly established by experts.

  • The current century is ‘the Century of Life Sciences’, especially considering how molecular biology and bioinformatics rapidly transform health sciences and medicine. It is also the era of information technology, artificial intelligence, and automatization. These driving forces will certainly promote further innovation within genetic faecal pollution detection. Many technological breakthroughs are expected.

  • From science to practice. The water management sector increasingly needs the tools and approaches offered by GFD to solve future challenges (e.g. challenges related to SDG6). The translation of such tools to practice has to be paralleled by standardization efforts. While some countries have already started such activities (e.g. three assays are standardized in the USA), international standards are still lacking. These needs will have to be defined by the water management sector and translated to future GFPD guidelines and standards by global panels of experts.

  • This meta-analysis provides the scientific status quo of the field of GFPD. It should promote further research to advance the scientific field and serve as a condensed information source for the wider audience, including microbiologists, water hygienists, water management professionals, and public health experts.

Supplementary Material

fuad028_Supplemental_Files

Acknowledgements

The authors acknowledge TU Wien Bibliothek for financial support through its Open Access Funding Programme. The authors are thankful to Andreas Pacher from TU Wien Bibliothek for help with database searches and for critical discussion on paper evaluation metrics. We thank also Dr. Mats Leifels for providing useful discussion and references. Information has been subjected to U.S. EPA peer and administrative review and has been approved for external publication. Any opinions expressed in this paper are those of the authors and do not necessarily reflect the official positions and policies of the U.S. EPA. Any mention of trade names or commercial products does not constitute endorsement or recommendation for use. This is a joint research effort of the Interuniversity Cooperation Centre for Water & Health (www.waterandhealth.at) and the Global Water Pathogen Program (GWPP) initiative (www.waterpathogens.org).

Contributor Information

Katalin Demeter, Institute of Chemical, Environmental and Bioscience Engineering 166, TU Wien, Gumpendorferstrasse 1A, 1060 Vienna, Austria.

Rita Linke, Institute of Chemical, Environmental and Bioscience Engineering 166, TU Wien, Gumpendorferstrasse 1A, 1060 Vienna, Austria.

Elisenda Ballesté, Departament de Genètica, Microbiologia I Estadística, Universitat de Barcelona, Av/ Diagonal 643, Barcelona, Spain.

Georg Reischer, Institute of Chemical, Environmental and Bioscience Engineering 166, TU Wien, Gumpendorferstrasse 1A, 1060 Vienna, Austria.

René E Mayer, Institute of Chemical, Environmental and Bioscience Engineering 166, TU Wien, Gumpendorferstrasse 1A, 1060 Vienna, Austria; Division Water Quality and Health, Department of Pharmacology, Physiology and Microbiology, Karl Landsteiner University of Health Sciences, Dr.-Karl-Dorrek-Straße 30, 3500 Krems an der Donau, Austria.

Julia Vierheilig, Institute of Water Quality and Resource Management , TU Wien, Lilienthalgasse 21 OD, 1030 Vienna, Austria.

Claudia Kolm, Division Water Quality and Health, Department of Pharmacology, Physiology and Microbiology, Karl Landsteiner University of Health Sciences, Dr.-Karl-Dorrek-Straße 30, 3500 Krems an der Donau, Austria.

Margaret E Stevenson, Institute of Hydraulic Engineering and Water Resources Management, TU Wien, Karlsplatz 13, 1040 Vienna, Austria.

Julia Derx, Institute of Hydraulic Engineering and Water Resources Management, TU Wien, Karlsplatz 13, 1040 Vienna, Austria.

Alexander K T Kirschner, Division Water Quality and Health, Department of Pharmacology, Physiology and Microbiology, Karl Landsteiner University of Health Sciences, Dr.-Karl-Dorrek-Straße 30, 3500 Krems an der Donau, Austria; Institute for Hygiene and Applied Immunology, CePII, Medical University of Vienna, Kinderspitalgasse 15, A-1090 Vienna, Austria.

Regina Sommer, Institute for Hygiene and Applied Immunology, CePII, Medical University of Vienna, Kinderspitalgasse 15, A-1090 Vienna, Austria.

Orin C Shanks, U.S. Environmental Protection Agency, Office of Research and Development, Cincinnati, USA.

Anicet R Blanch, Departament de Genètica, Microbiologia I Estadística, Universitat de Barcelona, Av/ Diagonal 643, Barcelona, Spain.

Joan B Rose, College of Agriculture and Natural Resources, Michigan State University, East Lansing, USA.

Warish Ahmed, CSIRO Environment, Ecosciences Precinct, 41 Boggo Rd, Dutton Park, QLD 4102, Australia.

Andreas H Farnleitner, Institute of Chemical, Environmental and Bioscience Engineering 166, TU Wien, Gumpendorferstrasse 1A, 1060 Vienna, Austria; Division Water Quality and Health, Department of Pharmacology, Physiology and Microbiology, Karl Landsteiner University of Health Sciences, Dr.-Karl-Dorrek-Straße 30, 3500 Krems an der Donau, Austria.

Conflict of interest statement

The authors declare no conflict of interest.

Funding

This work was supported by the Gesellschaft für Forschungsförderung Niederösterreich m.b.H. (GFF) (grant number LSC 19-016 ‘Future Danube’), by the Vienna Science and Technology Fund (WWTF) (grant number ESR17-070), and by the research project ‘Vienna Water Resource Systems 2020+’ (ViWa2020+), in cooperation with the City of Vienna (Vienna Water, MA31). The content does not necessarily represent the views of the Province of Lower Austria or the Gesellschaft für Forschungsförderung Niederösterreich as the funding agency. Neither the province of Lower Austria nor the funding agency can, therefore, be held responsible for the content.

References

  1. Agramont J, Gutierrez-Cortez S, Joffre Eet al. Fecal pollution drives antibiotic resistance and class 1 integron abundance in aquatic environments of the Bolivian Andes impacted by mining and wastewater. Microorganisms. 2020;8:1122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ahmed W, Bivins A, Bertsch PMet al. Intraday variability of indicator and pathogenic viruses in 1-h and 24-h composite wastewater samples: implications for wastewater-based epidemiology. Environ Res. 2021;193:110531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ahmed W, Goonetilleke A, Powell Det al. Evaluation of multiple sewage-associated Bacteroides PCR markers for sewage pollution tracking. Water Research. Vol. 43, 2009, 4872–7. ISSN 0043-1354, 10.1016/j.watres.2009.08.042. [DOI] [PubMed] [Google Scholar]
  4. Ahmed W, Gyawali P, Feng Set al. Host specificity and sensitivity of established and novel sewage-associated marker genes in human and nonhuman fecal samples. Appl Environ Microbiol. 2019;85:e00641–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ahmed W, Harwood VJ. Human and animal enteric viral markers for tracking the sources of faecal pollution. In: Rose JB, Jimenez Cisneros BE (eds), Global Water Pathogen Project. Lansing: Michigan State University, UNESCO, 2017. [Google Scholar]
  6. Ahmed W, Kitajima M, Tandukar Set al. Recycled water safety: current status of traditional and emerging viral indicators. Curr Opin Environ Sci Health. 2020;16:62–72. [Google Scholar]
  7. Ahmed W, Lobos A, Senkbeil Jet al. Evaluation of the novel crAssphage marker for sewage pollution tracking in storm drain outfalls in Tampa, Florida. Water Res. 2018;131:142–50. [DOI] [PubMed] [Google Scholar]
  8. Ahmed W, Simpson SL, Bertsch PMet al. Minimizing errors in RT-PCR detection and quantification of SARS-CoV-2 RNA for wastewater surveillance. Sci Total Environ. 2022;805:149877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Amann R, Ludwig W. Ribosomal RNA-targeted nucleic acid probes for studies in microbial ecology. FEMS Microbiol Rev. 2000;24:555–65. [DOI] [PubMed] [Google Scholar]
  10. An X-L, Wang J-Y, Pu Qet al. High-throughput diagnosis of human pathogens and fecal contamination in marine recreational water. Environ Res. 2020;190:109982. [DOI] [PubMed] [Google Scholar]
  11. Asami T, Katayama H, Torrey JRet al. Evaluation of virus removal efficiency of coagulation-sedimentation and rapid sand filtration processes in a drinking water treatment plant in Bangkok, Thailand. Water Res. 2016;101:84–94. [DOI] [PubMed] [Google Scholar]
  12. Bae S, Wuertz S. Decay of host-associated Bacteroidales cells and DNA in continuous-flow freshwater and seawater microcosms of identical experimental design and temperature as measured by PMA-qPCR and qPCR. Water Res. 2015;70:205–13. [DOI] [PubMed] [Google Scholar]
  13. Bae S, Wuertz S. Discrimination of viable and dead fecal Bacteroidales bacteria by quantitative PCR with propidium monoazide. Appl Environ Microbiol. 2009;75:2940–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bae S, Wuertz S. Survival of host-associated Bacteroidales cells and their relationship with Enterococcus spp., Campylobacter jejuni, Salmonella enterica Serovar Typhimurium, and Adenovirus in freshwater microcosms as measured by propidium monoazide-quantitative PCR. Appl Environ Microbiol. 2012;78:922–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Ballesté E, Belanche-Muñoz LA, Farnleitner AHet al. Improving the identification of the source of faecal pollution in water using a modelling approach: from multi-source to aged and diluted samples. Water Res. 2020;171:115392. [DOI] [PubMed] [Google Scholar]
  16. Balleste E, Blanch AR. Persistence of Bacteroides species populations in a river as measured by molecular and culture techniques. Appl Environ Microbiol. 2010;76:7608–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Bambic DG, Kildare-Hann BJ, Rajal VBet al. Spatial and hydrologic variation of Bacteroidales, adenovirus and enterovirus in a semi-arid, wastewater effluent-impacted watershed. Water Res. 2015;75:83–94. [DOI] [PubMed] [Google Scholar]
  18. Bauer R, Dizer H, Graeber Iet al. Removal of bacterial fecal indicators, coliphages and enteric adenoviruses from waters with high fecal pollution by slow sand filtration. Water Res. 2011;45:439–52. [DOI] [PubMed] [Google Scholar]
  19. Bauza V, Madadi V, Ocharo RMet al. Microbial source tracking using 16S rRNA amplicon sequencing identifies evidence of widespread contamination from young children's feces in an urban slum of Nairobi, Kenya. Environ Sci Technol. 2019;53:8271–81. [DOI] [PubMed] [Google Scholar]
  20. Becker MW, Collins SA, Metge DWet al. Effect of cell physicochemical characteristics and motility on bacterial transport in groundwater. J Contam Hydrol. 2004;69:195–213. [DOI] [PubMed] [Google Scholar]
  21. Bej AK, Steffan RJ, DiCesare Jet al. Detection of coliform bacteria in water by polymerase chain reaction and gene probes. Appl Environ Microbiol. 1990;56:307–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Bellou MI, Syngouna VI, Tselepi MAet al. Interaction of human adenoviruses and coliphages with kaolinite and bentonite. Sci Total Environ. 2015;517:86–95. [DOI] [PubMed] [Google Scholar]
  23. Bernhard AE, Field KG. A PCR assay to discriminate human and ruminant feces on the basis of host differences in Bacteroides-Prevotella genes encoding 16S rRNA. Appl Environ Microbiol. 2000;66:4571–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Bernhard AE, Goyard T, Simonich MTet al. Application of a rapid method for identifying fecal pollution sources in a multi-use estuary. Water Res. 2003;37:909–13. [DOI] [PubMed] [Google Scholar]
  25. Betancourt WQ, Kitajima M, Wing ADet al. Assessment of virus removal by managed aquifer recharge at three full-scale operations. J Environ Sci Heal A. 2014;49:1685–92. [DOI] [PubMed] [Google Scholar]
  26. Bibby K, Crank K, Greaves Jet al. Metagenomics and the development of viral water quality tools. npj Clean Water. 2019;2:13. [Google Scholar]
  27. Bivins A, Crank K, Greaves Jet al. Cross-assembly phage and pepper mild mottle virus as viral water quality monitoring tools—potential, research gaps, and way forward. Curr Opin Environ Sci Health. 2020;16:54–61. [Google Scholar]
  28. Boehm AB, Graham KE, Jennings WC. Can we swim yet? Systematic review, meta-analysis, and risk assessment of aging sewage in surface waters. Environ Sci Technol. 2018;52:9634–45. [DOI] [PubMed] [Google Scholar]
  29. Boehm AB, Silverman AI, Schriewer Aet al. Systematic review and meta-analysis of decay rates of waterborne mammalian viruses and coliphages in surface waters. Water Res. 2019;164:13. [DOI] [PubMed] [Google Scholar]
  30. Boehm AB, Soller JA, Shanks OC. Human-associated fecal quantitative polymerase chain reaction measurements and simulated risk of gastrointestinal illness in recreational waters contaminated with raw sewage. Environ Sci Technol Lett. 2015;2:270–5. [Google Scholar]
  31. Borchardt MA, Stokdyk JP, Kieke BAet al. Sources and risk factors for nitrate and microbial contamination of private household wells in the fractured dolomite aquifer of northeastern Wisconsin. Environ Health Perspect. 2021;129:67004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Bradford SA, Simunek J, Bettahar Met al. Modeling colloid attachment, straining, and exclusion in saturated porous media. Environ Sci Technol. 2003;37:2242–50. [DOI] [PubMed] [Google Scholar]
  33. Brewton RA, Kreiger LB, Tyre KNet al. Septic system–groundwater–surface water couplings in waterfront communities contribute to harmful algal blooms in Southwest Florida. Sci Total Environ. 2022;837:155319. [DOI] [PubMed] [Google Scholar]
  34. Brown KI, Graham KE, Boehm AB. Risk-based threshold of gull-associated fecal marker concentrations for recreational water. Environ Sci Technol Lett. 2017;4:44–48. [Google Scholar]
  35. Brunson JC, Read QD. ggalluvial: alluvial Plots in ‘ggplot2’. CRAN, 2023. [Google Scholar]
  36. Burch TR, Stokdyk JP, Spencer SKet al. Quantitative microbial risk assessment for contaminated private wells in the fractured dolomite aquifer of Kewaunee county, Wisconsin. Environ Health Perspect. 2021;129:067003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Bushon RN, Brady AMG, Christensen EDet al. Multi-year microbial source tracking study characterizing fecal contamination in an urban watershed. Water Environ Res. 2017;89:127–43. [DOI] [PubMed] [Google Scholar]
  38. Bustin SA, Benes V, Garson JAet al. The MIQE Guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem. 2009;55:611–22. [DOI] [PubMed] [Google Scholar]
  39. Byappanahalli MN, Nevers MB, Shively DAet al. Real-time water quality monitoring at a Great Lakes national park. J Environ Qual. 2018;47:1086–93. [DOI] [PubMed] [Google Scholar]
  40. Calgua B, Carratalà A, Guerrero-Latorre Let al. UVC inactivation of dsDNA and ssRNA viruses in water: UV fluences and a qPCR-based approach to evaluate decay on viral infectivity. Food Environ Virol. 2014;6:260–8. [DOI] [PubMed] [Google Scholar]
  41. Callaway E. ‘The entire protein universe’: AI predicts shape of nearly every known protein. DeepMind’s AlphaFold tool has determined the structures of around 200 million proteins. Nature. 2022;608:15–6. [DOI] [PubMed] [Google Scholar]
  42. Cao M, Hu A, Gad Met al. Domestic wastewater causes nitrate pollution in an agricultural watershed, China. Sci Total Environ. 2022;823:153680. [DOI] [PubMed] [Google Scholar]
  43. Cao YP, Sivaganesan M, Kelty CAet al. A human fecal contamination score for ranking recreational sites using the HF183/BacR287 quantitative real-time PCR method. Water Res. 2018;128:148–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Capo E, Giguet-Covex C, Rouillard Aet al. Lake sedimentary DNA research on past terrestrial and aquatic biodiversity: overview and recommendations. Quaternary. 2021;4:6. [Google Scholar]
  45. Carini P, Marsden PJ, Leff JWet al. Relic DNA is abundant in soil and obscures estimates of soil microbial diversity. Nat Microbiol. 2016;2:16242. [DOI] [PubMed] [Google Scholar]
  46. Carrey R, Ballesté E, Blanch ARet al. Combining multi-isotopic and molecular source tracking methods to identify nitrate pollution sources in surface and groundwater. Water Res. 2021;188:116537. [DOI] [PubMed] [Google Scholar]
  47. Cary SC, Fierer N. The importance of sample archiving in microbial ecology. Nat Rev Microbiol. 2014;12:789–90. [DOI] [PubMed] [Google Scholar]
  48. Castellani A, Chalmers AJ. Manual of Tropical Medicine. London: Baillière, Tindall and Cox, 1919. [Google Scholar]
  49. Chan AWY, Naphtali J, Schellhorn HE. High-throughput DNA sequencing technologies for water and wastewater analysis. Sci Prog. 2019;102:351–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Chen W, Wilkes G, Khan IUHet al. Aquatic bacterial communities associated with land use and environmental factors in agricultural landscapes using a metabarcoding approach. Front Microbiol. 2018;9:2301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Chick H. An investigation of the laws of disinfection. J Hyg. 1908;8:92–158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Choi PM, Tscharke BJ, Donner Eet al. Wastewater-based epidemiology biomarkers: past, present and future. TrAC Trends Anal Chem. 2018;105:453–69. [Google Scholar]
  53. Codony F, Dinh-Thanh M, Agustí G. Key factors for removing bias in viability PCR-based methods: a review. Curr Microbiol. 2020;77:682–7. [DOI] [PubMed] [Google Scholar]
  54. Converse RR, Kinzelman JL, Sams EAet al. Dramatic improvements in beach water quality following gull removal. Environ Sci Technol. 2012;46:10206–13. [DOI] [PubMed] [Google Scholar]
  55. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex Syst. 2006;1695:1–9. [Google Scholar]
  56. Dahlke HE, Williamson AG, Georgakakos Cet al. Using concurrent DNA tracer injections to infer glacial flow pathways. Hydrol Process. 2015;29:5257–74. [Google Scholar]
  57. Davey HM. Life, death, and in-between: meanings and methods in microbiology. Appl Environ Microbiol. 2011;77:5571–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. De Paoli P. Bio-banking in microbiology: from sample collection to epidemiology, diagnosis and research. FEMS Microbiol Rev. 2005;29:897–910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Demeter K, Derx J, Komma Jet al. Modelling the interplay of future changes and wastewater management measures on the microbiological river water quality considering safe drinking water production. Sci Total Environ. 2021;768:144278. [DOI] [PubMed] [Google Scholar]
  60. Derx J, Demeter K, Linke Ret al. Genetic microbial source tracking support QMRA modeling for a riverine wetland drinking water resource. Front Microbiol. 2021;12:668778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Derx J, Kılıç HS, Linke Ret al. Probabilistic fecal pollution source profiling and microbial source tracking for an urban river catchment. Sci Total Environ. 2023;857:159533. [DOI] [PubMed] [Google Scholar]
  62. Derx J, Schijven J, Sommer Ret al. QMRAcatch: human-associated fecal pollution and infection risk modeling for a river/floodplain environment. J Environ Qual. 2016;45:1205–14. [DOI] [PubMed] [Google Scholar]
  63. Deutscher MP Degradation of RNA in bacteria: comparison of mRNA and stable RNA. Nucleic Acids Res. 2006;34:659–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Devane M, Dupont PY, Robson Bet al. Mobilization of Escherichia coli and fecal source markers from decomposing cowpats. Sci Total Environ. 2022;853:158509. [DOI] [PubMed] [Google Scholar]
  65. Diedrich A, Sivaganesan M, Willis JRet al. Genetic fecal source identification in urban streams impacted by municipal separate storm sewer system discharges. PLoS ONE. 2023;18:e0278548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Domingo JWS, Bambic DG, Edge TAet al. Quo vadis source tracking? Towards a strategic framework for environmental monitoring of fecal pollution. Water Res. 2007;41:3539–52. [DOI] [PubMed] [Google Scholar]
  67. Dorevitch S, Shrestha A, DeFlorio-Barker Set al. Monitoring urban beaches with qPCR vs. culture measures of fecal indicator bacteria: implications for public notification. Environ Health. 2017;16:45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Dorner SM, Anderson WB, Slawson RMet al. Hydrologic modeling of pathogen fate and transport. Environ Sci Technol. 2006;40:4746–53. [DOI] [PubMed] [Google Scholar]
  69. Drexler JZ, Johnson HE, Duris Jet al. Marsh soils as potential sinks for bacteroides fecal indicator bacteria, Waccamaw National Wildlife Refuge, Georgetown, SC, USA. Water Air Soil Pollut. 2014;225:1861. [Google Scholar]
  70. Ebentier DL, Hanley KT, Cao YPet al. Evaluation of the repeatability and reproducibility of a suite of qPCR-based microbial source tracking methods. Water Res. 2013;47:6839–48. [DOI] [PubMed] [Google Scholar]
  71. Emerson JB, Adams RI, Román CMBet al. Schrödinger’s microbes: tools for distinguishing the living from the dead in microbial ecosystems. Microbiome. 2017;5:86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Ervin JS, Russell TL, Layton BAet al. Characterization of fecal concentrations in human and other animal sources by physical, culture-based, and quantitative real-time PCR methods. Water Research. Vol. 47, 2013, 6873–82. ISSN 0043-1354, 10.1016/j.watres.2013.02.060. [DOI] [PubMed] [Google Scholar]
  73. Escherich T Die Darmbakterien des Säuglings und ihre Beziehungen zur Physiologie der Verdauung. Stuttgart: Verlag von Ferdinand Enke, 1886. [Google Scholar]
  74. Etienne D, Destas M, Lyautey Eet al. Two thousand–year reconstruction of livestock production intensity in France using sediment-archived fecal Bacteroidales and source-specific mitochondrial markers. The Holocene. 2015;25:1384–93. [Google Scholar]
  75. European Union . Regulation 2020/741 of the European Parliament and of the Council of 25 May 2020 on minimum requirements for water reuse. 2020. [Google Scholar]
  76. Farkas K, Cooper DM, McDonald JEet al. Seasonal and spatial dynamics of enteric viruses in wastewater and in riverine and estuarine receiving waters. Sci Total Environ. 2018;634:1174–83. [DOI] [PubMed] [Google Scholar]
  77. Farkas K, Walker DI, Adriaenssens EMet al. Viral indicators for tracking domestic wastewater contamination in the aquatic environment. Water Res. 2020;181:115926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Farnleitner AH, Ryzinska-Paier G, Reischer GHet al. Escherichia coli and enterococci are sensitive and reliable indicators for human, livestock and wildlife faecal pollution in alpine mountainous water resources. J Appl Microbiol. 2010;109:1599–608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Farnleitner AH, Savio D, Sommer Ret al. Integrated strategy to guide health-related microbial quality management at alpine karstic drinking water resources. In: White WB, Herman JS, Herman EK, Rutigliano M (eds), Karst Groundwater Contamination and Public Health. Berlin: Springer, 2018. [Google Scholar]
  80. Fauvel B, Ogorzaly L, Cauchie HMet al. Interactions of infectious F-specific RNA bacteriophages with suspended matter and sediment: towards an understanding of FRNAPH distribution in a river water system. Sci Total Environ. 2017;574:960–8. [DOI] [PubMed] [Google Scholar]
  81. Fenech C, Rock L, Nolan Ket al. The potential for a suite of isotope and chemical markers to differentiate sources of nitrate contamination: a review. Water Res. 2012;46:2023–41. [DOI] [PubMed] [Google Scholar]
  82. Ferretti JA, Tran HV, Peterson SJet al. Rapid method demonstration project at four New Jersey marine beaches using real time quantitative polymerase chain reaction (qPCR). Mar Pollut Bull. 2013;71:51–63. [DOI] [PubMed] [Google Scholar]
  83. Fittipaldi M, Nocker A, Codony F. Progress in understanding preferential detection of live cells using viability dyes in combination with DNA amplification. J Microbiol Methods. 2012;91:276–89. [DOI] [PubMed] [Google Scholar]
  84. Forés E, Rusiñol M, Itarte Met al. Evaluation of a virus concentration method based on ultrafiltration and wet foam elution for studying viruses from large-volume water samples. Sci Total Environ. 2022;829:154431. [DOI] [PubMed] [Google Scholar]
  85. Frick C, Vierheilig J, Nadiotis-Tsaka Tet al. Elucidating fecal pollution patterns in alluvial water resources by linking standard fecal indicator bacteria to river connectivity and genetic microbial source tracking. Water Res. 2020;184:12. [DOI] [PubMed] [Google Scholar]
  86. Gao H, Zhang L, Lu Zet al. Complex migration of antibiotic resistance in natural aquatic environments. Environ Pollut. 2018;232:1–9. [DOI] [PubMed] [Google Scholar]
  87. García-Aljaro C, Ballesté E, Muniesa Met al. Determination of crAssphage in water samples and applicability for tracking human faecal pollution. Microb Biotechnol. 2017;10:1775–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. García-Aljaro C, Blanch AR, Campos Cet al. Pathogens, faecal indicators and human-specific microbial source-tracking markers in sewage. J Appl Microbiol. 2018;126:701–17. [DOI] [PubMed] [Google Scholar]
  89. Garner E, Davis BC, Milligan Eet al. Next generation sequencing approaches to evaluate water and wastewater quality. Water Res. 2021;194:116907. [DOI] [PubMed] [Google Scholar]
  90. Geldreich EE, Kenner BE. Concepts of fecal streptococci in stream pollution. J Water Pollut Control Fed. 1969;41:R336–52. [PubMed] [Google Scholar]
  91. Geldreich EE. Sanitary significance of fecal coliforms in the environment. Water Pollution Control Research Series Publication WP-20–3. Columbus: US Department of the Interior Cincinnati, Ohio Federal Water Pollution Control Administration, 1966. [Google Scholar]
  92. Gerba CP, Betancourt WQ, Kitajima Met al. Reducing uncertainty in estimating virus reduction by advanced water treatment processes. Water Res. 2018;133:282–8. [DOI] [PubMed] [Google Scholar]
  93. González-Fernández A, Symonds EM, Gallard-Gongora JFet al. Relationships among microbial indicators of fecal pollution, microbial source tracking markers, and pathogens in Costa Rican coastal waters. Water Res. 2021;188:116507. [DOI] [PubMed] [Google Scholar]
  94. González-Saldía RR, Pino-Maureira NL, Muñoz Cet al. Fecal pollution source tracking and thalassogenic diseases: the temporal-spatial concordance between maximum concentrations of human mitochondrial DNA in seawater and Hepatitis A outbreaks among a coastal population. Sci Total Environ. 2019;686:158–70. [DOI] [PubMed] [Google Scholar]
  95. Gourse RL, Gaal T, Bartlett MSet al. rRNA transcription and growth rate-dependent regulation of ribosome synthesis in Escherichia coli. Annu Rev Microbiol. 1996;50:645–77. [DOI] [PubMed] [Google Scholar]
  96. Grabow W. Bacteriophages: update on application as models for viruses in water. Water SA. 2001;27:251–68. [Google Scholar]
  97. Green H, Wilder M, Wiedmann Met al. Integrative survey of 68 non-overlapping upstate New York watersheds reveals stream features associated with aquatic fecal contamination. Front Microbiol. 2021;12:684533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Green HC, Haugland RA, Varma Met al. Improved HF183 quantitative real-time PCR assay for characterization of human fecal pollution in ambient surface water samples. Appl Environ Microbiol. 2014a;80:3086–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Green HC, White KM, Kelty CAet al. Development of rapid canine fecal source identification PCR-based assays. Environ Sci Technol. 2014b;48:11453–61. [DOI] [PubMed] [Google Scholar]
  100. Griffith JF, Weisberg SB, Arnold BFet al. Epidemiologic evaluation of multiple alternate microbial water quality monitoring indicators at three California beaches. Water Res. 2016;94:371–81. [DOI] [PubMed] [Google Scholar]
  101. Guerrero-Latorre L, Gonzales-Gustavson E, Hundesa Aet al. UV disinfection and flocculation-chlorination sachets to reduce hepatitis E virus in drinking water. International Journal of Hygiene and Environmental Health. Vol. 219, 2016, 405–411. ISSN 1438-4639, 10.1016/j.ijheh.2016.04.002. [DOI] [PubMed] [Google Scholar]
  102. Hachad M, Lanoue M, Vo Duy Set al. Locating illicit discharges in storm sewers in urban areas using multi-parameter source tracking: field validation of a toolbox composite index to prioritize high risk areas. Sci Total Environ. 2022;811:152060. [DOI] [PubMed] [Google Scholar]
  103. Hamza IA, Jurzik L, Uberla Ket al. Evaluation of pepper mild mottle virus, human picobirnavirus and Torque teno virus as indicators of fecal contamination in river water. Water Res. 2011;45:1358–68. [DOI] [PubMed] [Google Scholar]
  104. Haramoto E, Kitajima M, Hata Aet al. A review on recent progress in the detection methods and prevalence of human enteric viruses in water. Water Res. 2018;135:168–86. [DOI] [PubMed] [Google Scholar]
  105. Harwood VJ, Shanks O, Koraijkic Aet al. General and host-associated bacterial indicators of faecal pollution. In: Rose JB, Jimenez Cisneros BE (eds), Global Water Pathogen Project. Lansing: Michigan State University, UNESCO, 2018. [Google Scholar]
  106. Hassard F, Gwyther CL, Farkas Ket al. Abundance and distribution of enteric bacteria and viruses in coastal and estuarine sediments-a review. Front Microbiol. 2016;7:31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Haugland R, Oshima K, Sivaganesan Met al. Large-scale comparison of E. coli levels determined by culture and a qPCR method (EPA Draft Method C) in Michigan towards the implementation of rapid, multi-site beach testing. J Microbiol Methods. 2021;184:106186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Haugland RA, Siefring SD, Varma Met al. Standardization of enterococci density estimates by EPA qPCR methods and comparison of beach action value exceedances in river waters with culture methods. J Microbiol Methods. 2014;105:59–66. [DOI] [PubMed] [Google Scholar]
  109. Heaney CD, Myers K, Wing Set al. Source tracking swine fecal waste in surface water proximal to swine concentrated animal feeding operations. Sci Total Environ. 2015;511:676–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Heijnen L, Medema G. Method for rapid detection of viable Escherichia coli in water using real-time NASBA. Water Res. 2009;43:3124–32. [DOI] [PubMed] [Google Scholar]
  111. Hijnen WAM, Dullemont YJ, Schijven JFet al. Removal and fate of Cryptosporidium paruum, Clostridium perfringens and small-sized centric diatoms (Stephanodiscus hantzschii) in slow sand filters. Water Res. 2007;41:2151–62. [DOI] [PubMed] [Google Scholar]
  112. Hoff JC, Akin EW. Microbial resistance to disinfectants: mechanisms and significance. Environ Health Perspect. 1986;69:7–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Holcomb DA, Knee J, Capone Det al. Impacts of an urban sanitation intervention on fecal indicators and the prevalence of human fecal contamination in Mozambique. Environ Sci Technol. 2021;55:11667–79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. Holcomb DA, Knee J, Sumner Tet al. Human fecal contamination of water, soil, and surfaces in households sharing poor-quality sanitation facilities in Maputo, Mozambique. Int J Hyg Environ Health. 2020;226:113496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Hu A, Wang H, Li Jet al. Homogeneous selection drives antibiotic resistome in two adjacent sub-watersheds, China. J Hazard Mater. 2020;398:122820. [DOI] [PubMed] [Google Scholar]
  116. Hughes B, Dennis P, Beale Det al. Cross-comparison of human wastewater-associated molecular markers in relation to fecal indicator bacteria and enteric viruses in recreational beach waters. Appl Environ Microbiol. 2017;83:e00028–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Ishii S, Ksoll WB, Hicks REet al. Presence and growth of naturalized Escherichia coli in temperate soils from Lake Superior watersheds. Appl Environ Microbiol. 2006;72:612–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Ishii S, Sadowsky MJ. Escherichia coli in the environment: implications for water quality and human health. Microbes Environ. 2008;23:101–8. [DOI] [PubMed] [Google Scholar]
  119. ISO . ISO 16649-1:2018: microbiology of food and animal feeding stuffs – horizontal method for the enumeration of beta-glucuronidase-positive Escherichia coli – Part 1: colony-count technique at 44 degrees C using membranes and 5-bromo-4-chloro-3-indolyl beta-D-glucuronide. 2018, 9.
  120. ISO . ISO 7899-1:1998 water quality—detection and enumeration of intestinal enterococci in surface and waste water—Part 1: miniaturized method (most probable number) by inoculation in liquid medium. 1998a.
  121. ISO . ISO 9308-3:1998 water quality—detection and enumeration of Escherichia coli and coliform bacteria – Part 3: miniaturized method (most probable number) for the detection and enumeration of E. coli in surface and waste water. 1998b.
  122. ISO . ISO 7899-2:2000 water quality—detection and enumeration of intestinal enterococci—Part 2: membrane filtration method. 2000.
  123. ISO . ISO 9308-1:2014 water quality—enumeration of Escherichia coli and coliform bacteria—Part 1: membrane filtration method for waters with low bacterial background flora. 2014.
  124. ISO . ISO 9308-2:2012 water quality—enumeration of Escherichia coli and coliform bacteria—Part 2: most probable number method. 2012.
  125. Jackson J, Laikre L, Baker Cet al. Guidelines for collecting and maintaining archives for genetic monitoring. Conser Genet Resour. 2011;4:527–56. [Google Scholar]
  126. Jager T, Alexander J, Kirchen Set al. Live-dead discrimination analysis, qPCR assessment for opportunistic pathogens, and population analysis at ozone wastewater treatment plants. Environ Pollut. 2018;232:571–9. [DOI] [PubMed] [Google Scholar]
  127. Jamieson RC, Joy DM, Lee Het al. Resuspension of sediment-associated Escherichia coli in a natural stream. J Environ Qual. 2005;34:581–9. [DOI] [PubMed] [Google Scholar]
  128. Jennings WC, Chern EC, O'Donohue Det al. Frequent detection of a human fecal indicator in the urban ocean: environmental drivers and covariation with enterococci. Environ Sci-Proc Imp. 2018;20:480–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Jiang L, Chen YC, Zhu DJet al. Faecal coliform attachment to settleable suspended sediments in fresh surface waters: linear partition model validation and sediment concentration effects. Water Supply. 2015;15:864–70. [Google Scholar]
  130. Jiang SC, Han M, Chandrasekaran Set al. Assessing the water quality impacts of two category-5 hurricanes on St. Thomas, Virgin Islands. Water Res. 2020;171:115440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  131. Jofre J, Blanch AR. Feasibility of methods based on nucleic acid amplification techniques to fulfil the requirements for microbiological analysis of water quality. J Appl Microbiol. 2010;109:1853–67. [DOI] [PubMed] [Google Scholar]
  132. Jofre J, Lucena F, Blanch ARet al. Coliphages as model organisms in the characterization and management of water resources. Water-Sui. 2016;8:199. [Google Scholar]
  133. Kapoor V, Gupta I, Pasha ABMTet al. Real-time quantitative PCR measurements of fecal indicator bacteria and human-associated source tracking markers in a Texas river following Hurricane Harvey. Environ Sci Technol Lett. 2018;5:322–8. [Google Scholar]
  134. Kauppinen A, Pitkänen T, Al-Hello Het al. Two drinking water outbreaks caused by wastewater intrusion including sapovirus in Finland. Int J Env Res Pub He. 2019;16:4376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  135. Kendall C, Elliott EM, Wankel SD. Tracing anthropogenic inputs of nitrogen to ecosystems. In: Michener R, Lajtha K (eds), Stable Isotopes in Ecology and Environmental Science. 2nd edn. Oxford: Blackwell Publishing Ltd, 2007, 375–449. [Google Scholar]
  136. Kildare BJ, Leutenegger CM, McSwain BSet al. 16S rRNA-based assays for quantitative detection of universal, human-, cow-, and dog-specific fecal Bacteroidales: a Bayesian approach. Water Res. 2007;41:3701–15. [DOI] [PubMed] [Google Scholar]
  137. Kim J-W, Pachepsky YA, Shelton DRet al. Effect of streambed bacteria release on E. coli concentrations: monitoring and modeling with the modified SWAT. Ecol Model. 2010;221:1592–604. [Google Scholar]
  138. Kirschner AKT, Reischer GH, Jakwerth Set al. Multiparametric monitoring of microbial faecal pollution reveals the dominance of human contamination along the whole Danube River. Water Res. 2017;124:543–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  139. Kirschner AKT, Vierheilig J, Flemming HCet al. How dead is dead? Viable but non-culturable versus persister cells. Environ Microbiol Rep. 2021;13:243–5. [DOI] [PubMed] [Google Scholar]
  140. Kirschner AKT, Zechmeister TC, Kavka GGet al. Integral strategy for evaluation of fecal indicator performance in bird-influenced saline inland waters. Appl Environ Microbiol. 2004;70:7396–403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  141. Kitajima M, Sassi HP, Torrey JR. Pepper mild mottle virus as a water quality indicator. npj Clean Water. 2018;1:9. [Google Scholar]
  142. Kjellander J. Enteric streptococci as indicators of fecal contamination of water. Acta Pathol Microbiol ScandSuppl. 1960;48:3–124. [PubMed] [Google Scholar]
  143. Klase G, Lee S, Liang Set al. The microbiome and antibiotic resistance in integrated fishfarm water: implications of environmental public health. Sci Total Environ. 2019;649:1491–501. [DOI] [PubMed] [Google Scholar]
  144. Koch R. Zur Untersuchung von pathogenen Organismen (1881). In: Gradmann C (ed.), Robert Koch: Zentrale Texte. Berlin, Heidelberg: Springer, 1881, 45–111. [Google Scholar]
  145. Kolarević S, Micsinai A, Szántó-Egész Ret al. Wastewater-based epidemiology in countries with poor wastewater treatment—epidemiological indicator function of SARS-CoV-2 RNA in surface waters. Sci Total Environ. 2022;843:156964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  146. Kolm C, Martzy R, Brunner Ket al. A complementary isothermal amplification method to the U.S. EPA quantitative polymerase chain reaction approach for the detection of Enterococci in environmental waters. Environ Sci Technol. 2017;51:7028–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  147. Kongprajug A, Chyerochana N, Rattanakul Set al. Integrated analyses of fecal indicator bacteria, microbial source tracking markers, and pathogens for Southeast Asian beach water quality assessment. Water Res. 2021;203:117479. [DOI] [PubMed] [Google Scholar]
  148. Korajkic A, Badgley BD, Brownell MJet al. Application of microbial source tracking methods in a Gulf of Mexico field setting. J Appl Microbiol. 2009;107:1518–27. [DOI] [PubMed] [Google Scholar]
  149. Korajkic A, Wanjugi P, Brooks Let al. Persistence and decay of fecal microbiota in aquatic habitats. Microbiol Mol Biol Rev. 2019;83:26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  150. Kralj J, Servetas S, Hunter Met al. Certification of Standard Reference Material® 2917: Plasmid DNA for Fecal Indicator Detection and Identification. Special Publication (NIST SP). Gaithersburg: National Institute of Standards and Technology, 2021. [Google Scholar]
  151. Kuroda K, Nakada N, Hanamoto Set al. Pepper mild mottle virus as an indicator and a tracer of fecal pollution in water environments: comparative evaluation with wastewater-tracer pharmaceuticals in Hanoi, Vietnam. Sci Total Environ. 2015;506-507:287–98. [DOI] [PubMed] [Google Scholar]
  152. Layton BA, Cao YP, Ebentier DLet al. Performance of human fecal anaerobe-associated PCR-based assays in a multi-laboratory method evaluation study. Water Res. 2013;47:6897–908. [DOI] [PubMed] [Google Scholar]
  153. Lazou TP, Iossifidou EG, Gelasakis AIet al. Viability quantitative PCR utilizing propidium monoazide, spheroplast formation, and Campylobacter coli as a bacterial model. Appl Environ Microbiol. 2019;85:e01499–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  154. Leifels M, Cheng D, Sozzi Eet al. Capsid integrity quantitative PCR to determine virus infectivity in environmental and food applications – a systematic review. Water Res X. 2021;11:100080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  155. Levine M. Bacteria fermenting lactose and their significance in water analysis. In: Iowa State College of Agriculture and Mechanic Arts Official Publication. 20: Bulletin 62. Ames: Iowa State University, 1921. [Google Scholar]
  156. Ley RE, Lozupone CA, Hamady Met al. Worlds within worlds: evolution of the vertebrate gut microbiota. Nat Rev Microbiol. 2008;6:776–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  157. Li EZ, Saleem F, Edge TAet al. Biological indicators for fecal pollution detection and source tracking: a review. Processes. 2021a;9:2058. [Google Scholar]
  158. Li L-G, Yin X, Zhang T. Tracking antibiotic resistance gene pollution from different sources using machine-learning classification. Microbiome. 2018;6:93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  159. Li X, Kelty CA, Sivaganesan Met al. Variable fecal source prioritization in recreational waters routinely monitored with viral and bacterial general indicators. Water Res. 2021b;192:116845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  160. Li X, Sivaganesan M, Kelty CAet al. Large-scale implementation of standardized quantitative real-time PCR fecal source identification procedures in the Tillamook Bay watershed. PLoS ONE. 2019;14:19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  161. Liang G, Bushman FD. The human virome: assembly, composition and host interactions. Nat Rev Microbiol. 2021;19:514–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  162. Liang H, Yu Z, Wang Bet al. Synergistic application of molecular markers and community-based microbial source tracking methods for identification of fecal pollution in river water during dry and wet seasons. Front Microbiol. 2021;12:660368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  163. Linke RB, Kebede G, Mushi Det al. Assessing the faecal source sensitivity and specificity of ruminant and human genetic microbial source tracking markers in the central Ethiopian highlands. Lett Appl Microbiol. 2021;72:458–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  164. Liu P, Hill VR, Hahn Det al. Hollow-fiber ultrafiltration for simultaneous recovery of viruses, bacteria and parasites from reclaimed water. J Microbiol Methods. 2012;88:155–61. [DOI] [PubMed] [Google Scholar]
  165. Lorenzo M, Picó Y. Wastewater-based epidemiology: current status and future prospects. Curr Opin Environ Sci Health. 2019;9:77–84. [Google Scholar]
  166. Lu JR, Ryu HD, Hill Set al. Distribution and potential significance of a gull fecal marker in urban coastal and riverine areas of southern Ontario, Canada. Water Res. 2011;45:3960–8. [DOI] [PubMed] [Google Scholar]
  167. Lu Z, Imlay JA. When anaerobes encounter oxygen: mechanisms of oxygen toxicity, tolerance and defence. Nat Rev Microbiol. 2021;19:774–785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  168. Maidana-Kulesza MN, Poma HR, Sanguino-Jorquera DGet al. Tracking SARS-CoV-2 in rivers as a tool for epidemiological surveillance. Sci Total Environ. 2022;848:157707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  169. Malakoff D. Water quality. microbiologists on the trail of polluting bacteria. Science. 2002;295:2352–3. [DOI] [PubMed] [Google Scholar]
  170. Malla B, Haramoto E. Host-specific mitochondrial DNA markers for tracking the sources of fecal pollution. Curr Opin Environ Sci Health. 2020;16:34–46. [Google Scholar]
  171. Mallott EK, Amato KR. Host specificity of the gut microbiome. Nat Rev Microbiol. 2021;19:639–53. [DOI] [PubMed] [Google Scholar]
  172. Mara D, Oragui J. Sorbitol-fermenting bifidobacteria as specific indicators of human faecal pollution. J Appl Bacteriol. 1983;55:349–57. [DOI] [PubMed] [Google Scholar]
  173. Martellini A, Payment P, Villemur R. Use of eukaryotic mitochondrial DNA to differentiate human, bovine, porcine and ovine sources in fecally contaminated surface water. Water Res. 2005;39:541–8. [DOI] [PubMed] [Google Scholar]
  174. Martzy R, Kolm C, Brunner Ket al.  A loop-mediated isothermal amplification (LAMP) assay for the rapid detection of Enterococcus spp. in water. Water Research. Vol. 122, 2017, 62–69. ISSN 0043-1354, 10.1016/j.watres.2017.05.023. [DOI] [PubMed] [Google Scholar]
  175. Mathai PP, Staley C, Sadowsky MJ. Sequence-enabled community-based microbial source tracking in surface waters using machine learning classification: a review. J Microbiol Methods. 2020;177:106050. [DOI] [PubMed] [Google Scholar]
  176. Mattioli MC, Benedict KM, Murphy Jet al. Identifying septic pollution exposure routes during a waterborne norovirus outbreak – a new application for human-associated microbial source tracking qPCR. J Microbiol Methods. 2021;180:106091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  177. Mayer RE, Reischer GH, Ixenmaier SKet al. Global distribution of human-associated fecal genetic markers in reference samples from six continents. Environ Sci Technol. 2018;52:5076–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  178. McLellan SL, Eren AM. Discovering new indicators of fecal pollution. Trends Microbiol. 2014;22:697–706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  179. McLellan SL, Sauer EP, Corsi SRet al. Sewage loading and microbial risk in urban waters of the Great Lakes. Elementa. 2018;6:46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  180. Mieszkin S, Caprais MP, Le Mennec Cet al. Identification of the origin of faecal contamination in estuarine oysters using Bacteroidales and F-specific RNA bacteriophage markers. J Appl Microbiol. 2013;115:897–907. [DOI] [PubMed] [Google Scholar]
  181. Mieszkin S, Furet JP, Corthier Get al. Estimation of pig fecal contamination in a river catchment by real-time PCR using two pig-specific Bacteroidales 16S rRNA genetic markers. Appl Environ Microbiol. 2009;75:3045–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  182. Min J, Baeumner AJ. Highly sensitive and specific detection of viable Escherichia coli in drinking water. Anal Biochem. 2002;303:186–93. [DOI] [PubMed] [Google Scholar]
  183. Mitchell J, Akram S.Pathogen Specific Persistence Modeling Data. In: Rose JB, Jiménez-Cisneros B, (eds). Water and Sanitation for the 21st Century: Health and Microbiological Aspects of Excreta and Wastewater Management (Global Water Pathogen Project). MI, UNESCO: Michigan State University, E. Lansing, 2017. [Google Scholar]
  184. Mitranescu A, Uchaikina A, Kau A-Set al. Wastewater-Based Epidemiology for SARS-CoV-2 Biomarkers: Evaluation of Normalization Methods in Small and Large Communities in Southern Germany. Washington: ACS ES&T Water, 2022. [DOI] [PubMed] [Google Scholar]
  185. Momba M, Edbon J, Kamika Iet al. Using indicators to assess microbial treatment and disinfection efficacy. In: Rose JB, Jimenez Cisneros BE (eds), Global Water Pathogen Project. Lansing: Michigan State University, UNESCO, 2019. [Google Scholar]
  186. Mott J, Smith A. Library-Dependent Source Tracking Methods. Microbial Source Tracking: Methods, Applications, and Case Studies. Hagedorn C, Blanch AR, Harwood V (eds.), Berlin: Springer, 2011, 480. [Google Scholar]
  187. Mushi D, Byamukama D, Kivaisi AKet al. Sorbitol-fermenting Bifidobacteria are indicators of very recent human faecal pollution in streams and groundwater habitats in urban tropical lowlands. J Water Health. 2010;8:466–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
  188. Nagarkar M, Keely SP, Jahne Met al. SARS-CoV-2 monitoring at three sewersheds of different scales and complexity demonstrates distinctive relationships between wastewater measurements and COVID-19 case data. Sci Total Environ. 2022;816:151534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  189. Napier MD, Haugland R, Poole Cet al. Exposure to human-associated fecal indicators and self-reported illness among swimmers at recreational beaches: a cohort study. Environ Health. 2017;16:103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  190. Newton RJ, Bootsma MJ, Morrison HGet al. A microbial signature approach to identify fecal pollution in the waters off an urbanized coast of Lake Michigan. Microb Ecol. 2013;65:1011–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  191. Newton RJ, McLellan SL, Dila DKet al. Sewage reflects the microbiomes of human populations. Mbio. 2015;6:e02574–02514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  192. Nguyen KH, Senay C, Young Set al. Determination of wild animal sources of fecal indicator bacteria by microbial source tracking (MST) influences regulatory decisions. Water Res. 2018;144:424–34. [DOI] [PubMed] [Google Scholar]
  193. Nieuwkerk DM, Korajkic A, Valdespino ELet al. Critical review of methods for isothermal amplification of nucleic acids for environmental analysis. J Microbiol Methods. 2020;179:106099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  194. Noble RT, Weisberg SB. A review of technologies for rapid detection of bacteria in recreational waters. J Water Health. 2005;3:381–92. [DOI] [PubMed] [Google Scholar]
  195. Nocker A, Cheung CY, Camper AK. Comparison of propidium monoazide with ethidium monoazide for differentiation of live vs. dead bacteria by selective removal of DNA from dead cells. J Microbiol Methods. 2006;67:310–20. [DOI] [PubMed] [Google Scholar]
  196. Ognyanova K. Network visualization with R. 2021. https://www.kateto.net/network-visualization (24 July 2022, date last accessed).
  197. Pang L, Heiligenthal L, Premaratne Aet al. Degradation and adsorption of synthetic DNA water tracers in environmental matrices. Sci Total Environ. 2022;844:157146. [DOI] [PubMed] [Google Scholar]
  198. Pang LP, Farkas K, Bennett Get al. Mimicking filtration and transport of rotavirus and adenovirus in sand media using DNA-labeled, protein-coated silica nanoparticles. Water Res. 2014;62:167–79. [DOI] [PubMed] [Google Scholar]
  199. Park Y, Pachepsky Y, Hong E-Met al. Escherichia coli release from streambed to water column during baseflow periods: a modeling study. J Environ Qual. 2017;46:219–26. [DOI] [PubMed] [Google Scholar]
  200. Paruch L, Paruch AM, Sørheim R. DNA-based faecal source tracking of contaminated drinking water causing a large Campylobacter outbreak in Norway 2019. Int J Hyg Environ Health. 2020;224:113420. [DOI] [PubMed] [Google Scholar]
  201. Pascual-Benito M, Nadal-Sala D, Tobella Met al. Modelling the seasonal impacts of a wastewater treatment plant on water quality in a Mediterranean stream using microbial indicators. J Environ Manage. 2020;261:110220. [DOI] [PubMed] [Google Scholar]
  202. Pecson BM, Martin LV, Kohn T. Quantitative PCR for determining the infectivity of bacteriophage MS2 upon inactivation by heat, UV-B radiation, and singlet oxygen: advantages and limitations of an enzymatic treatment to reduce false-positive results. Appl Environ Microbiol. 2009;75:5544–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  203. Peed LA, Nietch CT, Kelty CAet al. Combining land use information and small stream sampling with PCR-based methods for better characterization of diffuse sources of human fecal pollution. Environ Sci Technol. 2011;45:5652–9. [DOI] [PubMed] [Google Scholar]
  204. Perry CA, Bayliss M. Escherichia coli as an indicator of fecal pollution in oysters and oyster waters. Am J Pub Health Nations Health. 1936;26:406–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  205. Pitkänen T, Ryu H, Elk Met al. Detection of fecal bacteria and source tracking identifiers in environmental waters using rRNA-based RT-qPCR and rDNA-based qPCR assays. Environ Sci Technol. 2013;47:13611–20. [DOI] [PubMed] [Google Scholar]
  206. Popa LI, Gheorghe I, Barbu ICet al. Multidrug resistant Klebsiella pneumoniae ST101 clone survival chain from inpatients to hospital effluent after chlorine treatment. Front Microbiol. 2021;11:610296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  207. Puig M, Jofre J, Lucena Fet al. Detection of adenoviruses and enteroviruses in polluted waters by nested PCR amplification. Appl Environ Microbiol. 1994;60:2963–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  208. Raza S, Kim J, Sadowsky MJet al. Microbial source tracking using metagenomics and other new technologies. J Microbiol. 2021;59:259–69. [DOI] [PubMed] [Google Scholar]
  209. Reischer GH, Ebdon JE, Bauer JMet al. Performance characteristics of qPCR assays targeting human- and ruminant-associated Bacteroidetes for microbial source tracking across sixteen countries on six continents. Environ Sci Technol. 2013;47:8548–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  210. Reischer GH, Haider JM, Sommer Ret al. Quantitative microbial faecal source tracking with sampling guided by hydrological catchment dynamics. Environ Microbiol. 2008;10:2598–608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  211. Reischer GH, Kasper DC, Steinborn Ret al. Quantitative PCR method for sensitive detection of ruminant fecal pollution in freshwater and evaluation of this method in alpine karstic regions. Appl Environ Microbiol. 2006;72:5610–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  212. Reischer GH, Kollanur D, Vierheilig Jet al. Hypothesis-driven approach for the identification of fecal pollution sources in water resources. Environ Sci Technol. 2011;45:4038–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  213. Rhodes ER, Hamilton DW, See MJet al. Evaluation of hollow-fiber ultrafiltration primary concentration of pathogens and secondary concentration of viruses from water. J Virol Methods. 2011;176:38–45. [DOI] [PubMed] [Google Scholar]
  214. Rosario K, Symonds Erin M, Sinigalliano Cet al. Pepper mild mottle virus as an indicator of fecal pollution. Appl Environ Microbiol. 2009;75:7261–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  215. Sabar MA, Honda R, Haramoto E. CrAssphage as an indicator of human-fecal contamination in water environment and virus reduction in wastewater treatment. Water Res. 2022;221:118827. [DOI] [PubMed] [Google Scholar]
  216. Sales-Ortells H, Medema G. Microbial health risks associated with exposure to stormwater in a water plaza. Water Res. 2015;74:34–46. [DOI] [PubMed] [Google Scholar]
  217. Savio D, Stadler P, Reischer GHet al. Opening the black box of spring water microbiology from alpine karst aquifers to support proactive drinking water resource management. Wiley Interdisc Rev Water. 2018;5:e1282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  218. Schijven J, Derx J, Husman AMDet al. QMRAcatch: microbial quality simulation of water resources including infection risk assessment. J Environ Qual. 2015;44:1491–502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  219. Schijven JF, Hassanizadeh SM. Removal of viruses by soil passage: overview of modeling, processes, and parameters. Crit Rev Env Sci Tec. 2000;30:49–127. [Google Scholar]
  220. Schill WB, Mathes MV. Real-time PCR detection and quantification of nine potential sources of fecal contamination by analysis of mitochondrial cytochrome b targets. Environ Sci Technol. 2008;42:5229–34. [DOI] [PubMed] [Google Scholar]
  221. Schmitz BW, Kitajima M, Campillo MEet al. Virus reduction during advanced Bardenpho and conventional wastewater treatment processes. Environ Sci Technol. 2016;50:9524–32. [DOI] [PubMed] [Google Scholar]
  222. Schoen ME, Boehm AB, Soller Jet al. Contamination scenario matters when using viral and bacterial human-associated genetic markers as indicators of a health risk in untreated sewage-impacted recreational waters. Environ Sci Technol. 2020;54:13101–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  223. Schriewer A, Miller WA, Byrne BAet al. Presence of Bacteroidales as a predictor of pathogens in surface waters of the central California coast. Appl Environ Microbiol. 2010;76:5802–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  224. Scott TM, Rose JB, Jenkins TMet al. Microbial source tracking: current methodology and future directions. Appl Environ Microbiol. 2002;68:5796–803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  225. Seurinck S, Defoirdt T, Verstraete Wet al. Detection and quantification of the human-specific HF183 Bacteroides 16S rRNA genetic marker with real-time PCR for assessment of human faecal pollution in freshwater. Environ Microbiol. 2005;7:249–59. [DOI] [PubMed] [Google Scholar]
  226. Shanks OC, Atikovic E, Blackwood ADet al. Quantitative PCR for detection and enumeration of genetic markers of bovine fecal pollution. Appl Environ Microbiol. 2008;74:745–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  227. Shanks OC, Kelty CA, Oshiro Ret al. Data acceptance criteria for standardized human-associated fecal source identification quantitative real-time PCR methods. Appl Environ Microbiol. 2016;82:2773–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  228. Shanks OC, Newton RJ, Kelty CAet al. Comparison of the microbial community structures of untreated wastewaters from different geographic locales. Appl Environ Microbiol. 2013;79:2906–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  229. Shanks OC, Nietch C, Simonich Met al. Basin-wide analysis of the dynamics of fecal contamination and fecal source identification in Tillamook Bay, Oregon. Appl Environ Microbiol. 2006;72:5537–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  230. Shanks OC, White K, Kelty CAet al. Performance assessment PCR-based assays targeting Bacteroidales genetic markers of bovine fecal pollution. Appl Environ Microbiol. 2010a;76:1359–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  231. Shanks OC, White K, Kelty CAet al. Performance of PCR-based assays targeting Bacteroidales genetic markers of human fecal pollution in sewage and fecal samples. Environ Sci Technol. 2010b;44:6281–8. [DOI] [PubMed] [Google Scholar]
  232. Shrestha A, Dorevitch S. Slow adoption of rapid testing: beach monitoring and notification using qPCR. J Microbiol Methods. 2020;174:105947. [DOI] [PubMed] [Google Scholar]
  233. Shrestha A, Kelty CA, Sivaganesan Met al. Fecal pollution source characterization at non-point source impacted beaches under dry and wet weather conditions. Water Res. 2020;182:116014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  234. Sivaganesan M, Aw TG, Briggs Set al. Standardized data quality acceptance criteria for a rapid Escherichia coli qPCR method (Draft Method C) for water quality monitoring at recreational beaches. Water Res. 2019;156:456–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  235. Sivaganesan M, Willis JR, Karim Met al. Interlaboratory performance and quantitative PCR data acceptance metrics for NIST SRM (R) 2917. Water Res. 2022;225:119162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  236. Smith T, Krometis LAH, Hagedorn Cet al. Associations between fecal indicator bacteria prevalence and demographic data in private water supplies in Virginia. J Water Health. 2014;12:824–34. [DOI] [PubMed] [Google Scholar]
  237. Sokolova E, Astrom J, Pettersson TJRet al. Estimation of pathogen concentrations in a drinking water source using hydrodynamic modelling and microbial source tracking. J Water Health. 2012;10:358–70. [DOI] [PubMed] [Google Scholar]
  238. Srinivasan S, Aslan A, Xagoraraki Iet al. Escherichia coli, enterococci, and Bacteroides thetaiotaomicron qPCR signals through wastewater and septage treatment. Water Res. 2011;45:2561–72. [DOI] [PubMed] [Google Scholar]
  239. Stachler E, Kelty C, Sivaganesan Met al. Quantitative CrAssphage PCR assays for human fecal pollution measurement. Environ Sci Technol. 2017;51:9146–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  240. Staley C, Gordon KV, Schoen MEet al. Performance of two quantitative PCR methods for microbial source tracking of human sewage and implications for microbial risk assessment in recreational waters. Appl Environ Microbiol. 2012;78:7317–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  241. Staley ZR, Boyd RJ, Shum Pet al. Microbial source tracking using quantitative and digital PCR to identify sources of fecal contamination in stormwater, river water, and beach water in a Great Lakes area of concern. Appl Environ Microbiol. 2018;84:e01634–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  242. Steinbacher SD, Savio D, Demeter Ket al. Genetic microbial faecal source tracking: rising technology to support future water quality testing and safety management. Österreichische Wasser- und Abfallwirtschaft. 2021;73:468–81. [Google Scholar]
  243. Stevenson ME, Sommer R, Lindner Get al. Attachment and detachment behavior of human adenovirus and surrogates in fine granular limestone aquifer material. J Environ Qual. 2015;44:1392–401. [DOI] [PubMed] [Google Scholar]
  244. Symonds EM, Nguyen KH, Harwood VJet al. Pepper mild mottle virus: a plant pathogen with a greater purpose in (waste)water treatment development and public health management. Water Res. 2018;144:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  245. Tan B, Ng C, Nshimyimana JPet al. Next-generation sequencing (NGS) for assessment of microbial water quality: current progress, challenges, and future opportunities. Front Microbiol. 2015;6:20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  246. Tandukar S, Sherchan SP, Haramoto E. Applicability of crAssphage, pepper mild mottle virus, and tobacco mosaic virus as indicators of reduction of enteric viruses during wastewater treatment. Sci Rep. 2020;10:3616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  247. The Human Microbiome Project Consortium . Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  248. Tiwari A. Ahmed W, Oikarinen Set al. Application of digital PCR for public health-related water quality monitoring. Science of The Total Environment. Vol. 837, 2022, 155663, ISSN 0048-9697. [DOI] [PubMed] [Google Scholar]
  249. Tufenkji N, Miller GF, Ryan JNet al. Transport of Cryptosporidium oocysts in porous media: role of straining and physicochemical filtration. Environ Sci Technol. 2004;38:5932–8. [DOI] [PubMed] [Google Scholar]
  250. Ufnar JA, Wang SY, Christiansen JMet al. Detection of the nifH gene of Methanobrevibacter smithii: a potential tool to identify sewage pollution in recreational waters. J Appl Microbiol. 2006;101:44–52. [DOI] [PubMed] [Google Scholar]
  251. Ulrich N, Rosenberger A, Brislawn Cet al. Restructuring of the aquatic bacterial community by Hydric dynamics associated with superstorm sandy. Appl Environ Microbiol. 2016;82:3525–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  252. Unno T, Jang J, Han Det al. Use of barcoded pyrosequencing and shared OTUs to determine sources of fecal bacteria in watersheds. Environ Sci Technol. 2010;44:7777–82. [DOI] [PubMed] [Google Scholar]
  253. Unno T, Staley C, Brown CMet al. Fecal pollution: new trends and challenges in microbial source tracking using next-generation sequencing. Environ Microbiol. 2018;20:3132–40. [DOI] [PubMed] [Google Scholar]
  254. USEPA . Bacteriological Ambient Water Quality Criteria for Marine and Fresh Recreational Waters. United States of America. 1986. [Google Scholar]
  255. USEPA . Method 1696: characterization of human fecal pollution in water by HF183/BacR287 TaqMan® quantitative polymerase chain reaction (qPCR) assay. EPA 821-R-19-002. 2019a.
  256. USEPA . Method 1697: characterization of human fecal pollution in water by HumM2 TaqMan® quantitative polymerase chain reaction (qPCR) Assay (c). EPA 821-R-19-003.2019b.
  257. USEPA . Recreational water quality criteria. EPA820-F-12-061. 2012a.
  258. USEPA . USEPA Method 1609: enterococci in water by TaqMan quantitative polymerase chain reaction (qPCR) with internal amplification control (IAC) assay. Technical Report EPA-820-R-13-005. Washington: Office of Water, US Environmental Protection Agency, 2013. [Google Scholar]
  259. USEPA . USEPA Method 1611: enterococci in Water by TaqMan quantitative polymerase chain reaction (qPCR) assay. Technical Report EPA-821-R-12-008. Washington: Office of Water, US Environmental Protection Agency, 2012b. [Google Scholar]
  260. Vadde KK, Feng Q, Wang Jet al. Next-generation sequencing reveals fecal contamination and potentially pathogenic bacteria in a major inflow river of Taihu Lake. Environ Pollut. 2019;254:113108. [DOI] [PubMed] [Google Scholar]
  261. Velimirov B, Milosevic N, Kavka GGet al. Development of the bacterial compartment along the Danube River: a continuum despite local influences. Microb Ecol. 2011;61:955–67. [DOI] [PubMed] [Google Scholar]
  262. Verbyla ME, Symonds EM, Kafle RCet al. Managing microbial risks from indirect wastewater reuse for irrigation in urbanizing watersheds. Environ Sci Technol. 2016;50:6803–13. [DOI] [PubMed] [Google Scholar]
  263. Verhougstraete MP, Martin SL, Kendall ADet al. Linking fecal bacteria in rivers to landscape, geochemical, and hydrologic factors and sources at the basin scale. Proc Natl Acad Sci USA. 2015;112:10419–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  264. Vierheilig J, Farnleitner AH, Kollanur Det al. High abundance of genetic Bacteroidetes markers for total fecal pollution in pristine alpine soils suggests lack in specificity for feces. J Microbiol Methods. 2012;88:433–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  265. Vierheilig J, Savio D, Ley REet al. Potential applications of next generation DNA sequencing of 16S rRNA gene amplicons in microbial water quality monitoring. Water Sci Technol. 2015;72:1962–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  266. Vrzel J, Vuković-Gačić B, Kolarević Set al. Determination of the sources of nitrate and the microbiological sources of pollution in the Sava River Basin. Sci Total Environ. 2016;573:1460–71. [DOI] [PubMed] [Google Scholar]
  267. Wade TJ, Arnold BF, Schiff Ket al. Health risks to children from exposure to fecally-contaminated recreational water. PLoS ONE. 2022;17:e0266749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  268. Wade TJ, Calderon RL, Brenner KPet al. High sensitivity of children to swimming-associated gastrointestinal illness: results using a rapid assay of recreational water quality. Epidemiology. 2008;20:375–83. [DOI] [PubMed] [Google Scholar]
  269. Wade TJ, Calderon RL, Sams Eet al. Rapidly measured indicators of recreational water quality are predictive of swimming-associated gastrointestinal illness. Environ Health Perspect. 2006;114:24–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  270. Wade TJ, Sams E, Brenner KPet al. Rapidly measured indicators of recreational water quality and swimming-associated illness at marine beaches: a prospective cohort study. Environ Health. 2010;9:66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  271. Wang H, Kaletta J, Kaschuba Set al. Attachment, re-mobilization, and inactivation of bacteriophage MS2 during bank filtration following simulation of a high virus load and an extreme rain event. J Contam Hydrol. 2022;246:103960. [DOI] [PubMed] [Google Scholar]
  272. Wang Y, Fang W, Wang Xet al. Spatial distribution of fecal pollution indicators in sewage sludge flocs and their removal and inactivation as revealed by qPCR/viability-qPCR during potassium ferrate treatment. J Hazard Mater. 2023;443:130262. [DOI] [PubMed] [Google Scholar]
  273. Ward MH, Jones RR, Brender JDet al. Drinking water nitrate and human health: an updated review. Int J Environ Res Public Health. 2018;15:1557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  274. Weidhaas JL, Macbeth TW, Olsen RLet al. Correlation of quantitative PCR for a poultry-specific Brevibacterium marker gene with bacterial and chemical indicators of water pollution in a watershed impacted by land application of poultry litter. Appl Environ Microbiol. 2011;77:2094–102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  275. Weller D, Belias A, Green Het al. Landscape, water quality, and weather factors associated with an increased likelihood of foodborne pathogen contamination of New York streams used to source water for produce production. Front Sustain Food Syst. 2020;3:32440656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  276. WHO . Guidelines for Drinking-Water Quality: Fourth Edition Incorporating the First Addendum. 2017. [PubMed] [Google Scholar]
  277. Wickham H, Averick M, Bryan Jet al. Welcome to the Tidyverse. J Open Source Software. 2019;4:1686. [Google Scholar]
  278. Wiesner-Friedman C, Beattie RE, Stewart JRet al. Microbial find, inform, and test model for identifying spatially distributed contamination sources: framework foundation and demonstration of ruminant bacteroides abundance in river sediments. Environ Sci Technol. 2021;55:10451–61. [DOI] [PubMed] [Google Scholar]
  279. Williams NLR, Siboni N, Potts Jet al. Molecular microbiological approaches reduce ambiguity about the sources of faecal pollution and identify microbial hazards within an urbanised coastal environment. Water Res. 2022;218:118534. [DOI] [PubMed] [Google Scholar]
  280. Willis JR, Sivaganesan M, Haugland RAet al. Performance of NIST SRM (R) 2917 with 13 recreational water quality monitoring qPCR assays. Water Res. 2022;212:118114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  281. Wintzingerode FV, Göbel UB, Stackebrandt E. Determination of microbial diversity in environmental samples: pitfalls of PCR-based rRNA analysis. FEMS Microbiol Rev. 1997;21:213–29. [DOI] [PubMed] [Google Scholar]
  282. Wolfe MK, Archana A, Catoe Det al. Scaling of SARS-CoV-2 RNA in settled solids from multiple wastewater treatment plants to compare incidence rates of laboratory-confirmed COVID-19 in their sewersheds. Environ Sci Technol Lett. 2021;8:398–404. [DOI] [PubMed] [Google Scholar]
  283. World Health Organisation . Fact sheet: antimicrobial resistance. 2021.
  284. World Health Organisation . WHO WASH strategy 2018-2025. Vol. 64, 2019. [Google Scholar]
  285. Wu T, Zhai C, Zhang Jet al. Study on the attachment of Escherichia coli to sediment particles at a single-cell level: the effect of particle size. Water-Sui. 2019;11:819. [Google Scholar]
  286. Wuertz S, Wang D, Reischer GHet al. Library-independent bacterial source tracking methods. In: Hagedorn C, Blanch AR, Harwood VJ (eds), Microbial Source Tracking: Methods, Applications, and Case Studies. New York: Springer, 2011, 61–112. [Google Scholar]
  287. Yamahara KM, Keymer DP, Layton BAet al. Application of molecular source tracking and mass balance approach to identify potential sources of fecal indicator bacteria in a tropical river. PLoS ONE. 2020;15:e0232054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  288. Youngblut ND, dl Cuesta-Zuluaga J, Reischer GHet al. Large-scale metagenome assembly reveals novel animal-associated microbial genomes, biosynthetic gene clusters, and other genetic diversity. Msystems. 2020;5:e01045–01020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  289. Youngblut ND, Reischer GH, Dauser Set al. Vertebrate host phylogeny influences gut archaeal diversity. Nat Microbiol. 2021;6:1443–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  290. Youngblut ND, Reischer GH, Walters Wet al. Host diet and evolutionary history explain different aspects of gut microbiome diversity among vertebrate clades. Nat Commun. 2019;10:2200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  291. Yu G. scatterpie: Scatter Pie Plot. CRAN, 2023. [Google Scholar]
  292. Zarza E, Diego-García E, García LVet al. Monitoring SARS-CoV-2 in the wastewater and rivers of Tapachula, a migratory hub in southern Mexico. Food Environ Virol. 2022;14:199–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  293. Zhang Q, Gallard J, Wu BLet al. Synergy between quantitative microbial source tracking (qMST) and quantitative microbial risk assessment (QMRA): a review and prospectus. Environ Int. 2019;130:10. [DOI] [PubMed] [Google Scholar]
  294. Zimmer-Faust AG, Steele JA, Xiong Xet al. A combined digital PCR and next generation DNA-sequencing based approach for tracking nearshore pollutant dynamics along the southwest United States/Mexico border. Front Microbiol. 2021;12:674214. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

fuad028_Supplemental_Files

Articles from FEMS Microbiology Reviews are provided here courtesy of Oxford University Press

RESOURCES