Skip to main content
Water Research X logoLink to Water Research X
. 2021 Feb 12;11:100093. doi: 10.1016/j.wroa.2021.100093

Predictive water virology using regularized regression analyses for projecting virus inactivation efficiency in ozone disinfection

Syun-suke Kadoya a, Osamu Nishimura a, Hiroyuki Kato b, Daisuke Sano a,c,
PMCID: PMC7903012  PMID: 33665597

Abstract

Wastewater reclamation and reuse have been practically applied to water-stressed regions, but waterborne pathogens remaining in insufficiently treated wastewater are of concern. Sanitation Safety Planning adopts the hazard analysis and critical control point (HACCP) approach to manage human health risks upon exposure to reclaimed wastewater. HACCP requires a predetermined reference value (critical limit: CL) at critical control points (CCPs), in which specific parameters are monitored and recorded in real time. A disinfection reactor of a wastewater treatment plant (WWTP) is regarded as a CCP, and one of the CCP parameters is the disinfection intensity (e.g., initial disinfectant concentration and contact time), which is proportional to the log reduction value (LRV) of waterborne pathogens. However, the achievable LRVs are not always stable because the disinfection intensity is affected by water quality parameters, which vary among WWTPs. In this study, we established models for projecting virus LRVs using ozone, in which water quality and operational parameters were used as explanatory variables. For the model construction, we used five machine learning algorithms and found that automatic relevance determination with interaction terms resulted in better prediction performances for norovirus and rotavirus LRVs. Poliovirus and coxsackievirus LRVs were predicted well by a Bayesian ridge with interaction terms and lasso with quadratic terms, respectively. The established models were relatively robust to predict LRV using new datasets that were out of the range of the training data used here, but it is important to collect LRV datasets further to make the models more predictable and flexible for newly obtained datasets. The modeling framework proposed here can help WWTP operators and risk assessors determine the appropriate CL to protect human health in wastewater reclamation and reuse.

Keywords: Log reduction value, Ozone disinfection, Waterborne viruses, Regularized regression analyses, Hierarchical bayesian modeling

Graphical abstract

Image 1

Highlights

  • The ozone inactivation models to predict virus LRVs are established using regularized regressions.

  • The models are applicable to the variety of water quality among wastewater treatment plants.

  • The models are relatively strong for extrapolation.

  • To develop more robust models, inactivation data with water quality information need to be accumulated.

1. Introduction

Water shortages are a critical global concern and endanger human lives all over the world. Wastewater reclamation and reuse have significantly contributed to the improvement of the accessibility of safe water, but waterborne viruses such as norovirus, rotavirus, adenovirus and enterovirus have been detected from effluents of wastewater treatment plants (WWTPs) (Subahir et al., 2019; Hoque et al., 2018; Croker et al., 2018; Bauri et al., 2019; Li et al., 2018; Silva-Sales et al., 2020; Katayama et al., 2008; Adeifisove et al., 2016; Shlindwein et al., 2010). The risk of viral infection to humans, including users of reclaimed wastewater, needs to be addressed because waterborne viruses in wastewater have imposed a significant disease burden on human society (Utsumi et al., 2020; Subahir et al., 2019; Hoque et al., 2018; Croker et al., 2018; Bauri et al., 2019; Li et al., 2018).

International and domestic guidelines for wastewater reclamation and reuse employ the multiple-barrier concept to address the waterborne disease risk in the usage of reclaimed wastewater, in which the target log reduction value (LRV) of waterborne pathogens is determined beforehand, and the target LRV is allocated to each step at a wastewater treatment plant (WWTP) (Ito et al., 2017; Sano et al., 2016). The World Health Organization (WHO) has recommended applying Sanitation Safety Planning (SSP) to the safe use of excreta, wastewater and greywater, and the hazard analysis and critical control point (HACCP) approach is employed in SSP in order to manage human health risks upon exposure to reclaimed wastewater (WHO, 2006 and 2015). The HACCP approach includes the identification of critical control points (CCPs), which are important operational steps to determine the magnitude of hazardous factors in the final products (U.S. FDA, 1997). The parameters at the CCP are monitored and recorded in real time. In addition, the parameters are compared to reference values called critical limit (CL) to maintain the safety of final products (U.S. FDA, 1997). The corrective action needs to take place when the parameters monitored at CCP deviate from CL. The disinfection intensity, determined by some operational parameters such as Ct-value in a disinfection reactor, is suitable as a CCP parameter at WWTPs since it determines the LRV of waterborne pathogens at a disinfection reactor.

However, an application of SSP to the WWTP operation is difficult at present since a universal method for CL determination has not yet been developed. In addition, the difference in water quality among WWTPs makes it difficult to establish a universal approach that determines the CLs appropriate for respective WWTPs, because chemical disinfectants are consumed by contaminants in wastewater, and the decay of disinfectant concentration results in deviation from the expected disinfection intensity. Thus, the disinfection intensity that achieves the target LRV needs to be determined by taking water quality and operational information at each WWTP into account. Previously, we proposed the concept of predictive water virology, in which the models predicted virus LRV by using water quality and operational parameters as explanatory variables (Kadoya et al., 2019). The predictive water virology enables us to derive the appropriate CL to attain the target LRV under site-specific water quality. Kadoya et al. (2019 and 2020) previously proposed the concept of predictive water virology and reported that the predictive inactivation models, based on hierarchical Bayesian modeling (HBM) and regularized regressions, flexibly correspond to the changes and variety in water quality parameters among WWTPs (e.g., seasonal variation). Predictive water virology can help us understand the required operational conditions to achieve the target LRV, in which there is no gap between the predicted and target LRVs. However, these previous studies were limited in their predictions of newly obtained datasets that are not used for model construction, in other words, we need to know the predictable range of present models and identify which algorithms are more robust to predict LRVs using new datasets.

Basic regularized regression analyses are ridge, lasso and elastic net regressions, in which the multicollinearity problem is solved, and thus all explanatory variables can be used. All these regression approaches are extended versions of the least square method by adding a regularization term corresponding to each algorithm. The ridge regression shrinks absolute values of the coefficients of model variables (Hoerl and Kennard, 1970), whereas the lasso regression makes the coefficients of inessential variables zero with a decrease in the magnitude of model variables (Tibshirani, 1996). The elastic net regression combines regularized terms used in both the ridge and lasso regressions (Zou and Hastie, 2006). The lasso and elastic net regressions are classified into the sparse estimation, which selects the plausibly essential variables. In Bayesian regularized regression (Bayesian ridge and automatic relevance determination (ARD)), an objective variable is assumed to be expressed as Gaussian functions, and regularization terms are then estimated as hyperparameters by Bayes rule. A Bayesian ridge employs all explanatory variables as well as basic ridge regression, whereas ARD is also one of the sparse estimations (Tipping, 2001). Previous studies have applied several machine learning algorithms to establish predictive models for water quality and rainfall runoff (Park et al., 2018; Lu and Ma, 2020; Safari et al., 2020), but the predictive inactivation models for waterborne viruses, which are based on Bayesian regularized regression analyses using water quality and operational parameters, have not been reported.

In this study, we introduced the Bayesian regularized regression analysis into the predictive water virology to explore appropriate algorithms, which indicated the best prediction performance and were capable of predicting newly obtained datasets and avoiding overfitting to training datasets. We here have focused on inactivation datasets related to ozone, which is one of disinfectants effective for inactivation of waterborne viruses (Roy et al., 1981; Kim et al., 1980) and is utilized at many WWTPs (Hembach et al., 2019; Kharel et al., 2020; Wang et al., 2018). We first collected peer-reviewed articles following a systematic review method and extracted LRV datasets with water quality and operational information from water disinfection experiments using ozone. We applied three regularized regressions (ridge, lasso, and elastic net) and two Bayesian regularized regressions (Bayesian ridge and ARD) to construct the predictive inactivation models, and then verified the prediction performance and overfitting problem based on two validation approaches: (1) the models constructed by training datasets (80% of total dataset) predicted test datasets (20% of total dataset) for evaluation of the prediction performance, and (2) one article was selected and its dataset was used as a test dataset, which had no relation to the training datasets, to confirm whether the model overfitted. We then tried to combine the best regularized regression-based models with HBM, which took experimental errors and virus type-dependent sensitivity into account (Kadoya et al., 2019). Finally, we provided the applicable range of explanatory variables to help readers obtain more precise prediction values and the required disinfection conditions under the water quality level to achieve 4 LRV, which has been recommended by the USEPA (USEPA, 1999).

2. Methods

2.1. Systematic Review and data collection

Peer-reviewed articles relating to virus inactivation using ozone were collected using Google Scholar from April 2019 to June 2020, following PRISMA guidelines (Moher et al., 2009). The keywords input to Google Scholar were “disinfection” and “ozone” combined with keywords of each virus species (“norovirus”, “rotavirus”, “poliovirus”, “adenovirus”, “hepatitis A virus”, “coxsackievirus” and “echovirus”), and we checked all records published from 1950 to 2020. Dissertations, book chapters, reviews, conference reports and non-English articles were eliminated from the collected articles (first screening). We read whole texts in the first-screened records, and the articles relevant to this study aim, which included the information about the virus LRV and ozone disinfection conditions such as contact time, initial concentration and Ct-value, were then selected from the first-screened records. To establish models using the highly qualified datasets, we assumed that LRVs must be differentiated under varied conditions of disinfection in well-performed experiments, in which water quality and operational parameters were strictly controlled. If the LRVs were same in spite of distinct disinfection conditions, the datasets were excluded. The process of systematic review was fulfilled by three persons.

The LRV was expressed as below:

LRV=log10(N0/Nt) (1)

where N0 is virus concentration at time 0 and Nt is that at time t. ImageJ software was used to extract numerical datasets of the LRV from figures, such as an inactivation curve (Shneider et al., 2012). Additionally, we obtained water quality and operational parameters such as initial ozone concentration (I) [mg/L], decay rate of ozone concentration (k) [min-1], contact time (t) [min], Ct-value (C) [min mg/L], pH (p) [-], temperature (T) [°C], assays for measuring virus concentration (A1: qPCR or infectivity, A2: plaque assay or MPN method when A1 is infectivity), types of water used for the disinfection experiment (W: purified or environmental water) and log10 initial virus concentration (V [log10 (genome copy number, PFU or MPN/mL)]). Virus types (S) were also extracted from the collected articles (Table S1), of which rotavirus and coxsackievirus were used as categorical variables because they are the only two types that were tested in the articles. When datasets did not include Ct-values, we calculated them by an integration of the disinfectant decay formula (Eq. (2)):

C(t)=Iexp(kt) (2)

where C(t) is ozone concentration at time t [min], I is initial concentration of ozone and k is the decay constant of ozone concentration in water.

2.2. Basic regularized regression analysis

Explanatory variables in the regularized regression analyses were I, k, t, C, p, T, A1, A2, W, V and S, indexed above. The variables A1, A2, W and S were binary ones (0: genome copy (in A1), PFU (in A2), purified water and one virus type, 1: infectivity (in A1), MPN (in A2), environmental water, another type). Interaction and interaction-quadratic terms were added in a stepwise procedure using the “PolynomialFeatures” function of the scikit-learn library (Pedregosa et al., 2011) in Python 3.7. Regularized regression analysis requires a standardization of explanatory variables (mean = 0 and standard deviation = 1), which was performed by the function “StandardScaler” in scikit-learn library.

The response variable LRVs (y) were expressed as a linear function:

y =  + ε (3)

where y is a matrix of response variables of (y1, …, yρ), β is a coefficient matrix (β1, …, βρ), ε is the matrix of observation errors of (ε1,..., ερ), X is the design matrix of model variables of (x(1), ..., x(ρ)), x(j) is a matrix of (x1j, ..., xnj), ρ is the number of datasets, and n is the number of model variables). Minimizing the sum of squares error (Sλ) derives the estimates of coefficient:

minβSλ=minβ{12nyXβ22+λR(β)} (4)

where minβ is a minimization function for β, λ is the regularization parameter and a regularization term R(β) determines which regularization methods are used for constructing models. R(β) for ridge regression is expressed as 1/2β22 (β22=i=1nβi2) (Hoerl and Kennard, 1970) while for lasso regression β1 (β1=i=1n|βi|) is applied (Tibshirani, 1996). Elastic net regression combines the R(β) of ridge and lasso regressions (αβ22+(1α)β1, where α determines the proportion of the ridge to the lasso regression) (Zou and Hastie, 2006). Suitable values of two parameters (λ and α) were concurrently searched from 10-6 to 102 and from 0 to 1, respectively, using the grid search method that enumerated all combinations of those parameters and identify an appropriate combination. The standardization of variables and the addition of regularization term can ensure the homoscedasticity of errors.

2.3. Bayesian regularized regression analysis

Regularization parameters are determined not by fitting to data but by the random procedure in the grid search method as described above. Bayesian regularized regression analysis enables us to estimate regularization parameters with coefficients of regression models by fitting to data, and can avoid overfitting by making the model less complex (Tipping, 2001).

The coefficient β follows a spherical Gaussian distribution in Bayesian ridge regression analysis, and two regularization parameters (f and σ2) are given by gamma distributions as prior distributions (Tipping, 2001; Bishop, 2006). In this case, the likelihood of y is expressed as:

p(y|β,σ2)=(2πσ2)ρ/2exp{12σ2yβX2} (5)

The prior for β is described as:

p(β|f)=i=0ρN(βi|0,fi1) (6)

and the prior distributions for f and σ2 are:

p(f)=i=0ρGamma(fi|a,b) (7)
p(σ2)=Gamma(σ2|c,d) (8)

where a and c are shape parameters, and b and d are rate parameters of each gamma distribution. These parameters (a, b, c and d) were determined from non-informative distributions. Maximizing the logarithmic posterior distribution (ln p(β|y)) is equal to the minimization of a regularization term. Automatic relevance determination (ARD) employs the elliptical Gaussian distribution for a prior distribution of β, and follows the same procedure of Bayesian ridge regression (Tipping, 2001; Bishop, 2006). These Bayesian regularized regression analyses were also conducted by the scikit-learn library in Python 3.7; codes and source data are available in the supplementary materials.

2.4. Hierarchical bayesian modeling

We adopted probability distributions specific to the objective variables’ LRV based on Akaike’s information criterion (AIC) using the “fitdistrplus” packages of R software (Delignette-Muller and Dutang, 2015). The population parameter shaping the probability distribution (Y) was described as a logarithmic link function because the LRV datasets followed a Weibull or gamma distribution (Table S2). The logarithmic link function is expressed as below:

logeY=β0+βEX (9)

where β0 is an intercept, βE is a matrix of coefficients and X is a design matrix. We assumed that LRVs involved an experimental error, and the matrix of coefficients was indexed “E”. The βE was assumed to follow a normal distribution because of experimental error:

βENormal(μST[i],σE) (10)

where σE is a standard deviation generating variations among inactivation experiments, and μST[i] is a mean of virus type i, which implies the type-dependent sensitivity to ozone. The virus type-dependent parameter μST[i] also follows a normal distribution:

μST[i]Normal(μO,σST) (11)

where μO is the averaged mean among all virus types and σST is the standard deviation bearing the type-dependent sensitivity. The hierarchical Bayesian models were constructed by two approaches that employed explanatory variables selected by the best regularized regression models or variance inflation factor (VIF). A multicollinearity problem possibly remained in the HBM, so the models that removed an explanatory variable having more than 10 VIF values were prepared. HBM was conducted on R software version 3.5.0 (https://www.r-project.org/) and Stan (https://mc-stan.org/).

2.5. Model validation

We conducted two trials to determine which regularized regression algorithms exhibited the best prediction performance and escaped overfitting to training datasets. In Trial 1, we randomly separated total datasets into training (80%) and test datasets (20%) to evaluate the prediction powers for interpolation. In Trial 2, the datasets derived from an article were used as test datasets, which resulted in no relationship between training and test datasets and enabled us to understand the predictability for extrapolation. Both trials adopted leave-one-out (LOO) cross-validation, in which temporal models were repeatedly generated using N-1 training datasets (N is the number of training datasets), and a remaining dataset was used for model validations. The coefficients were the averaged values of the generated N models. To evaluate the prediction performance, we calculated mean squared values (MSEs). We also generated a number of the predicted LRVs by changing, stepwise, some explanatory variables having a higher coefficient to explore the out-of-predictable range in which a model generated obviously wrong prediction values such as a negative LRV.

3. Results

3.1. Article collection and data characteristics

We first identified 10102 records according to a systematic review method. Dissertations, book chapters, reviews and non-English articles were eliminated from the list of the first-collected articles, which resulted in a decrease to 4058 articles. After excluding the articles not relevant to our study (e.g., no information about the LRV and no water disinfection), 21 articles remained. In detail, the numbers of screened articles for norovirus, rotavirus, poliovirus, coxsackievirus, echovirus and adenovirus were 6, 4, 12, 4, 3 and 2, respectively. The numbers of LRV data point were 68 (norovirus), 65 (rotavirus), 182 (poliovirus), 111 (coxsackievirus), 29 (echovirus) and 29 (adenovirus), respectively (Lim et al., 2010; Brie et al., 2018; Hirneisen et al., 2011; Tondera et al., 2015; Shin and Sobsey, 2003; Thurston-Enriquez et al., 2005; Harakeh and Bulter, 1984a; Harakeh and Bulter, 1984b; Meunier et al., 2006; Vaughn et al., 1987; Finch and Fairbairin, 1991; Farooq and Akhlaque, 1983; Katzenelson and Biedermann, 1976; Moore and Magolin, 1994; Roy et al., 1981; Roy et al., 1982; Engelbrecht et al., 1980; Katzenelson et al., 1979; Emerson et al., 1982; Wang et al., 2018; Wolf et al., 2018). The Scikit-learn library (https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html) and previous studies suggest that more than 50 datasets should be analyzed (Cui and Gong, 2018; Leonard et al., 2017), so we excluded echovirus and adenovirus data from our analyses.

Water quality and operational parameters were also extracted from the screened articles. Initial concentrations of ozone (I) are mostly less than 5 mg/L (Fig. 1, I), and the decay constant (k) is described only in articles for norovirus (Fig. 1, k). Data points of the contact time (t) gather around lower values, but a part of the data for rotavirus and poliovirus was distributed around 10 min (Fig. 1, t). Ct-values (C) were not listed in articles about rotavirus and poliovirus (Fig. 1, C). The articles about coxsackievirus listed C but included no descriptions of I. The values of pH (p) are centered around 7 ± 1 (Fig. 1, p), and almost all disinfection experiments are conducted at room temperature (around 20 °C) or 4 °C (Fig. 1, T). Purified water, like a buffer, was mainly used for disinfection tests (Fig. 1, W). Infectivity assay (plaque assay or MPN method) is one of the main methods to measure virus concentration (Fig. 1, A1 and A2). Initial log virus concentration ranges from approximately four to eight (Fig. 1, V). Since the collected studies used only two types of rotavirus and coxsackievirus (Table 1 and Table S1), the types were regarded as an explanatory variable in the regularized regression analyses. Taken together, the number of explanatory variables was 9 (norovirus), 8 (rotavirus), 7 (poliovirus) and 7 (coxsackievirus) (Table 1).

Fig. 1.

Fig. 1

Distribution of operational and water quality parameters. Water quality and operational parameters except for strain types were plotted for each virus species as swarm, box (median, 25 and 75 percentile) and violin plots. “Assay” is classified as A1 (genome or infectivity) or A2. (If classifying as infectivity, the variable is also classified as plaque forming unit (PFU) or most probable number (MPN) method.).

Table 1.

Information about datasets used for the model construction (NLRV: the number of log reduction values, NE: the number of articles, NS: the number of virus types, I: initial ozone concentration, k: decay constant, C: Ct-value, p: pH, T: temperature, A1: infectivity or genome, A2: plaque forming unit or most probable number method, W: water types, V: log10 initial virus concentration).

3.2. The prediction performance of regularized regression models in trial 1

Based on training datasets randomly split from the total dataset, predictive inactivation models for four virus species were constructed. MSEs for both training and test datasets were calculated to identify the best algorithms for the LRV prediction and the required polynomial terms (Fig. 2). The addition of higher terms to the norovirus inactivation model made prediction performances for both datasets better than only linear terms and decreased the differences of MSEs between training and test data (ΔMSE = |MSEtrain - MSEtest|) except for ridge regression. Also, MSEs decreased by adding interaction terms to the rotavirus inactivation models, whereas ΔMSEs in the models with only linear terms were lower. Among models for predicting poliovirus LRV, the ridge regression with polynomial terms indicated the best prediction performance, and Bayesian regularized regression-based models with higher terms also showed smaller MSEs. The coxsackievirus inactivation models were improved very slightly by the addition of polynomial terms, and the ridge-based model with linear terms had the smallest ΔMSE. The algorithms providing better prediction performance were ARD with interaction terms (norovirus and rotavirus) and ridge with interaction terms (poliovirus and coxsackievirus) at this time.

Fig. 2.

Fig. 2

Mean squared errors (MSEs) in Trial 1.Total datasets are randomly divided into test and training datasets in Trial 1. MSEs are compared between test (solid) and training (dashed) datasets (left), and the absolute values of difference between test and training data are also displayed (right).

3.3. The prediction performance for new datasets (trial 2)

In Trial 1, there could be relationships between training and test datasets, which made it difficult to confirm the ability to predict newly obtained datasets. Therefore, we selected datasets extracted from an article as test datasets in Trial 2 to validate the models appropriate for predicting new datasets not related to training datasets. The model indicating smaller MSE and ΔMSE is preferable for the predictive inactivation model. In norovirus models, lasso and elastic net with linear terms and ARD with interaction terms indicated smaller MSE and ΔMSE (Fig. 3). Only ARD with interaction terms in rotavirus models seemed to avoid the overfitting problem. All poliovirus models had very small ΔMSE and smaller MSEs, and Bayesian regularized regressions with polynomial terms slightly improved prediction performances. The coxsackievirus model based on lasso regression with quadratic terms had lower MSEs and ΔMSE.

Fig. 3.

Fig. 3

Mean squared errors (MSEs) in Trial 2.Datasets from an article are used as test datasets in Trial 2. MSEs are compared between test (solid) and training (dashed) datasets (left), and the absolute values of difference between test and training data are also displayed (right).

3.4. The best predictive inactivation models and their characteristics

Taken together with the results from Trial 1 and 2, the best predictive inactivation models were ARD with interaction terms for norovirus and rotavirus, Bayesian ridge with interaction terms for poliovirus and lasso with quadratic terms for coxsackievirus (Table 3). Comparative results for observed and predicted LRVs indicated that not only training datasets but also many of the test datasets were predicted well by the best models, but some predicted values (two test data of norovirus in Trial 2, both training and test data of coxsackievirus in Trial 1 and 2) largely deviated from the observed values (Fig. 4). Many of prediction values tended to be on the y = x line and some were randomly deviated from the line (i.e., residuals are distributed randomly), which indicates the independence of residuals. The estimated coefficients of explanatory variables in the best prediction models are displayed in Fig. 5. Except for the poliovirus model, the number of explanatory variables decreased because of the sparse estimations. All models had the variables directly related to the disinfection condition, such as I, C and t, indicating that they are essential to explain LRVs. Other water quality parameters are also included in the best models although some of them are interaction terms.

Table 3.

The best predictive inactivation models.

Algorithm Polynomial terms
Norovirus Automatic relevance determination Interaction
Rotavirus Automatic relevance determination Interaction
Poliovirus Bayesian ridge Interaction
Coxsackievirus Lasso Quadratic

Fig. 4.

Fig. 4

Comparison of observed data with predicted values by the best predictive models.Prediction values in the best predictive inactivation models are compared with observed data in the condition of Trial 1 (a) and 2 (b).Triangles and circles are coordinates (observed, predicted) for training and test datasets, respectively. Gray dotted lines indicate a predicted value completely matches an observed point. Error bar means a standard deviation.

Fig. 5.

Fig. 5

Model structures for the best predictive inactivation models. Coefficients of the best models (red: positive, pale blue: negative). Letters indicate explanatory variables (I: initial ozone concentration, k: decay constant, t: contact time, C: Ct-value, A: assays to measure virus concentration, W: water types, p: pH, T: temperature, V: initial virus concentration). (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)

3.5. No improvement of the prediction performance by HBM

We designed VIF-based and regularized-regression-based hierarchical Bayesian models (Table S3) and compared MSE values among them (Table 2). The hierarchical Bayesian models based on regularized regression analyses showed higher prediction performances than those based on VIF. However, the prediction performances in HBM for test datasets of norovirus, poliovirus and coxsackievirus were lower than the best regularized regression models, indicating that the HBM did not contribute to an improvement of the predictive inactivation models using ozone (Figs. S1 and 2).

Table 2.

Prediction performance of regularized regression and two types of hierarchical Bayesian models (regularized regression and variance inflation factor (VIF)-based).

The best regularized regression model
Hierarchical Bayesian model (Regularized regression base)
Hierarchical Bayesian model (VIF base)
train test train test train test
Norovirus 0.35 8.58 0.55 9.51 0.58 17.8
Rotavirus 0.51 0.73 0.67 0.33 1.34 0.31
Poliovirus 0.54 0.51 0.79 2.24
Coxsackievirus 0.98 1.06 1.16 4.85 1.17 3921

3.6. Identification of disinfection conditions achieving 4LRV and unpredictable ranges

The LRVs were generated using the major variables indicated in Fig. 5 to identify the predictable ranges in the models constructed and the disinfection conditions that achieved 4LRV. For all simulations, we set fixed values based on a practical WWTP in Sendai City, Japan, according to the seasonal prevalence of the viruses (summer: poliovirus and coxsackievirus, winter: norovirus and rotavirus). The p slightly affected the 4LRV achievement (Fig. 6a), but as the k increased, the 4LRV was achieved easily (Fig. 6b), which does not seem to be intuitive since a high value of k reduced the effectiveness of ozone disinfection. The rotavirus model clearly demonstrated the out-of-predictable ranges in certain I values (Fig. 6c, d). As well as norovirus, the p had a small effect on the LRV in the poliovirus model (Fig. 7a), but the higher T made it difficult to achieve the 4LRV in a disinfection reactor (Fig. 7b). The linear relationship between I and t was likely to be found (Fig. 7c). The coxsackievirus model also indicated unpredictable regions, and higher p and T made it easy to achieve 4LRV (Fig. 7d, e).

Fig. 6.

Fig. 6

Simulation of norovirus and rotavirus LRVs.Heatmaps indicate the LRVs by changing specific explanatory variables, and the white dotted lines are the boundary of 4LRV. Gray cell means the out of predictable range (negative value) in the models. Temperature T and log10 virus concentration V correspond to winter conditions. (I: initial ozone concentration, k: decay constant, t: contact time, C: Ct-value, A: assays to measure virus concentration, W: water types, p: pH, T: temperature, V: initial virus concentration).

Fig. 7.

Fig. 7

Simulation of poliovirus and coxsackievirus LRVs. Heatmaps indicate the LRVs by changing specific explanatory variables, and the white dotted lines are the boundary of 4LRV. The gray cell means the out-of-predictable-range value (negative value) in the models. Temperature T and log10 virus concentration V correspond to summer conditions. (I: initial ozone concentration, k: decay constant, t: contact time, C: Ct-value, A: assays to measure virus concentration, W: water types, p: pH, T: temperature, V: initial virus concentration).

4. Discussion

In this study, we constructed ozone inactivation models predicting virus LRV by regularized regression analyses. The best performances for predicting norovirus and rotavirus LRVs were found in ARD with interaction terms, in which overfitting to training datasets could be avoided (Fig. 2, Fig. 3, Fig. 4 and Table 3). Bayesian ridge with interaction terms and lasso with quadratic terms indicated the best prediction performance for poliovirus and coxsackievirus, respectively, and had the ability to predict new datasets not relevant to the training dataset (Fig. 2, Fig. 3, Fig. 4 and Table 3). However, some test and training datasets have still not been predicted well (Fig. 4), and the HBM did not improve the prediction performances (Table 2). We also provided information about the unpredictable range of some variables and the desirable disinfection condition for the achievement of 4LRV (Fig. 6, Fig. 7).

The conventional kinetics of disinfection for viruses, such as the Chick-Watson, Hom and efficiency factor Hom (EFH) models, use only disinfection parameters (disinfectant concentration, contact time and decay constant) as model parameters (Chick, 1908; Watson, 1908; Hom, 1972; Haas and Joffe, 1994). Boehm et al. established the predictive model for the natural decay of enteric viruses, in which several water quality parameters were used as explanatory variables, and found the larger effects of water quality on natural decay rates (Boehm et al., 2019). The disinfection intensity of ozone is also affected by water quality parameters as the decay rate of ozone increases at higher temperatures (Sohn et al., 2004). Classical models are likely to be inadequate to predict LRVs under varied water quality status, but predictive water virology provides more practical models for LRV prediction using disinfectants because some operational and water quality information can be incorporated into the model (Fig. 5).

The selection of explanatory variables is critical for model construction (Abdul-Wahab et al., 2005). In fact, sparse estimation, which is applied for the predictive inactivation models except for poliovirus, implies a mitigation of overfitting to training datasets (Fig. 3), although an addition of polynomial terms is required to improve prediction performances. Ozone is generally consumed by organic contaminants in water (Park et al., 2001), and the established models lack variables related to disinfectant-consuming substances (e.g., DOC and SS) (Sohn et al., 2004; Du et al., 2020), so the polynomial terms might compensate for the missing but essential variables. However, in this study, only the small number of variables was available and thus an addition of polynomial terms does not always support the prediction, which might attribute to the lack of fit to some datasets. To predict LRVs that are not predicted well with current models (Fig. 4), other types of water quality information needs to be incorporated into our models. The collection of datasets that completely involve water quality information is required to improve the prediction performance, which may be achievable by establishing a widely accessible database that includes datasets provided from laboratories and WWTPs, such as Global Water Pathogen Project (GWPP; https://www.waterpathogens.org/). If water quality information is difficult to prepare, the Ct-value that is calculated using a decay constant can be a substitution of disinfectant consuming matters (Kadoya et al., 2019). Additionally, we are able to construct the robust regularized regression-based models that exclude outliers from an estimation of model coefficients by using other machine learning algorithms (e.g., Random Sample Consensus, Theil-Sen Estimator) (Fischler and Bolles, 1981; Dang et al., 2008).

Predictive water virology helps us identify the water quality and operational conditions that are required for 4LRV, but the models established here have to be updated continuously because the out-of-predictable range has remained large (Fig. 6, Fig. 7). The LRVs for norovirus, rotavirus and poliovirus are relatively stable under near-neutral pH levels (Fig. 6, Fig. 7), which is consistent with previous reports (Lin et al., 2014; Meng et al., 1987; Ward and Ashley, 1979). On the other hand, coxsackievirus LRVs are positively proportional to the pH-value since coxsackievirus is easily inactivated by higher pH (McGeady et al., 1979). Although waterborne viruses are easily inactivated with higher temperatures (Moresco et al., 2016; Biziagos et al., 1988), the model implies that poliovirus is inactivated more easily with low temperatures, which may be attributable to a data bias (e.g., small datasets using low temperature, in which higher LRVs are observed). Also, as shown in Fig. 6b, an unintuitive result that the LRV became high as decay rate k increased is likely to be explained by the data bias owing to the small number of low k values (Fig. 1). Thus, virus LRV datasets obtained under various water quality conditions need to be collected to eliminate the data bias and expand the predictable range of models.

The predictive inactivation models established here are mostly based on datasets obtained in bench-scale experiments, so the models need to be carefully applied to LRV prediction at real-scale plants. In practice, there can be uncertainty related to hydraulics. For example, heterogeneity of DOC concentration, which is caused by the degree of incomplete mixing, could affect the decay of ozone concentration and the Ct-value. Also, under the mixing effect at a WWTP, water quality parameters need to be expressed using probability distributions (Sin et al., 2009 and 2011). Again, the model limitation must be solved by preparing a high-quality database of virus LRV, in which water quality parameters at WWTP are deposited. Alternatively, the prediction performance is developed by making the regularized regression-based models with explanatory variables that are expressed using probability distributions.

Several descriptions about the CL determination at WWTPs are available in the literature, but we do not have a common and flexible approach for such a determination (Hallowell, 2014). The current models for the LRV prediction help us determine the CL at the disinfection step at any WWTPs. The accepted concentration of virus in effluent is determined so as to correspond to the reference values of tolerable infectious risk and disease burden (e.g., DALY loss per person per year) (USEPA, 1990; WHO, 2017). When a probability distribution of virus concentration in influent is known, the accepted virus concentration in effluent derives the target LRV (Ito et al., 2017). The operational and water quality parameters at WWTPs are monitored, and then we can apply the predictive inactivation models established in this study to determine the CL, which retains the target LRV.

5. Conclusions

Under the concept of predictive water virology, we provided the ozone inactivation models for waterborne viruses, which enable operators at a WWTP and risk assessors to determine the CL at the disinfection reactor. The proposed models are useful for predicting LRVs that are affected by some water quality parameters within the predictable range, and to make the models predictable, we need to continue to collect water quality information with LRV data and then update the models.

Author agreement

SK and DS designed the present study. SK performed this study and prepared this article. ON, HK and DS checked the results of the analyses and modified the draft version of the article. All authors approved the final article.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This study was supported by Gesuido Academic Incubation to Advanced Project, Japan Ministry of Land, Infrastructure, Transport and Tourism, and “The Sanitation Value Chain: Designing Sanitation Systems as Eco-Community Value System” Project, Research Institute for Humanity and Nature (RIHN, Project No. 14200107).

Footnotes

Appendix A

Supplementary data to this article can be found online at https://doi.org/10.1016/j.wroa.2021.100093.

Appendix A. Supplementary data

The following are the Supplementary data to this article:

Multimedia component 1
mmc1.zip (3.5KB, zip)
Multimedia component 2
mmc2.xlsx (30.3KB, xlsx)
Multimedia component 3
mmc3.docx (9.6MB, docx)

References

  1. Abdul-Wahab S., Bakheit C.S., Al-Alawi M. Principal component and multiple regression analysis in modeling of ground-level ozone and factors affecting its concentrations. Environ. Model. Software. 2005;20:1263–1271. [Google Scholar]
  2. Adeifisove M.A., Nwodo U.U., Green E., Okoh A.I. Quantitative PCR detection and characterization of human adenovirus, rotavirus and hepatitis A virus in discharged effluents of two wastewater treatment facilities in the Eastern Cape, South Africa. Food. Environ. Virol. 2016;8:262–274. doi: 10.1007/s12560-016-9246-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bauri M., Wilkinson A.L., Ropa B., Feldon K., Snider C.J., Anand A., Tallis G., Boualam L., Grabovac V., Avagyan T., Reze S.M., Mekonnen D., Zhang Z., Thorley B.R., Shimizu H., Apostol L.N.G., Takashima Y. Circulating vaccine-derived poliovirus type 1 and outbreak response – Papua New Guinea, 2018. MMWR. 2019;69(5):119–120. doi: 10.15585/mmwr.mm6805a6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bishop C.M. Springer-Verlag; New York, LLC: 2006. Pattern Recognition and Machine Learning. [Google Scholar]
  5. Biziagos E., Passagot J., Crance J.-M., Deloince R. Long-term survival of hepatitis A virus and poliovirus type 1 in mineral water. Appl. Environ. Microbiol. 1988;54(11):2705–2710. doi: 10.1128/aem.54.11.2705-2710.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Boehm A.B., Silverman A.I., Schriewer A., Goodwin K. Systematic review and meta-analysis of decay rates of waterborne mammalian viruses and coliphages in surface waters. Wat. Res. 2019;164(1):114898. doi: 10.1016/j.watres.2019.114898. [DOI] [PubMed] [Google Scholar]
  7. Brie A., Boudaud N., Mssihid A., Loutreul J., Bertrand I., Gantzer C. Inactivation of murine norovirus and hepatitis A virus on fresh raspberries by gaseous ozone treatment. Food Microbiol. 2018;70:1–6. doi: 10.1016/j.fm.2017.08.010. [DOI] [PubMed] [Google Scholar]
  8. Chick H. An investigation of the laws of disinfection. J. Hyg. 1908;8(1):92–158. doi: 10.1017/s0022172400006987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Croker C., Hathaway S., Marutani S., Hernandez M., Cadavid C., Rajagopalan S. A virus infection among adult patients of a mental hospital – los Angeles County, 2017. Infect. Control Hosp. Epidemiol. 2018;39(7):881–882. doi: 10.1017/ice.2018.77. [DOI] [PubMed] [Google Scholar]
  10. Cui Z., Gong G. The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features. Neuroimage. 2018;178:622–637. doi: 10.1016/j.neuroimage.2018.06.001. [DOI] [PubMed] [Google Scholar]
  11. Dang X., Peng H., Wang X., Zhang H. Olemiss Edu; 2008. Theil-Sen Estimators in a Multiple Linear Regression Model.http://home.olemiss.edu/∼xdang/papers/MTSE.pdf 2008. [Google Scholar]
  12. Delignette-Muller M.L., Dutang C. fitdistrplus: an R package for fitting distributions. J. Stat. Software. 2015;64(4):1–34. [Google Scholar]
  13. Du H., Wu Y., Wu H., Li F. Effect of ozone pretreatment on characteristics of dissolved organic matter formed in aerobic and anaerobic digestion of waste-activated sludge. Environ. Sci. Pollut. Res. 2020 doi: 10.1007/s11356-020-10596-4. [DOI] [PubMed] [Google Scholar]
  14. Emerson M.A., Sproul O.J., Buck C.E. Ozone inactivation of cell-associated viruses. Appl. Environ. Microbiol. 1982;43(3):603–608. doi: 10.1128/aem.43.3.603-608.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Engelbrecht R.S., Weber M.J., Salter B.L., Schmidt C.A. Comparative inactivation of viruses by chlorine. Appl. Environ. Microbiol. 1980;40(2):249–256. doi: 10.1128/aem.40.2.249-256.1980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Farooq S., Akhlaque S. vol. 17. 1983. pp. 809–812. (Comparative Response of Mixed Cultures of Bacteria and Virus to Ozonation). (7) [Google Scholar]
  17. Finch G.R., Fairbairin N. Comparative inactivation of poliovirus type 3 and MS2 coliphage in demand-free phosphate buffer by using ozone. Appl. Environ. Microbiol. 1991;57(11):3121–3126. doi: 10.1128/aem.57.11.3121-3126.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fischler M.A., Bolles R.C. Random Sample Consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM. 1981;24(6):381–395. [Google Scholar]
  19. Haas C.N., Joffe J. Disinfection under dynamic conditions: modification of Hom’s model for decay. Environ. Sci. Technol. 1994;28:1367–1369. doi: 10.1021/es00056a028. [DOI] [PubMed] [Google Scholar]
  20. Hallowell D. WateReuse Foundation; 2014. Utilization of Hazard Analysis and Critical Control Points Approach for Evaluating Integrity of Treatment Barriers for Reuse. WRRF-09-03 PRoject Synopsis. [Google Scholar]
  21. Harakeh M.S., Bulter M. Factors increasing the ozone inactivation of enteric viruses in effluent. Ozone Sci. Eng. 1984;6(4):235–243. [Google Scholar]
  22. Harakeh M.S., Bulter M. Inactivation of human rotavirus, SA11 and other enteric viruses in effluent by disinfectants. J. Hyg. Camb. 1984;93:157–163. doi: 10.1017/s0022172400061040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hembach N., Alexander J., Hiller C., Wieland A., Schwartz T. Dissemination prevention of antibiotic resistant and facultative pathogenic bacteria ultrafiltration and ozone treatment at an urban wastewater treatment plant. Sci. Rep. 2019;9:12843. doi: 10.1038/s41598-019-49263-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hirneisen K.A., Markland S.M., Kniel K.E. Ozone inactivation of norovirus surrogates on fresh produce. J. Food Protect. 2011;74(5):836–839. doi: 10.4315/0362-028X.JFP-10-438. [DOI] [PubMed] [Google Scholar]
  25. Hoerl A.E., Kennard R.W. Ridge regression: biased estimation for nonorthogonal problems. Technometrics. 1970;12(1):55–67. [Google Scholar]
  26. Hom L.W. Kinetics of chlorine disinfection in an ecosystem. J. Stat. Eng. Duv. 1972;98(1):183–193. [Google Scholar]
  27. Hoque S.A., Kobayshi M., Takanashi S., Anwar K.S., Watanabe T., Khamrin P., Okitsu S., Hayakawa S., Ushijima H. Role of rotavirus vaccination on an emerging G8P[8] rotavirus strain causing an outbreak in central Japan. Vaccine. 2018;36:43–49. doi: 10.1016/j.vaccine.2017.11.056. [DOI] [PubMed] [Google Scholar]
  28. Ito T., Kitajima M., Kato T., Ishii S., Segawa T., Okabe S., Sano D. Target virus log10 reduction values determined for two reclaimed wastewater irrigation scenarios in Japan based on tolerable annual disease burden. Wat. Res. 2017;125(15):438–448. doi: 10.1016/j.watres.2017.08.057. [DOI] [PubMed] [Google Scholar]
  29. Kadoya S., Nishimura O., Kato H., Sano D. Predictive water virology: hierarchical Bayesian modeling for estimating virus inactivation curve. Water. 2019;11(10):2187. [Google Scholar]
  30. Kadoya S., Nishimura O., Kato H., Sano D. Regularized regression analyses for the prediction of virus inactivation efficiency by chloramine disinfection. Environ. Sci. Wat. Res. Technol. 2020;11:2187. doi: 10.1016/j.wroa.2021.100093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Katayama H., Haramoto E., Oguma K., Yamashita H., Tajima H., Nakajima H., Ohgaki S. One-year monthly quantitative survey of noroviruses, enteroviruses, and adenoviruses in wastewater collected from six plants in Japan. Wat. Res. 2008;42:1441–1448. doi: 10.1016/j.watres.2007.10.029. [DOI] [PubMed] [Google Scholar]
  32. Katzenelson E., Biedermann N. Disinfection of viruses in sewage by ozone. Wat. Res. 1976;10(7):629–631. [Google Scholar]
  33. Katzenelson E., Koerner G., Biedermann N., Peleg M., Shuval H.I. Measurement of the inactivation kinetics of poliovirus by ozone in a fast-flow mixer. Appl. Environ. Microbiol. 1979;37(4):715–718. doi: 10.1128/aem.37.4.715-718.1979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kharel S., Stapf M., Miehe U., Ekblad M., Cimbritz M., Falas P., Nilsson J., Sehlen R., Bester K. Ozone dose dependent formation and removal of ozonation products of pharmaceutical in pilot and full-scale municipal wastewater treatment plants. Sci. Total Environ. 2020;731:139064. doi: 10.1016/j.scitotenv.2020.139064. [DOI] [PubMed] [Google Scholar]
  35. Kim C.K., Gentile D.M., Sproul O.J. Mechanism of ozone inactivation of bacteriophage f2. Appl. Environ. Microbiol. 1980;39(1):210–218. doi: 10.1128/aem.39.1.210-218.1980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Leonard J., Flournoy J., Los angeles C.P.L., Whitaker K. How much motion is too much motion? Determining motion thresholds by sample size for reproducibility in developmental resting-state MRI. Res. Ideas. Outcomes. 2017;3 [Google Scholar]
  37. Li J., Lu X., Sun Y., Lin C., Li F., Yang Y., Liang Z., Jia L., Chen L., Jiang B., Wang Q. A swimming pool-associated outbreak of pharyngoconjunctival fever caused by human adenovirus type 4 in Beijing, China. Int. J. Infect. Dis. 2018;75:89–91. doi: 10.1016/j.ijid.2018.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lim M.Y., Kim J.-M., Lee J.E., Ko G.P. Characterization of ozone disinfection of murine norovirus. Appl. Environ. Microbiol. 2010;76(4):1120–1124. doi: 10.1128/AEM.01955-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lin Y., Fengling Li, Lianzhu W., Yuxiu Z., Yanhua J. Function of VP2 protein in the stability of the secondary structure of virus-like particles of genogroup II norovirus at different pH levels: function of VP2 protein in the stability of NoV VLPs. J. Microbiol. 2014;52(11):970–975. doi: 10.1007/s12275-014-4323-6. [DOI] [PubMed] [Google Scholar]
  40. Lu H., Ma X. Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere. 2020;249:126169. doi: 10.1016/j.chemosphere.2020.126169. [DOI] [PubMed] [Google Scholar]
  41. McGeady M.L., Siak J.-S., Crowell R.L. Survival of coxsackievirus B3 under diverse environmental conditions. Appl. Environ. Microbiol. 1979;37(5):972–977. doi: 10.1128/aem.37.5.972-977.1979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Meng Z.-D., Birch C., Heath R., Gust I. Physicochemical stability and inactivation of human and simian rotaviruses. Appl. Environ. Microbiol. 1987;53(4):727–730. doi: 10.1128/aem.53.4.727-730.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Meunier L., Canonica S., von Gunten U. Implications of sequential use of UV and ozone for drinking water quality. Wat. Res. 2006;40:1864–1876. doi: 10.1016/j.watres.2006.02.030. [DOI] [PubMed] [Google Scholar]
  44. Moher D., Libeerati A., Tetzlaff J., Altman D.G. Preferred reporting items for systematic reviews and meta-analysis: the PRISMA statement. PLoS Med. 2009;6 doi: 10.1371/journal.pmed.1000097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Moore N.J., Margolin A.B. Efficacy of nucleic acid probes for detection of poliovirus in water disinfected by chlorine, chlorine dioxide, ozone, and UV radiation. Appl. Environ. Microbiol. 1994;60(11):4189–4191. doi: 10.1128/aem.60.11.4189-4191.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Moresco V., Damazo N.A., Baradi C.R.M. Thermal and temporal stability on the enteric viruses infectivity in surface freshwater. Wat. Supply. 2016;16(3):620–627. [Google Scholar]
  47. Park H.-S., Hwang T.-M., Kang J.-W., Choi H., Oh H.-J. Characterization of raw water for the ozone application measuring ozone consumption rate. Wat. Res. 2001;35(11):2607–2614. doi: 10.1016/s0043-1354(00)00564-9. [DOI] [PubMed] [Google Scholar]
  48. Park Y., Kim M., Pachepsky Y., Choi S.-H., Cho J.-G., Jeon J., Cho K.H. Development of a nowcasting system using machine learning approaches to predict fecal contamination levels as recreational beaches in Korea. J. Environ. Qual. 2018;47:1094–1102. doi: 10.2134/jeq2017.11.0425. [DOI] [PubMed] [Google Scholar]
  49. Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dobourg V., Vanderplas J., Passos A., Cournapeau D., Brucher M., Perrot M., Duchesnay E. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]
  50. Roy D., Wong P.K.Y., Engelbrecht R.S., Chian E.S.K. Mechanisms of enteroviral inactivation by ozone. Appl. Environ. Microbiol. 1981;41(3):718–723. doi: 10.1128/aem.41.3.718-723.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Roy D., Chian E.S.K., Engelbrecht R.S. Mathematical model for enterovirus inactivation by ozone. Wat. Res. 1982;16:667–673. [Google Scholar]
  52. Safari M.J.S., Arashloo S.R., Mehr A.D. Rainfall-runoff modeling through regression in the reproducing kernel Hilbert space algorithm. J. Hydrol. 2020;587:125014. [Google Scholar]
  53. Sano D., Amarasiri M., Hata A., Watanabe T., Katayama H. Risk management of viral infectious diseases in wastewater reclamation and reuse: Review. Environ. Int. 2016;91:220–229. doi: 10.1016/j.envint.2016.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Shin G.-A., Sobsey M.D. Reduction of Norwalk virus, poliovirus 1, and bacteriophage MS2 by ozone disinfection of water. Appl. Environ. Microbiol. 2003;69(7):3975–3978. doi: 10.1128/AEM.69.7.3975-3978.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Shlindwein A.D., Rigotto C., Simoes C.M.O., Baradi C.R.M. Detection of enteric viruses in sewage sludge and treated wastewater effluent. Water Sci. Technol. 2010;61(2):537–544. doi: 10.2166/wst.2010.845. [DOI] [PubMed] [Google Scholar]
  56. Shneider C.A., Rasband W.S., Eliceiri K.W. NIH image to ImageJ: 25 years of image analysis. Nat. Methods. 2012;9(7):671–675. doi: 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Silva-Sales M., Martinez-Puchol S., Gonzales-Gustavsonm E., Hundesa A., Girones R. High prevalence of rotavirus A in raw sewage from northeast Spain. Viruses. 2020;12(3):318. doi: 10.3390/v12030318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Sin G., Gernaey K.V., Neumann M.B., van Loosdrecht M.C.M., Gujer W. Uncertainty analysis in WWTP model applications: a critical discussion using an example from design. Wat. Res. 2009;43:2894–2906. doi: 10.1016/j.watres.2009.03.048. [DOI] [PubMed] [Google Scholar]
  59. Sin G., Gernaey K.V., Neumann M.B., van Loosdrecht M.C.M., Gujer W. Global sensitivity analysis in wastewater treatment plant model applications: prioritizing sources of uncertainty. Wat. Res. 2011;45:639–651. doi: 10.1016/j.watres.2010.08.025. [DOI] [PubMed] [Google Scholar]
  60. Sohn J., Amy G., Cho J., Lee Y., Yoon Y. Disinfectant decay and disinfection by-products formation model development: chlorination and ozonation by-products. Wat. Res. 2004;38:2461–2478. doi: 10.1016/j.watres.2004.03.009. [DOI] [PubMed] [Google Scholar]
  61. Subahir M.N., Jeffree M.S., Hassan M., Mohamad S.M.G., Fong S.Y., Ahmed K. Norovirus outbreak among students of a boarding school in Kluang, Johor, Malaysia. J. Infect. Dec. Ctries. 2019;13(4):274–277. doi: 10.3855/jidc.11199. [DOI] [PubMed] [Google Scholar]
  62. Thurston-Enriquez J.A., Haas C.N., Jacangelo J., Gerba C.P. Inactivation of enteric adenovirus and feline calicivirus by ozone. Wat. Res. 2005;39:3650–3656. doi: 10.1016/j.watres.2005.06.006. [DOI] [PubMed] [Google Scholar]
  63. Tibshirani R. Regression shrinkage and selection via the lasso. J. Royal. Stat. Soc. Series. B. 1996;36:117–147. [Google Scholar]
  64. Tipping M.E. Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 2001;1:211–244. [Google Scholar]
  65. Tondera K., Klaer K., Gebhardt J., Wingender J., Koch C., Horstkott M., Strathmann M., Jurzik L., Hamza I.A., Pinnekamp J. Reducing pathogens in combined sewer overflows using ozonation or UV irradiation. Int. J. Hyg Environ. Health. 2015;218(8):731–741. doi: 10.1016/j.ijheh.2015.09.002. [DOI] [PubMed] [Google Scholar]
  66. U. S. Environmental Protection Agency . USEPA; Washington, DC: 1990. [Google Scholar]
  67. U. S. Environmental Protection Agency . 1999. Disinfection Profiling and Benchmarking Technical Guidance Manual. [Google Scholar]
  68. U.S. Food & Drug Administration HACCP principles and application guidelines. 1997. https://www.fda.gov/food/hazard-analysis-critical-control-point-haccp/haccp-principles-application-guidelines#:%7E:text=Seven%20basic%20principles%20are%20employed,and%20record%2Dkeeping%20and%20documentation
  69. Utsumi T., Wahyuni R.M., Dinana Z., Gunawan E., Putra A.S.D., Mubawadi T., Soetjipto, Lusida M.I., Shoji I. G2P[4] rotavirus outbreak in belu, east nusa tenggara province, Indonesia, 2018. J. Int. Public. Health. 2020 doi: 10.1016/j.jiph.2020.05.002. (in press) [DOI] [PubMed] [Google Scholar]
  70. Vaughn J.M., Chen Y.S., Lindburg K., Morales D. Inactivation of human and simian rotaviruses by ozone. Appl. Environ. Microbiol. 1987;53(9):2218–2221. doi: 10.1128/aem.53.9.2218-2221.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Wang H., Siroka P., Rutgersson C., Lindh M., Brodin T., Bjorlenius B., Larsson D.G., Norder H. Differential removal of human pathogenic viruses from sewage by conventional and ozone treatment. Int. J. Hyg Environ. Health. 2018;221:479–488. doi: 10.1016/j.ijheh.2018.01.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Ward R.L., Ashley C.S. pH modification of the effects of detergents on the stability of enteric viruses. Appl. Environ. Microbiol. 1979;38(2):314–322. doi: 10.1128/aem.38.2.314-322.1979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Watson, H. E., A note on the variation of the rate of disinfection with change in the concentration of the disinfectant. J. Hyg. 8(4), 536-542. [DOI] [PMC free article] [PubMed]
  74. WHO . third ed. World Health Organization; Geneva: 2006. Guidelines for the Use of Wastewater, Excreta and Greywater in Agriculture and Aquaculture. [Google Scholar]
  75. WHO . World Health Organization; Geneva: 2015. Sanitation Safety Planning: Manual for Safe Use and Disposal of Wastewater, Greywater and Excreta. [Google Scholar]
  76. WHO . World Health Organization; Geneva: 2017. WHO Methods and Data Sources for Global Burden of Disease Estimates 2000-2015. [Google Scholar]
  77. Wolf C., von Gunten U., Kohn T. Kinetics of inactivation of waterborne enteric viruses by ozone. Environ. Sci. Technol. 2018;52:2170–2177. doi: 10.1021/acs.est.7b05111. [DOI] [PubMed] [Google Scholar]
  78. Zou H., Hastie T. Regularization and variable selection via the elastic net. J. Royal. Stat. Soc. Series. B. 2006;67:301–320. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia component 1
mmc1.zip (3.5KB, zip)
Multimedia component 2
mmc2.xlsx (30.3KB, xlsx)
Multimedia component 3
mmc3.docx (9.6MB, docx)

Articles from Water Research X are provided here courtesy of Elsevier

RESOURCES