Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Feb 5.
Published in final edited form as: J Hazard Mater. 2021 Sep 11;423(Pt B):127141. doi: 10.1016/j.jhazmat.2021.127141

Machine Learning-based Biomarkers Identification from Toxicogenomics - Bridging to Regulatory Relevant Phenotypic Endpoints

Sheikh Mokhlesur Rahman 1,2, Jiaqi Lan 1,3, David Kaeli 4, Jennifer Dy 4, Akram Alshawabkeh 1, April Z Gu 1,5,*
PMCID: PMC9628282  NIHMSID: NIHMS1742899  PMID: 34560480

Abstract

One of the major challenges in realization and implementations of the Tox21 vision is the urgent need to establish quantitative link between in-vitro assay molecular endpoint and in-vivo regulatory-relevant phenotypic toxicity endpoint. Current toxicomics approach still mostly rely on large number of redundant markers without pre-selection or ranking, therefore, selection of relevant biomarkers with minimal redundancy would reduce the number of markers to be monitored and reduce the cost, time, and complexity of the toxicity screening and risk monitoring. Here, we demonstrated that, using time series toxicomics in-vitro assay along with machine learning-based feature selection (maximum relevance and minimum redundancy (MRMR)) and classification method (support vector machine (SVM)), an “optimal” number of biomarkers with minimum redundancy can be identified for prediction of phenotypic toxicity endpoints with good accuracy. We included two case studies for in-vivo carcinogenicity and Ames genotoxicity prediction, using 20 selected chemicals including model genotoxic chemicals and negative controls, respectively. The results suggested that, employing the adverse outcome pathway (AOP) concept, molecular endpoints based on a relatively small number of properly selected biomarker-ensemble involved in the conserved DNA-damage and repair pathways among eukaryotes, were able to predict both Ames genotoxicity endpoints and in-vivo carcinogenicity in rats. A prediction accuracy of 76% with AUC = 0.81 was achieved while predicting in-vivo carcinogenicity with the top-ranked five biomarkers. For Ames genotoxicity prediction, the top-ranked five biomarkers were able to achieve prediction accuracy of 70% with AUC = 0.75. However, the specific biomarkers identified as the top-ranked five biomarkers are different for the two different phenotypic genotoxicity assays. The top-ranked biomarkers for the in-vivo carcinogenicity prediction mainly focused on double strand break repair and DNA recombination, whereas the selected top-ranked biomarkers for Ames genotoxicity prediction are associated with base- and nucleotide-excision repair The method developed in this study will help to fill in the knowledge gap in phenotypic anchoring and predictive toxicology, and contribute to the progress in the implementation of tox 21 vision for environmental and health applications.

Keywords: machine learning, biomarker, genotoxicity, toxicogenomics, phenotypic anchor

Graphical abstract

graphic file with name nihms-1742899-f0005.jpg

INTRODUCTION

Genotoxicity is of great concern because of its link to mutagenicity, carcinogenicity as well as cancer, and there is urgent demand for genotoxicity screening and risk assessment for various environmental and health applications.14 In the absence of, or combined with, in-vivo carcinogenicity data, in-vitro or cell-based genotoxicity assays provide supporting data for cancer risk assessment.5 Recently, toxicogenomics has emerged to be a promising technology that reveals molecular-level activities, at the gene, protein, or metabolite level of organisms, in response to environmental contaminants and may represent the underlying cellular network mechanisms of toxicity responses.2,6 This also responds to the Tox21 vision that promotes a systematic transit from current in-vivo whole animal-based testing, to more in-vitro mechanistic pathway-based assays using high-throughput screening and tiered testing.78 However, one of the major challenges in realization and implementations of the Tox21 vision is the urgent need to establish quantitative link between in-vitro toxicogenomic assay molecular endpoint and in-vivo phenotypic regulatory relevant endpoints.

Establishing quantitative causal relationships between in-vitro assay endpoints to regulatory-relevant apical endpoints holds the key to the realization of predictive toxicology through practical and widespread implementation of in-vitro assay-based toxicity screening schemes and strategies for environmental and health applications.916 Adverse-outcome pathway (AOP) framework is the state-of-the-art approach to link mechanistic toxicity mechanisms with the phenotypic adverse outcome that would enable the assessment of health risk as well as ecotoxicological risks from exposure to pollutants and their mixtures.1721 Coalesce of effective biomarkers and proper predicting framework would enable more cost-effective and wider implementation of toxicomics in monitoring of genotoxicity and predict adverse toxic responses.2224 Subsequently, proper selection and validation of predicative biomarkers plays a crucial role in our ability to link molecular-level effects recorded in in-vitro assays to the in-vivo regulatory relevant phenotypic endpoints, or system-level impacts in many fields such as, environmental toxicity, disease prediction and health risk identification.2528

The rapid advancement in bioinformatics and machine learning methods enables more sophisticated biomarkers identification.10,2930 Current biomarker identification from toxicomics data employs feature selection and classification methods. Two general approaches of feature selection include filter and wrapper methods. The filter methods often provide relatively simpler and faster alternatives to select the most important features and the features are selected or filtered based on their relevance to differentiate a target outcome from others.31 The wrapper methods combine feature selection along with the classification method, where the features are judged based on their ability to increase the accuracy of the classification models.3233 However, the wrapper methods are often associated with extensive computational cost, and prone to possible overfitting when the sample size is relatively small.3435 In addition, since the selected features of the filter methods are independent of the classification method, they often have higher relevance to the target outcome than those derived from a wrapper method.31,34 Filter based feature selection methods that have been applied to toxicomics data (e.g., gene and protein expression data) include mutual information, statistical tests (t-test, F-test, chi-square), information gain,36 gainratio,30 and ReliefF36 among others. Though most of these algorithms find the important biomarkers based on their relevance and correlation to the target outcome, they do not address redundancy and overfitting issues. The maximum relevance and minimum redundancy (MRMR) algorithm aims to reduce the redundancy in datasets, while also identifying the most relevant features and biomarkers to predict the outcome.31,34 Furthermore, using the right classification algorithm for the problem is important in order to avoid overfitting by the model.37 The classification algorithms that have been used in the past to classify toxicogenomics data include k-nearest neighbor, naïve-Bayes, and support vector machines (SVMs). SVMs have been shown to yield reliable and efficient classification performances, while limiting overfitting, particularly for cases where the number of features is higher than the number of samples (as often seen with toxicogenomic data).29

Although a few isolated biomarkers have been used for genotoxicity detection in both environmental and human health applications, such as CYP1A1 and CYP1B1,38 RAD54,39 CYPR,38 A2m, Ca3, Cxcl1, and Cyp8b1,40 their correlation with phenotypic genotoxicity endpoints or carcinogenicity has not been quantified. Furthermore, the temporal dependencies of toxicogenomics responses have also not been considered in most cases, since most studies record a snapshot of the responses.38 It is still an open research area to identify relevant toxicogenomic-based biomarkers that quantitatively link in-vitro responses to regulatory relevant in-vivo toxicity endpoints, utilizing the temporal molecular response patterns.

In this study, we applied MRMR feature selection and SVM classification algorithm, to identify an ensemble of biomarkers from temporal toxicogenomic assays, for genotoxicity and carcinogenicity prediction and for bridging to regulatory relevant phenotypic endpoints via AOP. As per the AOP framework, molecular initiating event for DNA damage would link to an adverse outcome of genotoxicity at organism or population level that is relevant to risk assessment.13,19,41 We proposed and developed a novel quantitative toxicogenomics assay to evaluate mechanistic genotoxicity through detect and quantifying molecular level changes in proteins involved in known DNA damage repair pathways, to comply with the AOP concept.2,4243 The selected key proteins involved in all known DNA damage and repair stress response pathways are conserved among yeast and other eukaryotes including human, therefore is expected to capture AOP molecular effects at sub-cytotoxic dose levels that lead to phenotypic changes and adverse outcome.2,19,44 The protein expression changes, in exposure to each chemical, are monitored by employing a genotoxicity assay using GFP-tagged yeast reporter stains, covering 38 selected protein biomarkers indicative of all the seven known DNA damage repair pathways as discussed above.2,45 Two separate case studies — i) in-vivo rodent carcinogenicity and ii) Ames test-based genotoxicity prediction — are performed to identify the biomarker-ensembles for chemically-induced genotoxicity and carcinogenicity endpoint prediction. For each case study, six concentrations of 20 chemicals are selected that include model genotoxic compounds with reported endpoints and negative control without any reported genotoxic effects. Both in-vivo rodent carcinogenicity and Ames based genotoxicity are among the most widely used endpoints for genotoxicity assessment and they are being used in the National Toxicology Program46 and to prepare toxicity databases such as the Carcinogenic Potency Database (CPDB)47. The performance of the prediction models is evaluated by estimating the area under the receiver operating characteristics curve (AUC), as well as the classification accuracy, sensitivity, and specificity. The relationship between the number and identities of top-ranked biomarker selection and the prediction performances are assessed.

METHODOLOGY

Materials

A time-series toxicogenomic assay of 20 chemicals is evaluated in the current study, including model genotoxic compounds and negative control without any reported genotoxic effect. Details of the chemicals are provided in Table S1. Two types of genotoxicity endpoints are investigated, including i) in-vivo rodent carcinogenicity and ii) Ames genotoxicity assay. Both carcinogenicity and genotoxicity endpoint data is collected from the existing literature and are summarized in Table S1.2,48

Quantitative Toxicogenomic Assay for Genotoxicity Assessment

The proteomic based toxicogenomic assay employs a library of 38 in-frame GFP-fused reporter strains (key proteins) of Saccharomyces cerevisiae (Invitrogen, no. 95702, ATCC 201388), covering all known recognized DNA damage repair pathways, as reported in our previous work (Table S2).2,49 The reporter strains are constructed by oligonucleotide-directed homologous recombination to tag each open reading frame (ORF) with Aequrea victoria GFP (S65T) in its chromosomal location at the 3’ end. The assay library measures the expression of full-length, chromosomally tagged green fluorescent protein fusion proteins,50 where the GFP signal represents the protein expression, directly. The altered expression level is measured for 2-hr at every 5 minute, which is then integrated over the full exposure period to obtain the quantitative toxicity index — Protein Expression Level Index (PELI). The details of the proteomics assay, when using GFP-tagged yeast cells, were described in the Supporting Information (Text S1) and also in our previous reports.2,49,51

Scoring Criteria to Rank the Biomarkers

The selection of biomarkers is based on their contribution to differentiate protein expression level among the genotoxicity-negative and -positive chemicals. The maximum relevance is used as a scoring measure, which is quantified by estimating t-stat (Text S1) to find the relevance of biomarkers to a target outcome. The second scoring measure, maximum relevance minimum redundancy (MRMR), is applied to reduce the redundancy from the relevant biomarkers and quantified via penalizing the relevance for collinearity with other biomarkers in the library.

Maximum Relevance

Maximum relevance measures how the biomarkers are relevant to identify the genotoxicity-positive chemicals. t-stat criterion is used as a measure of the relevance.31 t-stat measures the difference between the mean of the features in genotoxicity-negative, and genotoxicity-positive class. The higher the t-stat value the higher the differences in average protein expression level between the two classes — genotoxicity-positive and -negative categories. Hence, the objective function of selecting the most significant biomarker is to maximize the relevance, Vt.

Vt=1|s|Σisti2;with the objective function:max(Vt) [1]

where, Vt measures the relevance of feature i,

ti is the t-stat of the protein i (Text S2), and

S is the number of features in the dataset.

Maximum Relevance Minimum Redundancy

Application of the maximum relevance criteria may identify multiple biomarkers that are correlated with each other. To reduce the redundancy, it is expected that, the selected biomarkers should contain only the uncorrelated proteins since multiple correlated proteins do not provide any additional information regarding the chemical class.31,52 Co-linearity can be measured using Pearson correlation between the genes that quantify redundancy, as described in the following.

Minimum redundancy:

The redundancy (Wc) of the biomarkers can be measured by the Pearson correlation of a biomarker with the rest of the biomarkers.

Wc=1|s|2Σi,j|c(i,j)|;with the objective function:min(Wc) [2]

where, c(i, j) is the correlation between proteins i and j (Text S2).

By combining the maximum relevance (Equation 1) and minimum redundancy (Equation 2), in two separate ways, two different MRMR scoring criteria can be computed. They are:

  1. MRMR-TCD (t-stat correlation difference criterion): In MRMR-TCD scoring method the relevance and redundancy are combined using difference, where the objective function is: maxiΩs{VtWc}. and

  2. MRMR-TCQ (t-stat correlation quotient criterion): MRMR-TCQ score is measured by combining the relevance and redundancy using quotient and the objective function is: maxiΩs{VtWc}. Since MRMR-TCQ apply greater penalty on redundancy than the MRMR-TCD, it has the potential to select the biomarkers that can generate models with better prediction accuracies.31

By utilizing the scoring measures, mentioned here, three scoring criteria — t-stat, MRMR-TCD, and MRMR-TCQ — are adopted to find the ranks of the biomarkers. The rank measures of all the scoring methods are calculated using a 10-fold cross-validation (CV). The dataset, which is composed of 120 samples resulting from toxicogenomic responses of 20 chemicals with six concentrations each, is randomly divided into 10 equal sized subsets and nine of the ten subsets are used as the training data to get the score. This process is repeated 10 times with different training dataset, achieved by leaving out each of the ten subsets exactly once from the training dataset. The overall score, which is used as the ranking measure, is calculated as the average of the scores from the ten folds.

Classification Algorithm

The prediction ability of the selected top-ranked biomarkers is evaluated by fitting a classification model, using SVM with linear kernel as the classification algorithm. SVM is chosen due to its power of avoiding overfitting when the feature space is very large.29,53 Since the number of estimation parameters in SVM is dependent on number of samples rather than features, SVM is more suitable for dataset with low sample-to-feature ratio.29 Performance of SVM is also less susceptible to noisy data, as it is known to reach the global minimum of objective function.54 To further control the overfitting problem, we have conducted the classification analysis using 10-fold CV approach, which is a widely used approach in classification problems as it creates a balance between the bias and variances in the error as well as makes the computation efficient.5556 In a 10-fold CV, the data are randomly divided into 10 equal sized folds, and among them 9 folds are used as the training dataset to generate classification model with the remaining fold as prediction test.57 The classification is conducted for 10 times by using each subset as the test dataset exactly once and the remaining nine subsets as training dataset. For estimating classification models, six concentrations of each chemicals are considered as separate samples. Thus, resulted sample size is 120 for 20 chemicals with six concentrations each. We have further conducted permutation test to assess the statistical significance of the classification models, where the null hypothesis of features and the labels being independent is tested.5859 Permutation test is also used to assess the overfitting issue as used in the literature.60 The class labels are altered randomly and classification models are built following the same setup of 10-fold CV with a linear SVM. Classification performances are measured for 1000 random iterations. From these performance measures, p-value of the selected model is estimated by calculating the fraction of the models for dataset with randomized class labels that performed better or equivalent to the selected model.58

Classification Performance Measuring Criteria

Receiver operating characteristics (ROC)61 curve of the classification models are obtained from the fitted models, and the area under the ROC curve (AUC) is estimated to evaluate the classification model efficiency.6263 The classification accuracy, sensitivity, and specificity are also calculated as additional performance measures. Accuracy is defined as the proportion of all the samples (both true positives and true negatives) that are identified correctly among the total number of samples.64 Sensitivity measures the proportion of positives that are correctly identified and specificity measures the proportion of negatives that are correctly identified as such.65 The accuracy, sensitivity, and specificity of the models are estimated from the test datasets of all 10 folds, mentioned above, and average of the classification models of the 10 folds are reported as the CV-accuracy, -sensitivity, and -specificity. Both MRMR feature selection and SVM classification algorithm are implemented in MATLAB 2017a (Mathworks, Natick, MA).

Gene Ontology

Gene ontology (GO) analysis is performed using the Functional Specification resource, FunSpec,66 to determine the significantly represented functional categories that are associated with the selected top-ranked biomarkers. The significant biological categories are obtained using a hypergeometric distribution with a p-value threshold of 0.01 — which represent that, the association between a given gene set and a given functional category does not occur at random.67

RESULTS AND DISCUSSION

Identification of Toxicogenomics-based Biomarkers for In-vivo Carcinogenicity Prediction

The most relevant protein biomarkers are identified based on their rank measures and ability to differentiate the altered expression level between the carcinogenicity-positive and -negative compounds. Three separate scores according to the three ranking criteria, namely t-stat, MRMR-TCD and MRMR-TCQ, as described in the methods section, were used. Figure 1 shows the most relevant biomarkers that have higher scores and assumingly higher relevancy to in-vivo carcinogenicity. The higher t-stat score represents a larger difference of the altered expression level between the average biomarker-responses in the carcinogenicity-positive and -negative chemicals (Figure 1-a). However, presence of collinearity with other potential biomarkers may result in redundancy among the top-ranked biomarkers, with higher scores. The MRMR scoring criteria helps to eliminate redundancy by penalizing the co-linearity for redundancy elimination. Two MRMR-based ranking criteria (MRMR-TCD, and MRMR-TCQ) rank the proteins after penalizing the t-stat score for the presence of collinearity, which is used as a measure of redundancy (Figure 1-b, c). Due to the penalization for redundancy, the rank of a few of the biomarkers are different for the MRMR-based ranking methods than those for the t-stat-based ranking. As the relevance is divided by the redundancy in the MRMR-TCQ, the penalty for collinearity is high in this case. Subsequently, the margin between the high ranked biomarkers and low ranked biomarkers are higher in the case of MRMR-TCQ ranking than the results obtained from MRMR-TCD score. Disregard the ranking algorithms used, the top-ranked five protein biomarkers are identified to be the same for all three scoring methods, although their rank sequences varied. These five proteins, namely NTG2, RAD34, RAD27, MSH2, and YKU70, can be considered to have high relevance with very little redundancy, which make them suitable as potential biomarkers for in-vivo carcinogenicity prediction.

Figure 1.

Figure 1.

Scores of the biomarker based on different scoring criteria for Carcinogenicity endpoint prediction. The biomarkers are sorted in descending order of their score. The higher the score, the more important the biomarker is. a) Ranking criterion: t-stat. This ranking method is based on maximum relevance alone. The t-stat measures the significant differences between the mean of the biomarker feature (in this specific case: PELI) in the positive and negative carcinogenic chemicals. The higher the difference in mean of the biomarkers in the two classes, the more important the biomarker is. b) Ranking criterion: MRMR-TCD. This ranking criterion is based on the maximum relevance and minimum redundancy. The t-stat measures the relevance to the endpoint. Collinearity among different biomarkers can measure the redundancy of the biomarker. In the MRMR-TCD ranking method, average correlation of the biomarker is subtracted from the t-stat score to remove the redundancy. c) Ranking criterion: MRMR-TCQ. This criterion is also based on the maximum relevance and minimum redundancy. The t-stat is divided by the average correlation, to get the maximum relevance and minimum score.

A heat-map of the molecular genotoxicity quantifier, PELI (protein effect level index) values, shows the differential expression pattern of the top-ranked biomarkers among carcinogenicity-positive and -negative chemicals (Figure 2). The preferred top-ranked biomarker(s) exhibited significantly higher altered protein expression level in carcinogenicity-positive chemicals, whereas significantly less or negligible (below detection limit PELI < 1.5)2,49 altered expression level for the carcinogenicity-negative chemicals. In contrast, the bottom-ranked protein had similar average PELI values in both carcinogenicity-positive and -negative chemicals.

Figure 2.

Figure 2.

Heatmap of the PELI values (colormap as indicated by the color bar) of all the protein biomarkers covering all known DNA damage and repair pathways in yeast (Saccharomyces cerevisiae), upon exposure to 20 chemicals (rows). For each chemical, responses to 6 doses are presented with concentrations vary from lowest to highest from top to bottom (noted on the right-side of vertical axis, all concentrations are listed in Table S1). The Biomarkers (columns) are sorted (from left to right) as per their ranks based on the MRMR-TCQ score (Figure 1-c). The carcinogenicity-positive chemicals (rows) are labeled in red color, and the carcinogenicity-negative chemicals are labeled in black color.

Performance evaluation of the top-ranked biomarkers for in-vivo carcinogenicity prediction

We evaluate the number and the ability of the top-ranked protein biomarkers as biomarker-ensemble to predict chemical-induced in-vivo carcinogenicity. SVM is used to classify and predict the carcinogenetic compounds with varying number of top-ranked biomarkers. The AUC is used as the classifier performance evaluation criterion, which indicates the stability of classification model with AUC value closer to 1 representing a stable classifier.68 For the in-vivo carcinogenicity prediction, AUC is 0.81 with SVM model using the top-five ranked biomarkers (Table 1). The increase in the number of biomarkers in the classifier model, as expected, leads to increase in the AUC. The maximum AUC value of the models is 0.88, when all the 38 proteins in the library are used.

Table 1.

Prediction model performance of the two case-studies. The model performance measures include AUC, classification accuracy, sensitivity, and specificity. The prediction models are generated with top-ranked biomarkers as well as with all the proteins in the assay library.

Number of Biomarkers utilized in the classification model AUC Classification Performance Metric
Accuracy Sensitivity Specificity
In-vivo carcinogenicity prediction models
Top Five 0.81 76% 74% 79%
All 0.88 83% 81% 86%
Ames genotoxicity prediction models
Top Five 0.75 70% 63% 81%
All 0.84 78% 79% 75%

Furthermore, the cross-validated accuracy, sensitivity, and specificity of the 10-fold models are estimated as additional performance measuring criteria. Top-ranked five biomarkers can obtain 76% accuracy for the carcinogenicity prediction, while the classification model with all the proteins can achieve 83% accuracy. The sensitivity and specificities also increase with the increase of the number of biomarkers in the classifier models. For example, sensitivity of the classification model is 74% for the prediction model with top five biomarkers and it reached 81% with all the proteins. Specificity increases from 79%, for the model with the top five biomarkers, to 86% for the model where all the proteins are used as features. Relatively high CV-accuracies, which are measured from the prediction of the test dataset, indicate the robustness of the prediction model obtained through the SVM classification model. Although, as expected, higher number of features in the classification models improves the accuracy and the stability, the top five identified most relevant biomarkers can achieve classification accuracy of ∼75%, implying a possible cost- and time-effective in-vivo carcinogenicity screening tool with the time-series toxicogenomic assay.

Permutation test is conducted by randomly reordering the class label and generating classification models. The process is repeated for 1000 times with both top-five biomarkers and all 38 proteins as features. AUC and accuracies are measured for all of these iterations. AUC and accuracies of the models with randomized datasets are less than the AUC and accuracy of the selected models (Figure S1). Subsequently, resulting p-values for the AUC and accuracy of the selected models, with top-five biomarkers as well as all proteins, are less than 0.001. This indicates that the selected classification models are significant, and overfitting may not be present in the selected models for in-vivo carcinogenicity prediction.

We have compared the performance of the SVM classification algorithm with performance of other classification algorithms namely, decision tree, random forest, and Gaussian naïve Bayes. We have estimated in-vivo carcinogenicity classification models with different algorithms, where all 38 proteins are used as the features. All models are generated using 10-fold cross validation, and performance metrics are calculated for each model (Table S4). Among four classification models, model generated by SVM performs the best. AUC of the SVM-derived model is the highest. Similarly, accuracy, sensitivity, and specificity of the models derived by the SVM models are higher than all three other models derived by decision tree, random forest, and Gaussian naïve Bayes algorithms. We have also compared the impact of selecting different number of features as biomarker for carcinogenicity classification. The results suggest that with varying numbers of features as biomarkers model performances are different (Table S5). Model AUC and accuracy improve slightly with higher number biomarkers (with Top 15 and Top 20 biomarkers). However, with Top-10 biomarkers both AUC and accuracy are less than that of model with Top-five biomarkers. This further suggests that, while higher number of biomarkers may provide slightly better prediction, considering the goal of choosing minimal biomarkers to reduce cost and increase throughput, selection of the top-five biomarkers would be sufficient to provide robust, quick, and cost-effective tool for in-vivo carcinogenicity prediction for larger number of environmental samples.

Identification of Toxicogenomics-based Biomarkers for Ames Genotoxicity Prediction

We performed a second case study, where we attempted to identify the protein biomarkers for the Ames genotoxicity endpoint prediction. The most important biomarkers are identified from their ranking scores, which are based on their difference in altered protein expression level between the Ames genotoxicity-positive and -negative compounds. The three separate scores of each biomarker, obtained from the three ranking criteria (t-stat, MRMR-TCD and MRMR-TCQ), are presented in Figure 3. The higher difference between the altered expression level of a protein in the genotoxicity-positive and -negative chemicals results in relatively higher t-stat scores (Figure 3-a). Hence, the top-ranked proteins, with higher t-stat scores, have the higher relevance to differentiate Ames genotoxicity-positive chemicals from the genotoxicity-negative chemicals. The MRMR-TCD and MRMR-TCQ based scores differ from the t-stat score, especially for the proteins that possess collinearity with other proteins in the assay (Figure 3-b, c). However, the identified top-ranked five protein biomarkers remain the same irrespective of the ranking methods employed, although their rank sequences varied. These five proteins, namely APN2, RFA2, NTG2, RAD2, and MSH6, are considered as potential biomarkers for the Ames genotoxicity prediction.

Figure 3.

Figure 3.

Scores of the biomarker based on different scoring criteria for Ames-genotoxicity assay prediction. The biomarkers are sorted in descending order of their score. The higher the score, the more important the biomarker is. a) Ranking criterion: t-stat. This ranking method is based on the maximum relevance. The t-stat measures the significant differences between the mean of the biomarker feature (in this specific case: PELI) in the chemicals with Ames assay positive and negative responses. The higher the difference in mean of the biomarkers in the two classes, the more important the biomarker is. b) Ranking criterion: MRMR-TCD. This ranking criterion is based on the maximum relevance and minimum redundancy. The t-stat measures the relevance to the endpoint. Collinearity among different biomarkers can measure the redundancy of the biomarker. In the MRMR-TCD ranking method, average correlation of the biomarker is subtracted from the t-stat score to remove the redundancy. c) Ranking criterion: MRMR-TCQ. This criterion is also based on the maximum relevance and minimum redundancy. The t-stat is divided by the average correlation, to get the maximum relevance and minimum score.

A heat-map of the molecular genotoxicity quantifier, PELI, further shows the distinct expression profiles of the top ranked biomarkers in the genotoxicity-positive and -negative chemicals (Figure 4). The top-ranked biomarkers show relatively higher protein expression level in genotoxicity-positive chemicals than the expression level in genotoxicity-negative chemicals that are often below toxicity threshold (PELI < 1.5).2,49 For the bottom-ranked proteins, the difference in expression levels across genotoxicity-positive and -negative chemicals decreases, which further suggests their inadequacy to separate genotoxicity-positive chemicals from the genotoxicity-negative chemicals.

Figure 4.

Figure 4.

Heatmap of the PELI values (colormap as indicated by the color bar) of all the protein biomarkers covering all known DNA damage and repair pathways in yeast (Saccharomyces cerevisiae), upon exposure to 20 chemicals (rows). For each chemical, responses to 6 doses are presented with concentrations vary from lowest to highest from top to bottom (noted on the right-side of vertical axis, all concentrations are listed in Table S1).The Biomarkers (columns) are sorted (from left to right) as per their ranks based on the MRMR-TCQ score (Figure 3-c). The Ames genotoxicity-positive chemicals (rows) are labeled in red color, and the genotoxicity-negative chemicals are labeled in black color.

Performance evaluation of the top-ranked biomarkers for Ames genotoxicity prediction

The prediction ability of the top-ranked biomarker-ensemble is evaluated by using the SVM to classify and predict compounds with Ames genotoxicity with varying number of top-ranked biomarkers. The classifier performances are evaluated by estimating the AUC of the SVM models (Table 1). AUC of the model with top-ranked five biomarkers is 0.75 and that of the model with all 38 proteins is 0.84. AUC values of both the models represent stable classification models, since the AUC value equal to 0.5 indicates a random classification model and closer to 1 represents a stable model68. As expected, the prediction model with all the proteins has higher AUC than the model with the top-five biomarkers.

The estimated model accuracy, sensitivity, and specificity provide additional measures of the prediction performance. Using only the top-ranked five biomarkers can achieve 70% accuracy, which increased to 78% for the model with all 38 proteins. The sensitivity (63% for top-five biomarkers and 79% for all 38 proteins) of the prediction model also increases with the increase in number of biomarkers in the classification model. Though the specificity of the prediction model with top-ranked five biomarkers (81%) is higher than that of the model with all 38 proteins (75%), the specificity of the model with all the protein biomarkers are closer to its accuracy representing relatively low model-bias. It can be inferred that, though higher number of biomarkers results in improvement of prediction, the top-ranked five biomarkers can achieve relatively high prediction accuracy (70%). Therefore, possible application of time-series toxicogenomic assay with the most relevant top-ranked biomarkers may provide a balance of cost- and time-effective Ames genotoxicity screening for Ames effect assessment and for further in-depth studies.

Permutation test is conducted by randomly reordering the class label and generating classification models for 1000 times. AUC and accuracies of models with both top-five biomarkers and all 38 proteins as features are measured for all iterations. AUC and accuracies of the models with randomized datasets are less than the AUC and accuracy of the selected models for Ames genotoxicity prediction (Figure S2). Subsequently, resulting p-values for the AUC and accuracy of the selected models, with top-five biomarkers as well as all 38 proteins, are less than 0.001. This indicates that the selected classification models are significant, and overfitting may not be present in the selected models.

Similar to in-vivo carcinogenicity prediction, SVM performs better than other three classification algorithms (decision tree, random forest, and Gaussian naïve Bayes) in classifying Ames genotoxicity, when 38 proteins are used as the features (Table S4). AUC, accuracy, sensitivity, and specificity of the SVM-derived model is the highest for Ames genotoxicity prediction models. Comparison of models with different number of features as biomarker shows that Top-five biomarkers produce a model with higher AUC and comparable accuracy (Table S5) than models with higher number of biomarkers (Top 10, 15 and 20 biomarkers). Addition of features on top of the highest ranked five biomarkers does not necessarily lead to the significant improvement of the prediction performances, except when all 38 proteins are used as the features. With the balance of both the consideration of prediction accuracy and the desire for high throughput and cost-effective assay for environmental applications, we suggest the selection of top-five biomarkers that would provide a quick, and cost-effective tool for Ames genotoxicity prediction. The user may choose different number of biomarkers for specific accuracy target.

Gene Ontology Analysis Reveals the Biological Processes and Molecular Functions of the Top-Ranked Biomarkers

Gene ontology (GO)67 analysis is conducted to reveal the biological processes and molecular functions of the top-ranked biomarkers. Biological processes and molecular functions that are significantly associated with the top-ranked five biomarkers are determined using the hypergeometric distribution with a threshold p-value of 0.01, which indicates that the associations between the biomarkers and the GO terms are not at random.6667

Biological processes and molecular functions of the biomarkers for carcinogenicity prediction

GO analysis reveals that, the most significantly enriched biological processes of the top-ranked five biomarkers (RAD34, RAD27, YKU70, NTG2, MSH2), for the in-vivo carcinogenicity prediction, are associated with DNA repair and response to DNA damage stimulus (Table S3). Additionally, none of the biomarkers is annotated with the functional categories that represent other types of response pathways. RAD27 and YKU70 are associated with double strand break repair. DNA recombination involves YKU70 and MSH2. The biomarkers are also relevant to the base-excision repair and double-strand break.

The significantly enriched molecular functions associated with the top-ranked five biomarkers are also relevant to the DNA damage and repair activities. Damaged DNA binding is associated with RAD34 and YKU70. Endonuclease activity is associated with RAD27, and NTG2. The other relevant molecular functions, associated with at least one of the top-ranked five biomarkers, include single base insertion or deletion binding, DNA binding, DNA insertion of deletion binding, double-strand/single-strand DNA junction binding, and mismatched DNA binding (Table S3). The top-ranked five biomarkers for the in-vivo carcinogenicity prediction mainly focused on double strand break repair and DNA recombination, which result in a severe type of DNA damage and may lead to genomic instability and cancer.6970 This suggests that the selected biomarker-ensemble have the potential to identify severe DNA damage and relevant carcinogenic potency of a chemical.

Biological processes and molecular functions of the biomarkers for Ames genotoxicity prediction

The most significantly enriched biological processes of the top-ranked five biomarkers for the Ames genotoxicity prediction are also revealed via GO analysis. These biological functional categories are also associated with DNA damage and repair activities, and do not show any association with other stress response pathways. The top-ranked five biomarkers (APN2, RFA2, NTG2, RAD2, MSH6) are associated with the DNA repair process and four of them (APN2, NTG2, RAD2, MSH6) are related to the response to DNA damage stimulus biological process. Other biological processes, which show significant association with at least one of the top-ranked biomarkers, include base-excision repair, nucleotide-excision repair, DNA unwinding involved in replication, and double-strand break repair via homologous recombination (Table S3).

The most significantly enriched molecular functions of the top-ranked five biomarkers are also related to DNA damage and repair activities. Endonuclease activity is associated with three of the top-five biomarkers (APN2, RAD2, NTG2). DNA binding molecular function is also associated with three biomarkers, which are APN2, MSH6, and RAD2. Other relevant molecular functions that are significantly associated with at least one of the top-five biomarkers include DNA (apurinic or apyrimidinic site) lyase activity, single-stranded DNA binding, double-stranded DNA specific 3’−5’ exodeoxyribonuclease activity, single base insertion or deletion binding, four-way junction DNA binding, DNA binding, and mismatched DNA binding. The selected top-ranked biomarkers for Ames genotoxicity prediction are associated with base- and nucleotide-excision repair. Since Ames genotoxicity test only detects the frame-shift mutation and base-pair substitution,7172 it captures certain genotoxicity effects, and results in the selection of different biomarker-ensemble than those in the in-vivo carcinogenicity prediction case-study.

Environmental Implications

Presence of numerous contaminants from various (e.g., wastewater effluent, industrial wastewater discharge, oil and chemical spills, urban runoff, agricultural runoff) sources make the risk assessment and monitoring challenging.17 Monitoring of chemical risks in environmental samples by targeted chemical analysis often becomes ineffective because of the presence of complex mixture of chemicals of various origins and their transformation products.7374 High-throughput in vitro assays and AOP-based approach is promising for the assessment of health and ecotoxicological risks from exposure to pollutants and their mixtures, , as the bioassays can capture the responses due to integral effect of a complex mixture.1721,7576 In complement to the chemical analysis, in vitro bioassays are used as a tool for assessment of risks from environmental samples including monitoring of drinking water quality,73 assessment of river water quality,77 effects of leachates from microplastics78. Multiple bioassays covering different stages of cellular toxicity pathways covering various modes of action were used for risk assessment of these environmental samples, which were also recommended in the literature.74 The markers comprising various bioassays are mostly selected based on their association with relevant toxicity response pathways, without any quantitative association. Quantitative correlation between in vitro bioassays and in vivo phenotypic endpoints are particularly important for translation of relevant in vitro responses into regulatory relevant endpoints.79 Establishment of a quantitative AOP framework through integration of computational modeling with in-vitro assays would require identification and selection of relevant biomarkers from appropriate bioassays that link to risk assessment pathways and phenotypic impacts.18,21,80 Current toxicomics approach still mostly rely on large number of redundant markers without pre-selection or ranking;3,51,81 therefore, selection of relevant biomarkers with minimal redundancy would reduce the number of markers to be monitored and reduce the cost, time, and complexity of the toxicity risk monitoring. As discussed earlier, since carcinogenicity involve many pathways and relevant biomarkers, it is preferred to employ combined biomarkers approach such as the biomarkers ensemble in this study to improve the reliability of the biomarkers-based approach. With selective chemicals covering various types of toxicity range, the present study focuses on developing a methodology that can be employed to effectively find relevant biomarkers combination to improve the overall accuracy for predicting regulatory-relevant phenotypic endpoints. It is to be noted, the accuracy reported in this study could vary depending on the selection of chemicals for training dataset. This study demonstrated that the proposed method can effectively identify a small set of relevant biomarkers, provided the chemicals in the dataset are sufficiently large.

The acceptable sample size for using SVM analysis varied widely. In this study, we applied the developed methods to a dataset with 120 samples and 38 features. In the literature, the cases that employed SVM had sample size varying from approximately 20 to as high as thousands. For example, several studies were conducted for biomarker discovery from microarray and gene-expression data with sample size of around 100 or less.29,31,34,8283 Some of those study used datasets with number of features higher (multiple order) than number of samples.29 Furthermore, SVM is selected as the classification algorithm because of its power of avoiding overfitting when the feature space is very large29,53 and its less susceptibility to noisy data54. However, SVM may fail to obtain a best possible model due to the presence of imbalance data. Furthermore, SVM may become complex and computation-heavy for multi-class data.84 Though this should be taken into account while applying this method for a multiclass dataset, SVM works quite well for binary class dataset as used in this study. Choosing an advanced kernel, adjusting bias, providing weight to class and prediction errors may help to improve the accuracy, sensitivity, and specificity of the SVM classifier.8586 In this study, we observed clear change in the slopes in the score values of biomarkers (Figure 1 and Figure 3), and the top-ranked proteins biomarkers, with higher MRMR scores, were therefore recommended based on this transition point. Higher relevance of these top-ranked biomarkers with phenotypic toxicity enabled the differentiation of toxicity-positive chemicals from the genotoxicity-negative chemicals. Note that there is no absolute proof of a threshold effect, so the selection and the number of top-ranked biomarkers could vary depending on the data and users’ decision. Though the method developed in this study for prediction of chemical toxicity, a similar approach can be used to assess the toxicity in environmental samples using toxicogenomic responses. The method developed in this study will help to fill in the knowledge gap in phenotypic anchoring and predictive toxicology, and contribute to the progress in the implementation of tox 21 vision for environmental and health applications.

Supplementary Material

Supplementary

Highlights.

  • Feature selection method, MRMR, was employed to identify most relevant biomarkers for genotoxicity prediction

  • Machine learning-based classification method enabled phenotypic toxicity prediction using selected optimal biomarkers

  • Key biomarkers associated with DNA-damage and repair pathways could predict in-vivo carcinogenicity and Ames genotoxicity

  • Molecular endpoints were validated and correlated with regulatory relevant phenotypic endpoints

ACKNOWLEDGEMENTS

This study is funded by National Science Foundation (NSF, CBET-1437257/ and IIS-1546428), National Institute of Environmental Health Sciences (NIEHS) grants P42ES017198 and P50ES026049, and U.S. Environmental Protection Agency (EPA) grant R83615501. The authors would like to thank Guangyu Li for his contribution to this study.

Footnotes

SUPPORTING INFORMATION

Tables listing the selected concentration ranges and sources of the chemicals, in-vivo carcinogenicity and Ames genotoxicity endpoints along with their sources, description and data processing steps of the yeast proteomic library used in this study, comparative performances of prediction models with different classification algorithms and number of features, and the summary of gene ontology analysis of the selected biomarkers are provided in the Supporting Information (SI). The data that we used for analysis is available upon request. The data is also hosted to a GitHub repository (available at https://github.com/mokhles002/biomarkerSelection_project).

CRediT authorship contribution statement

Sheikh Mokhlesur Rahman: Conceptualization, Methodology, Data curation, Formal analysis, Validation, Visualization, Writing - original draft, Writing - review & editing; JiaQi Lan: Methodology, Data curation, Formal analysis, Validation, Writing - review & editing; David Kaeli: Data processing and anlysis, Formal analysis; Jennifer Dy: data analysis methods, Formal analysis, Visualization, writing-review and editing ; Akram Alshawabkeh: Funding acquisition, Project administration, Resources, Supervision, Validation, April Z. Gu: Supervision, Conceptualization, Methodology, Formal analysis, Funding acquisition, Project administration, Resources, Validation, Writing - review & editing.

Declaration of Competing Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCE

  • (1).USEPA TSCA chemical substance inventory. https://www.epa.gov/tsca-inventory/abouttsca-chemical-substance-inventory (accessed November, 2016).
  • (2).Lan J; Gou N; Rahman SM; Gao C; He M; Gu AZ, A quantitative toxicogenomics assay for high-throughput and mechanistic genotoxicity assessment and screening of environmental pollutants. Environ. Sci. Technol. 2016, 50 (6), 3202–3214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (3).Gou N; Yuan S; Lan J; Gao C; Alshawabkeh AN; Gu AZ, A quantitative toxicogenomics assay reveals the evolution and nature of toxicity during the transformation of environmental pollutants. Environ. Sci. Technol. 2014, 48 (15), 8855–8863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (4).Gou N; Gu AZ, A new transcriptional effect level index (TELI) for toxicogenomics-based toxicity assessment. Environ. Sci. Technol. 2011, 45 (12), 5410–5417. [DOI] [PubMed] [Google Scholar]
  • (5).Ahn J-M; Hwang ET; Youn C-H; Banu DL; Kim BC; Niazi JH; Gu MB, Prediction and classification of the modes of genotoxic actions using bacterial biosensors specific for DNA damages. Biosens. Bioelectron. 2009, 25 (4), 767–772. [DOI] [PubMed] [Google Scholar]
  • (6).Altenburger R; Scholz S; Schmitt-Jansen M; Busch W; Escher BI, Mixture toxicity revisited from a toxicogenomic perspective. Environ. Sci. Technol. 2012, 46 (5), 2508–2522. [DOI] [PubMed] [Google Scholar]
  • (7).National Research Council Toxicity testing in the 21st century: A vision and a strategy; National Academies Press, Washington DC, 2007. [Google Scholar]
  • (8).Andersen ME; Krewski D, Toxicity testing in the 21st century: Bringing the vision to life. Toxicol. Sci. 2009, 107 (2), 324–330. [DOI] [PubMed] [Google Scholar]
  • (9).Benigni R, Predictive toxicology today: The transition from biological knowledge to practicable models. Expert Opinion on Drug Metabolism & Toxicology 2016, 12 (9), 989–992. [DOI] [PubMed] [Google Scholar]
  • (10).Thomas RS; Bao W; Chu T-M; Bessarabova M; Nikolskaya T; Nikolsky Y; Andersen ME; Wolfinger RD, Use of short-term transcriptional profiles to assess the long-term cancer-related safety of environmental and industrial chemicals. Toxicol. Sci. 2009, 112 (2), 311–321. [DOI] [PubMed] [Google Scholar]
  • (11).Thomas RS; Black MB; Li L; Healy E; Chu T-M; Bao W; Andersen ME; Wolfinger RD, A comprehensive statistical analysis of predicting in vivo hazard using high-throughput in vitro screening. Toxicol. Sci. 2012, 128 (2), 398–417. [DOI] [PubMed] [Google Scholar]
  • (12).Ankley GT; Daston GP; Degitz SJ; Denslow ND; Hoke RA; Kennedy SW; Miracle AL; Perkins EJ; Snape J; Tillitt DE; Tyler CR; Versteeg D, Toxicogenomics in regulatory ecotoxicology. Environ. Sci. Technol. 2006, 40 (13), 4055–4065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (13).Kramer VJ; Etterson MA; Hecker M; Murphy CA; Roesijadi G; Spade DJ; Spromberg JA; Wang M; Ankley GT, Adverse outcome pathways and ecological risk assessment: Bridging to population-level effects. Environ. Toxicol. Chem. 2011, 30 (1), 64–76. [DOI] [PubMed] [Google Scholar]
  • (14).Muth-Köhne E; Westphal-Settele K; Brückner J; Konradi S; Schiller V; Schäfers C; Teigeler M; Fenske M, Linking the response of endocrine regulated genes to adverse effects on sex differentiation improves comprehension of aromatase inhibition in a fish sexual development test. Aquat. Toxicol. 2016, 176 (Supplement C), 116–127. [DOI] [PubMed] [Google Scholar]
  • (15).Bradbury SP; Feijtel TCJ; Leeuwen CJV, Meeting the scientific needs of ecological risk assessment in a regulatory context. Environ. Sci. Technol. 2004, 38 (23), 463A–470A. [DOI] [PubMed] [Google Scholar]
  • (16).Conolly RB; Ankley GT; Cheng W; Mayo ML; Miller DH; Perkins EJ; Villeneuve DL; Watanabe KH, Quantitative adverse outcome pathways and their application to predictive toxicology. Environ. Sci. Technol. 2017, 51 (8), 4661–4672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (17).Blalock BJ; Robinson WE; Loguinov A; Vulpe CD; Krick KS; Poynton HC, Transcriptomic and Network Analyses Reveal Mechanistic-Based Biomarkers of Endocrine Disruption in the Marine Mussel, Mytilus edulis. Environ. Sci. Technol. 2018, 52 (16), 9419–9430. [DOI] [PubMed] [Google Scholar]
  • (18).Groh KJ; Carvalho RN; Chipman JK; Denslow ND; Halder M; Murphy CA; Roelofs D; Rolaki A; Schirmer K; Watanabe KH, Development and application of the adverse outcome pathway framework for understanding and predicting chronic toxicity: I. Challenges and research needs in ecotoxicology. Chemosphere 2015, 120, 764–777. [DOI] [PubMed] [Google Scholar]
  • (19).Ankley GT; Bennett RS; Erickson RJ; Hoff DJ; Hornung MW; Johnson RD; Mount DR; Nichols JW; Russom CL; Schmieder PK; Serrrano JA; Tietge JE; Villeneuve DL, Adverse outcome pathways: A conceptual framework to support ecotoxicology research and risk assessment. Environ. Toxicol. Chem. 2010, 29 (3), 730–741. [DOI] [PubMed] [Google Scholar]
  • (20).Ankley GT; Edwards SW, The adverse outcome pathway: A multifaceted framework supporting 21st century toxicology. Current Opinion in Toxicology 2018, 9, 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (21).Carusi A; Davies MR; De Grandis G; Escher BI; Hodges G; Leung KMY; Whelan M; Willett C; Ankley GT, Harvesting the promise of AOPs: An assessment and recommendations. Sci. Total Environ. 2018, 628–629, 1542–1556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (22).Harrill AH; Ross PK; Gatti DM; Threadgill DW; Rusyn I, Population-based discovery of toxicogenomics biomarkers for hepatotoxicity using a laboratory strain diversity panel. Toxicol. Sci. 2009, 110 (1), 235–243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (23).Stahl SH; Yates JW; Nicholls AW; Kenna JG; Coen M; Ortega F; Nicholson JK; Wilson ID, Systems toxicology: modelling biomarkers of glutathione homeostasis and paracetamol metabolism. Drug Discovery Today: Technologies 2015, 15, 9–14. [DOI] [PubMed] [Google Scholar]
  • (24).Angrish MM; Pleil JD; Stiegel MA; Madden MC; Moser VC; Herr DW, Taxonomic applicability of inflammatory cytokines in adverse outcome pathway (AOP) development. J. Toxicol. Environ. Health, A 2016, 79 (4), 184–196. [DOI] [PubMed] [Google Scholar]
  • (25).Garcia-Reyero N. l.; Poynton HC; Kennedy AJ; Guan X; Escalon BL; Chang B; Varshavsky J; Loguinov AV; Vulpe CD; Perkins EJ, Biomarker discovery and transcriptomic responses in Daphnia magna exposed to munitions constituents. Environ. Sci. Technol. 2009, 43 (11), 4188–4193. [DOI] [PubMed] [Google Scholar]
  • (26).Strimbu K; Tavel JA, What are biomarkers? Current Opinion in HIV and AIDS 2010, 5 (6), 463–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (27).Christin C; Hoefsloot HC; Smilde AK; Hoekman B; Suits F; Bischoff R; Horvatovich P, A critical assessment of feature selection methods for biomarker discovery in clinical proteomics. Molecular & Cellular Proteomics 2013, 12 (1), 263–276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (28).Ma S; Huang J, Regularized ROC method for disease classification and biomarker selection with microarray data. Bioinformatics 2005, 21 (24), 4356–4362. [DOI] [PubMed] [Google Scholar]
  • (29).Abeel T; Helleputte T; Van de Peer Y; Dupont P; Saeys Y, Robust biomarker identification for cancer diagnosis with ensemble feature selection methods. Bioinformatics 2010, 26 (3), 392–398. [DOI] [PubMed] [Google Scholar]
  • (30).Wei X; Ai J; Deng Y; Guan X; Johnson DR; Ang CY; Zhang C; Perkins EJ, Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles. BMC Genomics 2014, 15 (1), 248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (31).Ding C; Peng H, Minimum redundancy feature selection from microarray gene expression data. J. Bioinf. Comput. Biol. 2005, 03 (02), 185–205. [DOI] [PubMed] [Google Scholar]
  • (32).Xiong M; Fang X; Zhao J, Biomarker identification by feature wrappers. Genome Res. 2001, 11 (11), 1878–1887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (33).Kohavi R; John GH, Wrappers for feature subset selection. Artif. Intell. 1997, 97 (12), 273–324. [Google Scholar]
  • (34).Radovic M; Ghalwash M; Filipovic N; Obradovic Z, Minimum redundancy maximum relevance feature selection approach for temporal gene expression data. BMC Bioinformatics 2017, 18 (1), 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (35).Suto J; Oniga S; Sitar PP Comparison of wrapper and filter feature selection algorithms on human activity recognition, 2016 6th International Conference on Computers Communications and Control (ICCCC), 10–14 May 2016; 2016; pp 124–129. [Google Scholar]
  • (36).Bolón-Canedo V; Sánchez-Maroño N; Alonso-Betanzos A; Benítez JM; Herrera F, A review of microarray datasets and applied feature selection methods. Information Sciences 2014, 282 (Supplement C), 111–135. [Google Scholar]
  • (37).Statnikov A; Aliferis CF; Tsamardinos I; Hardin D; Levy S, A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 2005, 21. [DOI] [PubMed] [Google Scholar]
  • (38).Ellinger-Ziegelbauer H; Aubrecht J; Kleinjans JC; Ahr H-J, Application of toxicogenomics to study mechanisms of genotoxicity and carcinogenicity. Toxicol. Lett. 2009, 186 (1), 36–44. [DOI] [PubMed] [Google Scholar]
  • (39).Walmsley R; Billinton N; Heyer W, Green fluorescent protein as a reporter for the DNA damage-induced gene RAD54 in Saccharomyces cerevisiae. Yeast 1997, 13 (16), 1535–1545. [DOI] [PubMed] [Google Scholar]
  • (40).Huang S-H; Tung C-W, Identification of consensus biomarkers for predicting non-genotoxic hepatocarcinogens. Scientific Reports 2017, 7, 41176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (41).Brockmeier EK; Hodges G; Hutchinson TH; Butler E; Hecker M; Tollefsen KE; Garcia-Reyero N; Kille P; Becker D; Chipman K; Colbourne J; Collette TW; Cossins A; Cronin M; Graystock P; Gutsell S; Knapen D; Katsiadaki I; Lange A; Marshall S; Owen SF; Perkins EJ; Plaistow S; Schroeder A; Taylor D; Viant M; Ankley G; Falciani F, The role of omics in the application of adverse outcome pathways for chemical risk assessment. Toxicol. Sci. 2017, 158 (2), 252–262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (42).Milanowska K; Krwawicz J; Papaj G; Kosiński J; Poleszak K; Lesiak J; Osińska E; Rother K; Bujnicki JM, REPAIRtoire—a database of DNA repair pathways. Nucleic Acids Res. 2010, 39 (suppl_1), D788–D792. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (43).Hohmann S; Mager WH, Yeast stress responses. Springer Science & Business Media: 2007; Vol. 1. [Google Scholar]
  • (44).Simmons SO; Fan C-Y; Ramabhadran R, Cellular stress response pathway system as a sentinel ensemble in toxicological screening. Toxicol. Sci. 2009, 111 (2), 202–225. [DOI] [PubMed] [Google Scholar]
  • (45).O’Connor S; Lan J; North M; Loguinov A; Zhang L; Smith M; Gu A; Vulpe C, Genome-wide functional and stress response profiling reveals toxic mechanism and genes required for tolerance to benzo[a]pyrene in S. cerevisiae. Frontiers in Genetics 2013, 3 (316). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (46).Bucher JR, The National Toxicology Program rodent bioassay: Designs, interpretations, and scientific contributions. Ann. N.Y. Acad. Sci. 2002, 982 (1), 198–207. [DOI] [PubMed] [Google Scholar]
  • (47).Gold LS; Slone T; Manley N; Garfinkel G; Ames B, The carcinogenic potency database (CPDB). University of Berkeley: 2000. [Google Scholar]
  • (48).Richardson SD; Plewa MJ; Wagner ED; Schoeny R; DeMarini DM, Occurrence, genotoxicity, and carcinogenicity of regulated and emerging disinfection by-products in drinking water: A review and roadmap for research. Mutation Research/Reviews in Mutation Research 2007, 636 (1–3), 178–242. [DOI] [PubMed] [Google Scholar]
  • (49).Lan J; Gou N; Gao C; He M; Gu AZ, Comparative and mechanistic genotoxicity assessment of nanomaterials via a quantitative toxicogenomics approach across multiple species. Environ. Sci. Technol. 2014, 48 (21), 12937–12945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (50).Huh W-K; Falvo JV; Gerke LC; Carroll AS; Howson RW; Weissman JS; O’Shea EK, Global analysis of protein localization in budding yeast. Nature 2003, 425, 686. [DOI] [PubMed] [Google Scholar]
  • (51).Lan J; Rahman SM; Gou N; Jiang T; Plewa MJ; Alshawabkeh A; Gu AZ, Genotoxicity Assessment of Drinking Water Disinfection Byproducts by DNA Damage and Repair Pathway Profiling Analysis. Environ. Sci. Technol. 2018, 52 (11), 6565–6575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (52).Peng H; Long F; Ding C, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence 2005, 27 (8), 1226–1238. [DOI] [PubMed] [Google Scholar]
  • (53).Burbidge R; Buxton B, An introduction to support vector machines for data mining. Keynote papers, young OR12 2001, 3–15. [Google Scholar]
  • (54).Zanaty EA, Support Vector Machines (SVMs) versus Multilayer Perception (MLP) in data classification. Egyptian Informatics Journal 2012, 13 (3), 177–183. [Google Scholar]
  • (55).James G; Witten D; Hastie T; Tibshirani R, An introduction to statistical learning. 1st ed.; Springer: 2013. [Google Scholar]
  • (56).Kuhn M; Johnson K, Applied predictive modeling. 1st Edition ed.; Springer: 2013. [Google Scholar]
  • (57).Kohavi R A study of cross-validation and bootstrap for accuracy estimation and model selection, Ijcai, Stanford, CA: 1995; pp 1137–1145. [Google Scholar]
  • (58).Ojala M; Garriga GC, Permutation tests for studying classifier performance. Journal of Machine Learning Research 2010, 11 (6). [Google Scholar]
  • (59).Golland P; Liang F; Mukherjee S; Panchenko D Permutation tests for classification, International conference on computational learning theory, Springer: 2005; pp 501–515. [Google Scholar]
  • (60).de Andrade BM; de Gois JS; Xavier VL; Luna AS, Comparison of the performance of multiclass classifiers in chemical data: Addressing the problem of overfitting with the permutation test. Chemometrics Intellig. Lab. Syst. 2020, 201, 104013. [Google Scholar]
  • (61).Hanley JA; McNeil BJ, The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982, 143 (1), 29–36. [DOI] [PubMed] [Google Scholar]
  • (62).Flach PA; Hernández-Orallo J; Ramirez CF A coherent interpretation of AUC as a measure of aggregated classification performance, ICML, 2011; pp 657–664. [Google Scholar]
  • (63).Hsu M-J; Chang Y-CI; Hsueh H-M, Biomarker selection for medical diagnosis using the partial area under the ROC curve. BMC Research Notes 2014, 7 (1), 25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (64).Metz CE, Basic principles of ROC analysis. Semin. Nucl. Med. 1978, 8 (4), 283–298. [DOI] [PubMed] [Google Scholar]
  • (65).Altman DG; Bland JM, Statistics notes: Diagnostic tests 1: sensitivity and specificity. BMJ 1994, 308 (6943), 1552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (66).Robinson MD; Grigull J; Mohammad N; Hughes TR, FunSpec: A web-based cluster interpreter for yeast. BMC Bioinformatics 2002, 3 (1), 35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (67).Ashburner M; Ball C; Blake J; Botstein D; Butler H; Cherry J; Davis A; Dolinski K; Dwight S; Eppig J; Harris M; Hill D; Issel-Tarver L; Kasarskis A; Lewis S; Matese J; Richardson J; Ringwald M; Rubin G; Sherlock G, Gene Ontology: Tool for the unification of biology. Nat Genet 2000, 25, 25 – 29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (68).Fawcett T, An introduction to ROC analysis. Pattern Recog. Lett. 2006, 27 (8), 861–874. [Google Scholar]
  • (69).Tubbs A; Nussenzweig A, Endogenous DNA damage as a source of genomic instability in cancer. Cell 2017, 168 (4), 644–656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (70).Srivastava M; Raghavan, Sathees C., DNA double-strand break repair inhibitors as cancer therapeutics. Chem. Biol. 2015, 22 (1), 17–29. [DOI] [PubMed] [Google Scholar]
  • (71).Ames BN; Durston WE; Yamasaki E; Lee FD, Carcinogens are mutagens: A simple test system combining liver homogenates for activation and bacteria for detection. Proceedings of the National Academy of Sciences 1973, 70 (8), 2281–2285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (72).Zeinvand-Lorestani H; Sabzevari O; Setayesh N; Amini M; Nili-Ahmadabadi A; Faramarzi MA, Comparative study of in vitro prooxidative properties and genotoxicity induced by aflatoxin B1 and its laccase-mediated detoxification products. Chemosphere 2015, 135, 1–6. [DOI] [PubMed] [Google Scholar]
  • (73).Neale PA; Feliers C; Glauch L; König M; Lecarpentier C; Schlichting R; Thibert S; Escher BI, Application of in vitro bioassays for water quality monitoring in three drinking water treatment plants using different treatment processes including biological treatment, nanofiltration and ozonation coupled with disinfection. Environmental Science: Water Research & Technology 2020. [Google Scholar]
  • (74).Neale PA; Escher BI, In vitro bioassays to assess drinking water quality. Current Opinion in Environmental Science & Health 2019, 7, 1–7. [Google Scholar]
  • (75).Ankley G; Escher BI; Hartung T; Shah I, Pathway-Based Approaches for Environmental Monitoring and Risk Assessment. Environ. Sci. Technol. 2016, 50 (19), 10295–10296. [DOI] [PubMed] [Google Scholar]
  • (76).Altenburger R; Brack W; Burgess RM; Busch W; Escher BI; Focks A; Mark Hewitt L; Jacobsen BN; de Alda ML; Ait-Aissa S; Backhaus T; Ginebreda A; Hilscherová K; Hollender J; Hollert H; Neale PA; Schulze T; Schymanski EL; Teodorovic I; Tindall AJ; de Aragão Umbuzeiro G; Vrana B; Zonja B; Krauss M, Future water quality monitoring: improving the balance between exposure and toxicity assessments of real-world pollutant mixtures. Environmental Sciences Europe 2019, 31 (1), 12. [Google Scholar]
  • (77).Müller ME; Escher BI; Schwientek M; Werneburg M; Zarfl C; Zwiener C, Combining in vitro reporter gene bioassays with chemical analysis to assess changes in the water quality along the Ammer River, Southwestern Germany. Environmental Sciences Europe 2018, 30 (1), 20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (78).Rummel CD; Escher BI; Sandblom O; Plassmann MM; Arp HPH; MacLeod M; Jahnke A, Effects of leachates from UV-weathered microplastic in cell-based bioassays. Environ. Sci. Technol. 2019, 53 (15), 9214–9223. [DOI] [PubMed] [Google Scholar]
  • (79).Villeneuve DL; Coady K; Escher BI; Mihaich E; Murphy CA; Schlekat T; Garcia‐Reyero N, High‐throughput screening and environmental risk assessment: State of the science and emerging applications. Environ. Toxicol. Chem. 2019, 38 (1), 12–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (80).Jungnickel H; Potratz S; Baumann S; Tarnow P; von Bergen M; Luch A, Identification of Lipidomic Biomarkers for Coexposure to Subtoxic Doses of Benzo[a]pyrene and Cadmium: The Toxicological Cascade Biomarker Approach. Environ. Sci. Technol. 2014, 48 (17), 10423–10431. [DOI] [PubMed] [Google Scholar]
  • (81).Stalter D; O’Malley E; von Gunten U; Escher BI, Fingerprinting the reactive toxicity pathways of 50 drinking water disinfection by-products. Water Res. 2016, 91, 19–30. [DOI] [PubMed] [Google Scholar]
  • (82).Saeys Y; Abeel T; Van de Peer Y, Robust feature selection using ensemble feature selection techniques. In Machine learning and knowledge discovery in databases, Springer: 2008; pp 313–325. [Google Scholar]
  • (83).Fortino V; Wisgrill L; Werner P; Suomela S; Linder N; Jalonen E; Suomalainen A; Marwah V; Kero M; Pesonen M; Lundin J; Lauerma A; Aalto-Korte K; Greco D; Alenius H; Fyhrquist N, Machine-learning–driven biomarker discovery for the discrimination between allergic and irritant contact dermatitis. Proceedings of the National Academy of Sciences 2020, 117 (52), 33474–33485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • (84).Cervantes J; Garcia-Lamont F; Rodríguez-Mazahua L; Lopez A, A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing 2020, 408, 189–215. [Google Scholar]
  • (85).Moraes D; Wainer J; Rocha A, Low false positive learning with support vector machines. Journal of Visual Communication and Image Representation 2016, 38, 340–350. [Google Scholar]
  • (86).Davenport MA; Baraniuk RG; Scott CD Controlling False Alarms With Support Vector Machines, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, 14–19 May 2006; 2006; pp V-V. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary

RESOURCES