Skip to main content
. 2019 Apr 9;127(4):047002. doi: 10.1289/EHP3986

Figure 2.

Figure 2A is a predictive model of carcinogenicity which consists of the following: a tabular representation; a box and whisker plot showing A U C scores (y-axis) across four T A S subsets (x-axis); and a line graph plotting average TPR (y-axis) across FPRs (x-axis) for the four T A S subsets. Figure 2B is a predictive model of genotoxicity which consists of the following: a tabular representation; a box and whisker plot showing A U C scores (y-axis) across four T A S subsets (x-axis); and a line graph plotting average TPR (y-axis) across FPRs (x-axis) for the four T A S subsets.

Performance of classifiers in predictive models of (A) carcinogenicity, and (B) genotoxicity. From left to right: a) Summary statistics tables of area under the ROC curve (AUC) for each transcriptional activity score (TAS) subsets; data represented are the median, mean, and SE (standard error) of the AUC scores; and b) box plots of AUC across resamples (n=25) for each TAS subset with the lower, middle, and upper hinges corresponding to the 25th, 50th (median), and 75th percentiles, respectively, the upper and lower whiskers extending to the smaller and largest value at most 1.5 × IQR (interquartile range) from the hinge, and data points beyond the whiskers represented as dots. Dotted line at 0.5 represents the expected AUC of a random classifier. Labels in each TAS group (“n=”) represent the number of unique chemicals in the model training and validation step. c) Receiver operating characteristic (ROC) curves [false positive rate (FPR) vs. average true positive rate (TPR)]. Thick lines represent vertical averaging of ROC curves across resamples in each TAS group shown with bars denoting the standard errors. Thin, semitransparent lines represent ROC curves of individual resamples in each TAS group.