Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2019 Apr 9;127(4):047002. doi: 10.1289/EHP3986

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

EHP is an open-access journal published with support from the National Institute of Environmental Health Sciences, National Institutes of Health. All content is public domain unless otherwise noted.

PMC Copyright notice

Figure 2A is a predictive model of carcinogenicity which consists of the following: a tabular representation; a box and whisker plot showing A U C scores (y-axis) across four T A S subsets (x-axis); and a line graph plotting average TPR (y-axis) across FPRs (x-axis) for the four T A S subsets. Figure 2B is a predictive model of genotoxicity which consists of the following: a tabular representation; a box and whisker plot showing A U C scores (y-axis) across four T A S subsets (x-axis); and a line graph plotting average TPR (y-axis) across FPRs (x-axis) for the four T A S subsets. — Performance of classifiers in predictive models of (A) carcinogenicity, and (B) genotoxicity. From left to right: a) Summary statistics tables of area under the ROC curve (AUC) for each transcriptional activity score (TAS) subsets; data represented are the median, mean, and SE (standard error) of the AUC scores; and b) box plots of AUC across resamples ( $n = 25$ ) for each TAS subset with the lower, middle, and upper hinges corresponding to the 25th, 50th (median), and 75th percentiles, respectively, the upper and lower whiskers extending to the smaller and largest value at most 1.5 × IQR (interquartile range) from the hinge, and data points beyond the whiskers represented as dots. Dotted line at 0.5 represents the expected AUC of a random classifier. Labels in each TAS group (“ $n =$ ”) represent the number of unique chemicals in the model training and validation step. c) Receiver operating characteristic (ROC) curves [false positive rate (FPR) vs. average true positive rate (TPR)]. Thick lines represent vertical averaging of ROC curves across resamples in each TAS group shown with bars denoting the standard errors. Thin, semitransparent lines represent ROC curves of individual resamples in each TAS group.