The study presented in Bercu et al. (2021) was designed to understand whether (Quantitative) Structure-Activity Relationship ((Q)SAR) models are fit-for-purpose to use as part of classification and labelling. To test this hypothesis, proprietary and marketed data on rat oral acute toxicity from 10 organizations representing chemicals typically assessed was compiled and run through (Q)SAR models developed by Leadscope (an Instem company). The experimental and prediction data from all collaborators was then combined and an assessment of whether these models are fit-for-purpose was made based on their performance. In addition, an expert review was performed and documented on a subset of the chemicals. Based on this information, a decision tree was presented that showed how these models could be used to support classification and labelling decisions. A reassessment of the results from one of the collaborators was recently performed because: (a) some compounds were identified as belonging to the (Q)SAR training sets; (b) some compounds were found to be duplicates following computation of InChIs; (c) integration with a more highly curated set of experimental data led to the experimental labels being updated for some compounds. The revised results were then combined with other collaborators’ results. The revised results do not change the conclusions or recommendations of the paper based on both the overall and subset specific balanced statistics for the consensus predictions. For example, the original abstract stated that approximately 95% of chemicals were either correctly predicted or predicted in a more conservative GHS category, after removing a small fraction of inconclusive - meaning indeterminate or out of domain - predictions. In the original analysis this value was 94.82% whereas in the revised analysis the figure is 94.84%. Similarly, in the original analysis, the average percentage of these compounds, across all well-defined experimental categories, which were assigned to a correct or more conservative category was around 80%. Excluding the two GHS category 1 compounds, since two compounds are too few to obtain robust statistics, the average percentage of these compounds which are assigned to a correct or more conservative category is 78%. The following figures and tables have been updated to reflect these changes: Fig. 4, Tables 2–6, supplemental materials Tables s1–s22. These figures and tables are included in the supplemental material.
Supplementary Material
Footnotes
Appendix A. Supplementary data
Supplementary data to this article can be found online at https://doi.org/10.1016/j.yrtph.2022.105165.
Reference
- Bercu J, Masuda-Herrera MJ, Trejo-Martin A, Hasselgren C, Lord J, Graham J, Schmitz M, Milchak L, Owens C, Hari Lal S, Marchese Robinson R, Whalley S, Bellion P, Vuorinen A, Gromek K, Hawkins WA, van de Gevel I, Vriens K, Kemper R, Naven R, Ferrer P, Myatt GJ, 2021. A cross-industry collaboration to assess if acute oral toxicity (Q)SAR models are fit-for-purpose for GHS classification and labelling. Regul. Toxicol. Pharmacol 120, 104843 10.1016/j.yrtph.2020.104843. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.