Abstract
An essential aspect of the EU’s Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) regulation is the European Chemicals Agency’s (ECHA) evaluation of testing proposals submitted by registrants to address data gaps. Registrants may propose adaptations, such as read-across, to waive standard testing; however, it is widely believed that ECHA often finds justifications for read-across hypotheses inadequate. From 2008 to August 2023, 2,630 testing proposals were submitted to ECHA; of these, 1,538 had published decisions that were systematically evaluated in this study. Each document was manually reviewed and information extracted for further analyses, focusing on 17 assessment elements (AEs) from the Read-Across Assessment Framework (RAAF) and testing proposal evaluations (TPE). Each submission was classified as to the AEs relied upon by the registrants and by ECHA. Data was analyzed for patterns and associations. Adaptations were included in 23% (350) of proposals, with analogue (168) and group (136) read-across being most common. Of the 304 read-across hypotheses, 49% were accepted, with group read-across showing significantly higher odds of acceptance. Data analysis examined factors such as tonnage band (Annex), test guidelines, hypothesis AEs, and structural similarities of target and source substances. While decisions were often context-specific, several significant associations influencing acceptance emerged. Overall, this analysis provides a comprehensive overview of 15 years of experience with testing proposal-specific read-across adaptations by both registrants and ECHA. These data will inform future submissions as they identify most critical AEs to increase the odds of read-across acceptance.
Plain language summary
The European Union’s law requires companies to provide safety data on chemicals they produce or import. To avoid unnecessary testing in animals, companies can propose alternatives like “read-across”, which uses data from similar substances. However, the European Chemicals Agency often rejects these proposals if the reasoning is not convincing. 1,538 decisions on testing proposals submitted between 2008 and 2023 were reviewed for this study. We analyzed the factors that influenced the agency’s decisions, focusing on 17 criteria. About 23% of proposals included read-across adaptations, with nearly half of these (49%) found acceptable. Group read-across proposals were more likely to succeed than analogue approaches. The study identified patterns and key factors, such as the similarity between substances and the type of tests proposed, that affected approval rates. This research offers valuable insights to help companies improve their proposals and increase the chances of approval for alternatives to animal testing.
1. Introduction
The European Union’s (EU) Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) regulation (EC, 2006, Article 40(2)) aims to ensure that all chemical substances manufactured in and/or imported into the European Economic Area in the annual amount of greater than one ton are safe for human health and the environment. The manufacturers and importers (called “registrants”) have a responsibility to manage risks of chemicals. They are required to submit a dossier containing specific data based on the annual amount of the substance as specified in Annexes VII to X of REACH. Registration dossiers are evaluated by the European Chemicals Agency (ECHA) (Fig. 1). A data gap analysis is a necessary step in dossier preparation; if the registrants deem their dossier to be without data gaps, it is submitted to ECHA. ECHA may examine any registration dossier at any time to verify if the information submitted by registrants is compliant with the requirements of REACH. The legal requirement for ECHA’s compliance check was increased in 2019 from 5% to 20% of the dossiers; in fact, between 2009 and 2023 ECHA performed compliance checks of approximately 15,000 registrations, representing 21% of full registrations (ECHA, 2024). Dossier selection for compliance check is either random (comprehensive) or concern-based (targeted). No public record is created upon a compliance check if a dossier is found to be satisfactory. A compliance check (CCH) decision document is published if deficiencies were identified.
Fig. 1: A simplified illustration of a workflow of substance registration under REACH.

The two sides of the diagram represent the difference between “complete” dossiers that contain all information as required for a substance based on its annual tonnage (left side) and cases where gap analysis by the registrant identified missing information that results in the submission of a testing proposal to ECHA (right side). The compliance checks (CCH) and testing proposal evaluations (TPE) result in separate decisions by ECHA; in this study, only published TPE decisions were reviewed.
If the registrant identifies data gaps during preparation of the registration dossier, they have several options: (i) for information requirements in Annexes VII and VIII, a new study may be performed to fill data gaps; (ii) an adaptation of a standard required test(s) could be included with a justification in the dossier based on Column 2 of Annexes VII-X, or Annex XI provisions (EC, 2006); or (iii) a testing proposal must be submitted to ECHA to request permission to conduct a new study to address information requirements in Annex IX and X (ECHA, 2011). Testing proposals are a critical element of REACH insofar they are a mechanism to eliminate duplicative or otherwise unnecessary studies to fulfill data requirements, including those requiring testing in animals and other model organisms. They also provide ECHA the opportunity to comment on the proposed studies and to suggest refinements, so the information obtained from any new testing is most informative for hazard and risk characterization.
REACH provides for a range of possible adaptations to the standard information requirements that can be used either in registration dossiers or in testing proposals; these adaptations are detailed in Annex XI of REACH. Among potential adaptations, read-across is one of the major methods used to fulfill information requirements on any endpoint in REACH (ECHA, 2023a). The Organisation for Economic Co-operation and Development (OECD) and ECHA published guidance on read-across (ECHA, 2008, 2015, 2017; OECD, 2017), and there are many authoritative commentaries from diverse stakeholders on ways to improve read-across in regulatory submissions (Ball et al., 2016; Pestana et al., 2022; Patlewicz et al., 2014; Blackburn and Stuard, 2014; Rovida et al., 2020).
Few studies systematically evaluated regulatory decisions and the reasoning that regulators used to accept or reject read-across hypotheses. A review of published decisions by ECHA that were available as of July 2015 was a product of a multi-stakeholder collaboration (Ball et al., 2016). Both CCH (524 documents) and testing proposal evaluations (TPE, 388 documents) were examined with regard to the relative successes and pitfalls of different scientific arguments used in proposed read-across hypotheses. The timing of that publication coincided with the publication of the first edition of ECHA’s Read-Across Assessment Framework (RAAF) (ECHA, 2015); additional guidance (ECHA, 2017, 2020, 2022) and recommendations1 have been provided by ECHA. The study by Ball et al. (2016) summarized the state of the art in read-across based on their analysis of the data available then, and highlighted the areas where improvements were needed in refining justifications and informing registrants about best practices and successful cases. It was also acknowledged that RAAF guidance was likely to have a major impact on read-across adaptations and that additional systematic analyses would be needed.
Another example is a recent systematic analysis of read-across in REACH registration dossiers; data was extracted for target-source analogue pairs for mono-constituent substances from the information in IUCLID (International Uniform Chemical Information Database) (Patlewicz et al., 2024). The authors looked only at the data provided in the submission dossier and did not consider if ECHA had evaluated these through the process of TPE or CCH. They also focused on the entries where read-across was used to satisfy information requirements for repeated dose toxicity or developmental toxicity studies – standard test requirements for high-tonnage substances that require the use of a large number of animals (Taylor et al., 2014b; Rovida et al., 2023). The analyses were restricted to substances with defined organic structures; the final dataset comprised 270 target substances and 259 source substances. The study focused on examining physicochemical, structural, and metabolic similarity between source and target substances, as well as on the analysis of dose-response data from the animal studies – comparing the data from IUCLID to predictions using the United States Environmental Protection Agency (US EPA) generalized read-across (GenRA) approach (Helman et al., 2019). This study concluded that identification of suitable analogues for read-across is not only a challenge with respect to defining a similarity cutoff but also with respect to finding the substances that already have data that satisfy standard test requirements. Collectively, the study found that low structural similarity was a common occurrence in REACH submissions based on read-across. They also concluded that GenRA provides more conservative estimates for dose-response analysis of chemical effects.
Overall, discussions on the best practices to perform and evaluate read-across are critical to increase the familiarity with this approach and for strengthening the justifications for read-across adaptations in regulatory submissions. Concomitantly, it is important to determine what do successful examples of accepted read-across adaptations look like and are there common themes in accepted and rejected submissions to ECHA. An extensive database of both CCH and TPE decisions now exists because nearly 15 years have elapsed since the first review of submissions under the REACH regulation. The current study focused on TPE decisions as they present both provisional regulatory acceptance and rejection prior to generating data. From 2008 to August 11, 2023 (the cutoff date for this analysis), a total of 2,630 testing proposals had been submitted to ECHA; 1,538 of these had published TPE decisions, and they were evaluated herein. Each document was manually reviewed, and information was extracted in a systematic manner. We defined 17 assessment elements (AEs) based on RAAF guidance and both the robust summaries of the testing proposals and the TPE decisions. Data was analyzed for patterns, including Annex (i.e., tonnage band), test guideline study, read-across hypothesis AEs, and the structural similarity of target and source substances.
2. Methods
Review of ECHA TPE decisions and data extraction
ECHA publishes its decisions on the testing proposals on its website2. On August 11, 2023, we exported information on the identities of the substances listed (i.e., the target substance), stage of their evaluation process, and the date the decision (if any) was issued. Among all downloaded records, 2,630 records were identified as TPEs, of which 1,538 included a link to a publicly available decision. This information is listed as part of Table S13.
All TPEs with available decisions (n = 1,538; Fig. 2) were downloaded from the abovementioned website as PDF (portable document format) files and manually evaluated by one reviewer, with secondary review by additional individuals when necessary. It should be noted that while PDF files are machine-readable, about half of the files we examined were scanned images of varying quality. Attempts to perform optical character recognition processing of these files resulted in poor text quality that would make machine reading difficult to impossible. In addition, the terminology used by ECHA, as well as document formats, have changed over time. Therefore, attempts to automatically process these documents for information retrieval were vacated in favor of manual examination of each document.
Fig. 2: A diagram illustrating categorization of the publicly available TPE (testing proposal evaluation) documents with and without adaptations, as well as the acceptance and rejection rates of testing proposals with read-across (RA) adaptations.

Numbers indicate the number of submissions with a published TPE as substances may have multiple submitted testing proposals throughout the registration period. Asterisks (*) indicate categories where individual testing proposal submissions may have contained several read-across hypotheses.
Information extracted from each PDF at this stage included the Annex, the proposed tests to fulfill the data gap, and ECHA’s decision on the proposal. This step included identifying documents in which adaptations (e.g., read-across) were proposed; ECHA’s decision(s) on the proposed adaptation was also recorded. Of the 1,538 TPEs examined, 309 decision documents were identified as containing read-across adaptations. However, when evaluating these TPEs, five documents were found to include more than one read-across decision. In two cases, the same read-across hypothesis was proposed for two different required tests. In another two cases, a large category was proposed where the initial source substance was rejected in favor of another source substance in the same category. In the other case, both group and analogue read-across were proposed for the same required test with separate decisions. In addition, 10 testing proposals containing read-across were declared redundant by ECHA without evaluation of the read-across because the tests were not required; these were also not evaluated. Therefore, in total, 304 read-across hypotheses were included in the final analysis.
Next, the documents that included read-across adaptations were evaluated in greater detail, and additional information was extracted. First, we recorded the type of read-across, analogue or group/category, as well as the EC number for each source substance(s). The EC numbers for the target and source substances were used to search the ECHA database to identify the type of each substance from the registration dossier. ECHA classifies substances into three primary types – mono-constituent, multi-constituent, or substances of unknown or variable composition, complex reaction products, or biological (UVCB). There were instances where the substance type could not be ascertained from the TPE because it was redacted, or the registration dossier was no longer publicly accessible during the time of analysis. We also recorded separately what OECD test(s) were proposed by the registrants to be adapted through read-across and ECHA’s decision on each adaptation. Second, to enable evaluation of the read-across hypotheses in each TPE, we defined 17 AEs (Tab. 1) based on (i) information in ECHA’s RAAF guidance documents (ECHA, 2015, 2017) and (ii) additional considerations presented in TPEs that were not part of RAAF. Thus, it should be noted that the proposed AEs were defined by the authors and are not ECHA-specified. The AEs were grouped into three categories: those based on toxicodynamic considerations, on toxicokinetic considerations, and other assessment considerations. Additional details on each AE, including corresponding explanations from the RAAF and/or TPE decisions (where applicable) can be found in Table S24. For each TPE, AEs were recorded separately as those proposed by the registrant as part of their read-across hypothesis and those used by ECHA in justifying their decision. Third, for each read-across decision, we noted whether submission was based on analogue or group/category and whether the proposed standard test requirement adaptation was found by ECHA to be satisfactory (termed “accepted”) or not (termed “rejected”).
Tab. 1:
Assessment elements (AE) used in testing proposal submissions and decisions
| AEs based on toxicodynamic and other considerations | AEs based on toxicokinetic considerations | ||
|---|---|---|---|
| AE 1 | Structure/physicochemical properties | AE 9 | Formation of common (identical) compounds |
| AE 2 | Common biological targets with common metabolites | AE 10 | Exposure of biological targets to common compounds |
| AE 3 | Common biological targets without common metabolites (qualitative) | AE 11 | Extent of the bioavailability of the parent compound |
| AE 4 | Common biological targets without common metabolites (quantitative) | AE 12 | Formation and impact of non-common compounds/exposure to other compounds than those linked to the prediction |
| AE 5 | Environmental degradation to non-common compounds | AE 13 | Metabolites of source and target have been identified |
| AE 6 | Environmental bioaccumulation of potential non-common compounds | AE 14 | Potential presence of uncharacterized metabolites or impurities |
| AE 7 | Common environmental degradation pathways | Other AEs | |
| AE 8 | Toxicodynamic similarity based on the data from a bridging (OECD TG) study | AE 15 | Characterization of constituents (for multi-constituent or UVCB substances) |
| AE 16 | Toxicodynamic similarity based on the data from a bridging (not a TG) study | ||
| AE 17 | Lack of observed adverse effects | ||
Statistical data analysis
Statistical analyses were performed using R 4.40 and GraphPad Prism (v. 10.2.2, GraphPad Software, La Jolla, CA, USA). Using predictors such as OECD test and the proposed AEs in testing proposals and decisions, we constructed 2×2 tables (acceptance/rejection of a testing proposal vs other binary predictors). These tables were analyzed using Fisher’s exact test (fisher.test) for p-values, odds ratios, and associated confidence intervals. As these predictors were considered to be of independent interest, nominal confidence intervals and p-values are reported. Multiple logistic regression using glm()function in R was performed using submission year, decision year, group vs. analogue-based read-across, and each AE. For several AEs, the small number of associated testing proposals could produce unstable estimates resulting in infinite standard errors, and these AEs were removed in a single iterative step. Reported p-values for the multiple logistic regression were corrected using the Benjamini-Hochberg q-value via p.adjust and q < 0.05 was considered significant.
Analysis of chemical similarity
For testing proposals with analogue read-across for mono-constituent target and source substances, the EC number of each substance was used to obtain simplified molecular input line entry system (SMILES) identifiers for subsequent analysis of structural similarity. SMILES for each mono-constituent substance was identified based on the CAS number included in a decision document and using the CompTox dashboard batch search (Lowe and Williams, 2021). Then, SMILES strings were used to generate Morgan fingerprint information using the get.fingerprint command from the rcdk R package (Guha, 2007). The fingerprints were of the “standard” type, with a binary format and a length of 1024 bits. A t-test was then performed to compare the Jaccard similarity scores between accepted and rejected cases. To account for the possibility that acceptance rates might differ for scores greater than a threshold, a rigorous cutpoint analysis was performed to find an “optimal” threshold, while accounting for the implicit multiple testing in such an analysis. Here, permutation-based significance testing was employed as follows. First, Jaccard similarity scores were screened to identify an optimal cutpoint. Each possible cutpoint (defined as an observed Jaccard similarity score in the dataset) was evaluated, and the cutpoint that returned the smallest nominal p-value using Fisher’s exact test (acceptance/rejection vs. Jaccard similarity < or ≥ cutpoint) was identified as optimal. Second, a null population was constructed by permuting the accepted and rejected labels 100,000 times, and for each permutation these labels were combined with the observed Jaccard similarity scores. Finally, an empirical p-value was determined by computing the proportion of permuted minimum nominal p-values that were ≤ the minimum nominal p-value from the first step, and the final odds ratio using the associated optimal cutpoint.
3. Results
We first examined all 1,538 publicly accessible TPE decisions by ECHA as of August 11, 2023 with respect to whether any adaptations were proposed (Fig. 2). We found that 77% of TPEs (1,188) did not include adaptations, i.e., the registrants acknowledged that they needed to perform additional tests to address the data gap(s) in standard information requirements for their substance(s). The remaining 350 testing proposals (23% of total) contained one or more adaptations, and most of these (89%) contained some form of read-across reasoning for one or more tests. Ten of these testing proposals contained read-across adaptations but were not examined herein because ECHA rejected these submissions as redundant (e.g., required data was already available in another registration or endpoint proposed was not necessary based on the tonnage band) without weighing in on the registrant-proposed read-across arguments. Among the remaining testing proposals, 40 (11%) contained adaptations other than read-across, such as QSAR or weight of evidence; these were not evaluated because they were not based on read-across (for details, see Tab. S35). Among 304 examined read-across adaptations, more than half (55%) were of the analogue type – ECHA accepted read-across justifications for 39% of these (for details, see Tab. S46). Among testing proposals that contained the group type of read-across, a larger fraction (62%) was found to be acceptable. Overall, based on the published TPE decisions from 2008 to 2023, the odds that a testing proposal with read-across hypothesis would be found adequate by ECHA for group read-across submissions were 2.6 times as large as those for analogue read-across (p < 0.05, Fisher’s exact test). It should be noted, however, that this result should be interpreted with caution because if a read-across hypothesis/justification was found to be satisfactory for a group of similar substances, then several “positive” decisions might ensue, thus amplifying the numbers as compared to analogue-type read-across submissions.
The process of evaluation of testing proposals by ECHA has been characterized as “lengthy and bureaucratic” (Taylor et al., 2014b), yet ECHA had to rapidly implement the substantial new regulatory framework of REACH as well as establish internal expertise and competencies for evaluating a large number of testing proposals and registrations. Figure 3A shows the time trends in the public release of TPE decisions (bars) and the fraction of TPEs with read-across adaptations (line). Starting in 2012, when the final decision document formats became more uniform, between 58 and 194 (126 ± 42, mean ± SD) TPE decisions were published by ECHA each calendar year. The fraction of TPEs with read-across adaptations varied even more widely, from 6.2% to 42% (22.5 ± 11.2%). Figure 3B shows the data for published TPEs with read-across adaptations. The stacked bars show the numbers of proposals with analogue and group read-across types. The line shows the fraction of read-across-containing submissions that were deemed acceptable by ECHA. There appears to be a large difference in the rate of acceptance, with much higher acceptance rates in 2012–2015, before ECHA published final RAAF guidance (ECHA, 2015). For example, in 2014, ECHA released decisions on a group of “higher olefins”, which contained 21 substances.
Fig. 3: Time-trend plots for when TPE decisions were published.

(A) The number of all published TPE decisions per calendar year (bars, left y-axis) and the fraction of TPEs that contained read-across (RA) adaptations (line, right y-axis). (B) The number of published TPE decisions per calendar year indicating the type of read-across (stacked bar plots, left y-axis) and the fraction of these that was accepted (line, right y-axis).
The decrease in acceptance of testing proposals between 2015 and 2017 may reflect the time needed to standardize the evaluation process according to RAAF. Overall, the time-dependent trends for the rate of testing proposal acceptance when either proposal submission year or decision publication year is considered, are slightly negative (slopes of −0.078 and −0.101, respectively) but not significant.
There are three main substance types recognized under REACH – mono-constituent, multi-constituent, and UVCBs (ECHA, 2023b). Figure 4A shows that over 90% of published TPE decisions were on mono-constituent and UVCB substances. While the relative proportion of accepted read-across adaptations is about 50% for both, analogue submissions were most common for mono-constituent substances, while for UVCBs group-type read-across dominated. While the numbers for multi-constituent substances are relatively low, most submissions contained analogue type read-across, and the majority of these were not accepted. Figure 4B shows what source substance types were used by registrants in read-across adaptations for each target substance type. While it is not surprising that most (79% for mono-constituent and 75% for UVCB) of the time the target and source substances were of the same type, there were some instances registrants attempted to read across from a UVCB to a mono-constituent substance. It also appears that when a UVCB substance was used as a source for another UVCB, more instances of the read-across adaptation were deemed inadequate as opposed to when a mono-constituent substance was used as the source; however, the differences were not significant between source/target pairs for either substance type.
Fig. 4: Analysis of publicly available TPE decisions by substance type.

(A) Stacked bar plots show the number of substances for which read-across adaptations were accepted or rejected, separated into substance categories. The total number of substances in each category is shown. Within each stacked bar plot, submissions that used analogue (lighter shade) or group (darker shade) read-across are shown. In some instances of rejected read-across, the type of read-across could not be determined (white). (B) For each target substance type, different types of source substances were used as indicated. The outcome of the read-across evaluation is shown by the adjacent bars.
The REACH regulation established information requirements for substance registration (ECHA, 2011); these are based on the annual tonnage produced or imported into the European Union – the higher the tonnage, the greater the number of studies required (Botham et al., 2023). Figure 5 shows the number of read-across adaptations across tonnage bands, from Annex VII (1–10 tons) to Annex X (over 1,000 tons). Recent registration data from ECHA (ECHA, 2023a) shows that among ~12,500 registered substances most (39%) are of the lowest tonnage/data requirements. Substances that are subject to most comprehensive testing are in Annexes IX and X; these comprise about 19% of the total for each Annex (2,346 and 2,335, respectively). When substances with testing proposals were considered (Fig. 5A), we found that few Annex VII (3.1%) and VIII (9.8%) substances had published TPE decisions, while the bulk of the evaluations were for Annex IX (52.5%) and Annex X (34.6%) substances. These trends are not surprising because all tests required by Annexes IX and X but not yet available require a testing proposal to be submitted, while Annexes VII and VIII only do under certain circumstances (ECHA, 2011). Most testing proposals with read-across adaptations were for Annex IX; however, the relative proportion of such submissions is similar across four tonnage bands. When TPEs with read-across adaptations are compared by tonnage band, read-across type, and ECHA decision (Fig. 5B,C), it is evident that most rejected read-across adaptations for substances in Annexes VIII and IX were of the analogue type. The analysis of the odds that a proposed read-across type would favor a positive outcome shows that indeed, the odds of acceptance are significantly greater for group read-across adaptation for Annex VIII and IX substances (Fig. 5D). The odds were highest for Annex VIII substances, but the 95% confidence interval was also wide, owing to a relatively low number of observations.
Fig. 5: Analysis of publicly available TPE decisions by REACH Annex (tonnage band).

(A) Stacked bar plots show the number of substances in each Annex for which publicly available TPEs were examined. Gray color represents TPEs without read-across adaptations, and orange represents those with read-across adaptations. (B,C) The number of substances with accepted (B) and rejected (C) read-across adaptations. Within each stacked bar plot, submissions that used analogue (lighter shade) or group (darker shade) read-across are shown. In some instances of rejected read-across, the type of read-across could not be determined (white). (D) For each Annex number, odds ratios (OR) and 95% confidence intervals for acceptance in group vs analogue (i.e., an OR > 1.0 corresponds to greater odds of acceptance for group). The intervals for Annex IX and VIII do not contain the null OR = 1.0, corresponding to significantly greater odds of acceptance for group (p < 0.05).
We also stratified the data by the type of guideline test that was considered a data gap and where a read-across adaptation was proposed to fulfill data requirement(s) for registration under REACH. Most published TPE decisions concerned tests of health effects in mammalian systems (Fig. 6A). Among the OECD Guidelines for the Testing of Chemicals, Section 4 assays, which are designed to evaluate “health effects”, two assays predominated – TG 414 (Prenatal Developmental Toxicity Study) and TG 408 (Repeated Dose 90-Day Oral Toxicity Study in Rodents). Figure 6B shows the types of tests in OECD Guidelines for the Testing of Chemicals that are listed in Sections 1 (physical-chemical properties), 2 (effects on biotic systems), and 3 (environmental fate and behavior). Among these, the largest number of submissions was for TG 211 (Daphnia magna Reproduction Test). The fraction of submissions that involved read-across adaptations ranged from 14% to 29%. Figures 6C,D show the proportions of accepted and rejected testing proposals for tests with read-across adaptations; most accepted read-across adaptations were of the group type. When the odds of acceptance for group vs analogue read-across were calculated, the significantly greater odds were for any test and TG 414 and TG 408 – a group read-across argument favors acceptance of an adaptation for these tests (for details, see Tab. S46).
Fig. 6: Analysis of publicly available TPE decisions by OECD TG study.

(A,B) Stacked bar plots show the number of substances (i.e., testing proposals) for each TG, separated into “human health” (A), and other (B) OECD test categories. Gray color represents TPEs without read-across adaptations, and orange represents those with read-across adaptations. (C,D) The number of substances with accepted (C) and rejected (D) read-across adaptations. Within each stacked bar plot, submissions that used analogue (lighter shade) or group (darker shade) read-across are shown. In some instances of rejected read-across, the type of read-across could not be determined (white). (E) For the overall data and when split by OECD guideline type, odds ratios (OR) and 95% confidence intervals for acceptance in group vs analogue (i.e., an OR > 1.0 corresponds to greater odds of acceptance for group). The intervals for overall, TG 414 and TG 408 do not contain the null OR = 1.0, corresponding to significantly greater odds of acceptance for group (p < 0.05).
The type of read-across adaptation (analogue or group) may not be a matter of choice for each testing proposal depending on what source substance(s) with appropriate supporting data are available. However, registrants have options in building scientific support for their read-across hypothesis and can present a number of justifications based on ECHA guidance (ECHA, 2015, 2017, 2020, 2022) or other considerations (Ball et al., 2016; Beal et al., 2022; Berggren et al., 2015). It is generally accepted that several AEs, based on the considerations for how “similarity” is established between the target and source compounds, are needed to build the overall argument and justify the proposed adaptation. Because the TPEs we evaluated spanned almost 15 years, during which the best practices for read-across justifications have been evolving, we decided to craft AEs based on both formal ECHA guidance as well as the arguments that were used in the robust summaries of the test proposals or in TPE decisions. The short list of AEs we used in the systematic analysis is shown in Table 1; additional information and example text from ECHA guidelines and TPE decisions can be found in Table S24. Each TPE decision document with read-across adaptation was examined with respect to what AEs were used by the registrants in the test proposals and, separately, by ECHA in their TPE decision. Such analysis allowed us to determine not only the frequency of use for each AE, but also the patterns of whether successful proposals have relied on a particular element or combination thereof.
Figures 7A,B show that among 17 possible AEs we used in this analysis, several had been used far more often than others. For example, arguments in support of structural/physicochemical similarity between target and source substances (AE 1) were included invirtually every testing proposal submission with read-across adaptation; however, there was no difference in whether ECHA accepted this reasoning or not (for details, see Tab. S46). Likewise, a large proportion of submissions relied on AE 16 (bridging studies from assays other than OECD TGs), information that could aid in rationalization of the similarity argument; however, both successful and unsuccessful testing proposals had them at the same rate. However, what is also evident from these figures is that the relative proportion of analogue vs group read-across was not identical between accepted and rejected testing proposals. When odds of acceptance were analyzed for each AE with sufficient data (Fig. 7C), we found that group-based read-across hypotheses that included AE 1, AE 8 (bridging studies that are OECD TG-type), and AE 15 (characterization of constituents for multi-constituent or UVCB substances) were significantly more likely to be accepted. Figure 7D shows the results from multiple logistic regressions, where submission year, decision year, group vs. analogue-based RA, and AEs were included in the same model. Overall, for a testing proposal with read-across to be successful, inclusion of AE 3 (qualitative identification of common biological targets without common metabolites) and AE 15 (characterization of constituents) significantly increased the odds. By contrast, inclusion of information in support of AE 13 (identification of the metabolites of source and target) significantly increased the odds that the read-across would not be accepted. When similar analyses were done separately for either mono-constituent substances or UVCB and multi-constituent substances combined, the patterns were mostly similar but the significance for each AE was different. For mono-constituent substances, the overall direction was mostly the same as for all substances combined. However, the odds of rejection were considerable for UVCBs when AE 13 and AE 16 were part of a read-across hypothesis. Inclusion of data showing quantitative evidence for common biological targets without common metabolites (AE 4) appeared to have a significant positive impact on the odds of acceptance, but this result is based on only two observations within UVCB testing proposals.
Fig. 7: Analysis of publicly available TPE decisions by assessment elements (AEs).

(A,B) Stacked bar plots show the number of substances overall and for each AE, separated into TPEs that were accepted (A) and rejected (B). For AE 15 (marked with *), the fraction shown is for UVCB substances only. (C) For the overall data and for each AE, odds ratios (OR) and 95% confidence intervals for acceptance for group vs analogue (i.e. an OR > 1.0 corresponds to greater odds of acceptance for group). The intervals for overall, AE 1, AE 8, and AE 15 do not contain the null OR = 1.0, corresponding to significantly greater odds of acceptance for group (p < 0.05). (D) ORs from multiple logistic regression analyses with predictors submission year, decision year, group vs. analogue-based RA, and all AEs. For each column (all TPEs, mono-constituent, and UVCB), the ORs are displayed on a color scale, where red indicates predictor increases the chance of acceptance, blue indicates predictor decreases the chance of acceptance, and * denotes coefficients that have false discovery rates of q < 0.05.
We also considered whether in unsuccessful testing proposals with read-across adaptations (Fig. 7B), the AE reasoning was presented by the registrants but not accepted or ECHA pointed out that certain types of data/reasoning were needed to accept the proposed adaptation. Figure 8 shows the outcome of this analysis (for details, see Tab. S57). In many instances, ECHA did not agree that the argumentation for AE 1 was sufficient even though submissions contained information on this topic. The largest fraction of rejected submissions needed additional guideline bridging studies (AE 8), characterization of constituents (AE 15), or additional characterization of metabolites and/or impurities (AE 14). Characterization of the bioavailability of the parent molecule (AE 11) and quantitative analysis of common biological targets (AE 4) were frequently desired by ECHA but not provided by the registrants.
Fig. 8: Reasons that were stated by ECHA for the rejection of a testing proposal with read-across.

The light bar represents when a registrant proposed an AE that ECHA disagreed with and/or interpreted differently. The dark bar represents the registrant failing to take an AE into account in the read-across hypothesis.
Because AE 1 is the core element of building a similarity argument, we examined structural similarity between target and source compounds in testing proposals with read-across adaptations. This analysis was restricted to mono-constituent substances for which chemical descriptors can be calculated, similar to the approach by Patlewicz et al. (2024). In these analyses (Fig. 9) we used structural similarity based on Morgan fingerprints, and calculated Jaccard distance between target and source compounds as a metric for chemical similarity. Similar analyses were conducted using another type of chemical descriptor, the extensible chemistry-aware substructures called Saagar (Sedykh et al., 2021), and the results were nearly identical; therefore, we present here only the results of Morgan fingerprint analyses. For chemical pairs in testing proposals with accepted read-across adaptations (Fig. 9A), the greatest number were highly similar; however, there were many compounds that spanned the entire range. For example, the compounds with lowest similarity scores but with accepted read-across (Tab. S68) were metal-organic compounds such as Co2+ or Sr2+ 2-ethylhexanoate, Ba2+ or Co2+ carbonate, Pa2+ acetate ammoniate (1:2:4), and Co2+ acetate.
Fig. 9: Source-to-target Jaccard similarity values for read-across proposals.

(A) Accepted and (B) rejected testing proposals. Mean similarity values were not significantly different. However, a rigorous cutpoint analysis revealed that testing proposals with read-across for source and target substances with very high similarity (> 0.9) were significantly more likely to be accepted (p = 0.0003, OR = 7.0).
For substances in testing proposals with rejected read-across adaptations, the distribution was much wider, but still right-skewed (Fig. 9B). When the overall distributions in Jaccard similarity were compared between TPEs with accepted and rejected read-across adaptations, the difference in means was not significant (data not shown). However, when the analyses comparing Jaccard similarity between groups was restricted to substances with nearly perfect similarity based on Morgan fingerprints (score = 1) and those with scores that were < 1, then the chance of acceptance of the read-across adaptation was over 75% with a highly significant odds ratio of 7.0 (p = 0.0003 via rigorous cutpoint analysis). When the similarity was not as high, the chance of acceptance was only 31%.
4. Discussion
The goals of this study were three-fold – (i) to examine the overall success rate of testing proposals that contained read-across adaptations to standard information requirements, (ii) to examine whether particular AEs made proposed read-across more or less successful, and (iii) to improve future read-across submissions by providing a structured database of information based on the actual ECHA decisions so that future proposals can identify relevant examples to “learn” from. As we mentioned in the Introduction, there have been many suggestions on how to “improve” read-across, often lamenting conservatism of the decision-makers (Pestana et al., 2022; Patlewicz et al., 2014; Blackburn and Stuard, 2014; Rovida et al., 2020). However, relatively few studies (Ball et al., 2016) attempted to analyze the actual regulatory decisions on read-across submissions and identify specific areas where the registrants may need to do a better job in articulating their read-across justifications, or where the available information and source substances do not sufficiently support the proposed read-across.
The study presented herein is a follow-up on the latter analysis, but almost a decade later and after numerous guidelines and recommendations were issued by ECHA in an attempt to interpret REACH and explain what justifications are necessary for read-across hypotheses (ECHA, 2015, 2017, 2020, 2022). As our time-trend analysis shows, it does not appear that the success rates for read-across-based testing proposal submissions have been improving; therefore, the availability of guidance documents alone may not be sufficient to result in successful adaptations. Similarly, we found that for 77% of testing proposals, registrants did not request adaptations and opted to perform the studies that were required. This can be either because there was no viable source compound(s) or because the registrants chose certainty with respect to REACH compliance. It is possible that registrants were not interested in “trying” the adaptation route only to find that their submission was not acceptable and that additional testing would still be necessary. Therefore, we reason that by examining the decisions and cataloging successful and failed read-across hypotheses it is possible to identify factors associated with positive decisions on read-across adaptations and to improve the outcomes of such submissions in the future.
4.1. Overall trends in read-across adaptations in testing proposals and decisions
There appear to be distinct trends in testing proposals with read-across – a rapid growth of read-across-containing adaptations as a fraction of all evaluated testing proposals before 2015, when the first RAAF guidance was published (ECHA, 2015), a precipitous decline to near zero in 2015, and another steep incline in the 5 years after 2015. On the decision side, a high proportion (60–90%) of read-across-containing TPEs submitted before 2014 was accepted; this was followed by a precipitous drop in acceptance since 2016. There are likely many reasons for these notable swings – they can be attributed to the registration deadlines, internal considerations at ECHA to deal with a large number of initial submissions under REACH, mutual learning of what acceptable read-across is, and intense advocacy from the industry to provide clear guidance on interpretation of vague language in REACH. While certainly interesting from an historical perspective, and despite clear evidence that RAAF guidance did make an impact, perhaps not as intended, it is unlikely that a deeper dive into potential specifics of these trends is possible or will yield instructive learnings.
Our observation that the overall number of proposed adaptations was considerably greater for Annex IX and X substances is likely a direct result of standard testing requirements that escalate with higher tonnage (Botham et al., 2023). Similarly, the finding that most common data gaps pertain to a few tests (e.g., TG 414 and 408) is not surprising because these studies carry considerable cost and use large numbers of animals (Tab. 2). For Annex X substances, the main human health information requirements are TG 443 (Extended One Generation Reproduction Study) and TG 414 in a second species. Where a registrant required both, as well as the studies in Annex IX, it was common for several years for the testing proposals to be split, with a submission on TG 408 and the first species in TG 414, and then a follow-up submission on the remaining studies once the results of the TG 408 and 414 were available. This may explain why in general there were not as many TPE decisions published covering these additional higher tier studies – many decisions on the follow-up studies may still have been under consideration and not publicly available before this study’s cutoff of August 2023.
Tab. 2:
Animal count for representative tests in animals that had proposed read-across adaptations in testing proposals to satisfy REACH substance registration requirements (see Fig. 6 for details)
| OECD TG studies that were the subject of read-across adaptations in testing proposals | Accepted testing proposals with read-across: number (% of total) | Not accepted testing proposals with read-across: number (% of total) | Number of animals required for each test | ||
|---|---|---|---|---|---|
| Minimum number according to each OECD TG | Average number according to (Taylor, 2018) | Average number according to (Knight et al., 2023) | |||
| TG 408: Repeated Dose 90-Day Oral Toxicity Study in Rodents (OECD, 2018a) | 74 (49.7%) | 69 (41.8%) | 100 | 100 | 122 |
| TG 414: Prenatal Developmental Toxicity Study (Rat) (OECD, 2018b) | 88 (59.1%) | 101 (61.2%) | 100a | 900 | 1,459 |
| TG 443: Extended One-Generation Reproductive Toxicity Study (OECD, 2018c) | 10 (7.1%) | 19 (11.5%) | 580 | 960 | 2,733/1,830b |
| TG 489: In Vivo Mammalian Alkaline Comet Assay (OECD, 2016) | 6 (4.0%) | 4 (2.4%) | 25 | —c | 50 |
Not including the number of animals in each litter.
Reflecting the use of mated/unmated animals.
Information was not included in the publication.
It is noteworthy that the apparent “success” rate of read-across adaptations was significantly different between analogue and group/category submissions. The odds for a successful adaptation through group read-across were almost 3-times as high as those for the analogue read-across. These odds were most pronounced for substances in Annexes VIII and IX, and for TG 414 and TG 408 studies. Even though this finding indicates that if the registrant had a choice between analogue and group, they may wish to opt for a group approach, there will be many instances where the choice of group or analogue read-across is dictated by the type of substance and availability of the standard test data to which to read across. In addition, one needs to keep in mind that for an analogue approach, one argument to use read-across covers one substance, and so the ECHA decision covers just one substance. For the groups/categories, the same argument may cover multiple substances, but there will be a separate TPE decision for each one, hence the appearance of a larger number of successful submissions may be misleading. Analogue-type submissions must prove that a source substance represents equal or worse case than target, while group-type submissions need to demonstrate a trend in the effect; therefore, it is generally more intuitive that read-across within a group/category would be more acceptable compared to an analogue approach as there are more substances, and often more data from which to derive trends.
4.2. Read-across assessment elements – Examples
Similarity in structure/physicochemical properties (AE 1) is widely regarded as the foundational element for any read-across hypothesis, and it is not surprising that this AE was part of virtually every read-across-containing testing proposal. There were 3 TPE decisions where read-across hypothesis justification documents were missing (so no AE was included, including AE 1), all of these were rejected as incomplete. For example, ECHA’s RAAF states that “structural similarity is a pre-requisite for any grouping and read-across approach under REACH” (ECHA, 2015). Similarly, this principle is central to the application of read-across to decisions by other agencies (Helman et al., 2019; Lizarraga et al., 2023), and studies of the utility of various descriptors of chemical structure for prediction and characterization of chemical toxicity span several decades (McKinney et al., 2000). While structure-activity predictions are very useful, they also suffer from a number of pitfalls (Zvinavashe et al., 2008) and often need other data to increase confidence (Rusyn et al., 2012).
Our analysis of testing proposals found that AE 1 was included in both accepted and rejected testing proposals, essentially to the same degree; however, very high chemical similarity (i.e., Jaccard similarity score of 1 based on Morgan fingerprints) was a strong predictor of acceptable read-across. Still, testing proposals with read-across for substances with very low structural similarity were also accepted. For example, there were several metal-organic compounds with acceptable read-across but very low structural similarity based on Morgan fingerprints. One example is a group of “cobalt-containing compounds” that included Co2+ 2-ethylhexanoate, Co2+ carbonate, and Co2+ diacetate. The registrants reasoned that a common metal cation, not the organic counterions, was the driver of any adverse health effects; the metal cation rapidly dissociates from the organic counterion in biological fluids. These submissions also included AE 9 (formation of common/identical compounds) and AE 12 (formation and impact of non-common compounds/exposure to other compounds than those linked to the prediction) as part of their read-across hypothesis. The overall rationale presented in these testing proposals was found acceptable by ECHA. It is interesting that decisions on other metalorganic compounds that used similar arguments were published at different times between 2013 to 2023, demonstrating that well-rationalized reasoning based on AE 9 and 12 can overcome low similarity in AE 1 in specific cases.
It should be noted that while considerations of toxicokinetics (AE 9 through 14) are widely regarded as important for “good” read-across (Ball et al., 2016; Rovida et al., 2020), relatively few submissions included AEs other than 9 and 12, as discussed above. In fact, inclusion of AE 13 (metabolites of source and target have been identified) to argue for exposure to structurally similar metabolites or different compounds that cause the same effect appeared to greatly diminish the odds of acceptance. AE 13 was present in 4% of accepted and 26% of rejected testing proposals with read-across. ECHA used AE 13 as a reason to reject proposed read-across in 4% of all unfavorable decisions – 3% were a difference of opinion on how the data was interpreted or whether the data were supportive of the read-across hypothesis, and 1% was the lack of discussion of this AE by the registrant. For example, when computational predictors of metabolism were used without corresponding analytical evidence, ECHA did not find those arguments satisfactory. ECHA also frequently noted the lack of discussion of other metabolites that could be formed (i.e., AE 12) and whether those may be associated with adverse health effects. Irrespective of the apparent negative association, the results of this analysis should not deter practitioners from including data to address AE 13 as part of their read-across justifications. Rather, it should be recognized that this AE is often a core part of a read-across justification, particularly one relying on a metabolism hypothesis. Therefore, where it is relevant, it is recommended to address it comprehensively with experimental data, accounting for any differences in metabolism that could introduce uncertainty. This will provide a stronger foundation for the overall hypothesis and improve its likelihood of acceptance.
The second most common element in proposed read-across hypotheses was AE 16 (toxicodynamic similarity based on the data from a bridging (not a guideline) study). This AE includes any non-TG data submitted to support a read-across, such as in vitro methods, QSAR, and non-guideline animal studies. It was proposed in 77% of accepted and 74% of rejected testing proposals with read-across. It is even more noteworthy that ECHA used this AE to reject a proposed read-across adaptation in 86% of all unfavorable decisions. Among these, ECHA disagreed with the strengths of the registrant’s reasoning based on the available data in 66% of decisions; in 20% of decisions, ECHA pointed out that such data would be needed to strengthen the read-across hypothesis. For example, the use of short-term studies to justify read-across adaptation of chronic or prenatal developmental toxicity studies or presenting the data for only the target or source substance or for a different endpoint, was not deemed to be a satisfactory justification. Similarly, when target and source substances did not demonstrate similar effects in non-guideline studies, or when observations in non-animal (e.g., in vitro test) studies were not substantiated by data from in vivo studies, the read-across hypotheses were found to be not acceptable. For example, the data from in vitro mouse lymphoma assay, without support from in vivo studies, is not adequate as a justification for adaptations of higher-tier endpoints (e.g., TG 408).
A related read-across hypothesis element was AE 8 (bridging (guideline) study), for which registrants submitted data derived from a test conducted using an OECD TG. AE 8 was part of 39% of accepted and 43% of rejected testing proposals with read-across; odds of acceptable read-across adaptation were 4.3 for group-type submissions that had this AE as part of their hypothesis. TPE decisions invoked AE 8 in 76% of unfavorable decisions; half of these cases were a disagreement with the registrant as to whether such data were supportive of the read-across hypothesis, and another half were cases where ECHA pointed out that such data had not been provided but would be needed to justify read-across. For example, when testing proposals for higher-tier endpoints (TG 408 and TG 414) included data from TG 422 (Combined Repeated Dose Toxicity with the Reproduction/Developmental Toxicity Screening Test) in support of read-across and showed similar effects (e.g., same target organ, magnitude of effects, etc.) for both source and target substance, ECHA generally agreed with the registrant’s read-across hypothesis. It is also worth noting that AE 8 was frequently a focus of “third party” comments on testing proposals. Specifically, third parties reasoned that the findings of low toxicity in a 28-day oral study (TG 407) can be used as “bridging” evidence in support of read-across for adaptation of the TG 408 (90-day study) (Taylor and Andrew, 2017; Taylor et al., 2014a). ECHA considered this reasoning in their decisions but stated that it is the registrant’s (i.e., not a third party’s) responsibility to provide such reasoning and any other justifications when proposing adaptations.
Two AEs pertain to the argument that compounds may have common biological effects even if they do not form identical metabolites. Specifically, AE 3 (common biological targets without common metabolites (qualitative)) and AE 4 (common biological targets without common metabolites (quantitative)) address this type of a rationale in read-across. While AE 3 was included in a fairly large number of testing proposals with read-across, AE 4 was only found in 2 submissions. While it is impossible to extrapolate from only 2 submissions with AE 4, both were accepted by ECHA, which may very well indicate that quantitative arguments (i.e., the magnitude of the effect) are impactful. The latter conclusion is also supported by the fact that ECHA pointed out the lack of information pertaining to AE 4 in many unfavorable decisions on testing proposals with read across (13%). Reasoning related to AE 3 was included in 29% of accepted and 17% of rejected testing proposals with read-across. Among these, ECHA disagreed with the strength of the registrant’s reasoning based on the available data in 11% of decisions; in 5% of decisions ECHA pointed out that such data would be needed to strengthen the read-across hypothesis. For example, submissions based on large groups (e.g., higher olefins and resin acids) relied on arguments pertaining to AE 3 when reasoning that “different compounds have the same effect.” The inclusion of AE 3 significantly increased the odds of acceptance for group-type submissions.
Substances that are classified as multi-constituent or UVCB present several additional challenges with respect to the need for establishing both substance identity and to characterize the chemical composition to support read-across. ECHA published separate guidance for these substances (ECHA, 2017); in addition, the chasm between the regulator’s expectations and the realities of analytical chemistry solutions for identifying and quantifying the constituents in highly complex substances has been documented (Roman-Hubers et al., 2023). It is not surprising, therefore, that AE 15 (characterization of multi-/UVCB substances) was commonly included in testing proposals with read-across adaptations for these types of substances. In submissions concerning multi-constituent/UVCB substances, AE 15 was present in 76% of accepted and 32% of rejected testing proposals with read-across. ECHA used this AE in 43% of unfavorable decisions; in 15% it was a difference of opinion on the interpretation of the available data and in 28% because the registrant did not address this AE. The odds of acceptance were significantly higher (4.2) for group-type submissions when this AE was part of the read-across hypothesis. In the cases of an unfavorable decision, ECHA stated that insufficient characterization was provided by the registrant for target or source (or both) substances, which means that there was no way to compare their similarity or lack thereof. While this reasoning is similar to that of AE 1, it is far less clear how to define “broad similarity” for UVCBs. Case examples of petroleum substances were recently published with respect to the use of other supporting data types from in vitro studies to justify grouping (Tsai et al., 2023; House et al., 2021, 2022); however, ECHA did not find these non-guideline “bridging” studies (AE 16) satisfactory for a number of reasons. Many of their questions related to chemical characterization of the substances and/or their extracts that were used for in vitro testing, challenges that need to be addressed by additional research (Roman-Hubers et al., 2022; Cordova et al., 2022).
4.3. Using a database of TPE read-across adaptations to construct future submissions
Even though our analysis showed that about 50% of testing proposal submissions that relied on read-across adaptations were found to be acceptable by ECHA, improvements are needed to increase success. We found that over the past decade the fraction of accepted proposals has stayed relatively constant, and the efforts by all parties involved in the registration process have yet to produce a measurable improvement. Therefore, we reason that the data we extracted from the decisions may provide instructive examples of successful submissions that could be emulated in the near future. It is also clear that by improving the success rate of testing proposal submissions with read-across, measurable impact can be made in terms of reduction in animal use for chemical registrations under REACH. Table 2 shows that the number of animals that will be required to meet information requirements if new testing is performed is substantial. Similarly, if more registrants consider read-across-based or other adaptations, rather than defaulting to new testing, the reduction in animal use will be even more pronounced.
While it is without doubt that registrants and ECHA are committed to sharing encouraging examples and mutually developing best practices for read-across, we reason that the data collected in this project will prove useful to both registrants and regulators (for full database, see Tab. S13, S3–S55,6,7). On the one hand, to achieve greater chances of success, registrants will be able to identify specific aspects of read-across justifications that merit most attention and improve these in the testing proposals they are working on. Our results are also likely to encourage consideration of a group versus analogue approach, although the latter may not be possible for many submissions. We also found that read-across between different substance types should be discouraged, particularly reading across from UVCBs to mono-constituent substances. On the other hand, the trends identified in our study could indicate changes in ECHA’s approach to interpretation of read-across and suggest that additional efforts may be needed by ECHA to improve consistency in the application of their guidance over time. In addition, ECHA and other agencies may use our results and the database to identify areas where they can provide more granular advice to improve how read-across arguments are presented and justified (for details, see Tab. S46, S57).
4.4. Limitations of the study
While our analysis and interpretation of the findings may prove instructive, we also acknowledge that our results may not be taken as definitive and deterministic. First, we note that our dataset may have limitations as it had a cutoff date of August 2023, and the trends in most recent submissions, and thus decisions, may be different based on the attention given to the efforts to improve read-across (Pestana et al., 2022; Patlewicz et al., 2014; Blackburn and Stuard, 2014; Rovida et al., 2020). Second, we examined read-across cases that pertain to testing proposals, and there is likely a much greater number of cases of read-across decisions by ECHA in compliance checks, a dataset that is under evaluation. Third, we relied on the robust summaries of the testing proposals as included in ECHA’s decisions which vary in the level of detail presented; we did not examine the testing proposal submissions. While it is possible that some misinterpretation could have occurred, we reason that robust summaries in TPEs may be more informative in terms of categorizing the elements of the proposed read-across into standardized categories (i.e., AEs, Tab. 1). Even though the AEs were defined by the authors, we reason that no single guidance, or combination of different documents released by ECHA over time, clearly defines read-across AEs. While this is another limitation, we point out that we have made every effort to assure that the AEs used herein were comprehensive and consistent with both RAAF and the actual TPE decisions (Tab. S24). Along the same line, we cannot confirm how representative the TPE decisions are of the entire ECHA database because that information is not available.
Fourth, a large difference in the success rate of read-across adaptations between testing proposals and compliance checks (Ball et al., 2016) indicates that the “bar” may be lower on the former because the decision to allow for the use of read-across as part of a testing proposal is provisional. Whether the use of read-across is ultimately acceptable or not for the registration can only be decided once the registration dossier undergoes a compliance check. Consequently, there may be cases where the use of read-across has been accepted for the testing proposal but is subsequently challenged or rejected. For example, the registrants of decan-4-olide (EC: 211–892-8) submitted a testing proposal with read-across adaptation for TG 211 (long-term toxicity to aquatic invertebrates), and this submission was found to be acceptable by ECHA in 2015. However, a compliance check decision by ECHA (published in 2023) concluded a read-across adaptation to standard information requirement was insufficient and that TG 211 data were needed to justify proposed classification. Even though this caveat does not invalidate the results presented in this manuscript, it does highlight that even “final” decisions on read-across can be subject to change.
Finally, we need to point out that although our analysis has identified some specific areas that were influential in read-across acceptance, it is rare that ECHA decisions focus on just one aspect; often, multiple reasons may lead to the ultimate decision to accept or not accept. Furthermore, the submission may be accepted not because ECHA agreed with the rationale presented in a testing proposal, but because the combination of the information included in the submission was deemed to be sufficient. For example, for the asphalt UVCB (EC: 232–490-4) submission, the registrant proposed a read-across adaptation of TG 414 to another substance in the group (residues (petroleum), thermal cracked vacuum; CAS: 92062–05-0) on which this test was to be performed. The principal reasoning by the registrant was that the target and source substances belong to a group formed based on the refining process and on similarity in carbon number distribution and hydrocarbon class profiles. Furthermore, the registrant hypothesized that one of the hydrocarbon classes – polycyclic aromatic hydrocarbons containing 4 or more aromatic rings – is the putative reproductive toxicant. While ECHA found that the proposed justification for the overall group was insufficient, it agreed that a read-across to the proposed source substance selected based on the PAH with 4+ aromatic rings was “appropriate”. While such cases complicate the overall analysis, as they make it difficult to be confident that one or two specific AEs had an impact on each decision, our data still identified several “critical” AEs as detailed above.
Ultimately, we encourage registrants to focus their efforts on improving the rationales for their read-across hypotheses to stand a better chance of acceptance. Even though each read-across submission is unique in terms of the type of substance, availability of the data, and the endpoint, we show that there are several general trends and encourage registrants to consult case examples in the database when crafting their submissions.
Supplementary Material
Acknowledgements
The authors are grateful to ECHA Hazard Assessment Directorate staff for general advice and encouragement on this project. The authors would also like to thank Drs Lauren Zeise, Kannan Krishnan, and Anatoly Soshilov at OEHHA for useful discussions.
Funding
This work was supported, in part, by grants from the National Institute of Environmental Health Sciences (P42 ES027704 and T32 ES026568) and a contract from California Environmental Protection Agency Office of Environmental Health Hazard Assessment (OEHHA).
Footnotes
Disclaimer
This publication’s contents are solely the responsibility of the grantee and do not necessarily represent the official views of the funding agencies.
Conflict of interest
N. Ball is an employee of the Dow Chemical Company, which submitted several registrations and testing proposals for evaluation by ECHA. Other authors declare no conflicts of interest.
Data availability
Data extracted from publicly available testing proposal evaluations by ECHA are included as Tables S3–S65–8.
References
- Ball N, Cronin MT, Shen J et al. (2016). Toward good read-across practice (GRAP) guidance. ALTEX 33, 149–166. doi: 10.14573/altex.1601251 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beal MA, Gagne M, Kulkarni SA et al. (2022). Implementing in vitro bioactivity data to modernize priority setting of chemical inventories. ALTEX 39, 123–139. doi: 10.14573/altex.2106171 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berggren E, Amcoff P, Benigni R et al. (2015). Chemical safety assessment using read-across: Assessing the use of novel testing methods to strengthen the evidence base for decision making. Environ Health Perspect 123, 1232–1240. doi: 10.1289/ehp.1409342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blackburn K and Stuard SB (2014). A framework to facilitate consistent characterization of read across uncertainty. Regul Toxicol Pharmacol 68, 353–362. doi: 10.1016/j.yrtph.2014.01.004 [DOI] [PubMed] [Google Scholar]
- Botham P, Cronin MTD, Currie R et al. (2023). Analysis of health concerns not addressed by REACH for low tonnage chemicals and opportunities for new approach methodology. Arch Toxicol 97, 3075–3083. doi: 10.1007/s00204-023-03601-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cordova AC, Ford LC, Valdiviezo A et al. (2022). Dosing methods to enable cell-based in vitro testing of complex substances: A case study with a PAH mixture. Toxics 11, 19. doi: 10.3390/toxics11010019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- EC – European Council (2006). Regulation (EC) No 1907/2006 of the European Parliament and the Council of 18 December 2006 concerning the Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH), establishing a European Chemicals Agency, amending Directive 1999/45/EC and repealing Council Regulation (EEC) No 793/93 and Commission Regulation (EC) No 1488/94 as well as Council Directive 76/769/EEC and Commission Directives 91/155/EEC, 93/105/EC and 2000/21/EC. 1–849. https://eur-lex.europa.eu/eli/reg/2006/1907/oj
- ECHA (2008). Guidance on information requirements and chemical safety assessment. Chapter R.6: QSARS and grouping of chemicals. https://echa.europa.eu/documents/10162/13632/information_requirements_r6_en.pdf
- ECHA (2011). Guidance on information requirements and chemical safety assessment. https://echa.europa.eu/guidance-documents/guidance-on-information-requirements-and-chemical-safety-assessment (accessed 21.07.2024)
- ECHA (2015). Read-Across Assessment Framework (RAAF). doi: 10.2823/546436 [DOI]
- ECHA (2017). Read-Across Assessment Framework (RAAF) – Considerations on multi-constituent substances and UVCBs. https://echa.europa.eu/documents/10162/13630/raaf_uvcb_report_en.pdf/3f79684d-07a5-e439-16c3-d2c8da96a316 (acessed 25.08.2020)
- ECHA (2020). Dossier Evaluation Process – Principles applied by ECHA in reviewing read-across and category approaches. https://echa.europa.eu/documents/10162/17220/echa_testing_strategies_under_reach_en.pdf/6ffd4ed1-8f14-6326-8142-2c881846fdac (accessed 26.07.2024)
- ECHA (2022). Advice on using read-across for UVCB substances – Obligations arising from Commission Regulation 2021/979, amending REACH annexes. https://www.echa.europa.eu/documents/10162/11395738/advice_uvcb_read-across_en.pdf/ac1f64a6-9ee5-441e-cf1c-92914b843b4e?t=1651665130365 (accessed 19.05.2022)
- ECHA (2023a). The use of alternatives to testing on animals for the REACH Regulation: Fifth report under Article 117(3) of the REACH Regulation. doi: 10.2823/805454 (accessed 21.07.2024) [DOI]
- ECHA (2023b). Guidance for identification and naming of substances under REACH and CLP. https://echa.europa.eu/documents/10162/2324906/substance_id_en.pdf/ee696bad-49f6-4fec-b8b7-2c3706113c7d (accessed 21.07.2024)
- ECHA (2024). ECHA checked over 20% of REACH registration dossiers for compliance. https://echa.europa.eu/-/echa-checked-over-20-of-reach-registration-dossiers-for-compliance-1 (accessed 11.11.2024)
- Guha R (2007). Chemical informatics functionality in R. J Stat Softw 18, 1–16. doi: 10.18637/jss.v018.i05 [DOI] [Google Scholar]
- Helman G, Shah I, Williams AJ et al. (2019). Generalized read-across (GenRA): A workflow implemented into the EPA CompTox chemicals dashboard. ALTEX 36, 462–465. doi: 10.14573/altex.1811292 [DOI] [PMC free article] [PubMed] [Google Scholar]
- House JS, Grimm FA, Klaren WD et al. (2021). Grouping of UVCB substances with new approach methodologies (NAMs) data. ALTEX 38, 123–137. doi: 10.14573/altex.2006262 [DOI] [PMC free article] [PubMed] [Google Scholar]
- House JS, Grimm FA, Klaren WD et al. (2022). Grouping of UVCB substances with dose-response transcriptomics data from human cell-based assays. ALTEX 39, 388–404. doi: 10.14573/altex.2107051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knight J, Hartung T and Rovida C (2023). 4.2 million and counting… The animal toll for REACH systemic toxicity studies. ALTEX 40, 389–407. doi: 10.14573/altex.2303201 [DOI] [PubMed] [Google Scholar]
- Lizarraga LE, Suter GW, Lambert JC et al. (2023). Advancing the science of a read-across framework for evaluation of data-poor chemicals incorporating systematic and new approach methods. Regul Toxicol Pharmacol 137, 105293. doi: 10.1016/j.yrtph.2022.105293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lowe CN and Williams AJ (2021). Enabling high-throughput searches for multiple chemical data using the US-EPA CompTox chemicals dashboard. J Chem Inf Model 61, 565–570. doi: 10.1021/acs.jcim.0c01273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKinney JD, Richard A, Waller C et al. (2000). The practice of structure activity relationships (SAR) in toxicology. Toxicol Sci 56, 8–17. doi: 10.1093/toxsci/56.1.8 [DOI] [PubMed] [Google Scholar]
- OECD (2016). Test No. 489: In Vivo Mammalian Alkaline Comet Assay. OECD Guidelines for the Testing of Chemicals, Section 4. OECD Publishing, Paris. doi: 10.1787/9789264264885-en [DOI] [Google Scholar]
- OECD (2017). Guidance on Grouping of Chemicals, Second Edition. OECD Series on Testing and Assessment, No. 194. OECD Publishing, Paris. doi: 10.1787/9789264274679-en [DOI] [Google Scholar]
- OECD (2018a). Test No. 408: Repeated Dose 90-Day Oral Toxicity Study in Rodents. OECD Guidelines for the Testing of Chemicals, Section 4. OECD Publishing, Paris. doi: 10.1787/9789264070707-en [DOI] [Google Scholar]
- OECD (2018b). Test No. 414: Prenatal Developmental Toxicity Study. OECD Guidelines for the Testing of Chemicals, Section 4. OECD Publishing, Paris. doi: 10.1787/9789264070820-en [DOI] [Google Scholar]
- OECD (2018c). Test No. 443: Extended One-Generation Reproductive Toxicity Study. OECD Guidelines for the Testing of Chemicals, Section 4. OECD Publishing, Paris. doi: 10.1787/9789264185371-en [DOI] [Google Scholar]
- Patlewicz G, Ball N, Becker RA et al. (2014). Read-across approaches – Misconceptions, promises and challenges ahead. ALTEX 31, 387–396. doi: 10.14573/altex.1410071 [DOI] [PubMed] [Google Scholar]
- Patlewicz G, Karamertzanis P, Friedman KP et al. (2024). A systematic analysis of read-across within REACH registration dossiers. Comput Toxicol 30, 1–15. doi: 10.1016/j.comtox.2024.100304 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pestana C, Enoch SJ, Firman JW et al. (2022). A strategy to define applicability domains for read-across. Comput Toxicol 22, 100220. doi: 10.1016/j.comtox.2022.100220 [DOI] [Google Scholar]
- Roman-Hubers AT, Cordova AC, Rohde AM et al. (2022). Characterization of compositional variability in petroleum substances. Fuel 317, 123547. doi: 10.1016/j.fuel.2022.123547 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roman-Hubers AT, Cordova AC, Barrow MP et al. (2023). Analytical chemistry solutions to hazard evaluation of petroleum refining products. Regul Toxicol Pharmacol 137, 105310. doi: 10.1016/j.yrtph.2022.105310 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rovida C, Barton-Maclaren T, Benfenati E et al. (2020). Internationalization of read-across as a validated new approach method (NAM) for regulatory toxicology. ALTEX 37, 579–606. doi: 10.14573/altex.1912181 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rovida C, Busquet F, Leist M et al. (2023). REACH outnumbered! The future of REACH and animal numbers. ALTEX 40, 367–388. doi: 10.14573/altex.2307121 [DOI] [PubMed] [Google Scholar]
- Rusyn I, Sedykh A, Low Y et al. (2012). Predictive modeling of chemical hazard by integrating numerical descriptors of chemical structures and short-term toxicity assay data. Toxicol Sci 127, 1–9. doi: 10.1093/toxsci/kfs095 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sedykh AY, Shah RR, Kleinstreuer NC et al. (2021). Saagar – A new, extensible set of molecular substructures for QSAR/QSPR and read-across predictions. Chem Res Toxicol 34, 634–640. doi: 10.1021/acs.chemrestox.0c00464 [DOI] [PubMed] [Google Scholar]
- Taylor K, Andrew DJ and Rego L (2014a). The added value of the 90-day repeated dose oral toxicity test for industrial chemicals with a low (sub)acute toxicity profile in a high quality dataset. Regul Toxicol Pharmacol 69, 320–332. doi: 10.1016/j.yrtph.2014.04.008 [DOI] [PubMed] [Google Scholar]
- Taylor K, Stengel W, Casalegno C et al. (2014b). Experiences of the REACH testing proposals system to reduce animal testing. ALTEX 31, 107–128. doi: 10.14573/altex.1311151 [DOI] [PubMed] [Google Scholar]
- Taylor K and Andrew DJ (2017). The added value of the 90-day repeated dose oral toxicity test for industrial chemicals with a low (sub)acute toxicity profile in a high quality dataset: An update. Regul Toxicol Pharmacol 90, 258–261. doi: 10.1016/j.yrtph.2017.09.018 [DOI] [PubMed] [Google Scholar]
- Taylor K (2018). Ten years of REACH – An animal protection perspective. Altern Lab Anim 46, 347–373. doi: 10.1177/026119291804600610 [DOI] [PubMed] [Google Scholar]
- Tsai HD, House JS, Wright FA et al. (2023). A tiered testing strategy based on in vitro phenotypic and transcriptomic data for selecting representative petroleum UVCBs for toxicity evaluation in vivo. Toxicol Sci 193, 219–233. doi: 10.1093/toxsci/kfad041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zvinavashe E, Murk AJ and Rietjens IM (2008). Promises and pitfalls of quantitative structure-activity relationship approaches for predicting metabolism and toxicity. Chem Res Toxicol 21, 2229–2236. doi: 10.1021/tx800252e [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data extracted from publicly available testing proposal evaluations by ECHA are included as Tables S3–S65–8.
