Abstract
Mechanistic toxicology provides a powerful approach to inform on the safety of chemicals and the development of safe‐by‐design compounds. Although toxicogenomics supports mechanistic evaluation of chemical exposures, its implementation into the regulatory framework is hindered by uncertainties in the analysis and interpretation of such data. The use of mechanistic evidence through the adverse outcome pathway (AOP) concept is promoted for the development of new approach methodologies (NAMs) that can reduce animal experimentation. However, to unleash the full potential of AOPs and build confidence into toxicogenomics, robust associations between AOPs and patterns of molecular alteration need to be established. Systematic curation of molecular events to AOPs will create the much‐needed link between toxicogenomics and systemic mechanisms depicted by the AOPs. This, in turn, will introduce novel ways of benefitting from the AOPs, including predictive models and targeted assays, while also reducing the need for multiple testing strategies. Hence, a multi‐step strategy to annotate AOPs is developed, and the resulting associations are applied to successfully highlight relevant adverse outcomes for chemical exposures with strong in vitro and in vivo convergence, supporting chemical grouping and other data‐driven approaches. Finally, a panel of AOP‐derived in vitro biomarkers for pulmonary fibrosis (PF) is identified and experimentally validated.
Keywords: adverse outcome pathways, biomarkers, new approach methodologies, toxicogenomics
Comprehensive molecular annotation of adverse outcome pathways (AOPs) supports the development of new approach methodologies (NAMs) and the integration of toxicogenomic evidence into chemical safety assessment. Annotated AOPs can be used to interpret complex omics data, identify relevant biomarkers, and to gain information on potential systemic adverse outcomes with high in vitro to in vivo convergence.
1. Introduction
Mechanistic aspects of chemical exposures have been long exploited in the context of academic research, resulting in the emergence of toxicogenomics and systems toxicology as independent fields.[ 1 , 2 ] Although the mechanistic insight gained through the technologies employed in academia has been valued as supporting evidence in the regulatory setting, its incorporation into the regulatory framework is to date hindered by concerns related to the robustness and reproducibility of such data and its analysis.[ 3 ] At the same time, the growing need for faster, cheaper, and more ethical approaches for chemical safety assessment have made mechanistic toxicology central for clarifying aspects important to regulatory decision making. Furthermore, uncovering exposure related mechanistic properties is emerging as a fundamental approach for the design of new drugs and chemicals.[ 4 , 5 ] Hence, multiple high‐end research initiatives are underway to drive the shift from traditional animal‐based assessment of apical toxicity endpoints toward in vitro and in silico approaches supported by mechanistic evidence.[ 6 , 7 , 8 ]
Adverse outcome pathways (AOP) emerged as models to organize biological mechanisms into causally linked sequences of multi‐scale events to support chemical risk assessment.[ 9 ] AOPs have since expanded beyond the limits of toxicology, showing their applicability in organizing mechanisms of disease progression and adverse health outcomes,[ 10 , 11 ] and could even be applied to assess beneficial effects of therapies. The mechanisms depicted by AOPs comprise a sequence of events that progress from the molecular initiating event (MIE) toward an adverse outcome (AO) through intermediate steps, key events (KEs), with biological complexity increasing as the AOP progresses. Individual KEs are connected by key event relationships (KER) that verbally explain the causal link between the events and provide context for the pathway.
The AOP concept quickly attracted attention due to its potential in tackling one of the major challenges in the shift away from traditional toxicology: deciphering systemic and long‐term outcomes of chemical exposures without the use of animal experiments. While significant efforts still need to be made toward this goal, AOPs encompass the means to systematically guide the integration of in vitro‐based evidence into the risk assessment framework.[ 12 ] AOPs provide the grounds for various predictive approaches, read‐across, and the development of targeted assays and new approach methodologies (NAMs), as also suggested by regulatory agencies and international organizations, such as the OECD.[ 8 ] Furthermore, the construction of AOPs can help identify gaps in knowledge and guide resources toward mechanisms in need of further investigation, or alternatively, reveal connections that have not been previously characterized.[ 13 ]
Concurrently with the development of the AOP framework, the role of omics data in elucidating biomarkers and mechanisms of action (MOA) of chemical exposures and diseases has become more prominent.[ 14 , 15 , 16 , 17 , 18 ] Omics data have been used to support the development of AOPs, especially through the identification of molecular targets and mechanisms.[ 19 , 20 , 21 , 22 , 23 ] However, full exploitation of omics‐based evidence in the context of AOPs is hindered by the complication of linking molecular data to complex biological events, affecting both the development and the application of AOPs. Furthermore, while the value of omics data in answering questions of regulatory importance is recognized, the complexity of its interpretation and the lack of standardization in analysis and reporting have hampered widespread regulatory acceptance of omics‐based evidence.[ 24 ] Bypassing these challenges could broaden the application of AOPs, support the interpretation of complex omics data, and further aid in the development of the concept toward quantitative models and assays. While molecular assays based on arbitrarily selected reporter genes have been proposed (e.g., ToxCast assays), there is an urgent need to develop new data‐driven unbiased molecular assays for reliable and efficient mechanistic safety assessment of chemicals.
Here we hypothesized that rigorous curation of molecular events associated with AOPs could facilitate the implementation of omics‐based evidence to 1) guide the interpretation of omics data readout, 2) support the development of new AOPs, 3) identify and fill gaps in knowledge, and 4) transfer AOP‐based knowledge into robust assays to support chemical safety assessment.
Well‐curated gene ontologies, pathways, and biological processes are used to interpret omics results and their translation into biologically relevant information. While some KEs can be easily crosslinked with such terms and their associated genes, the annotation of complex KEs taking place at a higher level of biological organization (e.g., at the tissue‐ or organism‐level) is a more demanding task. This requires knowledge regarding ontologies and the biological events themselves. For instance, generic annotations are helpful for categorizing KEs, but without the intention of modeling KEs using the associated gene sets, they will likely not reach the level of granularity required for such a task. This is currently reflected in the annotations provided in the AOP‐Wiki repository (aopwiki.org). The annotation of KEs to selected ontologies is included as an option in AOP‐Wiki. However, the coverage of the annotations is currently low and has not been intended for modeling the KEs using the gene sets associated with their annotations.
Previous efforts to curate external annotation have shown the potential of the approach.[ 25 , 26 ] However, these have either remained at the level of abstract associations or focused on individual examples.[ 27 , 28 ] Hence, systematic, fit‐for‐purpose, and up‐to‐date annotation linking KEs to curated gene sets has not yet been established. To this end, we applied an integrated strategy for defining gene‐KE‐AOP associations through systematic curation. We show the applicability of our strategy for evaluating potential AOs of chemical exposures, and for the identification of AOP‐driven biomarkers that can inform the development of target assays and novel approaches to chemical hazard characterization.
2. Results and Discussion
We developed an integrated approach to systematically associate curated gene sets to the KEs and AOPs. Our approach combines natural language processing (NLP) techniques with manual curation to link relevant biological processes and pathways, as well as their associated genes, to KEs of AOPs relevant for human health risk assessment. The resulting gene‐KE‐AOP connections enable the modeling of KEs and AOPs through gene‐level data, which further introduces novel ways to benefit from the AOP concept. We applied this approach to generate an AOP fingerprint for a known profibrotic exposure in vivo and in vitro and finally combined the annotation to a framework for prioritizing KE‐ and AOP‐associated genes to guide the discovery of biomarkers and reporter genes. The complete approach described in the following sections is summarized in Figure 1 .
2.1. The Majority of KEs can be Successfully Annotated to Curated Gene Sets
At the time of retrieving the data from the AOP‐Wiki repository (November 2020), a total of 289 AOPs and 1131 distinct KEs were identified. However, after eliminating the AOPs for which taxonomic applicability was either not available nor in the scope of our investigation, 176 AOPs and 856 unique KEs remained, forming a total of 1245 unique AOP‐KE pairs (specific KEs). Although the AOP‐Wiki houses selected annotations for some of the KEs, majority of them were considered not to be specific enough for our purpose (i.e., KEs describing the dysregulation of a specific gene annotated to terms such as “gene expression”). Additionally, as the existing annotations only cover a part of the KEs, we decided to consistently curate the annotation of all KEs. As a result, 799 unique KEs mapped to 175 AOPs received a curated annotation. The KEs were treated as individual instances, hence the same KE mapped to multiple AOPs was always annotated to the same term(s). A summary of the number of terms annotated to the KEs is presented in Figure 2A along with the proportions of the different term sources (Figure 2B). GO biological processes (GO_BP) represent most of the mapped annotations, followed by GO molecular functions (GO_MF) and human phenotype ontology (HPO). Since up to five annotations were provided for the KEs, the final gene sets used from herein comprise the union of the genes mapped to each annotated term. This structure allowed improved specificity, while also providing the possibility to further refine the gene sets using the hierarchical order implemented where applicable. The size of the gene sets associated to each KE range from one to 5990 genes, with a median value of 81 genes. Consequently, when AOPs are modeled by combining the gene sets associated to their KEs, the gene set sizes range from 15 to 5992 with the median size being 752 genes.
In total, the annotations comprise 15 825 genes. While the majority of genes are annotated to less than 5 KEs (9044 genes), 1434 genes have more than 20 KEs associated to them, and 50 genes have more than 80 associated KEs (Figure 2C). Although these numbers can be affected by annotation bias, for example, certain genes are better researched and annotated than others, they can also guide the selection of AOP‐driven biomarkers when specificity is of importance.
2.2. AOP Enrichment Highlights Relevant Adverse Outcomes Associated to Chemicals
We tested the ability of our novel annotations to highlight relevant AOPs by analyzing a set of curated reference chemicals as defined by EU Reference Laboratory for alternatives to animal testing (ECVAM) and National Toxicology Program Interagency Center for the Evaluation of Alternative Toxicological Methods (NICEATM). We focused on four categories of chemicals defined by their toxicity properties to include hepatotoxic and carcinogenic agents as well as thyroid disruptors and sex hormone receptor (estrogen receptor—ER, and androgen receptor—AR) agonists. For each of the selected chemicals, we retrieved a list of associated genes from the Comparative Toxicogenomics Database (CTD),[ 29 ] resulting in a final set of 75 chemicals (File S1, Supporting Information).
First, we identified AOPs related to each of the selected categories (i.e., AOPs related to carcinogenesis, hepatotoxicity, sex hormones, and thyroid disruption) among all the AOPs. We then evaluated the prevalence of these relevant AOPs among the five most significantly enriched AOPs for each chemical. The results suggest that the enrichment approach successfully highlights AOPs of relevance for each group of chemicals (Figure 3 ). All sex hormone receptor agonists had at least one relevant AOP among the top five enriched, while the proportions varied from 43% (thyroid disrupters) to 93% (carcinogens) in the other categories (Figure 3).
In the group of carcinogenic chemicals, 93% of the compounds evaluated had cancer‐related adverse outcomes among the top enriched AOPs. In fact, the group of carcinogens had the highest proportion of relevant AOPs at the top as compared to the others (median four out of five compared to the median of two out of five in the other groups). However, it should be noted that AOPs related to cancer are among the most represented group of AOPs, and cancer‐related genes are generally highly researched and annotated, which may introduce a level of annotation bias that should be recognized.
The remaining four carcinogenic chemicals (7%) that showed no cancer AOPs among the top enriched AOPs were N‐nitrosodiethanolamine, N‐nitrosomorpholine, phenacetin, and tetrachloroethylene. N‐nitrosomorpholine and N‐nitrosodiethanolamine are both nitrosamines whose suspected AOs besides carcinogenesis include non‐alcoholic steatohepatitis.[ 30 ] Indeed, both compounds contained hepatic steatosis related AOPs among the top five enriched AOPs (File S1, Supporting Information). Tetrachloroethylene (perchloroethylene, PCE) is a chlorocarbon solvent used in dry‐cleaning and other degreasing applications.[ 31 ] AOPs with the most significant enrichment for PCE were also related to hepatic adverse outcomes. Although neurotoxicity is one of the most frequent AOs associated with PCE exposure, hepatotoxicity has also been reported.[ 31 ] Our results documenting liver steatosis are supported by biopsy‐based evidence of liver disease, both in human and animal models, in settings of high occupational exposures.[ 32 ] Last, phenacetin is a drug that was widely used as pain medication until it was withdrawn from the market across the globe due to increasing evidence of carcinogenicity and renal toxicity.[ 33 ] The most enriched AOPs for phenacetin included immune related AOPs “Immune mediated hepatitis” (Aop:362) and Aop:277 titled “Inhibition of IL‐1 binding to IL‐1 receptor leading to increases susceptibility to infection”. Although there is no described association between phenacetin and IL‐1 or immuno‐toxicity, it is known that they both play a role in paracetamol‐associated liver toxicity, which is the main metabolite of phenacetin.[ 33 , 34 ]
In the case of the known liver toxicants, hexaconazole was the only compound not highlighting AOPs associated with liver toxicity among the top enriched AOPs. Hexaconazole is a widely used triazole fungicide. It acts by blocking sterol biosynthesis via inhibition of cytochrome P450.[ 35 ] Hexaconazole was considered as a Group C‐Possible Human Carcinogen by the US EPA due to increased incidence of benign Leydig cell tumors in rats (https://www3.epa.gov/pesticides/chem_search/hhbp/R000356.pdf). Moreover, it was found to affect the reproduction of female rats.[ 35 ] The top enriched AOPs correctly identify this signature. Furthermore, the top two pathways “HMG‐CoA reductase inhibition leading to decreased fertility” and “Modulation of adult Leydig cell function subsequent to decreased cholesterol synthesis or transport in the adult Leydig cell” both suggest a decrease in cholesterol levels by inhibition of the HMG‐CoA reductase. Drugs inhibiting this enzyme, such as statins, are known to possibly cause liver damage.[ 36 ]
Known thyroid toxicants performed poorest in our analysis. Bifenthrin, malathion, permethrin, and simazine did not capture thyroid related AOPs among the top five enriched. All these compounds have been widely used in agriculture as herbicides or pesticides. Agrochemicals represent a significant class of endocrine disrupting chemicals, albeit through varying mechanisms. It is now accepted that many of these molecules may mimic the interaction of endogenous hormones with nuclear receptors, such as estrogen, androgen, and thyroid hormone receptors.[ 37 ] Indeed, bifenthrin has already been reported as an endocrine‐disrupting compound by blocking the binding of endogenous hormones.[ 38 ] In our framework, its anti‐estrogenic activity emerges as the most enriched AOP (File S1, Supporting Information). Malathion is an organophosphate pesticide that is known for its low acute toxicity and rapid degradation.[ 39 ] In this light, it is not listed as a primary thyroid disrupting chemical, and its toxicity has been associated with the inhibition of acetylcholinesterase activity on nerve impulse.[ 39 ] Recent studies, however, demonstrated that malathion acts as an endocrine disruptor, both in vitro and in vivo.[ 40 , 41 ] Our results support these findings, highlighting the Aop:165: “Anti‐estrogen activity leading to ovarian adenomas and granular cell tumors in the mouse” as well as Aop:112: “Increased dopaminergic activity leading to endometrial adenocarcinomas.” Furthermore, Moore et al. demonstrated that malathion exposure at higher concentrations induces cytotoxic and genotoxic effects in HepG2 through oxidative stress, which can finally lead to liver cancer.[ 39 ] Similarly, our framework highlights both the “PPARalpha‐dependent liver cancer” and “Cyp2E1 activation leading to liver cancer” AOPs. Simazine is a triazine herbicide whose use has been banned in most European countries for nearly two decades.[ 42 ] Simazine has now been recognized, similarly to the other compounds, as an endocrine disrupter.[ 42 ] Interestingly, the enrichment analysis for simazine highlighted AOPs related to the development of adenomas and carcinomas through endocrine disrupting activities (e.g., Aop:107 titled “Constitutive androstane receptor activation leading to hepatocellular adenomas and carcinomas in the mouse and the rat”) as well as direct disruption of the GnRH pulse (File S1, Supporting Information). Although multiple in vivo and in silico evidence also indicate permethrin as possible endocrine disruptor,[ 43 , 44 ] no endocrine related pathways are present in the top enriched AOPs. However, this framework was able to highlight the modulating effect of permethrin on the lipid metabolism. It has been demonstrated that in HepG2 cells, permethrin increases lipogenesis and decreases beta oxidation, possibly contributing to the development of NAFLD.[ 45 ] Indeed, the “Inhibition of fatty acid beta oxidation leading to nonalcoholic steatohepatitis (NASH)” AOP is statistically enriched in our results.
Together, these results highlight relevant AOPs modeled by our curated gene sets to be enriched by the genes associated to the compounds, suggesting that our framework is able to support robust mechanistic and data‐driven chemical grouping as well as the identification of potential AOs using chemical‐gene associations.
2.3. Our Annotation Enables Grouping of KEs Resulting in Improved Modeling of the AOP Network
In order to fully unleash the potential of mechanistic toxicology, more informative testing strategies need to be developed that can monitor specific phases of the exposure‐bio interactions and mechanisms. To this end, we defined accurate sets of genes capable of modeling specific KEs and AOPs. However, one of the challenges observed in the AOP‐Wiki is the redundant semantics in the naming of KEs. While creating a new KE can be meaningful in many cases (e.g., the same biological process taking place in a distinct organ or a tissue), unnecessary redundancy can lead to challenges in the application of the AOP‐based knowledge. This is especially true when modeling AOPs as a network and using such representation to identify hidden connections and to perform read‐across analysis.[ 10 , 46 , 47 , 48 , 49 , 50 , 51 ]
Hence, we hypothesized that KEs could be grouped based on the degree of similarity of their associated gene lists. We calculated the similarity of the KEs based on their annotated gene sets and grouped together those mapped to identical sets of genes (Jaccard Index (JI) = 1). This resulted in the identification of 637 groups of varying sizes. These groups were characterized by four main concepts: 1) truly duplicated KEs due to distinct semantics; 2) same biological event in multiple biological systems; 3) subsequent KEs mapped to the same terms due to inadequate specificity; and 4) opposite regulation of the same biological event (i.e., increased vs decreased signaling).
Here, the grouping based on identical gene sets was selected due to the nature of the downstream application and statistical considerations (i.e., to avoid multiple testing against the same gene set in enrichment analysis). However, a parallel approach with varying cut‐off values for similarity could be implemented to cluster KEs more roughly and to define specific categories of events. Similarly, further refinement of the KE clusters could help to enhance the AOP network by removing redundant nodes which, in turn, could reveal hidden links.
The potential of the KE grouping was showcased using a subgraph formed by considering the AOPs related to pulmonary fibrosis (PF). PF is a chronic lung disease characterized by tissue damage and scarring that impairs lung function.[ 52 ] A range of environmental exposures, including certain chemicals, drugs, radiation, and nanomaterials (most notably carbon nanotubes), as well as infectious diseases have been identified as causative agents for PF.[ 52 , 53 , 54 ] Moreover, the COVID‐19 pandemic has raised concerns about increasing rates of PF.[ 55 , 56 , 57 ] Understanding the disease mechanisms can help in the development of strategies to treat and prevent the disease, and to control and modulate the exposures that contribute to its pathogenesis and progression. Furthermore, it can serve as the foundation for developing targeted assays for evaluating profibrotic potential of chemical exposures.
Six AOPs related to PF were available in the AOP‐Wiki at the time of data retrieval (Figure 4A). These distinct AOPs characterize multiple pathways leading to the same AO. Together, these AOPs comprise 30 KEs, which form a connected graph when modeled as a network (Figure 4C). However, several redundancies were observed among the KEs. For instance, the AO was expressed either as lung fibrosis (Event:1276) or pulmonary fibrosis (Event:1458). Hence, the application of the similarity‐based grouping resulted in 23 distinct KEs (Figure 4B) that were then used as the basis for merging the KE nodes in the PF network (Figure 4D).
The PF AOPs formed a connected network, indicating that each of the individual AOPs shared at least one KE with one or more of the other AOPs. However, as the duplicated KEs were merged, the similarities between the AOPs became more evident. This is evidenced by the increasing number of shared KEs in the graph after merging (the gray nodes in Figure 4D) as compared to the original graph (Figure 4C). Furthermore, the merging revealed Aop:206 to be fully contained within the other AOPs.
The refinement of the AOP network through KE grouping simplifies the network while also enhancing the robustness of the KE relationships, depicted by the connections between the nodes. This process, in fact, removes redundant nodes, which supports the application of AOP networks in AOP research and risk assessment. Furthermore, as duplicated events are removed, the true influence of each node can be assessed more robustly through network analytics.
This example demonstrates the effect of KE redundancy and the potential of data‐driven grouping of the KEs. While manual assessment and grouping would be achievable for a limited number of AOPs at a time, doing it AOP‐Wiki wide would be a massive undertaking. Here, we show how our curated gene‐KE‐AOP connections can help guide the grouping and hence enhance network‐based approaches in AOP research. Furthermore, our results suggest that it is often possible to identify genes that can successfully represent multiple similar key events.
2.4. The AOP Fingerprint of Multi‐Walled Carbon Nanotubes Converges In Vitro and In Vivo
Toxicogenomics has supported the development of mechanistic toxicology and further enhanced the possibility to obtain relevant information from in vitro studies, which could reduce the need for animal experimentation.[ 58 , 59 , 60 ] Here, we tested the hypothesis that toxicogenomic data generated in two independent in vitro and in vivo exposure models would converge on a robust set of relevant AOPs. We focused on Mitsui‐7, a known hazardous long and rigid multi‐walled carbon nanotube (MWCNT). The airways provide the most prominent route of exposure to this nanomaterial, and it is best characterized for its lung‐related AOs, including PF.[ 61 , 62 , 63 , 64 ] Hence, we selected data derived from a lung exposure to the MWCNT in mice,[ 65 ] and an in vitro dataset with exposures on four cell lines representative of the human airways.[ 59 , 66 ] These cell lines include differentiated THP‐1 cells as a model of macrophages, A549 representing alveolar basal epithelial cells, BEAS‐2B as bronchial epithelial cells, and MRC9 as a model of lung fibroblasts. Differentially expressed genes (DEGs) from all experimental conditions in vivo and each cell line in vitro were obtained from Saarimäki et al.[ 67 ] and merged into a single MOA in vivo and in vitro, respectively.
We then performed enrichment analysis against both the AOPs and the KEs separately in order to evaluate the coverage of distinct KEs. We used the proportion of significantly enriched KEs to further filter the significant AOPs. This led us to identify 33 significant AOPs from the in vivo data, while 12 resulted significant from the in vitro exposure. These results were defined as the specific AOP fingerprint for the exposures, and it is presented in Figure 5 .
Despite the distinct sizes of the AOP fingerprints, ten of the 12 AOPs enriched in vitro were also included in the in vivo fingerprint. Moreover, the top enriched AOPs were shared and ranked similarly between in vivo and in vitro when ranked by the smallest adjusted p‐value. The AOP enriched with the most significant p‐value in both instances was Aop:173 titled “Substance interaction with the lung resident cell membrane components leading to lung fibrosis” (Figure 5). The in vivo data set was able to capture seven of the eight KEs of the AOP as significantly enriched, while three out of the eight KEs were enriched in vitro. Interestingly, Aop:173 has been specifically developed with robust evidence from MWCNT exposures, and multiple types of carbon nanotubes are listed as known stressors for the AOP (https://aopwiki.org/aops/173). The second AOP (Aop:171), on the other hand, describes the induction of pleural/peritoneal mesotheliomas by chronic cytotoxicity in rats. Like most AOPs used in this study, Aop:171 is still under development and lacks information on potential stressors. However, mesothelioma is a well‐known AO of asbestos exposure, a fibrous silicate mineral whose adverse effects have often been used as a warning example for MWCNTs.[ 68 ] Indeed, similarities in their MOA have been extensively investigated.[ 61 , 69 , 70 ]
The in vitro AOP fingerprint captures effects such as frustrated phagocytosis, oxidative stress, cytotoxicity, and immune activation, which have all been reported as consequences of this type of exposure and contribute to the pathogenic nature of Mitsui‐7.[ 61 , 62 , 64 , 71 ] Similarly, the profibrotic effects are highlighted with the multiple PF AOPs enriched. These effects are also observed in the in vivo AOP fingerprint. However, the in vivo fingerprint further highlights various AOPs outside the respiratory system, which is less apparent in vitro. While AOs beyond the immediate exposure site are feasible, many of these could likely be accounted for by the different effects of similar transcriptomic signatures in different biological systems (e.g., multiple AOPs related to gastric ulcer formation could reflect similar mechanisms of surfactant disturbance in two distinct exposure sites). On the other hand, the AOPs unique to the in vitro fingerprint, Aop:277 and Aop:263 (Figure 5), reflect the specific effects of the Mitsui‐7 exposure on the immune system. Such specific signals can be easily masked in the in vivo system, where a large array of cell types is affected by the exposure.
It is worth noting that the exposures selected for the analysis had diverse set ups and a notable difference in the size of the combined MOA (863 DEGs in vitro versus 3540 in vivo). While data from multiple cell lines were selected to capture effects besides immune cell activation in vitro, we were not able to match the dose and time course set up present in the in vivo dataset. However, we wanted to include this long‐term exposure to evaluate whether it would result in broader coverage over the KEs of AOPs. Furthermore, histopathological evaluation from the same in vivo exposure set up has shown fibrosis in the lung from the day 7 onward,[ 72 ] suggesting that a whole PF AOP could be covered with this data. Indeed, all but the MIE (Event:1495) of Aop:173 were enriched in vivo. The high proportion of enriched KEs in the in vivo data supports the modeling of KEs with relevant gene sets and the use of toxicogenomic evidence for the development of AOPs, as well as the evaluation of potential AOs of chemical exposures. Likewise, we show that the analysis of toxicogenomic data against robustly annotated AOP framework supports a high degree of in vitro to in vivo extrapolation and further supports the inclusion of toxicogenomics‐based evidence for regulatory purposes. The concept of the AOP fingerprint can be easily adapted to evaluate other chemical exposures and AOs. With nearly 16 000 genes mapped to the KEs in our curation, they are expected to cover most of the human genome. Hence, the AOP fingerprint provides a robust framework for meaningful interpretations also for chemicals and phenotypes that may be less characterized.
2.5. KE‐Associated Gene Sets Guide the Selection of Biomarkers
We showed that our KE‐linked gene sets provide a robust way of evaluating potential outcomes of chemical exposures from transcriptomics data. This observation alone can help to guide chemical testing and grouping. However, to support the development of target assays and integrated approaches, specific reporter genes and markers need to be identified.
Selection of transcriptional biomarkers and reporter genes only based on differential expression from experimental data gives little context or reference to the AO of interest. Even if a certain exposure is known to induce a specific endpoint, there is no indication whether the measured deregulation could be associated with the phenotype of interest. On the other hand, prioritizing the features in the context of the KEs or whole AOPs could shed light on the importance and specificity of the feature regarding the phenotype. This, in turn, can guide the selection of potential biomarkers even in the absence of experimental data. Hence, we implemented a universal and customizable framework for the prioritization of the KE‐associated genes to drive the identification of AOP‐informed biomarkers and used it to identify AOP‐driven biomarkers for PF. The shortlisted marker genes were then screened by RT‐qPCR in an in vitro model of human macrophages exposed to bleomycin, a well‐known profibrotic chemical.[ 73 ]
First, we defined characteristics for optimal biomarkers based on the Bradford Hill criteria, originally defined to evaluate causality in epidemiological research,[ 74 ] but later adopted to other research fields as well.[ 75 ] Our newly defined characteristics, their Bradford Hill counterparts, and short descriptions of the consideration of each step in the selection process are summarized in Table 1 .
Table 1.
Bradford Hill | Our characteristic | Method/Assessment |
---|---|---|
Consistency (reproducibility) | Reproducibility | Selection considers evidence from previous profibrotic exposures |
Strength (effect size) | Amplitude | Significant alteration of the expression as compared to control |
Experiment | Measurable | Transcriptional biomarkers measurable by qPCR; selected genes need to be expressed in the model |
Biological gradient (dose‐response relationship) | Dose‐responsive | Benchmark dose modeling to evaluate dose‐response |
Coherence | In vitro to in vivo extrapolation | Experimental evidence from in vitro and in vivoa) |
Analogy | Predictive (of the outcome of interest) | Selection based on the KEs preceding the AO of interest |
Specificity | Specificity | Gene ranking based on the specificity score |
Plausibility | (Biological) plausibility | The AOP framework provides a plausible context; supporting evidence; selection of the organism |
Temporality | Temporality | Transcriptional alteration follows the exposure; selection of the model organismb) |
– | GLP‐method | RT‐qPCR |
– | Influence | Centrality measures from human protein‐protein interaction and gene regulatory networks |
The biomarkers selected here are targeted for the development of non‐animal assays for toxicological assessment. Hence the coherence to in vivo set ups is not evaluated experimentally. However, in vivo data was used for the selection of the markers to provide context of the systemic response
Temporality in the Bradford Hill criteria refers to a clear distinction of the exposure happening prior to the outcome. Here, we considered temporality by observing transcriptional changes post exposure as well as in the selection of the model organism. Macrophages have a crucial role in the initiation of the profibrotic response preceding the outcome, fibrosis.
The prioritization and selection of the candidate biomarkers considered three main aspects: 1) the social life of genes, that is, some genes (gene products) are more influential than others; 2) specificity regarding the endpoint of interest; and 3) experimental evidence suggesting the genes respond to a relevant exposure. The ranking of the genes was based on the first two considerations, while the experimental evidence was included to guide the selection of candidate genes for RT‐qPCR validation from the ranked list. This enabled a flexible selection process that would be applicable even in the absence of experimental data. At this stage, we also considered the biological feasibility of the target genes given the selected macrophage model as well as a broad coverage over the PF KEs.
As a result, we obtained a list of 25 candidates out of the original 2075 genes related to PF (Table 2 ). Although we focused on the genes in the top 10%, we further included genes ranking lower to obtain a broader coverage over the PF KEs. Genes that are specific to individual KEs might rank low when the individual lists are combined. Hence, we considered the expression patterns from the experimental data as well as the specificity scores and ranks in the individual KEs. This also allowed us to evaluate whether genes ranked higher would perform better than others.
Table 2.
Gene (rank) | Time point | Detected (Amplification) | Deregulated (ANOVA) | Dose‐dependent |
---|---|---|---|---|
SMAD7 (1) | 6 h | |||
24 h | ||||
72 h | ||||
CXCL2 (3) | 6 h | |||
24 h | ||||
72 h | ||||
SPP1 (18) | 6 h | |||
24 h | ||||
72 h | ||||
CCL2 (19) | 6 h | |||
24 h | ||||
72 h | ||||
TGFB1 (23) | 6 h | |||
24 h | ||||
72 h | ||||
IL8 (33) | 6 h | |||
24 h | ||||
72 h | ||||
LOX (48) | 6 h | |||
24 h | ||||
72 h | ||||
PLOD2 (74) | 6 h | |||
24 h | ||||
72 h | ||||
MMP7 (80) | 6 h | |||
24 h | ||||
72 h | ||||
CXCL10 (91) | 6 h | |||
24 h | ||||
72 h | ||||
CCL7 (93) | 6 h | |||
24 h | ||||
72 h | ||||
MMP9 (105) | 6 h | |||
24 h | ||||
72 h | ||||
LTBP4 (112) | 6 h | |||
24 h | ||||
72 h | ||||
FN1 (116) | 6 h | |||
24 h | ||||
72 h | ||||
GDF15 (153) | 6 h | |||
24 h | ||||
72 h | ||||
MMP19 (179) | 6 h | |||
24 h | ||||
72 h | ||||
PTX3 (220) | 6 h | |||
24 h | ||||
72 h | ||||
TGFB3 (297) | 6 h | |||
24 h | ||||
72 h | ||||
LTBP3 (335) | 6 h | |||
24 h | ||||
72 h | ||||
TWIST1 (609) | 6 h | |||
24 h | ||||
72 h | ||||
TGFBI (727) | 6 h | |||
24 h | ||||
72 h | ||||
CTSK (759) | 6 h | |||
24 h | ||||
72 h | ||||
RCN3 (1592) | 6 h | |||
24 h | ||||
72 h | ||||
RSAD2 (1596) | 6 h | |||
24 h | ||||
72 h | ||||
PLK3 (2027) | 6 h | |||
24 h | ||||
72 h |
We could detect the expression of 22 of the candidate genes at one or more of the evaluated time points, and six of the detected genes showed significant alteration as compared to the unexposed control samples (Table 2). Finally, five of these genes were altered in a dose‐dependent manner: CXCL2 and CCL7 at 24 h, IL8 (CXCL8) at 24 and 72 h, and MMP19 and TGFBI at 72 h. All but TGFBI of these genes were among the top 10% in the global PF rank (Table 2). Although we were not able to fit a dose‐dependent model on the highest ranked gene, SMAD7, a suggestive trend could be appreciated in its expression pattern (Figure S4, Supporting Information, panel 18/6H). The expression of each of these genes was upregulated as compared to the controls (Figure S4, Supporting Information).
The central role of TGF‐beta signaling is well‐established in PF,[ 76 ] but neither of the TGFB genes tested (TGFB1 and TGFB3) showed significant change in expression in our setup. Indeed, TGF‐beta is activated through a complex cascade of events, where the inactive form of the protein is activated by other effectors post‐translationally,[ 77 ] making members of the TGF‐beta family a more attractive target for protein‐based biomarker assessment over gene expression. At the same time, we did observe upregulation of SMAD7 and TGFBI which are both activated by TGF‐beta,[ 78 , 79 ] suggesting the induction of TGF‐beta signaling. The protein encoded by TGFBI is involved in the extracellular matrix (ECM), and it has been shown to bind type I collagen, resulting in thicker fibers and further affecting macrophage polarization toward the M2 type.[ 80 ] Indeed, bleomycin has been suggested to polarize macrophages toward M2 (often referred to as the anti‐inflammatory type), which have been shown to drive the development of PF through their ability to promote myofibroblast differentiation.[ 81 , 82 ] Many of our suggested biomarkers are chemokines that mediate immune responses. IL8 and CXCL2 are best characterized as neutrophil attractants, while CCL7 targets a wide variety of leukocytes.[ 83 , 84 , 85 ] Indeed, prolonged inflammation, combined with persistent M2 macrophage activation, supports pathogenesis of fibrosis,[ 86 ] and a mixed status of M1/M2 macrophage activation has been previously associated with carbon nanotube‐induced PF in vivo.[ 87 ] Similarly, multi‐walled carbon nanotubes have been shown to induce the polarization of macrophages toward such mixed status of M1/M2 polarization.[ 88 , 89 ] MMP19 is a member of the matrix metalloproteinase family involved in ECM remodeling.[ 90 ] MMPs have been extensively characterized in the context of PF,[ 91 , 92 ] and MMP19 specifically has been suggested as a key regulator of PF in mice and humans.[ 93 ]
Although macrophages alone cannot capture all the KEs of PF, our model is able to highlight the key steps of macrophage involvement in PF. The temporality of the expression of our suggested biomarkers is supportive of the events leading to the development of fibrosis, where the initial inflammation is followed by type M2 macrophage activation that together contribute to the development of a profibrotic microenvironment and responses in other cells in the tissue.[ 86 ]
NAMs are urgently needed to reduce animal testing while providing robust evidence to support chemical safety assessment. Although alternative methods have been successfully developed to capture acute effects, modeling long‐term outcomes of the exposures, such as fibrosis, in vitro is still a challenge. Here, we propose a panel of five genes CXCL2, CCL7, IL8, MMP19, and TGFBI as AOP‐derived robust biomarkers of PF to be successfully measured in a model of human macrophages in vitro after short exposure time.
3. Conclusion
Mechanistic toxicology encompasses the means for faster, cheaper, and more ethical chemical safety assessment. However, to unleash the full potential of mechanistic evidence also in the regulatory framework, robust approaches to build confidence toward toxicogenomics are urgently needed. Here, we presented an integrated approach that links toxicogenomics with the concept of AOPs and proved its applicability to chemical grouping and development of data‐driven NAMs. We introduced the AOP fingerprint, a concept for evaluating potential systemic outcomes of chemical exposures through unbiased interpretation of toxicogenomics data. Our analysis points to a consistent AOP fingerprint of MWCNTs extrapolated from both in vitro and in vivo experiments. Finally, we identified and experimentally validated a panel of robust AOP‐derived in vitro biomarkers for PF.
Our results suggest that combining the regulatory‐supported AOP framework with toxicogenomics through a rigorous mapping of the MOA of chemicals into KEs/AOPs can facilitate the inclusion of omics derived evidence in regulatory evaluations. The outcome of our analysis in the form of the AOP fingerprint provides a clear and easily understandable way to summarize complex omics data while providing robust statistical evaluation that can support regulatory decisions. Moreover, the possibility to use the framework suggested in this manuscript as the foundation for developing data‐driven molecular assays further opens new possibilities for faster regulatory acceptance of novel alternative methods and NAMs.
4. Experimental Section
Definition of Knowledge Graph‐Based Data Structure
A knowledge graph‐based data structure was established by expanding the previously introduced framework, the Unified Knowledge Space (UKS) by the authors.[ 94 , 95 ] A detailed description and a full list of integrated data sources are provided in the Supporting Information. The so formed data structure was utilized throughout the study as described in the following sections.
Annotation of Key Events
A multi‐step strategy comprising NLP and manual curation was applied to annotate KEs to established gene sets through pathways, phenotypes, and gene ontologies. The annotation strategy is summarized in Figure S2, Supporting Information.
Computational prioritization of KE annotations: To match the descriptions of key events and gene sets an NLP pipeline (Figure S2, Supporting Information) was developed. The pipeline performed several operations to extract the informative terms from both the descriptions of a key event and a gene set that were scored to assess the degree of matching between the two entities. In detail, first, the raw text was converted to lower case and all punctuation symbols were removed. Second, concepts that span multiple words in the text description were replaced by a single word expressing the same concept to strengthen the matching quality (e.g., the concept “positive regulation” was replaced with the single word “upregulated”). Third, the text was split into tokens which were further processed one by one. Fourth, each token corresponding to a stop word in the English language was dropped. Stop words refer to the most common words in a language that do not bring additional meaning (e.g., for the English language common stop words include “in”, “the”, “of”, “from”). Fifth, different declinations of the same concept were mapped to their root term (e.g., plurals were converted to singulars, the terms “increased” and “increasing” were both mapped to “increase”). This same procedure was also used to standardize several styles to write the same symbol (e.g., “pparα” and “pparalpha” map both to “ppar‐alpha”). After these preprocessing steps, each gene set and key event was represented by a set of token words, for example, {upregulate, ppar‐alpha}. However, the frequency of each token word across the descriptions of genes and key events was not the same, and hence, the informative value of rare terms was higher than the informative value of more common tokens. This was taken into account by weighting each token by its inverse document frequency (IDF), that is, the weight was inversely proportional to the number of gene sets and key events that contain that token. Finally, a weighted version of the Jaccard Index (JIW) was employed to match gene sets and the key events, using the IDF as weights (i.e., each token that was shared between a gene and a key event did not account for 1 as in the standard JI, but it contributed its IDF weight to the matching score) and the matching gene sets for each key event was sorted in descending order.
Manual curation and refinement of annotations: Next, the results of the computational prioritization were manually evaluated for correct context and accuracy. Manual curation was used for gap filling and refinement of the annotations. In detail, the top five matches retained from the NLP‐based approach were evaluated, and inaccurate or spurious matches were discarded. In case no matches from the computational prioritization were deemed suitable, a manual search related to the name of the KE was performed on relevant databases (WikiPathways,[ 96 ] HPO,[ 97 ] KEGG,[ 98 ] Reactome,[ 99 ] and GO [ 100 ]). For molecular level KEs, where the alteration of an individual gene was described, either the main functions of the gene were selected, or the gene was directly annotated to the ensemble identifier of the said gene. More generic annotations (i.e., annotation of a KE describing the alteration of a gene to a functional term tightly related to that gene) were prioritized to increase the size of the relevant gene sets.
The matches for KEs expressing the increase/activation or decrease/repression of a biological progress were further organized based on the hierarchy of the terms by prioritizing the most generic but suitable term followed by increasing specificity when multiple annotations of various specificities were available. For instance, Ke:1457 called “induction, epithelial mesenchymal transition” was annotated to the following terms: 1) epithelial to mesenchymal transition (GO:0 001837); 2) regulation of epithelial to mesenchymal transition (GO:00 10717); and 3) positive regulation of epithelial to mesenchymal transition (GO:00 10718). The curated KE‐gene set links were added to the UKS so that for each key event entity its top five matches were added, while the matching level was stored as an edge attribute. This allowed to either combine multiple mappings for a key event or to filter for specific mapping levels. Since the KE‐gene set mappings were always the same for the same KE, these relationships were added to the Key Event entities and not to the Specific Key Event entities, which reduced complexity of the knowledge graph as well as reduced needed storage space. The information, however, could still be retrieved from the UKS via its connecting paths.
Gene set retrieval: The genes corresponding to the matched terms were retrieved by matching the term names to their exact identifiers and querying the UKS for human genes associated with the terms. For phenotypes (HPO and KEGG disease), only genes with a link in the original database were included by filtering by the source for the connection. In cases where no human genes were linked to the annotated GO term, the mouse and rat genes associated and converted them to human orthologs using Ensembl,[ 101 ] which were then used as the corresponding gene sets. When no genes of human, mouse, or rat were associated with the original term, the annotation match was discarded and considered unsuccessful. Once gene sets to all original terms were defined, the gene sets were merged to obtain the final set of genes corresponding to each KE in this study.
Enrichment Analysis of Reference Chemical‐Associated Gene Sets
To evaluate the ability of this framework to highlight relevant adverse outcomes from chemical associated gene signatures, lists of reference chemicals were retrieved from the ECVAM reference chemical library[ 102 ] and the NICEATM website (https://ntp.niehs.nih.gov/whatwestudy/niceatm/resources‐for‐test‐method‐developers/refchem/index.html). From the resources provided by ECVAM, a hepatotoxic chemical list that had clear distinctions between positive and negative compounds was selected. This list was based on the work from EPA's Virtual Liver project (https://cfpub.epa.gov/si/si_public_record_report.cfm?dirEntryId =166616&Lab=NCCT), and was provided as a downloadable Excel‐file (./CHELIST/CheLIST__EPA_VLIVER.xlsx) by ECVAM. From NICEATM, the list of chemicals with characterized thyroid activity (specified as “ACTIVE” in the listing produced based on a previous publication by Wegner et al.) was selected.[ 103 ] AR and ER agonists were selected from the lists of in vitro reference chemicals provided on the website. These listings had been previously published in Kleinstreuer et al.[ 104 ] and Browne et al.,[ 105 ] respectively. Finally, carcinogenic compounds were identified from the list containing chemicals that were either known carcinogens or reasonably anticipated to be human carcinogens (RAHC) based on the 14th report on Carcinogens (RoC classifications) provided by NICEATM. The chemicals from each of the reference lists were then matched to the list of chemicals obtained from the CTD[ 29 ] through name‐based matching or by the provided CAS identifiers, resulting in the final lists of reference chemicals for each endpoint used in this study. Chemical‐gene links originating from the CTD were retrieved from the UKS and only chemicals with 50–1000 associated genes were included in the enrichment analysis. This filter was applied to minimize the false discovery rate and spurious matches in the enrichment analysis.
Enrichment analysis was performed using the Fisher's exact test as implemented in the function enrich from R package bc3net[ 106 ] for each chemical associated gene set against the list of AOP‐related genes (i.e., the union of all the genes associated to all the KEs of the AOP). Enrichment p‐values were adjusted using the false discovery rate (FDR) correction. AOP was considered significantly enriched with FDR‐corrected p‐value < 0.01.
KE Clustering and Construction of the Pulmonary Fibrosis Network
Similarities between the gene sets associated to each KE were evaluated by calculating the JI between all pairs of KEs (size of the intersection divided by the size of the union of the gene sets). The resulting similarity matrix was then transformed into a distance matrix and used to group the KEs using hierarchical clustering as implemented in the function hclust in R package stats, specifying the agglomeration method as “complete.” The number of clusters was defined so that only KEs with the same gene sets associated to them (JI = 1) were assigned to the same group. The grouping obtained in this manner was used to perform the enrichment against KEs to avoid multiple testing against the same gene set as well as to enhance the network presentation of the PF AOP network. The unweighted PF AOP network was generated using gephi[ 107 ] by importing a graphml file generated with the function graph_from_edgelist from R package igraph.[ 108 ] KE groups from the clustering were added as attributes to the nodes and used to merge redundant nodes in gephi. Similarly, AOPs each KE is associated to were added as attributes and used to color the nodes.
Characterization of the AOP Fingerprints
Transcriptomics data: In vivo and in vitro transcriptomics data from MWCNT (Mitsui‐7) exposures were selected from a previously published collection by Saarimäki et al.[ 67 ] The original data sets are available under GEO accession number GSE29042 (in vivo) and ArrayExpress entry EMTAB6396 (in vitro), while the preprocessed data is available on Zenodo (https://doi.org/10.5281/zenodo.6425445). The in vivo data set comprised multiple doses and time points, while the in vitro data contained a single dose and time point exposure on four distinct cell lines representative different cell types of the lung. In each case, DEGs (filtered by an absolute fold change > 1.5 and Benjamini and Hochberg adjusted p‐value < 0.05) obtained from Zenodo (https://doi.org/10.5281/zenodo.6425445) for each distinct comparison (i.e., combination of each dose and time point versus control in vivo and separate cell lines in vitro) were pooled together to obtain a distinct MOA of the exposure in vivo and in vitro, respectively.
AOP fingerprints: To produce the AOP fingerprint for the MWCNT exposures, enrichment analysis was performed using the Fisher's exact test as implemented in the function enrich from R package bc3net[ 106 ] separately against the AOP‐associated gene lists and the KE‐associated gene lists (KEs linked to the same set of genes were grouped to avoid multiple tests against the same set). An AOP was considered significantly enriched when the AOP itself and at least 33% (or minimum of 2 KEs when the length of the AOP was less than six) of its KEs were enriched with an FDR‐corrected p‐value < 0.05.
Selection and Testing of AOP‐Driven Biomarkers
Gene prioritization: All human protein‐protein interaction (PPI) edges were extracted from the UKS and used to create a robust gene–gene network. PPI/gene (product) interaction information could vary across data sources as well as the covered genes may differ. In addition, there was an innate bias in the data, where more data sources were available for “more investigated” genes and gene products. Because of this, it was decided not to apply a global threshold on how many sources need to support an edge,[ 109 ] but instead, a local threshold was applied. This ensured that also less investigated genes and gene products will be retained in the final robust gene–gene network, but their edges were less penalized by the number of supporting edges, than highly covered gene (product) nodes. For each node, the mean number of sources supporting all its connecting edges was estimated and only edges with at least the “mean number of sources” for a node were retained, which needed to be true for at least one of the nodes making up an edge. This was only done for GENE nodes, which were flagged as protein coding in Ensembl. The final robust human gene–gene network, consisted of 20 260 nodes, 806 250 edges, a network density of 0.0039. Due to the significant lower number of available sources for transcription factor—gene (product) data, all available sources were kept and scored equally. These edges were used to create a directed gene–gene network, consisting of 18 754 nodes, 363 649 edges and a network density of 0.001. On the so created gene–gene networks, for each node its degree, betweenness, eigenvector, and closeness centrality were estimated with NetworkX.[ 110 ] These measures were then used to rank the genes linked to the KEs in the context of individual KEs. The gene list for each KE was ranked based on each of the centrality measures (degree, betweenness, closeness, and eigenvector centrality) individually from most central to the least. The ranked lists were combined using the Borda method as implemented in the function Borda in R package TopKLists.[ 111 ]
Biomarker selection: The gene centrality‐based ranking was then supplemented by a specificity ranking for the KEs of AOPs related to PF. A specificity score for the genes in the context of the KEs by dividing the occurrence of the gene in the KEs of PF AOPs by their occurrence in the KEs of other AOPs were calculated. A similar score was calculated at the level of AOPs (occurrence in PF AOPs/occurrence in other AOPs), as universal PF biomarkers were the target of the identification (i.e., prioritizing those that would be present in as many of the six PF AOPs as possible) while also being as specific as possible to PF. These ranks were again combined by the function Borda from R package TopKLists,[ 111 ] and a final round of the Borda method was applied to combine the lists of genes from each KE into one PF rank. The final rank was complemented with experimental evidence. It was assessed whether the genes were differentially expressed in the Mitsui‐7 exposures in vivo and in vitro. It was also evaluated whether they were dose‐dependently altered in the in vivo data as well as in an additional in vitro data set on Mitsui‐7 exposure of a THP‐1 macrophage model (originally published in Saarimäki et al.[ 58 ] and the preprocessed data available as GSE146708 in the previously published collection[ 67 ] available in https://doi.org/10.5281/zenodo.6425445). The dose‐response modeling of the in vivo (GSE29042) and in vitro (GSE146708) datasets was performed by following the strategy implemented in the BMDx tool.[ 112 ] Particularly, for each gene present in the dataset, multiple models were fitted including linear, second order polynomial, hill, power, and exponential model. For each gene, the optimal model was selected as the one with the lowest Akaike Information Criterion (AIC). Genes with an optimal model with lack‐of‐fit p‐values lower than 0.1 were removed from the analysis. The effective doses (BMD, BMDL, and BMDU) were estimated under the assumption of constant variance and by using a BMRF factor of 1.349 (corresponding to a minimum of 10% of difference with respect to the controls). Genes were further filtered based on the predicted doses. Genes with BMD or BMDU values extrapolated higher than the highest exposure dose were filtered. Moreover, genes whose ratio between the predicted doses was higher than the suggested values (BMD/BMDL> 20, BMDU/BMD> 20, and BMDU/BMDL> 40) were removed from the analysis. Genes passing the filters were considered to be dose‐dependently altered. At this stage, the measurability and feasibility of the gene in the selected macrophage model was also considered. For instance, numerous collagen‐encoding genes were ranked high, but would not be a meaningful target in a macrophage model. Moreover, a high coverage of PF KEs and the selection of genes with high specificity scores were emphasized. With these considerations, a subset of the genes with the following priority was selected: 1) genes that were deregulated both in vivo and in vitro, with most emphasis on dose‐dependency; 2) genes that were deregulated in vitro, with most emphasis on dose‐dependency; and 3) genes that were not significantly differentially expressed but were dose‐dependent. Finally, after this initial selection driven by the rank and experimental evidence, additional candidate biomarkers that had a lower rank but were specific to KEs that would otherwise not have been covered by the selected candidates were included.
Cell culture: THP‐1 cells (DSMZ no.: ACC 16) were grown in RPMI 1640 (Gibco, #21 875) + 10% inactivated FBS (Gibco, #10 270). Cells were cultivated in 75 cm2 culture flasks at 37 °C with a humidified atmosphere of 5% CO2. For all experiments, cells were seeded at a density of 1 × 105 cells mL−1 in 96 well plates and differentiated for 48 h with 25 nM PMA (phorbol‐12‐myristate‐13‐acetate, Sigma‐Aldrich, #P1585). Cells were then left to rest for 24 h in fresh media containing no PMA prior to bleomycin exposures.
Cell viability assay: THP‐1 cells were exposed to 0–10 µg mL−1 of bleomycin ready‐made solution (Sigma‐Aldrich, #B7216) and 0–100 mg mL−1 of Triclosan (Sigma‐Aldrich, #72 779), for 6, 24 and 72 h. A WST‐1 assay was then used to measure cell viability. Briefly, 10 µL of cell proliferation reagent WST‐1 (Roche, #11 644 807 001) was added to each well. Cells were left to incubate with WST‐1 for 3 h in a 37 °C, 5% CO2 incubator. Absorbance at 450 nm was then measured with a Spark microplate reader (Tecan). Results of the cell viability assay are available in File S2 and Figure S3, Supporting Information.
RT‐qPCR: For each time point of 6, 24 and 72 h, THP‐1 cells were exposed to 0, 2.5, 5, 10 and 100 µg mL−1 of bleomycin ready‐made solution (Sigma‐Aldrich, #B7216). Media was removed and cells were washed briefly with 50 µL of PBS. 100 µL of lysis buffer from the QIAGEN RNeasy mini kit (Qiagen, #74 104) was added to each well to lyse the cells. Three wells (300 µL) were pooled to create one sample, and there were five samples for each concentration at each time point. Total RNA was then extracted from these samples using the QIAGEN RNeasy mini kit (Qiagen, #74 104). DNase treatment was performed using TURBO DNA‐free Kit (ThermoFisher, #AM1907) according to the manufacturer's protocol. cDNA was synthesized from 100 ng of RNA, using the high‐capacity cDNA reverse transcription kit (Thermo Fisher Scientific, #4 368 813), according to manufacturer's instructions. Expression levels of target genes were determined by qRT‐PCR using CFX96 Touch Real‐Time PCR Detection System (BioRad) with 10 µL of iQ Multiplex Powermix (Bio‐Rad, #1 725 849), 5 µL of cDNA diluted fivefold, 2.5 µL of nuclease‐free (NF) water (not DEPC‐Treated, ThermoFisher, #AM9930) in a 20 µL reaction, together with 2.5 µL of single (1 µL assay + 1.5 µL NF water) or multiplexed (0.5 µL of each assay) PrimePCR Probe Assays (Bio‐Rad) as followed with single or multiplex reactions grouped in parentheses and formatted as Gene/UniqueAssayID: (ACTB/qHsaCEP0036280), (SMAD7/qHsaCEP0050142, MMP9/qHsaCIP0028098, GDF15/qHsaCEP0051579, CTSK/qHsaCIP0030907, PLOD2/qHsaCEP0052848), (CXCL2/qHsaCEP0058163, LTBP4/qHsaCEP0024931, TGFB3/qHsaCEP0058244, RCN3/qHsaCEP0057804, MMP7/qHsaCEP0052037), (SPP1/qHsaCEP0058179, FN1/qHsaCEP0050873, LTBP3/qHsaCEP0053782, RSAD2/qHsaCIP0031596, CCL7/qHsaCEP0058033), (IL8/qHsaCEP0053894, MMP19/qHsaCEP0051244, TWIST1/qHsaCEP0051221, PLK3/qHsaCIP0027687, CXCL10/qHsaCEP0053880), (LOX/qHsaCEP0050731, PTX3/qHsaCEP0033071, TGFBI/qHsaCEP0058394, CCL2/qHsaCIP0028103, TGFB1/qHsaCIP0030973).
Fold change (FC) values from RT‐qPCR data were calculated using the comparative CT(2−(ddCt)) method.[ 113 ] The FC values were log2 transformed (log2(FC)). For each gene and for each concentration, an outlier detection was performed by removing all the samples with log2(FC) values above or below the 75th and 25th percentiles of the distribution. C t values, dC t values, FC values and log2(FC) values are available in File S2, Supporting Information, along with ANOVA tables and tukey HSD posthoc test results. The log2FC expression of the genes as compared to the untreated controls are plotted in Figure S4, Supporting Information.
Dose‐dependent modeling: A dose‐response analysis of the log2(FC) values derived from the PCR experiments was performed. For each gene, multiple models were fitted, including linear, hill, power, polynomial, exponential, log‐logistic, Weibull, and Michaelis–Mentel models. The optimal model was selected as the one with the lowest AIC. The BMD estimation was performed under the assumption of constant variance. The BMR was identified by means of the standard deviation approach with a BMRF of 1.349. Only genes with lack‐of‐fit p‐value >0.10 and with estimated BMD, BMDL and BMDU values were considered relevant.
Conflict of Interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
This work received funding from the EU Horizon 2020 project NanoSolveIT (grant agreement no. 814572), the Academy of Finland (grant agreement no. 322761), and the European Research Council (ERC) programme, Consolidator project ARCHIMEDES (grant agreement no. 101043848). L.A.S. was supported by the Emil Aaltonen Foundation and A.S. by the Tampere Institute for Advanced Study.
Saarimäki L. A., Morikka J., Pavel A., Korpilähde S., del Giudice G., Federico A., Fratello M., Serra A., Greco D., Toxicogenomics Data for Chemical Safety Assessment and Development of New Approach Methodologies: An Adverse Outcome Pathway‐Based Approach. Adv. Sci. 2023, 10, 2203984. 10.1002/advs.202203984
Data Availability Statement
The data that support the findings of this study are available in the supplementary material of this article.
References
- 1. Waters M. D., Fostel J. M., Nat. Rev. Genet. 2004, 5, 936. [DOI] [PubMed] [Google Scholar]
- 2. Liu Z., Huang R., Roberts R., Tong W., Trends Pharmacol. Sci. 2019, 40, 92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Marx‐Stoelting P., Braeuning A., Buhrke T., Lampen A., Niemann L., Oelgeschlaeger M., Rieke S., Schmidt F., Heise T., Pfeil R., Solecki R., Arch. Toxicol. 2015, 89, 2177. [DOI] [PubMed] [Google Scholar]
- 4. Paananen J., Fortino V., Brief Bioinf. 2020, 21, 1937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Mech A., Gottardo S., Amenta V., Amodio A., Belz S., Bøwadt S., Drbohlavová J., Farcal L., Jantunen P., Małyska A., Rasmussen K., Riego Sintes J., Rauscher H., Regul. Toxicol. Pharmacol. 2022, 128, 105093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Richard A. M., Huang R., Waidyanatha S., Shinn P., Collins B. J., Thillainadarajah I., Grulke C. M., Williams A. J., Lougee R. R., Judson R. S., Houck K. A., Shobair M., Yang C., Rathman J. F., Yasgar A., Fitzpatrick S. C., Simeonov A., Thomas R. S., Crofton K. M., Paules R. S., Bucher J. R., Austin C. P., Kavlock R. J., Tice R. R., Chem. Res. Toxicol. 2021, 34, 189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Krewski D., Acosta D., Andersen M., Anderson H., Bailar J. C., Boekelheide K., Brent R., Charnley G., Cheung V. G., Green S., Kelsey K. T., Kerkvliet N. I., Li A. A., Mccray L., Meyer O., Patterson R. D., Pennie W., Scala R. A., Solomon G. M., Stephens M., Yager J., Zeise L., Staff Of Committee On Toxicity Test , J. Toxicol. Environ. Health, Part B 2010, 13, 51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. OECD , Guidance Document for the Use of Adverse Outcome Pathways in Developing Integrated Approaches to Testing and Assessment (IATA), OECD Series on Testing and Assessment , Vol. 260, OECD Publishing, Paris: 2017. [Google Scholar]
- 9. Ankley G. T., Bennett R. S., Erickson R. J., Hoff D. J., Hornung M. W., Johnson R. D., Mount D. R., Nichols J. W., Russom C. L., Schmieder P. K., Serrrano J. A., Tietge J. E., Villeneuve D. L., Environ. Toxicol. Chem. 2010, 29, 730. [DOI] [PubMed] [Google Scholar]
- 10. Clerbaux L.‐A., Amigó N., Amorim M. J., Bal‐Price A., Batista Leite S., Beronius A., Bezemer G. F. G., Bostroem A.‐C., Carusi A., Coecke S., Concha R., Daskalopoulos E. P., De Bernardi F., Edrosa E., Edwards S. W., Filipovska J., Garcia‐Reyero N., Gavins F. N. E., Halappanavar S., Hargreaves A. J., Hogberg H. T., Huynh M. T., Jacobson D., Josephs‐Spaulding J., Kim Y. J., Kong H. J., Krebs C. E., Lam A., Landesmann B., Layton A., et al., ALTEX 2022, 39, 322.35032963 [Google Scholar]
- 11. Nymark P., Sachana M., Leite S. B., Sund J., Krebs C. E., Sullivan K., Edwards S., Viviani L., Willett C., Landesmann B., Wittwehr C., Front. Public Health 2021, 9, 638605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Tollefsen K. E., Scholz S., Cronin M. T., Edwards S. W., De Knecht J., Crofton K., Garcia‐Reyero N., Hartung T., Worth A., Patlewicz G., Regul. Toxicol. Pharmacol. 2014, 70, 629. [DOI] [PubMed] [Google Scholar]
- 13. Halappanavar S., Van Den Brule S., Nymark P., Gaté L., Seidel C., Valentino S., Zhernovkov V., HøGh Danielsen P., De Vizcaya A., Wolff H., Stöger T., Boyadziev A., Poulsen S. S. S., Sørli J. B., Vogel U., Part. Fibre Toxicol. 2020, 17, 16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Fortino V., Kinaret P. A. S., Fratello M., Serra A., Saarimã¤Ki L. A., Gallud A., Gupta G., Vales G., Correia M., Rasool O., Ytterberg J., Monopoli M., Skoog T., Ritchie P., Moya S., VáZquez‐Campos S., Handy R., Grafström R., Tran L., Zubarev R., Lahesmaa R., Dawson K., Loeschner K., Larsen E. H., Krombach F., Norppa H., Kere J., Savolainen K., Alenius H., Fadeel B., et al., Nat. Commun. 2022, 13, 3798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Montaner J., Ramiro L., Simats A., Tiedt S., Makris K., Jickling G. C., Debette S., Sanchez J.‐C., Bustamante A., Nat. Rev. Neurol. 2020, 16, 247. [DOI] [PubMed] [Google Scholar]
- 16. Federico A., Serra A., Ha M.y K., Kohonen P., Choi J.‐S., Liampa I., Nymark P., Sanabria N., Cattelani L., Fratello M., Kinaret P. A. S., Jagiello K., Puzyn T., Melagraki G., Gulumian M., Afantitis A., Sarimveis H., Yoon T.‐H., Grafström R., Greco D., Nanomaterials (Basel) 2020, 10, 903.32397130 [Google Scholar]
- 17. Kinaret P. A. S., Serra A., Federico A., Kohonen P., Nymark P., Liampa I., Ha M.y K., Choi J.‐S., Jagiello K., Sanabria N., Melagraki G., Cattelani L., Fratello M., Sarimveis H., Afantitis A., Yoon T.‐H., Gulumian M., Grafström R., Puzyn T., Greco D., Nanomaterials 2020, 10, 750.32326418 [Google Scholar]
- 18. Serra A., Fratello M., Cattelani L., Liampa I., Melagraki G., Kohonen P., Nymark P., Federico A., Kinaret P. A. S., Jagiello K., Ha M.y K., Choi J.‐S., Sanabria N., Gulumian M., Puzyn T., Yoon T.‐H., Sarimveis H., Grafström R., Afantitis A., Greco D., Nanomaterials 2020, 10, 708.32276469 [Google Scholar]
- 19. Brockmeier E. K., Hodges G., Hutchinson T. H., Butler E., Hecker M., Tollefsen K. E., Garcia‐Reyero N., Kille P., Becker D., Chipman K., Colbourne J., Collette T. W., Cossins A., Cronin M., Graystock P., Gutsell S., Knapen D., Katsiadaki I., Lange A., Marshall S., Owen S. F., Perkins E. J., Plaistow S., Schroeder A., Taylor D., Viant M., Ankley G., Falciani F., Toxicol. Sci. 2017, 158, 252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Perkins E. J., Woolard E. A., Garcia‐Reyero N. L., Front. Toxicol. 2022, 4, 786057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Labib S., Williams A., Yauk C. L., Nikota J. K., Wallin H., Vogel U., Halappanavar S., Part. Fibre Toxicol. 2016, 13, 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Jin Y., Qi G., Shou Y., Li D., Liu Y., Guan H., Zhang Q., Chen S., Luo J., Xu L., Li C., Ma W., Chen N., Zheng Y., Yu D., J. Hazard. Mater. 2022, 425, 128041. [DOI] [PubMed] [Google Scholar]
- 23. Guan R., Li N., Wang W., Liu W., Li X., Zhao C., Ecotoxicol. Environ. Saf. 2022, 234, 113387. [DOI] [PubMed] [Google Scholar]
- 24. Saarimäki L. A., Melagraki G., Afantitis A., Lynch I., Greco D., Nat. Nanotechnol. 2022, 17, 17. [DOI] [PubMed] [Google Scholar]
- 25. Martens M., Verbruggen T., Nymark P., Grafström R., Burgoon L. D., Aladjov H., Andón F. T., Evelo C. T., Willighagen E. L., Front. Genet. 2018, 9, 661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Martens M., Evelo C. T., Willighagen E. L., Appl. In Vitro Toxicol. 2022, 8, 2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Nymark P., Rieswijk L., Ehrhart F., Jeliazkova N., Tsiliki G., Sarimveis H., Evelo C. T., Hongisto V., Kohonen P., Willighagen E., Grafström R. C., Toxicol. Sci. 2018, 162, 264. [DOI] [PubMed] [Google Scholar]
- 28. Jagiello K., Halappanavar S., Rybińska‐Fryca A., Willliams A., Vogel U., Puzyn T., Small 2021, 17, 2003465. [DOI] [PubMed] [Google Scholar]
- 29. Davis A. P., Grondin C. J., Johnson R. J., Sciaky D., Wiegers J., Wiegers T. C., Mattingly C. J., Nucleic Acids Res. 2021, 49, D1138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Tong M., Neusner A., Longato L., Lawton M., Wands J. R., de la Monte S. M., J. Alzheimers Dis. 2009, 17, 827. [PMC free article] [PubMed] [Google Scholar]
- 31. Ceballos D. M., Fellows K. M., Evans A. E., Janulewicz P. A., Lee E. G., Whittaker S. G., Front. Public Health 2021, 9, 638082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Cave M., Falkner K. C., McClain C., in Zakim and Boyer's Hepatology: A Textbook of Liver Disease, Elsevier, Amsterdam: 2012, pp. 476–492. [Google Scholar]
- 33. Chow W.‐H., Dong L. M., Devesa S. S., Nat. Rev. Urol. 2010, 7, 245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Krenkel O., Mossanen J. C., Tacke F., Hepatobiliary Surg. Nutr. 2014, 3, 331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Jia M., Wang Y., Wang D., Teng M., Yan J., Yan S., Meng Z., Li R., Zhou Z., Zhu W., Chemosphere 2019, 226, 520. [DOI] [PubMed] [Google Scholar]
- 36. Lee W.‐S., Kim J., Mol. Cell. Toxicol. 2019, 15, 9. [Google Scholar]
- 37. Warner G. R., Mourikes V. E., Neff A. M., Brehm E., Flaws J. A., Mol. Cell. Endocrinol. 2020, 502, 110680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Brander S. M., He G., Smalling K. L., Denison M. S., Cherr G. N., Environ. Toxicol. Chem. 2012, 31, 2848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Moore P. D., Yedjou C. G., Tchounwou P. B., Environ. Toxicol. 2010, 25, 221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Xiong J., Tian L., Qiu Y., Sun D., Zhang H., Wu M., Wang J., Drug Chem. Toxicol. 2018, 41, 501. [DOI] [PubMed] [Google Scholar]
- 41. Lal B., Sarang M. K., Kumar P., Gen. Comp. Endocrinol. 2013, 181, 139. [DOI] [PubMed] [Google Scholar]
- 42. Grasselli F., Bussolati S., Ramoni R., Grolli S., Basini G., Anim. Reprod. 2018, 15, 3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Tu W., Xu C., Jin Y., Lu B., Lin C., Wu Y., Liu W., Aquat. Toxicol. 2016, 175, 39. [DOI] [PubMed] [Google Scholar]
- 44. Kim S.‐S., Lee R.‐D.a, Lim K.‐J.o, Kwack S.‐J., Rhee G.‐S., Seok J.i‐H., Lee G.‐S., An B.‐S., Jeung E.‐B., Park K.‐L., J. Reprod. Dev. 2005, 51, 201. [DOI] [PubMed] [Google Scholar]
- 45. Yang J. S., Qi W., Farias‐Pereira R., Choi S., Clark J. M., Kim D., Park Y., Food Chem. Toxicol. 2019, 125, 595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Ravichandran J., Karthikeyan B. S., Samal A., Sci. Total Environ. 2022, 826, 154112. [DOI] [PubMed] [Google Scholar]
- 47. Spînu N., Cronin M. T. D., Lao J., Bal‐Price A., Campia I., Enoch S. J., Madden J. C., Mora Lagares L., Novič M., Pamies D., Scholz S., Villeneuve D. L., Worth A. P., Comput. Toxicol. 2022, 21, 100206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Zhang T., Wang S., Li L., Zhu A.n, Wang Q.i, Sci. Total Environ. 2022, 824, 153932. [DOI] [PubMed] [Google Scholar]
- 49. Arnesdotter E., Spinu N., Firman J., Ebbrell D., Cronin M. T. D., Vanhaecke T., Vinken M., Toxicology 2021, 459, 152856. [DOI] [PubMed] [Google Scholar]
- 50. Knapen D., Angrish M. M., Fortin M. C., Katsiadaki I., Leonard M., Margiotta‐Casaluci L., Munn S., O'brien J. M., Pollesch N., Smith L. C., Zhang X., Villeneuve D. L., Environ. Toxicol. Chem. 2018, 37, 1723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Villeneuve D. L., Angrish M. M., Fortin M. C., Katsiadaki I., Leonard M., Margiotta‐Casaluci L., Munn S., O'brien J. M., Pollesch N. L., Smith L. C., Zhang X., Knapen D., Environ. Toxicol. Chem. 2018, 37, 1734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Thannickal V. J., Toews G. B., White E. S., Lynch Iii J. P., Martinez F. J., Annu. Rev. Med. 2004, 55, 395. [DOI] [PubMed] [Google Scholar]
- 53. Bonner J. C., Toxicol. Pathol. 2007, 35, 148. [DOI] [PubMed] [Google Scholar]
- 54. Wynn T., J. Pathol. 2008, 214, 199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Mcgroder C. F., Zhang D., Choudhury M. A., Salvatore M. M., D'souza B. M., Hoffman E. A., Wei Y., Baldwin M. R., Garcia C. K., Thorax 2021, 76, 1242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Kinaret P. A. S., Del Giudice G., Greco D., Nano Today 2020, 35, 100945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Hama Amin B. J., Kakamad F. H., Ahmed G. S., Ahmed S. F., Abdulla B. A., Mohammed S. H., Mikael T. M., Salih R. Q., Ali R. K., Salh A. M., Hussein D. A., Ann. Med. Surg. 2022, 77, 103590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Saarimäki L. A., Kinaret P. A. S., Scala G., Del Giudice G., Federico A., Serra A., Greco D., NanoImpact 2020, 20, 100274. [Google Scholar]
- 59. Scala G., Kinaret P., Marwah V., Sund J., Fortino V., Greco D., NanoImpact 2018, 11, 99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Kinaret P., Marwah V., Fortino V., Ilves M., Wolff H., Ruokolainen L., Auvinen P., Savolainen K., Alenius H., Greco D., ACS Nano 2017, 11, 3786. [DOI] [PubMed] [Google Scholar]
- 61. Rydman E. M., Ilves M., Vanhala E., Vippola M., Lehto M., Kinaret P. A. S., Pylkkänen L., Happo M., Hirvonen M.‐R., Greco D., Savolainen K., Wolff H., Alenius H., Toxicol. Sci. 2015, 147, 140. [DOI] [PubMed] [Google Scholar]
- 62. Kinaret P., Ilves M., Fortino V., Rydman E., Karisola P., LäHde A., Koivisto J., Jokiniemi J., Wolff H., Savolainen K., Greco D., Alenius H., ACS Nano 2017, 11, 291. [DOI] [PubMed] [Google Scholar]
- 63. Poulsen S. S., Jacobsen N. R., Labib S., Wu D., Husain M., Williams A., BøGelund J. P., Andersen O., KøBler C., Mølhave K., Kyjovska Z. O., Saber A. T., Wallin H., Yauk C. L., Vogel U., Halappanavar S., PLoS One 2013, 8, e80452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Rahman L., Jacobsen N. R., Aziz S. A., Wu D., Williams A., Yauk C. L., White P., Wallin H., Vogel U., Halappanavar S., Mutat. Res., Genet. Toxicol. Environ. Mutagen. 2017, 823, 28. [DOI] [PubMed] [Google Scholar]
- 65. Guo N. L., Wan Y.‐W., Denvir J., Porter D. W., Pacurari M., Wolfarth M. G., Castranova V., Qian Y., J. Toxicol. Environ. Health, Part A 2012, 75, 1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Scala G., Marwah V., Kinaret P., Sund J., Fortino V., Greco D., Data Brief 2018, 19, 1046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Saarimäki L. A., Federico A., Lynch I., Papadiamantis A. G., Tsoumanis A., Melagraki G., Afantitis A., Serra A., Greco D., Sci. Data 2021, 8, 49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Barbarino M., Giordano A., Cancers (Basel) 2021, 13, 1318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Rittinghausen S., Hackbarth A., Creutzenberg O., Ernst H., Heinrich U., Leonhardt A., Schaudien D., Part. Fibre Toxicol. 2014, 11, 59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Gupta S. S., Singh K. P., Gupta S., Dusinska M., Rahman Q., Nanomaterials (Basel) 2022, 12, 1708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Boyles M. S. P., Young L., Brown D. M., Maccalman L., Cowie H., Moisala A., Smail F., Smith P. J. W., Proudfoot L., Windle A. H., Stone V., Toxicol. In Vitro 2015, 29, 1513. [DOI] [PubMed] [Google Scholar]
- 72. Porter D. W., Hubbs A. F., Mercer R. R., Wu N., Wolfarth M. G., Sriram K., Leonard S., Battelli L., Schwegler‐Berry D., Friend S., Toxicology 2010, 269, 136. [DOI] [PubMed] [Google Scholar]
- 73. Izbicki G., Segel M. J., Christensen T. G., Conner M. W., Breuer R., Int. J. Exp. Pathol. 2002, 83, 111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Hill A. B., Proc. R. Soc. Med. 1965, 58, 295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Fedak K. M., Bernal A., Capshaw Z. A., Gross S., Emerg. Themes Epidemiol. 2015, 12, 14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Yue X., Shan B., Lasky J. A., Curr. Enzyme Inhib. 2010, 6, 67. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Annes J. P., Munger J. S., Rifkin D. B., J. Cell Sci. 2003, 116, 217. [DOI] [PubMed] [Google Scholar]
- 78. Nummela P., Lammi J., Soikkeli J., Saksela O., Laakkonen P., Hölttä E., Am. J. Pathol. 2012, 180, 1663. [DOI] [PubMed] [Google Scholar]
- 79. Yan X., Liu Z., Chen Y., Acta Biochim. Biophys. Sin. 2009, 41, 263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Bachy S., Wu Z., Gamradt P., Thierry K., Milani P., Chlasta J., Hennino A., iScience 2022, 25, 103758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Hou J., Shi J., Chen L., Lv Z., Chen X., Cao H., Xiang Z., Han X., Cell Commun. Signaling 2018, 16, 89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82. Wang H., Gao Y., Wang L., Yu Y., Zhang J., Liu C., Song Y., Xu H., Wang J., Lou H., Dong T., J. Adv. Res. 2022, 10.1016/j.jare.2022.04.012. [DOI] [Google Scholar]
- 83. Hammond M. E., Lapointe G. R., Feucht P. H., Hilt S., Gallegos C. A., Gordon C. A., Giedlin M. A., Mullenbach G., Tekamp‐Olson P., J. Immunol. 1995, 155, 1428. [PubMed] [Google Scholar]
- 84. De Filippo K., Dudeck A., Hasenberg M., Nye E., Van Rooijen N., Hartmann K., Gunzer M., Roers A., Hogg N., Blood 2013, 121, 4930. [DOI] [PubMed] [Google Scholar]
- 85. Cheng J. W., Sadeghi Z., Levine A. D., Penn M. S., Von Recum H. A., Caplan A. I., Hijaz A., Cytokine 2014, 69, 277. [DOI] [PubMed] [Google Scholar]
- 86. Braga T. T., Agudelo J. S. H., Camara N. O. S., Front. Immunol. 2015, 6, 602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Dong J., Ma Q., Nanotoxicology 2018, 12, 153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Kinaret P. A. S., Scala G., Federico A., Sund J., Greco D., Small 2020, 16, 1907609. [DOI] [PubMed] [Google Scholar]
- 89. Meng J., Li X., Wang C., Guo H., Liu J., Xu H., ACS Appl. Mater. Interfaces 2015, 7, 3180. [DOI] [PubMed] [Google Scholar]
- 90. Loffek S., Schilling O., Franzke C.‐W., Eur. Respir. J. 2011, 38, 191. [DOI] [PubMed] [Google Scholar]
- 91. Pardo A., Cabrera S., Maldonado M., Selman M., Respir. Res. 2016, 17, 23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92. Craig V. J., Zhang L., Hagood J. S., Owen C. A., Am. J. Respir. Cell Mol. Biol. 2015, 53, 585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93. Yu G., Kovkarova‐Naumovski E., Jara P., Parwani A., Kass D., Ruiz V., Lopez‐OtãN C., Rosas I. O., Gibson K. F., Cabrera S., RamãRez R., Yousem S. A., Richards T. J., Chensny L. J., Selman M., Kaminski N., Pardo A., Am. J. Respir. Crit. Care Med. 2012, 186, 752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94. Federico A., Fratello M., Scala G., Möbus L., Pavel A., Del Giudice G., Ceccarelli M., Costa V., Ciccodicola A., Fortino V., Serra A., Greco D., Cancers (Basel) 2022, 14, 2043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95. Pavel A., Del Giudice G., Federico A., Di Lieto A., Kinaret P. A. S., Serra A., Greco D., Brief Bioinf. 2021, 22, 1430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Martens M., Ammar A., Riutta A., Waagmeester A., Slenter D. N., Hanspers K., Â Miller R. A., Digles D., Lopes E. N., Ehrhart F., Dupuis L. J., Winckers L. A., Coort S. L., Willighagen E. L., Evelo C. T., Pico A. R., Kutmon M., Nucleic Acids Res. 2021, 49, D613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97. Köhler S., Gargano M., Matentzoglu N., Carmody L. C., Lewis‐Smith D., Vasilevsky N. A., Danis D., Balagura G., Baynam G., Brower A. M., Callahan T. J., Chute C. G., Est J. L., Galer P. D., Ganesan S., Griese M., Haimel M., Pazmandi J., Hanauer M., Harris N. L., Hartnett M. J., Hastreiter M., Hauck F., He Y., Jeske T., Kearney H., Kindle G., Klein C., Knoflach K., Krause R., et al., Nucleic Acids Res. 2021, 49, D1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98. Kanehisa M., Nucleic Acids Res. 2000, 28, 27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99. Gillespie M., Jassal B., Stephan R., Milacic M., Rothfels K., Senff‐Ribeiro A., Griss J., Sevilla C., Matthews L., Gong C., Deng C., Varusai T., Ragueneau E., Haider Y., May B., Shamovsky V., Weiser J., Brunson T., Sanati N., Beckman L., Shao X., Fabregat A., Sidiropoulos K., Murillo J., Viteri G., Cook J., Shorser S., Bader G., Demir E., Sander C., et al., Nucleic Acids Res. 2022, 50, D687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100. Ashburner M., Ball C. A., Blake J. A., Botstein D., Butler H., Cherry J. M., Davis A. P., Dolinski K., Dwight S. S., Eppig J. T., Harris M. A., Hill D. P., Issel‐Tarver L., Kasarskis A., Lewis S., Matese J. C., Richardson J. E., Ringwald M., Rubin G. M., Sherlock G., Nat. Genet. 2000, 25, 25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101. Cunningham F., Allen J. E., Allen J., Alvarez‐Jarreta J., Amode M.â R., Armean I. M., Austine‐Orimoloye O., Azov A. G., Barnes I., Bennett R., Berry A., Bhai J., Bignell A., Billis K., Boddu S., Brooks L., Charkhchi M., Cummins C., Daâ Rinâ Fioretto L., Davidson C., Dodiya K., Donaldson S., Elâ Houdaigui B., Elâ Naboulsi T., Fatima R., Giron C. G., Genez T., Martinez J. G., Guijarro‐Clarke C., Gymer A., et al., Nucleic Acids Res. 2022, 50, D988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102. Sund J., Deceuninck P., EURL ECVAM Library of Reference Chemicals. European Commission, Joint Research Centre (JRC), Brussels, Belgium 2021, http://data.europa.eu/89h/92614229‐d020‐4d96‐941c‐c9604e525c9e
- 103. Wegner S., Browne P., Dix D., Reprod. Toxicol. 2016, 65, 402. [DOI] [PubMed] [Google Scholar]
- 104. Kleinstreuer N. C., Ceger P., Watt E. D., Martin M., Houck K., Browne P., Thomas R. S., Casey W. M., Dix D. J., Allen D., Sakamuru S., Xia M., Huang R., Judson R., Chem. Res. Toxicol. 2017, 30, 946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105. Browne P., Judson R. S., Casey W. M., Kleinstreuer N. C., Thomas R. S., Environ. Sci. Technol. 2015, 49, 8804. [DOI] [PubMed] [Google Scholar]
- 106. De Matos Simoes R., Emmert‐Streib F., PLoS One 2012, 7, e33624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107. Bastian M., Heymann S., Jacomy M., Proc. Int. AAAI Conf. on Web Soc. Media 2009, 3, 361. [Google Scholar]
- 108. Csárdi G., Nepusz T., InterJournal, Complex Syst. 2006, 1695, 1. [Google Scholar]
- 109. Pavel A., Serra A., Cattelani L., Federico A., Greco D., Methods Mol. Biol. 2022, 2401, 161. [DOI] [PubMed] [Google Scholar]
- 110. Hagberg A. A., Schult D. A., Swart P. J., in Proc. of the 7th Python in Science Conf, (Eds: Varoquaux G., Vaught T., Millman J.), SciPy 2008, Pasadena, CA: 2008, pp. 11–15. [Google Scholar]
- 111. Schimek M. G., Budinskã¡ E., Kugler K. G., Å Vendovã¡ V., Ding J., Lin S., Stat. Appl. Genet. Mol. Biol. 2015, 14, 311. [DOI] [PubMed] [Google Scholar]
- 112. Serra A., Saarimã¤Ki L. A., Fratello M., Marwah V. S., Greco D., Bioinformatics 2020, 36, 2932. [DOI] [PubMed] [Google Scholar]
- 113. Livak K. J., Schmittgen T. D., Methods 2001, 25, 402. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support the findings of this study are available in the supplementary material of this article.