Summary
The (developmental) neurotoxicity hazard is still unknown for most chemicals. Establishing a test battery covering most of the relevant adverse outcome pathways may close this gap without requiring a huge animal experimentation program. Ideally, each of the assays would cover multiple mechanisms of toxicity. One candidate test is the human LUHMES cell-based NeuriTox test. To evaluate its readiness for larger-scale testing, a proof of concept library, assembled by the U.S. National Toxicology Program (NTP), was screened. Out of the 75 unique compounds, seven were defined as specifically neurotoxic after the hit-confirmation phase and ten further compounds were generally cytotoxic within the concentration range of up to 20 μM. As complementary approach, the library was screened in the PeriTox test, which identifies toxicants affecting the human peripheral nervous system. Of the eight PeriTox hits, five were similar to the NeuriTox hits: rotenone, colchicine, diethylstilbestrol, berberine chloride, and valinomycin. The unique NeuriTox hit, methyl-phenylpyridinium (MPP+), is known from in vivo studies to affect only dopaminergic neurons (which LUHMES cells are). Conversely, the known peripheral neurotoxicant acrylamide was picked up in the PeriTox, but not in the NeuriTox assay. All of the five common hits had also been identified in the published neural crest migration (cMINC) assay, while none of them emerged as a cardiotoxicant in a previous screen using the same library. These comparative data suggest that complementary in vitro tests can pick up a broad range of toxicants, and that multiple test results might help to predict organ specificity patterns.
Keywords: neurite outgrowth inhibition, cytotoxicity, neurotoxicity, high content imaging, developmental toxicity
1. Introduction
The transition from method development to actual tests for screening and prioritization is an important advance for in vitro toxicology (Crofton et al., 2011; Daneshian et al., 2016; Leist et al., 2008a,b). While this has been achieved in areas like genotoxicity or topical toxicity to skin and eyes (Basketter et al., 2012; Ezendam et al., 2016; Kirkland et al., 2011; Leist et al., 2014; Prinsen et al., 2017), the fields of organ toxicity and developmental toxicity still represent a major challenge (Marx et al., 2016). Especially for neurotoxicity (NT) and developmental neurotoxicity (DNT), multiple tests have been developed, but their comparison using larger compound libraries still lags (Fritsche et al., 2017; Aschner et al., 2017; Smirnova et al., 2014). Developmental neurotoxicity results from gestational or peripartum disturbances of neural cells that eventually lead to an altered connectivity of the neuronal system. For instance, toxicants may inhibit proliferation, differentiation or migration of neural cells (Bal-Price et al., 2015; Aschner et al., 2017). The toxicological manifestation of disturbed key developmental processes is a structural or functional defect of the nervous system.
Current regulatory procedure hardly evaluates DNT for industrial chemicals, while about 100 pesticides have been tested according to OECD TG 426 (OECD, 2007), which requires repeated dosing during pregnancy and lactation (Smirnova et al., 2014; van Thriel et al., 2012; Schmidt et al., 2017; Makris et al., 2009). This test is highly costly (ca. $1–2 million per compound), and its sensitivity has been questioned (Fritsche et al., 2017).
At present, more than 100 compounds (including several drugs) have been found to trigger DNT in animals, while there is strong epidemiological evidence for such effects in humans for only about a dozen compounds (Aschner et al., 2017; Mundy et al., 2015; Grandjean and Landrigan, 2006). The majority of industrial chemicals, and even of drugs, have never been evaluated for DNT (Grandjean and Landrigan, 2006, 2014; Bennett et al., 2016; Crofton et al., 2012). Thus, there is an enormous need for high quality and high throughput in vitro testing (Bal-Price et al., 2015; Crofton et al., 2014; Smirnova et al., 2014).
During the last decade, in vitro tests have been developed to fill the testing gap (Fritsche et al., 2017; Schmidt et al., 2017). They assess, for instance, the proliferation and differentiation of neuronal precursor cells (Baumann et al., 2016; Culbreth et al., 2012; Fritsche et al., 2005; Hogberg et al., 2010, 2009; Schmuck et al., 2017; Shinde et al., 2017; Balmer et al., 2012), the loss of certain neuronal (sub)populations (Baumann et al., 2016; Culbreth et al., 2012; Pamies et al., 2016; Schmuck et al., 2017; Zimmer et al., 2011b,a), or the impairment of migration (Nyffeler et al., 2017b; Schmuck et al., 2017; Zimmer et al., 2012), neurite outgrowth (Harrill et al., 2011; Krug et al., 2013) or neuronal network formation (Brown et al., 2017; Pamies et al., 2016; Schmuck et al., 2017). Since a single in vitro assay cannot cover the complexity of in vivo development, plans for test strategies are built on compiling data from a battery of assays that cover all relevant processes (Fritsche et al., 2017; Zimmer et al., 2014).
A cell source of high quality and quantity is a crucial factor for robust tests. One such option is provided by LUHMES cells. They are of human origin and represent extensively characterized neuronal progenitors that can be differentiated within six days into dopaminergic neurons (Smirnova et al., 2016; Krug et al., 2014; Scholz et al., 2011; Schildknecht et al., 2013). In the process of establishing and validating an in vitro method, its robustness and suitability for high throughput testing has to be assessed (Leist et al., 2012, 2010). The U.S. National Toxicology Program (NTP) compiled a collection of 80 compounds (herein called NTP80 collection) made up of 75 unique chemicals and internal controls. The focus of this library is on known or suspected DNT/NT compounds as well as compounds of significant interest to the NTP (e.g., flame retardants, PAHs) that have not been tested for DNT/NT activity. This library has been made available for all interested test developers with the vision to generate a comparable data matrix across many DNT and neurotoxicity assays. Initial data on the interference with iPSC differentiation, neurite outgrowth, neural crest cell migration, and cardiotoxicity have been published (Pei et al., 2016; Nyffeler et al., 2017a; Ryan et al., 2016; Sirenko et al., 2017).
In our study, we used the NTP80 collection to evaluate the throughput and quality of the LUHMES cell-based developmental neurotoxicity assay (NeuriTox). The results from the screen were validated in a hit confirmation phase. As follow-up, the library was also screened for peripheral nervous system toxicity (PeriTox assay; Hoelting et al., 2016), and the data were put into context of published screens and of data available from the Tox21 program.
2. Material and methods
Screen library handling
The compound library was received as a 96-well “master plate” filled with compounds, i.e., a collection of drug/drug-like compounds, PAH, pesticides, flame retardants, and others (Fig. 1), (Nyffeler et al., 2017a; Ryan et al., 2016; Sirenko et al., 2017). In order to reduce freeze-thaw cycles, save compounds, and test them always after the same number of freeze-thaw cycles, sets of five compounds were transferred from the “master plate” to each “dilution plate” and diluted in DMSO. Subsequently, the dilutions were aliquoted into a “treatment plate” that was equipped with DMSO-solvent controls and narciclasine positive control (50 nM final concentration on cells, Sigma, CAS 2947783–6), sealed, and stored at −80°C until use. This procedure ensured that cells were always treated with 0.1% DMSO (Fig. S31).
Cell culture and differentiation
For the NeuriTox test (= UKN4), LUHMES (Lund human mesencephalic) cells were characterized and cultured as described in detail earlier (Krug et al., 2013; Lotharius et al., 2005; Scholz et al., 2011). Briefly, cells were maintained in proliferation medium (PM: AdvDMEM/F12 supplemented with 2 mM glutamine, 1x N2 supplement and 40 ng/ml fibroblast growth factor-2) in PLO/fibronectin coated flasks (50 μg/ml poly-L-ornithine (PLO) and 1 μg/ml fibronectin). For differentiation, cells were seeded at a density of 46,000 cells/cm2 in PM. After 24 h, they were switched to differentiation medium (DM) containing AdvDMEM/F12 supplemented with 2 mM glutamine, 1x N2 supplement, 2.25 μM tetracycline, 1 mM dibutyryl 3’,5’-cyclic adenosine monophosphate (cAMP), and 2 ng/ml recombinant human glial cell derived neurotrophic factor (GDNF).
The PeriTox test (= UKN5) is based on cells differentiated from the H9 human embryonic stem cell (hESC) line (WA09 line), which was obtained from WiCell (Madison, WI, USA). The import of cells and the experiments were authorized under license no. 170–79-1–4-27 (Robert Koch Institute, Berlin, Germany). The stem cells were cultured according to standard protocols (Thomson et al., 1998) and differentiated into immature dorsal root ganglia neurons, as described previously: after eight days of differentiation with noggin and SB-431542 (dual SMAD inhibition), dorsomorphin (BMP4 signaling inhibitor), DAPT (γ-secretase inhibitor), CHIR (Wnt antagonist) and SU (VEGF, FGF and PDGF signaling inhibitor), the neuronal precursors were cryopreserved for later use in the peripheral neurotoxicity test (PeriTox), which is described in detail in Hoelting et al. (2016).
Neurotoxicity assays based on neurite outgrowth dynamics
For the NeuriTox test, LUHMES cells were differentiated for 48 h and seeded into 96 well plates at a density of 100,000 cells/cm2 in a volume of 90 μl DM without cAMP and GDNF. Treatment was initiated applying 10 μl of a 10x concentrated treatment solution one hour after seeding. At 24 h after treatment, staining mix (SM) was applied (final concentrations: 1 μg/ml H-33342, 1 μM calcein-AM).
For the PeriTox test method, the cells were thawed and seeded at a density of 100,000 cells/cm2 in 75 μl PeriTox differentiation medium (PDM) consisting of 25% KSR-S and 75% N2-S media supplemented with 1.5 μM CHIR99021, 1.5 μM SU5402, and 5 μM DAPT on matrigel-coated plates (KSR-S: knockout DMEM with 15% serum replacement, 1 x Glutamax, 1 x nonessential amino acids and 50 mM beta-mercaptoethanol; N2-S: DMEM/F12, with 2 mM Glutamax, 0.1 mg/ml apotransferrin, 1.55 mg/ml glucose, 25 μg/ml insulin, 100 mM putrescine, 30 nM selenium, and 20 nM progesterone). After one hour, 25 μl PDM with 4x concentrated serial dilutions of the test compounds was added to the cells. At 23 h after treatment, the cells were stained with SM and incubated for one additional hour at 37°C.
Image acquisition was performed with an ArrayScan VTI HCS (high content imaging) microscope (Cellomics, Waltham, MA, USA).
Neurite outgrowth image analysis
The procedure was applied as detailed earlier (Hoelting et al., 2016; Stiegler et al., 2011). After automated imaging, an algorithm was applied that identified the neuronal somata (based on identified nuclei) and subtracted the somatic area from the total neuronal area to obtain the neurite area (NA), i.e., the number of pixels per field covered by neurites. Viability analysis was performed on the same pictures using combined information from H-33342 and calcein channels. Cells with normal-sized nuclei that were calcein-positive were counted as live cells, whereas H-33342 single-positive cells were counted as dead cells. Viability (V) was expressed as percentage of live cells relative to control.
Data analysis: curve fitting and deriving BMC and EC values
Neurite outgrowth assays were performed at least three times (biological replicates), each run evaluating three technical replicates, i.e., different wells with similar treatment. Neurite area (NA) and viability (V) were always expressed as percentage of the DMSO control. In a first step, matched technical replicates were averaged. Subsequently, these data were averaged across the different experiments. Curve fitting was performed employing a 4-parameter log-logistic function with least squares fit. The upper asymptote of the fit was forced to 100%, the lower asymptote was variable. The variation of DMSO controls was calculated from pooled values of DMSO controls over several experiments.
For calculation of the benchmark concentration values (BMC), a benchmark response (BMR) of three standard deviations of DMSO solvent controls of all assay plates (= “3 x noise level”) was used. EC50 concentrations were calculated as the concentration at which the parameter measured (neurite area or viability) declined to 50% of the DMSO-control level. To identify specific effects on neurite outgrowth, the EC50 ratio of EC50(viability)/EC50(neurite area) was calculated. The NeuriTox test system has a specificity threshold ratio of 4, the PeriTox test system has one of 3 (Hoelting et al., 2016; Krug et al., 2013).
In some cases (e.g., non-toxic compounds), no EC50 value could be calculated for viability (V). If V was not affected at the highest tested concentration (HTC), then 4 x HTC was used as surrogate EC50(V) (marked with ♦; for EC25 calculation, the HTC was doubled in this case) (Krug et al., 2013). If the V was affected significantly, but by < 50%, the highest tested concentration was taken as surrogate EC50(V) (marked with °). If no EC50 for neurite area could be calculated, but neurite outgrowth was inhibited significantly, the highest tested concentration was taken as surrogate EC50(NA).
Analysis of the chemical space
The physicochemical characteristics of the ToxCast + Tox21 (called Tox21), DrugBank, and NTP80 collection chemicals were analyzed as described earlier (Nyffeler et al., 2017a). The structures of the compounds were obtained from the SMILES provided in the original sources, converted to SDFile format (RDKit version 0.9.22) and protonated to pH 7.4 using Moka version 1.1 (Milletti et al., 2007). The molecular weight (MW) and octanol-water distribution coefficient (logP) were obtained using RDKit.
The structures were normalized using standardizer3 and converted to 3D using Corina version 3.494 (Sadowski et al., 1994). These were then used to generate GRIND2 descriptors (Pastor et al., 2000; Duran et al., 2009) using Pentacle software version 1.0.64, with default settings. The resulting molecular descriptors were then projected into the principal component analysis (PCA) scores obtained for a collection of ca. 9000 Tox21 and DrugBank compounds (5; Wishart et al., 2008) following a similar procedure (supplementary Excel file6).
Of the original 75 NTP80 collection compounds, the following four had to be removed from physicochemical analyses because they are salts or contain metallic elements not supported by our methods: (i) methylmercury (II) chloride (MeHgCl), (ii) acetic acid manganese (2+) salt, (iii) bis(tributyltin)oxide and (iv) methylcyclopentadienyl manganese tricarbonyl. For the remaining 71, logP and MW values were obtained. In the process of computing the GRIND2 molecular descriptors, two more compounds had to be removed: saccharin sodium salt hydrate and benzo(b)fluoranthene. Thus, the final series projected in the ToxCast and Tox21 space contained 69 compounds.
Statistical analysis and data mining
The Tox21 database (retrieved via NTP Sandbox7) was mined for all compounds that were active in the NeuriTox test and for which a BMC value could be calculated.
Statistics were performed using GraphPad Prism 5. Neurite outgrowth data were tested for significance by one-way ANOVA followed by a Dunnett’s post-hoc test at the significance level of p < 5%. The summaries displayed are based on independent experiments (different cell lots) unless specified otherwise and are termed “biological replicates”.
3. Results and discussion
3.1. Characterization of the chemical properties of the screened library
The library (NTP80 collection) used for screening consisted of 75 different compounds of which five were present as independent duplicates. The latter were intended as internal consistency-controls, so that there were “80 compounds” to be tested. The test items were classified into groups according to their main use or chemical structure: drug/drug-like, polycyclic aromatic hydrocarbons (PAH), pesticides and plasticizers made up 72.5% of all compounds. The remaining compounds were environmental and industrial chemicals, as wells as heavy metals (Fig. 1A and Fig. S11). For basic characterization of the library structure, the distribution of physicochemical properties was visualized. A plot of hydrophobicity (logP value) vs molecular weight showed that the library compounds spread out over a sizable part of the plot space defined by other compounds of toxicological concern (here exemplified by ca. 10,000 compounds from the ToxCast/Tox21 and DrugBank databases5 (Fig. 1B) (Wishart et al., 2008). However, the NTP80 collection covered only parts of the relevant chemical space, and it was over-represented in the “lower-right part” of the logP-MW plot, where the PAHs clustered.
For broader characterization of the chemical space occupied by the NTP80 collection, a large set of GRIND2 molecular descriptors was calculated for each compound. These descriptors sum up multiple chemical properties and can be considered as a comprehensive approach to characterize a compound. The same descriptors were derived for the compounds of the Tox21 and DrugBank libraries, and the latter data were displayed as a principal component analysis (PCA) scores plot. The NTP80 compounds were then projected into the same PCA scores space (Fig. 1C). Altogether, they were spread out over a sizable area of this PCA scores map. However, it was also clear that larger screens are required to cover the chemical space more comprehensively.
3.2. Assay features and performance
In order to run the NeuriTox test in high-throughput mode, proliferating LUHMES cells were switched to medium favoring neuronal differentiation (Fig. 2A), and treated with the compounds. Each potential toxicant was tested at 10 concentrations that were logarithmically spaced. To monitor assay quality, several solvent control (0.1% DMSO) and positive control (50 nM narciclasine) wells were included on each test plate (Fig. S31). After 24 h of exposure, cells were live-stained to assess neurite outgrowth and viability at the same time. LUHMES treated with solvent control (DMSO) grew neurites, which were longer than the diameter of their somata, and the total area covered by neurites within each imaging field was quantified. Cells treated with 50 nM narciclasine generated a neurite area that was significantly reduced (Fig. 2B). As part of the standard operating procedure (SOP), acceptance criteria were defined that described the limit of variation acceptable within a plate for inclusion into data analysis. These were (i) a neurite area of at least 45,000 pixels per well (cell number and frames per well were constant) in the solvent control wells, (ii) a viability (viable cells = double positive for calcein and Hoechst) of > 90% in solvent control wells, (iii) a reduction of neurite area in cells treated with narciclasine (50 nM) by at least 25% relative to DMSO control and (iv) viability of the narciclasine-treated cells greater 90% relative to DMSO control. Analysis of these acceptance criteria across 36 test plates, run on 12 different days, showed a robust performance of the assay (Fig. 2C, D).
3.3. Workflow for screening and data analysis of the NeuriTox test
For the actual screening procedure, a defined workflow was established that contained several pre-determined decision points. All compounds were tested at 1:1000 dilutions of their stock concentration, and at subsequent 3-fold dilution levels. For most compounds, the stock was 20 mM, so that the test concentrations were 20 μM, 6.7 μM, 2.2 μM, 0.7 μM, 0.25 μM, 82 nM, 27 nM, 9 nM, 3 nM, and 1 nM. For 2,2’,4,4’,5,5’-hexabromodiphenyl-ether, chrysene, dibenz(a,h)anthracene, bis(tributyltin) oxide, benzo[g,h,i]perylene, and 2,3,7,8-tetrachlorodibenzo-p-dioxin lower stock concentrations were used.
Data, expressed relative to DMSO solvent control, were used for log-logistic 4-parameter curve fitting (11 data points for each of the two endpoints: neurite area (NA) and viability (V)). Subsequently, hit identification was conducted in a two-step process. Initially, compounds were classified as “active” if they affected NA or V significantly (one-way ANOVA and Dunnett’s post-hoc test) at any test concentration, or when they reduced NA or V by ≥ 20%. Otherwise, they were classified as “inactive compounds”. For active compounds, it was examined in a second step whether there was at least one concentration tested that affected NA significantly, but did not decrease V significantly. If this was the case, the compound was classified as a “hit compound”. Compounds that affected NA and V similarly at all concentrations, were classified as unspecific “cytotoxic compounds”.
For the primary hits (= “hit compound”), hit confirmation testing was conducted. For that, the tested concentration range was adjusted to optimally span the estimated EC10, and three new independent experiments were performed. EC50 values were calculated according to the same procedure as for the screen data. Then, the EC50 ratio of V and NA, i.e., the offset of neurite outgrowth vs viability decrease, was calculated. These data were used as basis for the assay prediction model. If that ratio was ≥ 4, the compound was classified as a “specific (developmental) neurotoxicant”, while the compound was graded as a “cytotoxic compound” if the ratio was < 4 (Fig. 3).
3.4. Overview of NeuriTox screen results
After testing of all “80 compounds” in three independent experiments in the NeuriTox test, concentration-response graphs were produced for subsequent data analysis. Seven compounds (valinomycin, berberine chloride, colchicine, carbaryl, diethylstilbestrol, rotenone and MPP+) caused a significant decrease in neurite area at concentrations that did not affect viability (Fig. 4A). Therefore, these compounds were classified as “active hit compounds”. The lowest tested concentration that evoked an adverse effect (i.e., statistically significant reduction in neurite area compared to control) ranged between 27 nM (colchicine) and 20 μM (carbaryl and berberine chloride) (Fig. 4B).
Most cytotoxic compounds affected the neurite area and cell viability to about the same extent at any tested concentration (Fig. 4C, Fig. S51). As methylmercury (II) chloride was present as duplicate on the master plate, it was tested twice (as compound #69 and #77). The resulting curves overlapped to a large extent, indicating assay robustness and reproducibility. EC50 values were calculated for clear cytotoxicants directly from the screen results.
Unclear screen results were obtained only for some compounds (Fig. 4D). In these cases, the available data were not considered sufficient for classification of the respective chemical (e.g., as a neurotoxicant). Therefore, three compounds were re-tested. Captan proved to be a cytotoxicant with relatively low potency, while tebuconazole and triphenylphosphate (the latter was present twice in the library) were clearly non-toxic at concentrations up to 20 μM (Fig. 4D, Fig. S61), resulting in a toxicity EC50 of 238 μM for tebuconazole and > 200 μM for triphenylphosphate, when an extended concentration range was examined.
3.5. Hit confirmation testing and hit definition in the NeuriTox test
For hit confirmation testing, the compounds were re-purchased, and new stocks were prepared. Re-testing was performed using the same method, but within an optimized concentration range to facilitate EC50 calculation. The limits to this were compound solubility and a maximum DMSO concentration of 0.33% (v/v) in the NeuriTox test. All compounds that were identified as “active hit compound” in the screen were confirmed (Fig. 5A). The EC50 ratio of the compounds classified as “active hit compounds” ranged between 4.1 (diethylstilbestrol) and 620 (valinomycin) in the hit confirmation. Therefore, these seven compounds were classified as specific (developmental) neurotoxicants (Fig. 5B). Concerning the potency of the hit compounds, the EC50 values for reduction of neurite area ranged from 10 nM (valinomycin) up to 80 μM (MPP+). Even lower potencies were seen for some compounds, e.g., for tebuconazole (142 μM), but the latter compound was not a specific neurotoxicant according to our prediction model.
We were interested to what extent the hit definition depended on the prediction model. Here, we used the well-established prediction model of the NeuriTox test (based on EC50 values) as the default method (Krug et al., 2013; Stiegler et al., 2011). It was developed specifically for this assay, and it differs (i) from that used for other assays of neurite growth (Harrill et al., 2013; Harrill and Mundy, 2011; Flaskos et al., 2011; Frimat et al., 2010; Howard et al., 2005; Yang et al., 2014), (ii) from that of other assays that screened the same library (Ryan et al., 2016; Sirenko et al., 2017), and (iii) even from other assays developed in our own laboratory (Nyffeler et al., 2017a). If multiple assays are to be compared, as in the Tox21 program (Judson et al., 2014, 2016, 2015; Tice et al., 2013) or in screening the NTP80 collection, it may be advantageous to use a more generalized algorithm for hit definition. One such approach, taken by the NTP, uses the concept of benchmark concentrations (BMC). The underlying idea is that hits are defined by their distance from the background noise of a given assay. In more mathematical terms, the following steps are taken: (i) the standard deviation of negative controls is determined (= background noise level, BN); (ii) this information is used to define a benchmark response (BMR), which follows the same rule for each assay (e.g., BMR = 3 x BN); (iii) a concentration-response curve is fitted through the test compound data; (iv) the intersection of this curve with the BMR level is determined; (v) the concentration of the compound corresponding to this intersection-point is determined as the BMC. This procedure was applied here both for the viability and for the neurite area data obtained in the NeuriTox screen (Fig. S4A1).
The identification of active compounds obtained by this BMC method was largely similar to the prediction model of the NeuriTox assay (Fig. S4B1). Differences were only observed for the classification of “borderline compounds” into cytotoxic or specific developmental neurotoxicants. In such cases, one or the other approach may be more sensitive or specific (depending on variations and type of uncertainty of the test data). It was obvious that also the setting of the specificity-thresholds affected hit identification. For instance, if specificity of a compound was classified by the BMC ratio (BMCV/BMCNA), and the threshold was set at 3.16 (Ryan et al., 2016), valinomycin and carbaryl were classified as cytotoxic. If that threshold was changed to 2, the BMC method classified the same compounds as specific DNT-compounds, in agreement with the EC50 method (Fig. S4B1).
Three compounds, bisphenol A (BPA), 2,2’,4,4’-tetrabromodiphenylether (TBDE), and 2,2’,4,4’,5’-pentabromodiphenylether (PBDE) showed larger than normal data variation in the screen. According to the NeuriTox data analysis workflow, all three were initially classified as “active”. These compounds were also active according to the BMC method. However, re-testing of BPA showed that it actually has no effect up to 100 μM (data not shown), while the others retained the classification as “unspecific cytotoxicants”.
In summary, this comparison of entirely different hit definition approaches showed that they mostly lead to similar results. This suggests that the test method is robust. Moreover, it shows that both approaches may be useful, depending on the intended use: the BMC method may be more sensitive (fewer false negatives), and it is less dependent on the part of the curve that reflects high toxicant concentrations. On the other hand, it depends greatly on the quality of the data in the low toxicity range.
3.6. Chemical characterization of specific hit compounds (= specific (developmental) neurotoxicants)
To elucidate whether the NeuriTox test has a bias to detect compounds with certain physicochemical properties, these were investigated for the set of tested compounds. Compounds that were identified as specific neurite outgrowth inhibitors in the NeuriTox test were analyzed regarding their hydrophobicity and molecular weight (Fig. S2A1). While there was no bias for a certain molecular weight detectable, all identified specific neurite outgrowth inhibitors were located in a medium hydrophobicity range (logP values 0–5). A more generalized approach, using hundreds of chemical descriptors (GRIND2 physicochemical descriptors), showed that the specific neurite outgrowth inhibitors were evenly distributed within the physicochemical properties of the NTP80 collection compounds, and even within the chemical space of the large libraries Tox21 and DrugBank (Fig. 1C and Fig. S2B1). These data suggest that the NeuriTox test has no obvious classification bias with respect to physicochemical properties.
3.7. NeuriTox hits in light of Tox21 data on these compounds
For all active compounds identified from the NTP80 collection by the EC50 and BMC method, available data were extracted from the Tox21 database. In order to compare data from different assays, the BMC value for the most sensitive measured endpoint was used. On this basis, the impairment of LUHMES neurite outgrowth was compared with all viability data in the Tox21 library (boxes and whiskers, n = 168 viability endpoints in total, 7–28 per compound) (Fig. 6A). For 11 of the 13 compared compounds, inhibition of LUHMES neurite outgrowth was more sensitive than the median response of the Tox21 assays; for 10 of the 13 compounds, LUHMES cells were even more sensitive than the 25th percentile fraction of the Tox21 viability results. No Tox21 results were available for valinomycin and MPP+.
Furthermore, LUHMES neurite outgrowth was compared against functional endpoints (e.g., receptor activation or stress response signaling (n = 123 specific endpoints in total, 8–16 per compound) measured in the Tox21 set up, excluding viability measurements. Data from a recently published neurite outgrowth test method were included in this comparison (Ryan neurites, Ryan viability, Fig. 6B) (Ryan et al., 2016).
For 9 of the 13 compounds, LUHMES neurite outgrowth was a more sensitive endpoint than the median of all functional Tox21 data. For all compounds, except for berberine chloride, the NeuriTox test was more sensitive than the alternative neurite outgrowth test method used by Ryan. The parkinsonian toxicant MPP+ was only detected in the NeuriTox test. This is consistent with its known mode of action, which requires the dopamine transporter. The latter is expressed in LUHMES cells (Lotharius et al., 2002; Schildknecht et al., 2009), but not in the mixed neuronal cultures used by Ryan et al. (2016).
3.8. Re-testing of the NTP collection in the PeriTox test
We were interested in how far hits in the NeuriTox screen (hazard of compounds to central nervous system neurons) would overlap with the activity of compounds of the NTP80 collection in a recently established (Hoelting et al., 2016) test of peripheral neurotoxicity (PeriTox test). This method uses human immature dorsal root ganglia neurons (iDRG) that are produced from pluripotent stem cells that are still in a phase of neurite growth. Like in the NeuriTox assay, exposure to toxicants in this test is for 24 h, and readouts for viability (V) and neurite area (NA) are also conducted in a similar way (Fig. 7A).
Three independent screen runs were performed, and eight compounds were identified as “active hit compounds” according to the evaluation algorithm specified above for the NeuriTox test. These compounds (berberine chloride, carbaryl, colchicine, diethylstilbestrol, rotenone, valinomycin, iodocarb, and methylmercury chloride) underwent subsequent hit confirmation testing. Seven of the eight compounds were confirmed as specific hits according to the published prediction model (EC50 ratio viability/neurite area ≥ 3; Hoelting et al., 2016). Carbaryl failed this verification step (Fig. 7B, Fig. S71). After the screen, we included acrylamide in the group of hits. This is known from a former publication (Hoelting et al., 2016) to be a specific and active compound in the PeriTox assay, but at concentrations ≥ 20 μM (Fig. 7B). As further post-testing step, valproic acid (VPA) was classified as cytotoxic. This was done on the basis of previously obtained data in a much higher concentration range than used in the screen (Fig. S71). For carbaryl and VPA, the ratio of the EC50 values for NA and V was > 2 but < 3. We investigated alternative prediction models (BMCs, EC30, EC25 and EC20 ratios, Fig. S71) to explore whether they would indicate a specific effect of the toxicants. However, a ratio > 3 was reached by neither approach. Thus, the default prediction model appears to yield a robust definition of hits and non-hits.
For the PeriTox hits, the potency spanned a range from 20 nM (EC50(NA)) for valinomycin to 2.3 mM (EC50(NA)) for acrylamide. The offset between adverse effects on NA and V (EC50 ratios) ranged from 3.2 (acrylamide) up to 330 (rotenone). These ratios were useful ranking measures within the PeriTox assay. However, they are based on various assumptions (e.g., curve shape and steepness) and they were therefore considered problematic for comparisons with other assays (e.g., the NeuriTox test). In order to directly compare the effects of the same set of compounds on different tests (NeuriTox vs PeriTox), the BMC(NA) values (referring more to the onset of toxicity) were plotted for compounds which were identified as specific compounds in at least one of the tests (Fig. 7C). This approach allowed the comparison of the hazard to the central nervous system (NeuriTox) vs the peripheral nervous system (PeriTox). The NeuriTox assay showed a tendency to be affected at lower compound concentrations when the compound was a hit. The PeriTox had a higher hit-rate (detection of acrylamide, iodocarb and methylmercury chloride). The PeriTox detected acrylamide, a well-known peripheral neurotoxicant (Cavanagh, 2000; Spencer and Schaumburg, 1975), whereas the NeuriTox assay identified MPP+ as a hit, well in accordance with the known central nervous toxicity (Schildknecht et al., 2017) of this compound (Fig. 7C).
For comparison of the specificity (V/NA ratios) of the tests, the default prediction models have disadvantages (rules dealing, for example, with viability curves that did not drop to a 50% level). Thus, BMC values were used. This comparison shows that there are indeed some drastic differences (e.g., for MPP+). It also demonstrates that some differences in the identification of specific hits are not very robust. For instance, acrylamide, re-tested at high concentration in the PeriTox and NeuriTox assays, was a specific PeriTox hit according to the individual test prediction models. Comparison of BMC ratios suggests however, that the differences between the tests are minor. In such a borderline situation, a compound may end up by chance on either side of the hit threshold, and for some purposes, it would be useful to introduce a third category (besides hits and non-hits) of “borderline compounds” (Fig. 7D).
The comparison also clearly shows the advantage of using two complementary assays for the same type of endpoint if sensitivity of compound identification (e.g., for further testing) is a major issue. The combination of both tests had a higher sensitivity for detection of potentially hazardous compounds.
3.9. Comparison of data from neurite toxicity assays with other published DNT tests
Hazardous effects of the NTP80 collection have so far been described in four publications, which span a pure cytotoxicity assessment of cells in varying neural differentiation states (Pei et al., 2016), an alternative neurite outgrowth model (Ryan et al., 2016), and highly function-based studies focusing on the migration of neural crest cells (Nyffeler et al., 2017a) or adverse effects on cardiomyocyte function (Sirenko et al., 2017). These data were synoptically compared to the results of our study (Fig. 8).
In the first published screen (Pei et al., 2016), cytotoxicity was assessed after exposure to the NTP80 collection compounds at two different concentrations (10 and 100 μM) for 72 h. In this study, many compounds appeared cytotoxic to neural cells, but hit confirmation was not performed. On the other hand, the cardiotoxicity screen (Sirenko et al., 2017) addressed a broad set of endpoints and more than half of the 69 tested compounds affected cardiomyocytes in some way. A prediction model still also needs to be developed for that test method. For our comparison, we ranked only those compounds as potentially cardiotoxic, which i) affected cardio-physiologic parameters after 30 min treatment at a three-fold lower concentration than viability and ii) if they had no effect on viability after 24 h. For the Ryan et al. (2016) neurite outgrowth model, we adopted the classification suggested by the authors: a specific neurotoxin had a BMC for neurite outgrowth that was at least 3.16-fold (= one half-log dilution step) lower than for general cytotoxicity. For the neural crest migration (cMINC) (Nyffeler et al., 2017a) as well as for the NeuriTox and PeriTox tests, the published prediction models were used (Hoelting et al., 2016; Krug et al., 2013).
Limiting the comparison to compounds selectively active for neuro or cardio effects, the cMINC test and the cardiotoxicity assay classified the highest number of compounds (23 and 32, respectively). Further analysis of the specific compounds showed that none of the tested compounds evoked a specific response in all assays. However, seven of the 69 compounds (rotenone, diethylstilbestrol, berberine chloride, valinomycin, carbaryl, methylmercury(II)chloride, and iodocarb) were active (not necessarily specific) in all test methods when full concentration responses were considered.
Comparing which compounds were classified as specific between neural (NeuriTox, PeriTox, cMINC) cell based tests and the cardiotoxicity test method showed that many compounds that were specific in the neuronal system were generally cytotoxic in the cardiomyocyte-based test method (e.g., rotenone, diethylstilbestrol, berberine chloride, valinomycin), whereas compounds that were specific in the cardiotoxicity test method were inactive or generally cytotoxic in the neuronal-based tests (e.g., carbaryl, hydroxydopamine). From this initial comparison of tests, the PeriTox and NeuriTox tests appear to have a largely overlapping specificity range. Moreover, most hits of the neurite assays are also identified by the cMINC test. However, the latter test identifies a large group of additional compounds. The cardiotoxicity test method seems to be largely complementary.
4. Conclusion and outlook
Our comparative compilation of screen data shows where gaps remain to be filled in data generation and interpretation. For instance, strong developmental toxicants, such as thalidomide and 5-fluorouracil, were not detected by any of the published screens. This pinpoints the need for supplementing the test battery with other complementary tests.
Our comparison also revealed some technical issues that need to be addressed:
The definition of non-actives is difficult, especially if the highest tested concentration differs between screens.
The use of nominal concentrations for comparisons poses problems, as the compound concentration that impacts the cell directly is influenced by its physicochemical parameters (e.g., extent of adhesion to plastic and serum binding), and the properties of the test system (cell density, medium type). The free soluble concentration in the culture medium or the intracellular concentration would provide more comparable measures.
Different concentrations of solvent (e.g., 0.1% (v/v) DMSO (= 14 mM) or 0.5% (v/v) DMSO (= 70 mM)) can affect screen results.
Fixed concentration range screens that limit the highest possible concentration prevent testing of low potency but highly abundant compounds at relevant exposure concentrations. Examples here were VPA and acrylamide, where clinical and accidental exposure can be higher than the highest tested concentration used in our screen. The issue of test concentrations is also important in another context: how do the concentrations at which hits are observed relate to relevant in vivo concentrations? This point was neglected here, since the screen is designed to create alerts, and the follow-up evaluation would then prioritize them, e.g., taking various exposure scenarios and related estimates of human brain, plasma or fetal concentrations into account.
The different false-positive rates of screens are important for comparison of screen hits or for subsequent toxicological evaluations (e.g., for QSAR or read-across approaches). In order to obtain a good sensitivity (low number of false-negatives), hit definitions of screens are set in a way to allow many false-positives. For instance, if the significance level is set to 0.1, then a screen of 80 compounds will result in 8 false-positives. This number can subsequently be drastically reduced by secondary re-testing of hits.
One of the most pertinent issues of hit definition is the test prediction model. Most screens, including NeuriTox and PeriTox, use a binary model (hit/non-hit). In such cases, threshold setting requires a large learning and training set of negative, unspecific and positive control compounds (Crofton et al., 2011; Leist et al., 2010; Schmidt et al., 2017). For the NeuriTox and PeriTox tests, prediction models have been established based on the evaluation of the EC50 ratio of viability and neurite area (Hoelting et al., 2016; Krug et al., 2013; Stiegler et al., 2011). These prediction models are designed in a way that compounds that affect neurites more potently than viability (EC50(V/NA) ratio > 4 for NeuriTox and EC50(V/NA) ratio > 3 for PeriTox) are considered specific neurotoxicants. It is important to note that the prediction model only makes a statement on positives (= neurotoxicants). The model does by no means imply that compounds with a low EC50 ratio (= non-hits) are non-toxicants. This potential fallacy must be strictly avoided. For instance, strong cytotoxicity under the given in vitro test conditions may mask a potential specific in vivo neurotoxic effect.
Furthermore, it has to be considered which curve-fitting approach and constraints were applied to yield summary data from the curve-fit to enter them into the prediction model. For instance, EC50 values are relatively robust against baseline fluctuations, but they depend strongly on the shape of the concentration response curve (shallow vs steep) and on the lower part (higher toxicity range) of the curve. In contrast, BMC values better define the actual onset of toxicity, but they depend strongly on the low-concentration data and baseline fluctuations. If the focus on data analysis is strong sensitivity (low false-negative rate) or comparison across many different models, the BMC is a very useful method.
An issue that may also need to be revisited in the future is the classification of so-called “borderline compounds”, where the EC50 ratio is close to the specificity threshold. Following the classical binary prediction model of, e.g., the NeuriTox test, a compound with an EC50 ratio of 3.9 is classified as “cytotoxic”, whereas a compound with a ratio of 4.1 is a “neurotoxicant”. This sharp distinction contrasts with the statistical variation of data, e.g., in different screen runs. Therefore, it might be helpful to introduce a third category of “borderline compounds”, which comprises the range around the threshold (Leontaridou et al., 2017). Alternatively, a probability-based prediction model could be developed which is not based on distinct hazard classes (i.e., “non-specific”, “borderline”, “specific”) (Leontaridou et al., 2017), but which identifies the compound’s hazard potential (Paparella et al., 2017, 2013; Leist et al., 2014; Basketter et al., 2012; Jaworska and Hoffmann, 2010; Hartung et al., 2013; Judson et al., 2015). An example for such an approach is given in Fig. S81, but considerable further work is required to refine this approach.
Further issues are acceptance criteria for test data and resultant curve shapes. Here, we used “inspection by the human eye” to ensure some plausibility (e.g., monotonic curve shapes). This procedure may introduce bias, and it is difficult to apply to large screens. An example from the NeuriTox screen is the concentration response curve for tebuconazole (Fig. 4D). This compound never affected viability or neurite area more than 20% and it would therefore be classified as a non-hit. However, visual inspection showed a non-monotonic concentration-response curve. It was therefore re-tested and was indeed identified as an active cytotoxic compound (at high concentrations) (Fig. S61).
Of toxicological concern are false negatives due to biological differences of the screen system vs the in vivo situation. A typical example here is the non-toxicity of hexane, a known neurotoxicant. In vivo, hexane is activated by P450 enzymes to hexanedione and this metabolite subsequently causes neurotoxicity. The lack of a metabolite activation system prevents the detection of such toxicants. Similarly, the in vitro system may lack important toxicant targets or the readout used can be independent of a certain target activity. An example here is acetylcholinesterase (AChE), which does not play a role for the assay readout, and thus, typically neurotoxic AChE inhibitors are not detected.
In order to transit from screen hits to more definite toxicological statements, it is usually necessary to test whether a similar adverse effect can be confirmed with another test method (= secondary hit follow-up) (Nyffeler et al., 2017a). Here, the PeriTox test was used to gain further information on potential neurotoxic hazard. It needs to be noted that the NeuriTox and PeriTox assays do not test for identical toxicities but are complementary to a certain extent. While the NeuriTox test was designed to identify toxicants that disrupt neurites of the central nervous system neurons, the PeriTox test was set up to detect peripheral neurotoxicants. This complementarity is exemplified by the fact that the parkinsonian toxicant MPP+ is only picked up by the NeuriTox test, while, e.g., acrylamide was only identified as specific hit by the PeriTox test. However, since both tests are based on the impairment of neuronal structures, a certain degree of convergence is expected as well. Both tests identified the same five compounds (out of 7 or 8 specific hit compounds in the NeuriTox and PeriTox test, respectively) as specifically neurotoxic.
Knowing for which hazard assessment scenarios these tests can be applied, rationally structured test batteries can be built in an efficient (minimal overlap of tests) and sufficient (broad coverage of biological endpoints) manner.
Supplementary Material
Acknowledgements
This work was supported by the Land BW, the Doeren-kamp-Zbinden Foundation, the DFG (RTG1331, KoRS-CB), the BMBF (NeuriTox) and the European Project EU-ToxRisk.
Abbreviations:
- BMC
benchmark concentration
- BN
background noise level
- BPA
bisphenol A
- BR
benchmark response
- calcein-AM
calcein acetoxymethyl ester
- cAMP
N6,2′-O-dibutyryladenosine 3′,5′-cyclic monophosphate
- cMINC
neural crest cell migration test
- DM
differentiation medium
- DMSO
dimethyl sulfoxide
- DNT
developmental neurotoxicity
- EC
effective concentration
- FGF
fibroblast growth factor
- GDNF
glial derived neurotrophic factor
- log
logarithm of the partition-coefficient
- LUHMES
Lund human mesencephalic
- MW
molecular weight
- NA
neurite area
- NT
neurotoxicity
- NTP
National Toxicology Program of the USA
- OECD
Organisation for Economic Co-operation and Development
- PAH
polycyclic aromatic hydrocarbons
- PCA
principal component analysis
- PDM
PeriTox differentiation medium
- PLO
poly-L-ornithine
- PM
proliferation medium
- SEM
standard error of the mean
- UKN2
cMINC
- UKN4
NeuriTox
- UKN5
PeriTox
- V
viability
- VPA
valproic acid
Footnotes
Disclaimer: The views expressed in this paper are those of the authors and do not necessarily reflect the statements, opinions, views, conclusions, or policies of the National Institutes of Health or the United States government. Mention of trade names or commercial products does not constitute endorsement or recommendation for use.
Conflict of interest
The authors declare no conflict of interest.
https://github.com/flatkinson/standardiser (accessed 21.11.2016)
https://www.epa.gov/chemical-research/toxicity-forecaster-toxcasttm-data (retrieved on 29.07.2016; data released 19.10.2015)
https://ntp.niehs.nih.gov/sandbox/tox21-activity-browser/ (accessed 28.06.2016)
References
- Aschner M, Ceccatelli S, Daneshian M. et al. (2017). Reference compounds for alternative test methods to indicate developmental neurotoxicity (DNT) potential of chemicals: Example lists and criteria for their selection and use. ALTEX 34, 49–74. doi: 10.14573/altex.1604201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bal-Price A, Crofton KM, Leist M. et al. (2015). International stakeholder network (ISTNET): Creating a developmental neurotoxicity (DNT) testing road map for regulatory purposes. Arch Toxicol 89, 269–287. doi: 10.1007/s00204-015-1464-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balmer NV, Weng MK, Zimmer B. et al. (2012). Epigenetic changes and disturbed neural development in a human embryonic stem cell-based model relating to the fetal valproate syndrome. Hum Mol Genet 21, 4104–4114. doi: 10.1093/hmg/dds239 [DOI] [PubMed] [Google Scholar]
- Basketter DA, Clewell H, Kimber I. et al. (2012). A roadmap for the development of alternative (non-animal) methods for systemic toxicity testing – t4 report*. ALTEX 29, 3–91. doi: 10.14573/altex.2012.1.003 [DOI] [PubMed] [Google Scholar]
- Baumann J, Gassmann K, Masjosthusmann S. et al. (2016). Comparative human and rat neurospheres reveal species differences in chemical effects on neurodevelopmental key events. Arch Toxicol 90, 1415–1427. doi: 10.1007/s00204-015-1568-8 [DOI] [PubMed] [Google Scholar]
- Bennett D, Bellinger DC, Birnbaum LS et al. (2016). Project TENDR: Targeting environmental neuro-developmental risks the TENDR consensus statement. Environ Health Perspect 124, A118–122. doi: 10.1289/ehp358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown JP, Lynch BS, Curry-Chisolm IM et al. (2017). Assaying spontaneous network activity and cellular viability using multi-well microelectrode arrays. Methods Mol Biol 1601, 153–170. doi: 10.1007/978-1-4939-6960-9_13 [DOI] [PubMed] [Google Scholar]
- Cavanagh JB (2000). Experimental and clinical neurotoxicology. Second edition. Brain 123, 2571–2573. doi: 10.1093/brain/123.12.2571 [DOI] [Google Scholar]
- Crofton KM, Mundy WR, Lein PJ et al. (2011). Developmental neurotoxicity testing: Recommendations for developing alternative methods for the screening and prioritization of chemicals. ALTEX 28, 9–15. doi: 10.14573/altex.2011.1.009 [DOI] [PubMed] [Google Scholar]
- Crofton KM, Mundy WR and Shafer TJ (2012). Developmental neurotoxicity testing: A path forward. Congenit Anom (Kyoto) 52, 140–146. doi: 10.1111/j.1741-4520.2012.00377.x [DOI] [PubMed] [Google Scholar]
- Crofton K, Fritsche E, Ylikomi T. et al. (2014). International stakeholder network (ISTNET) for creating a developmental neurotoxicity testing (DNT) roadmap for regulatory purposes. ALTEX 31, 223–224. doi: 10.14573/altex.1402121 [DOI] [PubMed] [Google Scholar]
- Culbreth ME, Harrill JA, Freudenrich TM et al. (2012). Comparison of chemical-induced changes in proliferation and apoptosis in human and mouse neuroprogenitor cells. Neurotoxicology 33, 1499–1510. doi: 10.1016/j.neuro.2012.05.012 [DOI] [PubMed] [Google Scholar]
- Daneshian M, Kamp H, Hengstler J. et al. (2016). Highlight report: Launch of a large integrated European in vitro toxicology project: EU-ToxRisk. Arch Toxicol 90, 1021–1024. doi: 10.1007/s00204-016-1698-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duran A, Zamora I. and Pastor M. (2009). Suitability of GRIND-based principal properties for the description of molecular similarity and ligand-based virtual screening. J Chem Inf Model 49, 2129–2138. doi: 10.1021/ci900228x [DOI] [PubMed] [Google Scholar]
- Ezendam J, Braakhuis HM and Vandebriel RJ (2016). State of the art in non-animal approaches for skin sensitization testing: From individual test methods towards testing strategies. Arch Toxicol 90, 2861–2883. doi: 10.1007/s00204-016-1842-4 [DOI] [PubMed] [Google Scholar]
- Flaskos J, Nikolaidis E, Harris W. et al. (2011). Effects of sub-lethal neurite outgrowth inhibitory concentrations of chlorpyrifos oxon on cytoskeletal proteins and acetylcholinesterase in differentiating N2a cells. Toxicol Appl Pharmacol 256, 330–336. doi: 10.1016/j.taap.2011.06.002 [DOI] [PubMed] [Google Scholar]
- Frimat JP, Sisnaiske J, Subbiah S. et al. (2010). The network formation assay: A spatially standardized neurite outgrowth analytical display for neurotoxicity screening. Lab Chip 10, 701–709. doi: 10.1039/b922193j [DOI] [PubMed] [Google Scholar]
- Fritsche E, Cline JE, Nguyen N-H et al. (2005). Polychlorinated biphenyls disturb differentiation of normal human neural progenitor cells: Clue for involvement of thyroid hormone receptors. Environ Health Perspect 113, 871–876. doi: 10.1289/ehp.7793 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fritsche E, Crofton KM, Hernandez AF et al. (2017). OECD/EFSA workshop on developmental neurotoxicity (DNT): The use of non-animal test methods for regulatory purposes. ALTEX 34, 311–315. doi: 10.14573/altex.1701171 [DOI] [PubMed] [Google Scholar]
- Grandjean P. and Landrigan PJ (2006). Developmental neurotoxicity of industrial chemicals. Lancet 368, 2167–2178. doi: 10.1016/S0140-6736(06)69665-7 [DOI] [PubMed] [Google Scholar]
- Grandjean P. and Landrigan PJ (2014). Neurobehavioural effects of developmental toxicity. Lancet Neurol 13, 330–338. doi: 10.1016/S1474-4422(13)70278-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harrill JA and Mundy WR (2011). Quantitative assessment of neurite outgrowth in PC12 cells. Methods Mol Biol 758, 331–348. doi: 10.1007/978-1-61779-170-3_23 [DOI] [PubMed] [Google Scholar]
- Harrill JA, Freudenrich TM, Robinette BL and Mundy WR (2011). Comparative sensitivity of human and rat neural cultures to chemical-induced inhibition of neurite outgrowth. Toxicol Appl Pharmacol 256, 268–280. doi: 10.1016/j.taap.2011.02.013 [DOI] [PubMed] [Google Scholar]
- Harrill JA, Robinette BL, Freudenrich T. and Mundy WR (2013). Use of high content image analyses to detect chemical-mediated effects on neurite sub-populations in primary rat cortical neurons. Neurotoxicology 34, 61–73. doi: 10.1016/j.neuro.2012.10.013 [DOI] [PubMed] [Google Scholar]
- Hartung T, Luechtefeld T, Maertens A. and Kleensang A. (2013). Food for thought … Integrated testing strategies for safety assessments. ALTEX 30, 3–18. doi: 10.14573/altex.2013.1.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoelting L, Klima S, Karreman C. et al. (2016). Stem cell-derived immature human dorsal root ganglia neurons to identify peripheral neurotoxicants. Stem Cells Transl Med 5, 476–487. doi: 10.5966/sctm.2015-0108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hogberg HT, Kinsner-Ovaskainen A, Hartung T. et al. (2009). Gene expression as a sensitive endpoint to evaluate cell differentiation and maturation of the developing central nervous system in primary cultures of rat cerebellar granule cells (CGCs) exposed to pesticides. Toxicol Appl Pharmacol 235, 268–286. doi: 10.1016/j.taap.2008.12.014 [DOI] [PubMed] [Google Scholar]
- Hogberg HT, Kinsner-Ovaskainen A, Coecke S. et al. (2010). mRNA expression is a relevant tool to identify developmental neurotoxicants using an in vitro approach. Toxicol Sci 113, 95–115. doi: 10.1093/toxsci/kfp175 [DOI] [PubMed] [Google Scholar]
- Howard AS, Bucelli R, Jett DA et al. (2005). Chlorpyrifos exerts opposing effects on axonal and dendritic growth in primary neuronal cultures. Toxicol Appl Pharmacol 207, 112–124. doi: 10.1016/j.taap.2004.12.008 [DOI] [PubMed] [Google Scholar]
- Jaworska J. and Hoffmann S. (2010). Integrated testing strategy (ITS) – Opportunities to better use existing data and guide future testing in toxicology. ALTEX 27, 231–242. doi: 10.14573/altex.2010.4.231 [DOI] [PubMed] [Google Scholar]
- Judson R, Houck K, Martin M. et al. (2014). In vitro and modelling approaches to risk assessment from the U.S. Environmental protection agency toxcast programme. Basic Clin Pharmacol Toxicol 115, 69–76. doi: 10.1111/bcpt.12239 [DOI] [PubMed] [Google Scholar]
- Judson RS, Magpantay FM, Chickarmane V. et al. (2015). Integrated model of chemical perturbations of a biological pathway using 18 in vitro high-throughput screening assays for the estrogen receptor. Toxicol Sci 148, 137–154. doi: 10.1093/toxsci/kfv168 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Judson R, Houck K, Martin M. et al. (2016). Analysis of the effects of cell stress and cytotoxicity on in vitro assay activity across a diverse chemical and assay space. Toxicol Sci 152, 323–339. doi: 10.1093/toxsci/kfw092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirkland D, Reeve L, Gatehouse D. et al. (2011). A core in vitro genotoxicity battery comprising the Ames test plus the in vitro micronucleus test is sufficient to detect rodent carcinogens and in vivo genotoxins. Mutat Res 721, 27–73. doi: 10.1016/j.mrgentox.2010.12.015 [DOI] [PubMed] [Google Scholar]
- Krug AK, Balmer NV, Matt F. et al. (2013). Evaluation of a human neurite growth assay as specific screen for developmental neurotoxicants. Arch Toxicol 87, 2215–2231. doi: 10.1007/s00204-013-1072-y [DOI] [PubMed] [Google Scholar]
- Krug AK, Gutbier S, Zhao L. et al. (2014). Transcriptional and metabolic adaptation of human neurons to the mitochondrial toxicant MPP+. Cell Death Dis 5, e1222. doi: 10.1038/cddis.2014.166 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leist M, Hartung T. and Nicotera P. (2008a). The dawning of a new age of toxicology. ALTEX 25, 103–114. doi: 10.14573/altex.2008.2.103 [DOI] [PubMed] [Google Scholar]
- Leist M, Kadereit S. and Schildknecht S. (2008b). Food for thought ... On the real success of 3R approaches. ALTEX 25, 17–32. doi: 10.14573/altex.2008.1.17 [DOI] [PubMed] [Google Scholar]
- Leist M, Efremova L. and Karreman C. (2010). Food for thought ... Considerations and guidelines for basic test method descriptions in toxicology. ALTEX 27, 309–317. doi: 10.14573/altex.2010.4.309 [DOI] [PubMed] [Google Scholar]
- Leist M, Hasiwa N, Daneshian M. and Hartung T. (2012). Validation and quality control of replacement alternatives – Current status and future challenges. Toxicol Res 1, 8–22. doi: 10.1039/c2tx20011b [DOI] [Google Scholar]
- Leist M, Hasiwa N, Rovida C. et al. (2014). Consensus report on the future of animal-free systemic toxicity testing. ALTEX 31, 341–356. doi: 10.14573/altex.1406091 [DOI] [PubMed] [Google Scholar]
- Leontaridou M, Urbisch D, Kolle SN et al. (2017). The borderline range of toxicological methods: Quantification and implications for evaluating precision. ALTEX 34, 525–538. doi: 10.14573/altex.1606271 [DOI] [PubMed] [Google Scholar]
- Lotharius J, Barg S, Wiekop P. et al. (2002). Effect of mutant alpha-synuclein on dopamine homeostasis in a new human mesencephalic cell line. J Biol Chem 277, 38884–38894. doi: 10.1074/jbc.M205518200 [DOI] [PubMed] [Google Scholar]
- Lotharius J, Falsig J, van Beek J. et al. (2005). Progressive degeneration of human mesencephalic neuron-derived cells triggered by dopamine-dependent oxidative stress is dependent on the mixed-lineage kinase pathway. J Neurosci 25, 6329–6342. doi: 10.1523/JNEUROSCI.1746-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Makris SL, Raffaele K, Allen S. et al. (2009). A retrospective performance assessment of the developmental neurotoxicity study in support of OECD test guideline 426. Environ Health Perspect 117, 17–25. doi: 10.1289/ehp.11447 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marx U, Andersson TB, Bahinski A. et al. (2016). Biology-inspired microphysiological system approaches to solve the prediction dilemma of substance testing. ALTEX 33, 272–321. doi: 10.14573/altex.1603161 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Milletti F, Storchi L, Sforna G. and Cruciani G. (2007). New and original pKa prediction method using grid molecular interaction fields. J Chem Inf Model 47, 2172–2181. doi: 10.1021/ci700018y [DOI] [PubMed] [Google Scholar]
- Mundy WR, Padilla S, Breier JM et al. (2015). Expanding the test set: Chemicals with potential to disrupt mammalian brain development. Neurotoxicol Teratol 52, 25–35. doi: 10.1016/j.ntt.2015.10.001 [DOI] [PubMed] [Google Scholar]
- Nyffeler J, Dolde X, Krebs A. et al. (2017a). Combination of multiple neural crest migration assays to identify environmental toxicants from a proof-of-concept chemical library. Arch Toxicol 91, 3613–3632. doi: 10.1007/s00204-017-1977-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nyffeler J, Karreman C, Leisner H. et al. (2017b). Design of a high-throughput human neural crest cell migration assay to indicate potential developmental toxicants. ALTEX 34, 75–94. doi: 10.14573/altex.1605031 [DOI] [PubMed] [Google Scholar]
- OECD (2007). Test No. 426: Developmental Neurotoxicity Study. Paris, France: OECD Publishing. doi: 10.1787/9789264067394-en [DOI] [Google Scholar]
- Pamies D, Barreras P, Block K. et al. (2016). A human brain microphysiological system derived from induced pluripotent stem cells to study neurological diseases and toxicity. ALTEX 34, 362–376. doi: 10.14573/altex.1609122 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paparella M, Daneshian M, Hornek-Gausterer R. et al. (2013). Uncertainty of testing methods – What do we (want to) know? ALTEX 30, 131–144. doi: 10.14573/altex.2013.2.131 [DOI] [PubMed] [Google Scholar]
- Paparella M, Colacci A. and Jacobs MN (2017). Uncertainties of testing methods: What do we (want to) know about carcinogenicity? ALTEX 34, 235–252. doi: 10.14573/altex.1608281 [DOI] [PubMed] [Google Scholar]
- Pastor M, Cruciani G, McLay I. et al. (2000). Grid-independent descriptors (GRIND): A novel class of alignment-independent three-dimensional molecular descriptors. J Med Chem 43, 3233–3243. doi: 10.1021/jm000941m [DOI] [PubMed] [Google Scholar]
- Pei Y, Peng J, Behl M. et al. (2016). Comparative neurotoxicity screening in human iPSC-derived neural stem cells, neurons and astrocytes. Brain Res 1638, 57–73. doi: 10.1016/j.brainres.2015.07.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prinsen MK, Hendriksen CF, Krul CA and Woutersen RA (2017). The isolated chicken eye test to replace the Draize test in rabbits. Regul Toxicol Pharmacol 85, 132–149. doi: 10.1016/j.yrtph.2017.01.009 [DOI] [PubMed] [Google Scholar]
- Ryan KR, Sirenko O, Parham F. et al. (2016). Neurite outgrowth in human induced pluripotent stem cell-derived neurons as a high-throughput screen for developmental neurotoxicity or neurotoxicity. Neurotoxicology 53, 271–281. doi: 10.1016/j.neuro.2016.02.003 [DOI] [PubMed] [Google Scholar]
- Sadowski J, Gasteiger J. and Klebe G. (1994). Comparison of automatic three-dimensional model builders using 639 x-ray structures. J Chem Inf Comput Sci 34, 1000–1008. doi: 10.1021/ci00020a039 [DOI] [Google Scholar]
- Schildknecht S, Poltl D, Nagel DM et al. (2009). Requirement of a dopaminergic neuronal phenotype for toxicity of low concentrations of 1-methyl-4-phenylpyridinium to human cells. Toxicol Appl Pharmacol 241, 23–35. doi: 10.1016/j.taap.2009.07.027 [DOI] [PubMed] [Google Scholar]
- Schildknecht S, Karreman C, Poltl D. et al. (2013). Generation of genetically-modified human differentiated cells for toxicological tests and the study of neurodegenerative diseases. ALTEX 30, 427–444. doi: 10.14573/altex.2013.4.427 [DOI] [PubMed] [Google Scholar]
- Schildknecht S, Di Monte DA, Pape R. et al. (2017). Tipping points and endogenous determinants of nigrostriatal degeneration by MPTP. Trends Pharmacol Sci 38, 541–555. doi: 10.1016/j.tips.2017.03.010 [DOI] [PubMed] [Google Scholar]
- Schmidt BZ, Lehmann M, Gutbier S. et al. (2017). In vitro acute and developmental neurotoxicity screening: An overview of cellular platforms and high-throughput technical possibilities. Arch Toxicol 91, 1–33. doi: 10.1007/s00204-016-1805-9 [DOI] [PubMed] [Google Scholar]
- Schmuck MR, Temme T, Dach K. et al. (2017). Omnisphero: A high-content image analysis (HCA) approach for phenotypic developmental neurotoxicity (DNT) screenings of organoid neurosphere cultures in vitro. Arch Toxicol 91, 2017–2028. doi: 10.1007/s00204-016-1852-2 [DOI] [PubMed] [Google Scholar]
- Scholz D, Poltl D, Genewsky A. et al. (2011). Rapid, complete and large-scale generation of post-mitotic neurons from the human LUHMES cell line. J Neurochem 119, 957–971. doi: 10.1111/j.1471-4159.2011.07255.x [DOI] [PubMed] [Google Scholar]
- Shinde V, Hoelting L, Srinivasan SP et al. (2017). Definition of transcriptome-based indices for quantitative characterization of chemically disturbed stem cell development: Introduction of the STOP-Toxukn and STOP-Toxukk tests. Arch Toxicol 91, 839–864. doi: 10.1007/s00204-016-1741-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sirenko O, Grimm FA, Ryan KR et al. (2017). In vitro cardiotoxicity assessment of environmental chemicals using an organotypic human induced pluripotent stem cell-derived model. Toxicol Appl Pharmacol 322, 60–74. doi: 10.1016/j.taap.2017.02.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smirnova L, Hogberg HT, Leist M. and Hartung T. (2014). Developmental neurotoxicity – Challenges in the 21st century and in vitro opportunities. ALTEX 31, 129–156. doi: 10.14573/altex.1403271 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smirnova L, Harris G, Delp J. et al. (2016). A LUHMES 3D dopaminergic neuronal model for neurotoxicity testing allowing long-term exposure and cellular resilience analysis. Arch Toxicol 90, 2725–2743. doi: 10.1007/s00204-015-1637-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spencer PS and Schaumburg HH (1975). Nervous system degeneration produced by acrylamide monomer. Environ Health Perspect 11, 129–133. doi: 10.1289/ehp.7511129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stiegler NV, Krug AK, Matt F. and Leist M. (2011). Assessment of chemical-induced impairment of human neurite outgrowth by multiparametric live cell imaging in high-density cultures. Toxicol Sci 121, 73–87. doi: 10.1093/toxsci/kfr034 [DOI] [PubMed] [Google Scholar]
- Thomson JA, Itskovitz-Eldor J, Shapiro SS et al. (1998). Embryonic stem cell lines derived from human blastocysts. Science 282, 1145–1147. doi: 10.1126/science.282.5391.1145 [DOI] [PubMed] [Google Scholar]
- Tice RR, Austin CP, Kavlock RJ and Bucher JR (2013). Improving the human hazard characterization of chemicals: A Tox21 update. Environ Health Perspect 121, 756–765. doi: 10.1289/ehp.1205784 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Thriel C, Westerink RH, Beste C. et al. (2012). Translating neurobehavioural endpoints of developmental neurotoxicity tests into in vitro assays and readouts. Neurotoxicology 33, 911–924. doi: 10.1016/j.neuro.2011.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wishart DS, Knox C, Guo AC et al. (2008). Drugbank: A knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 36, D901–D906. doi: 10.1093/nar/gkm958 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang D, Kania-Korwel I, Ghogha A. et al. (2014). PCB 136 atropselectively alters morphometric and functional parameters of neuronal connectivity in cultured rat hippocampal neurons via ryanodine receptor-dependent mechanisms. Toxicol Sci 138, 379–392. doi: 10.1093/toxsci/kft334 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zimmer B, Kuegler PB, Baudis B. et al. (2011a). Coordinated waves of gene expression during neuronal differentiation of embryonic stem cells as basis for novel approaches to developmental neurotoxicity testing. Cell Death Differ 18, 383–395. doi: 10.1038/cdd.2010.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zimmer B, Schildknecht S, Kuegler PB et al. (2011b). Sensitivity of dopaminergic neuron differentiation from stem cells to chronic low-dose methylmercury exposure. Toxicol Sci 121, 357–367. doi: 10.1093/toxsci/kfr054 [DOI] [PubMed] [Google Scholar]
- Zimmer B, Lee G, Balmer NV et al. (2012). Evaluation of developmental toxicants and signaling pathways in a functional test based on the migration of human neural crest cells. Environ Health Perspect 120, 1116–1122. doi: 10.1289/ehp.1104489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zimmer B, Pallocca G, Dreser N. et al. (2014). Profiling of drugs and environmental chemicals for functional impairment of neural crest migration in a novel stem cell-based test battery. Arch Toxicol 88, 1109–1126. doi: 10.1007/s00204-014-1231-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.