Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Jan 16.
Published in final edited form as: SLAS Discov. 2019 Sep 9;25(1):9–20. doi: 10.1177/2472555219873068

Cytotoxic Profiling of Annotated and Diverse Chemical Libraries Using Quantitative High Throughput Screening

Olivia W Lee 1, Shelley Austin 1, Madison Gamma 1, Dorian M Cheff 1, Tobie D Lee 1, Kelli M Wilson 1, Joseph Johnson 1, Jameson Travers 1, John C Braisted 1, Rajarshi Guha 1, Carleen Klumpp-Thomas 1, Min Shen 1,*, Matthew D Hall 1,*
PMCID: PMC10791069  NIHMSID: NIHMS1861520  PMID: 31498718

Abstract

Cell-based phenotypic screening is a commonly used approach to discover biological pathways, novel drug targets, chemical probes and high-quality hit-to-lead molecules. Many hits identified from high-throughput screening campaigns are ruled out through a series of follow-up potency, selectivity/specificity, and cytotoxicity assays. Prioritization of molecules with little or no cytotoxicity for downstream evaluation can influence the future direction of projects, so cytotoxicity profiling of screening libraries at an early stage is essential for increasing the likelihood of candidate success. In this study, we assessed cell-based cytotoxicity of nearly 10,000 compounds in NCATS annotated libraries, and over 100,000 compounds in a diversity library, against four ‘normal’ cell lines (HEK 293, NIH 3T3, CRL-7250 and HaCat) and one cancer cell line (KB 3–1, a HeLa subline). This large-scale library profiling was analyzed for overall screening outcomes, hit rates, pan-activity and selectivity. For the annotated library, we also examined the primary targets and mechanistic pathways regularly associated with cell death. To our knowledge, this is the first study to use high-throughput screening to profile a large screening collection (>100,000 compounds) for cytotoxicity in both normal and cancer cell lines. The results generated here constitutes a valuable resource for the scientific community and provides insight on the extent of cytotoxic compounds in screening libraries, identifying and avoiding compounds with cytotoxicity during high-throughput screening campaigns.

Introduction

The development of new chemical probes enables therapeutic target validation, hypothesis testing, and new insight into the biological role of genes and proteins 13. At the NIH National Center for Advancing Translational Sciences (NCATS), the development of small molecule probes for the scientific community allows understanding of rare and neglected diseases, novel targets, and enables basic biological understanding of the “undrugged” genome. This is accomplished through a team science approach that begins with assay development and automated quantitative high-throughput screening (qHTS) with a small molecule library to identify active hits 4. One or more chemotypes that emerge in a qHTS campaign may progress to medicinal chemistry to develop a small molecule probe with strong biochemical and/or cell-based activity, specificity, and optimized properties to enable use in in vivo models. To accommodate unbiased qHTS discovery for medicinal chemistry, large libraries of small molecules are created and curated, containing molecules that capture diverse chemical space that are ideally synthetically tractable. These large libraries of small molecules are generally referred to ‘diversity’ libraries or collections 5.

A second significant discovery screening strategy involves the creation of libraries of ‘annotated’ small molecules. Annotated libraries contain drugs, probes and tool molecules with one or more known mechanisms of action 6. They have emerged as information-rich databases to integrate both biological and chemical data. These can be screened in (primarily) cell or organism-based assays to identify targets relevant to a phenotype, or for potential drug repurposing 7. While compounds in a diversity library are expected to demonstrate weak biological activity, annotated libraries by definition are medicinal chemistry-optimized products with known activity and in many cases, known mechanism of actions. Both diversity collection and annotated library screening are important components of the NCATS Chemical Genomics Center (NCGC) program 8.

Throughout a qHTS campaign, the activities of hits are confirmed in a re-test, and a number of orthogonal and counter-assays are performed to confirm that the observed modulatory activity of is ‘on-target’. This is also to ensure that compounds demonstrating artefactual activity in an assay are triaged. For example, biochemical and cell-based assays that utilize firefly luciferase (fLuc) are sensitive to compounds that modulate luciferase activity 9. To ensure this is not the case, the screening libraries at NCATS are ‘profiled’ for inhibitory activity of fLuc, allowing compounds that interfere with luciferase to be automatically triaged from hit lists without the need for additional screening. In one study it was shown that ~5% of compounds in a qHTS library inhibit fLuc 10. Perhaps the most notorious example of the need to perform counter-assays is the drug ataluren (PTC124) approved for treatment of patients with Duchenne muscular dystrophy, that may have been discovered due to fLuc inhibition rather than on-target activity 11.

A resurgence in cell-based screening, both target-based and phenotypic, means that an increasing number of cell-based qHTS campaigns are performed. Although a number of phenotypic screening approaches have been taken to library profiling 1223, the profiling of a big library using direct cytotoxicity read-out is less frequently reported. An early report of assessing compound cytotoxicity across a library of 1,408 compounds appears in the literature 24, and more recently, two studies examining cell killing with multiplexed assays were reported; one using a high-content assay against ~12,000 screening library (‘diversity’) compounds 25, and the other using a multiplex assay against ~10,000 environmental toxins 26. A significant collection of disease-agnostic ‘annotated’ small molecules exist in libraries at NCATS, including a collection of drugs approved by American, European and Japanese therapeutic regulatory agencies (the ‘NCATS Pharmaceutical Collection’, NPC 27), and many small molecules reported as tools, probes, or clinical/pre-clinical candidates. Understandably, a significant proportion of these molecules arose from oncology programs, and a portion of NCATS’ collaborations are oncology-related phenotypic screens. We were motivated to profile the activity of our annotated libraries for two reasons: first, to allow scientists to have a reference dataset for discerning compounds whose activity is ‘selective’ for cancer cell lines versus a set of non-cancer ‘normal’ cell lines; and second, to enable scientists to discriminate promiscuous/cytotoxic compounds when reviewing data from cell-based/phenotypic assays and provide valuable input for prioritizing compounds for further evaluation.

To this end, we assessed the cytotoxicity of NCATS annotated libraries of nearly 10,000 compounds against four ‘normal’ cell lines (HEK 293, NIH 3T3, CRL-7250 and HaCat) and a cancer cell line (KB 3–1, a HeLa subline), and examined the hit rates and mechanistic pathways regularly associated with cell death. Furthermore, we assessed that activity of a diversity library (>100,000 compounds) against two ‘normal’ cell lines (HEK 293 and NIH 3T3), assessed hit rate, and compared active compounds against a cancer cell line (KB 3–1). This study provides insight into the extent of cell-based killing activity in annotated and diversity libraries, and the importance of confirming that active compounds in phenotypic screens are not cytotoxic.

Materials and Methods

Profiling annotated/diversity libraries and cherry-picked compounds.

HEK 293, NIH 3T3, CRL-7250, HaCat and KB 3–1 cells were seeded into white 1536-well plates using a Multidrop Combi peristaltic dispenser (ThermoFisher, Waltham, MA) at a density of 250, 400, 500, 500, 500 cells/well in 5 μL of medium respectively. A pintool (Kalypsys) was used to transfer 23 nL of compound solution to the 1536-well assay plates. After 48 hr incubation at 37 °C, 5% CO and 85% humidity, 2.5 μL of CellTiter-Glo (Promega) was dispensed into each well using a dispenser (Aspect Automation, St. Paul, MN) with solenoid valves (Lee Valves, Westbrook CT). Plates were left at room temperature for 10 min before imaging the ATP-coupled luminescence using a ViewLux microplate imager (PerkinElmer, Waltham, MA).

Luciferase assay protocol.

Assays determining firefly luciferase inhibition were performed as previously described 28. Briefly, 3 μL of luciferase substrate solution (10 μM ATP, 10 μM D-Luciferin, 10 mM Mg-Acetate, 0.01 % Tween-20, 0.05 % BSA, 50 mM Tris Acetate, pH 7.6, in final 4 μL volume) was dispensed into each well of white, solid bottom, 1536-well plates using a dispenser. A pintool (Kalypsis) was used to transfer 23 nL of compound solution to the assay plates. Following a 15 minute incubation at room temperature protected from light, 1 μL of purified luciferase enzyme solution was added to a final concentration of 10 nM Photinus pyralis luciferase (Sigma). Luminescence was detected by a Viewlux (Perkin-Elmer, Waltham, MA) by using a 10 s exposure time and 2X binning.

Data analysis and clustering of compounds by activity outcomes.

To determine compound activity in the qHTS assay, the concentration-response data for each sample was plotted and modeled by a four parameter logistic fit yielding EC50 and efficacy (maximal response) values as previously described 29. Raw plate reads for each titration point were first normalized relative to positive control (9.2 μM Bortezomib, −100% activity, full inhibition) and DMSO only wells (basal, 0% activity). Data normalization and curve fitting were performed using in-house informatics tools. Compounds were designated as Class 1–4 according to the type of concentration–response curve (CRC) observed. In brief, Class −1.1 and −1.2 were the highest-confidence complete CRCs containing upper and lower asymptotes with efficacies ≥ 80% and < 80%, respectively. Class −2.1 and −2.2 were incomplete CRCs having only one asymptote with efficacy ≥ 80% and < 80%, respectively. Class −3 CRCs showed activity at only the highest concentration or were poorly fit. Class 4 CRCs were inactive having a curve-fit of insufficient efficacy or lacking a fit altogether.

Compounds were further clustered hierarchically using TIBCO Spotfire 6.0.0 (Spotfire Inc., Cambridge, MA. https://spotfire.tibco.com/) based on their activity outcomes from the primary or follow up screen across different cell lines. Compound’s AUC calculated based on the qHTS data analysis and curve fittings were utilized for clustering. In the heatmap, darker color indicates compounds that are more potent and efficacious, i.e. high-quality actives, and lighter color indicates less potent and efficacious compounds. If a compound didn’t show any activity in an assay, it was highlighted as white in the heatmap.

Statistical analysis.

In order to determine whether the hits predominantly identified from the specific therapeutic categories were over-represented in the chemical library, an enrichment analysis was implemented against the drug library. 9,893 compounds in the annotated library were broken down to different therapeutic categories based on their primary mechanisms of action and pharmaceutical indications, the enrichment was calculated from the following formula: E = a/n, given a is the number of actives and n is the total number of drugs in each therapeutic category. Fisher’s exact test was used as a measure of the consensus cytotoxicity potential of compounds in each MOA; all calculations were performed in R statistical computing software (https://www.r-project.org/).

Results

Profiling of an annotated library

Annotated libraries were profiled for cell viability by performing a primary screen against four normal cell lines using CellTiter-Glo (CTG) as the assay read-out (screening assay protocol displayed in Supplementary Table 1). Three of the cell lines are immortalized (HEK 293, NIH 3T3 and HaCat), while CRL-7250 is a primary cell line. A total of 9,893 compounds were tested in either an 8-pt concentration-response ranging from 0.6 nM to 46 μM (1:5 dilution) or an 11-pt concentration-response ranging from 0.8 nM to 46 μM (1:3 dilution) according to different compound plating mechanisms.

Variation of sensitivity was observed across all four cell lines as shown in Supplementary Table 2. The HEK 293 cell line was more sensitive to compound treatment (Figure 1a), with 41% of compounds eliciting a dose-response cell killing effect (16% high-quality actives, 25% low-quality actives), while the three remaining cell lines demonstrated reduced sensitivity (7.9 – 10.6% high-quality actives, 14.2–18.1% low-quality actives), where the ‘high-quality’ actives were defined as compounds in class −1.1, −1.2, −2.1, and −2.2, with maximal response ≤ −50% and EC50 values less than 10 μM. The detailed information for data analysis and curve class definition is provided in Methods section. Comparison of compounds active against the four cell lines (Figure 1b, Venn diagram) revealed 496 compounds active only against HEK 293 cells (consistent with its greater sensitivity to compounds). 409 compounds demonstrated pan-activity against all four cell lines, which corresponds to a consensus hit rate of 4.1% for the entire annotated library.

Figure 1.

Figure 1.

a) Pie chart distribution of high-quality actives (orange), low-quality actives (yellow), and inactives (green) identified from the annotated compound library qHTS against the four normal cell lines for 48 hr incubation condition. High-quality actives: compounds in class −1.1, −1.2, −2.1, and −2.2, maximum response ≤ −50%, and EC50 value ≤ 10 μM. Inactives: compounds in curve class 4; low-quality actives: compounds with shallow curves, single-point activity or inconclusive activity. b) Compound overlapping Venn diagram for HEK 293, HaCat, NIH 3T3, and CRL-7250 cell lines. Number of high-quality actives in each cell line and number of compounds overlapped were calculated.

Comparison of the AUC (Area Under the dose-response Curve) values of each compound using unbiased hierarchical clustering across four cell lines revealed some cross-correlation of activity, with the HEK 293 cells clustering away from the other cell lines (Figure 2a). Given the long-term intention of utilizing profiling data to make comparisons with activity against cancer cell lines, we also tested the library against KB 3–1 human adenocarcinoma cells (KB 3–1 cells are a HeLa sub-clone). Intriguingly, the KB 3–1 cell line did not demonstrate remarkably different sensitivity compared to the normal cells, but surprisingly its sensitivity clustered close with the NIH 3T3 murine fibroblast line. The pair-wise AUC correlation with R2 for all actives among five cell lines were calculated and shown in Figure 2b, the R2 ranges from 0.31 to 0.77. NIH 3T3 and KB 3–1 showed the strongest correlation with R2 = 0.77. Violin plots of the distribution of sensitivity to compounds across the libraries reinforce that the majority of compounds were not cytotoxic in cell lines (Figure 2c, pink area), while focused plots of only high-quality actives (Figure 2c, cyan area) show that only a small number of compounds are highly cytotoxic (AUC < −300) against cell lines.

Figure 2.

Figure 2.

a) The comparison of AUC values of each compound in four normal cell lines and KB 3–1 human adenocarcinoma cells. In the heat map each row corresponds to a compound and each column to a cell line. Darker red color indicates more potent and efficacious compound. b) Pair-wise R2 correlation matrix among five cell lines. Only high-quality actives were included in this analysis and the Pearson correlation was based on AUC values. c) Split violin plot showing the distribution of AUC values for all compounds screened and high-quality actives across four normal cell lines. The lines within each distribution area represent the 0.25, 0.5 and 0.75 quantiles.

One strength of annotated compound libraries is the ability to integrate compound mechanism of action (MOA) and targets into downstream analysis. Each compound that met the criteria for high-quality active in each cell line were aggregated by mechanism of action. The treemap shown in Figure 3 contains 1 box for each MOA containing three or more compounds. The size of each box is representative of the number of compounds with that MOA. The MOAs with the most compounds were antibacterial agents and PI3 Kinase inhibitors with 42 compounds present in each. We also assessed average AUC for each MOA which is represented in Figure 3 by the color gradient for each box. Kinesin like Spindle Protein (KIF11) inhibitors and proteasome inhibitors had the lowest average AUC of all MOAs.

Figure 3.

Figure 3.

A treemap representation of the mechanisms of action (MOA) of all high-quality actives from the annotated library compounds screened. Box size represents the total number of compounds representing each MOA (bigger box size indicates more compounds present in the high-quality actives). Color represents the average AUC from the cytotoxicity screen in four normal cell lines (darker red indicates a lower AUC meaning a more potent and efficacious hits). 1 = Lineage specific differentiation; 2 = RNA polymerase; 3 = Immuno-suppressant.

There were several instances where multiple molecules with the same target were screened. To examine which MOAs were enriched among molecules active against cell lines, the proportion of active drugs to total drugs for each MOA target were used to derive an ‘enrichment ratio’ (Figure 4a, where ratio = 1 represents 100% active, ratio = 0 represents none active). All classes of MOA targets with at least five compounds presented in the annotated library were included in the enrichment analysis. The top five target classes that resulted in consensus cell killing against the normal cell lines were proteasome, heat shock protein 90 (HSP90), anaplastic lymphoma kinase (ALK), mammalian target of rapamycin (mTOR), and cyclin-dependent kinase (CDK), with the enrichment ratio greater than 50%. To examine the significance of the association between consensus activity outcomes with the primary MOA, Fisher’s exact test was applied on the entire data set and the statistical significance was reported in Supplementary Table 3. Eight out of eleven highly enriched MOA classes have P-value < 0.05, and seven classes with P-value < 0.001, indicating that compounds with similar activity profiles as determined in normal cell line screens tend to share similar annotated MOAs. These highly enriched MOAs (along with many other targets shown in Figure 4a) are generally associated with anti-cancer cell killing by targeting essential cellular processes. For example, nine of ten proteasome inhibitors tested resulted in acute cytotoxicity across all four cell lines (see proteasome inhibitor delanzomib, Figure 4b). As a reference point, the cancer cell line KB 3–1 did not demonstrate hypersensitivity compared with the normal cell lines, this is also consistent to the results we discussed in the heatmap clustering analysis (Figure 2a). The global cytotoxicity of proteasome inhibitors is displayed by radar plot (Figure 4c) and is extended to other mechanisms of actions (supplementary Figure 1).

Figure 4.

Figure 4.

a) Enrichment analysis of active agents in each drug category. Enrichment ratio = the number of actives/the total number of drugs in each drug category. b) Dose-response curves for delanzomib, a representative proteasome inhibitor, in cytotoxicity screens. EC50 HEK 293 = 148 nM, EC50 NIH 3T3 = 82 nM, EC50 CRL-7250 = 205 nM, EC50 HaCat = 326 nM, EC50 KB 3–1 = 186nM. c) A radar plot displaying Log EC50 of all active proteasome inhibitors in four normal cell lines. d) Dose-response curves for TC-S7004, a potent and selective DYRK1A/B inhibitor, in cytotoxicity screens, including KB 3–1 cells. EC50 KB 3–1 = 3.2 μM. e) Dose-response curves for LDN-212854, an ALK2 inhibitor, in cytotoxicity screens, including KB 3–1 cells. EC50 HEK 293 = 16.4 μM, EC50 KB 3–1 = 4.1 μM.

As outlined, one use of cell profiling data for mechanistically-annotated screening libraries is to enable the identification of targeted agents selectively killing cancer cells. Here we defined that a selective agent should show strong cell killing to KB 3–1 cells with EC50 ≤ 10 μM and have no effect to normal cell lines or at least 10-fold of EC50 shift in normal cell lines. Two such examples of selective killing are displayed in Figures 4d and 4e. First, the dual specificity tyrosine phosphorylation regulated kinase 1A and 1B (DYRK1A and DYR1B) inhibitor TC-S7004 demonstrated complete killing of KB 3–1 cells (EC50 = 3.2 μM), with no effect on any of the four normal cell lines. Second, the potent and selective BMP receptor inhibitor LDN-212854 demonstrated complete killing of KB 3–1 cells (EC50 = 4.1 μM), with no effect on NIH 3T3 and CRL-7250 cells, and only partial killing of one normal line (HEK 293, EC50 = 16.4 μM).

Data for annotated library screen was deposited in PubChem with AID 1296008, and you can access this assay with the following link: https://pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?aid=1296008

Profiling of a diversity library

The diversity library was profiled for cell viability by performing a primary screen against HEK 293 and NIH 3T3 cell lines, using CTG as the assay read-out (screening assay protocol displayed in Supplementary Table 1). A total of 102,726 compounds were tested in a 4-pt dose response manner at varied concentrations with nearly 80% of the compounds tested at 115, 58, 12 and 2.3 μM. 30.0% and 34.6% of the library was cytotoxic to cell lines NIH 3T3 and HEK 293 respectively, with 1.2% of the library classified as ‘high-quality’ actives in each of the cell lines (Figures 5a and 5b). Of these high-quality actives, there were 285 compounds overlapped and shared similar activity in both NIH 3T3 and HEK 293 (Figure 5 c).

Figure 5.

Figure 5.

a-b) Pie chart distribution of high-quality actives (orange), low-quality actives (yellow), and inactives (green) identified from the qHTS against diversity collection compound library in NIH 3T3 and HEK 293 cell line, respectively. c) Compound overlapping Venn diagram for NIH 3T3 and HEK 293 cell lines. Number of high-quality actives in each cell line was calculated and number of compounds overlapped were labeled. d) The comparison of AUC values of 588 cherry-picked compounds in HEK 293, NIH 3T3 and KB 3–1 cell lines. In the heat map each row corresponds to a compound and a column to a cell line. Darker red color indicates more potent and efficacious compound.

After cherry-picking 588 hits based on their potency, maximal response in primary qHTS assay and structural features, confirmation of activity was performed in 12 dose-point testing for all the cherry-picked compounds against HEK 293, NIH 3T3, along with an orthogonal test against the KB 3–1 cell line. The final concentration of the compounds in the 5 μl assay volume ranged from 0.3 nM to 46 μM. To analyze the data, we generated a heat map with hierarchical clustering analysis (dendrogram) of compound activities based on their activity outcomes, which shows that NIH 3T3 clusters away from HEK 293 and KB 3–1 (Figure 5d).

The Promega CTG reagent utilizes an engineered version of firefly luciferase to measure ATP concentrations. As such, we tested compounds that appeared cytotoxic against recombinant fLuc in a biochemical assay to triage any compounds that were inhibiting fLuc rather than reducing cell viability. A number of compounds inhibited fLuc, and two examples from a novel fLuc inhibitor chemotype are shown in Figures 6a and 6b. In both cases, the compounds appeared equally active against all three cell lines in the CTG assay and demonstrated potent fLuc inhibition. The Promega CTG reagent contains a proprietary engineered fLuc that claims to be resistant to inhibition compared with wild-type fLuc 30. This indeed appears to be the case, as a counter-assay testing hits in cell-free media with CTG and ATP added as substrate demonstrated at least a 10-fold lower sensitivity to inhibition. Bright-field microscopy of cells grown in the presence of NCGC00413522 and NCGC00413607 (Figure 6c) demonstrated unaffected cell growth at 48 hr at concentrations suggested to be cytotoxic by CTG dose-response data (DMSO and 500 nM bortezomib as negative and positive control respectively).

Figure 6.

Figure 6.

a) Dose-response curves for NCGC00413522 in cytotoxicity screens, luciferase inhibition assay and CellTiter-Glo luciferase assay. EC50 HEK 293 = 8.1 μM, EC50 NIH 3T3 = 9.1 μM, EC50 KB 3–1 = 8.1 μM. EC50 Luciferase = 0.89 μM. EC50 CTG Luciferase = 26.6 μM. b) Dose-response curves for NCGC00413607 in cytotoxicity screens, luciferase inhibition assay and CellTiter-Glo luciferase assay. EC50 HEK 293 = 3.2 μM, EC50 NIH 3T3 = 2.3 μM, EC50 KB 3–1 = 2.3 μM. EC50 Luciferase = 0.08 μM. EC50 CTG Luciferase = 7.3 μM. c) Representative bright field images of HEK293 cells after 48 hr treatment.

Moreover, comparing the AUC values of high-quality actives in the three cell lines, we found that they moderately correlated: NIH 3T3 vs HEK 293, R2 = 0.77; NIH 3T3 vs KB 3–1, R2 = 0.59; KB 3–1 vs HEK 293, R2 = 0.61 (Supplementary Figures 2 a, b, c). Sixty-one of the top-ranking hits were active across all cell lines (pan-active), whereas others were more selective toward a specific cell line or cell lines. If only taking the pan-actives that showed strong cytotoxicity against all three cell lines in the analysis, it was clearly shown that most of the compounds have very good EC50 agreement with low standard deviation, except for few outlier compounds that showed differential cell killing effect in three cell lines (Supplementary Figures 2d). These outlier compounds had greater cell killing against two normal cell lines (HEK 293 and NIH 3T3) although the selectivity window to the cancer cell line KB-3–1 was not significant, individual dose-response curves were shown in Supplementary Figure 3. Among these five compounds, NCGC00420512, NCGC00420455 and NCGC00420435 belonged to the same structural chemotype with a tetrahydropyrazolo-pyrimidine core. To examine if the observed selectivity is a chemotype-specific outcome, we further analyzed the screening data through substructure search and found 172 compounds sharing the same tetrahydropyrazolo-pyrimidine scaffold presented in our diverse library, but none of the remaining 169 compounds showed similar differential cell killing effect, indicating that the selectivity is irrelevant to the specific structural features.

The remaining 56 pan-actives were cytotoxic across all three cell lines lacking any selectivity toward either normal or cancer lines. Representatives of the top ranking pan-actives were shown in Supplementary Figure 2e. Pan-actives were further clustered using fingerprint structural keys and Tanimoto coefficient as similarity metric in Molecular Operating Environment software (MOE, https://www.chemcomp.com/). We identified three enriched clusters based on their structural similarity using MACCS fingerprint and Tanimoto coefficient (Supplementary Table 4). The most significant enriched scaffolds are in Clusters 1 and 2. Members in Cluster 1 shared the same pyrimidine-quinuclidine core, with comparable EC50 values across all three cell lines ranging from 7 to 12 μM, except one compound NCGC00421344 which was > 10-fold more potent in HEK 293 cell than the other two cell lines. Cluster 2 had the spiro structural feature and it was the most cytotoxic cluster among all four clusters discussed in Supplementary Table 4. Compounds in this cluster showed almost identical activity across three cell lines, with EC50 values ranging from 0.5 to 3 μM for most of the compounds. For example, NCGC00419015 was the most cytotoxic agent in Cluster 2 with EC50 values for HEK 293, NIH 3T3 and KB 3–1 being 0.51, 0.72 and 0.64 μM, respectively. Furthermore, Cluster 3 with tetrahydro-imidazo-diazepine core was a less enriched cluster but also showed as non-selective cluster in which compounds were lacking selectivity against a specific cell line. We also ran the pan-actives against multiple toxicity datasets including Bursi Mutagenicity dataset 31 (4,337 compounds with mutagenicity data) and National Toxicology Program dataset 32 (503 structures with carcinogenicity data for male/female mouse and male/female rat), but found no overlapping structures which indicated that the chemotypes identified from our diverse library profiling are new additions to the well-known toxic collections.

Data for diverse chemical library screen was deposited in PubChem with AID 1345083 for HEK 293 cell line and AID 1345082 for NIH 3T3 cell line. You can access the assay data with the following links: https://pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?aid=1345083 https://pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?aid=1345082

Discussion

Here, we describe our effort to profile the cytotoxicity of two distinct libraries: 1. our annotated libraries (nearly 10,000 compounds of known MOAs or therapeutic indications) against four ‘normal’ cell lines (HEK 293, NIH 3T3, CRL-7250 and HaCat) and a cancer cell line (KB 3–1, a HeLa sub-line); and 2. a diversity library (>100,000 compounds) against two ‘normal’ cell lines (HEK 293 and NIH 3T3) and a cancer cell line (KB 3–1, a HeLa sub-line). The assays were all performed in quantitative HTS format. Hit rates and mechanistic pathways regularly associated with cell death are described. As one might reasonably anticipate, ‘annotated’ (or mechanistic) libraries containing small molecules developed against targets or known to possess phenotypic activity have a high hit rate (7.91–16.05%) against the cell lines, especially for many annotated molecules were developed against oncology targets.

The large diversity collection revealed a low rate of profound cytotoxicity (1.2%), and this is consistent with the fact that the majority are not optimized against targets, but reinforces the need to confirm toxicity of such compounds through counter-assays as part of screening for compound prioritization and triage. In our case, this profiling data is now used at NCATS to allow for rapid cross-referencing of active compounds in a screen against our dataset of general cytotoxicity in normal cell lines. To facilitate the use of these data by the broader research community, we have also made a significant amount of this data available through PubChem.

What is a ‘normal’ cell line, and what are their limitations? The four ‘normal’ cell lines used in this study were selected for a number of reasons. HEK 293 and NIH 3T3 cell lines are very commonly employed in the scientific literature as ‘control’, ‘normal’, or ‘comparator’ cell lines, along with conventional research uses. A common criticism of these lines is how normal they are given their capacity for uncontrolled cell growth. The HEK 293 cell line is one of the most utilized tool cell lines. It is a neuronal lineage line, generated in 1973 from embryonic kidney cells from an aborted fetus immortalized by transformation with adenovirus 33. The NIH 3T3 murine fibroblast cell line was generated in 1962 from Swiss albino mouse embryo, and spontaneously immortalized after multiple passages 34. The HaCat human keratinocyte cell line spontaneously immortalized in culture and was first reported in 1988 35. All three of the aforementioned immortalized cell lines are adherent, and all are non-diploid despite being non-cancer non-tumorigenic cells. The CRL-7250 cell line differs from the others in that it is a primary, non-immortalized fibroblast cell line generated from human foreskin 36. The HEK 293 cell line demonstrated greater sensitivity than the other three ‘normal’ cell lines in the annotated library screen.

One potential use for normal cell line profiling data is to enable the identification of compounds selectively active against cancer cell lines based on their target biology. A well-known example are inhibitors of MAPK/ERK kinase (MEK), that elicit acute cell killing in cells harboring somatic activating mutations of Ras, but not in cells expressing wild-type Ras (such as the ‘normal’ cells discussed here 37). As a proof-of-concept comparison, we tested the KB 3–1 adenocarcinoma cells against the annotated libraries. KB 3–1 cells are a sub-clone of HeLa cells 38, originally called KB squamous cell carcinoma before it was identified as a HeLa contaminant 39and has subsequently been studied and acknowledged as such. KB cells do not possess activating Ras mutations, but selective activity was seen by a DYRK1A/B inhibitor that killed KB 3–1 cells without affecting the normal cells, demonstrating the utility. This profiling data has already been applied to analysis of multiple oncology-related screens across NCATS.

Perhaps the key result (reassuring from a screening perspective) from cytotoxicity profiling of >100,000 diversity library compounds was the very low rate of cytotoxicity observed, though some chemotype-related activity was observed. Cell lines like HEK 293 (along with cell lines such as Chinese hamster ovary, CHO, cell line) are commonly used to engineer reporter cell lines for cell target-based and high-content assays, and the relative insensitivity to diversity libraries supports their utilization. The only prior study we have identified (also from NCATS but not involving any of the current authors) tested 1,408 compounds against thirteen cell lines, including a number of cancer cell lines along with HEK 293 and NIH 3T3 cells 24. While the compound number was relatively modest in scale, the normal HEK 293 and NIH 3T3 cells were reported to be among the least sensitive, supporting the basis for utilization of normal cell lines rather than cancer cell lines for designing cell-based assays for discovery screening.

The data reported here is made available through PubChem, and it is hoped this data acts as a general guide for normal cell line sensitivity to killing and assists in guiding others in the design of counter-assays for high-throughput screening of cell-based assays.

Supplementary Material

Paper Suppl

Acknowledgements

We thank Dr. Aleksandra Michaloa for advice regarding data analysis. This work was supported by the NCATS Division of Pre-Clinical Innovation Intramural Program.

Footnotes

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

References

  • 1.Schreiber SL; Kotz JD; Li M; et al. Advancing Biological Understanding and Therapeutics Discovery with Small-Molecule Probes. Cell 2015, 161, 1252–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Moustakim M; Felce SL; Zaarour N; et al. Target Identification Using Chemical Probes. Methods Enzymol 2018, 610, 27–58. [DOI] [PubMed] [Google Scholar]
  • 3.Arrowsmith CH; Audia JE; Austin C; et al. The promise and peril of chemical probes. Nat Chem Biol 2015, 11, 536–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Michael S; Auld D; Klumpp C; et al. A robotic platform for quantitative high-throughput screening. Assay Drug Dev Technol 2008, 6, 637–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Renner S; Popov M; Schuffenhauer A; et al. Recent trends and observations in the design of high-quality screening collections. Future Med Chem 2011, 3, 751–66. [DOI] [PubMed] [Google Scholar]
  • 6.Wassermann AM; Camargo LM; Auld DS Composition and applications of focus libraries to phenotypic assays. Front Pharmacol 2014, 5, 164. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Coussens NP; Braisted JC; Peryea T; et al. Small-Molecule Screens: A Gateway to Cancer Therapeutic Agents with Case Studies of Food and Drug Administration-Approved Drugs. Pharmacol Rev 2017, 69, 479–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Thomas CJ; Auld DS; Huang R; et al. The pilot phase of the NIH Chemical Genomics Center. Curr Top Med Chem 2009, 9, 1181–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Auld DS; Lovell S; Thorne N; et al. Molecular basis for the high-affinity binding and stabilization of firefly luciferase by PTC124. Proc Natl Acad Sci U S A 2010, 107, 4878–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Auld DS; Thorne N; Nguyen DT; et al. A specific mechanism for nonspecific activation in reporter-gene assays. ACS Chem Biol 2008, 3, 463–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Auld DS; Thorne N; Maguire WF; et al. Mechanism of PTC124 activity in cell-based luciferase assays of nonsense codon suppression. Proc Natl Acad Sci U S A 2009, 106, 3585–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Quereda V; Hou S; Madoux F; et al. A Cytotoxic Three-Dimensional-Spheroid, High-Throughput Assay Using Patient-Derived Glioma Stem Cells. SLAS Discov 2018, 23, 842–849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hou S; Tiriac H; Sridharan BP; et al. Advanced Development of Primary Pancreatic Organoid Tumor Models for High-Throughput Phenotypic Drug Screening. SLAS Discov 2018, 23, 574–584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Joslin J; Gilligan J; Anderson P; et al. A Fully Automated High-Throughput Flow Cytometry Screening System Enabling Phenotypic Drug Discovery. SLAS Discov 2018, 23, 697–707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ding M; Clark R; Bardelle C; et al. Application of High-Throughput Flow Cytometry in Early Drug Discovery: An AstraZeneca Perspective. SLAS Discov 2018, 23, 719–731. [DOI] [PubMed] [Google Scholar]
  • 16.Young J; Margaron Y; Fernandes M; et al. MyoScreen, a High-Throughput Phenotypic Screening Platform Enabling Muscle Drug Discovery. SLAS Discov 2018, 23, 790–806. [DOI] [PubMed] [Google Scholar]
  • 17.Giuliano KA; Wachi S; Drew L; et al. Use of a High-Throughput Phenotypic Screening Strategy to Identify Amplifiers, a Novel Pharmacological Class of Small Molecules That Exhibit Functional Synergy with Potentiators and Correctors. SLAS Discov 2018, 23, 111–121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Collia D; Bannister TD; Tan H; et al. A Rapid Phenotypic Whole-Cell Screening Approach for the Identification of Small-Molecule Inhibitors That Counter beta-Lactamase Resistance in Pseudomonas aeruginosa. SLAS Discov 2018, 23, 55–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Madoux F; Tanner A; Vessels M; et al. A 1536-Well 3D Viability Assay to Assess the Cytotoxic Effect of Drugs on Spheroids. SLAS Discov 2017, 22, 516–524. [DOI] [PubMed] [Google Scholar]
  • 20.Booij TH; Bange H; Leonhard WN; et al. High-Throughput Phenotypic Screening of Kinase Inhibitors to Identify Drug Targets for Polycystic Kidney Disease. SLAS Discov 2017, 22, 974–984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wang SS; Ehrlich DJ Image-Based Phenotypic Screening with Human Primary T Cells Using One-Dimensional Imaging Cytometry with Self-Tuning Statistical-Gating Algorithms. SLAS Discov 2017, 22, 985–994. [DOI] [PubMed] [Google Scholar]
  • 22.Cao Q; Yao J; Li H; et al. Cellular Phenotypic Analysis of Macrophage Activation Unveils Kinetic Responses of Agents Targeting Phosphorylation. SLAS Discov 2017, 22, 51–57. [DOI] [PubMed] [Google Scholar]
  • 23.Boehnke K; Iversen PW; Schumacher D; et al. Assay Establishment and Validation of a High-Throughput Screening Platform for Three-Dimensional Patient-Derived Colon Cancer Organoid Cultures. J Biomol Screen 2016, 21, 931–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Xia M; Huang R; Witt KL; et al. Compound cytotoxicity profiling using quantitative high-throughput screening. Environ Health Perspect 2008, 116, 284–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Chiaravalli J; Glickman JF A High-Content Live-Cell Viability Assay and Its Validation on a Diverse 12K Compound Screen. SLAS Discov 2017, 2472555217724745. [DOI] [PubMed] [Google Scholar]
  • 26.Hsieh JH; Huang R; Lin JA; et al. Correction: Real-time cell toxicity profiling of Tox21 10K compounds reveals cytotoxicity dependent toxicity pathway linkage. PLoS One 2017, 12, e0181291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Huang R; Southall N; Wang Y; et al. The NCGC pharmaceutical collection: a comprehensive resource of clinically approved drugs enabling repurposing and chemical genomics. Sci Transl Med 2011, 3, 80ps16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Auld DS; Southall NT; Jadhav A; et al. Characterization of chemical libraries for luciferase inhibitory activity. J Med Chem 2008, 51, 2372–86. [DOI] [PubMed] [Google Scholar]
  • 29.Inglese J; Auld DS; Jadhav A; et al. Quantitative high-throughput screening: a titration-based approach that efficiently identifies biological activities in large chemical libraries. Proc Natl Acad Sci U S A 2006, 103, 11473–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hook B; Schagat T. Compound Interference of CellTiter-Glo® vs PE ATPlite 1 step. Accessed 2018 Aug 23. https://www.promega.com/-/media/files/resources/posters/compound-interference-of-celltiter-glo-vs-pe-atplite-1-step-poster.pdf [Google Scholar]
  • 31.Kazius J; McGuire R; Bursi R. Derivation and Validation of Toxicophores for Mutagenicity Prediction. Journal of Medicinal Chemistry 2005, 48, 312–320. [DOI] [PubMed] [Google Scholar]
  • 32.US National Toxicology Program data set. 2001.
  • 33.Graham FL; Smiley J; Russell WC; et al. Characteristics of a human cell line transformed by DNA from human adenovirus type 5. J Gen Virol 1977, 36, 59–74. [DOI] [PubMed] [Google Scholar]
  • 34.Todaro GJ; Green H. Quantitative studies of the growth of mouse embryo cells in culture and their development into established lines. J Cell Biol 1963, 17, 299–313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Boukamp P; Petrussevska RT; Breitkreutz D; et al. Normal keratinization in a spontaneously immortalized aneuploid human keratinocyte cell line. J Cell Biol 1988, 106, 761–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Skroder HM; Hamadani JD; Tofail F; et al. Selenium status in pregnancy influences children’s cognitive function at 1.5 years of age. Clin Nutr 2015, 34, 923–30. [DOI] [PubMed] [Google Scholar]
  • 37.Zhao Y; Adjei AA The clinical development of MEK inhibitors. Nat Rev Clin Oncol 2014, 11, 385–400. [DOI] [PubMed] [Google Scholar]
  • 38.Cornwell MM; Gottesman MM; Pastan IH Increased vinblastine binding to membrane vesicles from multidrug-resistant KB cells. J Biol Chem 1986, 261, 7921–8. [PubMed] [Google Scholar]
  • 39.Vaughan L; Glanzel W; Korch C; et al. Widespread Use of Misidentified Cell Line KB (HeLa): Incorrect Attribution and Its Impact Revealed through Mining the Scientific Literature. Cancer Res 2017, 77, 2784–2788. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Paper Suppl

RESOURCES