Abstract
Computational methods for image-based profiling are under active development, but their success hinges on assays that can capture a wide range of phenotypes. We have developed a multiplex cytological profiling assay that “paints the cell” with as many fluorescent markers as possible without compromising our ability to extract rich, quantitative profiles in high throughput. The assay detects seven major cellular components. In a pilot screen of bioactive compounds, the assay detected a range of cellular phenotypes and it clustered compounds with similar annotated protein targets or chemical structure based on cytological profiles. The results demonstrate that the assay captures subtle patterns in the combination of morphological labels, thereby detecting the effects of chemical compounds even though their targets are not stained directly. This image-based assay provides an unbiased approach to characterize compound- and disease-associated cell states to support future probe discovery.
Introduction
Gene-expression profiling, the most established unbiased profiling method, has been used to support small-molecule discovery in number of ways. For example, gene expression has been used to define disease states, such as those caused by genomic alterations in cancer, thereby enabling identification of compounds that reverse the cellular phenotype to a preferable state [1]. Gene expression has also been used to infer compound mechanism of action by revealing that previously unconnected compounds yield similar profiles in cells, or by revealing that sets of genes enriched for those having specific functions are regulated in a concerted manner [2,3]. Microscopy images of cells are increasingly being used for profiling [4,5] because they contain a large amount of quantitative information about a wide range of complex phenotypes, and because image-based assays can be scaled to medium and high throughput with relative ease. It has for some time been possible to measure hundreds of properties of individual cells in microscopy images [6] and to find nonlinear combinations of features that can identify complex phenotypes [7]. Computational methods for image-based profiling are under active development [8-13], but have largely been applied to assays that model particular phenotypes of interest with minimal numbers of labels. Applying these methods in a more unbiased manner to, for example, discover new phenotypes of interest, requires development of an assay that can capture a much wider range of phenotypes.
Results
We sought to develop an assay that “paints the cell” with as many fluorescent morphological labels as possible without compromising our ability to extract quantitative image-based profiles in high throughput. We present a multiplex cytological profiling assay that allows detection of seven major cell components (Figure 1A ), and we demonstrate its ability to capture a wide range of cellular phenotypes induced by small molecules (Figure 1B ). Further, we demonstrate the ability of the profiling data to connect compounds with similar mechanisms of action (Figure 2 ). Because the profiles capture subtle patterns in the combination of morphological labels, the assay can detect the effects of chemical compounds even though their targets are not stained directly.
We considered only well-characterized, fluorescent, non-antibody dyes suitable for high-throughput application. We first screened a number of potential dyes for those with high signal, low background, assay buffer compatibility, fixation and permeabilization condition compatibility, staining time, and optical spectra. To ensure compatibility with commonly available microscopes, we limited the protocol to detecting stains in five channels. Within that constraint, we increased the degree of multiplexing by including two dyes for a given optimal spectrum if they stained spatially distinct cellular components that could be distinguished during analysis. The staining protocol was optimized largely based on qualitative assessment of cellular features of interest. Particular attention was paid to the relative concentration of WGA and phalloidin to allow visualization of the Golgi apparatus, but not at the expense of detection of actin filaments. Pilot plates were assayed with varying concentrations of WGA and phalloidin. Images were examined by eye to select the optimal concentrations.
The final protocol involves imaging five channels to detect seven cell components using six stains (Table 1 , Figure 1A ), which were significantly optimized for dye concentration, buffer composition, staining time, and permeabilization, blocking, and washing conditions. The protocol is readily transferable to multiple adherent cell lines (Figure S1 ).
Table 1. Multiplex cytological profiling assay components.
Cellular component(s) | Stain | Detection (ex/em) |
---|---|---|
nucleus | Hoechst 33342 | 387/447 nm |
endoplasmic reticulum | concanavalin A (con A) AlexaFluor488 conjugate | 472/520 nm |
nucleoli | SYTO 14 green fluorescent nucleic acid stain | 531/593 nm |
Golgi apparatus and plasma membrane | wheat germ agglutinin (WGA) AlexaFluor594 conjugate | 562/642 nm |
F-actin | phalloidin AlexaFluor594 conjugate | 562/642 nm |
mitochondria | MitoTracker Deep Red | 628/692 nm |
We validated the assay by profiling 1600 commercially available bioactive compounds (Table S1 ) spanning a range of mechanisms of action. Briefly, U2OS cells were plated in quadruplicate in 384-well plates, incubated for 24 h to allow cells to adhere and resume growth, and then treated with compounds for 48 h (typical concentration 10 µM). Following the multiplex cytological profiling protocol, images were captured at 20x magnification with an automated epifluorescent microscope. We extracted 824 morphological features (Table S2 ) from each cell using the open-source software CellProfiler [6]. A number of cellular phenotypes could be detected by eye (Figure 1B ). The profiles of the 64 mock-treated wells on each plate vary little over the course of the experiment (Figure S2 , Table S3 ), although some positional effects are evident (Figure S3 , Table S4 ). Roughly half of the features showed significant response to one or more compounds (Figure S4 ). The group of features that were the least useful for this assay were the Zernike shape features (Table S5 ).
To determine whether image-based profiles derived from the multiplex assay are useful for studying compound mechanism-of-action, we examined whether clustering compounds according to image-based profile similarity would group compounds with similar annotated protein targets or chemical structure. After clustering hierarchically the 75 active compounds for which we had annotations and ranking the clusters' enrichment of annotation terms, we found that several of the most enriched clusters were convincing mechanistic groups (Figure 2 ). For example, cluster A contains both structurally related and distinct modulators of tubulin (fenbendazole; oxibendazole; taxol), which lead to large multinucleated cells with fused nucleoli. The promotion of polyploidization and multinucleation by tubulin modulators has been long recognized [14,15]. Cluster B contains modulators of neuronal receptors, all of which lead to enhanced Golgi staining and some cells with fused nucleoli: fluphenazine (D1 and D2 dopamine receptor antagonist), metoclopramide (D2 dopamine antagonist; muscarinic M1 receptor antagonist; 5-hydroxytramine 4 receptor agonist), as well as procaine (sodium channel antagonist), a structural analog of metaclopramide (DrugBank [16] acc. DB01233). It is worth noting that all three compounds contain a basic tertiary amine, which has been linked to compound accumulation in acidic cellular compartments, such as the lysosome and Golgi, with effects on their shape and function [17]. It is possible that this chemical feature and cellular mechanism underlie the shared effect of these compounds on morphology rather than channel inhibition. Cluster C contains a number of structurally related cardenolide glycosides (digoxin; lanatoside C; peruvoside; neriifolin; digitoxin), characterized by reduced cell size, condensed nuclei, plasma membrane blebbing, reduced nucleolar staining, and significant cytotoxicity (Text S2 ). While compounds of this class are thought to affect a range of biological processes, their effects on morphology are consistent with their reported ability to cause cell death [18,19].
Discussion
A rich multiplex assay, such as our cell-painting assay, is a necessary step towards productively profiling a large collection of small molecules. Profiles from such an experiment could be mined to identify regulators of dozens of different phenotypes without having to design and optimize specific assays for each phenotype. Rather, a large, unbiased profiling experiment could be performed once and then efficiently and inexpensively mined for multiple patterns, including unexpected patterns associated with a perturbation of interest. The rich patterns in the profiles could also be used to group small molecules based on their similarity to generate hypotheses about which small molecules share a common mechanisms of action.
Cellular morphology is affected by a number of factors, such as the genetic and epigenetic state of the cell, physiologic processes such as cell division or metabolism, and changes in environmental cues that alter cell signaling. Extensive measurement of morphological features, treated as a profile, can be applied to study the response of cells to diverse perturbations or to characterize the differences between cells from disease and non-disease states. The multiplex assay described here increases the number of morphological features that can be quantified by microscopy and image analysis to create image-based profiles. We anticipate the assay will be useful for characterizing perturbations whose effects are poorly understood, such as novel small molecules or disease-associated variants emerging in genome-wide association studies. We provide the complete set of images from our experiment as well as source code for computer programs that reproduce our results (Text S1 ).
Materials and Methods
Plating
U2OS cells (#HTB-96, ATCC) were plated at the density of 1500–2000 cells per well in 384-well imager quality black/clear plates (Aurora Biotechnologies/Nexus Biosystems) in 50 µL DMEM supplemented with 10% fetal bovine serum, and 1% penicillin/streptomycin. Cells were grown for 24 h at 37°C.
Compound Pinning
Compounds were pin-transferred to cells using a CyBi-Well robot (CyBio, Inc.). Cells were treated for 48 h at 37°C.
Staining
The samples were stained as follows.
Step 1: MitoTracker and Wheat Germ Agglutinin staining.
MitoTracker Deep Red (#M22426, Invitrogen) was dissolved in DMSO to 1 mM. Wheat Germ Agglutinin (WGA) Alexa594 conjugate (#W11262, Invitrogen) was dissolved in dH2O to 1 mg/mL. A 500 nM MitoTracker, 60 µg/mL WGA solution was prepared in prewarmed media (DMEM, 10% FBS, 1% penicillin/streptomycin). Media was removed from plates; residual volume was 10 µL in each well. 30 µL of staining solution was added to wells and incubated for 30 min at 37 °C.
Step 2: Fixation.
10 µL of 16% methanol-free paraformaldehyde (#15710-S, Electron Microscopy Services) was added to wells for a final concentration of 3.2%. The plates were then incubated at room temperature for 20 min. Wells were washed once with 70 µL 1xHBSS (#14065-056, Invitrogen).
Step 3: Permeabilization.
A 0.1% solution of Triton X-100 (T8787-100mL, Sigma) was prepared in 1x HBSS. 30 µL of the solution was added to the wells and incubated for 10–20 min. Wells were washed twice with 70 µL 1x HBSS.
Step 4: Phalloidin, ConcanavalinA, Hoechst, and SYTO 14 staining.
Concanavalin A Alexa488 conjugate (#C11252, Invitrogen) was dissolved to 1 mg/mL in 0.1 M sodium bicarbonate (SH30033.01, HyClone), and Phalloidin Alexa594 conjugate (#A12381, Invitrogen) was dissolved in 1.5 mL methanol (67-56-1, BDH) per vial. A 0.025 µL phalloidin/µL solution, 100 µg/mL ConcanavalinA, 5 µg/mL Hoechst33342 (#H3570, Invitrogen), and 3 µM SYTO14 green fluorescent nucleic acid stain (#S7576, Invitrogen) solution was prepared in 1x HBSS, 1% BSA. 30 µL of staining solution was added to wells and incubated for 30 min. Wells were washed three times with 70 µL 1xHBSS, no final aspiration. Plates were sealed with blue Remp thermal seal, at 171 °C for 4 s.
Imaging
Images were captured at 20x magnification in 5 fluorescent channels, DAPI (387/447 nm), GFP (472/520 nm), Cy3 (531/593 nm), TexasRed (562/642 nm), Cy5 (628/692 nm) on an ImageXpress Micro epifluorescent microscope (Molecular Devices), 9 sites per well, with laser based autofocus in the DAPI channel, first site of each well.
Image analysis
Version 2.0.9925 of the image-analysis software CellProfiler [6] was used to locate and segment the cells and measure many features of each cell (Table S2 ) using the pipelines provided (Text S1 ). After correcting for uneven illumination, the pipeline identifies the nuclei from the DAPI channel and uses the nuclei as seeds to help a segmentation algorithm identify the cytoplasm[20,21]. The pipeline measure size, shape, texture, intensity statistics, and local density of the nuclei, cytoplasms, and entire cells.
Annotation
We used annotations that have previously been collected and curated over the course of several projects. Many of the annotations have been deposited into ChemBank [22], but the annotation work has continued after ChemBank became static. The annotations we used are included as supplementary data.
The annotations covered 649 of the 1600 compounds in the experiment (Table S6 ). Some annotations were from the Gene Ontology [23] (including GOMF, GOBP, and GOCC). Others were medical subject headings (MeSH) or product use/class fields from the compounds’ material safety data sheets. There were also a small number of protein targets (Entrez GeneIDs) among the annotations.
The annotation terms had been “slimmed,” replacing excessively detailed terms with more general terms that give a broader overview. The GO annotations were slimmed using GO slim [23], whereas MeSH and product use/class terms were slimmed by manual inspection. The protein targets were slimmed by assigning the appropriate GOMF, then applying GO slim.
Finding term-enriched clusters
We identified clusters and scored them for enrichment for annotation terms as follows.
-
1
Computed a profile for each of the 7680 samples (20 plates with 384 wells per plate) by averaging each CellProfiler-generated feature across the cells in the well. Averaging has been effective for profiling even though it does not explicitly model heterogeneity among cells [4,10]. The entire CellProfiler feature set was used for the analysis; while feature reduction techniques may result in incremental improvements in performance, we chose to transform the data as little as possible in order to focus the evaluation on the assay itself rather than advanced data-analysis methods. For the same reason, we also chose well-known and transparent methods for the subsequent steps of the analysis.
-
2
Aggregated the 7680 per-sample profiles into 1601 per-compound profiles by computing the element-wise median. The 1601 per-compound profiles include the median mock profile, i.e., the median profile of all DMSO-treated samples.
-
3
Excluded compounds that were inactive in the assay. Compounds were deemed to be active if their profiles’ Euclidean distance to the median mock profile was above a cutoff. The cutoff was the 95th percentile of the distances from the mock-treated wells to the median mock profile. Of the 1600 compounds, 203 (13%) were active.
-
4
Excluded compounds that were unannotated. Of the 203 active compounds, 75 were annotated by one or more of 96 slimmed terms (Table S7).
-
5
Performed hierarchical clustering of the compound profiles of the 75 compounds that were active and annotated, using the cosine distance and single linkage.
-
6
Assessed whether each possible cluster is enriched by each annotation term (Table S8). There were 74 possible clusters, one for each non-leaf subtree of the dendrogram produced by the hierarchical clustering. The assessment was by permutation testing: we measured the fraction of random clusters of the same size that had at least the same number of compounds annotated with the term in question. When constructing random clusters for permutation testing, the cluster members were drawn from a uniform distribution over the compounds. It was not necessary to correct for multiple testing because the fractions were only used for ranking and not interpreted as p-values. Enrichment in GO terms has also recently been used to validate clusters of profiles generated from HTS experiments [24]. Table S8 shows the clusters ranked by permutation-testing score, i.e., the fraction of random clusters that had at least the same number of compounds annotated with the term in question. For each cluster, it shows the number of compounds in the cluster, the number of times the enriched term occurs in the cluster, and the number of times the enriched term occurs in the entire dataset. For each compound in the cluster, the table shows whether the compound has the enriched term, as well as the compound’s name and Broad ID (internal identifier from our compound-management department).
Reproducibility
We provide (Text S1 ) the complete image set, the CellProfiler pipelines used to identify and measure the cells, the database of cellular features, and the source code for the programs that analyze the features and produce the figures and tables in this article.
Supporting Information
Acknowledgments
We thank Thomas P Hasaka for technical assistance, Cindy Hon for project management, Bridget Wagner for providing the U2OS cells, and Mathias Wawer for assistance with presenting chemical structures.
Funding Statement
This work was supported by National Science Foundation CAREER award DBI 1148823 (AEC), and National Institutes of Health grant U54 HG005032 (SLS). KPS and PAC were supported in part by US National Institutes of Health Genomics Based Drug Discovery–Target ID Project grant RL1HG004671, which is administratively linked to the US National Institutes of Health grants RL1CA133834, RL1GM084437, and UL1RR024924. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Stegmaier K, Wong JS, Ross KN, Chow KT, Peck D et al. (2007) Signature-based small molecule screening identifies cytosine arabinoside as an EWS/FLI modulator in Ewing sarcoma. PLoS Med 4: e122. doi: 10.1371/journal.pmed.0040122. PubMed: 17425403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Lamb J, Crawford ED, Peck D, Modell JW, Blat IC et al. (2006) The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease. Science 313: 1929-1935. doi: 10.1126/science.1132939. PubMed: 17008526. [DOI] [PubMed] [Google Scholar]
- 3. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL et al. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102: 15545-15550. doi: 10.1073/pnas.0506580102. PubMed: 16199517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Adams CL, Kutsyy V, Coleman DA, Cong G, Crompton AM et al. (2006) Compound classification using image-based cellular phenotypes. Methods Enzymol 414: 440-468. doi: 10.1016/S0076-6879(06)14024-0. PubMed: 17110206. [DOI] [PubMed] [Google Scholar]
- 5. Tanaka M, Bateman R, Rauh D, Vaisberg E, Ramachandani S et al. (2005) An unbiased cell morphology-based screen for new, biologically active small molecules. PLoS Biol 3: e128. doi: 10.1371/journal.pbio.0030128. PubMed: 15799708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Carpenter AE, Jones TR, Lamprecht MR, Clarke C, Kang IH et al. (2006) CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol 7: R100. doi: 10.1186/gb-2006-7-10-r100. PubMed: 17076895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Jones TR, Carpenter AE, Lamprecht MR, Moffat J, Silver SJ et al. (2009) Scoring diverse cellular morphologies in image-based screens with iterative feedback and machine learning. Proc Natl Acad Sci U S A 106: 1826-1831. doi: 10.1073/pnas.0808843106. PubMed: 19188593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Feng Y, Mitchison TJ, Bender A, Young DW, Tallarico JA (2009) Multi-parameter phenotypic profiling: using cellular effects to characterize small-molecule compounds. Nat Rev Drug Discov 8: 567-578. doi: 10.1038/nrd2876. PubMed: 19568283. [DOI] [PubMed] [Google Scholar]
- 9. Futamura Y, Kawatani M, Kazami S, Tanaka K, Muroi M et al. (2012) Morphobase, an encyclopedic cell morphology database, and its use for drug target identification. Chem Biol 19: 1620-1630. PubMed: 23261605. [DOI] [PubMed] [Google Scholar]
- 10. Ljosa V, Caie PD, Ter Horst R, Sokolnicki KL, Jenkins EL et al. (. (2013)) Comparison of methods for image-based profiling of cellular morphological responses to small-molecule treatment. Nature Methods, 18: 1321–9. PubMed: 24045582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Loo L-H, Wu LF, Altschuler SJ (2007) Image-based multivariate profiling of drug responses from single cells. Nat Methods 4: 445-453. PubMed: 17401369. [DOI] [PubMed] [Google Scholar]
- 12. Perlman ZE, Slack MD, Feng Y, Mitchison TJ, Wu LF et al. (2004) Multidimensional drug profiling by automated microscopy. Science 306: 1194-1198. doi: 10.1126/science.1100709. PubMed: 15539606. [DOI] [PubMed] [Google Scholar]
- 13. Young DW, Bender A, Hoyt J, McWhinnie E, Chirn G-W et al. (2008) Integrating high-content screening and ligand-target prediction to identify mechanism of action. Nat Chem Biol 4: 59-68. doi: 10.1038/nchembio.2007.53. PubMed: 18066055. [DOI] [PubMed] [Google Scholar]
- 14. Roberts JR, Allison DC, Donehower RC, Rowinsky EK (1990) Development of Polyploidization in Taxol-resistant Human Leukemia Cells in Vitro. Cancer Res 50: 710-716. PubMed: 1967550. [PubMed] [Google Scholar]
- 15. Yvon AM, Wadsworth P, Jordan MA (1999) Taxol suppresses dynamics of individual microtubules in living human tumor cells. Mol Biol Cell 10: 947-959. doi: 10.1091/mbc.10.4.947. PubMed: 10198049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, Djoumbou Y, Eisner R, Guo AC, Wishart DS (2011) DrugBank 3.0: a comprehensive resource for 'omics' research on drugs. Nucleic Acids Res 39: D1035-1041 [DOI] [PMC free article] [PubMed]
- 17. Marceau F, Bawolak M-T, Lodge R, Bouthillier J, Gagné-Henley A et al. (2012) Cation trapping by cellular acidic compartments: beyond the concept of lysosomotropic drugs. Toxicol Appl Pharmacol 259: 1-12. doi: 10.1016/j.taap.2011.12.004. PubMed: 22198553. [DOI] [PubMed] [Google Scholar]
- 18. Badr CE, Wurdinger T, Nilsson J, Niers JM, Whalen M et al. (2011) Lanatoside C sensitizes glioblastoma cells to tumor necrosis factor-related apoptosis-inducing ligand and induces an alternative cell death pathway. Neuro-Oncology 13: 1213-1224. doi: 10.1093/neuonc/nor067. PubMed: 21757445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Riganti C, Campia I, Kopecka J, Gazzano E, Doublier S et al. (2011) Pleiotropic Effects of Cardioactive Glycosides. Curr Med Chem 18: 872-885. doi: 10.2174/092986711794927685. PubMed: 21182478. [DOI] [PubMed] [Google Scholar]
- 20. Ljosa V, Carpenter A (2009) Introduction to the Quantitative Analysis of Two-Dimensional Fluorescence Microscopy Images for Cell-Based. Screening - PLOS Comput Biol 5: e1000603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Li F, Yin Z, Jin G, Zhao H, Wong S (2013) Bioimage Informatics for Systems Pharmacology chapter 17. PLOS Comput Biol 9: e1003043 [DOI] [PMC free article] [PubMed]
- 22. Seiler KP, Ga George, Happ MP, Bodycombe NE, Carrinski Ha et al. (2008) ChemBank: a small-molecule screening and cheminformatics resource database. Nucleic Acids Res 36: D351-D359. PubMed: 17947324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Ma Harris, Clark J, Ireland a, Lomax J, Ashburner M, et al. (2004) The Gene Ontology (GO: ) database and informatics resource. Nucleic acids research 32: D258-261 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Petrone PM, Simms B, Nigsch F, Lounkine E, Kutchukian P et al. (2012) Rethinking molecular similarity: comparing compounds on the basis of biological activity. ACS Chem Biol 7: 1399-1409. doi: 10.1021/cb3001028. PubMed: 22594495. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.