Abstract
Universal phenotyping techniques that can discriminate among various states of biological systems have great potential. We applied 557 fluorescent library compounds to NCI's 60 human cancer cell-lines (NCI-60) to generate a systematic fluorescence phenotypic profiling data. By the kinetic fluorescence intensity analysis, we successfully discriminated the organ origin of all the 60 cell-lines.
Introduction
A principal challenge for functional genomics is to identify genotypes that are associated with a specific phenotype. Recent advances in gene expression profiling and next-generation sequencing (NGS) technology have driven significant improvements in high-throughput genotyping. [1], [2] However, unlike the well established genotyping platforms, there is no standard quantitative methods for phenotyping yet. [3] Phenotypes can be defined at many different levels; e.g., biochemical, or physiological characteristics which can be measured at a cellular level, or behaviour or clinical history at an organism level. [4] Each phenotypic measurement has been used for further study on a case-by-case basis. For instance, behaviour and brain imaging pattern have frequently been used for phenomic character in neuroscience, [5]–[8] and CD (cluster of designation)-marker based biochemical phenotypes are extensively used to discriminate between cell types in various fields. [9], [10] However, there are not yet universal phenomic parameter that can be applied to diverse screening formats. [11]
We envisioned that a fluorescent probe library would have great potential to identify a set of universal phenomic parameters due to their uniform phenotypic readout (fluorescence signal), easy dose controls, high sensitivity to micro-environments, and structural diversity at the molecular level. Our group has recently developed fluorescent libraries and reported a series of imaging probes in glucagon secreting cells, [12] differentiated myotube cells, [13] and pluripotent stem cells. [14] Even in case the intracellular targets are not clearly defined, the phenotypic signature could be used for a variety of applications. [15] More importantly, depending on the properties of the fluorescent probe, high content information could be extracted from a single fluorescence image. [16], [17] Accordingly, fluorescent phenotype profiling using synthetic probes could reveal subtle differences in biochemical characteristics. In this paper, we report the first fluorescence intensity-based cellular phenomic profiling study using a diversity-oriented fluorescence library (DOFL) and NCI-60 cancer cell lines.
To demonstrate the phenomic scale of fluorescence image-based screening methodology, 60 human cancer cell lines (NCI-60) of the National Cancer Institute's Developmental Therapeutics Program were chosen. These cells were selected based on the following the two reasons. (i) it is the largest standard collection of broad cancer cell types. The NCI-60 set contains cancers of 9 different origins, including leukemia, melanomas, renal, breast, colon, lung, CNS, prostate, and ovarian carcinomas. (ii) rich biochemical background information about the collection is available to public. The characteristics of the NCI-60 set have been profiled at the genomic, [18], [19] proteomic, [20] and drug effect level, [21]–[23] and those data can be easily applied to further “omics” studies. While various profiling approaches have been widely explored to characterize the NCI-60 cell-lines, a systematic fluorescent probe-based profiling has not been pursued yet.
Results and Discussion
Fluorescent probes and fluorescence phenotyping
Combination of rosamine (RS) and BODIPY (BD) fluorophore libraries (240 rosamine [24] and 317 BODIPY [12] compounds, 557 compounds in total) were used to generate fluorescence phenotyping data. As previously reported, these compounds showed good cell permeability, and broad absorbance (λ abs = 480∼616 nm) and fluorescence emission (λ em = 530∼656 nm) ranges with a broad structural diversity. To maximize the information from the cell based assays, we utilized a whole cell fluorescence image-based screening platform, which would provide integrated cellular characteristics, so called high-contents screening. The large-scale profiling was performed using an automated fluorescence microscope system (ImageXpress Micro, Molecular Devices, Inc.). NCI-60 cells were plated in thin-bottom 384-well plates at 70% confluency, and 557 fluorescent compounds were added to different wells at two final concentrations of 250 nM and 500 nM. The fluorescence images were obtained both in the FITC and the TRITC channels to cover the broad range of spectra of the probes. Fluorescent images were taken from two independent sites of each well for 3 time points (1 hour, 24 hours and 48 hours), and all the experiments were duplicated. All together, 1,604,160 images (557 probes×60 cancer cells×2 concentrations×2 channels×2 sites×3 time points×2 duplicates) were collected. A representative fluorescence image data of BD-46 is presented in Figure 1 and the details of the plate design is described in Figure S1. From these fluorescence images, many different kinds of features, such as morphology, texture, or intensity could be extracted and used as a phenotypic signature. [25] Notably, the staining intensities and the pattern of localizations varied in individual cell-lines, even though all cells were stained with an identical amount of BD-46.
Quantitative fluorescence intensity profiling of NCI-60 cell lines
Among various possible parameters, we focused on the fluorescence intensity of the cells, since the image processing method was straightforward and the processed data was less dependent to the analysis algorithm than others. To further suppress the artifact from the batch-to-batch variations (this is especially important in this study due to the long data acquisition time over 2 years), we decided to use fluorescence intensity fold change over the time (intensity kinetic profile), rather than the absolute value of the intensity. Firstly, we measured the mean values of the fluorescence intensity of the cells, and the intensity pattern for different cell-lines were analyzed by their intensity kinetic profile (fold change of the intensity values at 48 h over 1 h; the data processing protocols are described in the experimental methods). Then, we analyzed the overall fluorescence intensity pattern change of the NCI-60 set of cell-lines by heatmap analysis and hierarchical clustering (Fig. 2). In general, lung, colon, and ovarian cancer cells were well classified by the simple clustering. On the other hand, from a perspective of chemical structure, fluorescent compounds that shares same classes of xanthone derivative structure (e.g. RS-A, RS-E, and RS-K series) exhibited similar response pattern in the cell lines for same cancer origin (Fig. S2).
Interestingly, a remarkable signature was observed for the RS-K series of compounds. These compounds initially showed low fluorescence intensity, but after 24 h, their fluorescence was dramatically increased only in specific set of cell-lines (HCT-116, HT29, COLO205, HCC-2998, EKVX, HOP-62, NCI-H460, NCI-H322M, and NCI-H23). Moreover, fluorescence response profiles were highly correlated with the probe structure and cancer cell type (Fig. S2). For example, a series of RS-K compounds (K1, K3, K4, K14, K15, K16, and K17) exhibited fluorescence turn-on effects in lung and colon cancer cells (Fig. S3), and RS-E compounds (E1, E3, E7, and E29) showed turn-on responses in lung cancer cells (Fig. S4). Although the fluorescence response of RS-K and RS-E series increased for those specific cancer cells, each compounds showed subtle but distinct signature. In particular, RS-K series compounds showed turn-on effects towards broad ranges of lung cancer cell, but they differs response pattern depending on the 9-phenyl ring structure of rosamine (Fig. S3). It is also noteworthy to point out some of these probes distinguish specific cell lines from identical cancer origin. RS-E1, RS-E3 and RS-E7 compounds selectively turned on only in ACHN cells among renal cancer cells (Fig. S5). Moreover, they exhibited specific response to SW60 cells, but not for the rest of 6 colon cancer cells.
Another cell line specific profile was observed with the RS-C3 compound. RS-C3 induced a strong fluorescence turn-on effect only in KM12 cells, a human colon cancer cell line, among 60 cell lines (Fig. 3). Compared with the average fluorescent intensities in other NCI-60 cells, RS-C3 exhibited an exceptional 5.64-fold higher intensity in the KM12 cells. To explore the functional characteristic of the KM122 cells that induce selective fluorescence phenotype, we analyzed mRNA expression data of the NCI-60 cell lines. Based on the mRNA expression data of GSE5846, differentially expressed genes (DEGs) that up-regulated in KM12 cells (Log2(Fold) >1.5) were selected, and their gene annotation information were analyzed. KEGG pathway and gene ontology analyses identified five pathways that exhibit significant difference (P value <0.001) between KM12 cells and all the other NCI60 cells, such as cell communication, hematopoietic cell lineage, urea cycle and metabolism of amino groups and p53 signaling pathway (Table S3, S4). With these combinations of DEGs expression, it is possible to discriminate KM12 cells out of other NCI60 cell lines, and up-regulated profile of these genes are unique functional signature of KM12 cells. Since fluorescence phenotype of RS-C3 shows collective information of such DEGs profile within a single fluorescence image as intensity changes, this probe has potential for sensing similar functional signature of other cell lines. While it was not clear at the moment what kinds of endogenous biomolecule are directly interacting with this probe, it could be utilized to visually identify a colon cancer cell line (KM12) among 60 cancer cell lines after 24 h pre-incubation.
Discrimination of cancer cell origin
For quantitative pattern analysis, the intensity kinetic profiles were further examined using linear discriminant analysis (LDA) to classify the cancer cell types. LDA is a commonly used classification method especially for high dimensional data for reduction of dimensionality. By applying LDA to kinetic profiles, it was possible to evaluate fold change responses of each chemical probe for optimum cancer cell identification, and generate score function to predict the identity of cells even each probe does not show specific response to single cell line. The minimum fluorescent probe set that could discriminate 9 cancer cell types was chosen by a forward step-wise variable selection algorithm, and 37 differentially responding fluorescent probes were selected (Fig. S6). Surprisingly, all 60 cancer cells were successfully clustered on an LDA score plot (Fig. 4), and 98% were correctly assigned to each of the original groups using jackknifed cross-validation (Table S1). LDA scores are arbitrary unit values representing maximum identity prediction, and our result showed the first and second highest LDA scores were enough to discriminate 60 cell lines in terms of cancer origin (Table S2). It is worth noting that classification using a combination of both BD and RS probes was much more successful as determined by cross-validation than any combination of either RS or BD probes alone (Table 1). This result implies that the chemical diversity of the fluorescent probes used to generate the intensity profile significantly affected the results of the cancer type classification. Another interesting aspect of selected 37 probes set is that none of these probes are selective to single cell or cancer types. They showed unique response patterns for multiple cell-lines, and the target cell was not always limited within identical cancer types. We believe such selective responses originated by specific biochemical signature or integrated intact environments of target cell types.
Table 1. LDA results for different fluorescnt probe sets.
BD Library | RS Library | BD & RS Library | |
Selected probe number | 37/317 | 22/240 | 37/557 |
% Correct classification in Jackknifed cross-validation | 90% | 80% | 98% |
Detailed Jackknifed classification matrix results are shown in Table S1.
Conclusion
In short, we report the first fluorescence intensity-based phenotype profiling of 60 human cancer cell-lines (NCI-60) using synthetic fluorescent probes. Structural diversity of the fluorescent probes seems to be a critical factor for the generation of distinct phenotypes, and our results showed that 60 cancer cell-lines are all successfully discriminated in terms of 9 cancer cell origins. The most emphasis in cancer research has been focused on pathogenesis and metastasis, and metastasis research has been stagnated because of a lack of reliable tracking technique that could visualize a specific origin of cancer cells. Fluorescence compounds that probe specific origin of cancer cell could not only provide a new window for metastasis research, but also be utilized for cancer diagnosis and progression monitoring. Although this study has limitations in that we performed identification of cancer cells origin using only in vitro cancer cell lines, our results provided a clue that combinations of fluorescent probes could distinguish complex and subtle differences of cancer cells. Based on the response pattern, the better identification result was achieved when more fluorophore scaffolds were used, and the fluorescence intensity profiles were highly correlated with the chemical structure of the fluorescent probes and cancer types. These findings demonstrate the practical utility of the diversity-oriented fluorescence library approach in universal cell phenotyping.
Methods
Diversity-oriented fluorescence library preparation
The rosamine and BODIPY library compounds were prepared according to previously reported protocols, and the compound structures and characterization data has been reported. [12], [24], [26] All compounds were stored at −20°C in microtilter plates in solid form, and stock solutions were made by dissolving the compounds in DMSO prior to screening.
NCI-60 cancer cell culture
The NCI-60 cell-lines were obtained from the National Cancer Institute. All NCI-60 cell-lines were cultured as described in the distributor's manual. Briefly, cells were cultured in a high-glucose Dulbecco's Modified Eagle's Medium (DMEM) supplemented with 10% FBS and 1% antibiotics.
Quantitative fluorescence intensity analysis
Mean values of the fluorescence intensities were calculated using Matlab 7.9.0 (R2009b). In our dataset, 4 images represented each experimental condition (2 duplicate×2 images/well). We combined the 4 images using the cat function in Matlab to optimize IO processing. To minimize the background effect, only the fluorescence intensity values for area that were positive for Otsu's threshold were used to calculate the average value. The graythresh and im2bw functions in the Matlab image processing module were used to calculate Otsu's threshold and the cell area selection. All quantitative image processing batch jobs were computed using a 16-node PC cluster for 3 days.
Discriminant analysis
Linear discriminant analysis (LDA) was performed using fold change values of the fluorescence intensities between 48 hour and 1 hour incubation points. Probe selection was processed by forward-stepwise variable selection algorithum in SYSTAT (version 13).
NCI-60 gene expression and pathway analysis
The mRNA expression patterns for the NCI-60 set of cell lines were downloaded from the NCBI gene expression omnibus (GEO) database. The GSE5846 data were analyzed using GenePlex v3.0 DEG finding and Pathway analysis modules (ISTECH, Inc.).
Supporting Information
Acknowledgments
We would like to thank Korea Institute of Science and Technology for the quantitative image analysis using PC cluster.
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: This work was supported by an intramural funding from Agency for Science, Technology and Research (A*STAR), Singapore Biomedical Research Council and A*STAR SSCC Grant (SSCC10/024). This work was also supported by Converging Research Center Program through the Ministry of Education, Science and Technology (2010K001410). JSL is a recipient of T. J. Park Science Fellowship. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Metzker ML. Sequencing technologies - the next generation. Nature reviews Genetics. 2010;11:31–46. doi: 10.1038/nrg2626. [DOI] [PubMed] [Google Scholar]
- 2.Eid J, Fehr A, Gray J, Luong K, Lyle J, et al. Real-time DNA sequencing from single polymerase molecules. Science. 2009;323:133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
- 3.Houle D, Govindaraju DR, Omholt S. Phenomics: the next challenge. Nature reviews Genetics. 2010;11:855–866. doi: 10.1038/nrg2897. [DOI] [PubMed] [Google Scholar]
- 4.Zbuk KM, Eng C. Cancer phenomics: RET and PTEN as illustrative models. Nature reviews Cancer. 2007;7:35–45. doi: 10.1038/nrc2037. [DOI] [PubMed] [Google Scholar]
- 5.Ayhan Y, Abazyan B, Nomura J, Kim R, Ladenheim B, et al. Differential effects of prenatal and postnatal expressions of mutant human DISC1 on neurobehavioral phenotypes in transgenic mice: evidence for neurodevelopmental origin of major psychiatric disorders. Molecular psychiatry. 2011;16:293–306. doi: 10.1038/mp.2009.144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shen L, Kim S, Risacher SL, Nho K, Swaminathan S, et al. Whole genome association study of brain-wide imaging phenotypes for identifying quantitative trait loci in MCI and AD: A study of the ADNI cohort. NeuroImage. 2010;53:1051–1063. doi: 10.1016/j.neuroimage.2010.01.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Vounou M, Nichols TE, Montana G. Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach. NeuroImage. 2010;53:1147–1159. doi: 10.1016/j.neuroimage.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gerlai R. Phenomics: fiction or the future? Trends in neurosciences. 2002;25:506–509. doi: 10.1016/s0166-2236(02)02250-6. [DOI] [PubMed] [Google Scholar]
- 9.Fatrai S, van Gosliga D, Han L, Daenen SM, Vellenga E, et al. KRAS(G12V) enhances proliferation and initiates myelomonocytic differentiation in human stem/progenitor cells via intrinsic and extrinsic pathways. The Journal of biological chemistry. 2011;286:6061–6070. doi: 10.1074/jbc.M110.201848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.MacKenzie DA, Schartner J, Lin J, Timmel A, Jennens-Clough M, et al. GRAIL is up-regulated in CD4+ CD25+ T regulatory cells and is sufficient for conversion of T cells to a regulatory phenotype. The Journal of biological chemistry. 2007;282:9696–9702. doi: 10.1074/jbc.M604192200. [DOI] [PubMed] [Google Scholar]
- 11.Freimer N, Sabatti C. The human phenome project. Nature genetics. 2003;34:15–21. doi: 10.1038/ng0503-15. [DOI] [PubMed] [Google Scholar]
- 12.Lee JS, Kang NY, Kim YK, Samanta A, Feng S, et al. Synthesis of a BODIPY library and its application to the development of live cell glucagon imaging probe. Journal of the American Chemical Society. 2009;131:10077–10082. doi: 10.1021/ja9011657. [DOI] [PubMed] [Google Scholar]
- 13.Wagner BK, Carrinski HA, Ahn YH, Kim YK, Gilbert TJ, et al. Small-molecule fluorophores to detect cell-state switching in the context of high-throughput screening. Journal of the American Chemical Society. 2008;130:4208–4209. doi: 10.1021/ja077656d. [DOI] [PubMed] [Google Scholar]
- 14.Im CN, Kang NY, Ha HH, Bi X, Lee JJ, et al. A fluorescent rosamine compound selectively stains pluripotent stem cells. Angewandte Chemie. 2010;49:7497–7500. doi: 10.1002/anie.201002463. [DOI] [PubMed] [Google Scholar]
- 15.Ljosa V, Carpenter AE. High-throughput screens for fluorescent dye discovery. Trends in biotechnology. 2008;26:527–530. doi: 10.1016/j.tibtech.2008.06.008. [DOI] [PubMed] [Google Scholar]
- 16.Korn K, Krausz E. Cell-based high-content screening of small-molecule libraries. Current opinion in chemical biology. 2007;11:503–510. doi: 10.1016/j.cbpa.2007.08.030. [DOI] [PubMed] [Google Scholar]
- 17.Belloc F, Dumain P, Boisseau MR, Jalloustre C, Reiffers J, et al. A flow cytometric method using Hoechst 33342 and propidium iodide for simultaneous cell cycle analysis and apoptosis determination in unfixed cells. Cytometry. 1994;17:59–65. doi: 10.1002/cyto.990170108. [DOI] [PubMed] [Google Scholar]
- 18.Ross DT, Scherf U, Eisen MB, Perou CM, Rees C, et al. Systematic variation in gene expression patterns in human cancer cell lines. Nature genetics. 2000;24:227–235. doi: 10.1038/73432. [DOI] [PubMed] [Google Scholar]
- 19.Gmeiner WH, Reinhold WC, Pommier Y. Genome-wide mRNA and microRNA profiling of the NCI 60 cell-line screen and comparison of FdUMP[10] with fluorouracil, floxuridine, and topoisomerase 1 poisons. Molecular cancer therapeutics. 2010;9:3105–3114. doi: 10.1158/1535-7163.MCT-10-0674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Nishizuka S, Charboneau L, Young L, Major S, Reinhold WC, et al. Proteomic profiling of the NCI-60 cancer cell lines using new high-density reverse-phase lysate microarrays. Proceedings of the National Academy of Sciences of the United States of America. 2003;100:14229–14234. doi: 10.1073/pnas.2331323100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Weinstein JN, Kohn KW, Grever MR, Viswanadhan VN, Rubinstein LV, et al. Neural computing in cancer drug development: predicting mechanism of action. Science. 1992;258:447–451. doi: 10.1126/science.1411538. [DOI] [PubMed] [Google Scholar]
- 22.Paull KD, Shoemaker RH, Hodes L, Monks A, Scudiero DA, et al. Display and analysis of patterns of differential activity of drugs against human tumor cell lines: development of mean graph and COMPARE algorithm. Journal of the National Cancer Institute. 1989;81:1088–1092. doi: 10.1093/jnci/81.14.1088. [DOI] [PubMed] [Google Scholar]
- 23.Lee JS, Chang YT. Understanding the activity of small molecules in the genomics context. Angewandte Chemie. 2007;46:3616–3618. doi: 10.1002/anie.200604972. [DOI] [PubMed] [Google Scholar]
- 24.Ahn YH, Lee JS, Chang YT. Combinatorial rosamine library and application to in vivo glutathione probe. Journal of the American Chemical Society. 2007;129:4510–4511. doi: 10.1021/ja068230m. [DOI] [PubMed] [Google Scholar]
- 25.Walter T, Shattuck DW, Baldock R, Bastin ME, Carpenter AE, et al. Visualization of image data from cells to organisms. Nature methods. 2010;7:S26–41. doi: 10.1038/nmeth.1431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lee JS, Kim HK, Feng S, Vendrell M, Chang YT. Accelerating fluorescent sensor discovery: unbiased screening of a diversity-oriented BODIPY library. Chemical communications. 2011;47:2339–2341. doi: 10.1039/c0cc04495d. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.