Machine learning and image-based profiling in drug discovery

Christian Scheeder; Florian Heigwer; Michael Boutros

doi:10.1016/j.coisb.2018.05.004

. 2018 Aug;10:43–52. doi: 10.1016/j.coisb.2018.05.004

Machine learning and image-based profiling in drug discovery

Christian Scheeder ¹, Florian Heigwer ¹, Michael Boutros ^1,^∗

PMCID: PMC6109111 PMID: 30159406

Abstract

The increase in imaging throughput, new analytical frameworks and high-performance computational resources open new avenues for data-rich phenotypic profiling of small molecules in drug discovery. Image-based profiling assays assessing single-cell phenotypes have been used to explore mechanisms of action, target efficacy and toxicity of small molecules. Technological advances to generate large data sets together with new machine learning approaches for the analysis of high-dimensional profiling data create opportunities to improve many steps in drug discovery. In this review, we will discuss how recent studies applied machine learning approaches in functional profiling workflows with a focus on chemical genetics. While their utility in image-based screening and profiling is predictably evident, examples of novel insights beyond the status quo based on the applications of machine learning approaches are just beginning to emerge. To enable discoveries, future studies also need to develop methodologies that lower the entry barriers to high-throughput profiling experiments by streamlining image-based profiling assays and providing applications for advanced learning technologies such as easy to deploy deep neural networks.

Keywords: Imaging, Image analysis, Machine learning, Drug discovery, High-throughput screening, High-content analysis

Introduction

Using complex phenotypes to predict functions of genes has been a powerful approach in genetics and clustering mutations by their organismal phenotypes has yielded important insights into functions of many genes and pathways [1]. Visual phenotypes in genetic screens were usually manually scored and phenotypes clustered by similarity. These approaches yielded fundamental insights into the components and architecture of many conserved signaling pathways that have later been found to be recurrently mutated in many diseases 2, 3, 4.

More recently, automated microscopy-based phenotyping has become a powerful method to infer functions and functional relationships of genes, to investigate cellular or organismal structure, behavior and disease mechanisms 5, 6, 7. Automated imaging-based methods comprise a broad range of qualitative and quantitative strategies to measure phenotypes in various systems ranging from unicellular organisms to cell lines and whole animals 3, 8, ∗∗9, 10, 11, 12, however, challenges are often similar, from standardization of imaging conditions to an automated approach to analyze very large data sets.

In drug discovery, automated imaging was used in a number of different applications in pre-clinical development and has been shown to allow scalable and systematic phenotypic profiling of small molecules, establishing itself as a complementary method to target-based in-vitro screening 7, 13, 14. In this review, we will summarize the current state of the art with a focus on recent publications that highlight novel analysis methodologies for image-based screening. While most methods are applicable in any perturbation-based screens, this review in particular highlights applications in chemical genetics and drug discovery.

Image-based profiling in drug discovery

For image-based phenotyping of perturbations, one can distinguish two approaches to experimental design that fundamentally differ with regards to probes as well as image analysis 10, ∗15, 16.

The first approach includes screening applications (often called high-content or phenotypic screening) that are focused on pre-defined, specific phenotypes with the aim to identify drugs or drug targets that modulate it (Figure 1A). Such image-based phenotypic screens have, for example, been successfully used to identify targets and compounds that modulate phenotypes like the subcellular localization of specific proteins 17, 18.

Typical workflow of image-based small molecule experiments. A typical high-throughput imaging experiment starts with the seeding of adherent cells in suitable microtiter plates (e.g. 384-well plates) followed by an incubation time of several hours to let the cells attach. In a second step, compounds are added using robotic liquid handling stations. In simple experiments cells are perturbed with one compound per well before the assay is stopped by fixation and staining of cells. Finally, plates are imaged using automated microscopes (wide-field or confocal, one or more fields on view, [23]). This generic workflow describes the basic steps of high-throughput imaging experiments which can further be distinguished in two approaches. (A) Screening experiments are designed to identify small molecules that modulate a pre-defined phenotype. (B) Profiling experiments follow an unbiased approach to profile cells upon perturbations by extracting hundreds of phenotypic measurements. In both approaches the application of more complex experimental designs and cell culture models such as 3-D cell culture will require specific microscopes, additional processing steps and, thus, reduce throughput.

The second application of high-throughput imaging in drug discovery is the more global profiling of perturbations. This approach profiles cells upon exposure to genetic, pathogenic or chemical perturbations and is complementary to techniques like transcriptional profiling (Figure 1B, 19, 20, 21). Subcellular structures are stained with multiplexed fluorescent dyes and fluorescently labeled antibodies to visualize or ‘paint’ cells and subcellular structures. Automated image acquisition and analysis are subsequently used to profile phenotypes of cells in an unbiased manner (Figure 2, Box 1, 22, ∗23). Computer vision can extract multivariate feature vectors of cell morphology such as cell size, shape, texture and staining intensity without further human intervention (Figure 2A). All large-scale studies reported so far employed segmentation approaches to accurately define cellular outlines prior to feature extraction. The derived profiles of single cells or cell populations are then scored to find relationships within data sets comprising up to tens of thousands of perturbations (Figure 2B, 24, ∗∗25, 26).

Typical analysis strategy and machine learning applications in image-based small molecule profiling experiments. (A) Images acquired in a high-throughput profiling experiment are analyzed using automated, highly parallelized analysis pipelines that employ software such as CellProfiler, R/EBImage, Icy or ImageJ. In a first step, the quality of the images is controlled. For this purpose, machine learning classifiers can be trained to recognize and remove images with artifacts. Segmentation-based or segmentation-free approaches are then used to extract quantitative image features that represent the cellular phenotypes (see Box 1 for more details). (B) The extracted phenotypic features may be used “as is” or further processed to derive meta-feature vectors representing each perturbation. Treatment-level profiles are then further used to classify compounds with shared MOA, toxicity or to assess the efficacy of candidate molecules.

Box 1. Processes in image analysis workflows and machine learning applications.

Segmentation

High-throughput small molecule profiling is typically based on the staining of subcellular structures and fluorescence microscopy. A selective staining of abundant subcellular structures such as DNA or actin and the subsequent imaging in distinct channels allows the identification of single cells as objects [23]. An accurate segmentation of nuclei and cell bodies can be achieved by intensity-based thresholding and region growing approaches 81, 82, 83. For more complex cell shapes or multicellular objects such as organoids, more sophisticated models are required to segment objects of interest [84]. A number of computational applications for segmentation-free analysis in image-based profiling have been developed 85, 86. Recently, deep learning approaches based on segmentation-free strategies, in some cases in combination with single-cell segmentation, have been developed for image-based small molecule profiling (see main text for further details).

Feature extraction

Following segmentation, computer vision is used to extract morphology, intensity and texture features of single cells 81, 82. Morphological features describing cellular size and shape are calculated based on the cell outlines. Intensity and texture features are calculated on pixel values of raw or pre-processed images within those outlines. These features are defined based on human knowledge. Hundreds to thousands of features are extracted to achieve an unbiased character of the studies. In contrast, segmentation-free approaches provide a hypothesis-free approach for feature extraction independent of segmentation outlines. Recently, several approaches that use deep neural networks for segmentation-free feature extraction were developed 72, 73, 74, 75. While the features extracted by deep neural networks cannot be readily interpreted by humans, results from pilot studies indicate that those features are complementary to those extracted by segmentation-based strategies [73].

Profile generation

Segmentation-based image analysis results in collections of single cell feature vectors for each condition (e.g. per small molecule treatment). Various strategies have been explored to process those single cell feature vectors into profiles for downstream analyses. A common method relies on the aggregation of single cell features by summarizing the mean and standard deviation over all cells of a perturbation and other methods using machine learning approaches to generate profiles are reported [60]. Furthermore, features are often highly correlated and deliver very similar information content. This inflates data complexity and impairs downstream analysis. Strategies to either select features carrying redundant information or to reduce data dimensionality by transformations are thus commonly applied before proceeding with downstream analyses [59].

Classification

The aim of many profiling experiments is a classification of compounds. Based on a set of reference compounds and their profiles, classification strategies are used to infer putative MOA or toxicity relationships of formerly unseen compounds. This process is a bona fide machine learning task. Approaches include hierarchical clustering of compounds into groups that share high profile similarity, training support vector machines on profiles for MOA prediction or correlation network analysis ∗∗59, ∗60, 61. More recently, advanced classifiers such as random forests or deep neural networks were employed in order to achieve higher accuracy in predicting MOAs 71, 74, 75. The latter however with an increased complexity for analysis and demand more in-depth knowledge of neural network-based machine learning.

Alt-text: Box 1

High-throughput imaging has been successfully used for small molecule profiling in human cancer cells to group small molecules with similar activity and mechanism of action (MOA) [27]. The concept of clustering small molecules based on the similarity of phenotypic profiles was further exploited to correlate phenotypic responses with chemical structure similarity (Figure 3A and B, ∗28, 29). The basic principle of grouping small molecules based on similarity of measured phenotypic profiles has further been successfully applied in a variety of drug discovery applications. For example, annotated libraries of pharmacologically active small molecules could be profiled to hypothesize off-targets and to suggest drug-target pairs with possibilities for repurposing ∗∗25, 30. Chemical genetics approaches also allow a systematic survey for drug repurposing by linking the effect of small molecules with the absence or presence of loss- or gain-of-function alleles ∗∗25, 31. Further strategies include the association of small molecules to a new target by comparing its induced phenotype to that of a reference molecule and the unbiased testing of small molecules for candidates that reverse a disease-associated cellular phenotype to the wild-type phenotype ∗∗25, 30, 32. Other applications include MOA profiling of natural product libraries and the enrichment of large compound libraries for effective and diverse subsets 33, 34. Studies also demonstrated the application of image-based profiling for toxicity profiling, for example, by using model systems such as 3-D liver cell culture or induced stem cell-derived cardiomyocytes to assess small molecule toxicity 35, 36. Additional applications include the unbiased identification of toxic small molecules in large natural product libraries 33, 37, 38. A further optimization of image-based profiling assays and analysis concepts made the technology accessible to more complex drug discovery studies as demonstrated by a recent study in which 50,000 compounds were profiled in pluripotent stem cells to complement a targeted antibody readout [39].

Example images and analyses for image-based small molecule profiling. (A) Young et al. could show that substantial correlation between phenotypic feature vectors and chemical similarities based on compound chemical structure exists. This implied that phenotypic similarities could be harnessed to find small molecules with similar chemical structure (adapted from Ref. [28]). (B) Breinig et al. built on this observation and tested if image-based phenotypic profiles also reveal cell line specific responses to certain compound perturbations. They found, for example, that the presence or absence of the CTNNB1 mutant allele causes a differential cellular reaction to Etoposide treatment (adapted from Ref. [25]). (C) 3D-cyst culture was applied by Booij et al. to map the activity and function of small molecule compounds and antibodies by image-based profiling. Using this analysis, they found that PI3K inhibitor treatment was not able to suppress forskolin induced cyst swelling (adapted from Ref. [68]). (D) Schematic illustration of the neural network architecture (left) that was used by Kraus et al. to classify small molecules by mechanism of action from full-size microscopy images of cancer cells. Kraus et al. combined deep convolutional networks with multiple instance learning (MIL) to classify small molecules based on image-level labels. In a second step the architecture of the convolutional MIL model allowed tracing back and segmenting single cells in the full-size images ([71], figure kindly provided by O. Kraus). (E) Fuchs et al. employed a classifier to group cells into phenotypic classes. By classifying single cells phenotypic class upon RNAi perturbation they could group genes based on phenotypic similarity and functionally related groups (adapted from Ref. [16]).

Image-based phenotypic profiling has also been recently applied to characterize small molecule function in patient-derived primary cells or 3-D cell culture models. A recent study by Booij et al. used image-based profiling to quantify the phenotypic effects of kinase inhibitors in a 3-D cell culture model of Polycystic kidney disease to facilitate novel hypotheses for small molecule treatments (Figure 3C, [40]). Another study deployed image-based profiling in primary chronic lymphatic leukemia (CLL) cells to identify small molecules effective in overcoming resistance against the BCL2 inhibitor Venetoclax [41].

Chemical genetics and image-based profiling

Chemical genetics approaches have been pioneered in model organisms to discover drug MOA as well as gene functions 42, 43, 44, 45, 46. In such screens, libraries of loss-of-function or gain-of-function mutant strains were screened against chemical libraries to measure genotype-dependent changes in fitness. Comparisons of phenotypic responses of mutant strains across chemical perturbations allowed to map genes with shared function and, by assessing the response across a panel of mutants, the bioactivity and cellular targets of small molecules [31].

In model organisms like yeast or bacteria, sophisticated experimental and analysis frameworks have been established for chemical-genetic studies which might provide further strategies for profiling in human cells 31, 47. As an example, a recent study in Saccharomyces cerevisiae used an algorithm to define a set of query mutant strains from genetic-interaction data. Several chemical libraries were then screened against the selection of deletion strains with the aim to functionally annotate small molecules across relevant biological processes without screening the full deletion strain library [48].

This principle has been adopted to pharmacogenomics studies in human cancer cell lines by integrating drug response and the genetic background of cell lines with the aim to identify cancer-specific vulnerabilities 49, 50, 51. Several large-scale pharmacogenomics studies relied on univariate phenotypic readouts based on cell viability assays that can be applied to a large panel of cell lines, however, not all univariate cell variability measurements are equivalent 49, 50, 52.

Microscopy-based profiling experiments are typically carried out in microtiter plates, which enables a rapid scaling of experiments for a large number of perturbations, i.e. to profile chemical libraries in a large number of genetic perturbations. As such, image-based chemical genetics offers a great potential to systematically characterize small molecules in human cell-based systems and to probe chemical-genetic interactions in complex and monogenic human diseases 7, 53. To systematically map chemical-genetic interactions by image-based profiling, smaller panels of human cancer cell lines have been used (Figure 3B, ∗∗25, 54, 55. The unbiased measurement of chemical genetic interactions across a wide phenotypic space can identify chemical-genetic interactions, small molecule off-target effects and drug repurposing opportunities. The establishment of image-based assays is often more labor intensive and not every cell line is suitable for high-throughput image-based profiling experiments [23].

Machine learning strategies for image-based profiling

High-throughput microscopy generates large collections of phenotypic data. The size, complexity and multiparametric nature of such data sets make automated analysis strategies necessary to identify cellular phenotypes that differ from wild type phenotypes and discover novel relationships between phenotypes and the perturbagens which induce them. Simple statistical inference methods such as mean values are often insufficient to comprehensively describe the complexity of data sets [56]. As a consequence, many profiling studies use supervised or unsupervised machine learning strategies to capture and evaluate phenotypic changes of single cells or populations of cells (Box 2, Figure 3D, 57, 58, ∗∗59).

Box 2. Basic principles of machine learning.

Machine learning strategies can generally be separated in unsupervised and supervised approaches.

Unsupervised machine learning approaches interpret and learn an abstract representation of the intrinsic structure of data sets and can be applied without a priori definition of labels (phenotypes) to be classified. A commonly applied unsupervised machine learning strategy is clustering of data points into patterns to ultimately derive biologically meaningful information ∗27, ∗28, ∗60. Commonly applied unsupervised clustering algorithms include hierarchical or k-means clustering.

By contrast, supervised machine learning strategies are typically applied to classification problems and thus rely on pre-defined classes. Annotated training sets with data points representing those classes are used to train a classifier 16, 65. This way, the classifier learns how to distinguish between the classes based on the measured variables (also called features). Linear and non-linear supervised machine learning classifiers exist, and the choice of the classifiers depends on the classification problem. In many cases, a comparison of the classifier performance is used to identify the algorithm that is best suited for the problem. The training of classifiers requires representative training data sets of sufficient size and quality. To avoid issues such as overfitting and assess classification accuracy statistics, the resulting classifier needs to be evaluated carefully using test data (not used for training).

Recently, deep learning has been exploited as a supervised machine learning technology for biological classification problems. Deep learning uses artificial neural networks, which consist of multiple layers of interconnected linear or non-linear transformations that are applied to the data with the goal to solve a classification problem. The networks are trained over multiple epochs, each time comparing the predicted and true class labels to optimize network parameters with automated algorithms [87]. The actual network architecture relies on complex mathematical concepts. Strategies such as transfer learning or generic deep neural networks provide possibilities for non-experts to apply deep learning without full knowledge of the mathematical details ∗∗9, 70, 73.

The high dimensionality and complexity of image data sets typically make analysis strategies with multiple steps and combinations of machine learning algorithms necessary. Here, deep learning strategies provide great potential to automate analysis at many stages from segmentation, feature extraction to classification to reduce manual tuning of analysis pipelines 71, 74.

Alt-text: Box 2

Clustering image-based profiles to identify small molecules with shared MAO based on the ‘guilt-by-association’ rule relies on unsupervised machine learning strategies ∗∗25, ∗27, ∗28, 34. In the simplest case, morphological profiles are generated by aggregating all single-cell measurements per treatment condition and several statistical methods have been applied to generate morphological profiles. When used to classify a set of compounds by their MOA, a comparison of profiling methods showed only minimal differences between them [60].

For large data sets, hierarchical (unsupervised) clustering algorithms are commonly used to cluster small molecule profiles based on their profile similarity. Often, matrices of all pairwise similarities between small molecule profiles are used to derive a distance measure (i.e. 1 – similarity) rather than absolute distance measures such as the Euclidian distance [61]. Furthermore, phenotypic similarity matrices correlate in some cases with other similarities of small molecules such as chemical similarity ∗∗25, ∗28.

Hierarchal clustering and the corresponding visualization as heatmaps are relatively easy to implement and require only few computational resources, even for large data sets. Furthermore, the concept of similarity-based clustering is frequently applied in other fields such as gene expression profiling or genetic interaction studies making those analyses and visualizations accessible to a broad audience 62, 63. Depending on the size and characteristics of the data set, alternative cluster algorithms and visualizations as for example two-dimensional maps might be favorable to identify relevant groups of cells or perturbations 16, 30, 64.

In contrast to unsupervised approaches, supervised machine learning algorithms are commonly applied when cells or populations of cells need to be classified in discrete phenotypic classes. Supervised machine learning has been successfully used within image-based genetic screening experiments to classify single cells into pre-defined, biologically meaningful classes based on their phenotypic profiles (Figure 3D, 16, 64, 65). Supervised classifiers are trained to learn the relevant phenotypes from training data to distinguish different phenotypes. This approach has been applied by using tool compounds, genetic perturbation or extracellular stimuli to create reference profiles which were further used to train classifiers 32, ∗39, 41, 66, 67, 68. In some cases, supervised machine learning strategies were used to calculate profiles of meta features which were then applied to various downstream analysis strategies [60]. As an example, Loo et al. developed a strategy using support vector machines to separate treated from untreated cells in the high-dimensional feature space. In subsequent steps, profiles were calculated using classification accuracy and the orientation of support-vector hyperplanes. The derived meta-profiles incorporate multiphasic responses over dose ranges and retain human interpretable profiles for analyses [69].

Supervised machine learning requires curated training data, e.g. manually labeled cells or defined experimental measurements. The size and quality of the training data is crucial for the performance of the corresponding classifier and manual annotation of samples may be biased by subjective decisions. As for many published studies that used supervised machine learning, training data sets need to be reviewed and possibly re-generated if only small experimental parameters change 57, 58. Thus, the choice of the machine learning algorithm and its proper implementation are key factors to avoid issues such as overfitting. Whilst such challenges might put unsupervised and simple statistical inference methods in favor for large-scale profiling experiments, supervised machine learning is being used with success to classify complex phenotypes 10, 16, 65.

A recent study that used high-throughput microscopy and automated image analysis with deep convolutional neural networks classified proteins by their subcellular localization in S. cerevisiae and thereby demonstrated the power of supervised machine learning for image-based phenotyping (Figure 3E, [9]). The deep neural network that was used not only outperformed a previously implemented ensemble of support vector machines at a complex automated image analysis task, but could also be adapted to new, divergent data sets using transfer learning.

Transfer learning aims to adopt a machine learning classifier to a new problem (i.e. new data sets, generated with e.g. different experimental parameters) by fine-tuning the classifier with training data from unseen data sets. Examples show that better classification performance can be achieved by combining deep- and transfer learning when compared to the setup of a classifiers trained from scratch ∗∗9, 70. This saves computational resources as the fine-tuning is computationally less expensive than training a deep neural network from scratch and high classification accuracy might be achieved with smaller training data sets.

Recently, a number of studies reported the application of supervised machine learning approaches with deep neural networks to classify small molecules by their MOA using annotated image data sets from the Broad Bioimage Benchmark Collection 70, 71, 72, 73, 74, 75. Various strategies with applications of deep neural networks at the level of feature extractions and/or classifications were reported, in some cases with previous traditional single cell segmentation. As an example, one study used labeled full resolution images to train a deep neural network which gave slightly better treatment level classification results compared to previously reported predictions using segmentation and factor analysis [71]. Notably, a relatively low number of images (25 images per treatment) were used to train the network and no previous segmentation and labeling of single cells was required.

Nevertheless, labeled training data sets imply the a priori knowledge of phenotypes, which can contradict the unbiased strategy of image-based profiling. Two recent studies propose to use generic deep neural networks that were pre-trained on millions of ‘consumer’ images for image-based profiling tasks 73, 75. The approaches are based on the assumption that generic neural networks learned general properties of natural images and are thus capable of extracting biologically meaningful information without additional training. Both studies report better results compared to traditional feature extraction when predicting small molecule MOA and provide a proof-of-concept for the applicability of generic deep neural networks for image-based small molecule profiling. As noted by the authors, additional studies with larger data sets across conditions to sample broader biological and technical space will be required for further validation.

Another recently explored application of supervised learning in image-based profiling, particularly deep neural networks, is a novelty detection framework to identify unexpected phenotypes [76]. Label-free profiling and the prediction of targeted drug screening assays are also future approaches exploiting image-based profiling data 77, 78, 79.

Conclusions

Image-based profiling studies demonstrated the capability to improve the pre-clinical development of small molecules at almost any step of the pipeline from target identification over mechanism of action prediction to toxicity profiling. Increasing the throughput and extending more complex analysis methods of image based phenotypic screens and profiling approaches will help to increase the methodological portfolio of cellular screens to support the drug development process. Community efforts to create annotated datasets that can be shared across laboratories will be required to test and optimize the potential of strategies such as transfer learning to improve discovery science ∗∗59, 80.

Furthermore, large-scale chemical-genetics approaches inspired from successful studies in model organisms might harbor great potential to characterize drugs and drug-gene interactions in a systematic manner. Particularly, image-based profiling approaches in pre-selected informer panels of human cell lines might be a scalable and versatile tool to deprioritize compounds harboring adverse effects, asses compound efficacy and to generate hypotheses for drug synergism and repurposing.

Funding

This work was in part supported by an ERC Advanced Grant (SYNGENE) of the European Commission.

Conflicts of interest

None.

Acknowledgements

We kindly thank Oren Z. Kraus for Figure 3E and Benedikt Rauscher and Niklas Rindtorff for critical comments on the manuscript. We further thank the Boutros lab for helpful discussions.

This review comes from a themed issue on Pharmacology and drug discovery

Edited by Mikhail Savitski and Athanasios Typas

References

1.Nüsslein-Volhard C., Wieschaus E. Mutations affecting segment number and polarity in Drosophila. Nature. 1980;287:795–801. doi: 10.1038/287795a0. [DOI] [PubMed] [Google Scholar]
2.Sepp K.J., Hong P., Lizarraga S.B., Liu J.S., Mejia L.A., Walsh C.A., Perrimon N. Identification of neural outgrowth genes using genome-wide RNAi. PLoS Genet. 2008;4:e1000111. doi: 10.1371/journal.pgen.1000111. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Kiger A.A., Baum B., Jones S., Jones M.R., Coulson A., Echeverri C., Perrimon N. A functional genomic analysis of cell morphology using RNA interference. J Biol. 2003;2:27. doi: 10.1186/1475-4924-2-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Kamath R.S., Ahringer J. Genome-wide RNAi screening in Caenorhabditis elegans. Methods. 2003;30:313–321. doi: 10.1016/s1046-2023(03)00050-1. [DOI] [PubMed] [Google Scholar]
5.Boutros M., Heigwer F., Laufer C. Microscopy-based high-content screening. Cell. 2015;163:1314–1325. doi: 10.1016/j.cell.2015.11.007. [DOI] [PubMed] [Google Scholar]
6.Mattiazzi Usaj M., Styles E.B., Verster A.J., Friesen H., Boone C., Andrews B.J. High-content screening for quantitative cell biology. Trends Cell Biol. 2016;26:598–611. doi: 10.1016/j.tcb.2016.03.008. [DOI] [PubMed] [Google Scholar]
7.Pegoraro G., Misteli T. High-throughput imaging for the discovery of cellular mechanisms of disease. Trends Genet. 2017;33:604–615. doi: 10.1016/j.tig.2017.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Peters J.M., Colavin A., Shi H., Czarny T.L., Larson M.H., Wong S., Hawkins J.S., Lu C.H.S., Koo B.M., Marta E. A comprehensive, CRISPR-based functional analysis of essential genes in bacteria. Cell. 2016;165:1493–1506. doi: 10.1016/j.cell.2016.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kraus O.Z., Grys B.T., Ba J., Chong Y., Frey B.J., Boone C., Andrews B.J. Automated analysis of high-content microscopy data with deep learning. Mol Syst Biol. 2017;13:924. doi: 10.15252/msb.20177551. [DOI] [PMC free article] [PubMed] [Google Scholar]; A study that used deep learning to classify images of yeast cells according to the subcellular localization of fluorescently labeled proteins. This enabled an automated analysis of large-scale image data sets and further explored transfer learning as a framework to facilitate the adaption of deep learning to diverse biological data sets.
10.Neumann B., Held M., Liebel U., Erfle H., Rogers P., Pepperkok R., Ellenberg J. High-throughput RNAi screening by time-lapse imaging of live human cells. Nat Methods. 2006;3:385–390. doi: 10.1038/nmeth876. [DOI] [PubMed] [Google Scholar]
11.Huisken J., Swoger J., Del Bene F., Wittbrodt J., Stelzer E.H.K. Optical sectioning deep inside live embryos by selective plane illumination microscopy. Science. 2004;305:1007–1009. doi: 10.1126/science.1100035. [DOI] [PubMed] [Google Scholar]
12.Le X., Pugach E.K., Hettmer S., Storer N.Y., Liu J., Wills A.A., DiBiase A., Chen E.Y., Ignatius M.S., Poss K.D. A novel chemical screening strategy in zebrafish identifies common pathways in embryogenesis and rhabdomyosarcoma development. Development. 2013;140:2354–2364. doi: 10.1242/dev.088427. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Bickle M. The beautiful cell: high-content screening in drug discovery. Anal Bioanal Chem. 2010;398:219–226. doi: 10.1007/s00216-010-3788-3. [DOI] [PubMed] [Google Scholar]
14.Swinney D.C., Anthony J. How were new medicines discovered? Nat Rev Drug Discov. 2011;10:507–519. doi: 10.1038/nrd3480. [DOI] [PubMed] [Google Scholar]
Caicedo J.C., Singh S., Carpenter A.E. Applications in image-based profiling of perturbations. Curr Opin Biotechnol. 2016;39:134–142. doi: 10.1016/j.copbio.2016.04.003. [DOI] [PubMed] [Google Scholar]; This reference provides a concise review on the various applications of image-based profiling including drug discovery applications.
16.Fuchs F., Pau G., Kranz D., Sklyar O., Budjan C., Steinbrink S., Horn T., Pedal A., Huber W., Boutros M. Clustering phenotype populations by genome-wide RNAi and multiparametric imaging. Mol Syst Biol. 2010;6:370. doi: 10.1038/msb.2010.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Zanella F., Lorens J.B., Link W. High content screening: seeing is believing. Trends Biotechnol. 2010;28:237–245. doi: 10.1016/j.tibtech.2010.02.005. [DOI] [PubMed] [Google Scholar]
18.Moffat J.G., Vincent F., Lee J.A., Eder J., Prunotto M. Opportunities and challenges in phenotypic drug discovery: an industry perspective. Nat Rev Drug Discov. 2017;16:531–543. doi: 10.1038/nrd.2017.111. [DOI] [PubMed] [Google Scholar]
19.Lamb J., Crawford E.D., Peck D., Modell J.W., Blat I.C., Wrobel M.J., Lerner J., Brunet J.P., Subramanian A., Ross K.N. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science (80-) 2006;313:1929–1935. doi: 10.1126/science.1132939. [DOI] [PubMed] [Google Scholar]
20.Schirle M., Jenkins J.L. Identifying compound efficacy targets in phenotypic drug discovery. Drug Discov Today. 2016;21:82–89. doi: 10.1016/j.drudis.2015.08.001. [DOI] [PubMed] [Google Scholar]
21.Schenone M., Dančík V., Wagner B.K., Clemons P.A. Target identification and mechanism of action in chemical biology and drug discovery. Nat Chem Biol. 2013;9:232–240. doi: 10.1038/nchembio.1199. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Singh S., Carpenter A.E., Genovesio A. Increasing the Content of High-Content Screening. J Biomol Screen. 2014;19:640–650. doi: 10.1177/1087057114528537. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bray M.-A., Singh S., Han H., Davis C.T., Borgeson B., Hartland C., Kost-Alimova M., Gustafsdottir S.M., Gibson C.C., Carpenter A.E. Cell painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nat Protoc. 2016;11:1757–1774. doi: 10.1038/nprot.2016.105. [DOI] [PMC free article] [PubMed] [Google Scholar]; A methods paper providing comprehensive experimental guidelines for the CellPainting assay, an image-based profiling assay established by the Carpenter lab.
24.Fischer B., Sandmann T., Horn T., Billmann M., Chaudhary V., Huber W., Boutros M. A map of directional genetic interactions in a metazoan cell. Elife. 2015;4 doi: 10.7554/eLife.05464. [DOI] [PMC free article] [PubMed] [Google Scholar]
Breinig M., Klein F.A., Huber W., Boutros M. A chemical-genetic interaction map of small molecules using high-throughput imaging in cancer cells. Mol Syst Biol. 2015;11:846. doi: 10.15252/msb.20156400. [DOI] [PMC free article] [PubMed] [Google Scholar]; A library of 1280 pharmacologically active small molecules was profiled in a panel of 12 isogenic human cancer cell lines to derive a multiparametric chemical-genetic interaction map to predict small molecule off-targets and to hypothesize drug repurposing possibilities. The imaging data can be browsed at http://dedomena.embl.de/PGPC/.
26.Bray M.-A., Gustafsdottir S.M., Rohban M.H., Singh S., Ljosa V., Sokolnicki K.L., Bittker J.A., Bodycombe N.E., Dančík V., Hasaka T.P. A dataset of images and morphological profiles of 30 000 small-molecule treatments using the cell painting assay. Gigascience. 2017;6:1–5. doi: 10.1093/gigascience/giw014. [DOI] [PMC free article] [PubMed] [Google Scholar]
Perlman Z.E., Slack M.D., Feng Y., Mitchison T.J., Wu L.F., Altschuler S.J. Multidimensional drug profiling by automated microscopy. Science. 2004;306:1194–1198. doi: 10.1126/science.1100709. [DOI] [PubMed] [Google Scholar]; This reference reports the first large-scale small molecule profiling study in human cells to connect drug mechanisms of action with image-based profiling.
Young D.W., Bender A., Hoyt J., McWhinnie E., Chirn G.-W., Tao C.Y., Tallarico J.A., Labow M., Jenkins J.L., Mitchison T.J. Integrating high-content screening and ligand-target prediction to identify mechanism of action. Nat Chem Biol. 2008;4:59–68. doi: 10.1038/nchembio.2007.53. [DOI] [PubMed] [Google Scholar]; The first that showed that similarity between phenotypic profiles can be correlated with chemical structure similarity. In some cases, small changes in chemical structure resulted in drastic phenotypic changes.
29.Gustafsdottir S.M., Ljosa V., Sokolnicki K.L., Wilson J.A., Walpita D., Kemp M.M., Seiler K.P., Carrel H.A., Golu T.R. Multiplex cytological profiling assay to measure diverse cellular states. PLoS One. 2013;8 doi: 10.1371/journal.pone.0080999. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Reisen F., Sauty de Chalon A., Pfeifer M., Zhang X., Gabriel D., Selzer P. Linking phenotypes and modes of action through high-content screen fingerprints. Assay Drug Dev Technol. 2015;13:415–427. doi: 10.1089/adt.2015.656. [DOI] [PubMed] [Google Scholar]
31.Cacace E., Kritikos G., Typas A. Chemical genetics in drug discovery. Curr Opin Syst Biol. 2017;4:35–42. doi: 10.1016/j.coisb.2017.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Gibson C.C., Zhu W., Davis C.T., Bowman-Kirigin J.A., Chan A.C., Ling J., Walker A.E., Goitre L., Monache S.D., Retta S.F. Strategy for identifying repurposed drugs for the treatment of cerebral cavernous malformation. Circulation. 2015;131:289–299. doi: 10.1161/CIRCULATIONAHA.114.010403. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Schulze C.J., Bray W.M., Woerhmann M.H., Stuart J., Lokey R.S., Linington R.G. “Function-First” lead discovery: mode of action profiling of natural product libraries using image-based screening. Chem Biol. 2013;20:285–295. doi: 10.1016/j.chembiol.2012.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Wawer M.J., Li K., Gustafsdottir S.M., Ljosa V., Bodycombe N.E., Marton M.A., Sokolnicki K.L., Bray M.-A., Kemp M.M., Winchester E. Toward performance-diverse small-molecule libraries for cell-based phenotypic screening using multiplexed high-dimensional profiling. Proc Natl Acad Sci USA. 2014;111:10911–10916. doi: 10.1073/pnas.1410933111. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Grimm F.A., Iwata Y., Sirenko O., Bittner M., Rusyn I. High-content assay multiplexing for toxicity screening in induced pluripotent stem cell-derived cardiomyocytes and hepatocytes. Assay Drug Dev Technol. 2015;13:529–546. doi: 10.1089/adt.2015.659. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Sirenko O., Hancock M.K., Hesley J., Hong D., Cohen A., Gentry J., Carlson C.B., Mann D.A. Phenotypic characterization of toxic compound effects on liver spheroids derived from iPSC using confocal imaging and three-dimensional image analysis. Assay Drug Dev Technol. 2016;14:381–394. doi: 10.1089/adt.2016.729. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Woehrmann M.H., Bray W.M., Durbin J.K., Nisam S.C., Michael A.K., Glassey E., Stuart J.M., Lokey R.S. Large-scale cytological profiling for functional analysis of bioactive compounds. Mol Biosyst. 2013;9:2604. doi: 10.1039/c3mb70245f. [DOI] [PubMed] [Google Scholar]
38.Ochoa J.L., Bray W.M., Lokey R.S., Linington R.G. Phenotype-guided natural products discovery using cytological profiling. J Nat Prod. 2015;78:2242–2248. doi: 10.1021/acs.jnatprod.5b00455. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaufmann M., Schuffenhauer A., Fruh I., Klein J., Thiemeyer A., Rigo P., Gomez-Mancilla B., Heidinger-Millot V., Bouwmeester T., Schopfer U. High-throughput screening using iPSC-derived neuronal progenitors to identify compounds counteracting epigenetic gene silencing in fragile X syndrome. J Biomol Screen. 2015;20:1101–1111. doi: 10.1177/1087057115588287. [DOI] [PubMed] [Google Scholar]; By combining image-based assays with iPSC and 3-D cell culture systems those two references demonstrate the potential of image-based small molecule profiling to be applied in disease relevant model systems.
40.Booij T.H., Bange H., Leonhard W.N., Yan K., Fokkelman M., Kunnen S.J., Dauwerse J.G., Qin Y., van de Water B., van Westen G.J.P. High-throughput phenotypic screening of kinase inhibitors to identify drug targets for polycystic kidney disease. SLAS Discov Adv life Sci RD. 2017;22:974–984. doi: 10.1177/2472555217716056. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Oppermann S., Ylanko J., Shi Y., Hariharan S., Oakes C.C., Brauer P.M., Zúñiga-Pflücker J.C., Leber B., Spaner D.E., Andrews D.W. High-content screening identifies kinase inhibitors that overcome venetoclax resistance in activated CLL cells. Blood. 2016;128:934–947. doi: 10.1182/blood-2015-12-687814. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Giaever G., Shoemaker D.D., Jones T.W., Liang H., Winzeler E.A., Astromoff A., Davis R.W. Genomic profiling of drug sensitivities via induced haploinsufficiency. Nat Genet. 1999;21:278–283. doi: 10.1038/6791. [DOI] [PubMed] [Google Scholar]
43.Parsons A.B., Lopez A., Givoni I.E., Williams D.E., Gray C.A., Porter J., Chua G., Sopko R., Brost R.L., Ho C.H. Exploring the mode-of-action of bioactive compounds by chemical-genetic profiling in yeast. Cell. 2006;126:611–625. doi: 10.1016/j.cell.2006.06.040. [DOI] [PubMed] [Google Scholar]
44.Hillenmeyer M.E., Fung E., Wildenhain J., Pierce S.E., Hoon S., Lee W., Proctor M., St. Onge R.P., Tyers M., Koller D. The chemical genomic portrait of yeast: Uncovering a phenotype for all genes. Science (80-) 2008;320:362–365. doi: 10.1126/science.1150021. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Nichols R.J., Sen S., Choo Y.J., Beltrao P., Zietek M., Chaba R., Lee S., Kazmierczak K.M., Lee K.J., Wong A. Phenotypic landscape of a bacterial cell. Cell. 2011;144:143–156. doi: 10.1016/j.cell.2010.11.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Lee A.Y., St. Onge R.P., Proctor M.J., Wallace I.M., Nile A.H., Spagnuolo P.A., Jitkova Y., Gronda M., Wu Y., Kim M.K. Mapping the cellular response to small molecules using chemogenomic fitness signatures. Science (80-) 2014;344 doi: 10.1126/science.1250217. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Nijman S.M.B. Functional genomics to uncover drug mechanism of action. Nat Chem Biol. 2015;11:942–948. doi: 10.1038/nchembio.1963. [DOI] [PubMed] [Google Scholar]
Piotrowski J.S., Li S.C., Deshpande R., Simpkins S.W., Nelson J., Yashiroda Y., Barber J.M., Safizadeh H., Wilson E., Okada H. Functional annotation of chemical libraries across diverse biological processes. Nat Chem Biol. 2017;13:982–993. doi: 10.1038/nchembio.2436. [DOI] [PMC free article] [PubMed] [Google Scholar]; A large-scale chemical-genetic study in yeast that used an algorithm (COMPRESS-GI) to predict a diagnostic set of gene-deletion mutant strains based on genome-wide genetic interaction data. This provides a scalable-approach to functionally annotate large chemical libraries which might also be applicable to chemical-genetic studies in human cells.
49.Barretina J., Caponigro G., Stransky N., Venkatesan K., Margolin A.A., Kim S., Wilson C.J., Lehár J., Kryukov G.V., Sonkin D. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483:603–607. doi: 10.1038/nature11003. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Iorio F., Knijnenburg T.A., Vis D.J., Bignell G.R., Menden M.P., Schubert M., Aben N., Gonçalves E., Barthorpe S., Lightfoot H. A landscape of pharmacogenomic interactions in cancer. Cell. 2016;166:740–754. doi: 10.1016/j.cell.2016.06.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Seashore-Ludlow B., Rees M.G., Cheah J.H., Coko M., Price E.V., Coletti M.E., Jones V., Bodycombe N.E., Soule C.K., Gould J. Harnessing connectivity in a large-scale small-molecule sensitivity dataset. Cancer Discov. 2015;5:1210–1223. doi: 10.1158/2159-8290.CD-15-0235. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Gilbert D.F., Erdmann G., Zhang X., Fritzsche A., Demir K., Jaedicke A., Muehlenberg K., Wanker E.E., Boutros M. A novel multiplex cell viability assay for high-throughput RNAi screening. PLoS One. 2011;6:e28338. doi: 10.1371/journal.pone.0028338. [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Moffat J.G., Rudolph J., Bailey D. Phenotypic screening in cancer drug discovery — past, present and future. Nat Rev Drug Discov. 2014;13:588–602. doi: 10.1038/nrd4366. [DOI] [PubMed] [Google Scholar]
54.Caie P.D., Walls R.E., Ingleston-Orme A., Daya S., Houslay T., Eagle R., Roberts M.E., Carragher N.O. High-content phenotypic profiling of drug response signatures across distinct cancer cells. Mol Cancer Ther. 2010;9:1913–1926. doi: 10.1158/1535-7163.MCT-09-1148. [DOI] [PubMed] [Google Scholar]
55.Warchal S.J., Dawson J.C., Carragher N.O. Development of the theta comparative cell scoring method to quantify diverse phenotypic responses between distinct cell types. Assay Drug Dev Technol. 2016;14:395–406. doi: 10.1089/adt.2016.730. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Horvath P., Wild T., Kutay U., Csucs G. Machine learning improves the precision and robustness of high-content screens. J Biomol Screen. 2011;16:1059–1067. doi: 10.1177/1087057111414878. [DOI] [PubMed] [Google Scholar]
57.Grys B.T., Lo D.S., Sahin N., Kraus O.Z., Morris Q., Boone C., Andrews B.J. Machine learning and computer vision approaches for phenotypic profiling. J Cell Biol. 2017;216:65–71. doi: 10.1083/jcb.201610026. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Sommer C., Gerlich D.W. Machine learning in cell biology – teaching computers to recognize phenotypes. J Cell Sci. 2013;126:5529–5539. doi: 10.1242/jcs.123604. [DOI] [PubMed] [Google Scholar]
Caicedo J.C., Cooper S., Heigwer F., Warchal S., Qiu P., Molnar C., Vasilevich A.S., Barry J.D., Bansal H.S., Kraus O. Data-analysis strategies for image-based cell profiling. Nat Methods. 2017;14:849–863. doi: 10.1038/nmeth.4397. [DOI] [PMC free article] [PubMed] [Google Scholar]; A community paper that was published following a Hackathon at the Broad Institute in 2016 with contributions from 20 laboratories assembled best practice guidelines for the analysis of image-based profiling data sets.
Ljosa V., Caie P.D., ter Horst R., Sokolnicki K.L., Jenkins E.L., Daya S., Roberts M.E., Jones T.R., Singh S., Genovesio A. Comparison of methods for image-based profiling of cellular morphological responses to small-molecule treatment. J Biomol Screen. 2013;18:1321–1329. doi: 10.1177/1087057113503553. [DOI] [PMC free article] [PubMed] [Google Scholar]; In this study various computational methods to derive morphological profiles were benchmarked by classifying small molecules by their mechanism of action.
61.Reisen F., Zhang X., Gabriel D., Selzer P. Benchmarking of multivariate similarity measures for high-content screening fingerprints in phenotypic drug discovery. J Biomol Screen. 2013;18:1284–1297. doi: 10.1177/1087057113501390. [DOI] [PubMed] [Google Scholar]
62.Eisen M.B., Spellman P.T., Brown P.O., Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998;95:14863–14868. doi: 10.1073/pnas.95.25.14863. [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Schuldiner M., Collins S.R., Thompson N.J., Denic V., Bhamidipati A., Punna T., Ihmels J., Andrews B., Boone C., Greenblatt J.F. Exploration of the Function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell. 2005;123:507–519. doi: 10.1016/j.cell.2005.08.031. [DOI] [PubMed] [Google Scholar]
64.de Groot R., Lüthi J., Lindsay H., Holtackers R., Pelkmans L. Large-scale image-based profiling of single-cell phenotypes in arrayed CRISPR-Cas9 gene perturbation screens. Mol Syst Biol. 2018;14:e8064. doi: 10.15252/msb.20178064. [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Bakal C., Aach J., Church G., Perrimon N. Quantitative morphological signatures define local signaling networks regulating cell morphology. Science (80-) 2007;316:1753–1756. doi: 10.1126/science.1140324. [DOI] [PubMed] [Google Scholar]
66.Sutherland J.J., Low J., Blosser W., Dowless M., Engler T.A., Stancato L.F. A robust high-content imaging approach for probing the mechanism of action and phenotypic outcomes of cell-cycle modulators. Mol Cancer Ther. 2011;10:242–254. doi: 10.1158/1535-7163.MCT-10-0720. [DOI] [PubMed] [Google Scholar]
67.Brandl M.B., Pasquier E., Li F., Beck D., Zhang S., Zhao H., Kavallaris M., Wong S.T.C. Computational analysis of image-based drug profiling predicts synergistic drug combinations: Applications in triple-negative breast cancer. Mol Oncol. 2014;8:1548–1560. doi: 10.1016/j.molonc.2014.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Booij T.H., Klop M.J.D., Yan K., Szántai-Kis C., Szokol B., Orfi L., van de Water B., Keri G., Price L.S. Development of a 3D tissue culture–based high-content screening platform that uses phenotypic profiling to discrimainate selective inhibitors of receptor tyrosine kinases. J Biomol Screen. 2016;21:912–922. doi: 10.1177/1087057116657269. [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Loo L.H., Wu L.F., Altschuler S.J. Image-based multivariate profiling of drug responses from single cells. Nat Methods. 2007;4:445–453. doi: 10.1038/nmeth1032. [DOI] [PubMed] [Google Scholar]
70.Kandaswamy C., Silva L.M., Alexandre L.A., Santos J.M. High-content analysis of breast cancer using single-cell deep transfer learning. J Biomol Screen. 2016;21:252–259. doi: 10.1177/1087057115623451. [DOI] [PubMed] [Google Scholar]
71.Kraus O.Z., Ba J.L., Frey B.J. Classifying and segmenting microscopy images with deep multiple instance learning. Bioinformatics. 2016;32:i52–i59. doi: 10.1093/bioinformatics/btw252. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Dürr O., Sick B. Single-cell phenotype classification using deep convolutional neural networks. J Biomol Screen. 2016;21:998–1003. doi: 10.1177/1087057116631284. [DOI] [PubMed] [Google Scholar]
73.Pawlowski N., Caicedo J.C., Singh S., Carpenter A.E., Storkey A. Automating morphological profiling with generic deep convolutional networks. bioRxiv. 2016 [Google Scholar]
74.Godinez W.J., Hossain I., Lazic S.E., Davies J.W., Zhang X. A multi-scale convolutional neural network for phenotyping high-content cellular images. Bioinformatics. 2017;33:2010–2019. doi: 10.1093/bioinformatics/btx069. [DOI] [PubMed] [Google Scholar]
75.Ando D.M., McLean C., Berndl M. Improving phenotypic measurements in high-content imaging screens. bioRxiv. 2017 [Google Scholar]
76.Sommer C., Hoefler R., Samwer M., Gerlich D.W. A deep learning and novelty detection framework for rapid phenotyping in high-content screening. Mol Biol Cell. 2017;28:3428–3436. doi: 10.1091/mbc.E17-05-0333. [DOI] [PMC free article] [PubMed] [Google Scholar]
77.Ounkomol C., Fernandes D.A., Seshamani S., Maleckar M.M., Collman F., Johnson G.R. Three dimensional cross-modal image inference: label-free methods for subcellular structure prediction. bioRxiv. 2017 [Google Scholar]
78.O'Duibhir E., Paris J., Lawson H., Sepulveda C., Shenton D.D., Carragher N.O., Kranc K.R. Machine learning enables live label-free phenotypic screening in three dimensions. Assay Drug Dev Technol. 2018;16:51–63. doi: 10.1089/adt.2017.819. [DOI] [PubMed] [Google Scholar]
79.Simm J., Klambauer G., Arany A., Steijaert M., Wegner J.K., Gustin E., Chupakhin V., Chong Y.T., Vialard J., Buijnsters P. Repurposing high-throughput image assays enables biological activity prediction for drug discovery. Cell Chem Biol. 2018;0 doi: 10.1016/j.chembiol.2018.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Costello J.C., Heiser L.M., Georgii E., Gönen M., Menden M.P., Wang N.J., Bansal M., Ammad-Ud-Din M., Hintsanen P., Khan S.A. A community effort to assess and improve drug sensitivity prediction algorithms. Nat Biotechnol. 2014;32:1202–1212. doi: 10.1038/nbt.2877. [DOI] [PMC free article] [PubMed] [Google Scholar]
81.Pau G., Fuchs F., Sklyar O., Boutros M., Huber W. EBImage-an R package for image processing with applications to cellular phenotypes. Bioinformatics. 2010;26:979–981. doi: 10.1093/bioinformatics/btq046. [DOI] [PMC free article] [PubMed] [Google Scholar]
82.Carpenter A.E., Jones T.R., Lamprecht M.R., Clarke C., Kang I.H., Friman O., Guertin D.A., Chang J.H., Lindquist R.A., Moffat J. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7:R100. doi: 10.1186/gb-2006-7-10-r100. [DOI] [PMC free article] [PubMed] [Google Scholar]
83.Jones T.R., Carpenter A., Golland P. Springer; Berlin, Heidelberg: 2005. Voronoi-based segmentation of cells on image manifolds; pp. 535–543. [Google Scholar]
84.Robinson S., Guyon L., Nevalainen J., Toriseva M., Åkerfelt M., Nees M. Segmentation of image data from complex organotypic 3D models of cancer tissues with markov random fields. PLoS One. 2015;10:e0143798. doi: 10.1371/journal.pone.0143798. [DOI] [PMC free article] [PubMed] [Google Scholar]
85.Uhlmann V., Singh S., Carpenter A.E. CP-CHARM: segmentation-free image classification made accessible. BMC Bioinformatics. 2016;17 doi: 10.1186/s12859-016-0895-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
86.Rajaram S., Pavie B., Wu L.F., Altschuler S.J. PhenoRipper: software for rapidly profiling microscopy images. Nat Methods. 2012;9:635–637. doi: 10.1038/nmeth.2097. [DOI] [PMC free article] [PubMed] [Google Scholar]
87.Angermueller C., Pärnamaa T., Parts L., Stegle O. Deep learning for computational biology. Mol Syst Biol. 2016;12:878. doi: 10.15252/msb.20156651. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib1] 1.Nüsslein-Volhard C., Wieschaus E. Mutations affecting segment number and polarity in Drosophila. Nature. 1980;287:795–801. doi: 10.1038/287795a0. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Sepp K.J., Hong P., Lizarraga S.B., Liu J.S., Mejia L.A., Walsh C.A., Perrimon N. Identification of neural outgrowth genes using genome-wide RNAi. PLoS Genet. 2008;4:e1000111. doi: 10.1371/journal.pgen.1000111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] 3.Kiger A.A., Baum B., Jones S., Jones M.R., Coulson A., Echeverri C., Perrimon N. A functional genomic analysis of cell morphology using RNA interference. J Biol. 2003;2:27. doi: 10.1186/1475-4924-2-27. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] 4.Kamath R.S., Ahringer J. Genome-wide RNAi screening in Caenorhabditis elegans. Methods. 2003;30:313–321. doi: 10.1016/s1046-2023(03)00050-1. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Boutros M., Heigwer F., Laufer C. Microscopy-based high-content screening. Cell. 2015;163:1314–1325. doi: 10.1016/j.cell.2015.11.007. [DOI] [PubMed] [Google Scholar]

[bib6] 6.Mattiazzi Usaj M., Styles E.B., Verster A.J., Friesen H., Boone C., Andrews B.J. High-content screening for quantitative cell biology. Trends Cell Biol. 2016;26:598–611. doi: 10.1016/j.tcb.2016.03.008. [DOI] [PubMed] [Google Scholar]

[bib7] 7.Pegoraro G., Misteli T. High-throughput imaging for the discovery of cellular mechanisms of disease. Trends Genet. 2017;33:604–615. doi: 10.1016/j.tig.2017.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8.Peters J.M., Colavin A., Shi H., Czarny T.L., Larson M.H., Wong S., Hawkins J.S., Lu C.H.S., Koo B.M., Marta E. A comprehensive, CRISPR-based functional analysis of essential genes in bacteria. Cell. 2016;165:1493–1506. doi: 10.1016/j.cell.2016.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Kraus O.Z., Grys B.T., Ba J., Chong Y., Frey B.J., Boone C., Andrews B.J. Automated analysis of high-content microscopy data with deep learning. Mol Syst Biol. 2017;13:924. doi: 10.15252/msb.20177551. [DOI] [PMC free article] [PubMed] [Google Scholar]; A study that used deep learning to classify images of yeast cells according to the subcellular localization of fluorescently labeled proteins. This enabled an automated analysis of large-scale image data sets and further explored transfer learning as a framework to facilitate the adaption of deep learning to diverse biological data sets.

[bib10] 10.Neumann B., Held M., Liebel U., Erfle H., Rogers P., Pepperkok R., Ellenberg J. High-throughput RNAi screening by time-lapse imaging of live human cells. Nat Methods. 2006;3:385–390. doi: 10.1038/nmeth876. [DOI] [PubMed] [Google Scholar]

[bib11] 11.Huisken J., Swoger J., Del Bene F., Wittbrodt J., Stelzer E.H.K. Optical sectioning deep inside live embryos by selective plane illumination microscopy. Science. 2004;305:1007–1009. doi: 10.1126/science.1100035. [DOI] [PubMed] [Google Scholar]

[bib12] 12.Le X., Pugach E.K., Hettmer S., Storer N.Y., Liu J., Wills A.A., DiBiase A., Chen E.Y., Ignatius M.S., Poss K.D. A novel chemical screening strategy in zebrafish identifies common pathways in embryogenesis and rhabdomyosarcoma development. Development. 2013;140:2354–2364. doi: 10.1242/dev.088427. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Bickle M. The beautiful cell: high-content screening in drug discovery. Anal Bioanal Chem. 2010;398:219–226. doi: 10.1007/s00216-010-3788-3. [DOI] [PubMed] [Google Scholar]

[bib14] 14.Swinney D.C., Anthony J. How were new medicines discovered? Nat Rev Drug Discov. 2011;10:507–519. doi: 10.1038/nrd3480. [DOI] [PubMed] [Google Scholar]

[bib15] Caicedo J.C., Singh S., Carpenter A.E. Applications in image-based profiling of perturbations. Curr Opin Biotechnol. 2016;39:134–142. doi: 10.1016/j.copbio.2016.04.003. [DOI] [PubMed] [Google Scholar]; This reference provides a concise review on the various applications of image-based profiling including drug discovery applications.

[bib16] 16.Fuchs F., Pau G., Kranz D., Sklyar O., Budjan C., Steinbrink S., Horn T., Pedal A., Huber W., Boutros M. Clustering phenotype populations by genome-wide RNAi and multiparametric imaging. Mol Syst Biol. 2010;6:370. doi: 10.1038/msb.2010.25. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17.Zanella F., Lorens J.B., Link W. High content screening: seeing is believing. Trends Biotechnol. 2010;28:237–245. doi: 10.1016/j.tibtech.2010.02.005. [DOI] [PubMed] [Google Scholar]

[bib18] 18.Moffat J.G., Vincent F., Lee J.A., Eder J., Prunotto M. Opportunities and challenges in phenotypic drug discovery: an industry perspective. Nat Rev Drug Discov. 2017;16:531–543. doi: 10.1038/nrd.2017.111. [DOI] [PubMed] [Google Scholar]

[bib19] 19.Lamb J., Crawford E.D., Peck D., Modell J.W., Blat I.C., Wrobel M.J., Lerner J., Brunet J.P., Subramanian A., Ross K.N. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science (80-) 2006;313:1929–1935. doi: 10.1126/science.1132939. [DOI] [PubMed] [Google Scholar]

[bib20] 20.Schirle M., Jenkins J.L. Identifying compound efficacy targets in phenotypic drug discovery. Drug Discov Today. 2016;21:82–89. doi: 10.1016/j.drudis.2015.08.001. [DOI] [PubMed] [Google Scholar]

[bib21] 21.Schenone M., Dančík V., Wagner B.K., Clemons P.A. Target identification and mechanism of action in chemical biology and drug discovery. Nat Chem Biol. 2013;9:232–240. doi: 10.1038/nchembio.1199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Singh S., Carpenter A.E., Genovesio A. Increasing the Content of High-Content Screening. J Biomol Screen. 2014;19:640–650. doi: 10.1177/1087057114528537. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Bray M.-A., Singh S., Han H., Davis C.T., Borgeson B., Hartland C., Kost-Alimova M., Gustafsdottir S.M., Gibson C.C., Carpenter A.E. Cell painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nat Protoc. 2016;11:1757–1774. doi: 10.1038/nprot.2016.105. [DOI] [PMC free article] [PubMed] [Google Scholar]; A methods paper providing comprehensive experimental guidelines for the CellPainting assay, an image-based profiling assay established by the Carpenter lab.

[bib24] 24.Fischer B., Sandmann T., Horn T., Billmann M., Chaudhary V., Huber W., Boutros M. A map of directional genetic interactions in a metazoan cell. Elife. 2015;4 doi: 10.7554/eLife.05464. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Breinig M., Klein F.A., Huber W., Boutros M. A chemical-genetic interaction map of small molecules using high-throughput imaging in cancer cells. Mol Syst Biol. 2015;11:846. doi: 10.15252/msb.20156400. [DOI] [PMC free article] [PubMed] [Google Scholar]; A library of 1280 pharmacologically active small molecules was profiled in a panel of 12 isogenic human cancer cell lines to derive a multiparametric chemical-genetic interaction map to predict small molecule off-targets and to hypothesize drug repurposing possibilities. The imaging data can be browsed at http://dedomena.embl.de/PGPC/.

[bib26] 26.Bray M.-A., Gustafsdottir S.M., Rohban M.H., Singh S., Ljosa V., Sokolnicki K.L., Bittker J.A., Bodycombe N.E., Dančík V., Hasaka T.P. A dataset of images and morphological profiles of 30 000 small-molecule treatments using the cell painting assay. Gigascience. 2017;6:1–5. doi: 10.1093/gigascience/giw014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] Perlman Z.E., Slack M.D., Feng Y., Mitchison T.J., Wu L.F., Altschuler S.J. Multidimensional drug profiling by automated microscopy. Science. 2004;306:1194–1198. doi: 10.1126/science.1100709. [DOI] [PubMed] [Google Scholar]; This reference reports the first large-scale small molecule profiling study in human cells to connect drug mechanisms of action with image-based profiling.

[bib28] Young D.W., Bender A., Hoyt J., McWhinnie E., Chirn G.-W., Tao C.Y., Tallarico J.A., Labow M., Jenkins J.L., Mitchison T.J. Integrating high-content screening and ligand-target prediction to identify mechanism of action. Nat Chem Biol. 2008;4:59–68. doi: 10.1038/nchembio.2007.53. [DOI] [PubMed] [Google Scholar]; The first that showed that similarity between phenotypic profiles can be correlated with chemical structure similarity. In some cases, small changes in chemical structure resulted in drastic phenotypic changes.

[bib29] 29.Gustafsdottir S.M., Ljosa V., Sokolnicki K.L., Wilson J.A., Walpita D., Kemp M.M., Seiler K.P., Carrel H.A., Golu T.R. Multiplex cytological profiling assay to measure diverse cellular states. PLoS One. 2013;8 doi: 10.1371/journal.pone.0080999. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] 30.Reisen F., Sauty de Chalon A., Pfeifer M., Zhang X., Gabriel D., Selzer P. Linking phenotypes and modes of action through high-content screen fingerprints. Assay Drug Dev Technol. 2015;13:415–427. doi: 10.1089/adt.2015.656. [DOI] [PubMed] [Google Scholar]

[bib31] 31.Cacace E., Kritikos G., Typas A. Chemical genetics in drug discovery. Curr Opin Syst Biol. 2017;4:35–42. doi: 10.1016/j.coisb.2017.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Gibson C.C., Zhu W., Davis C.T., Bowman-Kirigin J.A., Chan A.C., Ling J., Walker A.E., Goitre L., Monache S.D., Retta S.F. Strategy for identifying repurposed drugs for the treatment of cerebral cavernous malformation. Circulation. 2015;131:289–299. doi: 10.1161/CIRCULATIONAHA.114.010403. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] 33.Schulze C.J., Bray W.M., Woerhmann M.H., Stuart J., Lokey R.S., Linington R.G. “Function-First” lead discovery: mode of action profiling of natural product libraries using image-based screening. Chem Biol. 2013;20:285–295. doi: 10.1016/j.chembiol.2012.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] 34.Wawer M.J., Li K., Gustafsdottir S.M., Ljosa V., Bodycombe N.E., Marton M.A., Sokolnicki K.L., Bray M.-A., Kemp M.M., Winchester E. Toward performance-diverse small-molecule libraries for cell-based phenotypic screening using multiplexed high-dimensional profiling. Proc Natl Acad Sci USA. 2014;111:10911–10916. doi: 10.1073/pnas.1410933111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] 35.Grimm F.A., Iwata Y., Sirenko O., Bittner M., Rusyn I. High-content assay multiplexing for toxicity screening in induced pluripotent stem cell-derived cardiomyocytes and hepatocytes. Assay Drug Dev Technol. 2015;13:529–546. doi: 10.1089/adt.2015.659. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Sirenko O., Hancock M.K., Hesley J., Hong D., Cohen A., Gentry J., Carlson C.B., Mann D.A. Phenotypic characterization of toxic compound effects on liver spheroids derived from iPSC using confocal imaging and three-dimensional image analysis. Assay Drug Dev Technol. 2016;14:381–394. doi: 10.1089/adt.2016.729. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] 37.Woehrmann M.H., Bray W.M., Durbin J.K., Nisam S.C., Michael A.K., Glassey E., Stuart J.M., Lokey R.S. Large-scale cytological profiling for functional analysis of bioactive compounds. Mol Biosyst. 2013;9:2604. doi: 10.1039/c3mb70245f. [DOI] [PubMed] [Google Scholar]

[bib38] 38.Ochoa J.L., Bray W.M., Lokey R.S., Linington R.G. Phenotype-guided natural products discovery using cytological profiling. J Nat Prod. 2015;78:2242–2248. doi: 10.1021/acs.jnatprod.5b00455. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Kaufmann M., Schuffenhauer A., Fruh I., Klein J., Thiemeyer A., Rigo P., Gomez-Mancilla B., Heidinger-Millot V., Bouwmeester T., Schopfer U. High-throughput screening using iPSC-derived neuronal progenitors to identify compounds counteracting epigenetic gene silencing in fragile X syndrome. J Biomol Screen. 2015;20:1101–1111. doi: 10.1177/1087057115588287. [DOI] [PubMed] [Google Scholar]; By combining image-based assays with iPSC and 3-D cell culture systems those two references demonstrate the potential of image-based small molecule profiling to be applied in disease relevant model systems.

[bib40] 40.Booij T.H., Bange H., Leonhard W.N., Yan K., Fokkelman M., Kunnen S.J., Dauwerse J.G., Qin Y., van de Water B., van Westen G.J.P. High-throughput phenotypic screening of kinase inhibitors to identify drug targets for polycystic kidney disease. SLAS Discov Adv life Sci RD. 2017;22:974–984. doi: 10.1177/2472555217716056. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] 41.Oppermann S., Ylanko J., Shi Y., Hariharan S., Oakes C.C., Brauer P.M., Zúñiga-Pflücker J.C., Leber B., Spaner D.E., Andrews D.W. High-content screening identifies kinase inhibitors that overcome venetoclax resistance in activated CLL cells. Blood. 2016;128:934–947. doi: 10.1182/blood-2015-12-687814. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] 42.Giaever G., Shoemaker D.D., Jones T.W., Liang H., Winzeler E.A., Astromoff A., Davis R.W. Genomic profiling of drug sensitivities via induced haploinsufficiency. Nat Genet. 1999;21:278–283. doi: 10.1038/6791. [DOI] [PubMed] [Google Scholar]

[bib43] 43.Parsons A.B., Lopez A., Givoni I.E., Williams D.E., Gray C.A., Porter J., Chua G., Sopko R., Brost R.L., Ho C.H. Exploring the mode-of-action of bioactive compounds by chemical-genetic profiling in yeast. Cell. 2006;126:611–625. doi: 10.1016/j.cell.2006.06.040. [DOI] [PubMed] [Google Scholar]

[bib44] 44.Hillenmeyer M.E., Fung E., Wildenhain J., Pierce S.E., Hoon S., Lee W., Proctor M., St. Onge R.P., Tyers M., Koller D. The chemical genomic portrait of yeast: Uncovering a phenotype for all genes. Science (80-) 2008;320:362–365. doi: 10.1126/science.1150021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] 45.Nichols R.J., Sen S., Choo Y.J., Beltrao P., Zietek M., Chaba R., Lee S., Kazmierczak K.M., Lee K.J., Wong A. Phenotypic landscape of a bacterial cell. Cell. 2011;144:143–156. doi: 10.1016/j.cell.2010.11.052. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] 46.Lee A.Y., St. Onge R.P., Proctor M.J., Wallace I.M., Nile A.H., Spagnuolo P.A., Jitkova Y., Gronda M., Wu Y., Kim M.K. Mapping the cellular response to small molecules using chemogenomic fitness signatures. Science (80-) 2014;344 doi: 10.1126/science.1250217. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] 47.Nijman S.M.B. Functional genomics to uncover drug mechanism of action. Nat Chem Biol. 2015;11:942–948. doi: 10.1038/nchembio.1963. [DOI] [PubMed] [Google Scholar]

[bib48] Piotrowski J.S., Li S.C., Deshpande R., Simpkins S.W., Nelson J., Yashiroda Y., Barber J.M., Safizadeh H., Wilson E., Okada H. Functional annotation of chemical libraries across diverse biological processes. Nat Chem Biol. 2017;13:982–993. doi: 10.1038/nchembio.2436. [DOI] [PMC free article] [PubMed] [Google Scholar]; A large-scale chemical-genetic study in yeast that used an algorithm (COMPRESS-GI) to predict a diagnostic set of gene-deletion mutant strains based on genome-wide genetic interaction data. This provides a scalable-approach to functionally annotate large chemical libraries which might also be applicable to chemical-genetic studies in human cells.

[bib49] 49.Barretina J., Caponigro G., Stransky N., Venkatesan K., Margolin A.A., Kim S., Wilson C.J., Lehár J., Kryukov G.V., Sonkin D. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483:603–607. doi: 10.1038/nature11003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] 50.Iorio F., Knijnenburg T.A., Vis D.J., Bignell G.R., Menden M.P., Schubert M., Aben N., Gonçalves E., Barthorpe S., Lightfoot H. A landscape of pharmacogenomic interactions in cancer. Cell. 2016;166:740–754. doi: 10.1016/j.cell.2016.06.017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] 51.Seashore-Ludlow B., Rees M.G., Cheah J.H., Coko M., Price E.V., Coletti M.E., Jones V., Bodycombe N.E., Soule C.K., Gould J. Harnessing connectivity in a large-scale small-molecule sensitivity dataset. Cancer Discov. 2015;5:1210–1223. doi: 10.1158/2159-8290.CD-15-0235. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] 52.Gilbert D.F., Erdmann G., Zhang X., Fritzsche A., Demir K., Jaedicke A., Muehlenberg K., Wanker E.E., Boutros M. A novel multiplex cell viability assay for high-throughput RNAi screening. PLoS One. 2011;6:e28338. doi: 10.1371/journal.pone.0028338. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib53] 53.Moffat J.G., Rudolph J., Bailey D. Phenotypic screening in cancer drug discovery — past, present and future. Nat Rev Drug Discov. 2014;13:588–602. doi: 10.1038/nrd4366. [DOI] [PubMed] [Google Scholar]

[bib54] 54.Caie P.D., Walls R.E., Ingleston-Orme A., Daya S., Houslay T., Eagle R., Roberts M.E., Carragher N.O. High-content phenotypic profiling of drug response signatures across distinct cancer cells. Mol Cancer Ther. 2010;9:1913–1926. doi: 10.1158/1535-7163.MCT-09-1148. [DOI] [PubMed] [Google Scholar]

[bib55] 55.Warchal S.J., Dawson J.C., Carragher N.O. Development of the theta comparative cell scoring method to quantify diverse phenotypic responses between distinct cell types. Assay Drug Dev Technol. 2016;14:395–406. doi: 10.1089/adt.2016.730. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] 56.Horvath P., Wild T., Kutay U., Csucs G. Machine learning improves the precision and robustness of high-content screens. J Biomol Screen. 2011;16:1059–1067. doi: 10.1177/1087057111414878. [DOI] [PubMed] [Google Scholar]

[bib57] 57.Grys B.T., Lo D.S., Sahin N., Kraus O.Z., Morris Q., Boone C., Andrews B.J. Machine learning and computer vision approaches for phenotypic profiling. J Cell Biol. 2017;216:65–71. doi: 10.1083/jcb.201610026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib58] 58.Sommer C., Gerlich D.W. Machine learning in cell biology – teaching computers to recognize phenotypes. J Cell Sci. 2013;126:5529–5539. doi: 10.1242/jcs.123604. [DOI] [PubMed] [Google Scholar]

[bib59] Caicedo J.C., Cooper S., Heigwer F., Warchal S., Qiu P., Molnar C., Vasilevich A.S., Barry J.D., Bansal H.S., Kraus O. Data-analysis strategies for image-based cell profiling. Nat Methods. 2017;14:849–863. doi: 10.1038/nmeth.4397. [DOI] [PMC free article] [PubMed] [Google Scholar]; A community paper that was published following a Hackathon at the Broad Institute in 2016 with contributions from 20 laboratories assembled best practice guidelines for the analysis of image-based profiling data sets.

[bib60] Ljosa V., Caie P.D., ter Horst R., Sokolnicki K.L., Jenkins E.L., Daya S., Roberts M.E., Jones T.R., Singh S., Genovesio A. Comparison of methods for image-based profiling of cellular morphological responses to small-molecule treatment. J Biomol Screen. 2013;18:1321–1329. doi: 10.1177/1087057113503553. [DOI] [PMC free article] [PubMed] [Google Scholar]; In this study various computational methods to derive morphological profiles were benchmarked by classifying small molecules by their mechanism of action.

[bib61] 61.Reisen F., Zhang X., Gabriel D., Selzer P. Benchmarking of multivariate similarity measures for high-content screening fingerprints in phenotypic drug discovery. J Biomol Screen. 2013;18:1284–1297. doi: 10.1177/1087057113501390. [DOI] [PubMed] [Google Scholar]

[bib62] 62.Eisen M.B., Spellman P.T., Brown P.O., Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998;95:14863–14868. doi: 10.1073/pnas.95.25.14863. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib63] 63.Schuldiner M., Collins S.R., Thompson N.J., Denic V., Bhamidipati A., Punna T., Ihmels J., Andrews B., Boone C., Greenblatt J.F. Exploration of the Function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell. 2005;123:507–519. doi: 10.1016/j.cell.2005.08.031. [DOI] [PubMed] [Google Scholar]

[bib64] 64.de Groot R., Lüthi J., Lindsay H., Holtackers R., Pelkmans L. Large-scale image-based profiling of single-cell phenotypes in arrayed CRISPR-Cas9 gene perturbation screens. Mol Syst Biol. 2018;14:e8064. doi: 10.15252/msb.20178064. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib65] 65.Bakal C., Aach J., Church G., Perrimon N. Quantitative morphological signatures define local signaling networks regulating cell morphology. Science (80-) 2007;316:1753–1756. doi: 10.1126/science.1140324. [DOI] [PubMed] [Google Scholar]

[bib66] 66.Sutherland J.J., Low J., Blosser W., Dowless M., Engler T.A., Stancato L.F. A robust high-content imaging approach for probing the mechanism of action and phenotypic outcomes of cell-cycle modulators. Mol Cancer Ther. 2011;10:242–254. doi: 10.1158/1535-7163.MCT-10-0720. [DOI] [PubMed] [Google Scholar]

[bib67] 67.Brandl M.B., Pasquier E., Li F., Beck D., Zhang S., Zhao H., Kavallaris M., Wong S.T.C. Computational analysis of image-based drug profiling predicts synergistic drug combinations: Applications in triple-negative breast cancer. Mol Oncol. 2014;8:1548–1560. doi: 10.1016/j.molonc.2014.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib68] 68.Booij T.H., Klop M.J.D., Yan K., Szántai-Kis C., Szokol B., Orfi L., van de Water B., Keri G., Price L.S. Development of a 3D tissue culture–based high-content screening platform that uses phenotypic profiling to discrimainate selective inhibitors of receptor tyrosine kinases. J Biomol Screen. 2016;21:912–922. doi: 10.1177/1087057116657269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib69] 69.Loo L.H., Wu L.F., Altschuler S.J. Image-based multivariate profiling of drug responses from single cells. Nat Methods. 2007;4:445–453. doi: 10.1038/nmeth1032. [DOI] [PubMed] [Google Scholar]

[bib70] 70.Kandaswamy C., Silva L.M., Alexandre L.A., Santos J.M. High-content analysis of breast cancer using single-cell deep transfer learning. J Biomol Screen. 2016;21:252–259. doi: 10.1177/1087057115623451. [DOI] [PubMed] [Google Scholar]

[bib71] 71.Kraus O.Z., Ba J.L., Frey B.J. Classifying and segmenting microscopy images with deep multiple instance learning. Bioinformatics. 2016;32:i52–i59. doi: 10.1093/bioinformatics/btw252. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib72] 72.Dürr O., Sick B. Single-cell phenotype classification using deep convolutional neural networks. J Biomol Screen. 2016;21:998–1003. doi: 10.1177/1087057116631284. [DOI] [PubMed] [Google Scholar]

[bib73] 73.Pawlowski N., Caicedo J.C., Singh S., Carpenter A.E., Storkey A. Automating morphological profiling with generic deep convolutional networks. bioRxiv. 2016 [Google Scholar]

[bib74] 74.Godinez W.J., Hossain I., Lazic S.E., Davies J.W., Zhang X. A multi-scale convolutional neural network for phenotyping high-content cellular images. Bioinformatics. 2017;33:2010–2019. doi: 10.1093/bioinformatics/btx069. [DOI] [PubMed] [Google Scholar]

[bib75] 75.Ando D.M., McLean C., Berndl M. Improving phenotypic measurements in high-content imaging screens. bioRxiv. 2017 [Google Scholar]

[bib76] 76.Sommer C., Hoefler R., Samwer M., Gerlich D.W. A deep learning and novelty detection framework for rapid phenotyping in high-content screening. Mol Biol Cell. 2017;28:3428–3436. doi: 10.1091/mbc.E17-05-0333. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib77] 77.Ounkomol C., Fernandes D.A., Seshamani S., Maleckar M.M., Collman F., Johnson G.R. Three dimensional cross-modal image inference: label-free methods for subcellular structure prediction. bioRxiv. 2017 [Google Scholar]

[bib78] 78.O'Duibhir E., Paris J., Lawson H., Sepulveda C., Shenton D.D., Carragher N.O., Kranc K.R. Machine learning enables live label-free phenotypic screening in three dimensions. Assay Drug Dev Technol. 2018;16:51–63. doi: 10.1089/adt.2017.819. [DOI] [PubMed] [Google Scholar]

[bib79] 79.Simm J., Klambauer G., Arany A., Steijaert M., Wegner J.K., Gustin E., Chupakhin V., Chong Y.T., Vialard J., Buijnsters P. Repurposing high-throughput image assays enables biological activity prediction for drug discovery. Cell Chem Biol. 2018;0 doi: 10.1016/j.chembiol.2018.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib80] 80.Costello J.C., Heiser L.M., Georgii E., Gönen M., Menden M.P., Wang N.J., Bansal M., Ammad-Ud-Din M., Hintsanen P., Khan S.A. A community effort to assess and improve drug sensitivity prediction algorithms. Nat Biotechnol. 2014;32:1202–1212. doi: 10.1038/nbt.2877. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib81] 81.Pau G., Fuchs F., Sklyar O., Boutros M., Huber W. EBImage-an R package for image processing with applications to cellular phenotypes. Bioinformatics. 2010;26:979–981. doi: 10.1093/bioinformatics/btq046. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib82] 82.Carpenter A.E., Jones T.R., Lamprecht M.R., Clarke C., Kang I.H., Friman O., Guertin D.A., Chang J.H., Lindquist R.A., Moffat J. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7:R100. doi: 10.1186/gb-2006-7-10-r100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib83] 83.Jones T.R., Carpenter A., Golland P. Springer; Berlin, Heidelberg: 2005. Voronoi-based segmentation of cells on image manifolds; pp. 535–543. [Google Scholar]

[bib84] 84.Robinson S., Guyon L., Nevalainen J., Toriseva M., Åkerfelt M., Nees M. Segmentation of image data from complex organotypic 3D models of cancer tissues with markov random fields. PLoS One. 2015;10:e0143798. doi: 10.1371/journal.pone.0143798. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib85] 85.Uhlmann V., Singh S., Carpenter A.E. CP-CHARM: segmentation-free image classification made accessible. BMC Bioinformatics. 2016;17 doi: 10.1186/s12859-016-0895-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib86] 86.Rajaram S., Pavie B., Wu L.F., Altschuler S.J. PhenoRipper: software for rapidly profiling microscopy images. Nat Methods. 2012;9:635–637. doi: 10.1038/nmeth.2097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib87] 87.Angermueller C., Pärnamaa T., Parts L., Stegle O. Deep learning for computational biology. Mol Syst Biol. 2016;12:878. doi: 10.15252/msb.20156651. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Machine learning and image-based profiling in drug discovery

Christian Scheeder

Florian Heigwer

Michael Boutros

Abstract

Introduction

Image-based profiling in drug discovery

Figure 1.

Figure 2.

Box 1. Processes in image analysis workflows and machine learning applications.

Figure 3.

Chemical genetics and image-based profiling

Machine learning strategies for image-based profiling

Box 2. Basic principles of machine learning.

Conclusions

Funding

Conflicts of interest

Acknowledgements

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Machine learning and image-based profiling in drug discovery

Christian Scheeder

Florian Heigwer

Michael Boutros

Abstract

Introduction

Image-based profiling in drug discovery

Figure 1.

Figure 2.

Box 1. Processes in image analysis workflows and machine learning applications.

Figure 3.

Chemical genetics and image-based profiling

Machine learning strategies for image-based profiling

Box 2. Basic principles of machine learning.

Conclusions

Funding

Conflicts of interest

Acknowledgements

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases