Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Apr 21.
Published in final edited form as: Immunity. 2015 Apr 21;42(4):591–592. doi: 10.1016/j.immuni.2015.04.006

Thinking Outside the Gate: Single-Cell Assessments in Multiple Dimensions

Pia Kvistborg 1,19, Cécile Gouttefangeas 2,19, Nima Aghaeepour 3, Angelica Cazaly 4, Pratip K Chattopadhyay 5, Cliburn Chan 6, Judith Eckl 7, Greg Finak 8, Sine Reker Hadrup 9, Holden T Maecker 10, Dominik Maurer 11, Tim Mosmann 12, Peng Qiu 13, Richard H Scheuermann 14,15, Marij JP Welters 16, Guido Ferrari 17, Ryan R Brinkman 3,20,*, Cedrik M Britten 18,20,21,*
PMCID: PMC4824634  NIHMSID: NIHMS752758  PMID: 25902473

Over the last two decades, our ability to interrogate the immune system on a single-cell level has increased dramatically (Chattopadhyay and Roederer, 2012; Bendall et al., 2011), allowing an opportunity to better understand the immunological mechanisms underlying disease. Complex flow cytometry (FCM) data are now surpassing our ability to fully analyze and interpret all information via current standard approaches, such as 2D dot plots and Boolean gates. Indeed, the number of potential cell subpopulations increases exponentially with the number of parameters assessed, making it difficult to decipher all possible combinations included in the raw data (e.g., 512 potential subsets with nine markers) via the traditional approaches (Bendall and Nolan, 2012). This could limit the translation of technical advances into new diagnostics or therapies. Newly developed bioinformatics tools that have the potential to bridge this gap are now available. The aim of this letter is to foster the implementation and adoption of these novel computational methodologies for unbiased analysis of complex cytometry data.

In recent years, a host of new data-analysis tools have emerged, creating workflows for processing and analyzing complex FCM datasets; however, these have gone mostly unnoticed by immunologists. Table S1 provides an overview of many of the currently available tools and their specific applications. They can be assigned to specific categories arranged in a “FCM data-analysis workflow” from compensated data as input to biologically interpretable results as output. The vast majority of the listed tools for FCM data processing, analysis, and visualization are made available by the bioinformaticians at no cost and include open source code and unrestrictive software licensing, opening up these computational approaches to broad use by the research community. Many of the tools have been developed to address similar analysis objectives via quite different approaches. They might provide optimal results for different datasets, such that there is no “right” or “best” tool, and using several algorithms in combination might yield even better results and exceed the possibilities offered by manual analysis. Comprehensive comparative studies by the Flow Cytometry: Critical Assessment of Population Identification Methods (FlowCAP) project have shown that many of these tools have reached a level of maturity that matches, or even surpasses, the results produced by human experts (Aghaeepour et al., 2013).

The development of computational approaches addresses many needs associated with high-dimensional datasets. However, for the immunology community, three main challenges have surfaced, and tackling them will facilitate a paradigm shift in the analysis of FCM data. First, despite the focused efforts by bioinformaticians to develop novel tools for analyzing FCM data, only a minority of immunologists are aware of the advantages offered to the field. These tools need to be presented in immunology forums rather than limited to bioinformatics journals and conferences. Second, even though the vast majority of the computational tools are open source and thus freely available, most do not have user-friendly interfaces, limiting their use to investigators with programming expertise. Third, as a consequence of the first two challenges, there is a lack of general understanding of how these novel tools work. This has two opposing effects. In some cases, skepticism increases because of a feeling that direct control of analysis has been lost and that results are unverifiable. In other cases, overconfidence occurs, and no real effort is made to validate results. This can lead to the reporting of “significant” cell populations that actually arise from experimental artifact (and quality-control issues).

These are challenges that must be addressed, both in terms of generalizable strategies and within each individual experiment, with the goals of broader adoption and more accurate results.

Inter-disciplinary collaborations between immunologists and bioinformaticians might help address these points, as demonstrated by pioneering collaborations that identified immunological correlates of HIV protection in high-dimensional FCM data (Aghaeepour et al., 2012). Such collaborations could also be implemented within institutions and research groups by convening bioinformaticians and immunologists or by team members trained in both immunology and bioinformatics. Hence, inter-disciplinary collaboration should be encouraged as soon as a study is conceived and should continue through the entire study (from wet bench experimentation to final data analysis to publication).

Another, possibly game-changing solution is to develop user-friendly web or computer interfaces that would allow immunologists with little bioinformatics background to rationally combine the available tools and run datasets through different workflows to achieve optimal results. Work on making this model a reality is ongoing. The FLOCK algorithm (Qian et al., 2010) has been implemented in the Immunology Database and Analysis Portal (https://immport.niaid.nih.gov), which supports management of FCM data, cell-population identification, cross-sample comparison, and result visualization with a simple user interface. Also, a comprehensive suite of tools for processing and analyzing FCM data has been implemented within the GenePattern infrastructure (Spidlen et al., 2013). Finally, the OpenCyto framework provides open-source tools for analyzing FCM data within an extensible and flexible interface to simplify the construction of re-usable FCM workflows while facilitating comparative analysis against manually gated results in order to enhance user confidence (Finak et al., 2014).

Progress in cytometry technology generates complex datasets for which exhaustive analysis by existing practices is difficult. Solutions for deciphering multi-dimensional FCM and mass cytometry datasets exist but have not yet reached most immunologists. Here, we describe a list of available computational tools with the aim of enhancing awareness, access, and acceptance and discuss models to bridge the existing gap between immunology and bioinformatics. We predict that interdisciplinary efforts to address the current data-analysis bottlenecks will rapidly enhance our knowledge of the immune system, guide immunotherapy development, accelerate biomarker discovery, and ultimately benefit patients.

Supplementary Material

Supplement

Footnotes

SUPPLEMENTAL INFORMATION

Supplemental Information includes Supplemental Acknowledgments and one table and can be found with this article online at http://dx.doi.org/10.1016/j.immuni.2015.04.006.

References

  1. Aghaeepour N, Jalali A, O’Neill K, Chattopadhyay PK, Roederer M, Hoos HH, Brinkman RR. Cytometry A. 2012;81:1022–1030. doi: 10.1002/cyto.a.22209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Aghaeepour N, Finak G, Hoos H, Mosmann TR, Brinkman R, Gottardo R, Scheuermann RH, FlowCAP Consortium; DREAM Consortium Nat Methods. 2013;10:228–238. [Google Scholar]
  3. Bendall SC, Nolan GP. Nat Biotechnol. 2012;30:639–647. doi: 10.1038/nbt.2283. [DOI] [PubMed] [Google Scholar]
  4. Bendall SC, Simonds EF, Qiu P, Amir AD, Krutzik PO, Finck R, Bruggner RV, Melamed R, Trejo A, Ornatsky OI, et al. Science. 2011;332:687–696. doi: 10.1126/science.1198704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chattopadhyay PK, Roederer M. Methods. 2012;57:251–258. doi: 10.1016/j.ymeth.2012.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Finak G, McDavid A, Chattopadhyay P, Dominguez M, De Rosa S, Roederer M, Gottardo R. Biostatistics. 2014;15:87–101. doi: 10.1093/biostatistics/kxt024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Qian Y, Wei C, Eun-Hyung Lee F, Campbell J, Halliley J, Lee JA, Cai J, Kong YM, Sadat E, Thomson E, et al. Cytometry B Clin Cytom. 2010;78(1):S69–S82. doi: 10.1002/cyto.b.20554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Spidlen J, Barsky A, Breuer K, Carr P, Nazaire MD, Hill BA, Qian Y, Liefeld T, Reich M, Mesirov JP, et al. Source Code Biol Med. 2013;8:14. doi: 10.1186/1751-0473-8-14. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement

RESOURCES