Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Aug 1.
Published in final edited form as: Curr Opin Biotechnol. 2019 Apr 10;58:129–136. doi: 10.1016/j.copbio.2019.03.001

Beyond bulk: A review of single cell transcriptomics methodologies and applications

Ashwinikumar Kulkarni 1, Ashley G Anderson 1, Devin P Merullo 1, Genevieve Konopka 1,*
PMCID: PMC6710112  NIHMSID: NIHMS1523469  PMID: 30978643

Abstract

Single cell RNA sequencing (scRNA-seq) is a promising approach to study the transcriptomes of individual cells in the brain and the central nervous system (CNS). This technology acts as a bridge between neuroscience, computational biology and systems biology, enabling an unbiased and novel understanding of the cellular composition of the brain and CNS. Gene expression at the single cell resolution is often noisy, sparse, and high-dimensional, creating challenges for computational analysis of such data. In this review, we overview fundamental sample preparation and data analysis processes of scRNA-seq and provide a comparative perspective for analyzing and visualizing these data

Graphical abstract

graphic file with name nihms-1523469-f0001.jpg

Introduction

High-throughput single-cell transcriptomics has provided unprecedented insights into cellular diversity of tissues across diverse organisms. Transcriptomic studies using bulk tissue assume and represent all the cells as a homogeneous material, thereby ignoring the stochasticity of gene expression [13]. Single-cell transcriptomics, however, is able to address questions related to the stochastic nature of gene expression.

In this review, we will focus on a number of technical aspects of single cell transcriptomic profiling, such as various methods of single cell isolation and library preparation, and the incorporation of spatial information with single-cell gene expression profiling. We will also highlight commonalities and differences between single cell and single nuclei RNA-seq. Overall, we will highlight some of the outstanding questions in the field and provide some useful resources for the design and implementation of single cell or single nuclei RNA-seq studies.

Single Cell Isolation and Library Preparation

Methods to isolate single cells for RNA sequencing vary in the number of cells they isolate (high-throughput or low-throughput) and how they select cells (biased or unbiased) [4, 5]. Droplet-based technologies are currently commonly used as a high-throughput, unbiased solution [6]. The three most widely employed platforms are 10X Genomics Chromium, DropSeq, and inDrop [710]. Each of these approaches uses microfluidics to tag individual cells with single beads containing a unique barcode. Each mRNA transcript is also linked with a unique molecular identifier (UMI). This approach results in a matrix containing the absolute number of counts for each transcript in each cell. Each platform has its strength and weaknesses: 10X Genomics Chromium is the most sensitive for detecting the greatest number of transcripts, although it can be cost-prohibitive for large numbers of samples; DropSeq generally is more affordable, but with the tradeoff of slight sensitivity losses by counting fewer transcripts; inDrop may be ideal to detect genes expressed at low levels because of its customizable parameters [7].

Two distinct approaches for generating single-cell sequencing libraries are full-length and tag-based [1113]. Full-length methods provide reads that are derived from across the entire length of genes, improving overall sensitivity. However, full-length library preparations tend to show a bias for longer genes because counts are often missed for shorter genes [14]. Tag-based methods incorporate UMIs, allowing for the identification and quantification of individual transcripts. These tags are added to either the 3’ or 5’ end of transcripts and therefore introduce a read bias to the end that receives the tag, although there is no length bias [14]. While full-length methods achieve more comprehensive read coverage, tag-based methods have become dominant for quantification purposes, especially for larger numbers of cells [1113]. Figure 1 summarizes a generalized single cell transcriptomic experiment.

Figure 1: Schematic diagram of a single cell transcriptomic experiment.

Figure 1:

(A) Single cell gene expression profiling can be carried out from any tissue from humans or model systems. (B) The human brain, as an example, is composed of diverse types of cells, each of which contain gene expression patterns that can be resolved into clusters of cells of similar cell types. (C) A generalized scRNA-seq pipeline from source tissue to sequencing.

Single Cell vs. Single Nuclei

CNS cell types can be morphologically complex, making the isolation of intact cells challenging [1518]. Thus, unbiased neuronal profiling requires the preparation of RNA-seq libraries from freshly-harvested, viable, and intact neurons. Most of the single cell isolation or preparation methods applied to brain tissue make use of harsh enzymatic dissociation steps which can harm neurons and impair neuronal profiling [17, 19, 20]. Several studies have indicated that some cell types are under- or over-represented in the final datasets due to artificial filtering from dissociation steps [21, 22].

Similar issues arise for other tissues in the body where some cell types are more susceptible to the tissue dissociation process, such as the heart [23]. Additionally, isolating intact cells from post-mortem frozen tissue is challenging because cell membranes are easily damaged from mechanical and physical stress during the freeze-thaw process [23, 24]. The isolation of nuclei offers an advantage over cells because nuclei are more resistant to the mechanical and physical stress during freeze-thaw. As an alternative approach to scRNA-seq, single nuclei RNA sequencing (snRNA-seq) has been successfully implemented to study cellular composition in post-mortem human brain [16, 18, 19, 25], mouse hippocampus [26], and other tissues. Several studies have compared the matched single-cells versus single-nuclei, e.g. in mouse 3T3 cells [19] and in mouse somatosensory cortex [17, 20]. snRNA-seq data have a high degree of similarity with whole-cell transcriptomes in terms of the number of detected genes and resolved cell types [17, 20], snRNA-seq datasets showed a higher proportion of intronic reads than scRNA-seq, the detection of more shorter-length genes [17, 20]. An important aspect of snRNA-seq is ensuring that reads that map both to either exons or introns are included as counting intronic reads is required for high resolution cell type identification from snRNA-seq data [20, 27].

Some methods also attempt to impact less damage to cells during dissociation. These approaches are particularly relevant to the expression of immediate early genes, for which transcription changes quickly in response to stimuli. For example, Act-seq minimizes cellular disturbances during isolation and preparation by using the application of actinomyocin D during dissociation to inhibit rapid transcriptional changes [28].

Performance evaluation of a single cell RNA-seq experiment

The technical performance of scRNA-seq data should be evaluated for each experiment, including sensitivity, accuracy, and precision [27, 29, 30]. Sensitivity in terms of scRNA-seq refers to the lowest number of transcripts that can be reproducibly detected. This measurement is highly dependent on the depth of sequencing. It is crucial to understand this aspect of the dataset, as it directly estimates the proportion of RNA being measured in a single cell. Accuracy refers to the degree of agreement between the expression measurements and the actual transcript abundance [27, 29]. Accuracy can be affected by the factors specific to the protocol being used, such as exponential PCR amplification or sequencing bias [29]. Precision regarding scRNA-seq is considered inversely proportional to the technical noise in the RNA-seq measurements. Technical noise is often considered as a coefficient of variation across replicate measurements. Due to the nature of scRNA-seq and snRNA-seq technologies, such that the data generated is generally sparse and may contain many zeros, precision currently remains to be the largest limitation for current scRNA-seq and snRNA-seq protocols [27, 29].

Data Preprocessing

Raw sequencing data, usually in the form of binary base calls (BCL files), are first de-multiplexed using oligonucleotide index sequences (such as chromium i7 index sets used during 10X Genomics library preparation) to generate FASTQ files. FASTQ files then undergo quality control (QC) to identify and remove low quality bases and sequencing adapters from the reads. Following filtering, reads are aligned to a reference genome. Uniquely mapped reads are then assigned to genes using reference annotation followed by counting the number of reads per gene per cell. This count matrix is then used to distinguish real cells from doublets and empty beads. The filtered count matrix ultimately enables downstream processing such as normalization, clustering, cell-type identification, and pseudo-time analysis.

Considering the limitations of using Cell Ranger pipelines, an alternative strategy that is more customizable is to use a combination of tools such as FastQC [FastQC, Babraham Bioinformatics, URL: https://www.bioinformatics.babraham.ac.uk/projects/fastqc], UMI tools [31], STAR [32], featureCounts [33], HTSeq [34], Samtools [35], and zUMIs [36], etc. There are several customized pipelines [Analysis of single cell RNA-seq data, URL: http://hemberg-lab.github.io/scRNA.seq.course; Custom pipeline to preprocess the raw sequencing data for a single cell/nuclei transcriptomic profiling, URL: https://github.com/konopkalab/Pipeline-SingleCellRNASeq] available publicly (Fig. 2) which take advantage of the above-mentioned tools to pre-process the data.

Figure 2: General workflow of scRNA-seq data analysis.

Figure 2:

(A) Raw BCL files containing reads are quality-filtered and de-multiplexed. These reads contain information for the cell barcode, a molecular barcode or UMI, and the cDNA sequence. (B) Using UMI tools [31], one can distinguish real cells from potential background noise. UMI tools estimates these cell barcodes from the data using the knee method. This whitelist of cell barcodes is then used to extract cDNA reads corresponding to the estimated real cells. (C) The next step is to align the reads to the reference genome, for example using STAR [32] and (D) assign reads to the reference annotation gene model using a tool such as featureCounts [33]. (E) UMI tools can further collapse the read counts corresponding to each gene in each cell creating a raw digital expression count matrix. (F) Using the raw count table, clustering pipelines such as Seurat [37, 38] separate the cells into different clusters based on the cell types and (G) facilitate differential gene expression analysis between any given clusters. One can additionally use tools like SCDE [78, 79] which are designed specifically for single cell transcriptional data to calculate differential expression. (H) Additional analyses could include a pseudo-time trajectory of cells using tools such as Velocyto [52] and Monocle [80]. etc.

Clustering and visualization of cellular clusters

Interpreting single-cell transcriptomic data requires a few additional steps after the raw count matrix is ready, which are discussed in publicly-available user tutorials for R-packages and python-based tools [10, 3740]. It remains a fundamentally unclear how to ideally cluster and visualize heterogeneous cellular population into groups of so-called homogeneous “cell types”.

Dimensionality reduction is used in scRNA-seq data processing to allow for downstream analyses such as clustering, visualization and further interpretation. Common algorithms for this include principle component analysis (PCA) [41], multi-dimensional scaling (MDS) [42], Sammon mapping [43], t-distributed stochastic neighbor embedding (t-SNE) [4447], and Isomap [48]. Linear dimensionality reduction methods such as PCA [41] often cannot represent high-dimensional data into low-dimensional data. Among many non-linear methods, t-SNE [4447] is a very popular choice for cluster visualization. Though most of the single cell RNA-seq studies published to date rely on t-SNE [4447] for visualization, several limitations have now become apparent, for example when compared to a more recent method of visualization [4951]. Uniform manifold approximation and projection (UMAP) is a novel algorithm for non-linear dimensionality reduction based on Riemannian geometry and algebraic topology [49]. UMAP offers promising advantages over t-SNE for visualization of single cell RNA-seq data [51]. Furthermore, UMAP also retains and projects the information related to multi-branched trajectories to facilitate pseudo-time analysis [40, 51, 52]. In our unpublished dataset from mouse striatal neurons, UMAP not only facilitated annotating the cells more accurately but also helped visualize the clusters in accordance with their developmental trajectories (Fig. 3).

Figure 3: Comparison of non-linear dimensionality reduction techniques.

Figure 3:

Using the same dataset of 62,778 mouse striatal neurons (unpublished dataset), plotting the cells using t-SNE embeddings (A) does not spatially capture the trajectory of distinct cell types like (B) UMAP. Arrow shows the progression of neural progenitor clusters to fully mature spiny projection neuron clusters that is visible in UMAP, however, no such trajectory can be visually discerned using t-SNE.

Incorporating spatial information

A major drawback to scRNA-seq methods is the loss of spatial information [5356]. The first attempts to identify the spatial locations of genes identified via scRNA-seq that were co-expressed in clusters involved visually labeling individual gene markers through fluorescence in situ hybridization (FISH). In single-molecule fish (smFISH), multiple fluorescent probes are used to characterize distinct cell groups based on the resulting combinations [57]. However, this technique is limited by the number of colors that can be visualized at once. Sequential fluorescence in situ hybridization (seqFISH) resolves this problem by sequentially labeling tissue for different RNA markers [5860]. Fluorescence is washed out after each round of labeling, thereby binarily labelling cell populations for the presence or absence of a particular gene. A unique barcode is then assigned to each cell population. Such combinatorial barcodes generated by sequential labeling enables cell populations to be identified with much greater efficiency and robustness than smFISH. The development of multiplexed error-robust FISH (MERFISH) allows even more RNA species to be labeled with a reduced error rate [6164].

Several other methods to observe spatial gene expression also exist [65]. Transcriptomic scRNA-seq data can be computationally integrated with spatial seqFISH information [66]. Expansion microscopy, which physically increases the space between molecules while preserving spatial integrity, has been used in conjunction with the fluorescent labelling of RNA molecules to visualize gene expression [67, 68]. osmFISH is a recently developed method for labelling gene expression without the use of barcodes, and a chief advantage of it is that the detection of highly-expressed genes does not obscure the detection of lowly-expressed genes [69]. Together, these efforts to preserve the spatial locations of specific cell-types will offer invaluable insight into the organization and functioning of diverse tissues.

Application of single cell transcriptomics in neuroscience

Single-cell transcriptomics has uncovered novel cell-types and elucidated neuronal and glial diversity within various brain regions and across species [7073]. This technology has also enabled the molecular examination of “cell-states” as well as cell-types [71, 74]. Novel methodologies to implement single-cell transcriptomics on both fixed and frozen postmortem human brain tissue using nuclei are providing insight into human brain evolution, development, and function [19]. The application of single-cell transcriptomics within experimental systems modelling human brain development or disease states, such as human brain organoids [75] or animal models [76], can ultimately help illuminate conserved and unique aspects of the human brain development and function.

Using scRNA-seq to study neurological disorders has been a major technical advancement given the enormous diversity of cell-types within the brain. Single-cell transcriptomics has helped to characterize Rett syndrome, an X-linked neurodevelopmental disorder, where random X-chromosome inactivation leads to a mixture of mutant and normal cells within the same patient [76]. A new methodology that identifies single-cell single-nucleotide polymorphism sequencing (SNP-seq) could reliably detect both mutant and normal cells in postmortem human brain tissue [76]. This single-cell transcriptomic tool could be broadly applied to other X-linked neurodevelopmental disorders, such as Fragile-X syndrome or X-linked intellectual disability.

Studies using scRNA-seq in control vs knockout studies in mice have analyzed changes in cellular composition, differentially expressed genes, and pseudo-time or pseudo-differentiation tools at cellular resolution with deletion or manipulation of a given gene [77] (Fig. 4). Given the immense cellular heterogeneity of any given brain region, single-cell transcriptomics allows for the identification of cell-types particularly vulnerable in disease states and for addressing the question of why these cell-types are particularly vulnerable. Another benefit of using single-cell transcriptomic within these comparative approaches is the ability to examine both cell-autonomous and non-cell-autonomous changes. For example, in studies using models to delete a specific gene in distinct cell-types, downstream analyses could examine how disruption of one cell-type alters the function or number of other cell-types.

Figure 4: Example schematic of a comparative single-cell transcriptomic study.

Figure 4:

(A-B) Control vs. experimental cells can be clustered together and visualized using a dimensionality reduction tool. (C-D) Gene expression differences within distinct cell-types between control and experimental cells can be uncovered and cell-type composition within the tissue can be compared across samples. (E) Other scRNA-seq tools, such as pseudo-time analyses, can be examined between control and experimental systems.

Conclusions and Future Challenges

Currently, annotating clustered cells using a handful of known marker genes or using enrichment for other published dataset is a common approach, but this poses a real issue to identify novel cell types hidden in the scRNA-seq datasets. Machine learning approaches combined with literature-based knowledge can potentially resolve this problem. Many collaborative approaches across the globe such as the Human Cell Atlas [https://www.humancellatlas.org] are creating an enormous catalogue to account for every single type of cell in human body. As these data are rapidly increasing, the present challenge is a careful curation, analysis, and annotation of these datasets. Efforts are also needed to overcome the limitations due to the sparse and noisy nature of the dataset, low starting material, biases introduced due to amplification and sequencing, technical variations, and cost. Nevertheless, single cell transcriptomics technology has great potential to contribute to the fields of spatial transcriptomics and personalized medicine.

Highlights.

  • Technologies such as single-cell RNA sequencing have rapidly progressed in recent years.

  • Various methods exist for preparing libraries for sequencing, each with its own limitations and advantages.

  • There are several analytical approaches for analyzing the large amount of data generated by single-cell sequencing experiments.

  • The ability of single-cell RNA sequencing studies to address issues of brain tissue heterogeneity has great potential to transform understanding of how distinct cell populations comprise brain organization and underlie normal and disease-relevant behaviors.

Acknowledgments

Funding

G.K. is a Jon Heighten Scholar in Autism Research at UT Southwestern. This work was supported by the NIH (T32MH076690 to A.A.; DC014702, DC016340, MH102603, MH103517, and NS106447 to G.K.), the James S. McDonnell Foundation 21st Century Science Initiative in Understanding Human Cognition (Scholar Award 220020467), the Simons Foundation (SFARI awards 401220 and 573689), and the Chan Zuckerberg Initiative, an advised fund of Silicon Valley Community Foundation (HCA-A-1704-10747) to G.K.

Footnotes

Conflict of interest statement

Nothing declared.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Raj A and van Oudenaarden A, Nature, nurture, or chance: stochastic gene expression and its consequences. Cell, 2008. 135(2): p. 216–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Munsky B, Neuert G, and van Oudenaarden A, Using Gene Expression Noise to Understand Gene Regulation. Science, 2012. 336: p. 183–187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Stegle O, Teichmann SA, and Marioni JC, Computational and analytical challenges in singlecell transcriptomics. Nature Reviews Genetics, 2015. 16: p. 133. [DOI] [PubMed] [Google Scholar]
  • 4.Cristinelli S and Ciuffi A, The use of single-cell RNA-Seq to understand virus–host interactions. Current Opinion in Virology, 2018. 29: p. 39–50. [DOI] [PubMed] [Google Scholar]
  • 5.Gross A, et al. , Technologies for single-cell isolation. International Journal of Molecular Sciences, 2015. 16(8). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Prakadan SM, Shalek AK, and Weitz DA, Scaling by shrinking: Empowering single-cell ‘omics’ with microfluidic devices. Nature Reviews Genetics, 2017. 18: p. 345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zhang X, et al. , Comparative analysis of droplet-based ultra-high-throughput single-cell RNAseq systems. Molecular Cell, 2018.•• This study provides a comparative analysis of results achieved by three different microfluidics platforms (10X Genomics Chromium, DropSeq, and inDrop), highlighting the strengths and limitations of each approach.
  • 8.Zheng GXY, et al. , Massively parallel digital transcriptional profiling of single cells. Nature Communications, 2017. 8: p. 1–12.• The 10X Genomics Chromium platform and Cell Ranger pipeline are used to characterize single cells within the immune system. The authors provide an overview of this method and the utility of its applications.
  • 9.Klein AM, et al. , Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell, 2015. 161(5): p. 1187–1201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Macosko EZ, et al. , Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets. Cell, 2015. 161(5): p. 1202–1214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hedlund E and Deng Q, Single-cell RNA sequencing: Technical advancements and biological applications. Molecular Aspects of Medicine, 2018. 59: p. 36–46. [DOI] [PubMed] [Google Scholar]
  • 12.See P, et al. , A single-cell sequencing guide for immunologists. Frontiers in Immunology, 2018. 9: p. 2425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ziegenhain C, et al. , Comparative analysis of single-cell RNA sequencing methods. Molecular Cell, 2017. 65(4): p. 631–643.e4. [DOI] [PubMed] [Google Scholar]
  • 14.Phipson B, Zappia L, and Oshlack A, Gene length and detection bias in single cell RNA sequencing protocols. F1000Research, 2017. 6: p. 595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Grindberg RV, et al. , RNA-sequencing from single nuclei. Proc Natl Acad Sci U S A, 2013. 110(49): p. 19802–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Krishnaswami SR, et al. , Using single nuclei for RNA-seq to capture the transcriptome of postmortem neurons. Nat Protoc, 2016. 11(3): p. 499–524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lake BB, et al. , A comparative strategy for single-nucleus and single-cell transcriptomes confirms accuracy in predicted cell-type expression from nuclear RNA. Sci Rep, 2017. 7(1): p. 6031.•• Authors provide a very insightful comparison for using single-nuclei RNA-seq over single-cell to study neuronal complexity. This is especially very useful in situations when access to fresh surgical tissue material is limited.
  • 18.Lake BB, et al. , Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain. Science, 2016. 352(6293): p. 1586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Habib N, et al. , Massively parallel single-nucleus RNA-seq with DroNc-seq. Nat Methods, 2017. 14(10): p. 955–958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bakken TE, et al. , Equivalent high-resolution identification of neuronal cell types with single-nucleus and single-cell RNA-sequencing. bioRxiv, 2018.
  • 21.Darmanis S, et al. , A survey of human brain transcriptome diversity at the single cell level. Proc Natl Acad Sci U S A, 2015. 112(23): p. 7285–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tasic B, et al. , Shared and distinct transcriptomic cell types across neocortical areas. Nature, 2018. 563(7729): p. 72–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Farrant J, Mechanism of Cell Damage During Freezing and Thawing and its Prevention. Nature, 1965. 205: p. 1284. [Google Scholar]
  • 24.McGann LE, Yang H, and Walterson M, Manifestations of cell damage after freezing and thawing. Cryobiology, 1988. 25(3): p. 178–185. [DOI] [PubMed] [Google Scholar]
  • 25.Lake BB, et al. , Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain. Nat Biotechnol, 2018. 36(1): p. 70–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Habib N, et al. , Div-Seq: Single-nucleus RNA-Seq reveals dynamics of rare adult newborn neurons. Science, 2016. 353(6302): p. 925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ziegenhain C, et al. , Comparative Analysis of Single-Cell RNA Sequencing Methods. Mol Cell, 2017. 65(4): p. 631–643 e4. [DOI] [PubMed] [Google Scholar]
  • 28.Wu YE, et al. , Detecting activated cell populations using single-cell RNA-seq. Neuron, 2017. 96(2): p. 313–329.e6. [DOI] [PubMed] [Google Scholar]
  • 29.Wu AR, et al. , Single-Cell Transcriptional Analysis. Annu Rev Anal Chem (Palo Alto Calif), 2017. 10(1): p. 439–462.• This review focuses on almost every aspect of a single-cell RNA-seq experiment. The most interesting fact discussed is regarding the performance evaluation of a single-cell RNA-seq experiment.
  • 30.Saliba A-E, et al. , Single-cell RNA-seq: advances and future challenges. Nucleic Acids Research, 2014. 42(14): p. 8845–8860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Smith TS, Heger A, and Sudbery I, UMI-tools: Modelling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Research, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Dobin A, et al. , STAR: ultrafast universal RNA-seq aligner. Bioinformatics, 2013. 29(1): p. 15–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Liao Y, Smyth GK, and Shi W, featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics, 2014. 30(7): p. 923–930. [DOI] [PubMed] [Google Scholar]
  • 34.Anders S, Pyl PT, and Huber W, HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics, 2015. 31(2): p. 166–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Li H, et al. , The Sequence Alignment/Map format and SAMtools. Bioinformatics, 2009. 25(16): p. 2078–2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Parekh S, et al. , zUMIs - A fast and flexible pipeline to process RNA sequencing data with UMIs. GigaScience, 2018. 7(6): p. giy059–giy059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Butler A, et al. , Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nature Biotechnology, 2018. 36: p. 411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Satija R, et al. , Spatial reconstruction of single-cell gene expression data. Nature Biotechnology, 2015. 33: p. 495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Qiu X, et al. , Single-cell mRNA quantification and differential analysis with Census. Nature Methods, 2017. 14: p. 309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wolf FA, Angerer P, and Theis FJ, SCANPY: large-scale single-cell gene expression data analysis. Genome Biology, 2018. 19(1): p. 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Jolliffe IT, Principal Component Analysis and Factor Analysis, in Springer Series in Statistics. 1986, Springer: New York, NY. [Google Scholar]
  • 42.Cox MAACF, Multidimensional Scaling, in Handbook of Data Visualization. 2008, Springer: Berlin, Heidelberg. [Google Scholar]
  • 43.Sammon JW, A Nonlinear Mapping for Data Structure Analysis. IEEE Transactions on Computers, 1969. C-18(5): p. 401–409. [Google Scholar]
  • 44.van der Maaten L, Accelerating t-SNE using Tree-Based Algorithms. Journal of Machine Learning Research, 2014. 15. [Google Scholar]
  • 45.van der Maaten L and Hinton G, Visualizing non-metric similarities in multiple maps. Machine Learning, 2012. 87(1): p. 33–55. [Google Scholar]
  • 46.van der Maaten L, Learning a Parametric Embedding by Preserving Local Structure. 2009, PMLR. p. 384–391.
  • 47.van der Maaten LH, Geoffrey, Visualizing Data using t-SNE. Journal of Machine Learning Research, 2008. 9. [Google Scholar]
  • 48.Tenenbaum JB, Silva V.d., and Langford JC, A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science, 2000. 290(5500): p. 2319. [DOI] [PubMed] [Google Scholar]
  • 49.McInnes L.a.H., John and Saul, Nathaniel and Grossberger, Lukas, UMAP: Uniform Manifold Approximation and Projection. The Journal of Open Source Software, 2018. 3(29): p. 861. [Google Scholar]
  • 50.Ding J, Condon A, and Shah SP, Interpretable dimensionality reduction of single cell transcriptome data with deep generative models. Nature Communications, 2018. 9(1): p. 2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Becht E, et al. , Evaluation of UMAP as an alternative to t-SNE for single-cell data. bioRxiv, 2018: p. 298430.• Authors have identified UMAP as a better alternative to t-SNE for visualizing single-cell transcriptomic data in order to identify clusters. Use of UMAP is also crucial to study developmental trajectories using single-cell data.
  • 52.La Manno G, et al. , RNA velocity of single cells. Nature, 2018. 560(7719): p. 494–498.•• Studying developmental trajectory is a very unique feature of single cell transcriptomic data. Authors have very systematically made use of the data to extract trajectory information.
  • 53.Lein E, Borm LE, and Linnarsson S, The promise of spatial transcriptomics for neuroscience in the era of molecular cell typing. Science, 2017. 358(6359): p. 64–69.• The authors provide an in-depth review of the history of, and current progress in, attempts to unify spatial information and transcriptomic data.
  • 54.Crosetto N, Bienko M, and van Oudenaarden A, Spatially resolved transcriptomics and beyond. Nature Reviews Genetics, 2015. 16: p. 57. [DOI] [PubMed] [Google Scholar]
  • 55.Zeng H and Sanes JR, Neuronal cell-type classification: challenges, opportunities and the path forward. Nature Reviews Neuroscience, 2017. 18: p. 530. [DOI] [PubMed] [Google Scholar]
  • 56.Yuan G-C, et al. , Challenges and emerging directions in single-cell analysis. Genome Biology, 2017. 18(1): p. 84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Raj A, et al. , Imaging individual mRNA molecules using multiple singly labeled probes. Nature Methods, 2008. 5: p. 877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Shah S, et al. , In situ transcription profiling of single cells reveals spatial organization of cells in the mouse hippocampus. Neuron, 2016. 92(2): p. 342–357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Lubeck E, et al. , Single-cell in situ RNA profiling by sequential hybridization. Nature Methods, 2014. 11: p. 360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lubeck E and Cai L, Single-cell systems biology by super-resolution imaging and combinatorial labeling. Nature Methods, 2012. 9: p. 743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Moffitt JR, et al. , Molecular, spatial, and functional single-cell profiling of the hypothalamic preoptic region. Science, 2018. 362(6416).•• This study uses MERFISH to create a spatially-informed transcriptomic map of single cells in the mouse hypothalamic preoptic region. Furthermore, distinct cell populations are associated with specific social behaviors.
  • 62.Chen KH, et al. , Spatially resolved, highly multiplexed RNA profiling in single cells. Science, 2015. 348(6233). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Moffitt JR and Zhuang X, RNA imaging with multiplexed error-robust fluorescence in situ hybridization (MERFISH), in Methods in Enzymology, Filonov GS and Jaffrey SR, Editors. 2016, Academic Press; p. 1–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Moffitt JR, et al. , High-throughput single-cell gene-expression profiling with multiplexed error-robust fluorescence in situ hybridization. Proceedings of the National Academy of Sciences of the United States of America, 2016. 113(39): p. 11046–11051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Wang X, et al. , Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science, 2018: p. eaat5691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Zhu Q, et al. , Identification of spatially associated subpopulations by combining scRNAseq and sequential fluorescence in situ hybridization data. Nature Biotechnology, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Karagiannis ED and Boyden ES, Expansion microscopy: Development and neuroscience applications. Current Opinion in Neurobiology, 2018. 50: p. 56–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Alon S, Huynh GH, and Boyden ES, Expansion microscopy: Enabling single cell analysis in intact biological systems. The FEBS Journal, 2018. 0(0). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Codeluppi S, et al. , Spatial organization of the somatosensory cortex revealed by osmFISH. Nature Methods, 2018. 15(11): p. 932–935. [DOI] [PubMed] [Google Scholar]
  • 70.Nowakowski TJ, et al. , Spatiotemporal gene expression trajectories reveal developmental hierarchies of the human cortex. Science, 2017. 358(6368): p. 1318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Saunders A, et al. , Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain. Cell, 2018. 174(4): p. 1015–1030.e16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Zeisel A, et al. , Molecular Architecture of the Mouse Nervous System. Cell, 2018. 174(4): p. 999–1014.e22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Zhong S, et al. , A single-cell RNA-seq survey of the developmental landscape of the human prefrontal cortex. Nature, 2018. 555: p. 524. [DOI] [PubMed] [Google Scholar]
  • 74.Hrvatin S, et al. , Single-cell analysis of experience-dependent transcriptomic states in the mouse visual cortex. Nature Neuroscience, 2018. 21(1): p. 120–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Quadrato G, et al. , Cell diversity and network dynamics in photosensitive human brain organoids. Nature, 2017. 545: p. 48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Renthal W, et al. , Characterization of human mosaic Rett syndrome brain tissue by singlenucleus RNA sequencing. Nature Neuroscience, 2018. 21(12): p. 1670–1679.•• These authors develop a new application of single-cell transcriptomics (SNP-seq) for overcoming challenges for characterizing X-linked neurodevelopmental disorders.
  • 77.Zywitza V, et al. , Single-Cell Transcriptomics Characterizes Cell Types in the Subventricular Zone and Uncovers Molecular Defects Impairing Adult Neurogenesis. Cell Reports, 2018. 25(9): p. 2457–2469.e8.•• One of the first studies to use scRNA-seq in comparative analysis; in this case, LRP2 knockout vs. littermate control cells were compared to better understand the cell-types affected by LRP2 ablation and adult neurogenesis.
  • 78.Kharchenko PV, Silberstein L, and Scadden DT, Bayesian approach to single-cell differential expression analysis. Nature Methods, 2014. 11: p. 740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Fan J, et al. , Characterizing transcriptional heterogeneity through pathway and gene set overdispersion analysis. Nature Methods, 2016. 13: p. 241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Trapnell C, et al. , The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nature Biotechnology, 2014. 32: p. 381. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES