Skip to main content
eLife logoLink to eLife
. 2021 Jul 22;10:e65445. doi: 10.7554/eLife.65445

Molecular characterization of projection neuron subtypes in the mouse olfactory bulb

Sara Zeppilli 1,2, Tobias Ackels 3,4,, Robin Attey 1,, Nell Klimpert 1, Kimberly D Ritola 5, Stefan Boeing 6,7, Anton Crombach 8,9, Andreas T Schaefer 3,4,, Alexander Fleischmann 1,2,
Editors: Naoshige Uchida10, Catherine Dulac11
PMCID: PMC8352594  PMID: 34292150

Abstract

Projection neurons (PNs) in the mammalian olfactory bulb (OB) receive input from the nose and project to diverse cortical and subcortical areas. Morphological and physiological studies have highlighted functional heterogeneity, yet no molecular markers have been described that delineate PN subtypes. Here, we used viral injections into olfactory cortex and fluorescent nucleus sorting to enrich PNs for high-throughput single nucleus and bulk RNA deep sequencing. Transcriptome analysis and RNA in situ hybridization identified distinct mitral and tufted cell populations with characteristic transcription factor network topology, cell adhesion, and excitability-related gene expression. Finally, we describe a new computational approach for integrating bulk and snRNA-seq data and provide evidence that different mitral cell populations preferentially project to different target regions. Together, we have identified potential molecular and gene regulatory mechanisms underlying PN diversity and provide new molecular entry points into studying the diverse functional roles of mitral and tufted cell subtypes.

Research organism: Mouse

Introduction

The mammalian olfactory system is unique among sensory systems in that it bypasses the thalamus: olfactory receptor neurons (ORNs) in the nose project to the olfactory bulb (OB), a forebrain structure containing – in the mouse – approximately 500,000 neurons per hemisphere (Parrish-Aungst et al., 2007). There, they synapse onto various interneurons and projection neurons. The latter directly project to a variety of cortical structures, including the anterior olfactory nucleus, piriform cortex, cortical amygdala, and the lateral entorhinal cortex (Ghosh et al., 2011; Haberly and Price, 1977; Miyamichi et al., 2011; Sosulski et al., 2011). This places OB projection neurons at a pivotal position to distribute processed olfactory information broadly across the brain.

Each ORN in the mouse expresses only one of approximately 1000 olfactory receptor genes (Buck and Axel, 1991; Yoshihito, 2012; Zhang and Firestein, 2002). ORNs expressing the same receptor project axons onto defined spherical structures, glomeruli (Mori and Sakano, 2011), containing a variety of neuropil including the apical dendrites of 10–50 projection neurons (Bartel et al., 2015; Schwarz et al., 2018). Historically, OB projection neurons have been divided into mitral and tufted cells (MCs, TCs), largely based on their soma location and dendritic and axonal projection pattern (Figure 1—figure supplement 1Haberly and Price, 1977; Imamura et al., 2020; Mori et al., 1983; Orona et al., 1984): MC somata are located predominantly in a thin layer with their dendrites covering the deeper part of the OB external plexiform layer. Their axons project to a wide range of structures including posterior piriform cortex. TC axons, on the other hand, are restricted to more anterior forebrain structures and their cell bodies are distributed across the external plexiform layer, with dendrites largely restricted to superficial layers. Within the TC population, subdivisions have been made into deep, middle, superficial, and external TCs, largely based on soma position. MCs on the other hand are often morphologically described as a largely homogeneous population. However, branching patterns of lateral dendrites as well as soma size and apical dendrite length might allow for further subdivision (Mouradian and Scott, 1988; Orona et al., 1984; Schwarz et al., 2018). Moreover, projection patterns might differ based on soma position along the dorsomedial–ventrolateral axis of the OB (Inokuchi et al., 2017; Chen et al., 2021).

In parallel to this morphological diversity, numerous studies have described physiological heterogeneity both as a result of differential inputs from granule cells onto TCs and MCs (Christie et al., 2001; Ezeh et al., 1993; Geramita et al., 2016; Phillips et al., 2012) as well as intrinsic excitability and glomerular wiring (Burton and Urban, 2014; Gire et al., 2019). Consequently, TCs respond more readily, with higher peak firing rates, and to lower odor concentration in vivo (Griff et al., 2008; Kikuta et al., 2013; Nagayama et al., 2014), and earlier in the respiration cycle compared to MCs (Ackels et al., 2020; Fukunaga et al., 2012; Igarashi et al., 2012; Jordan et al., 2018; Phillips et al., 2012).

Within the TC and MC populations, biophysical heterogeneity has been more difficult to tie to specific cell types or subtypes. MCs show diversity in biophysical properties that is thought to aid efficient encoding of stimulus-specific information and is, at least in part, experience dependent (Angelo et al., 2012; Burton et al., 2012; Padmanabhan and Urban, 2010; Tripathy et al., 2013). Both in vivo and in vitro recordings suggest that a subset of MCs show regular firing, while others show ‘stuttering’ behavior characterized by irregular action potential clusters (Angelo et al., 2012; Balu et al., 2004; Bathellier et al., 2008; Buonviso et al., 2003; Carey and Wachowiak, 2011; Desmaisons et al., 1999; Fadool et al., 2011; Friedman and Strowbridge, 2000; Margrie and Schaefer, 2003; Padmanabhan and Urban, 2010; Schaefer et al., 2006). While TCs are heterogeneous, with, for example, external TCs displaying prominent rhythmic bursting, driving the glomerular circuitry into long-lasting depolarizations in vitro (De Saint Jan et al., 2009; Gire and Schoppa, 2009; Gire et al., 2019; Najac et al., 2011), a systematic assessment of biophysical variety is lacking so far. Moreover, differential centrifugal input from cortical and subcortical structures might further amplify this overall heterogeneity both between MCs and TCs as well as potentially within those different classes (Boyd et al., 2012; Kapoor et al., 2016; Markopoulos et al., 2012; Niedworok et al., 2012; Otazu et al., 2015).

Thus, anatomical projection patterns, in vivo odor responses, and intrinsic properties are known to show substantial variability across different projection neurons. Systematic investigation of different projection neurons, however, has been hampered by a scarcity of specific molecular tools. Interneuron diversity, on the other hand, in general has received considerable attention with numerous studies including in the OB (Parrish-Aungst et al., 2007; Tavakoli et al., 2018), aiming to provide a systematic assessment of morphology, physiology, chemotype and the basis for genetic targeting of distinct types of interneurons. For OB projection neurons, however, only little information about chemotypes (Kiyokage et al., 2010) is available at this point: Cdhr1 and Tbx21 Haddad et al., 2013; Nagai et al., 2005 have been shown to be selectively expressed in a subset of OB projection neurons. CCK distinguishes a subset of TCs (superficial TCs, [Liu and Shipley, 1994; Seroogy et al., 1985; Short and Wachowiak, 2019; Sun et al., 2020]). Vasopressin-expressing cells might constitute a further subset of superficial TCs (Lukas et al., 2019), and recently, the Lbhd2 gene has been used to obtain more specific genetic access to MCs (Koldaeva et al., 2021). Heterogeneous expression of both the GABAa receptor and voltage-gated potassium channel subunits have been observed (Padmanabhan and Urban, 2010; Panzanelli et al., 2005), albeit not linked to specific cell types. Expression of axon guidance molecules such as Nrp2 might further allow subdivision of projection neurons across the OB (Inokuchi et al., 2017).

Hence, while some molecular markers can be used to define specific subsets of projection neurons, this description is far from complete. A comprehensive molecular definition of projection neuron types would help to classify and collate existing biophysical, morphological, and physiological data and delineate the distinct output streams of the OB. Moreover, it would provide a platform upon which further focused experimental approaches could be tied.

Single-cell or single-nucleus RNA sequencing has been used effectively to map out cell types across a variety of brain areas (Macosko et al., 2015; Zeisel et al., 2018), including inhibitory interneurons in the mouse OB (Tepe et al., 2018). As M/TCs constitute only ~10% of all OB neurons, we decided to enrich for projection neurons for single-nucleus (sn)RNA-seq. We then combined snRNA-seq with bulk RNA deep seq as well as additional snRNA-seq for OB neurons projecting to different cortical areas, thereby allowing us to disentangle different projection neuron classes by target area. We found that indeed both MCs and TCs fall into several, separable types, defined by expression of both common and overlapping gene regulatory networks. This work will therefore provide a molecular entry point into disentangling the diversity of OB projection neurons and defining the functional roles of different MC/TC types.

Results

Single-nucleus RNA sequencing of olfactory bulb projection neurons distinguishes mitral and tufted cell types

To characterize the molecular diversity of OB projection neurons, we used viral targeting and fluorescence-activated nuclei sorting (FANS) to enrich for OB projection neurons and characterized their transcriptomes using single-nucleus RNA sequencing (snRNA-seq) (Figure 1A, Figure 1—figure supplement 2A,B).

Figure 1. Comprehensive molecular profiling of olfactory bulb projection neurons.

(A) Schematic representation of experimental design. Top: after injection of rAAVretro-CAG-H2B-GFP into PCx and AON, single nuclei were dissociated from three mice (single nuclei (sn) R1,2,3: replicates 1,2,3) and sorted using fluorescence-activated nuclei sorting (FANS). The population of nuclei is selected based on GFP and DRAQ5 (far-red fluorescent DNA dye). See Figure 1—figure supplement 2 for detailed FANS plots. Sorted nuclei were sequenced using 10x single-nucleus RNA-seq. Middle and bottom: after injection of rAAVretro-CAG-H2B-GFP into PCx (middle) or AON (bottom), single nuclei were dissociated from three mice for each injection site and sorted using FANS (as described above and Figure 1—figure supplement 2). RNA extracted from sorted nuclei was prepared and sequenced using bulk RNA deep sequencing. PCx: Piriform Cortex; AON: Anterior Olfactory Nucleus; R: replicate; GRN: Gene Regulatory Network. (B) Representative coronal sections and high-magnification images showing GFP expression (in green) in the main olfactory bulb after injection of rAAVretro-CAG-H2B-GFP into PCx and AON (top), PCx only (middle), and AON only (bottom). Injection of the virus into PCx and AON resulted in GFP-expressing nuclei located in the mitral cell (empty arrowheads), external plexiform, glomerular (white arrowheads), and granule cell layers; injection into PCx resulted in GFP-expressing nuclei located in the mitral cell layer (empty arrowheads); injection into AON resulted in GFP-expressing nuclei located in the external plexiform and glomerular layers (white arrowheads) and granule cell layers. GL: glomerular layer; EPL: external plexiform layer; MCL: mitral cell layer; GCL: granule cell layer. Neurotrace counterstain in blue. Scale bars, 100 μm and 50 μm (high magnification).

Figure 1.

Figure 1—figure supplement 1. Schematic representation of olfactory bulb cell types and their cortical projection targets.

Figure 1—figure supplement 1.

(A) Schematic representation of cell types and their distribution within the olfactory bulb (IN: interneuron, TC: tufted cell, MC: mitral cell, GL: glomerular layer, EPL: external plexiform layer, MCL: mitral cell layer, GCL: granule cell layer). Tufted and mitral cells project their axons to downstream cortical regions. (B) Schematic representation of the distinct axonal projection targets for mitral and tufted cells in the olfactory cortex (AON: anterior olfactory nucleus, PCx: piriform cortex). Tufted cells project primarily to AON and to the anterior part of PCx, whereas mitral cell axons target AON, the anterior and posterior portions of PCx.
Figure 1—figure supplement 2. Enrichment of GFP-expressing nuclei using fluorescence-activated nuclei sorting (FANS).

Figure 1—figure supplement 2.

(A) Representative FANS data of GFP-expressing nuclei after injection of rAAVretro-CAG-H2B-GFP into AON and PCx to label OB projection neurons. From left to right: gating strategy for enrichment of GFP-expressing nuclei. Nuclei are identified based on size and granularity in the FSC (forward scatter) versus SSC (side scatter) plot. Next, the population of nuclei restricted to Quadrant two is selected that is both GFP-expressing and DRAQ5-positive (far-red fluorescent DNA dye). To exclude doublets, the population of nuclei around the diagonal in the FSC-A (forward scatter area) versus FSC-H (forward scatter height) plot is selected. Lastly, GFP-expressing nuclei filtered through the preceding steps are plotted against the SSC-A (side scatter area). (B) FANS results for control specimen for AON and PCx (olfactory bulb contralateral to the injection hemisphere from the same animal) showing the absence of GFP-expressing nuclei (same gating strategy as shown in (A)). (C) Representative FANS data after injection of rAAVretro-CAG-H2B-GFP into PCx to enrich for mitral cell nuclei. (D) FANS results for control specimen for PCx injections showing the absence of GFP-expressing nuclei (same gating strategy as shown in (C)). (E) Same as in (C) but after injection of rAAVretro-CAG-H2B-GFP into AON to enrich for tufted cell nuclei. (F) FANS results for control specimen for AON (olfactory bulb contralateral to the injection hemisphere from the same animal) showing the absence of GFP-expressing nuclei (same gating strategy as shown in (E)).

First, we injected a retrogradely transported Adeno-Associated Virus expressing nuclear GFP (rAAV-retro-CAG-H2B-GFP; Tervo et al., 2016) into multiple sites along the antero-posterior axis of the olfactory cortex, specifically into the anterior olfactory nucleus (AON) and piriform cortex (PCx) (Figure 1A). Histological analysis revealed that virus injections resulted in GFP expression in a heterogeneous population of OB cells labelling cells in the mitral cell, external plexiform, glomerular, and granule layers (Figure 1B). Sparse GFP expression in putative periglomerular and granule cells may have resulted from viral infection of migrating immature neurons from the rostral migratory stream or from diffusion of the virus from the injection site.

We dissected the olfactory bulbs of three injected mice, generated three independent replicates of single-nucleus suspensions, enriched for GFP expression using FANS (Figure 1—figure supplement 2), and performed snRNA-seq using 10× Genomics technology (Figure 1A). We performed a detailed quality check of the individual replicates, then combined nuclei for downstream analyses (Figure 2—figure supplement 1). We analyzed a total of 31,703 nuclei, grouped in 22 clusters that were annotated post hoc based on the expression of established marker genes for excitatory and inhibitory neurons and glial cell populations (Figure 2A–C). We initially used the combinatorial expression patterns of glutamatergic and previously characterized M/T cell markers (Vglut1, Vglut2, Vglut3, Tbx21, Cdhr1, Thy1) to identify putative OB projection neurons, comprising 23.66% (n = 7504) of all nuclei.

Figure 2. Single-nucleus RNA sequencing distinguishes distinct cell types and molecular signatures of OB projection neurons.

(A) UMAP representation of gene expression profiles of 31,703 single nuclei combined from all replicates (R1, R2, R3) of mice injected into both AON and PCx, grouped into 22 clusters color-coded by cell type membership (GC: granule cell, PG: periglomerular cell, OPC: oligodendrocyte precursor cell, Micro: microglia, Astro: astrocyte, Oligo: oligodendrocyte, ET: external tufted cell, M: mitral cell, T: tufted cell, Imm-like: Immature-like cell). See Figure 2—figure supplement 1 for detailed quality check of each replicate. (B) Matrix plot showing the z-scored expression levels of the top 50 differentially expressed (DE) genes for each cell population ordered by hierarchical relationships between distinct clusters. Each column represents the average expression level of a gene in a given cluster, color-coded by the UMAP cluster membership (from A). The dendrogram depicts the hierarchical relationships and is computed from the PCA representation of the data using Pearson correlation as distance measure and link by complete linkage. (C) UMAP representations of known marker genes for main cell populations (Syt1: neurons; Gad2: GABAergic neurons; Slc17a7: glutamatergic neurons; Gfap, Tmem119, Pdgfra, Mobp: glial cells; Th, Calb1: periglomerular neurons; Cdhr1, Tbx21, Slc17a6: mitral/tufted cells). Nuclei are color-coded by the raw expression level of each transcript. (D) UMAP representation of subclustering from the initial clusters M1, M2/M3, T1, ET1, and ET2 (cluster names from A), selected for the expression of known excitatory and mitral/tufted cell markers (shown in C), resulting in 7504 putative projection neurons grouped into nine distinct types. (E) Same matrix plot as described in (B) showing the z-scored expression levels of the top 50 DE genes for each projection neuron type ordered by hierarchical relationships and color-coded by the UMAP subcluster membership (from D).

Figure 2.

Figure 2—figure supplement 1. Quality check of individual replicates of sn-RNA seq (sn-R1/R2/R3 dataset) shows the reliability of the data and the replicability of each cell type.

Figure 2—figure supplement 1.

(A) Depiction of nuclei for each replicate (in red R1, in green R2, in blue R3) embedded in the UMAP space showing that replicates are very similar to each other and can be combined for downstream analyses. Replicates were merged using Canonical Correlation Analysis (CCA). (B) Bar chart showing the fraction of each cluster when R1, 2 and 3 are combined, color-coded by the replicate membership, indicating that each cell type is consistently represented in each replicate. (C) Scatter plots of pairwise comparisons of log10 gene expression levels averaged over nuclei. In the three panels, replicates R1, R2, and R3 are compared. Pearson correlations are shown in the top left corner of each plot.

Next, we further subclustered the selected profiles, resulting in a total of nine molecularly distinct subpopulations. To assign preliminary labels to each of these cell types, we used marker genes previously employed in functional or single cell RNA-seq studies (Nagayama et al., 2014; Tepe et al., 2018). We also used available RNA in situ hybridization data from the Allen Institute for Brain Science to corroborate our preliminary assignments (Figure 3—figure supplements 1 and 2). Our analysis revealed eight molecularly distinct clusters of putative projection neurons and one cluster of putative periglomerular cells (Figure 2D,E). Among projection neurons, we identified three clusters of MCs (M1, M2, and M3) and five clusters of middle and external TCs (T1, ET1, ET2, ET3, and ET4).

smFISH validates mitral and tufted cell types

To identify genes selectively expressed in OB projection neurons, we used the R-package glmGamPoi (Ahlmann-Eltze and Huber, 2021). We combined the average expression levels of the top differentially expressed genes for each cell type and found it to be highly specific for each cluster (Figure 3A,C). We then selected a few specific marker genes (Figure 3B,D) to validate projection neuron type identity by combining single-molecule fluorescent in situ hybridization (smFISH) with GFP staining upon rAAV-retro-CAG-H2B-GFP injection into the olfactory cortex.

Figure 3. Histological validation of molecularly distinct mitral and tufted cell types.

(A) Combined average (avg) raw expression level of the top n differentially expressed (DE) genes for each mitral cell type (M1 n=14, M2 n=11, M3 n=10), overlaid onto the subclustering UMAP space (shown in Figure 2D). DE genes were selected if their log fold change was greater than 4 (see Materials and methods for details). M1-specific genes: Kcng1, Lhx5, Sertm1, Gabra2, Doc2b, Cntn6, Olfr1259, Nrp2, C1ql1, Ebf1, Baiap3, Adgrl2, Dsc2, Chrna5; M2-specific genes: Piezo2, Vgll2, Zfp114, Nts, Ros1, Samsn1, Grid2, Smpx, Itga4, Itga9, Sema6d; M3-specific genes: Cadps2, Calca, Fst, Ets1, Ednra, Cdkn1c, Mustn1, Smoc2, Cnr1, Ccno. (B) Violin plots showing maximum raw expression value of selected mitral cell type-specific DE genes across mitral and tufted cell clusters for further validation with smFISH. (C) Combined average (avg) raw expression level of the top n DE genes for each tufted cell type (T1 n=9, ET1 n=7, ET2 n=9, ET4 n=6), overlaid onto the subclustering UMAP space (shown in Figure 2D). T1-specific genes: Barhl2, Sgcg, Vdr, Olfr111, Olfr110, Cacna1g, Fam84b, Kcna10, Tspan10; ET1-specific genes: Coch, Wnt5b, Rorb, Chst9, Tpbgl, Clcf1, Rxfp1; ET2-specific genes: Lhx1, Ebf3, Trp73, Edn1, Ebf2, Nr2f2, Uncx, Psrc1, Dsp; ET4-specific genes: Ly6g6e, Foxo1, Siah3, Galnt12, Itga8, Ets2, Grik4. (D) Violin plots showing maximum raw expression value of selected tufted cell type-specific DE genes across mitral and tufted cell clusters for further validation with smFISH. (E, F) Schematic representations of the smFISH images for validating projection neuron type-specific marker genes upon rAAVretro-CAG-H1B-GFP injection into PCx and AON. The schemes depict the laminar location visualized in the histological images from a coronal section of the ipsilateral hemisphere to the injection site. EPL: external plexiform layer; MCL: mitral cell layer; GL: glomerular layer. (G–I) smFISH showing combinatorial expression of mitral cell type-specific marker genes for M1, M2 and M3 cells in the mitral cell layer. High magnifications (top right) show co-labeling of viral GFP with the in situ mRNA probe. (G and G’). The M3 markers Cadps2 and Calca are co-expressed in subpopulations of cells in the mitral cell layer, indicated by the yellow/magenta arrowheads. (H and H’) The M3 marker Cadps2 and M1 marker Kcng1 are expressed in distinct subpopulations of cells in the mitral cell layer, indicated by the magenta and yellow arrowheads respectively. (I and I’) The M3 marker Cadps2 and M2 marker Vgll2 are mutually exclusive in subpopulations of cells in the mitral cell layer, indicated by the yellow and magenta arrowheads respectively. For additional histological analysis and quantification of co-expression see Figure 3—figure supplement 1. (J–L) smFISH images showing combinatorial expression patterns of tufted cell type-specific marker genes for validating T1, ET1, ET2 and ET4 clusters as distinct projection neuron types in the external plexiform and glomerular layers. High magnifications (top right) show co-labeling of viral GFP with the in situ mRNA probe. As described for the mitral cell types, yellow or magenta arrowheads show non-overlapping patterns (K, K’: T1–ET1 and L, L’: ET1–ET4), and yellow/magenta arrowheads show co-expression patterns (J, J’: T1–T1). For additional histological analysis and quantification of co-expression see Figure 3—figure supplement 1. DAPI counterstain in blue. Scale bars, 50 μm and 10 μm (high magnifications).

Figure 3.

Figure 3—figure supplement 1. Histological analysis of DE genes for distinct mitral cell types.

Figure 3—figure supplement 1.

(A) Schematic representation of the smFISH images for validating selected mitral cell type-specific marker genes upon rAAVretro-CAG-H2B-GFP injection into PCx and AON. The scheme depicts the laminar location visualized in the histological images from a coronal section of the ipsilateral hemisphere to the injection site. MCL: mitral cell layer. (B–D) smFISH showing co-expression of M1-specific marker genes with viral GFP. Labeled nuclei are indicated by the magenta/green arrowheads. (E) smFISH showing co-expression of two M1-specific marker genes. Labeled nuclei are indicated by the yellow/magenta arrowheads. (J) Same as (E) but for M2-specific marker gene. (N) Same as (E) but for M3-specific marker gene. (F, G) Same as (B–D) but for M2-specific marker gene. (H, I) In situ hybridization images from the Allen Brain Atlas showing additional M2-specific DE genes. (K - M) Same as (B–D) but for M3-specific marker gene. (O–Q) smFISH images showing mutually exclusive expression of mitral cell type-specific marker genes for M1, M2, and M3. Yellow and magenta arrowheads show mutually exclusive expression of M1 versus M2 (O–P), and M1 versus M3 (Q) marker genes. DAPI counterstain in blue. Scale bars, 50 μm. (R) Combinatorial quantification of mitral cell type-specific smFISH probes. Each dot is color-coded by the mitral cell type membership between cells of the same clusters or represented in grey for cells across different clusters. Each dot represents the fraction of co-expression between two cluster-specific probes.
Figure 3—figure supplement 2. Histological analysis of DE genes for distinct tufted cell types.

Figure 3—figure supplement 2.

(A) Schematic of the smFISH images for validating selected tufted cell type-specific marker genes upon rAAVretro-CAG-H2B-GFP injection into PCx and AON. The scheme depicts the laminar location visualized in the histological images from a coronal section of the ipsilateral hemisphere to the injection site. GL=glomerular layer; EPL=external plexiform layer. (B) smFISH showing co-expression of two T1-specific marker genes. Labeled nuclei are indicated by the yellow/magenta arrowheads. High magnification (left) shows clear co-labeling of the two mRNA probes Barhl2 and Olfr110. (C) In situ hybridization images from the Allen Brain Atlas showing one additional T1-specific DE gene. (D) Same as (B) but for ET1-specific marker gene. (E) In situ hybridization images from the Allen Brain Atlas showing additional ET1-specific DE genes. (F) Same as (B) but for ET2-specific marker gene. (G) In situ hybridization images from the Allen Brain Atlas showing additional ET2-specific DE genes. (H) Same as (B) but for ET4-specific marker gene. (I–J–K) smFISH images showing mutually exclusive expression of tufted cell type-specific marker genes for T1, ET1, ET2, and ET4. Yellow and magenta arrowheads show mutually exclusive expression of T1 versus ET4 (I), ET2 versusET4 (J), and ET1 versus ET4 (K) marker genes. DAPI counterstain in blue. Scale bars, 50 μm. (L) Combinatorial quantification of tufted cell type-specific smFISH probes. Each dot is color-coded by tufted cell type membership between cells of the same clusters or represented in grey for cells across different clusters. Each dot represents the fraction of the co-expression between the two cluster-specific probes visualized next to the dot.
Figure 3—figure supplement 3. Excitability-related, cell adhesion-related, and pan-OB projection neuron DE genes.

Figure 3—figure supplement 3.

(A) Violin plots showing maximum raw expression values of genes encoding for channels as key candidates for controlling the differential excitability of different MC and TC types. Kcng1 gene is known to encode for a voltage-gated potassium channel that forms heterotetrameric channels with the ubiquitously expressed delayed rectifying Kv2.1 potassium channel (Kcnb1). (B) Violin plots showing maximum raw expression values of genes encoding for cell adhesion and axon guidance molecules known to control the formation and maintenance of neuronal connectivity that were differentially expressed in MC and TC types. (C) Violin plots showing maximum raw expression values of pan-external tufted cell marker genes (Gabbr2, Shisa6, Pip5k1b), pan-tufted cell marker genes (Zbtb20, Astn2), and pan-mitral cell marker genes (Fam19a2, Tnfaip8).

We first investigated MC type-specific gene expression. Differential expression (DE) analysis identified the voltage-gated potassium channel Kcng1, the transcriptional regulator LIM homeobox 5 (Lhx5) and the serine-rich transmembrane domain 1 (Sertm1) as putative M1-specific marker genes. Two-color smFISH revealed extensive co-localization of Kcng1 and Lhx5 transcripts within the same subpopulation of cells in the MC layer (Figure 3—figure supplement 1E,R). Furthermore, Kcng1, Lhx5, and Sertm1 expression was consistently observed in neurons expressing GFP (Figure 3H, Figure 3—figure supplement 1B–D). Next, DE analysis identified the mechanosensory ion channel Piezo2, the transcription cofactor vestigial like family member 2 (Vgll2) and the zinc finger protein 114 (Zfp114) as putative M2-specific markers. Two-color smFISH revealed extensive co-localization of Piezo2 and Vgll2 transcripts within the same subpopulation of cells in the mitral cell layer (Figure 3—figure supplement 1J,R), and co-localization of M2-specific marker genes with GFP (Figure 3I, Figure 3—figure supplement 1F,G). Finally, we identified the calcium-dependent secretion activator 2 (Cadps2), calcitonin (Calca), and follistatin (Fst) as putative M3-specific markers. smFISH revealed selective and extensive co-localization of Cadps2 with Calca or with Fst transcripts within the same subpopulation of cells in the MC layer (Figure 3G,G’, Figure 3—figure supplement 1N,R). Furthermore, Cadps2, Calca and Fst expression was consistently observed in neurons expressing GFP (Figure 3G,H, Figure 3—figure supplement 1K–M). Importantly, two-color smFISH revealed that type-specific M1, M2, and M3 markers were expressed in largely non-overlapping populations of MCs: M1-specific Kcng1 and Lhx5 transcripts rarely co-localized with M2-specific Vgll2 and Piezo2 transcripts (Figure 3—figure supplement 1O,P,R); M1-specific Kcng1 transcripts rarely co-localized with M3-specific Cadps2 and Calca transcripts (Figure 3H,H’, Figure 3—figure supplement 1Q,R); M2-specific Vgll2 transcripts rarely co-localized with M3-specific Cadps2 transcripts (Figure 3I,I’, Figure 3—figure supplement 1R). Together, the selective co-localization of type-specific genes in non-overlapping populations in the mitral cell layer validates these three types as accurate and meaningful groupings of MCs, and their co-localization with GFP validates their identity as projection neurons.

DE analysis for TC type-specific genes identified the transcription factor BarH-like homeobox 2 (Barhl2), the gamma-sarcoglycan Sgcg, the vitamin D receptor (Vdr), and the olfactory receptors Olfr110 and Olfr111 as putative T1 markers. Two-color smFISH revealed extensive co-localization of Barhl2 and Sgcg or Olfr110/Olfr111 transcripts within the same subpopulation of cells in the external plexiform layer, indicative of middle tufted cells (Figure 3J,J’, Figure 3—figure supplement 2B,L). Furthermore, Barhl2 and Sgcg expression was observed in neurons expressing GFP (Figure 3J). The coagulation factor C homolog (Coch) and the Wnt family member 5b (Wnt5b) were identified as putative ET1 markers. smFISH confirmed the expression of Coch and Wnt5b in a subpopulation of cells in the external plexiform and glomerular layers, although co-expression could not be recovered as consistently as for other markers (Figure 3K, Figure 3—figure supplement 2D,L). Moreover, Coch expression was observed in neurons expressing GFP (Figure 3K). The LIM homeobox 1 (Lhx1) and the early B-cell factor 3 (Ebf3) were identified as putative ET2 markers. Two-color smFISH revealed consistent Lhx1 and Ebf3 co-expression in a sparse subpopulation of cells at the boundary between the external plexiform and glomerular layers (Figure 3—figure supplement 2F,L), indicative of external TCs. Finally, we identified the lymphocyte antigen 6 family member 6GE (Ly6g6e) and the transcription factor Forkhead box O1 (Foxo1) as putative ET4 markers. smFISH revealed selective co-expression of Ly6g6e and Foxo1 in a subpopulation of cells in the glomerular layer (Figure 3—figure supplement 2H,L), and co-localization of Ly6g6e with GFP validates their identity as external TCs (Figure 3L). Importantly, two-color smFISH revealed that type-specific tufted cell markers were expressed in largely non-overlapping populations of cells: ET1-specific Coch transcripts did not co-localize with T1-specific Barhl2 transcript or with ET4-specific Ly6g6e transcripts (Figure 3K,K’ ,L ,L’, Figure 3—figure supplement 2L). Furthermore, ET4-specific Foxo1 and Ly6g6e transcripts did not co-localize with T1-specific Barhl2 transcript (Figure 3—figure supplement 2I,L), with ET2-specific Ebf3 transcript (Figure 3—figure supplement 2J,L) or with ET1-specific Wnt5b transcript (Figure 3—figure supplement 2K,L). Overall, the selective co-localization of type-specific genes, their location within the olfactory bulb, their non-overlapping nature, and their co-localization with GFP validate these five types of middle and external TCs as accurate and meaningful classifications.

Inferring gene regulatory networks for projection neurons

The differential gene expression patterns revealed by transcriptome analysis are determined by the concerted action of transcription factors (TFs). We therefore set out to characterize cell types by their TF activity. To do so, we predicted regulatory relationships between TFs and target genes (that can be also TFs), taking into account independent genomic information about TF binding sites around the target gene-specific promoter. These gene regulatory networks are more robust against technical artifacts than the expression of individual genes, providing a complementary set of axes along which to cluster MCs and TCs. Ultimately, gene regulatory network analysis can yield more detail for classifying cell types and for understanding the molecular mechanisms that underlie their transcriptional differences.

To infer the regulatory networks of each type of projection neuron, we used the Single-Cell Regulatory Network Inference and Clustering pipeline (SCENIC, Aibar et al., 2017, Van de Sande et al., 2020). In brief, SCENIC consists of co-expression analysis, followed by TF binding motif enrichment analysis, and finally evaluation of the activity of regulons, that is a TF and its predicted target genes (Figure 4A). The result is a list of regulons and a matrix of all the single cells with their regulon activity scores (RAS, essentially an area-under-the-curve metric, see Materials and methods for details).

Figure 4. Mitral and tufted cell-specific regulons combine into modules.

(A) Schematic representation of the network analysis pipeline, including the required input, the SCENIC protocol, and the output in the form of regulon modules and a regulatory network (in Figure 5). TSS: transcription start site; TF: transcription factor. (B) Hierarchical clustering of mitral and tufted cell-specific regulons using the Connection Specificity Index (CSI) as a distance measure results in seven modules (Ward linkage). The connection specificity index compares Pearson correlation coefficients (PCC) between regulons. If we take regulons A and B, their CSI is the number of PCC of A with other regulons and of B with other regulons that are lower than the PCC between A and B. This means that the CSI contextualizes a correlation between two regulons given how these regulons correlate to the rest. Prominent cross-module interactions are observed for mod1–2, mod3–4, and mod6–7. Module four showed interactions with all other modules. (C) Table listing the mitral and tufted cell-specific modules, their regulons, the number of unique target genes in each module, and cluster-specific marker genes found also as target genes in a given module. Marker genes were established by transcriptome analysis and shown in Figure 3B and D. (D) Regulon activity (columns) in mitral and tufted cell types (rows) defines subtypes within each cluster. Within each cell type, rows were ordered by hierarchical clustering (correlation distance, complete linkage). Columns clustered as in (B). (E) Module activity per cell subtype. Module activity is calculated as the average activity of its regulons for a given cell subtype. Rows were ordered by hierarchical clustering (correlation distance, complete linkage). Each cell subtype may be defined by a combination of active and inactive modules. For example, M2b and M2c are defined by relatively high activity in modules 1 and 2. (F) Quantification of changes in module activity through trajectory analysis. A single trajectory that traverses the cell types in the order ET4, ET1, T1, M2, M3, M1, ET2, is computed using PAGA (Wolf et al., 2019). Module activity along the trajectory is the average over a sliding window of 100 nuclei. Trajectory progression is depicted as a greyscale gradient from white to black. Along the same trajectory is also computed the regulon activity and expression levels of the corresponding TFs (Figure 4—figure supplement 3). (G) Module activity mapped on the projection neuron UMAP space (Figure 2D, rightmost UMAP for convenience). Color range as in (E).

Figure 4.

Figure 4—figure supplement 1. Comparing transcriptomic and gene regulatory network-defined mitral and tufted cell types.

Figure 4—figure supplement 1.

(A) Transcriptome-defined mitral and tufted cell types in transcriptome-based UMAP space (Figure 2D) shown here for comparison with (B). PG cluster was not used in the pySCENIC gene regulatory network (GRN) analysis and therefore was removed from the plot. (B) GRN-based mitral and tufted cell types mapped onto the transcriptome-defined UMAP space. The numbers shown on the right (1–6) are the pySCENIC clusters. (C) Quantification of the comparison between transcriptomic and GRN-based mitral and tufted cell types. The Adjusted Rand Index (ARI) = 0.326 indicates differences between the two types of clustering analysis. (D) Transcriptome-defined mitral and tufted cell types in GRN-based UMAP space. Clusters are restricted to specific, yet overlapping, regions of the GRN-based UMAP. (E) Same as (D), but only tufted cell types are visualized. (F) Same as (D), but only mitral cell types are visualized.
Figure 4—figure supplement 2. Mitral and tufted cell type-specific marker genes found in regulons.

Figure 4—figure supplement 2.

(A) Mitral and tufted cell type-specific marker genes identified as regulons in pySCENIC analysis. Columns from left to right: cluster membership; marker gene; marker gene as regulon with its max confidence cut-off. Regulons with a confidence cut-off ≥0.3 are shown. (B) Presence of mitral and tufted cell type-specific marker genes as target genes in pySCENIC analysis. Columns from left to right: cluster membership; marker gene; presence in regulon(s) at the corresponding highest confidence cut-offs. Regulons with a confidence cut-off ≥ 0.5 are shown.
Figure 4—figure supplement 3. PAGA-based trajectory analysis of mitral and tufted cell types.

Figure 4—figure supplement 3.

(A) Recomputed transcriptome-based UMAP with the PG cluster removed from the data. (B) The PAGA graph derived from the clustering in (A). (C) Average regulon activity along the trajectory ET4, ET1, T1, M2, M3, M1, ET2. Regulons are grouped by the module membership as indicated by the vertical color-coded bars. The trajectory is averaged with a sliding window of size 100 nuclei. (D) Average transcription factor expression levels along the trajectory ET4, ET1, T1, M2, M3, M1, ET2 as in (C). Each row corresponds to the TF of the regulon as shown in (C). Log expression levels are normalized to the interval [0, 1]. The trajectory is averaged with a sliding window of size 100 nuclei.

Clustering on regulon activity corroborates molecular groupings of mitral and tufted cell types and allows further subdivision

We applied SCENIC to MCs and TCs (6484 cells), computing 86 regulons with a range of 8–551 target genes (median = 23.5). This greatly reduces the dimensionality of the data from >30,000 genes to 86 regulons, defining a new low-dimensional space in which to analyze relationships between nuclei. Using the regulon activity matrix, we computed a UMAP space and performed a Louvain clustering on the putative projection neurons (Traag et al., 2019; Wolf et al., 2018). We compared this gene regulatory network (GRN)-based clustering to the transcriptome-based clustering and observed that neighbor relationships are conserved between neurons (Figure 4—figure supplement 1). Indeed, several clusters in transcriptome space are clearly identifiable as clusters in GRN-space, and vice versa (e.g. ET2, ET4, M1+M3). However, we also noted significant differences between other clusters (ET1, T1, and M2). This disagreement suggests that network analysis provides orthogonal information relative to transcriptome analysis.

Combinations of regulon modules characterize mitral and tufted cell subtypes

TFs activity is thought to be organized into coordinated network modules that determine cellular phenotypes (Alexander et al., 2009; Irons and Monk, 2007; Suo et al., 2018). To characterize how TFs are organized into such modules in OB projection neurons, we searched for combinatorial patterns of regulon activity. We used the Connection Specificity Index (CSI) to this end, which is an association index known to be suited for the detection of modules (Fuxman Bass et al., 2013; Suo et al., 2018). By computing the CSI on the basis of pairwise comparisons of regulon activity patterns across cells, we found that the 86 regulons grouped into seven modules (mod1–7) (Figure 4B,C).

Interestingly, regulon and module activity were not uniform within cell types. Rather, regulon activity suggested the existence of distinct subtypes within mitral and tufted cell types (Figure 4D). We used hierarchical clustering to further subdivide cell types, finding four subtypes of M1, three subtypes each of M2, T1, and ET2, and two subtypes of ET1 and ET4. To further investigate these subtypes, we used the modules to describe how the combined regulatory logic of distinct TFs contributes to the diversity of MC and TC subtypes. We asked if combinations of modules could uniquely describe the subtypes. To do so, we calculated the average module activity score per cell subtype. Next, we performed a hierarchical clustering on the subtypes (Figure 4E). We found that, when grouped by module activity, subtypes do not strictly group by type; rather, subtypes of different cell types tend to share similar module activity (Figure 4E). This is a consequence of the differences between transcriptome- and GRN-based analysis: transcriptome clusters (cell types) are subdivided along the boundaries of GRN-based clusters. As we observed the grouping of subtypes along three neighboring clusters – ET1, T1, and M2 – we set out to better understand how module activity changes along these cell types. To this end, we used PAGA (PArtition-based Graph Abstraction) for trajectory analysis along the sequence of ET4, ET1, T1, M2, M3, M1, ET2 (Figure 4F, and for the corresponding UMAPs, see Figure 4G). We observed clear module activity gradients between ET4-ET1-T1, where modules 1, 3, 4, and 7 are high at the ET4 end of the cluster and low at T1. Module two forms an antagonistic gradient across ET1, high at T1 and low at ET4. Similarly, activity of module five changes in a gradient fashion from T1 to M2. Importantly, trajectory analysis not only detects gradients, but also discrete changes between cell types. For instance, the activity of module six shows a step-wise dynamic: high activity at ET2, medium at M1, and low for the other cell types. And module two has no activity in M3, while it is clearly present in its neighboring cell types M1 and M2.

Regulon-based transcription factor networks reveal overlapping features of cell type identity

While TFs regulate a large number of target genes, central to cell identity are the interactions between them: TFs can regulate their own expression as well as the expression of other TFs, generating a TF network thought to be a core determinant of cell type identity (Arendt et al., 2016; Becskei et al., 2001; Thieffry, 2007). We thus looked at TF–TF interactions to visualize the TF network topology that defines MCs and TCs. We specifically asked whether MC and TC classes share common TF network features, or whether, as suggested by the analysis of genome-wide transcriptome and regulon analysis (Figures 24), MC and TC subtypes are defined by specific yet overlapping TF network features.

As a regulon is defined by a TF and a set of target genes, we constructed a (directed) network of TFs by taking from each regulon’s target genes only the TFs that have regulons themselves (Figure 5A). Overall, we found one large set of interconnected TFs, one small component of two TFs, and 18 isolated TFs. Fifty-five of the 86 TFs (64%) show possible self-activating regulation, and several others form mutually activating pairs (e.g. Lhx1 and Uncx). The network is dominated by three hub genes, two of which may self-activate: Pbx3 (activates 17 TFs), Bclaf1 (activates 11 TFs), and Taf1 (activates 9 TFs). Their central position in this network suggests important roles in regulating transcription broadly across different cell types (e.g. Taf1, Bclaf1) or selectively in cell type-specific networks (e.g. Pbx3, Figure 5—figure supplement 1).

Figure 5. Mitral and tufted cell-specific transcription factor network derived from regulons.

(A) Overview of mitral and tufted cell-specific transcription factor (TF) network, with node size scaled by the number of target genes and nodes colored with different shades of gray based on the outdegree (number of outgoing edges). Thick borders and edges denote cycles of 2 or three regulons. The three main hubs are: Pbx3 (outdegree 17, target genes 545), Bmyc (outdegree 11, target genes 523) and Taf1 (outdegree 9, target genes 551). (B) Same network as shown in A specific for mitral cell types (M1, M2, M3) with standardized regulon activity for the top 10 most specific regulons mapped onto the corresponding TF nodes (compare Figure 4D,E). Regulon specificity scores are shown in Figure 5—figure supplement 1A, with expression of the corresponding TFs visualized on UMAPs in Figure 5—figure supplement 1B. Mitral and tufted cell type-specific marker genes are visualized in the network in Figure 5—figure supplement 2. (C) Same network as shown in (A) specific for tufted cell types (T1, ET1, ET2, ET4) with standardized regulon activity for the top 10 most specific regulons. We omitted ET3 as it only has a few nuclei. As for mitral cell types, see also Figure 5—figure supplements 1 and 2.

Figure 5.

Figure 5—figure supplement 1. Top five mitral and tufted cell type-specific regulons.

Figure 5—figure supplement 1.

(A) Regulon specificity score for each cell type, with the top five highest scoring regulons shown in red. Specificity scores are based on Jensen-Shannon Divergence, a metric for comparing probability distributions (Suo et al., 2018). pySCENIC provides standard functionality to compute the scores. (B) UMAP representations of the top five mitral and tufted cell type-specific regulons (transcription factors) organized in rows by the cluster membership. Z-scored expression levels are used to highlight relative differences between cell types. Four TFs, namely Prrx1, Snai2, Bhlhe22, and Tcf7l2, are only very sparsely expressed (confirmed by the raw expression levels, data not shown). Their cell-type specificity has to be interpreted with caution. (C) UMAP representations of the top three network hub genes (Pbx3, Bclaf1, and Taf1). Pbx3 is predominantly expressed in mitral cell type M1 and tufted cell type ET2. Bclaf1 and Taf1 are more broadly expressed across all M/T cell types. Nuclei are color-coded by the raw expression level of transcripts.
Figure 5—figure supplement 2. Mitral and tufted cell type-specific marker genes visualized as target genes in the gene regulatory network.

Figure 5—figure supplement 2.

For each mitral and tufted cell type, marker genes are integrated in the gene regulatory network (network shown in Figure 5), with nodes and interactions colored by the highest confidence cut-off that would keep the marker gene in the regulon dataset. Results based on a confidence cut-off ≥0.3 are included, consistent with Figure 4—figure supplement 2.

As anticipated from our module analysis, we find features that are shared across certain types of MCs and TCs rather than MC- or TC-specific network features (Figure 5B,C). For instance, we find that M1 and ET2 are characterized by the hub Pbx3. Yet both cell types are distinguishable by other TFs: M1 uses Egr1 and Dlx5 with Pbx3, whilst ET2 has a 2-cycle involving Lhx1 and Uncx, which were both identified as marker genes specific for this cell type (Figure 4—figure supplement 2, Figure 5—figure supplement 2). Between T1, ET1, and ET4, we observe the gradients discussed above recapitulated in a transition between three groups of closely linked TFs. T1 is characterized by the close-knit group of Bcl11a, Cux1, Cux2, Hlf, and Rcor1, which are all members of module 2. Next, ET1 also employs these TFs, with the exception of Hlf and with the addition of Hmgb3 and Ppargc1a (modules 3 and 4, respectively). Interestingly, the estrogen receptors Esrra and Esrrg are in the top 15 of specific regulons for ET1, creating a target-gene-rich three-cycle with Ppargc1a: the second group of TFs. For ET4, Hmgb3 and Ppargc1a remain characteristic (but not Esrra and Esrrg), with Hmgb3 closely linked to the (third) group of Junb, Fos, and Atf4 (also members of modules 3 and 4).

Taken together, the analysis of TF regulatory networks suggests that individual MC and TC types share key TF network features, which might point towards common physiology or connectivity features. The differentially active TF network hubs and loops provide starting points for future investigation of the functional differences between the MC and TC types described here (Figure 5—figure supplement 2). Thus, the modules and the network serve as complementary approaches for studying cell type identity, with modules suited to classifying cells into types and subtypes and network analysis suited to investigating their functional differences.

Simulating single-nucleus gene expression from bulk RNA deep sequencing

TCs preferentially target anterior olfactory regions, including the Anterior Olfactory Nucleus (AON), while MCs target anterior and posterior olfactory areas (Imamura et al., 2020). Therefore, we asked whether the genetic diversity within MC types could provide information about their projection targets. To investigate this question, we again injected rAAV-retro-CAG-H2B-GFP into the olfactory cortex, albeit now either into the AON or the posterior piriform cortex (pPCx) (Figure 1A). For each injection site and in three biological replicates, we then enriched for GFP expression using FANS (Figure 1—figure supplement 2C–F) and prepared RNA for bulk RNA deep sequencing (Figure 1A).

Bulk RNA sequencing represents molecular information from a variety of different cell types. Given that a substantial fraction of isolated nuclei in our experiments was comprised of granule and periglomerular cells, in addition to projection neurons, we devised a novel computational approach to recapitulate the constituent cell types from bulk RNA sequencing data by simulating single-nucleus expression profiles. Previous methods simulated the transcriptome of a single cell based on the overall distribution of gene expression levels in the bulk RNA sequencing data, producing many nuclei that were similar to each other and to the original bulk expression profile (Konstantinides et al., 2018; Avila Cobos et al., 2018; Zhu et al., 2016). This worked well for clean bulk RNA-seq datasets with only one cell type, but for our mixed datasets, the simulated nuclei resembled an unrealistic average of the constituent cell types. Therefore, to capture the diversity contained within our bulk RNA-seq datasets, we used regulons as the unit of analysis to create simulated nuclei with more biologically realistic transcriptomes (Figure 6A,B).

Figure 6. Regulon-based simulations from bulk RNA deep sequencing data suggest that mitral cell types have distinct projection targets.

Figure 6.

(A) Schematic representation of strategy to integrate bulk RNA-seq and single nucleus RNA-seq data. (A1) Simulations use regulons inferred from sn-R1/R2/R3 data. A regulon consists of a transcription factor (TF) and all target genes (A, B, etc) that are activated by that transcription factor. (A2) When simulating nuclei from bulk RNA-seq data, the probability of selecting a given regulon is determined by the abundance of its transcription factor in the bulk RNA-seq dataset. (A3) Nuclei are simulated from each bulk dataset (AON-sim and PCx-sim) through random sampling of regulons with replacement. This method maintains broad differences between datasets while accounting for heterogeneity within each dataset. (B) AON-sim, PCx-sim and sn-R1/R2/R3 nuclei projected into a shared UMAP representation. Step 1: Blue indicates AON-sim nuclei and purple indicates PCx-sim nuclei. Step 2: Darker color indicates sn-R1/R2/R3 projection neurons. Step 3: Blue indicates AON-projecting simulated projection neurons (AON-simPN) and purple indicates pPCx-projecting simulated projection neurons (PCx-simPN). (C) Linear Discriminant Analysis (LDA) classifiers were trained on AON-simPN and PCx-simPN, then used to predict the projection target of sn-R1/R2/R3 mitral cells to investigate projection targets of sn-R1/R2/R3 derived types. Each row represents one mitral cell. Each column represents one of 1000 LDA classifiers. Blue indicates that the mitral cell was classified as AON-simPN, and purple indicates that the mitral cell was classified as PCx-simPN. Within each mitral cell type, cells are sorted vertically by their predicted projection target. (D) UMAP representation color-coded by predicted projection target. Cells in blue were predicted to be AON-simPN by all 1000 classifiers. Cells in purple were predicted to be PCx-simPN by all 1000 classifiers.

We first compared simulated and sn-R1/R2/R3 nuclei by using principal component analysis to project both sets of nuclei into a shared low-dimensional space. We used these principal components as input to a UMAP projection to visually inspect relationships between simulated and sn-R1/R2/R3 nuclei (Figure 6B, step 1). Consistent with histology and sn-R1/R2/R3 analyses (Figures 1 and 2), simulations from both AON-injection and pPCx-injection bulk RNA-seq datasets (AON-sim and PCx-sim, respectively) contained cell types other than projection neurons, and this contamination was more pronounced in the AON-sim dataset. The dispersion of simulated nuclei throughout this space indicated that simulations successfully recapitulated the diversity of cell types in the bulk RNA-seq datasets, as each cell class from the combined sn-R1/R2/R3 data had some simulated nuclei in its vicinity (Figure 6B, step 1). To account for this contamination and filter for only putative simulated projection neurons, we trained linear discriminant analysis (LDA) classifiers to predict whether sn-R1/R2/R3 nuclei were projection neurons based on the 30 top principal components that defined the shared low-dimensional space. These classifiers accurately and consistently classified projection neurons, with a mean Jaccard index (a measure of similarity between predicted and true labels) of 99.0% and a standard deviation of 0.07% over 1000 classifiers. We then applied this classifier to the AON-sim and PCx-sim nuclei, designating those simulations predicted to be projection neurons by all 1000 classifications as putative simulated AON- and PCx-projecting neurons (AON-simPN and PCx-simPN, respectively) (Figure 6B, step 3).

To directly compare AON-simPN and PCx-simPN to MCs characterized through sn-R1/R2/R3, we next used principal component analysis to define a shared low-dimensional space for MCs, AON-simPN and PCx-simPN only. To investigate potential differences in projection target between MC types, we trained 1000 LDA classifiers to predict the projection target of simulated projection neurons based on the 30 top principal components that defined this shared low-dimensional space (mean Jaccard index: 94.3%; standard deviation: 6.1%). We then used these classifiers to predict the projection targets of sn-R1/R2/R3 MCs. Interestingly, we consistently found different predicted targets for the molecularly defined MC types. All M2 and M3 MCs were classified as PCx-simPN by 100% of classifiers, suggesting that these mitral types preferentially project to posterior targets (Figure 6C). In contrast, 73.5% of M1 MCs were classified as AON-simPN by at least 80% of classifiers, suggesting that M1 MCs preferentially project to anterior targets (Figure 6C). These findings suggest that the molecular subcategorization of MCs may delineate differences in connectivity. The types M1, M2, and M3 were defined by gene expression and regulon activity, but these results suggest that they also describe projection target specificity. This correspondence between molecular identity and projection target is further demonstrated by the segregation of MCs by predicted projection target in a UMAP space defined by gene expression (Figure 6D).

Targeted snRNA-seq validates predictions of preferential connectivity for molecularly defined mitral cell types

To validate our prediction that M2 MCs preferentially project to posterior targets and M1 MCs preferentially project to anterior targets, we performed a small volume rAAV-retro-CAG-H2B-GFP injection into the pPCx. While small volumes will likely result in a smaller number of labeled projection neurons overall, such focal injection will enhance specificity. We dissected the olfactory bulbs, generated single nuclei, enriched for GFP expression using FANS, and performed snRNA-seq using 10x Genomics technology (sn-PCx dataset) (Figure 7A). To assign cell type identity, we first identified projection neurons by transforming the expression matrix into the space of 30 principal components defined on the original, larger sn-R1/R2/R3 RNA-seq dataset. One thousand LDA classifiers, each trained and validated on the original sn-R1/R2/R3 dataset (mean jaccard index = 0.95), designated 57 out of 62 cells from the sn-PCx dataset as projection neurons (Figure 7B).

Figure 7. Targeted snRNA-seq experiment validates simulation-based predictions of molecularly defined-mitral cell types with distinct projection targets.

Figure 7.

(A) Schematic representation of experimental design. Top: after injection of rAAVretro-CAG-H2B-GFP into PCx, single nuclei were dissociated from one mouse and sorted using Fluorescence-activated Nuclei Sorting (FANS). The population of nuclei is selected based on GFP and DRAQ5 (far-red fluorescent DNA dye). Sorted nuclei were sequenced using 10x single-nucleus RNA-seq and integrated with the main sn-R1/R2/R3 dataset. (B) Linear discriminant analysis (LDA) classifiers were trained on the main sn-R1/R2/R3 dataset, then used to predict the projection neuron identity of sn-PCx cells. Each row represents one cell in the new, targeted sn-PCx data. Each column represents one of 1000 LDA classifiers trained to discriminate projection neurons from other cell types. The color indicates the prediction made by each classifier (dark: projection neuron, light: other). (C) LDA classifiers were trained on the main dataset sn-R1/R2/R3, then used to predict the cell type identity of sn-PCx projection neurons. Each row represents one sn-PCx projection neuron. Each column represents one of 1000 LDA classifiers trained to predict cell type identity based on regulon expression. Color indicates the predicted cell type identity (orange: M1, red: M2, yellow: M3). (D) UMAP showing sn-R1/R2/R3 mitral cells and sn-PCx projection neurons integrated in the same low-dimensional space. UMAP was computed based on regulon expression. sn-PCx projection neurons are in purple, while sn-R1/R2/R3 mitral cells are in color according to their cell type identity (orange: M1, red: M2, yellow: M3) on the top UMAP, while in grey on the bottom (same) UMAP.

Next, we transformed the expression matrix of these projection neurons into a low-dimensional space defined by the expression of the 86 regulons from our SCENIC analysis on the original projection neurons sn-R1/R2/R3 dataset. These regulons create a low-dimensional space in which to compare this new, targeted sn-PCx dataset to our original dataset. To quantify this comparison, we used LDA classifiers, trained on the original mitral cells, to predict the cell type of these new projection neurons based on regulon activity scores. These classifiers were then applied to the regulon activity scores of the new projection neurons (Figure 7C).

Consistent with our prediction based on simulated transcriptomes, the vast majority (44 of 57) PCx-projecting mitral cells were classified as M2 cells by at least 80% of classifiers. In contrast, only 6 of 57 PCx-projecting mitral cells were classified as M1 cells (analysis of M3 projections was precluded by their low prevalence). Were M2 cells present in the sn-PCx population at the same rate as in the general mitral cell population, observing this many or more M2 cells would be unlikely (binomial test, p=0.001, see Materials and methods for details). Similarly, the scarcity of M1 neurons would be unlikely if M1 cells were present in the sn-PCx mitral population at the same rate as the general mitral population (binomial test, p=5.57e-05).

Together, these findings validate the predictions made by simulating single-nucleus regulon-based transcriptomes from bulk RNA deep sequencing and suggest that the molecular subcategorization of MCs delineates differences in connectivity (Figure 7C,D).

Discussion

Morphological differences between OB mitral and tufted cells have been described since the time of Cajal (The croonian lecture, 1894). Electrophysiological and functional imaging experiments in vivo and in vitro, developmental studies, as well as anatomical reconstructions from light and electron microscopy studies have further highlighted the heterogeneity of OB projection neurons (Christie et al., 2001; Ezeh et al., 1993; Fukunaga et al., 2012; Geramita et al., 2016; Haberly and Price, 1977; Kawasawa et al., 2016; Mori et al., 1983; Phillips et al., 2012). We here provide the first detailed molecular profiling of mouse OB projection neurons and delineate types and subtypes of mitral and tufted cells together with their key gene regulatory networks and connectivity patterns.

We have performed single-nucleus and bulk RNA deep sequencing to characterize the molecular diversity of mouse OB projection neurons. We identified, based on transcriptome and RNA in situ analysis, three distinct types of MCs and five distinct types of TCs. We then used comprehensive gene regulatory network analysis to reveal potential molecular determinants of cell type-specific functional properties. Finally, we describe a novel computational approach for integrating single nucleus and bulk RNA sequencing data, and we provide evidence that different MC types preferentially project to anterior versus posterior regions of the olfactory cortex. Our analyses provide a comprehensive resource for investigating olfactory circuit function and evolution.

The molecular diversity of olfactory bulb projection neurons

Given that the vast majority of OB neurons are interneurons, notably granule cells and juxtaglomerular neurons, we devised a retrograde viral targeting strategy to substantially enrich for OB projection neurons. This allowed us to analyze the transcriptomes of over 7500 putative projection neurons that could in turn be grouped into eight molecularly distinct mitral and tufted cell types. We validated neuronal identity using smFISH with multiple type-specific marker genes, and we determined the localization of identified neuronal types within the mitral cell, external plexiform, and glomerular layers of the main olfactory bulb. Finally, we used retrograde viral tracing combined with marker genes to confirm that the molecularly distinct neuronal types we describe indeed project to the olfactory cortex. Based on our analysis, we define three molecularly distinct types of MCs and five distinct types of TCs.

The neuronal cell types we have characterized here likely represent the major categories of OB projection neurons. More extensive sampling might reveal additional rare cell types, and more fine-grained clustering could further subdivide subtypes of neurons. However, our samples contained 7504 putative projection neurons compared to current estimates of 10,000–30,000 projection neurons overall per OB (Nagayama et al., 2014; Richard et al., 2010). Furthermore, we sorted nuclei rather than whole cells, which is thought to more accurately reflect relative cell-type abundance (Habib et al., 2017; Lake et al., 2017). Altogether we are therefore confident that our analysis captures the key biologically relevant types of projection neurons. Independent from the number of molecularly distinct neuronal cell types, the gene expression profiles we have described here provide critical new tools for refining projection neuron cell type identities by aligning a cell’s molecular features with its functional properties. Previous experiments have highlighted the heterogeneity of odor responses of MCs and TCs (Balu et al., 2004; Bathellier et al., 2008; Carey and Wachowiak, 2011; Desmaisons et al., 1999; Friedman and Strowbridge, 2000; Schaefer et al., 2006). We propose that this functional diversity can be explained, at least in part, by the molecular diversity of OB projection neurons, a model that can now be tested experimentally.

Specificity of mitral cell projections

A critical feature of neuronal cell type identity is their projection target specificity. Earlier studies have shown that TCs project to anterior regions of the olfactory cortex only, while MCs project to both anterior and posterior olfactory cortex (Imamura et al., 2020; Nagayama et al., 2010; Scott et al., 1980). Furthermore, and in contrast to the organization of neuronal projections to sensory cortexes for vision, hearing, or touch, projections from the OB to the piriform cortex appear to lack apparent topographical organization (Ghosh et al., 2011; Miyamichi et al., 2011; Sosulski et al., 2011, see also Chen et al., 2021).

We have analyzed, using bulk RNA deep sequencing, gene expression profiles of cells projecting to anterior versus posterior olfactory cortex (AON versus PCx). Using simulations based on gene regulatory network analysis, we have then mapped the bulk RNA sequencing data onto MC types defined by single-nucleus RNA sequencing. Interestingly, our analysis suggested that cells of the M1 MC type preferentially target the AON, while M2 and M3 cells preferentially target the pPCx. A targeted snRNA-seq experiment of posterior-projecting MCs supported this prediction. Our findings that MC types preferentially target anterior or posterior cortical areas is consistent with recent observations using MAP-seq (Chen et al., 2021). Our results are consistent with the model that gene expression and connectivity provide overlapping yet complementary axes for cell-type classification (Kim et al., 2020).

Molecular mechanisms underlying the functional diversity of olfactory bulb projection neurons

Gene regulatory network analysis can reveal the transcriptional programs that determine the functional properties of neuronal subtypes. Here, we describe cell-type-specific modules of gene regulation, defined by the interactions of transcription factors and their target genes. One intriguing result of this analysis is that cell subtypes do not fall into clearly delineated MC and TC classes. For example, module activity of M2b and M2c MC subtypes is more similar to that of T1c TCs than other M1 and M2 MC subtypes. We obtained similar results from analyzing the TF-TF network that is thought to be closely linked to maintenance of cell identity. This demonstrated that while some MC subtypes indeed share key TFs with each other, regulon activity and network features in MC subtypes and TC subtypes are highly overlapping. For example, the most prominent hub, Pbx3, is predominantly expressed in M1 and ET2 types. More generally, projection neurons are characterized by a variety of TFs, often closely linked through feedback cycles, that are used by both MC and TC types in a combinatorial manner. The prominent hub and cycle-related genes we have identified here may act as candidate master regulators of neuronal function, which can be targeted for experimental validation. Together, these results suggest that subtypes of MCs and TCs may share important functional properties, possibly blurring at the transcriptional level the traditional division into tufted and mitral cells as the two major classes of OB projection neurons. Moreover, the gradients of module activity that we observed over the MC and TC subtypes theoretically provide a mechanism for generating multiple distinct cellular phenotypes, similar to how morphogenetic gradients allow for spatial patterning and cell differentiation (Wolpert, 1969). Through non-linear regulatory interactions, gradual differences at the transcriptomic level can be translated into selective expression of functional genes.

For example, we found that the Kcng1 gene was selectively expressed in M1 MCs. The Kcng1 gene encodes for a voltage-gated potassium channel, which forms heterotetrameric channels with the ubiquitously expressed delayed rectifying Kv2.1 potassium channel (indeed also expressed in the M1 cluster) and modifies the kinetics of channel activation and deactivation (Kramer et al., 1998). Other voltage-gated potassium channels exhibiting prominent differential expression levels in MC and TC subtypes include Kcnd3, Kcng1, Kcnh5, Kcnq3, Kcnj2 and 6, and Hcn1 (for details see accompanying website link in Materials and methods). These channels represent intriguing candidates for controlling the differential excitability of different MC and TC types.

We also found that a large number of cell adhesion and axon guidance genes known to control the formation and maintenance of neuronal connectivity were differentially expressed in OB projection neuron types. Examples include members of the cadherin superfamily of cell adhesion glycoproteins (Cdh6, 7, 8 c, 9, 13, and 20) and components of the Semaphorin/Neuropilin/Plexin complexes, including Nrp1, Nrp2, Plexna3, Sema3e, and Sema5b. Semaphorin/Neuropilin/Plexin complexes are known to play critical roles in the development and maintenance of neuronal connections, including in OB MCs (Inokuchi et al., 2017; Saha et al., 2007). Heterogeneity in these cell adhesion and guidance genes might inform subdivisions in projection neurons across the OB, in particular along the dorsomedial–ventrolateral axis.

Our data set will be an important resource for the emerging study of the evolution of olfactory neural circuits across species. Adaptation to distinct olfactory environments, and the critical role of olfaction in survival and reproduction, has shaped the evolution of the repertoire of odorant receptors and olfactory sensory neurons (Bargmann, 2006; Yoshihito, 2012). However, little is known about how evolving sensory inputs from the nose are accommodated at the level of the OB and its connections to the olfactory cortex. The detailed molecular description of mouse OB projection neurons presented here provides an entry point toward understanding the evolution of olfactory sensory processing across species.

Materials and methods

Key resources table.

Reagent type
(species) or resource
Designation Source or reference Identifiers Additional information
Strain, strain background (M. musculus) C57Bl/6 Jackson Laboratory Stock #: 000664
RRID:IMSR_JAX:000664
Genetic reagent (Adeno-associated virus) rAAVretro-CAG-H2B-GFP Tervo et al., 2016 PMID:27720486
Commercial assay or kit Nuclei PURE Prep Sigma cat#: NUC201-1KT
Commercial assay or kit Arcturus PicoPure RNA Isolation Kit ThermoFisher cat#: KIT0204
Other DRAQ5 ThermoFisher cat#: 65-0880-92
Software, algorithm Cell Ranger version 3.0 10x Genomics RRID:SCR_017344
Software, algorithm Seurat v3.6 Butler et al., 2018 RRID:SCR_007322
Software, algorithm Scanpy v1.7.1 Wolf et al., 2018 RRID:SCR_018139
Software, algorithm glmGamPoi R-package Ahlmann-Eltze and Huber, 2021 PMID:33295604
Software, algorithm SCENIC Aibar et al., 2017 RRID:SCR_017247
Software, algorithm Cytoscape Shannon et al., 2003 RRID:SCR_003032
Software, algorithm ImageJ version 2.1.0. Schindelin et al., 2012 RRID:SCR_003070

Experimental model and subject details

Male and female C57Bl/6 mice (6–8 weeks old) were used in this study and obtained by in-house breeding. All animal protocols were approved by the Ethics Committee of the board of the Francis Crick Institute and the United Kingdom Home Office under the Animals (Scientific Procedures) Act 1986 (project license number PA2F6DA12), as well as Brown University’s Institutional Animal Care and Use Committee (protocol number: 21-03-0004) followed by the guidelines provided by the National Institutes of Health.

Stereotaxic injections and histology

Mice were anaesthetized using isoflurane and prepared for aseptic surgery in a stereotaxic frame (David Kopf Instruments). A retrogradely transported Adeno Associated Virus (rAAV-retro-CAG-H2B-GFP, Tervo et al., 2016) was injected stereotaxically into multiple sites of piriform cortex (PCx) and anterior olfactory nucleus (AON). The following coordinates, based on the Paxinos and Franklin Mouse Brain Atlas, were used: Coordinates (AP/ML/DV) in mm for PCx injections: (1) −0.63/−4.05/−4.10, (2) −0.8/−4.00/−4.10. For AON injections: (1) 2.8/1.25/2.26 and 2.6, (2) 2.68/1.25/2.3 and 2.75, and (3) 2.34/0.7/3.5. Using a micromanipulator, a pulled glass micropipette was slowly lowered into the brain and left in place for 30 s before the virus was dispensed from the micropipette using a Nanoject injector (Drummond Scientific) at a rate of 46 nl/min (0.3 μl for PCx and 0.2 μl for AON per injection site). The micropipette was left in place for an additional 5 min before being slowly withdrawn to minimize diffusion along the injection tract. Craniotomies were covered with silicone sealant (WPI), and the skin was sutured. Mice were provided with 5 mg/kg Carprofen in their drinking water for 2 days following surgery.

Histology was used to validate viral targeting of olfactory bulb projection neurons. Mice were deeply anaesthetized with 2.5% of 250 mg/kg Avertin and transcardially perfused with 10 ml of ice-cold phosphate-buffered saline (PBS) followed by 10 ml of 4% paraformaldehyde (PFA). Brains were dissected and post-fixed for 5 hr in 4% PFA at 4°C. Coronal sections (100 µm thick) were prepared using a vibrating-blade Leica VT100S Vibratome. Sections were rinsed in PBS and incubated in PBS/0.1% Triton X-100 and Neurotrace counterstain (1:1000, ThermoFisher) at 4°C overnight, then mounted on SuperFrost Premium microscope slides (Fisher, cat# 12-544-7) in Fluorescent Vectashield Mounting Medium (Vector). Images were acquired at 20× using a Nikon A1R-HD confocal microscope. While this injection strategy cannot fully exclude labeling of axons of passage (e.g. axons passing through AON), these are unlikely to constitute a significant sub-population of labeled neurons as, e.g., we identified exclusively PCx projecting cell types as well.

Single-nuclei isolation, FANS, and RNA extraction

To isolate GFP-labeled nuclei, 10 individual biological replicates were used. For bulk RNA deep sequencing: three replicates of AON-injected mice and three replicates of PCx-injected mice (all females, AON1 = 7 w, AON2 = 6w, AON3 = 8w, PCx1 = 7w, PCx2 = 6w, PCx3 = 7w); for single-nucleus RNA sequencing (sn-R1/R2/R3 dataset): three replicates of AON- and PCx-injected mice (all males, 7 w); for validating the simulation results using single-nucleus RNA sequencing (sn-PCx dataset): 1 PCx-injected mouse (female, 11 w). Mice were deeply anaesthetized with an overdose of ketamine/xylazine and transcardially perfused with ice-cold PBS. Both hemispheres of the olfactory bulb were dissected, and the hemisphere ipsilateral to the injection site was carefully separated from the contralateral hemisphere. Both hemispheres were minced separately and placed into two different tubes. To dissociate single nuclei, Nuclei PURE Prep was used according to the manufacturer instructions (Sigma, cat# NUC201-1KT) with some modifications. The minced tissue was gently homogenized in 2.75 ml Nuclei PURE Lysis Buffer and 27.5 μl 10% Triton X-100 using an ice-cold dounce and pestle, and filtered two times through a 40 μm cell strainer on ice. After centrifuging at 500 rpm for 5 min at 4°C, the supernatant was aspirated and gently resuspended in 500 μl of cold buffer (1× of cold Hanks’ Balanced Salt Solution HBSS, 1% nuclease-free BSA, 22.5 μl of RNasin Plus [Promega N2611] and 1/2000 DRAQ5). The remaining PBS-perfused brain of the PCx-injected mouse (for sn-PCx dataset) was post-fixed 5 hr in 4% PFA at 4°C and used for histology validation of the injection points.

Fluorescence-activated nuclei sorting of single nuclei was performed using a BD FACSAria III Cell Sorter with a 70 μm nozzle at a sheath pressure of 70 psi. Precision mode (yield mask set to 16, purity mask set to 16 and phase mask set to 0) was used for stringent sorting. For single-nucleus RNA sequencing, GFP+ nuclei were sorted into a 1.5 ml centrifuge tube. For bulk RNA deep sequencing, GFP+ nuclei were sorted into 100 μl Trizol and 1.43 μl of RNA carrier, and total RNA was extracted using the Arcturus PicoPure RNA Isolation Kit (ThermoFisher, cat# KIT0204).

Single-nucleus RNA sequencing

The sn-R1/R2/R3 cDNA(s) were prepared using the 10× kit version Chromium Single Cell 3’ v3. Libraries were prepared using the Next Single Cell/Low Input RNA Library Prep Kit (New England Biolabs). The quality and quantity of the final libraries were assessed with the TapeStation D5000 Assay (Agilent Technologies) before sequencing with an Illumina HiSeq 4000 platform. cDNA concentrations were measured as: 14.4 (R1), 23.3 (R2), 7.9 (R3) ng/μl (n = 3 animals).

The sn-PCx cDNA and library were prepared using the 10x kit version Chromium Single Cell 3ʹ v3.1. The quality and quantity of the final library was assessed with the TapeStation D1000 Assay (Agilent Technologies) before sequencing with an Illumina NextSeq500. The cDNA concentration was measured as 1.09 ng/μl.

Bulk RNA deep sequencing

Libraries were prepared using the Next Single Cell/Low Input RNA Library Prep Kit (New England Biolabs). The quality and quantity of the final libraries was assessed with the TapeStation D1000 Assay (Agilent Technologies) before sequencing with an Illumina HiSeq 4000 platform. RNA concentrations were measured as: AON injections (n = three animals), 1.495, 1.682, 1.881 ng/μl and RNA integrity numbers (RIN) 8.3, 8.7, 9.0; PCx injections (n = 3 animals), 0.257, 0.165, 0.133 ng/μl; RIN = 8.0, 10.0, 7.8 for each replicate, respectively.

Single-nucleus RNA sequencing analysis

Raw sequencing datasets were processed using Cell Ranger version 3.0 (10x Genomics). Count tables were loaded into R (version 3.6, https://www.r-project.org) and further processed using Seurat v3.6 (Butler et al., 2018) and Scanpy v1.7.1 (Wolf et al., 2018).

For all four datasets (three biological replicates sn-R1/R2/R3 and sn-PCx), we removed all nuclei with fewer than 1000 distinct genes detected or with more than 5% of unique molecular identifiers stemming from mitochondrial genes. After quality control, for the large sn-RNAseq dataset we merged the three replicates using standard functionality of Seurat: we apply NormalizeData and FindVariableFeatures on the individual replicates, then apply Canonical Correlation Analysis (CCA) (Stuart et al., 2019). We retained a total of 31,703 nuclei (median of 2300 genes per nucleus; for each replicate median genes per nucleus: R1 = 2,266; R2 = 2419; R3 = 2,322). For the sn-PCx dataset, we retained a total of 74 nuclei (median genes per nucleus: 6,092). We describe in detail its downstream analysis in the Materials and methods section ‘Validation of regulon-based simulations'. For the sn-R1/R2/R3 dataset, principal component analysis (PCA) was then performed on highly variable genes and the first 30 principal components were selected as input for clustering and UMAP, based on manual inspection of a principal component variance plot (PC elbow plot). Clustering was performed using the default method (Louvain) from the Seurat package, with the resolution parameter of the FindClusters function set to 0.3.

Subclustering of projection neurons was carried out by selecting clusters M1, M2/M3, T1, ET1 and ET2 from the initial single-nuclei analysis based on the combinatorial expression patterns of glutamatergic and previously characterized mitral/tufted cell markers (Tbx21, Pcdh21, Thy1, Vglut1, Vglut2, and Vglut3). For subclustered nuclei, we again performed the above steps of highly variable genes selection, principal component analysis, and clustering, this time with the Louvain resolution parameter set to 0.5.

Differential gene expression analysis on single-nuclei data was performed using the glmGamPoi R-package (Ahlmann-Eltze and Huber, 2021). Gene set enrichment analysis (GSEA) on the resulting log-fold changes was performed as described in Subramanian et al., 2005.

Network inference

Gene regulatory networks were inferred using the pySCENIC pipeline (Single-Cell rEgulatory Network InferenCe, Aibar et al., 2017; Van de Sande et al., 2020) and visualized using Jupyter notebooks and Cytoscape (Shannon et al., 2003). We used pySCENIC on the projection neuron subcluster data after removal of nuclei assigned the PG cell type. pySCENIC is a three-step approach: (1) predict TF-target gene pairs using Arboreto; (2) filter TF-target gene associations for false positives using TF binding site enrichment in a window of 5 kb around a target’s transcription start site (TSS) and group TFs with their target genes into so-called regulons; (3) calculate the activity of regulons in each cell in terms of the area under the recovery curve (AUC). Step one depends on a stochastic search algorithm and is therefore performed n = 100 times. Only TFs that are found >80 times and with TF-target gene interactions that occur >80 times are considered. To avoid technical issues in the analysis, regulons with fewer than eight target genes are removed from the final list. Subsequent analysis in Step three involves a stochastic downsampling to speed up computation; hence, we verified that the chosen sample size was sufficient for accurate AUC approximations. We calculated n = 25 AUC matrices and confirmed that they contained few zeros and the variance of each matrix entry (i.e. approximated regulon activity in a given cell) was low.

Quantification along cell type trajectories

Gradual and sudden changes between cell types are studied using pseudotime trajectory analysis. We used the partition-based graph abstraction (PAGA) functionality offered by the Python package Scanpy (Wolf et al., 2019). Given a set of clusters, PAGA computes a graph as a condensed representation of how the clusters are distributed in transcriptomic/PCA space. It then uses a diffusion map to smoothly travel along the paths in the graph (Figure 4—figure supplement 3). We focused on a single path, namely ET4, ET1, T1, M2, M3, M1, ET2 (ignoring ET3, as it has few neurons). We then computed trajectories of module and regulon activity levels, and TF expression levels using a running average with window size of 100 nuclei to smoothen the signal.

Bulk RNA deep sequencing analysis

The ‘Trim Galore!’ utility version 0.4.2 was used to remove sequencing adaptors and to quality trim individual reads with the q-parameter set to 20. Sequencing reads were then aligned to the mouse genome and transcriptome (Ensembl GRCm38 release-89) using RSEM version 1.3.0 (Li and Dewey, 2011) in conjunction with the STAR aligner version 2.5.2 (Dobin et al., 2013). Sequencing quality of individual samples was assessed using FASTQC version 0.11.5 and RNA-SeQC version 1.1.8 (DeLuca et al., 2012). Differential gene expression was determined using the R-bioconductor package DESeq2 version 1.24.0 (Love et al., 2014). GSEA was conducted as described in Subramanian et al., 2005.

Integration of single-nucleus and bulk RNA deep sequencing data using regulon-based simulations

Nuclei were simulated from bulk RNA deep sequencing data using a weighted random sampling of regulons with replacement. A regulon’s relative weight corresponded to the prevalence of its transcription factor in the given bulk RNA-seq sample. Each time a regulon was selected, the counts for its transcription factor and all its target genes increased by one. This weighted random sampling of regulons maintains key information from the starting bulk RNA-seq data (in the form of weights) while also maintaining key co-expression patterns of single nuclei (in the form of regulons), creating a population of biologically-realistic simulated transcriptomes that reflect the starting bulk RNA-seq data. The number of regulons expressed in each simulated nucleus was randomly selected from a list of how many unique transcription factors each nucleus from snRNA-seq expressed (normalized expression > 2). Simulated nuclei were treated as raw count matrices and integrated with snRNA-seq nuclei using the sctransform package in R (Hafemeister and Satija, 2019). Briefly, this integration algorithm normalizes counts as the Pearson residuals from a regularized negative binomial regression model with sequencing depth as a covariate. The main result is the removal of batch effects related to sequencing depth, which facilitates pooling of our snRNA-seq replicates and makes the results of the simulation pipeline invariant to the number of regulons chosen for each nucleus, keeping results driven by proportions in the starting bulk RNA-seq data rather than absolute numbers.

To filter simulated nuclei, we trained 1000 LDA classifiers with the python package scikit-learn (Pedregosa et al., 2011). For each classifier, snRNA-seq nuclei were split into test and train datasets, with 75% of nuclei used for training and the other 25% used for testing. This train-test split was unique to each classifier. Each classifier was trained to predict whether a nucleus was a projection neuron (i.e. whether it was selected for subclustering in the initial transcriptome-based Seurat analysis) based on values for the 30 top principal components from the sctransform integration. Principal component analysis projects all data points into the same space and reduces the number of features, enabling better classification performance. Each classifier was applied to the remaining snRNA-seq nuclei for testing, and accuracy was evaluated using the Jaccard index calculated by scikit-learn (Pedregosa et al., 2011). The classifiers were then applied to the simulated nuclei. Simulated nuclei predicted to be projection neurons by all 1000 classifiers were designated as putative simulated projection neurons and selected for further analysis. Similarly, these putative simulated projection neurons were integrated with snRNA-seq mitral cells using sctransform. One thousand LDA classifiers were trained to classify simulated nuclei as AON-projecting or PCx-projecting based on values for the 30 top principal components from the sctransform integration. Each classifier was trained on 75% of the simulated projection neurons and tested on the other 25%, with accuracy evaluated using the Jaccard index. Each classifier was then applied to snRNA-seq mitral cells.

Validation of regulon-based simulations

To assign the cell type identity in the sn-PCx dataset, we first identified projection neurons by transforming the expression matrix into the space of 30 principal components defined on the larger sn-R1/R2/R3 dataset. One thousand LDA classifiers were trained to classify cells as projection neurons or not based on the values for these 30 principal components. Each classifier was trained on 75% of the original sn-R1/R2/R3 dataset and tested on the other 25%. sn-PCx cells were considered projection neurons if they were predicted to be such by all 1000 classifiers.

Sn-PCx projection neurons were then transformed into a low-dimensional space defined by their expression of the 86 regulons from our SCENIC analysis on the original sn-R1/R2/R3 projection neurons. These regulons create a low-dimensional space in which to compare this new, targeted sn-PCx dataset to our original, larger sn-R1/R2/R3 dataset. To quantify this comparison, we used LDA classifiers, trained on the original mitral cells, to predict the cell type of the new projection neurons. Each of 1000 LDA classifiers was trained on 75% of the original data and tested on the other 25% to predict cell type identity based on regulon activity scores. These classifiers were then applied to the regulon activity scores of the new projection neurons. Statistics were performed on the results using binomial distributions. To calculate a p-value for the abundance of M2 cells in the sn-PCx dataset, we used the R command pbinom(44,57,1241/2114, lower.tail=F) to quantify the probability of observing that many or more M2 cells in a population of that size were the probability of sampling M2 as opposed to a different MC equal to that of sampling M2 from our sn-R1/R2/R3 dataset. With similar logic, to calculate a p-value for the scarcity of M1 cells in the sn-PCx data, we used the R command pbinom(6,57,711/2114) to calculate the probability of observing that many or fewer M1 cells.

smFISH in tracing experiments and quantification

Experiments were performed according to the manufacturer’s instructions, using the RNAscope Fluorescent Multiplex kit (Advanced Cell Diagnostics [ACD]) for fresh frozen tissue. Briefly, a total of six mice were injected into the AON and PCx with the rAAVretro-CAG-H2B-GFP. After 15 days post-injection, mice were deeply anaesthetized with 2.5% of 250 mg/kg Avertin and transcardially perfused with 10 ml of ice-cold phosphate-buffered saline (PBS). The brains were dissected out from the skull, immediately embedded in Tissue Plus O.C.T. compound (Fisher Healthcare) and snap-frozen in a bath of 2-methylbutane on dry ice. Brains were cryo-sectioned coronally at 20 µm thickness, mounted on Fisherbrand Superfrost Plus microscope slides (Fisher Scientific), and stored at −80 °C until use. In situ probes against the following mouse genes were ordered from ACD and multiplexed in the same permutations across sections: Foxo1 (#485761-C2 and 485761), Kcng1 (#514181-C2), Lxh1 (#488581), Sertm1 (#505401-C2), Ebf3(#576871-C3), Sgcg (#488051-C3), Cadps2 (#529361-C3 and 529361), Coch (#530911-C3), Ly6g6e (#506391-C2), Wnt5b (#405051), Fst (#454331), Barhl2 (#492331-C2), Vdr (524511-C3), Gfp (#409011, #409011-C2 and #409011-C3), Piezo2 (#500501), Olfr110/111 (#590641), Calca (#578771), Lhx5 (#885621-C3), and Vgll2 (#885631-C2). Following smFISH, high-resolution images of a single z-plane were obtained using a 60× oil immersion objective on an Olympus FV3000 confocal microscope and a 40× oil immersion objective on a Nikon A1R-HD confocal microscope.

For each image, we first counted the number of cells positive for the less abundant probe (probe A), on average 60 cells for each probe. We then counted amongst probe A-positive cells how many cells also expressed the second probe (probe B). We report overlap as the percentage of probe A-positive cells also expressing probe B. For mitral cell markers we only considered cells along the mitral cell and internal plexiform layers. For tufted cell markers, we only considered cells in the external plexiform and glomerular layers. We considered a cell positive for a given smFISH probe if more than five dots/cell were present, and if the dots were located within the nucleus and perinuclear compartment of the cell, visualized by DAPI counterstain.

Acknowledgements

We thank the Crick Advanced Sequencing Facility, especially Robert Goldstone and Amelia Edwards for their excellent support. We thank Debipriya Das and Ana Agua-Doce from the Crick Flow Cytometry Facility for technical assistance, and the Crick animal facility for animal care. We thank the members of the Crick Digital Development Team, particularly Amy Strange, Luke Nightingale, Jude Pinnock, and Marc Pollitt for excellent technical support. We thank the Harvard Neurobiology Imaging Facility for consultation and RNAscope services. This facility is supported in part by the Neural Imaging Center as part of an NINDS P30 Core Center grant #NS072030. We also thank Zach Herbert and Maura Berkeley from the Molecular Biology Core Facilities at the Dana-Farber Cancer Institute for sequencing services. We thank Kevin Carlson of the Flow Cytometry and Sorting Facility at Brown University for his technical assistance. This facility is supported in part by the NCRR equipment grant 1S10RR021051. We thank Keeley Baker, Gilad Barnea, Bob Datta, Kevin Franks, and Paul Greer for critical comments on the manuscript. Work in the ATS lab was supported by the Francis Crick Institute, which receives its core funding from Cancer Research UK (FC001153), the UK Medical Research Council (FC001153), and the Wellcome Trust (FC001153); a Wellcome Trust Investigator grant to ATS (110174/Z/15/Z), and a DFG postdoctoral fellowship to TA. Work is the AF lab was supported by grants from the NIH (1U19NS112953-01, 1R01DC017437-03) and the Robert J and Nancy D Carney Institute for Brain Science. Carney Institute computational resources used in this work were supported by the NIH Office of the Director grant S10OD025181.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Andreas T Schaefer, Email: andreas.schaefer@crick.ac.uk.

Alexander Fleischmann, Email: alexander_fleischmann@brown.edu.

Naoshige Uchida, Harvard University, United States.

Catherine Dulac, Harvard University, United States.

Funding Information

This paper was supported by the following grants:

  • Cancer Research UK FC001153 to Andreas T Schaefer.

  • Wellcome Trust FC001153 to Andreas T Schaefer.

  • Deutsche Forschungsgemeinschaft AC 304/1-1 to Tobias Ackels.

  • National Institutes of Health 1R01DC017437-03 to Alexander Fleischmann.

  • National Institutes of Health 1U19NS112953-01 to Alexander Fleischmann.

  • National Institutes of Health S10OD025181 to Alexander Fleischmann.

  • Medical Research Council FC001153 to Andreas T Schaefer.

  • Wellcome Trust 110174/Z/15/Z to Andreas T Schaefer.

Additional information

Competing interests

Reviewing editor, eLife.

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Formal analysis, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Software, Formal analysis, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Investigation, Visualization, Writing - original draft, Writing - review and editing.

Writing - review and editing, Essential reagent.

Software, Formal analysis, Visualization, Methodology, Writing - review and editing.

Conceptualization, Software, Formal analysis, Supervision, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Investigation, Writing - original draft, Writing - review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Investigation, Writing - original draft, Writing - review and editing.

Ethics

Animal experimentation: All animal protocols were performed in strict accordance with the recommendations approved by the Ethics Committee of the board of the Francis Crick Institute and the United Kingdom Home Office under the Animals (Scientific Procedures) Act 1986 (project license number PA2F6DA12), as well as Brown University's Institutional Animal Care and Use Committee (protocol number: 21-03-0004) followed by the guidelines provided by the National Institutes of Health.

Additional files

Transparent reporting form

Data availability

Raw single nucleus RNA sequencing data (large sn-R1/R2/R3 and targeted sn-PCx datasets) have been deposited in Gene Expression Omnibus (GEO) under the accession numbers GSE162654 and GSM5363097, respectively. Bulk RNA deep sequencing data has been deposited in GEO under the accession number GSE162655. The R and Python analysis scripts developed for this paper are available at the GitLab links https://gitlab.com/fleischmann-lab/papers/molecular-characterization-of-projection-neuron-subtypes-in-the-mouse-olfactory-bulb (copy archived at https://archive.softwareheritage.org/swh:1:rev:4bbee4380a84756ea0e6532f5603b2d99c8e62fa) and https://gitlab.inria.fr/acrombac/projection-neurons-mouse-olfactory-bulb (copy archived at https://archive.softwareheritage.org/swh:1:rev:b9d253e7885124d810b52a0cd511bf8032b5efa5). Extensive computational tools for additional in-depth exploration of our data sets are available through our website: https://biologic.crick.ac.uk/OB_projection_neurons.

The following datasets were generated:

Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2020. single nucleus RNA sequencing data. NCBI Gene Expression Omnibus. GSE162654

Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2020. RNA deep sequencing data. NCBI Gene Expression Omnibus. GSE162655

Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2021. single nucleus RNA sequencing data. NCBI Gene Expression Omnibus. GSM5363097

References

  1. Ackels T, Jordan R, Schaefer AT, Fukunaga I. Respiration-Locking of olfactory receptor and projection neurons in the mouse olfactory bulb and its modulation by brain state. Frontiers in Cellular Neuroscience. 2020;14:220. doi: 10.3389/fncel.2020.00220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ahlmann-Eltze C, Huber W. glmGamPoi: fitting Gamma-Poisson generalized linear models on single cell count data. Bioinformatics. 2021;36:5701–5702. doi: 10.1093/bioinformatics/btaa1009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Aibar S, González-Blas CB, Moerman T, Huynh-Thu VA, Imrichova H, Hulselmans G, Rambow F, Marine JC, Geurts P, Aerts J, van den Oord J, Atak ZK, Wouters J, Aerts S. SCENIC: single-cell regulatory network inference and clustering. Nature Methods. 2017;14:1083–1086. doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Alexander RP, Kim PM, Emonet T, Gerstein MB. Understanding modularity in molecular networks requires dynamics. Science Signaling. 2009;2:pe44. doi: 10.1126/scisignal.281pe44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Angelo K, Rancz EA, Pimentel D, Hundahl C, Hannibal J, Fleischmann A, Pichler B, Margrie TW. A biophysical signature of network affiliation and sensory processing in mitral cells. Nature. 2012;488:375–378. doi: 10.1038/nature11291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Arendt D, Musser JM, Baker CVH, Bergman A, Cepko C, Erwin DH, Pavlicev M, Schlosser G, Widder S, Laubichler MD, Wagner GP. The origin and evolution of cell types. Nature Reviews Genetics. 2016;17:744–757. doi: 10.1038/nrg.2016.127. [DOI] [PubMed] [Google Scholar]
  7. Avila Cobos F, Vandesompele J, Mestdagh P, De Preter K. Computational deconvolution of transcriptomics data from mixed cell populations. Bioinformatics. 2018;34:1969–1979. doi: 10.1093/bioinformatics/bty019. [DOI] [PubMed] [Google Scholar]
  8. Balu R, Larimer P, Strowbridge BW. Phasic stimuli evoke precisely timed spikes in intermittently discharging mitral cells. Journal of Neurophysiology. 2004;92:743–753. doi: 10.1152/jn.00016.2004. [DOI] [PubMed] [Google Scholar]
  9. Bargmann CI. Comparative chemosensation from receptors to ecology. Nature. 2006;444:295–301. doi: 10.1038/nature05402. [DOI] [PubMed] [Google Scholar]
  10. Bartel DL, Rela L, Hsieh L, Greer CA. Dendrodendritic synapses in the mouse olfactory bulb external plexiform layer. Journal of Comparative Neurology. 2015;523:1145–1161. doi: 10.1002/cne.23714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bathellier B, Buhl DL, Accolla R, Carleton A. Dynamic ensemble odor coding in the mammalian olfactory bulb: sensory information at different timescales. Neuron. 2008;57:586–598. doi: 10.1016/j.neuron.2008.02.011. [DOI] [PubMed] [Google Scholar]
  12. Becskei A, Séraphin B, Serrano L. Positive feedback in eukaryotic gene networks: cell differentiation by graded to binary response conversion. The EMBO Journal. 2001;20:2528–2535. doi: 10.1093/emboj/20.10.2528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Boyd AM, Sturgill JF, Poo C, Isaacson JS. Cortical feedback control of olfactory bulb circuits. Neuron. 2012;76:1161–1174. doi: 10.1016/j.neuron.2012.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Buck L, Axel R. A novel multigene family may encode odorant receptors: a molecular basis for odor recognition. Cell. 1991;65:175–187. doi: 10.1016/0092-8674(91)90418-X. [DOI] [PubMed] [Google Scholar]
  15. Buonviso N, Amat C, Litaudon P, Roux S, Royet J-P, Farget V, Sicard G. Rhythm sequence through the olfactory bulb layers during the time window of a respiratory cycle. European Journal of Neuroscience. 2003;17:1811–1819. doi: 10.1046/j.1460-9568.2003.02619.x. [DOI] [PubMed] [Google Scholar]
  16. Burton SD, Ermentrout GB, Urban NN. Intrinsic heterogeneity in oscillatory dynamics limits correlation-induced neural synchronization. Journal of Neurophysiology. 2012;108:2115–2133. doi: 10.1152/jn.00362.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Burton SD, Urban NN. Greater excitability and firing irregularity of tufted cells underlies distinct afferent-evoked activity of olfactory bulb mitral and tufted cells. The Journal of Physiology. 2014;592:2097–2118. doi: 10.1113/jphysiol.2013.269886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nature Biotechnology. 2018;36:411–420. doi: 10.1038/nbt.4096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Carey RM, Wachowiak M. Effect of sniffing on the temporal structure of mitral/Tufted cell output from the olfactory bulb. Journal of Neuroscience. 2011;31:10615–10626. doi: 10.1523/JNEUROSCI.1805-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Chen Y, Chen X, Bserdem B, Zhan H, Li Y, Davis MB, Kebschull JM, Zador AMK, Albeanu DF. Wiring logic of the early rodent olfactory system revealed by high-throughput sequencing of single neuron projections. bioRxiv. 2021 doi: 10.1101/2021.05.12.443929. [DOI]
  21. Christie JM, Schoppa NE, Westbrook GL. Tufted cell dendrodendritic inhibition in the olfactory bulb is dependent on NMDA receptor activity. Journal of Neurophysiology. 2001;85:169–173. doi: 10.1152/jn.2001.85.1.169. [DOI] [PubMed] [Google Scholar]
  22. De Saint Jan D, Hirnet D, Westbrook GL, Charpak S. External tufted cells drive the output of olfactory bulb glomeruli. Journal of Neuroscience. 2009;29:2043–2052. doi: 10.1523/JNEUROSCI.5317-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. DeLuca DS, Levin JZ, Sivachenko A, Fennell T, Nazaire M-D, Williams C, Reich M, Winckler W, Getz G. RNA-SeQC: rna-seq metrics for quality control and process optimization. Bioinformatics. 2012;28:1530–1532. doi: 10.1093/bioinformatics/bts196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Desmaisons D, Vincent J-D, Lledo P-M. Control of action potential timing by intrinsic subthreshold oscillations in olfactory bulb output neurons. The Journal of Neuroscience. 1999;19:10727–10737. doi: 10.1523/JNEUROSCI.19-24-10727.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Ezeh PI, Wellis DP, Scott JW. Organization of inhibition in the rat olfactory bulb external plexiform layer. Journal of Neurophysiology. 1993;70:263–274. doi: 10.1152/jn.1993.70.1.263. [DOI] [PubMed] [Google Scholar]
  27. Fadool DA, Tucker K, Pedarzani P. Mitral cells of the olfactory bulb perform metabolic sensing and are disrupted by obesity at the level of the Kv1.3 ion channel. PLOS ONE. 2011;6:e24921. doi: 10.1371/journal.pone.0024921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Friedman D, Strowbridge BW. Functional role of NMDA autoreceptors in olfactory mitral cells. Journal of Neurophysiology. 2000;84:39–50. doi: 10.1152/jn.2000.84.1.39. [DOI] [PubMed] [Google Scholar]
  29. Fukunaga I, Berning M, Kollo M, Schmaltz A, Schaefer AT. Two distinct channels of olfactory bulb output. Neuron. 2012;75:320–329. doi: 10.1016/j.neuron.2012.05.017. [DOI] [PubMed] [Google Scholar]
  30. Fuxman Bass JI, Diallo A, Nelson J, Soto JM, Myers CL, Walhout AJ. Using networks to measure similarity between genes: association index selection. Nature Methods. 2013;10:1169–1176. doi: 10.1038/nmeth.2728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Geramita MA, Burton SD, Urban NN. Distinct lateral inhibitory circuits drive parallel processing of sensory information in the mammalian olfactory bulb. eLife. 2016;5:e16039. doi: 10.7554/eLife.16039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ghosh S, Larson SD, Hefzi H, Marnoy Z, Cutforth T, Dokka K, Baldwin KK. Sensory maps in the olfactory cortex defined by long-range viral tracing of single neurons. Nature. 2011;472:217–220. doi: 10.1038/nature09945. [DOI] [PubMed] [Google Scholar]
  33. Gire DH, Zak JD, Bourne JN, Goodson NB, Schoppa NE. Balancing extrasynaptic excitation and synaptic inhibition within olfactory bulb glomeruli. Eneuro. 2019;6:ENEURO.0247-19.2019. doi: 10.1523/ENEURO.0247-19.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Gire DH, Schoppa NE. Control of on/Off glomerular signaling by a local GABAergic microcircuit in the olfactory bulb. Journal of Neuroscience. 2009;29:13454–13464. doi: 10.1523/JNEUROSCI.2368-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Griff ER, Mafhouz M, Chaput MA. Comparison of identified mitral and tufted cells in freely breathing rats: ii. Odor-evoked responses. Chemical Senses. 2008;33:793–802. doi: 10.1093/chemse/bjn040. [DOI] [PubMed] [Google Scholar]
  36. Haberly LB, Price JL. The axonal projection patterns of the mitral and tufted cells of the olfactory bulb in the rat. Brain Research. 1977;129:152–157. doi: 10.1016/0006-8993(77)90978-7. [DOI] [PubMed] [Google Scholar]
  37. Habib N, Avraham-Davidi I, Basu A, Burks T, Shekhar K, Hofree M, Choudhury SR, Aguet F, Gelfand E, Ardlie K, Weitz DA, Rozenblatt-Rosen O, Zhang F, Regev A. Massively parallel single-nucleus RNA-seq with DroNc-seq. Nature Methods. 2017;14:955–958. doi: 10.1038/nmeth.4407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Haddad R, Lanjuin A, Madisen L, Zeng H, Murthy VN, Uchida N. Olfactory cortical neurons read out a relative time code in the olfactory bulb. Nature Neuroscience. 2013;16:949–957. doi: 10.1038/nn.3407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hafemeister C, Satija R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biology. 2019;20:296. doi: 10.1186/s13059-019-1874-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Igarashi KM, Ieki N, An M, Yamaguchi Y, Nagayama S, Kobayakawa K, Kobayakawa R, Tanifuji M, Sakano H, Chen WR, Mori K. Parallel mitral and tufted cell pathways route distinct odor information to different targets in the olfactory cortex. Journal of Neuroscience. 2012;32:7970–7985. doi: 10.1523/JNEUROSCI.0154-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Imamura F, Ito A, LaFever BJ. Subpopulations of projection neurons in the olfactory bulb. Frontiers in Neural Circuits. 2020;14:561822. doi: 10.3389/fncir.2020.561822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Inokuchi K, Imamura F, Takeuchi H, Kim R, Okuno H, Nishizumi H, Bito H, Kikusui T, Sakano H. Nrp2 is sufficient to instruct circuit formation of mitral-cells to mediate odour-induced attractive social responses. Nature Communications. 2017;8:15977. doi: 10.1038/ncomms15977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Irons DJ, Monk NA. Identifying dynamical modules from genetic regulatory systems: applications to the segment polarity network. BMC Bioinformatics. 2007;8:413. doi: 10.1186/1471-2105-8-413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Jordan R, Fukunaga I, Kollo M, Schaefer AT. Active sampling state dynamically enhances olfactory bulb odor representation. Neuron. 2018;98:1214–1228. doi: 10.1016/j.neuron.2018.05.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kapoor V, Provost AC, Agarwal P, Murthy VN. Activation of raphe nuclei triggers rapid and distinct effects on parallel olfactory bulb output channels. Nature Neuroscience. 2016;19:271–282. doi: 10.1038/nn.4219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kawasawa YI, Salzberg AC, Li M, Šestan N, Greer CA, Imamura F. RNA-seq analysis of developing olfactory bulb projection neurons. Molecular and Cellular Neuroscience. 2016;74:78–86. doi: 10.1016/j.mcn.2016.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kikuta S, Fletcher ML, Homma R, Yamasoba T, Nagayama S. Odorant response properties of individual neurons in an olfactory glomerular module. Neuron. 2013;77:1122–1135. doi: 10.1016/j.neuron.2013.01.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Kim EJ, Zhang Z, Huang L, Ito-Cole T, Jacobs MW, Juavinett AL, Senturk G, Hu M, Ku M, Ecker JR, Callaway EM. Extraction of distinct neuronal cell types from within a genetically continuous population. Neuron. 2020;107:274–282. doi: 10.1016/j.neuron.2020.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kiyokage E, Pan YZ, Shao Z, Kobayashi K, Szabo G, Yanagawa Y, Obata K, Okano H, Toida K, Puche AC, Shipley MT. Molecular identity of periglomerular and short axon cells. Journal of Neuroscience. 2010;30:1185–1196. doi: 10.1523/JNEUROSCI.3497-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Koldaeva A, Zhang C, Huang Y-P, Reinert J, Mizuno S, Sugiyama F, Takahashi S, Soliman T, Matsunami H, Fukunaga I. Generation and characterisation of a cell-type specific, inducible Cre-driver line to study olfactory processing. The Journal of Neuroscience. 2021;20:2021. doi: 10.1523/JNEUROSCI.3076-20.2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Konstantinides N, Kapuralin K, Fadil C, Barboza L, Satija R, Desplan C. Phenotypic convergence: distinct transcription factors regulate common terminal features. Cell. 2018;174:622–635. doi: 10.1016/j.cell.2018.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Kramer JW, Post MA, Brown AM, Kirsch GE. Modulation of potassium channel gating by coexpression of Kv2.1 with regulatory Kv5.1 or Kv6.1 alpha-subunits. American Journal of Physiology-Cell Physiology. 1998;274:C1501–C1510. doi: 10.1152/ajpcell.1998.274.6.C1501. [DOI] [PubMed] [Google Scholar]
  53. Lake BB, Codeluppi S, Yung YC, Gao D, Chun J, Kharchenko PV, Linnarsson S, Zhang K. A comparative strategy for single-nucleus and single-cell transcriptomes confirms accuracy in predicted cell-type expression from nuclear RNA. Scientific Reports. 2017;7:6031. doi: 10.1038/s41598-017-04426-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Liu WL, Shipley MT. Intrabulbar associational system in the rat olfactory bulb comprises cholecystokinin-containing tufted cells that synapse onto the dendrites of GABAergic granule cells. The Journal of Comparative Neurology. 1994;346:541–558. doi: 10.1002/cne.903460407. [DOI] [PubMed] [Google Scholar]
  56. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Lukas M, Suyama H, Egger V. Vasopressin cells in the rodent olfactory bulb resemble Non-Bursting superficial tufted cells and are primarily inhibited upon olfactory nerve stimulation. Eneuro. 2019;6:ENEURO.0431-18.2019. doi: 10.1523/ENEURO.0431-18.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, Tirosh I, Bialas AR, Kamitaki N, Martersteck EM, Trombetta JJ, Weitz DA, Sanes JR, Shalek AK, Regev A, McCarroll SA. Highly parallel Genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015;161:1202–1214. doi: 10.1016/j.cell.2015.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Margrie TW, Schaefer AT. Theta oscillation coupled spike latencies yield computational vigour in a mammalian sensory system. The Journal of Physiology. 2003;546:363–374. doi: 10.1113/jphysiol.2002.031245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Markopoulos F, Rokni D, Gire DH, Murthy VN. Functional properties of cortical feedback projections to the olfactory bulb. Neuron. 2012;76:1175–1188. doi: 10.1016/j.neuron.2012.10.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Miyamichi K, Amat F, Moussavi F, Wang C, Wickersham I, Wall NR, Taniguchi H, Tasic B, Huang ZJ, He Z, Callaway EM, Horowitz MA, Luo L. Cortical representations of olfactory input by trans-synaptic tracing. Nature. 2011;472:191–196. doi: 10.1038/nature09714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Mori K, Kishi K, Ojima H. Distribution of dendrites of mitral, displaced mitral, tufted, and granule cells in the rabbit olfactory bulb. The Journal of Comparative Neurology. 1983;219:339–355. doi: 10.1002/cne.902190308. [DOI] [PubMed] [Google Scholar]
  63. Mori K, Sakano H. How is the olfactory map formed and interpreted in the mammalian brain? Annual Review of Neuroscience. 2011;34:467–499. doi: 10.1146/annurev-neuro-112210-112917. [DOI] [PubMed] [Google Scholar]
  64. Mouradian LE, Scott JW. Cytochrome oxidase staining marks dendritic zones of the rat olfactory bulb external plexiform layer. The Journal of Comparative Neurology. 1988;271:507–518. doi: 10.1002/cne.902710404. [DOI] [PubMed] [Google Scholar]
  65. Nagai Y, Sano H, Yokoi M. Transgenic expression of cre recombinase in mitral/tufted cells of the olfactory bulb. Genesis. 2005;43:12–16. doi: 10.1002/gene.20146. [DOI] [PubMed] [Google Scholar]
  66. Nagayama S, Enerva A, Fletcher ML, Masurkar AV, Igarashi KM, Mori K, Chen WR. Differential axonal projection of mitral and tufted cells in the mouse main olfactory system. Frontiers in Neural Circuits. 2010;4:120. doi: 10.3389/fncir.2010.00120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Nagayama S, Homma R, Imamura F. Neuronal organization of olfactory bulb circuits. Frontiers in Neural Circuits. 2014;8:98. doi: 10.3389/fncir.2014.00098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Najac M, De Saint Jan D, Reguero L, Grandes P, Charpak S. Monosynaptic and polysynaptic Feed-Forward inputs to mitral cells from olfactory sensory neurons. Journal of Neuroscience. 2011;31:8722–8729. doi: 10.1523/JNEUROSCI.0527-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Niedworok CJ, Schwarz I, Ledderose J, Giese G, Conzelmann KK, Schwarz MK. Charting monosynaptic connectivity maps by two-color light-sheet fluorescence microscopy. Cell Reports. 2012;2:1375–1386. doi: 10.1016/j.celrep.2012.10.008. [DOI] [PubMed] [Google Scholar]
  70. Orona E, Rainer EC, Scott JW. Dendritic and axonal organization of mitral and tufted cells in the rat olfactory bulb. The Journal of Comparative Neurology. 1984;226:346–356. doi: 10.1002/cne.902260305. [DOI] [PubMed] [Google Scholar]
  71. Otazu GH, Chae H, Davis MB, Albeanu DF. Cortical feedback decorrelates olfactory bulb output in awake mice. Neuron. 2015;86:1461–1477. doi: 10.1016/j.neuron.2015.05.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Padmanabhan K, Urban NN. Intrinsic biophysical diversity decorrelates neuronal firing while increasing information content. Nature Neuroscience. 2010;13:1276–1282. doi: 10.1038/nn.2630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Panzanelli P, Perazzini AZ, Fritschy JM, Sassoè-Pognetto M. Heterogeneity of gamma-aminobutyric acid type A receptors in mitral and tufted cells of the rat main olfactory bulb. Journal of Comparative Neurology. 2005;484:121–131. doi: 10.1002/cne.20440. [DOI] [PubMed] [Google Scholar]
  74. Parrish-Aungst S, Shipley MT, Erdelyi F, Szabo G, Puche AC. Quantitative analysis of neuronal diversity in the mouse olfactory bulb. The Journal of Comparative Neurology. 2007;501:825–836. doi: 10.1002/cne.21205. [DOI] [PubMed] [Google Scholar]
  75. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V. Scikit-learn: machine learning in Python. Journal of Machine Learning Research : JMLR. 2011;12:2825–2830. [Google Scholar]
  76. Phillips ME, Sachdev RNS, Willhite DC, Shepherd GM. Respiration drives network activity and modulates synaptic and circuit processing of lateral inhibition in the olfactory bulb. Journal of Neuroscience. 2012;32:85–98. doi: 10.1523/JNEUROSCI.4278-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Richard MB, Taylor SR, Greer CA. Age-induced disruption of selective olfactory bulb synaptic circuits. PNAS. 2010;107:15613–15618. doi: 10.1073/pnas.1007931107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Saha B, Hari P, Huilgol D, Tole S. Dual role for LIM-Homeodomain gene Lhx2 in the formation of the lateral olfactory tract. Journal of Neuroscience. 2007;27:2290–2297. doi: 10.1523/JNEUROSCI.5571-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Schaefer AT, Angelo K, Spors H, Margrie TW. Neuronal oscillations enhance stimulus discrimination by ensuring action potential precision. PLOS Biology. 2006;4:e163. doi: 10.1371/journal.pbio.0040163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Schwarz D, Kollo M, Bosch C, Feinauer C, Whiteley I, Margrie TW, Cutforth T, Schaefer AT. Architecture of a mammalian glomerular domain revealed by novel volume electroporation using nanoengineered microelectrodes. Nature Communications. 2018;9:183. doi: 10.1038/s41467-017-02560-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Scott JW, McBride RL, Schneider SP. The organization of projections from the olfactory bulb to the piriform cortex and olfactory tubercle in the rat. The Journal of Comparative Neurology. 1980;194:519–534. doi: 10.1002/cne.901940304. [DOI] [PubMed] [Google Scholar]
  83. Seroogy KB, Brecha N, Gall C. Distribution of cholecystokinin-like immunoreactivity in the rat main olfactory bulb. The Journal of Comparative Neurology. 1985;239:373–383. doi: 10.1002/cne.902390403. [DOI] [PubMed] [Google Scholar]
  84. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Short SM, Wachowiak M. Temporal dynamics of Inhalation-Linked activity across defined subpopulations of mouse olfactory bulb neurons imaged In Vivo. Eneuro. 2019;6:ENEURO.0189-19.2019. doi: 10.1523/ENEURO.0189-19.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Sosulski DL, Bloom ML, Cutforth T, Axel R, Datta SR. Distinct representations of olfactory information in different cortical centres. Nature. 2011;472:213–216. doi: 10.1038/nature09868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, Hao Y, Stoeckius M, Smibert P, Satija R. Comprehensive integration of Single-Cell data. Cell. 2019;177:1888–1902. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. PNAS. 2005;102:15545–15550. doi: 10.1073/pnas.0506580102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Sun X, Liu X, Starr ER, Liu S. CCKergic tufted cells differentially drive two anatomically segregated inhibitory circuits in the mouse olfactory bulb. The Journal of Neuroscience. 2020;40:6189–6206. doi: 10.1523/JNEUROSCI.0769-20.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Suo S, Zhu Q, Saadatpour A, Fei L, Guo G, Yuan GC. Revealing the critical regulators of cell identity in the mouse cell atlas. Cell Reports. 2018;25:1436–1445. doi: 10.1016/j.celrep.2018.10.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Tavakoli A, Schmaltz A, Schwarz D, Margrie TW, Schaefer AT, Kollo M. Quantitative association of anatomical and functional classes of olfactory bulb neurons. The Journal of Neuroscience. 2018;38:7204–7220. doi: 10.1523/JNEUROSCI.0303-18.2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Tepe B, Hill MC, Pekarek BT, Hunt PJ, Martin TJ, Martin JF, Arenkiel BR. Single-Cell RNA-Seq of mouse olfactory bulb reveals cellular heterogeneity and Activity-Dependent molecular census of Adult-Born neurons. Cell Reports. 2018;25:2689–2703. doi: 10.1016/j.celrep.2018.11.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Tervo DG, Hwang BY, Viswanathan S, Gaj T, Lavzin M, Ritola KD, Lindo S, Michael S, Kuleshova E, Ojala D, Huang CC, Gerfen CR, Schiller J, Dudman JT, Hantman AW, Looger LL, Schaffer DV, Karpova AY. A designer AAV variant permits efficient retrograde access to projection neurons. Neuron. 2016;92:372–382. doi: 10.1016/j.neuron.2016.09.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. The croonian lecture The croonian lecture—La fine structure des centres nerveux. Proceedings of the Royal Society of London. 1894;55:444–468. doi: 10.1098/rspl.1894.0063. [DOI] [Google Scholar]
  95. Thieffry D. Dynamical roles of biological regulatory circuits. Briefings in Bioinformatics. 2007;8:220–225. doi: 10.1093/bib/bbm028. [DOI] [PubMed] [Google Scholar]
  96. Traag VA, Waltman L, van Eck NJ. From louvain to Leiden: guaranteeing well-connected communities. Scientific Reports. 2019;9:5233. doi: 10.1038/s41598-019-41695-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Tripathy SJ, Padmanabhan K, Gerkin RC, Urban NN. Intermediate intrinsic diversity enhances neural population coding. PNAS. 2013;110:8248–8253. doi: 10.1073/pnas.1221214110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Van de Sande B, Flerin C, Davie K, De Waegeneer M, Hulselmans G, Aibar S, Seurinck R, Saelens W, Cannoodt R, Rouchon Q, Verbeiren T, De Maeyer D, Reumers J, Saeys Y, Aerts S. A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nature Protocols. 2020;15:2247–2276. doi: 10.1038/s41596-020-0336-2. [DOI] [PubMed] [Google Scholar]
  99. Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biology. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Wolf FA, Hamey FK, Plass M, Solana J, Dahlin JS, Göttgens B, Rajewsky N, Simon L, Theis FJ. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biology. 2019;20:59. doi: 10.1186/s13059-019-1663-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Wolpert L. Positional information and the spatial pattern of cellular differentiation. Journal of Theoretical Biology. 1969;25:1–47. doi: 10.1016/S0022-5193(69)80016-0. [DOI] [PubMed] [Google Scholar]
  102. Yoshihito N. Olfactory receptor multigene family in vertebrates: from the viewpoint of evolutionary genomics. Current Genomics. 2012;13:103–114. doi: 10.2174/138920212799860706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Zeisel A, Hochgerner H, Lönnerberg P, Johnsson A, Memic F, van der Zwan J, Häring M, Braun E, Borm LE, La Manno G, Codeluppi S, Furlan A, Lee K, Skene N, Harris KD, Hjerling-Leffler J, Arenas E, Ernfors P, Marklund U, Linnarsson S. Molecular architecture of the mouse nervous system. Cell. 2018;174:999–1014. doi: 10.1016/j.cell.2018.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Zhang X, Firestein S. The olfactory receptor gene superfamily of the mouse. Nature Neuroscience. 2002;5:124–133. doi: 10.1038/nn800. [DOI] [PubMed] [Google Scholar]
  105. Zhu Y, Wang N, Miller DJ, Wang Y. Convex analysis of mixtures for separating Non-negative Well-grounded sources. Scientific Reports. 2016;6:1–13. doi: 10.1038/srep38350. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Naoshige Uchida1

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The authors performed single cell RNA sequencing in combination with retrograde labeling from the anterior olfactory nucleus and the anterior piriform cortex, to reveal several distinct cell types of projection neurons in the olfactory bulb. The authors further characterized gene regulatory networks and the relationship between gene expressions and their axonal projections. This study provides foundational information and resource regarding the diversity of projection neurons in the olfactory bulb.

Decision letter after peer review:

Thank you for submitting your article "Molecular characterization of projection neuron subtypes in the mouse olfactory bulb" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Catherine Dulac as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

We would like to draw your attention to changes in our policy on revisions we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, when editors judge that a submitted work as a whole belongs in eLife but that some conclusions require a modest amount of additional new data, as they do with your paper, we are asking that the manuscript be revised to either limit claims to those supported by data in hand, or to explicitly state that the relevant conclusions require additional supporting data.

Our expectation is that the authors will eventually carry out the additional experiments and report on how they affect the relevant conclusions either in a preprint on bioRxiv or medRxiv, or if appropriate, as a Research Advance in eLife, either of which would be linked to the original paper.

Essential revisions:

Summary:

This study by Zeppilli et al. addresses the question whether the projection neurons in the olfactory bulb (OB) are different in their molecular identity. This is one of the major open questions in the olfaction field. The authors enriched the projection neuron population through retrograde tracing from the anterior olfactory nucleus (AON) and piriform cortex (PCX), and performed single nucleus RNA-Seq. They identified 3 mitral cell clusters and 5 tuft cell clusters. They further leveraged regulon analysis (Single-Cell Regulartory Network Inference and Clustering, SCENIC) and found more subtypes based on the transcription factor activities. They developed a simulation method and used bulk RNA-seq data traced from AON and PCX separately to infer that projection neurons of different molecular identity project differently to the targets (however, see Essential revision #1 below).

For the most part our understanding of these projection neurons from a molecular perspective rounds to nearly zero, and their diversity is likely critical for region-specific functions of the higher olfactory brain. The data will be a useful resource to the community, and we commend the authors on releasing the data along with interactive tools for exploring it. In addition, this paper appears to be a useful proof-of-concept for the use of single cell data to deconvolute bulk sequencing data.

Overall, all the reviewers thought that the data are presented clearly and the results are important. However, the reviewers raised several important concerns to which we hope the authors can address with additional experiments and analyses. Most importantly, the reviewers were concerned that the projection specificity predicted with simulated cells (Figure 6) is not validated with real data although projection patterns are one of the major subjects of the study. We hope that the authors can validate the results with additional experiments.

1) The AAVretro labeling strategy is supposed to enrich sequencing of OB projection neurons, which account for only a few percent of OB neurons. However, for unknown reason, only <24% of single nuclei are from projection neurons (mitral and tufted cells). The rest are all kinds including non-neuronal cells. While this does not affect the single-nucleus RNA-Seq analysis, as the authors can identify projection neurons based on known markers and only focus on those 24% cells, they also used the same AAVretro method to perform bulk RNA-Seq, which presumably has the same contamination issues. And in bulk RNA-Seq, RNAs from the other three-quarter cells cannot be distinguished from RNAs from projection neurons. No matter how sophisticated analysis they are doing, I just don't see how the authors could conclude projection patterns from two different retrograde AAV experiments given the above contamination. To make matters worse, the contamination is most likely due to AAV spread, and therefore the AON injection would have more contaminations of OB cells than PCX contaminations so these two bulk RNA-Seq samples contain (unknown) different amount of OB cells. To validate their conclusions of Figure 6, the authors need to either perform classical retrograde labeling together with RNAscope detection of cell-type-specific marker, or better yet to perform AAVretro-based single-nucleus RNA-Seq using separate AAVretro injection (perhaps a smaller volume) into AON and PCX (not sure why they injected two sites in the same animal for their main experiment). Alternatively, the authors could potentially use cell-type specific genetic labeling approaches using novel markers identified in the profiling and mapping target fields. As projection patterns are a framing question for this study, we would like to see at least one experimental verification on this.

2) The computational approach for sorting the contributions of projection neuron populations to the bulk sequencing data is novel and interesting, but not entirely convincing or well-validated. In addition to the issues discussed above, the extent to which simulated nuclei capture the diversity of the original single nucleus RNA-Seq data remains ambiguous since the simulated and actual data seem to form separate clusters in the shared UMAP space. It would be useful to see a simpler complimentary analysis of the bulk RNA-Seq data, perhaps via approaches that evaluate how much of the difference between the AON-projecting and PCX-projecting could be explained by over/under representation of certain clusters in each dataset.

3) In comparing the SCENIC data and analysis shown herein with the Seurat based analysis, there seem to be differences both at the gene level and at the cell type level. Can the authors more extensively characterize similarities and differences in these orthogonal modes of analysis? What explains the observed differences (for example, several of the marker genes shown in Figure 3 are not identified in regulons or TF networks in Figures 4 and 5)?

4) Given that SCENIC is a bioinformatic method, all of the conclusions about regulatory relationships are inferred; the authors should be careful not to assert mechanistic causality from these sorts of analyses (e.g., "continuous activity gradient of TFs is transformed in a non-linear manner into distinct transcriptome differences between mitral and tufted cell types"), which will ultimately require additional experiment. Some tempering of language is likely called for here.

5) A previous single cell RNA-Seq study (Tepe et al., 2018, Cell Report) from OB has identified 3 projection neuron types; those authors' approach resulted in significant labeling of non-projection neurons, about 78% of the data. Since the current method also does not strictly label projection neurons, the authors identified projection neurons from the data using known molecular markers. Therefore, the current method only offered moderate advantage over the previous study. How are the projection neuron cell types correlate with the previously identified 3 types? We note that the quality of the single-cell transcriptomes appears to be superior than the Tepe et al. based on genes detected per cell. Incidentally, it will be useful to plot parameters of snRNA-seq as supplement figure panels rather than bury them in Methods.

6) What is the rationale of using Seurat 0.3 resolution rather than higher or lower? Did the authors try higher or lower resolutions? What would happen to cell-type-specific marker expression? To the regulon analysis? It will be more reassuring, if variations of cluster resolution does not change the major conclusions.

7) It is unclear how the transcription factors identified in the regulon analysis specify these cell types. Their expression patterns do not seem to be cell type specific. Some of them are generic transcription factors and regulate functions in other cells. Examples are Sox10, Sp1, Fos, Jun, Egr1. The authors should examine the hub transcription factors using in situ or at least plot their expressions in their own dataset to make sure they do express in the projection neurons. In addition, the previously identified transcription factors, such as Tbr1, Tbr2 and Tbx21 genes are not in the analysis. The authors should discuss the relationship of the transcription factors identified here and the previously known projection neuron transcription factors.

8) The sub clustering of projection neurons from single nucleus RNA-Seq data is elegant and well-validated with convincing fluorescent in situ hybridization (FISH) data. However, some sort of quantification of overall labeling and overlap between different FISH probes would be useful. Related, it would be broadly beneficial to the field to provide such quantification, particularly for choosing cell-type specific markers in the future.

9) Although compelling, the authors may be somewhat stretching the extent to which the gene regulatory network (GRN) clustering recapitulates the transcriptomic clustering. To strengthen this, it may be useful to include some sort of quality measure for GRN clustering to evaluate if class changes are truly meaningful. This might be achieved by clustering based only on the GRN analysis which is not constrained by the originally identified cell classes. This seems important given that the authors use the rationale that there is heterogeneity of active GRNs within transcriptomically identified clusters to justify the sub clustering, and point out that gradual transitions may exist in active GRNs between clusters. This seems to be a fundamental distinction – whether the GRNs functionally define cell types or whether they are a separate axis along which to evaluate cell identity.

10) The methods and description of the simulated nuclei approach are somewhat under-described. There are multiple manipulations to the bulk sequencing gene expression (expanding the bulk sequencing data into many simulated nuclei by resampling the counts for regulon/target genes, transforming the simulated counts into Pearson residuals before performing PCA and multiple rounds of classification), but the rationale for and the effects of each of these transformations are relatively unclear. Comparisons to existing algorithms that can deconvolve bulk RNA sequencing data (e.g. Frishberg 2019 Nat Methods, Newman 2019 Nat Biotechnology), would also be useful.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Thank you for submitting your article "Molecular characterization of projection neuron subtypes in the mouse olfactory bulb" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Catherine Dulac as the Senior Editor. The reviewers have opted to remain anonymous.

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential Revisions:

All the reviewers commend the authors' thorough and thoughtful revisions. There are, however, some points that we would like you to address before publication of this work. You can find these points in the individual reviewers' comments. We expect that modifications of text will be sufficient to address these points. We hope that you can address these points within two weeks.

Reviewer #1 (Recommendations for the authors):

The authors have addressed many critiques of the reviewers quite thoroughly. Furthermore, they have performed new single-nucleus RNA-sequencing of mitral cells based on their projections into specific sites. The data provide some support of their previous predictions that specific mitral types have preference for specific target sites.

While I applaud the authors' effort to perform additional single nucleus RNAseq of mitral cells based on their projection to either PCx or AON, the numbers of recovered cells are quite small-they did not report any data on AON-projecting ones because the cell number is too small. Of the 57 cells from the sn-PCx data as projection neurons 44 were classified as M2 cells while 6 were classified as M1 cells. Thus, the heading "Targeted snRNA-seq validates predictions of selective connectivity for molecularly defined mitral cell types" is an overstatement: either the prediction is not completely accurate, or the selectivity is not absolute, or both. This heading and similar statements throughout the text need to be toned down.

Reviewer #2 (Recommendations for the authors):

This a thoughtful response to all previous reviewers with extensive new data analyses, figure modifications, and clarifications to the text. This is a vastly improved manuscript, and a timely contribution to the field. I commend the authors on a nice revision and fully endorse publication at eLife.

Reviewer #3 (Recommendations for the authors):

I commend the authors for performing a thorough revision of the paper that addressed our main concerns, especially for the validation data shown in Figure 7 and for the additional details regarding both clustering and the imputation of single nucleus transcriptomes. This paper will serve as an important catalog of molecular diversity in the bulb. I have one experimental request and two minor comments. The experimental request was made in our initial review but was not addressed – unless I missed it (and I may indeed have done so), the authors did not validate the expression of their three key hub genes in MT cells (Taf1, Bclaf1, Pbx3) by either in situ or by showing a UMAP plot describing their expression in the relevant cell types – this is really critical for believing the analysis shown in Figure 5 and should be provided. The two minor comments are 1. maybe consider not using the word "direct" in the abstract to describe OSN inputs onto projection neurons, given the complexity and 2. Line 1276 should define "PCC" – it took me a second to understand what that was on like 1277.

eLife. 2021 Jul 22;10:e65445. doi: 10.7554/eLife.65445.sa2

Author response


Essential revisions:

Summary:

This study by Zeppilli et al. addresses the question whether the projection neurons in the olfactory bulb (OB) are different in their molecular identity. This is one of the major open questions in the olfaction field. The authors enriched the projection neuron population through retrograde tracing from the anterior olfactory nucleus (AON) and piriform cortex (PCX), and performed single nucleus RNA-Seq. They identified 3 mitral cell clusters and 5 tuft cell clusters. They further leveraged regulon analysis (Single-Cell Regulartory Network Inference and Clustering, SCENIC) and found more subtypes based on the transcription factor activities. They developed a simulation method and used bulk RNA-seq data traced from AON and PCX separately to infer that projection neurons of different molecular identity project differently to the targets (however, see Essential revision #1 below).

For the most part our understanding of these projection neurons from a molecular perspective rounds to nearly zero, and their diversity is likely critical for region-specific functions of the higher olfactory brain. The data will be a useful resource to the community, and we commend the authors on releasing the data along with interactive tools for exploring it. In addition, this paper appears to be a useful proof-of-concept for the use of single cell data to deconvolute bulk sequencing data.

Overall, all the reviewers thought that the data are presented clearly and the results are important. However, the reviewers raised several important concerns to which we hope the authors can address with additional experiments and analyses. Most importantly, the reviewers were concerned that the projection specificity predicted with simulated cells (Figure 6) is not validated with real data although projection patterns are one of the major subjects of the study. We hope that the authors can validate the results with additional experiments.

Thank you very much for the support and the important and helpful suggestions, criticism and comments. As a consequence, we have re-done both the Seurat and pySCENIC analyses and regenerated the relevant parts of manuscript Figures 2-6. This also allowed us to iron out a few minor inconsistencies in the analysis, which have altered some results in a minor way but have not changed any of the conclusions. Regarding the small changes in results, these are essentially restricted to the pySCENIC analysis (Figures 4 and 5). One should think of, for example, some regulons no longer meeting the 0.8 confidence cutoff, whilst others now do. In other words, some regulons will be found missing and some newcomers are seen.

1) The AAVretro labeling strategy is supposed to enrich sequencing of OB projection neurons, which account for only a few percent of OB neurons. However, for unknown reason, only <24% of single nuclei are from projection neurons (mitral and tufted cells). The rest are all kinds including non-neuronal cells. While this does not affect the single-nucleus RNA-Seq analysis, as the authors can identify projection neurons based on known markers and only focus on those 24% cells, they also used the same AAVretro method to perform bulk RNA-Seq, which presumably has the same contamination issues. And in bulk RNA-Seq, RNAs from the other three-quarter cells cannot be distinguished from RNAs from projection neurons. No matter how sophisticated analysis they are doing, I just don't see how the authors could conclude projection patterns from two different retrograde AAV experiments given the above contamination. To make matters worse, the contamination is most likely due to AAV spread, and therefore the AON injection would have more contaminations of OB cells than PCX contaminations so these two bulk RNA-Seq samples contain (unknown) different amount of OB cells. To validate their conclusions of Figure 6, the authors need to either perform classical retrograde labeling together with RNAscope detection of cell-type-specific marker, or better yet to perform AAVretro-based single-nucleus RNA-Seq using separate AAVretro injection (perhaps a smaller volume) into AON and PCX (not sure why they injected two sites in the same animal for their main experiment). Alternatively, the authors could potentially use cell-type specific genetic labeling approaches using novel markers identified in the profiling and mapping target fields. As projection patterns are a framing question for this study, we would like to see at least one experimental verification on this.

Thank you for this important suggestion. As suggested by the reviewers, we have now performed small volume AAV-retro injections, separately into the AON and the pPCx (Author response image 1). Analysis of these new experiments validate our predictions, based on regulon-based simulations, that mitral cell subtypes project to distinct target areas.

Author response image 1. Schematic representation of experimental design and Coronal section.

Author response image 1.

(A) Schematic representation of experimental design. Top: after injection of rAAVretro-CAGH2B-GFP into PCx, single nuclei were dissociated from 1 mouse and sorted using Fluorescence-activated Nuclei Sorting (FANS). The population of nuclei is selected based on GFP and DRAQ5 (far-red fluorescent DNA dye). Sorted nuclei were sequenced using 10X single-nucleus RNA-seq and integrated with the main dataset sn-R1/R2/R3. Bottom: Same as the top figure but injection was performed into AON. (B) Coronal section showing GFP expression in the injection points for PCx (top) and AON (bottom) after injection of rAAVretro-CAG-H2B-GFP. Neurotrace counterstain in blue. Scale 50μm. PCx: Piriform Cortex; AON: Anterior Olfactory Nucleus.

In detail, we injected a retrogradely transported Adeno-Associated Virus expressing nuclear GFP (rAAV-retro-CAG-H2B-GFP (Tervo et al., 2016)) into the AON (n=1 animal) or in the pPCx (n=1 animal). Injection coordinates were the same as for the main experiment, we only reduced the volume of each injection site and eliminated the most lateral injection point of AON coordinates to avoid as much LOT contamination as possible. We dissected the olfactory bulbs of the two AON- and pPCx-injected mice, the bulbs were dissociated into single nuclei and enriched for GFP expression using FANS (ipsilateral bulb for enrichment, contralateral bulb as control). We then performed snRNA-seq using 10X Genomics technology. The remaining part of the brain was postfixed in 4 % PFA at 4 °C and used for histological validation of the injection points.

We have now included this new data as Figure 7 and describe the results in detail in the new revised Results section (page 17, line 424-454). We have also made these new datasets available through GEO under the accession number GSE176350.

2) The computational approach for sorting the contributions of projection neuron populations to the bulk sequencing data is novel and interesting, but not entirely convincing or well-validated. In addition to the issues discussed above, the extent to which simulated nuclei capture the diversity of the original single nucleus RNA-Seq data remains ambiguous since the simulated and actual data seem to form separate clusters in the shared UMAP space. It would be useful to see a simpler complimentary analysis of the bulk RNA-Seq data, perhaps via approaches that evaluate how much of the difference between the AON-projecting and PCX-projecting could be explained by over/under representation of certain clusters in each dataset.

Thank you very much for this comment. As discussed above, we now have independently validated our computational approach with new, targeted snRNA-seq experiments. The distribution of cell types observed in this new data corroborates the predictions made according to the simulation pipeline. We have considered alternate computational approaches (see also our reply to point 10 below), however, we are not convinced that these would resolve the ambiguities mentioned above, considering that most other methods (such as Zhu et al., Sci Rep 6:38350, Cobos et al. Bioinformatics, 34:1969) tend to take less information into account. We are, however, convinced that the new experimental data provides a significantly stronger validation.

To assign the cell type identity of new snRNA-seq cells, we first identified projection neurons by transforming the expression matrix into the space of 30 principal components defined on the original, larger snRNA-seq dataset. 1000 Linear discriminant analysis (LDA) classifiers (Author response image 2), each trained on 75% of the original snRNA-seq dataset and validated on the other 25% (mean jaccard index = 0.96) designated 57 out of 62 cells from the sn-PCx dataset as projection neurons and 14 out of 118 cells from the sn-AON dataset (consistent with the sn-R1/R2/R3 data presented originally that contained “contaminations” from non-projection neurons as well). Given the paucity of sn-AON projection neurons, we chose to focus on only sn-PCx neurons for further analysis.

Author response image 2. Each row represents one cell in the new, targeted snRNA-seq data.

Author response image 2.

Each column represents one of 1000 linear discriminant analysis (LDA) classifiers trained to discriminate projection neurons from other cell types (analogously to the analysis in the original manuscript Figure 7B). The color indicates the prediction made by each classifier (dark: projection neuron, light: rest).

To assign cell type to these projection neurons, these projection neurons were then transformed into a low-dimensional space defined by their expression of the 86 regulons from our SCENIC analysis on the original sn-R1/R2/R3 projection neurons. These regulons create a lowdimensional space in which to compare this new, targeted sn-PCx dataset to our original, larger sn-R1/R2/R3 dataset. To quantify this comparison, we used LDA classifiers, trained on the original mitral cells, to predict the cell type of these new projection neurons. Each of 1000 LDA classifiers was trained on 75% of the original data and validated on the other 25% to predict cell type identity based on regulon activity scores. These classifiers were then applied to the regulon activity scores of the new projection neurons (Author response image 3). Out of 57 sn-PCx mitral cells, only 6 were classified as M1 by at least 80% of classifiers. This scarcity of M1 neurons would be very unlikely if M1 cells were present in the sn-PCx mitral population at the same rate as the general mitral population observed in our original sn-R1/R2/R3 sample, as modeled with a binomial distribution (p=5.57e-05). Therefore, the new data suggest that M1 mitral cells preferentially project to the AON, as predicted by the analyses using simulated transcriptomes. Additionally, 44 of 57 sn-PCx mitral cells were classified as M2 cells by at least 80% of classifiers. Were M2 cells present in the sn-PCx population at the same rate as in the general mitral population, observing this many or more M2 cells would be unlikely, as modeled with a binomial distribution (p=0.001). Therefore, the new data suggest that M2 mitral cells preferentially project to the PCx, as predicted by the analyses using simulated transcriptomes.

Author response image 3. Linear discriminant analysis (LDA) classifiers were trained on the original, larger snRNA-seq dataset, then used to predict the cell type identity of projection neurons in a new, targeted snRNA-seq dataset.

Author response image 3.

Each row represents one projection neuron in the new, targeted sn-PCx data. Each column represents one of 1000 linear discriminant analysis (LDA) classifiers trained to predict cell type identity based on regulon expression. Color indicates the predicted cell type identity (orange: M1, red: M2, yellow: M3).

We now include this new data, analysis and description as Figure 7 in the revised manuscript. Please also see response to point (10) below.

3) In comparing the SCENIC data and analysis shown herein with the Seurat based analysis, there seem to be differences both at the gene level and at the cell type level. Can the authors more extensively characterize similarities and differences in these orthogonal modes of analysis? What explains the observed differences (for example, several of the marker genes shown in Figure 3 are not identified in regulons or TF networks in Figures 4 and 5)?

As suggested by the reviewers, we have performed a more comprehensive comparison between the SCENIC and Seurat-based analyses.

First, we revised the mapping of Seurat’s clusters on SCENIC’s and vice versa . We now compare in a straightforward manner Louvain clustering in Seurat’s PCAspace against Louvain clustering in SCENIC’s regulon activity space. In summary, even if the two analysis modes are orthogonal, neurons considered similar according to one method remain close to each other (i.e., are considered similar) also in the other method.

The comparison resulted in a new Figure 4 —figure supplement 1 in the revised manuscript. One can appreciate that the mapping of regulon-based clusters on Seurat’s UMAP shows both consistent clusters (ET4), grouped clusters (M1 and M3) and gradient-like overlapping clusters (mostly ET1, T1, M2).

While overall comparable, the mapping between Seurat and SCENIC clusters also reveals some differences (Author response image 4; Adjusted Rand Index (ARI) = 0.326, in the revised manuscript as the new Figure 4 —figure supplement 1C). When we do the reverse exercise of mapping Seurat’s clusters onto the UMAP computed for pySCENIC, we find that each of Seurat’s clusters occupies a distinct region of the UMAP. This means clustering results have remained consistent, even if the Louvain algorithm in SCENIC-space partitions the data differently (in the revised manuscript as the new Figure 4 —figure supplement 1D-F).

Author response image 4. Seurat marker genes identified as regulons in pySCENIC.

Author response image 4.

Columns are cell type, marker gene, and the marker gene as regulon with its max confidence cutoff. Any regulon with a confidence cutoff <0.3 is ignored.

Second, we now studied how marker genes identified in the Seurat analysis manifest in the SCENIC analysis (see also Comment 7). We have taken marker genes from manuscript Figure 3 and checked at which confidence level they were present as regulons and/or as target genes. The tables below provide a simplified overview, we have incorporated a regulon module-oriented version in Figure 4 —figure supplement 2 of the revised manuscript. In addition, we now provide a supplementary figure to Figure 5A (Figure 5 —figure supplement 1) to show what the network looks like with marker genes added and highlighted by confidence level and highlight the new Figure 5—figure supplement 2 describing differentially active TF networks on p. 14 (l. 356).

Author response image 5. Presence of marker genes identified through Seurat as target genes in regulons of SCENIC.

Author response image 5.

Columns from left to right are: cell type; marker gene; presence in regulon(s) with corresponding confidence cutoffs. Any regulon with a confidence cutoff <0.4 is ignored.

The third and final aspect of our comparison between Seurat and SCENIC refers to the regulon gradients that we loosely identified in the original version of the manuscript (original Figure 4F).

These can now be interpreted as the mismatch in the two analysis approaches. We refer to Comment 32 for a more in-depth treatment of this topic.

The full comparison of the two orthogonal methods, including the new analysis, is part of the new Figure 4, Figure 4 —figure supplements 1 and 3 and discussed on page 11 of the revised manuscript.

4) Given that SCENIC is a bioinformatic method, all of the conclusions about regulatory relationships are inferred; the authors should be careful not to assert mechanistic causality from these sorts of analyses (e.g., "continuous activity gradient of TFs is transformed in a non-linear manner into distinct transcriptome differences between mitral and tufted cell types"), which will ultimately require additional experiment. Some tempering of language is likely called for here.

The reviewers raise an important point here and we are glad to be given the opportunity to avoid such confusion. Specifically, they refer to a phrase in the Results section that is better placed in the Discussion section. Indeed, we now discuss gradients on p. 22, l. 550, where we offer a potential explanation of how a gradient of module activity (average activity of group of TFs and their target genes) may result in distinct phenotypic outcomes, namely distinct mitral and tufted cell types. We did not intend to assert mechanistic causality from our bioinformatic analysis, but to offer a hypothesis that can direct future experiments.

In addition, we kindly refer to comment 32 for an in-depth revision of the quantification of activity gradients.

5) A previous single cell RNA-Seq study (Tepe et al., 2018, Cell Report) from OB has identified 3 projection neuron types; those authors' approach resulted in significant labeling of non-projection neurons, about 78% of the data. Since the current method also does not strictly label projection neurons, the authors identified projection neurons from the data using known molecular markers. Therefore, the current method only offered moderate advantage over the previous study. How are the projection neuron cell types correlate with the previously identified 3 types? We note that the quality of the single-cell transcriptomes appears to be superior than the Tepe et al. based on genes detected per cell. Incidentally, it will be useful to plot parameters of snRNA-seq as supplement figure panels rather than bury them in Methods.

Tepe et al. identified 3 clusters of putative mitral and tufted cell projection neurons, named M/TC1, 2, and 3. We have performed an integration between the two data sets and highlighted in colors our sub clustering group. While difficult to directly compare single cell and single nucleus RNAseq datasets, it is apparent from the integration and QC plots that we were able to enrich significantly for projection neurons and therefore able to identify different subtypes with a finer-grained resolution (Author response image 6 and Author response image 7).

Author response image 6. Integration of nuclei from R1/R2/R3 snRNA-seq with cells from Tepe et al.

Author response image 6.

2018 dataset.(A,B) Integration of WT replicates from the Tepe et al. study with the integrated R1/R2/R3 snRNAseq dataset of the present study. Count data for samples WT1 and WT2 were extracted from file GSE121891_OB_6_runs.raw.dge.csv.gz. All WT1 and WT2 cells listed in the GSE121891_OB_metaData_seurat.csv.gz metadata file were used for the integration with the QCfiltered nuclei from the R1/R2/R3 snRNAseq dataset. Data integration between the two full datasets and label transfer of only projection neuron clusters (Figure 2D in the manuscript) to both full datasets were performed using the Seurat 3 R-package. MC/TC clusters are color-coded as indicated.(C,D) Quantification of cells/nuclei labelled as MC/TC clusters in the two datasets.

Author response image 7. On the left is shown the number of genes per cell in the Tepe et al.

Author response image 7.

, 2018 dataset while on the right the equivalent QC parameter (gene/nucleus) in our R1/R2/R3 snRNAseq dataset.

6) What is the rationale of using Seurat 0.3 resolution rather than higher or lower? Did the authors try higher or lower resolutions? What would happen to cell-type-specific marker expression? To the regulon analysis? It will be more reassuring, if variations of cluster resolution does not change the major conclusions.

Author response image 8. UMAPs for different numbers of neighbors (nb, rows) and different Louvain resolutions (res, columns).

Author response image 8.

Neighbors are taken at 25, 50 and 100; resolutions at 0.2, 0.3, 0.5, and 0.7. We recomputed the PCA space of the projection neuron subcluster and consequently the displayed clustering is slightly different from the one in the original manuscript (See also comment 38). ‘nc’ stands for number of detected clusters, ‘ari’ is the adjusted rand index in comparison to the clustering of the original manuscript.

Thanks for pointing out this source of confusion. We have now repeated the clustering for a variety of different parameters (different Seurat resolutions as well as number of neighbors; Author response image 8). Our original clustering parameters were tuned to get as close as possible to biologically-relevant cell types. We initially looked at cross-correlation matrices for distinct resolution parameters, that were computed on the top highly differentially expressed genes between the clusters. We then looked for specific/unique marker genes for each cluster to make sure that at least a few marker genes could differentiate one cluster from another. Finally, we selected the specific marker genes for in situ validation that corroborated our first-level clustering. We are confident that here we are describing the main classification of mitral and tufted cells, in line with the projection-neuronconnectivity axis as well. Our main groups could be further split into subtypes based on additional biological axes taken into consideration, nonetheless our classification will serve as a general roadmap to better investigate functional differences between main subclasses projecting to different areas of the brain.

Finally, cell types as defined by Seurat clustering influence regulon analysis in the sense that we decided to remove the PG cluster for the pySCENIC analysis (i.e., we wanted to only study M/T neurons). As the PG cluster is very stable across neighborhood and resolution changes (see central cluster in UMAPs of Author response image 8) this influence is “constant” and does not affect the major conclusions of our study. We now highlight the resolution parameters on p11 as well as in the methods section (p 27) of the revised manuscript. We also now clarify our use of the term “type” and “subtype” in the revised manuscript in Figure 4.

7) It is unclear how the transcription factors identified in the regulon analysis specify these cell types. Their expression patterns do not seem to be cell type specific. Some of them are generic transcription factors and regulate functions in other cells. Examples are Sox10, Sp1, Fos, Jun, Egr1. The authors should examine the hub transcription factors using in situ or at least plot their expressions in their own dataset to make sure they do express in the projection neurons. In addition, the previously identified transcription factors, such as Tbr1, Tbr2 and Tbx21 genes are not in the analysis. The authors should discuss the relationship of the transcription factors identified here and the previously known projection neuron transcription factors.

We apologize for not making the rationale behind the regulon analysis clearer in the original manuscript. Regulon analysis does not solely produce a list of cell-type specific transcription factors. It will find both general and specific factors. Indeed, precisely to distinguish between general and specific factors in the context of M/T cell types, we highlighted in Figure 5 cell-type specific regulons. To make our analysis more transparent, we now add a supplementary figure to Figure 5 (Figure 5 figure supplement 1A in the revised manuscript) that shows which regulons are found to be specific for each cell type.

Author response image 9. UMAPs of the reviewer’s requested TFs: Sox10, Sp1, Fos, Jun, Egr1.

Author response image 9.

All TFs, with the exception of Sox10, are expressed in the projection neurons. We note that Sox10 is sparsely expressed and was of minor importance in both the original manuscript and the current, revised one. Raw expression levels, clipped at 8, are shown.

Finally, in addition to the marker genes identified in the Seurat-style analysis, we have made a list of well-known developmental genes from the literature and analyzed their presence in SCENIC regulons, either as transcription factor or as target gene. The results are summarized in Author response image 10. below. All well-known TFs appear in our analysis, though not always at the sufficient confidence level of 0.8; some of them are identified as having regulatory roles, others are simply recognized as target genes. We now comment specifically on confidence levels in the corresponding figure legends (e.g. Figure 4 —figure supplement 2, Figure 5—figure supplement 2) of the revised manuscript.

Author response image 10. Presence of genes known for their role during OB development in regulons of SCENIC.

Author response image 10.

In the left panel, columns are: developmental gene; presence as target gene in given regulon with max confidence cutoffs. Any regulon with a confidence cutoff <0.5 is ignored. In the right panel: developmental gene; found as regulon at max confidence cutoff. Any regulon found with a cutoff <0.4 is ignored.

8) The sub clustering of projection neurons from single nucleus RNA-Seq data is elegant and well-validated with convincing fluorescent in situ hybridization (FISH) data. However, some sort of quantification of overall labeling and overlap between different FISH probes would be useful. Related, it would be broadly beneficial to the field to provide such quantification, particularly for choosing cell-type specific markers in the future.

Thank you for this suggestion. We have now quantified the overlap of the expression of marker genes detected by smFISH in mitral and tufted cell types. We report the results in the revised Figure 3 —figure supplements 1R and 2L. For each image, we first counted the number of cells positive for the less abundant probe (probe A), on average 60 cells for each probe. We then counted amongst probe A-positive cells how many cells also expressed the second probe (probe B). We report overlap as the percentage of probe Apositive cells also expressing probe B. For mitral cell markers we only considered cells along the mitral cell and internal plexiform layers. For tufted cell markers we only considered cells in the external plexiform and glomerular layers. We considered a cell positive for a given smFISH probe if more than 5 dots/cell were present, and if the dots were located within the nucleus and perinuclear compartment of the cell, visualized by DAPI counterstain.

9) Although compelling, the authors may be somewhat stretching the extent to which the gene regulatory network (GRN) clustering recapitulates the transcriptomic clustering. To strengthen this, it may be useful to include some sort of quality measure for GRN clustering to evaluate if class changes are truly meaningful. This might be achieved by clustering based only on the GRN analysis which is not constrained by the originally identified cell classes. This seems important given that the authors use the rationale that there is heterogeneity of active GRNs within transcriptomically identified clusters to justify the sub clustering, and point out that gradual transitions may exist in active GRNs between clusters. This seems to be a fundamental distinction – whether the GRNs functionally define cell types or whether they are a separate axis along which to evaluate cell identity.

Thank you for pointing out this source of confusion. Triggered by this suggestion, we have substantially revised the comparison between transcriptomic and GRN clustering and we refer to comment 3 for the detailed answer. Indeed we have followed the advice of the reviewer and performed Louvain clustering in both transcriptomic and GRN space, and then compared these clusters. Summarizing, results are consistent: cells that are close to each other (i.e. similar) in one space remain similar in the other. Moreover, the mismatches between transcriptomic and GRN clusters occur where we had previously identified heterogeneity and gradual transitions (see Comment 32 for details on gradients).

Regarding the last statement, whether GRNs define cell types or are a separate axis along which one may evaluate cell identity, a mixed picture emerges. Some of our cell types are rather welldefined in both transcriptomic and GRN space, e.g. ET4 and ET2. Contrastingly, while the cell types ET1, T1, and M2 are distinct in transcriptomic space, in GRN space they form a single heterogeneous collective with gradual transitions (see Comment 3, Figure 3.1). On top of that, one may even argue that in GRN space ET1 splits into two regions, one captured by ET4 and the other by T1.

If we speculate on how a GRN is translated into distinct phenotypic outcomes in the context of M/T projection neurons, we might propose (i) that ET1 is a cell type originating from two distinct GRNs, and that (ii) T1 and M2 translate gradients into distinct cell types akin to Wolpert’s French flag (Wolpert 1969, J. Theor. Biol.).

We now discuss this new analysis and its implications on p. 11 l. 274-281 and p. 12 /13 of the revised manuscript, relating to the Figure 4—figure supplement 1.

10) The methods and description of the simulated nuclei approach are somewhat under-described. There are multiple manipulations to the bulk sequencing gene expression (expanding the bulk sequencing data into many simulated nuclei by resampling the counts for regulon/target genes, transforming the simulated counts into Pearson residuals before performing PCA and multiple rounds of classification), but the rationale for and the effects of each of these transformations are relatively unclear. Comparisons to existing algorithms that can deconvolve bulk RNA sequencing data (e.g. Frishberg 2019 Nat Methods, Newman 2019 Nat Biotechnology), would also be useful.

The simulated nuclei approach described here is similar to existing deconvolution methods in its combined use of bulk RNA-seq and scRNA-seq ( e.g. Frishberg Nat Meth 2019). However, our method is unique in its deployment of outside biological information about transcription factor binding sites and gene regulatory networks to harness the relationships between genes to aid the deconvolution. The Methods section has been updated (p. 29-30 in the revised manuscript) with more detailed explanation, relevant portions of which we present below:

“Integration of single nucleus and bulk RNA deep sequencing data using regulon-based simulations

Nuclei were simulated from bulk RNA deep sequencing data using a weighted random sampling of regulons with replacement. A regulon’s relative weight corresponded to the prevalence of its transcription factor in the given bulk RNA-seq sample. Each time a regulon was selected, the counts for its transcription factor and all its target genes increased by one. […]Simulated nuclei predicted to be projection neurons by all 1000 classifiers were designated as putative simulated projection neurons and selected for further analysis. Similarly, these putative simulated projection neurons were integrated with snRNA-seq mitral cells using sctransform. 1000 LDA classifiers were trained to classify simulated nuclei as AON-projecting or PCx-projecting based on values for the 30 top principal components from the sctransform integration. Each classifier was trained on 75% of the simulated projection neurons and tested on the other 25%, with accuracy evaluated using the Jaccard index. Each classifier was then applied to snRNA-seq mitral cells.”

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Essential Revisions:

All the reviewers commend the authors' thorough and thoughtful revisions. There are, however, some points that we would like you to address before publication of this work. You can find these points in the individual reviewers' comments. We expect that modifications of text will be sufficient to address these points. We hope that you can address these points within two weeks.

Reviewer #1 (Recommendations for the authors):

The authors have addressed many critiques of the reviewers quite thoroughly. Furthermore, they have performed new single-nucleus RNA-sequencing of mitral cells based on their projections into specific sites. The data provide some support of their previous predictions that specific mitral types have preference for specific target sites.

Thank you very much for the support and encouragement.

While I applaud the authors' effort to perform additional single nucleus RNAseq of mitral cells based on their projection to either PCx or AON, the numbers of recovered cells are quite small-they did not report any data on AON-projecting ones because the cell number is too small. Of the 57 cells from the sn-PCx data as projection neurons 44 were classified as M2 cells while 6 were classified as M1 cells. Thus, the heading "Targeted snRNA-seq validates predictions of selective connectivity for molecularly defined mitral cell types" is an overstatement: either the prediction is not completely accurate, or the selectivity is not absolute, or both. This heading and similar statements throughout the text need to be toned down.

Thank you for this suggestion. We have now toned down the corresponding heading on p17 of the revised manuscript (to “preferential connectivity”) as well as abstract, and p19 and p21 of the discussion to address this point.

Reviewer #2 (Recommendations for the authors):

This a thoughtful response to all previous reviewers with extensive new data analyses, figure modifications, and clarifications to the text. This is a vastly improved manuscript, and a timely contribution to the field. I commend the authors on a nice revision and fully endorse publication at eLife.

Thank you for your support and encouragement.

Reviewer #3 (Recommendations for the authors):

I commend the authors for performing a thorough revision of the paper that addressed our main concerns, especially for the validation data shown in Figure 7 and for the additional details regarding both clustering and the imputation of single nucleus transcriptomes. This paper will serve as an important catalog of molecular diversity in the bulb. I have one experimental request.

Thank you for your suggestions and support.

The experimental request was made in our initial review but was not addressed – unless I missed it (and I may indeed have done so), the authors did not validate the expression of their three key hub genes in MT cells (Taf1, Bclaf1, Pbx3) by either in situ or by showing a UMAP plot describing their expression in the relevant cell types – this is really critical for believing the analysis shown in Figure 5 and should be provided.

Apologies for not more clearly addressing this point in the revision. We now provide UMAP plots as suggested in the revised Figure 5 —figure supplement 1 and describe these on p14 and discuss them on p21.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2020. single nucleus RNA sequencing data. NCBI Gene Expression Omnibus. GSE162654
    2. Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2020. RNA deep sequencing data. NCBI Gene Expression Omnibus. GSE162655
    3. Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2021. single nucleus RNA sequencing data. NCBI Gene Expression Omnibus. GSM5363097 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Transparent reporting form

    Data Availability Statement

    Raw single nucleus RNA sequencing data (large sn-R1/R2/R3 and targeted sn-PCx datasets) have been deposited in Gene Expression Omnibus (GEO) under the accession numbers GSE162654 and GSM5363097, respectively. Bulk RNA deep sequencing data has been deposited in GEO under the accession number GSE162655. The R and Python analysis scripts developed for this paper are available at the GitLab links https://gitlab.com/fleischmann-lab/papers/molecular-characterization-of-projection-neuron-subtypes-in-the-mouse-olfactory-bulb (copy archived at https://archive.softwareheritage.org/swh:1:rev:4bbee4380a84756ea0e6532f5603b2d99c8e62fa) and https://gitlab.inria.fr/acrombac/projection-neurons-mouse-olfactory-bulb (copy archived at https://archive.softwareheritage.org/swh:1:rev:b9d253e7885124d810b52a0cd511bf8032b5efa5). Extensive computational tools for additional in-depth exploration of our data sets are available through our website: https://biologic.crick.ac.uk/OB_projection_neurons.

    The following datasets were generated:

    Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2020. single nucleus RNA sequencing data. NCBI Gene Expression Omnibus. GSE162654

    Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2020. RNA deep sequencing data. NCBI Gene Expression Omnibus. GSE162655

    Zeppilli S, Ackels T, Attey R, Klimpert N, Ritola KD, Boeing S, Crombach A, Schaefer AT, Fleischmann A. 2021. single nucleus RNA sequencing data. NCBI Gene Expression Omnibus. GSM5363097


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES