Summary
Animal bodies are composed of cell types with unique expression programs that implement their distinct locations, shapes, structures, and functions. Based on these properties, cell types assemble into specific tissues and organs. To systematically explore the link between cell-type-specific gene expression and morphology, we registered an expression atlas to a whole-body electron microscopy volume of the nereid Platynereis dumerilii. Automated segmentation of cells and nuclei identifies major cell classes and establishes a link between gene activation, chromatin topography, and nuclear size. Clustering of segmented cells according to gene expression reveals spatially coherent tissues. In the brain, genetically defined groups of neurons match ganglionic nuclei with coherent projections. Besides interneurons, we uncover sensory-neurosecretory cells in the nereid mushroom bodies, which thus qualify as sensory organs. They furthermore resemble the vertebrate telencephalon by molecular anatomy. We provide an integrated browser as a Fiji plugin for remote exploration of all available multimodal datasets.
Keywords: volume electron microscopy, image registration, automatic segmentation, gene expression atlas, Platynereis dumerilii, cell types, multimodal data integration, machine learning, mushroom bodies, telencephalon
Graphical abstract
Highlights
-
•
A cellular atlas integrates gene expression and ultrastructure for an entire annelid
-
•
Morphometry of all segmented cells, nuclei, and chromatin categorizes cell classes
-
•
Molecular anatomy and projectome of head ganglionic nuclei and mushroom bodies
-
•
An open-source browser for multimodal big image data exploration and analysis
A framework for integrating cellular-resolution gene expression and cell morphological information at full-organism scale is provided for the marine annelid Platynereis dumerilii
Introduction
Cells are the basic units of life. In multicellular organisms, distinct genes are expressed in different cells, producing individual traits that define cell types (Arendt et al., 2016). Deciphering how genotype is decoded into cellular phenotype is thus critical to understand the structure and function of an entire body. Toward this goal, we need to establish the link between expression profiles and cellular morphologies. This requires techniques that permit the integration of genetic and phenotypic information for all cells of the body. On one hand, volume electron microscopy (EM) produces 3D ultrastructural data for cells and tissues with unprecedented coherency and detail (see Titze and Genoud, 2016 for review). On the other, spatial single-cell omics techniques have revolutionized expression profiling (Lein et al., 2017).
In established model species, the use of transgenic lines permits the combined molecular, morphological, and functional interrogation of cell types and enables multimodal databases and atlases (Clements et al., 2020; Milyaev et al., 2012; Sunkin et al., 2013; Altun et al., 2002). This includes the use of genetic tools to link connectomics and transcriptomics for the full central nervous system of the fly (Bates et al., 2019). This strategy is, however, very costly and hard to scale to the full organism. More recently, single-cell sequencing has enabled cell-type classification for any organism but at the cost of losing anatomical information. The objective of this study is to provide a framework for generating atlases that integrate transcriptional and morphological information for all cells in an entire animal body, using techniques that are not species specific. To this end, we introduce a pipeline integrating gene expression and cellular ultrastructure for the 6-days post-fertilization (dpf) young worm of the marine annelid Platynereis dumerilii. At this stage, Platynereis already exhibits a rich and differentiated set of cell types, which is comparable to that of many bilaterians, including vertebrates. However, because each cell type comprises a few cells only, the overall number of cells remains small. This goes in concert with a considerable stereotypy of Platynereis development and differentiation: the developmental lineage is invariant (Vopalensky et al., 2019), and differentiated larvae and young worms resemble each other down to the cellular detail (Asadulina et al., 2012; Randel et al., 2015; Tomer et al., 2010; Vergara et al., 2017).
We first acquired a serial block-face scanning EM (SBEM) volume and introduced a semi-automated multi-scale approach for cellular segmentation of the whole animal. This allowed us to quantitatively characterize the morphology of all body cells. This revealed classes of cells, such as neurons, muscle, or epithelial cells, that not only differ in shape or cytoplasmic features but also in chromatin characteristics. We then expanded an existing gene expression atlas for the 6-dpf stage (Vergara et al., 2017) and registered it to the segmented EM volume. For the first time, this enabled us to assign gene expression information to the segmented cells of an entire body and to cluster these according to expression profile. We demonstrate that such genetically defined clusters respect morphological boundaries and correspond to distinct ganglionic nuclei with coherent axonal projections. Our multimodal data reveal a fundamental duality in the Platynereis brain of segmentally iterated parts, such as sensory appendages, on one hand, and unique parts, such as the sensory-neurosecretory dorsal brain, on the other. Focusing on the mushroom bodies (MBs)—the invertebrate associative centers—we detect a proliferative center generating neurons that molecularly resemble vertebrate telencephalic interneurons. We provide the open-source platform “PlatyBrowser” to integrate, explore, and analyze multimodal data at the level of cell types, tissues, and organs. We expect the tools presented here to be transferable to EM volumes of other animals with some degree of developmental stereotypy, enabling the comparison of multimodal cell type catalogs across organisms.
Results
A whole-body SBEM dataset
An EM image stack of a complete 6-dpf young Platynereis worm was collected by SBEM at a pixel size (x/y) of 10 nm and 25 nm section thickness (z), resulting in 11,416 planar images made of >200,000 tiles for a total size of 2.5 TB. This dataset enabled detailed analyses of overall anatomy and ultrastructural detail throughout the body (Figures 1 and S1). For example, attachment complexes are identified in epithelial (Figure 1B) and ciliated support cells (Figure S1A). Made of intermediate filaments, these complexes connect to the underlying muscle layer and withstand the mechanical forces exerted at this interface. In the rhabdomeric photoreceptors of the eye, characteristic microvilli and sub-microvillar cisternae can be detected (Figure 1C). Individual myosin filaments are visible in Platynereis muscles, which lack the t-tubule system present in vertebrates and instead have an extended sarcoplasmic reticulum (Rosenbluth, 1968; Figure 1D).
The paired larval nephridia (Hasse et al., 2010) are found adjacent to the longitudinal muscles between the 2nd and 3rd segment (Figures 1E, 2D, and S1B). Each composed of 7 cells, they connect the coelomic cavity to the outside. All cells form a lumen, which is closed off via auto-cell junctions (Figure 1E) and into which they protrude motile cilia (with 9 + 2 microtubules; Figure 1E, inset). The terminal cell of each nephridium forms the nephridiopore, which is surrounded by microvilli. We also identify multiple transport vesicles (Bartolomaeus and Quast, 2005) and sites of endocytosis from the coelomic cavity (Figure S1B, inset). Similar ultrastructural analysis can be performed on other cells, illustrating the resolution power for the full 6-dpf Platynereis.
Segmentation of nuclei and cells
We aimed to segment every cell, nucleus, and selected ultrastructure from the EM volume. Nuclei were segmented with the Mutex Watershed algorithm (Wolf et al., 2018). For cell segmentation, we used the Lifted Multicut framework (Pape et al., 2019) that uses boundary predictions from a convolutional neural network, complemented by top-down constraints derived from the nucleus segmentation and additional tissue segmentations provided at lower scale. We corrected 154 merged cells using a semi-automatic approach and proofread the detailed morphology for selected cells with Paintera (Hanslovsky et al., 2020). The segmentation contains 11,402 cells with nuclei (Figure 2A). Segmentations were validated against 8 manually annotated slices (4 transversal and 4 horizontal) distributed throughout the dataset. Here, we found a 99.0% agreement with the automatic nuclear segmentation and a 90.3% agreement with the cellular segmentation (Figure S1G). Figure 2 exemplifies segmentation quality for epidermal cells (Figure 2B), muscles (Figure 2C), and nephridia (Figure 2D). Nuclei sizes ranged from 33.6 to 147.5 μm3 and cell sizes from 59.8 to 1,224.6 μm3. Note that automated segmentation did not cover neurites.
To assess the quality of cellular segmentation, we focused on the larval nephridia (Figure 2D). Previous transmission EM analysis revealed six cilia constantly present in the nephridial lumen (Hasse et al., 2010) but could not attribute them to individual cells. As proof of principle for comprehensive ultrastructure segmentation, we also segmented the nephridial cilia and showed that each of the 7 cells per side contributes several cilia to the continuous central bundle. The bundle contains 85 and 78 cilia on the left and right side, respectively. We observed a similar distribution of cilia per cell for both sides and found that, for any luminal cross-section, cilia mostly stem from one cell (Figure S1H).
The strong contrast in subnuclear structure allowed the semantic segmentation of chromatin using the nuclear segmentation as a mask (Figure S1I). Nuclei showed a meandering pattern of light and dark subregions, corresponding to classical euchromatin and heterochromatin plus nucleolus.
Morphological clustering of segmented cells
The whole-body, cellular-scale segmentation enabled quantitative morphological comparison of all cells. For this, we defined and measured 140 descriptors for cells and nuclei, including chromatin distribution (see Table S1; Figure S2C). We then clustered all cells using a graph-based approach (Figure 3A; see STAR Methods). Mapping clusters to the EM showed that they largely corresponded to neurons, muscles, and epithelial, digestive, or ciliated cells (Figure 3B). We also found a conspicuous group of cells with dark (highly scattering) nuclei and cytoplasm, sparsely distributed in the head. To validate the identity of the morphological clusters, we manually classified 978 cells (Figure S2A; see STAR Methods). These cell classes occupy distinct regions in the UMAP, demonstrating that they can be identified by the morphological descriptors. The exception is the broad category of secretory cells that likely comprise subclasses with a wide range of morphologies (Figure 3C).
We then focused on morphological features characterizing the manually identified cell classes (Figures 3D and S2B). For example, ciliated and digestive cells have larger nuclei than other cells, which goes in concert with a larger heterochromatin surface area (including nucleolus; Figure 3D). In contrast, the heterochromatin volume is similar to that of other cells, in line with our observation that, in Platynereis nuclei, the electron-scattering “heterochromatin” represents the actual DNA (with constant volume; Figures 3D and 3E). These relationships also apply to all nuclei (Figure 3F). The larger heterochromatin surface should reflect an increased exposure (i.e., unpacking) of the DNA, which we speculated might be indicative of a higher number of activated genes. Supporting this, active gene bodies (Miron et al., 2020) are found on the heterochromatin surface in Platynereis nuclei (Figure 3E). This indicates that, in the 6-dpf young worm, nucleus size is a proxy for the extent of gene activation in a cell type. For the ciliated and digestive cells, the increased number of active genes might relate to the production of cilia and microvilli.
We next explored the power of morphometry with regard to bilateral cell pairs. In the bilaterally symmetrical Platynereis 6-dpf young worm, there are hundreds of cell types comprising bilateral homologs (Vergara et al., 2017). For each cell, we ranked all other cells by distance in morphology space and identified potential bilateral partner cells by their mirror-image position (see STAR Methods; Figures 3G and S2D). Based on all morphology features combined, 14% of cells found a potential bilateral partner within the 5 nearest neighbors (20% within 10 and 28% within 20 nearest neighbors; Figure 3H). This is remarkable, given that there are more than 10,000 possible partners and that neurons, which constitute the majority of the cells, are morphologically very similar (Figure 3A). Although all features combined performed the best, followed by nuclear features (shape, intensity, texture, and chromatin distribution), we noted that the chromatin features alone (intensity and texture) still found bilateral pairs with surprising efficiency (e.g., 7% of cells finding a partner within the 5 nearest neighbors, 12% within 10, and 18% within 20). Because bilateral pairs of cells in the 6-dpf Platynereis are also identified via identical expression profiles (Vergara et al., 2017), these results suggest some correlation between chromatin morphology and gene expression profile.
Registration of EM volume and ProSPr gene expression atlas
The high stereotypy of Platynereis development (Vopalensky et al., 2019) allows for the generation of whole-body gene expression atlases with cellular resolution. This is achieved by whole-mount in situ hybridization, DNA-staining-based image registration to a common reference, and profiling by signal probability mapping (ProSPr) (Achim et al., 2018; Asadulina et al., 2012; Vergara et al., 2017). We expanded the 6-dpf ProSPr atlas for an improved representation of expression profiles throughout the body (Figures S3A and S3B; see STAR Methods), achieving a coverage of >7 genes in half of the cells. We next set out to integrate the SBEM dataset with the ProSPr atlas. Stepwise image registration (see STAR Methods) yielded a registration accuracy below one cell diameter (<4.7 μm; Figures 4A and S3E), allowing the overlay of gene expression and ultrastructural data.
The mapping of tissue markers confirmed the registration accuracy. For example, muscle markers label myocytes, with myosin heavy chain (mhc) in longitudinal and other muscles (Figures 4B and 4C) and the paraxis and twist transcription factors in oblique muscles (Figure 4B) and stomodaeal muscles (Figure 4D), respectively. We also find neural transcripts, such as the glutamate transporter glt1 (Figures 4B, 4E, and 4H) and the nicotinic acetylcholine receptor (nAchR), accurately overlapping the neuropil, indicating that these transcripts are transported into neurites. Epithelial genes, such as the cadherin pcdh15 (Figure 4B), the metabotropic glutamate receptor grm7, and the serine protease neurotrypsin ntrps, overlap with the flat epithelial cells lining the body (Achim et al., 2018) and the cathepsin-like L-protease cal2 with oral gland cells (Figure 4F) that resemble salivary glands in other annelids (Walz et al., 1988). Other expression patterns appear to obey tissue boundaries within an organ, such as the calcium-binding calmodulin in the stomodeum (Figure 4D) and the neuronal transcription factors dachshund (dash) and barH1 in nervous tissue (Figure 4H). Identifiable structures overlap between the two datasets (Figure 4G). Finally, we used the adult eyes for registration validation, with their complex cellular morphology comprising rhabdomeric photoreceptor cells and pigment cells (Figure 1C; Rhode, 1992), and revealed distinct patterns for the basic-helix-loop-helix (bHLH) transcription factor mitf demarcating pigment and photoreceptor and the bHLH factor olig demarcating photoreceptor cells alone (Figure 4E).
Correlation of gene expression with morphologically defined tissues
The unique combination of cellularly resolved gene expression and ultrastructure allowed us to explore the interplay between differential gene expression and morphology across all body cells. To do so, an expression value was calculated for every gene in every segmented cell as the fraction of the cell’s volume overlapping the registered gene expression volume (referred to as “overlap assignment”). We then used a graph-based clustering approach (with visualization by UMAP) that resulted in 15 clusters (Figure 5A). To understand how these clusters relate to the animal’s anatomy, we mapped body parts (head, cryptic segment, ventral nerve cord, pygidium, lateral ectoderm, foregut, and midgut) onto the UMAP and found a strict correlation with gross anatomy, with six genetic clusters spanning the head and two the ventral nerve cord (Figure 5B). Mapping these clusters onto the EM volume revealed that segmented cells that belong to the same genetic cluster occupy spatially coherent territories in the body, with a clear correspondence of genetic and tissue boundaries (Figure 5C). In the trunk, these transcriptionally defined domains correspond to ventral nerve cord cells, peripheral ganglia, musculature, and gland cells.
For a more detailed analysis of how gene expression relates to tissue boundaries, we focused on the animal’s head (Figure 5D)—the region with the highest gene density in the expression atlas (Figure S3A) and a more refined subdivision in the UMAP (Figure 5A). The 6-dpf brain is largely differentiated (Chartier et al., 2018) with discernible anatomical subunits called ganglionic nuclei (GN) (Engelhardt et al., 1982) that we manually segmented from the EM volume (Figure 5D, upper row). We then noted that the 6 head clusters closely matched the anatomically defined GN (Figure 5D). To quantify this, we calculated a specificity score for each genetic cluster and gene, for every ganglion (Figures 5E and S4A; see STAR Methods). Remarkably, for almost all GN, specificity values for individual genes were considerably lower than those for gene clusters. Notably, genes with relatively high specificity values often encoded transcription factors, such as dach, isl, or ap2 (Figure 5E), whose expression often occurred in coherent and overlapping domains. Highly specifically expressed transcription factors were found in subsets of one GN, as observed for arx, hox7, nk2.2, and sim1 (Figure 5E). These findings indicate that the Platynereis 6-dpf head is subdivided into transcriptional domains that are defined by the combined expression of several transcription factors largely corresponding to GN. Notably, the two clusters c6 and c9 together corresponded to a single GN, with the c6 cells mapping strictly anteriorly and the c9 cells strictly posteriorly. We thus subdivided that unit into a dorso-anterior and a dorso-posterior GN for subsequent analysis.
Expression profile and neuropil projections of head ganglia
The correspondence of transcriptional domains and GN in the head suggested distinct functions and projections of these units. In Platynereis, neuronal cell bodies surround a central neuropil, where they project and form synapses (Engelhardt et al., 1982; Verasztó et al., 2020). We pseudo-randomly selected 384 neurons in the head, manually traced their arborizations (see STAR Methods), and observed unique and distinct neurite projections and innervation fields for the neurons of each GN (Figures 6A–6C). This allowed us to define sensory and central GN of the 6-dpf brain based on gene expression, location, neuronal morphology, and projection information.
The palpal, antennal, and cirral GN (PG, AG, and CG), associated with the prominent sensory appendages of the head, contained many bipolar sensory neurons (89%, 88%, and 83%). The sensory endings of these cells enter the respective appendage, and their axons form prominent tracts that project locally in the central neuropil, with few axons crossing the midline in a single commissure (Figure 6C). We found many genes specifically expressed in two or all three sensory GN (in line with AG and CG belonging to one single cluster—c0), including the transcription factors brn3, prox, and asci. This indicates that these GN and their appendages are serial homologs (see Discussion). Other transcription factors were expressed in one of the three sensory ganglia only (e.g., hox1 in CG, pax258 in AG, and pax6 in PG), reflecting their divergent fates. In addition, the sensory ganglia jointly expressed the receptors pkd1 and pkd2, the Trp channels trpV4 and trpV5, and the ionotropic glutamate receptor iGluR (Figure S4), consistent with neurons in all three ganglia responding to glutamate (Chartier et al., 2018) and indicating that they also detect mechanical stimuli (Bezares-Calderón et al., 2018). Next, we inspected the circumpalpal GN (CpG) and found that they represented epithelial sheaths surrounding the PG with 26% bipolar sensory neurons. In line with epidermal properties, we found expression of markers, such as neurotrypsin (ntrps) and metabotropic glutamate receptor7 (grm7) (Achim et al., 2018).
We next identified the dorso-anterior and ventromedial GN (DAG and VMG), enriched in interneurons and with fewer bipolar sensory neurons (24% and 0%, respectively). DAG encompasses the medial tip of the brain with a dorsal barH1+ and a ventral nkx2.1+, dbx1+, lmx1+ part, both of which contain GABAergic neurons (Figure S4). The paired VMG encompass dbx1+, lmx1+, gdpd1+ GABAergic interneurons ventrally adjacent to the DAG (Figures S4A and S4B). Both VMG and DAG neurons project to the ventral-most portion of the neuropil, with VMG neurons sharing their projection target area with CpG neurons (Figure 6C).
The large dorsoposterior GN (DPG) forms a dorsal cap to the remainder of the brain. 50% of the cells are bipolar sensory and innervate the neurosecretory plexus (nsp), a distinct neuropil region formed by projections from peptidergic sensory-neurosecretory cells (Tessmar-Raible et al., 2007) filled with dense core vesicles and forming very few synapses (Williams et al., 2017). This unique brain part has been previously referred to as apical nervous system (Achim et al., 2018; Tosches and Arendt, 2013; Williams et al., 2017) and has been homologized to neuroendocrine pars intercerebralis of insects and associated central complex neuropil (He et al., 2019). Consistent with this, we detect a unique transcription factor signature (vat1-like, maf, otp, and rx; Achim et al., 2018) and neurosecretory markers, such as the prohormone convertase phc2, the serotonin-synthesizing enzyme tryptophan hydroxylase trpH, and the peptide receptor anpra (Figure S4). Laterally abutting the DPG, we found the adult eyes (AEs) specifically expressing the bHLH transcription factors olig and mitf (see above) and a photoreceptor marker, the neuropeptide fvri (Jékely et al., 2008). Our tracings confirmed previous findings for 3-day-old worms showing that the AE projections enter a commissure ventral to nsp (Williams et al., 2017; Figure 6C). Only cells from one other GN project to the nsp, those from the mushroom bodies (MBs).
Cellular architecture and molecular anatomy of the MBs
The tracings for the MBs revealed three distinct axon bundles projecting to the neuropil (Figures 6D and 6E). Two of these bundles are easily traced by immunostaining through development and correspond to the dorsal and ventral peduncles of the adult MBs (Figure S5A; Dürichen, 2016; Tomer et al., 2010). At 6 dpf, these dorsal and ventral peduncles are relatively close to each other and oriented parallel to the anterior-posterior axis of the animal. A third prominent peduncle of unknown fate is oriented laterally and perpendicular to the dorsal and ventral peduncles (Figures 6D and 6E).
Unexpectedly, we found that 17% of traced MB cells represented bipolar sensory neurons, which indicated that the 6-dpf Platynereis MBs have sensory properties. We noted that the dendrites of some of these neurons coalesce in tight bundles, so as to form joint sensory structures (asterisk in Figures 6D and 6E). We densely reconstructed the distal endings of the sensory bundles and found terminal cilia that do not penetrate the cuticle (Figure 6F), as characteristic for uniciliate non-penetrative sensory neurons (Purschke, 2005). We also observed that, among the MB cells, the bipolar sensory neurons were the only ones projecting to the nsp (Figure 6D), very similar to the sensory-neurosecretory cells of the apical nervous system. Intriguingly, these nsp-projecting bipolar sensory cells turned out to be the dark cells (Figures 3C, 3D, and S2A–S2D; see above). We accordingly examined all traced dark cells of the 6-dpf brain and found that they were mostly located on the DPG and projected specifically to the nsp (Figure S5F). This established a strong link between MBs and DPG, which both contain dark bipolar sensory-neurosecretory neurons projecting to the nsp.
Next, we investigated the molecular anatomy of the MB neurons (Figures 6G and S5B–S5D). We found eight genetically defined territories and 14 gene modules differentially expressed in these territories (see STAR Methods). Consistent with what we had observed for the entire brain, the genetically defined MB sub-territories corresponded to spatially and morphologically coherent groups of cells in the context of MBs anatomy. Several groups of neurons jointly fed into the ventral, dorsal, or lateral peduncles (Figures 6H and S5C–S5E) and thus constituted proximal or distal portions of the MB calyces. Each of these groups expressed distinct sets of transcription factors (e.g., arx and ptf1 in dCvP, hb9 in pCvP, barh1 in pCdP, and dlx and maf in the dClP) and marker genes for neurosecretion (e.g., phc2 and anpra in dClP), suggesting affinities of this part of the MBs to the apical nervous system (Figures 6G and S5B). Neurons of each territory also showed distinct projection areas into the central neuropil (Figures 6H and S5E).
We found that dCvP, together with two additional territories (MB-PR and MB-M), were enriched in cells labeled by our EdU proliferation assays (Figure S5B) that are devoid of long neuronal processes (see STAR Methods). The proliferating cells within these regions were located peripherally, close to the surface. These territories are identified as MB progenitor regions by their unique expression of two soxb2 paralogs and the specific co-expression of the transcription factors nkx2.1, dlx, arx, and lhx6 (Figure S4C). In addition, MB-PR and MB-M express many genes shared exclusively with other selected territories (e.g., gene modules 2, 3, 5, 7, 11, and 14), consistent with the notion that they contain diverse MB progenitors.
Assigning virtual cells to segmented cells
At single-cell resolution, assignment of gene expression by overlap may result in asymmetric profiles (Figures 7A and S6B), due to biological variability of the specimen, deformations during sample preparation, and registration error (Figure S6E). To overcome this, we first identified spatially coherent units of homogeneous gene expression in the ProSPr atlas that we named virtual cells (VCs) (Figure S6A; see STAR Methods). In total, we found 12,393 VCs, a number very similar to that of EM-segmented cells (11,402). We then assigned VCs to single segmented cells on each side according to minimal expression difference (Figures 7B and S6C; see STAR Methods). To benchmark our method, we compared the accuracy of overlap versus VC assignment by testing symmetric cell pairs (Figures S6H and S6I) for identical gene expression profiles (Vergara et al., 2017). VC assignment resulted in a mean discrepancy of 2.0 genes per cell pair, performing ∼30% better than overlap assignment with 2.8 genes mean discrepancy (Figure S6G). Specifically, VC assignment performed better in ∼70% of all pairs. Reflecting the atlas coverage (Figures S3A and S3B), VC gene density was not uniform throughout the animal body (Figure 7C). We observed that the assignment quality is higher for cells with better expression coverage (Figure S6F) and thus expect that adding more genes to the atlas will further improve gene assignment at cellular resolution.
PlatyBrowser: Multimodal image data exploration for Platynereis dumerilii
The registered ProSPr and SBEM datasets form a valuable resource containing rich biological information. However, the size of this resource (currently 231 image sources adding up to 2.02 TB lossless compressed image data) poses a challenge to its effective interrogation for scientific discovery. We therefore developed MoBIE: an open-source platform for multimodal big image data exploration and sharing. MoBIE consists of an object store backend for cloud-based image data hosting (Bogovic et al., 2020) and a Fiji (Schindelin et al., 2012) plugin for interactive image and segmentation data browsing. We use MoBIE to deploy the “PlatyBrowser” (Figures 7D–7G). Thanks to lazy loading (Pietzsch et al., 2015) from a public object store, the PlatyBrowser provides interactive access to the complete TB-sized Platynereis dataset to everyone with a standard computer and an internet connection (see STAR Methods).
Discussion
Seeking multimodality
A comprehensive understanding of cell types and their role in animal development, evolution, and physiology requires the integration of distinct modalities (Bates et al., 2019; Hobert et al., 2016; Nicovich et al., 2019). In particular, correlating gene expression with cellular and subcellular morphology is crucial to understand the principles that guide the decoding of expression information into cellular phenotypes.
One direction has been to combine expression profiling with cellular imaging. Several recent studies have implemented this for restricted sets of cells. For example, retrograde labeling of neurons prior to sequencing allows integration of partial projection information with transcriptomics (Kim et al., 2019; Phillips et al., 2019; Tasic et al., 2018). Photoconversion of dyes in selected cells can be used to correlate single-cell sequencing with live imaging techniques (Lee et al., 2019; Pfeffer and Beltramo, 2017). Other approaches based on probe hybridization or local sequencing provide expression information in situ, recovering the position of the cells interrogated (reviewed in Lein et al., 2017). Such spatial transcriptomics techniques can be correlated and complemented with single-cell transcriptomics (Kim et al., 2019; Phillips et al., 2019; Qian et al., 2020), and recent developments using viral barcoding have also enabled the integration of long-range projection information in a brain-wide manner (Chen et al., 2019). However, these approaches only retrieve multimodal information for small subsets of cells and/or their scalability is not straight-forward beyond a set of tissue slices (Qian et al., 2020).
The other possible direction is to complement EM with expression profiling. So far, such attempts have focused on cell-type populations (Davis et al., 2020) or have relied entirely on prior knowledge about the cells of interest. For example, this link is possible for recognizable neurons, such as mechanoreceptors, nocireceptors, and MB neurons in the Drosophila larvae (Ohyama et al., 2015; Saumweber et al., 2018) or peptidergic and ciliomotor neurons and mechanoreceptors in the Platynereis larvae (Bezares-Calderón et al., 2018; Verasztó et al., 2017; Williams et al., 2017). This approach can ultimately generate a catalog of functional, molecular, and morphological data for a multitude of cells in an organism, as is the case for the C. elegans nervous system with only 302 neurons (Hobert et al., 2016). More recently, proposals for the multimodal description of the Drosophila central nervous system, with 5.000 times more neurons than C. elegans, have been put forward (Bates et al., 2019). The rationale behind these approaches relies on anatomical information gathered through the use of cell-type-specific genetic markers and/or GAL4 lines. This restricts the use of cell-type integrative approaches, making it hard to scale to the organism level and hard to translate to species in which genetic manipulation is labor intensive. We now push these efforts to a new level by providing the first multimodal atlas combining EM and expression data for an entire animal.
A fully segmented volume EM dataset of an entire animal
Ultrastructural analysis of entire animals so far involved the manual collection of serial sections (Hall and Altun, 2007; Randel et al., 2014) and targeted manual or semi-automated reconstructions for sparse morphological and phenotypic characterization (Anderson et al., 2011; Bumbarger et al., 2006, 2013; Cardona et al., 2012; Kasthuri et al., 2015; Wanner and Friedrich, 2020; Wanner et al., 2016a; White et al., 1986). More recently, machine-learning-based reconstruction pipelines enabled the segmentation of entire neuronal circuits at synaptic resolution (Heinrich et al., 2018; Januszewski et al., 2018; Scheffer et al., 2020). Adapting SBEM approaches to Platynereis, we have imaged a full individual at high resolution (10 × 10 × 25 nm3), enabling analyses that span the length scales between the anatomy of entire organs and cellular ultrastructure. Extending existing pipelines (Beier et al., 2017; Pape et al., 2019), we also present the first automated segmentation of individual cellular somata, nuclei, and chromatin for a complete animal imaged in EM. Our method segments the whole multiple terabyte dataset in less than 3 days using at most 600 CPUs on a computer cluster and can be directly applied to similar datasets.
Taking advantage of 140 descriptors for 11,402 segmented cells, we provide a morphometry-based clustering that subdivides the body into cell classes, including neurons and the “dark” sensory-neurosecretory cells. This factors in not only cellular shapes and cytoplasmic texture but also nuclear shapes and chromatin features that reflect the regulatory state of the cells. In this context, we uncover a link between the nuclear size, heterochromatin volume, and regulatory state. This follows the notion that, upon gene activation, the heterochromatin “unfolds”, increasing its surface area so that active transcription takes place at the interface of the two chromatin phases. This morphometric representation of cell-type-specific regulatory states also underlies the power of our resource in detecting bilateral cell pairs (Vergara et al., 2017), which is highly efficient using chromatin morphometric data only.
Toward a genetic definition of tissue
A tissue is commonly defined as an ensemble of similar cells and their extracellular matrix with a specific function. Classically, identification of tissues relies on similarity in phenotype of the partaking cells as also manifest in our body-wide morphological clustering. Beyond that, our resource uniquely enables the genetic clustering of all segmented cells in the Platynereis body. Recent whole-body, single-cell transcriptomics have been the first to provide genetic clustering of all body cells (Achim et al., 2018; Packer et al., 2019; Sebé-Pedrós et al., 2018a, 2018b). In the absence of spatial mapping at late differentiation stages, however, these studies have not yet resolved to what extent, and at what hierarchical level, the genetically defined clusters represent coherent tissues. Our genetic clustering based on 201 differentially expressed genes subdivides the Platynereis body into 15 clusters that demarcate distinct tissues and body parts. In the central nervous system, these clusters represent spatially coherent groups of neurons, whose boundaries match those of hand-segmented ganglionic nuclei, which can thus be defined as genetically similar ensembles of cells, termed “transcriptional domains” (Achim et al., 2018). Our data further show that these domains are best defined by the combined expression of several regulatory genes rather than a single gene. This includes homeodomain factors, such as Pax6, that have been implicated in tissue specification (Rungger-Brändle et al., 2010; Zhang et al., 2019). Our neuronal tracing further reveals that the genetically defined ganglionic nuclei comprise neurons with similar projection patterns. In line with this, previous work identified transcription factors that control tissue integrity via the establishment of an “adhesive code”, as shown for Pax6, that controls axonal connectivity (Jones et al., 2002).
Our unique combination of cellular-resolution gene expression, anatomical segmentation, and neurite tracing yields new insights into the overall organization of the body. For example, we provide evidence that the Platynereis head is composed of both segmentally iterated and unique components. Solving a long-standing question about the relatedness and significance of annelid head appendages (e.g., Weigert et al., 2014), our data reveal that Platynereis antennae, palpae, and peristomial cirri express transcription factors and Trp channels that are shared with segmental trunk appendages. This indicates that the sensory head appendages and associated ganglia are serial duplicates in a segmental “mechanosensory girdle” (Verasztó et al., 2020). Within this girdle, antennae and palpae may represent separate appendages of the same anteriormost segment, as they project to related areas of the anterior neuropil. In a similar manner, the dbx+, lmx1+, gad+ neurons in the ventral brain form part of a segmentally iterated series of GABAergic neurons extending to the nerve cord (see bookmark for Figure S4C in the PlatyBrowser). In contrast, the molecular and morphological characteristics of the Platynereis dorsal brain do not reoccur in other body regions. For example, the sensory neurons of the DPG are unique in their combined transcription factor identity and in their projection to the neurosecretory plexus (Achim et al., 2018; Tessmar-Raible et al., 2007; Williams et al., 2017).
Annelid MBs and the evolution of associative centers
The link between cellular gene expression, tracings, and subcellular morphology improves our understanding of the Platynereis MBs. Given their overall morphological similarity to the insect MBs, these are commonly considered associative centers of the annelid brain, composed of densely packed interneurons in the calyces that project through the MB stalks (Heuer et al., 2010). In the past decade, a renewed discussion has emerged whether and at what level insect and annelid MBs represent homologous structures and whether there is any evolutionary relatedness to the vertebrate telencephalic cortex (Strausfeld, 2010; Tomer et al., 2010; Wolff and Strausfeld, 2016). Regarding this, our data provide new insights in two exciting directions.
First, we learn that the MB calyces not only comprise unipolar interneurons but also bipolar sensory neurons that project into the neurosecretory plexus, thus qualifying as sensory-neurosecretory cells, and express marker genes for the apical nervous system. Both ciliary morphology and GPCR expression indicate that the Platynereis bipolar sensory-neurosecretory cells act as chemosensors. This is intriguing with regard to the evolution of associative centers. Although both insect MBs and vertebrate telencephalic cortex do not contain olfactory sensory neurons themselves, they receive prominent olfactory input and have particularly elaborate control over olfaction-driven behaviors, suggesting that the distinction of odors may have played a pivotal role in the evolution of associative learning. With our new data, the hypothesis emerges that the evolution of associative centers/MBs may have built on a chemosensory organ that gradually acquired associative properties.
Second, the specific combined expression of the transcription factors nkx2.1, dlx, lhx6, and arx in the proliferative region of the Platynereis MBs—with arx being exclusively expressed in this region—is remarkable when compared to vertebrate telencephalic development. Vertebrate orthologs of these transcription factors are implicated in the specification of the telencephalic interneurons emerging and emigrating from the ganglionic eminences (Marsh et al., 2016) and continue to demarcate striatal and cortical interneurons in the adult mouse telencephalon (Zeisel et al., 2018). In particular, the same full combination is present in the striatal cholinergic interneurons that co-transmit glutamate (Kljakic et al., 2017; Zeisel et al., 2018). This is intriguing, given the broad co-occurrence of vesicular acetylcholine and glutamate transporters in the Platynereis MBs. Again, our data shed new light on the evolution of associative centers, in that co-transmitting interneurons may represent one of their oldest neuron types, possibly antedating the split of protostome and deuterostome lineages.
Conclusions and outlook
Our work represents a first step toward a comprehensive interrogation of the relationship between gene expression and subcellular morphology. Our resource will further improve with the addition of new genes and with the mapping of single-cell transcriptomics data into the ProSPr atlas (Achim et al., 2018) and thus onto the segmented EM cells. For this, we can capitalize on the assignment of VCs to segmented cells as an intermediate for combining expression space and morphospace. Next, the segmentation of additional ultrastructural features (e.g., mitochondria and Golgi apparatus) will expand opportunities for genotype-phenotype correlations.
We also envisage that enhanced resolution of new EM datasets registered onto our resource will allow the automated reconstruction of neuronal circuits and mapping of synapses at a larger scale. Notably, a completely traced serial section EM dataset already exists for 3-dpf Platynereis (Verasztó et al., 2020; Williams and Jékely, 2019), with few cells already manually linked to expression data (Williams et al., 2017). With our new resource, this can now be extended to integrate connectomics and transcriptomics for an entire animal and for different developmental stages. Additionally, the multimodal capacity of the PlatyBrowser will allow the incorporation of physiological data for identified cell types using tools such as calcium imaging (Chartier et al., 2018) and CRISPR-Cas9 (Bezares-Calderón et al., 2018). Importantly, the experimental techniques used for data collection (electron microscopy, in situ hybridization, and single-cell RNA sequencing) can be readily applied to non-canonical laboratory species. The comprehensive integration of whole-body connectomics and transcriptomics will open the door to the in toto comparison of cell types and neural circuits within and across organisms at the genetic and ultrastructure level, bringing us closer toward the understanding of the physiology, development, and evolution of living systems.
Limitations of the study
Our success capitalizes on two main features of the Platynereis three-segmented young worm that may not apply to other model species. The small size of 6-dpf Platynereis allows acquiring an entire individual in one SBEM dataset (Titze et al., 2018), which is not yet possible for an equally differentiated stage of an insect or a vertebrate. Advances in EM will allow acquiring larger samples and expanding the range of accessible species. Also, the high stereotypy of Platynereis development is key to multimodal image registration at cellular resolution (Vergara et al., 2017) but may not apply to animals with more regulative development and random or mosaic cell arrangements (as in cnidarians or vertebrates). However, considerable degrees of stereotypy are present in other systems, such as nematodes (Schafer, 2016), Drosophila (Jenett et al., 2012), Aplysia (Katz and Quinlan, 2019), the chordate Ciona (Satoh, 1999), and early neurons of zebrafish (Metcalfe and Westerfield, 1990). Even in cases where cellular-resolution alignment cannot be obtained, we expect that multimodal registration will be possible at the level of genetically homogeneous cell types or tissues.
STAR★Methods
Key resources table
Resource availability
Lead contact
Further information and requests for resources should be directed and will be fulfilled by the lead contact, Detlev Arendt (arendt@embl.de).
Materials availability
Plasmids to generate in situ hybridization probes are available upon requests, no MTA required.
Experimental model and subject details
Platynereis dumerilii larvae were obtained from an established culture at EMBL Heidelberg. Animals were kept and raised in natural seawater, at a constant temperature of 18°C, and under a 16 - 8 hours light - dark cycle. The study used animals at 6 days post fertilization.
Method details
Sample fixation, preparation, and imaging
6 dpf Platynereis dumerilii were anaesthetised using 7% MgCl2 in seawater (1:1 ratio). Platynereis worms were fixed in a solution of 2% formaldehyde and 2.5% glutaraldehyde in 0.1 M sodium cacodylate buffer for 4 days at 4°C.
Fixed samples were prepared for SBEM following an adapted form of the NCMIR protocol (Deerinck et al., 2010) aided by microwave application (Pelco Biowave). Samples were postfixed with 2% osmium tetroxide in 1.5% potassium ferrocyanide (14 min of 2 min on/off cycles, 150 W, with vacuum) followed by rinsing with H2O. The rinse protocol, used throughout processing, involved one initial exchange of H2O on the bench and twice aided by the microwave (40 s, 80 W). The samples were then incubated in 1% aqueous solution of thiocarbohydrazide (14 min of 2 min on/off cycles, 150 W, with vacuum with the cold spot set to 40°C), followed by rinsing with H2O. This was followed by a second step of osmium tetroxide, this time in a 2% aqueous solution (14 min of 2 min on/off cycles, 150 W, with vacuum) and another rinse step. Samples were then incubated in 1% uranyl acetate (aqueous) overnight at 4°C. The following day the samples were rinsed and the final step of staining was performed. Samples were transferred to Walton’s lead aspartate solution for 14 min in the microwave (2 min on/off cycles, 150 W, vacuum, 50°C). Samples were again rinsed with H2O and then dehydrated with increasing concentrations of ethanol (20%, 50%, 70%, 90%, 3 × 100%). Samples were then infiltrated with durcupan resin through increasing percentages of resin with ethanol (25%, 50%, 75%, 3 × 100%). Samples were first prepared using the minimal resin method (Schieber et al., 2017) and then embedded in silver epoxy resin (Wanner et al., 2016a).
Samples were mounted onto aluminum pins with 2-part silver epoxy. Images were acquired with a ZEISS Merlin SEM at 1.8 keV landing energy, 270 pA beam current, and 0.8 μs dwell time. ∼2.5 TB comprising 11,416 slices with > 200,000 image tiles were acquired at 10 × 10 nm2 pixel size and 25 nm cutting thickness. During the seven-week-long acquisition we used the open-source acquisition software SBEMimage (https://github.com/SBEMimage/SBEMimage) (Titze et al., 2018). For image registration, translational offsets between neighboring image tiles were calculated using a custom optimized normalized cross-correlation procedure (Wanner et al., 2016a). Subsequently, pairwise relative offsets between neighboring tiles were used to optimize the tile positions in a global total least square displacement sense. Before in-plane stitching, the histograms of neighboring image tiles were matched in order to adjust and homogenize the contrast and brightness. A subset of about 10% of the sections showed nonlinear distortion artifacts due to sample charging. For these sections, a non-distorted neighboring section was manually chosen as a reference for distortion correction using the ImageJ plugin bunwarpJ (Arganda-Carreras et al., 2006).
Intensities of the fully stitched z-slices were then matched to remove intensity and contrast jumps in the z axis. This was done by calculating the 5% (L) and 95% (U) quantiles of intensity for each slice within the Platynereis (excluding the resin and silver embedding) from a downsampled version of the raw data (pixel size of 0.32 × 0.32 × 0.025 μm3). The 95% quantile U was matched between slices to adjust for shifts in absolute intensity, while the quantile range U - L was matched between slices to adjust for shifts in contrast. For each slice, these adjustments can be written as a linear transformation of the intensities: xc = ((U-L)ref/(U-L)) (x–U) + Uref where xc are the corrected intensities of the slice, x are the raw intensities of the slice and ‘ref’ refers to the reference which was taken as the median of all slices. Using the median intensity of the resin around the Platynereis as a reference (as this should be fairly constant throughout the dataset), we noticed that this technique performed less well at the start and end of the dataset, where only the tips of the Platynereis remain. To correct for this, we chose a minimum z cutoff of slice 800 and a maximum z cutoff of slice 9800 – values beyond these cutoffs were corrected by the median of the correction factors of the 100 slices next to each cutoff (i.e., the calculated values are extrapolated for the very tips of the dataset).
Scripts for the intensity correction in z can be found on github: https://github.com/mobie/platybrowser-datasets/tree/master/misc/intensity_correction
Immunofluorescence labeling and 3D-SIM
H3K36me3 immunolabelling was performed on 4% PFA-fixed 6 dpf Platynereis specimens using a 1:200 concentration of the Abcam antibody ab194677. Super-resolution 3D structured illumination microscopy (SIM) of H3K36me3 immunolabelled and 4’, 6-diamidino-2-phenylindole (DAPI) stained specimens was performed on a DeltaVision OMX V3 Blaze system (GE Healthcare) equipped with sCMOS cameras (PCO), and 405, 488 and 593nm lasers, using a 60x NA 1.42 PlanApo oil immersion objective lens (Olympus). Specimens were mounted on a microscope slide with Vectashield and then covered with a No.1.5H (170 μm ± 5 μm tol.) coverslip (Marienfeld Superior). To adapt for spherical aberration when imaging in extended depth (10-20 μm) immersion oil with a refractive index (RI) of 1.518 was used. Raw data was acquired with a z-distance of 125 nm and with 15 raw images per plane (5 phases, 3 angles) and reconstructed with SoftWoRx 6.2 (GE Healthcare) using channel-specifically measured optical transfer functions (OTFs) generated from 100 nm diameter FluoSphere beads (ThermoFisher) recorded with 1.514 RI oil, respectively, and Wiener filter setting 0.0040 (Demmerle et al., 2017). Color channel alignment was performed using Chromagnon image registration software (Matsuda et al., 2018) using biological 3D calibration slides of simultaneously multicolor detected EdU pulse labeled mammalian cells. 3D-SIM raw and reconstructed data quality was assessed with SIMcheck (Ball et al., 2015). The full-scale 32-bit reconstructed data was thresholded for each channel to the stack modal gray value (representing the center of the background intensity level) and converted to 16-bit composite tif-stacks using the ‘threshold & 16-bit conversion’ utility of SIMcheck.
Segmentation Methods
In total, we provide segmentations of all cells, all nuclei, the cuticle and selected tissues and body parts as well as nuclear chromatin. For simpler segmentation tasks (tissues and regions of the animal: coelomic cavity, glands, gut, secretory cells and yolk) where the region boundary is very pronounced we use the carving workflow of ilastik (Berg et al., 2019) on downsampled data (80 × 80 × 100 nm3). Muscles and neuropil tissue were segmented at the same resolution, using a CNN for semantic segmentation. Similarly, to segment nuclear chromatin we use ilastik pixel classification, limiting it to the pre-segmented nuclei regions. For ilastik training the nuclei of 50 cells with diverse nuclear morphology were interactively annotated as “heterochromatin + nucleolus” or “euchromatin” classes, on data downsampled to 20 × 20 × 25 nm3 voxel size.
For the more complex tasks of cell and nuclei segmentation as well as for the segmentation of the cilia and cuticle, we extended the state-of-the-art EM segmentation methods originally developed for neural tissue blocks. In essence, the pipeline consists of a membrane detection step performed by a 3D U-net and a graph agglomeration step performed either by the Lifted Multicut or the Mutex Watershed algorithms.
In detail, we start from the segmentation of the nuclei. The groundtruth annotations for CNN training were provided by ariadne.ai (12 blocks of 400 × 400 × 120 pixels each) and additionally curated. A 3D U-Net (Çiçek et al., 2016) was trained to predict short- and long-range pixel affinities as described in Lee et al. (2017) and to predict for each pixel whether it belongs to a nucleus. The predictions were processed by the Mutex Watershed algorithm (Wolf et al., 2018), blockwise in blocks of 512 × 512 × 64 pixels. We chose Mutex Watershed over the more common superpixel-based Multicut agglomeration as we observed Multicut frequently merges individual nuclei which touch across a very small portion of their boundary (short-circuiting of the multicut constraints). The individual block segmentations were then stitched together using Multicut-based agglomeration with edge weights derived from pixel affinites as described in Pape et al. (2017). All computations were done on raw data downscaled to the resolution of 80 × 80 × 100 nm3.
Cilia and cuticle segmentations were performed using the same method as for the nuclei. The cilia segmentation was performed at full resolution (10 × 10 × 25 nm3 voxel size) but only applied to the segmented nephridia cells, using 3 blocks of annotated training data consisting of a total of 171 megavoxels. The cuticle segmentation was performed for data downscaled to 40 × 40 × 50 nm3 using 5 training blocks consisting of a total of 495 megavoxels.
Cell segmentation was also started from membrane detection. The groundtruth annotations were provided by ariadne.ai, consisting of 8 blocks of 628 × 628 × 130 pixels that were additionally curated and extended by one additional block of size 1280 × 1280 × 120 pixels to include more biological variability. A 3D U-net was trained to predict short- and long-range pixel affinities. In addition, we insert the edges of tissue and region segmentations (see above) into the affinity predictions, in order to avoid missing boundary signal due to the very different appearance of some region/tissue boundaries. These predictions were then used to break the volume into superpixels by the blockwise distance transform-based watershed algorithm (Beier et al., 2017). The superpixels were used to construct a region adjacency graph and to solve the segmentation problem as a graph partitioning with Lifted Multicut (Horňáková et al., 2017). Unlike (Beier et al., 2017) and other connectomics pipelines (Funke et al., 2019; Januszewski et al., 2018; Lee et al., 2017), we additionally exploit the nuclei segmentation to enforce separation of cells containing different nuclei. To that end we introduce lifted edges between superpixels which belong to the segmented nuclei, attractive for the superpixels of the same nucleus and repulsive for the superpixels of different nuclei. Lifted edges are introduced up to a graph distance of 4 and the attractive / repulsive edge weight is set to the maximum / minimum of the local edge weights. This approach adapts the common boundary based approach used for neuron segmentation to the larger variety of appearance found in the cellular segmentation task for the complete animal. The overall lifted multicut problem was solved by the hierarchical solver introduced in Pape et al. (2019). Since the nuclei repulsion is only included up to a certain graph distance, there are still objects in the resulting cell segmentation that contain more than one nucleus segment. We find these in post-processing and separate them individually by running a graph watershed seeded from the nodes mapped to the nuclei. Cell segmentation was performed on the raw data downscaled to a voxel size of 20 × 20 × 25 nm3; the runtime for the whole volume measured 10 hours on 6 GPUs for the neural network prediction and 20 hours on a CPU computer cluster for the agglomeration part. Several ganglia in the animal’s head were identified by manually selecting the corresponding segmented cells.
We leverage prior information from nuclei and tissue segmentation to mitigate segmentation errors arising from ruptured cellular membranes and diverse appearance of cell boundaries. While these issues could be mitigated by providing additional training data, this process is very laborious for 3D segmentation. Instead, we prefered to rely on the nuclei segmentation - a much simpler problem which our algorithm solves to 99.0% accuracy - and tissue segmentation, which can be achieved at lower resolution. Our use of nuclei as prior knowledge assumes that every cell should contain only one nucleus, a constraint that is expected to be true for all cells (including muscles) at this developmental stage.
The final proof-reading was performed in a semi-automated manner. First, we compute a morphology-based score for all cells that roughly matches the likelihood of a cell being a false merge. Then, we iterate through the top 1000 cells based on their rank by this score and correct all cells that contain a false merge by running graph watershed from user-generated seeds. This resolving step had to be applied to 154 falsely merged cells. In addition, we use Paintera (Hanslovsky et al., 2020) to perform some more fine grained proof-reading. Note that the segmentation currently provided in the PlatyBrowser has not been proof-read down to the pixel level of every cell. We corrected the errors we found by the approach above and additionally polished the segmentation of the regions used in the analysis presented here: nephridia, adult eyes, symmetric cells used for the gene assignment validation. For reference, finalising the segmentation of the nephridia (14 cells with far-reaching cilia) from the fully automatic pipeline results took approximately 2 hours with Paintera.
In addition, semantic segmentations based on a CNN for the muscles and the neuropil were provided by ariadne.ai.
We provide the weights for the networks used to segment cell membranes, cilia, cuticle and nuclei on zenodo https://zenodo.org/record/3675288 as well as the corresponding training data https://zenodo.org/record/3675220. The ilastik project and training data for the chromatin segmentation is also available at https://zenodo.org/record/3676534. The ilastik projects for carving the animal outline and regions/tissue are available at https://zenodo.org/record/3678793.
The scripts to run the segmentation methods are available at https://github.com/mobie/platybrowser-datasets/tree/master/segmentation.
Segmentation Validation
The validation of the segmentation was based on manual annotations for cell centers and nuclei on 8 slices (4 transversal, 4 horizontal) from 8 domain experts, each slice annotated at least twice without access to the automatic segmentation results. The missing detections in the expert annotations were then additionally corrected by comparison with the automatic segmentation. We provide the validation data at https://zenodo.org/record/3690727.
In total we observe, for the cell segmentation task, 6.30% false split errors and 3.23% false merge errors, based on 4806 annotations. For the nucleus segmentation task, we found 0.49% false positive detections; 0.55% false negative detections based on 2888 annotations. Figure S1G gives an overview of the expert annotations and 2 examples of false merge and false split errors each.
Groundtruth annotation
The groundtruth annotations for training the automated segmentation pipelines were generated and curated by human expert annotators in the open-source 3D annotation software KNOSSOS (https://knossos.app/, https://github.com/knossos-project/knossos).
Neuron tracing
The morphology of each neuron was traced using the open-source tracing software PyKNOSSOS (https://github.com/adwanner/PyKNOSSOS) (Wanner et al., 2016b). Starting from seed points in somata, human expert annotators manually reconstructed the skeleton of the neurons by following their neurites within the 3d EM volume. Neurite branch points were manually identified and tagged. In an iterative procedure, the different neurites originating from a given branch point were reconstructed. In total 384 neurons have been reconstructed manually with a total length of 36.94 mm, which took a total of 304.51 hours of manual tracing. All reconstructed neurons are available in the PlatyBrowser and the skeleton files can be downloaded from https://zenodo.org/record/4660134.
Expansion of the 6dpf ProSPr atlas
To obtain comprehensive expression information for all cells in the body at 6 dpf, we introduced new genes into the 6dpf ProSPr resource focusing on areas with poor expression coverage in previous versions of the atlas, such as the digestive system. For example, we selected 37 genes that we found highly enriched in a foregut cDNA library (see section below) and that proved to be differentially expressed in the foregut. We also introduced additional genes with preferential nervous system expression, with special focus on neural differentiation genes. As a result, the 6 dpf ProSPr gene expression atlas (referred to as ProSPr atlas) now contains information for 201 genes (including 78 transcription factors, 56 neural effector genes, and 53 other differentiation genes) and 4 EdU proliferation stainings (see below).
Foregut cDNA library
Around 40 maturing animals (juveniles) were isolated from the tubes and starved for three days. The epidermis around the foregut was removed to reveal the muscular pharynx and the anterior part of the organ. The tissue was immediately fixed in liquid nitrogen. Tissue was then ground in Trizol and a Trizol/Chloroform extraction was performed. RNA was purified with the RNeasy kit (QIAGEN columns). Illumina library was prepared by the EMBL GeneCore facility. cDNA for individual gene cloning was generated with SuperScript III kit (Invitrogen).
In situ hybridization and registration
Animal breeding, animal fixations, and whole-mount in situ hybridizations were performed as described previously (Tessmar-Raible et al., 2005). Light microscopy sample imaging and gene expression maps were generated following the ProSPr protocol as described in Vergara et al. (2017). Animal body parts (head, cryptic segment, ventral nerve cord, foregut, lateral ectoderm and pygidium) were manually segmented in TrackEM (Cardona et al., 2012) using the ProSPr DAPI reference.
EdU proliferation assays and tracing analysis
We incorporated EdU (5-ethynyl-2′-deoxyuridine) at 200 mM directly in sea water during specific developmental periods (42 hours to 48 hours; 3 days to 4 days; 4 days to 5 days; 5 days to 6 days) in order to label the nuclei of those cells undergoing mitosis within a certain period. Fluorescent labeling was done using the Click-iT™ EdU kit following the manufacturer’s instructions (Invitrogen).
We categorized EdU positive cells from the period 4 to 6 days old as late dividing and should represent immature neurons and/or progenitor cells. We hypothesize that this should be reflected in a lack of long neuronal processes. To test this, we randomly selected 10 EdU positive cells and 10 EdU negative cells and traced their morphology. Only 1 out of 10 of the EdU positive cells showed a projection to the central neuropil, contrasting with 9 out of 10 of the EdU negative cells, confirming that these cells are still developing and constitute proliferation domains. This correlation adds additional support to the accuracy of the registration between the ProSPr atlas and the electron-microscopy dataset.
Registration of ProSPr to EM
As the typical variation between individuals in the position of individual cells is less than one cell diameter (< 4.7 μm) (Vergara et al., 2017), we reasoned that registration should be possible at close to cellular resolution. To avoid compromising the appearance of ultrastructural morphology, we computed a transformation of the lower resolution ProSPr atlas onto the higher resolution SBEM volume. We used the software package elastix (Klein et al., 2010; Shamonin et al., 2014) to compute a multi-step registration of the average DAPI signal (representing nuclei of 153 specimens from the ProSPr atlas) onto the binary mask of the segmented nuclei of the EM individual (Figure S3C). Historically, elastix is mainly used by the medical image analysis community. To increase the usability of elastix for the biological community we developed the Fiji (Schindelin et al., 2012) plugin ElastixWrapper (Tischer, 2019), which we used for the registration. As an input for the registration we prepared three image files at an isotropic voxel spacing of 0.55 μm: DAPI: a greyscale image of the average DAPI signal of ProSPr (Vergara et al., 2017); EM-Nuclei: a binary image containing the segmented EM nuclei; and EM-Mask: a binary mask covering the image region containing information relevant for the registration optimization (Figure S3). To prepare the EM-Nuclei image we removed 42 nuclei in the gut region from the nuclei segmentation binary mask as these nuclei do not locate to stereotypical positions and were not reflected in the ProSPr DAPI signal, and therefore they did not properly guide the registration. Next, we downsampled the binary mask of the remaining 11,456 segmented nuclei to the 0.55μm voxel size of the ProSPr data; note that due to the bi-linear downsampling algorithm the voxel values in the EM-Nuclei image are not strictly binary, but show some shades of gray (Figure S3B). To prepare the EM-Mask image we isotropically dilated the EM-Nuclei image by 12 pixels, corresponding to 6.6 μm. The EM-Mask serves to restrict the optimization algorithm of elastix to relevant regions of the image. The usage of such a mask in elastix is optional, though we found it critical for our image data because it contains a lot of empty space due to the oblique orientation of the EM image within the 3D voxel space. Using these images we sequentially ran elastix with transformation models of increasing deformability: Similarity: a rigid transformation, allowing for rotation, translation, and a uniform scaling factor; BSpline100: a locally deformable BSpline transformation (Rueckert et al., 1999) with a 3-D voxel grid spacing of 100 pixels (55 μm); BSpline30: BSpline transformation with a grid spacing of 30 pixels (16.5 μm); BSpline10: BSpline transformation with a grid spacing of 10 pixels (5.5 μm). The registrations build on each other, e.g., the BSpline100 transformation takes the Similarity transformation as a starting point. Importantly, since the two datasets are initially very misaligned (Figures S3A and S3B) using the automated registration algorithms of elastix directly did not work for us. The reason is that elastix uses local search algorithms for improving the transformation parameters and thus typically needs a decent starting condition. To produce such a starting condition, we used the Fiji plugin TransformJ Rotate (https://imagej.net/libs/imagescience) to manually determine angles that roughly align the two datasets in 3-D. We then converted these angles into an elastix transformation file (Manual-Rotation). Using the Manual-Rotation transformation as a starting point, we could then use elastix to sequentially improve the registration. After each step in the registration sequence we visually compared the transformed DAPI image with the EM-Nuclei image to qualitatively assess whether the registration improved (e.g., using overlays as shown in Figure 4A in the main text). To further assess registration quality, we manually identified 43 corresponding landmarks between the EM and ProSPr datasets, covering all relevant regions of the specimen (Figure S3D). These landmarks were selected based on unequivocal positions in both datasets using the nuclei, muscle and neuropil signals. Using these landmark pairs we could measure the registration quality at each step by transforming the manually assigned landmark coordinates in EM space into ProSPr space and then comparing these computed coordinates with the manually assigned coordinates. Measuring the pairwise distances of the 43 coordinate pairs for the different registration steps we obtained the following results (mean, median): Manual-Rotation: 63.57 μm, 58.79 μm; Similarity: 11.90 μm, 10.09 μm; BSpline100: 5.29 μm, 4.48 μm; BSpline30: 4.23 μm, 3.53 μm; BSpline10: 3.47 μm, 2.99 μm. The fact that the mean and median are similar shows that there are no severe outliers and thus the registration quality is consistent across all parts of the animal (Figures S3D and S3E). In addition, the pairwise distances decrease with each registration, showing that the elastix algorithm indeed converged to a better registration. We did not attempt to improve the registration beyond the current measured accuracy of ∼3 μm, because at 6 dpf the typical variation in the position of individual cells in platynereis is ∼4.7 μm (Vergara et al., 2017).
Scripts and transformation files used for the registration can be found on github: https://github.com/mobie/platybrowser-datasets/blob/master/registration/0.6.3
Morphology clustering
Various morphological, intensity and texture features were calculated from the cell, nucleus and chromatin segmentations (full description of features in Table S1). Cells (and their associated nuclei and chromatin) were filtered to remove those most likely to be affected by segmentation errors – cells were removed that had no assigned nucleus, were in certain regions (yolk, neuropil, cuticle, cavities), or outside a reasonable size range. This resulted in 11348 remaining cells. All features were calculated on data downsampled to 80 × 80 × 100 nm3 resolution.
For downstream analysis, any cells (and associated nuclei and chromatin) that were within the region for extrapolated intensity correction (see Sample fixation, preparation and imaging methods section) were also discarded (leaving a total of 10344 cells). This was to avoid any possible artifacts in features that rely on the raw intensity data.
This resulted in a table of 140 features by 10344 cells. All features were standardized by centring to a mean of 0 and scaling to unit variance. A K-nearest neighbor graph was then constructed (k = 10) using Euclidean distance between the feature vectors. Community detection (using the Louvain method (Blondel et al., 2008)) was performed with a resolution parameter of 1.2, resulting in 11 clusters. Clusters were visualized on a UMAP (McInnes et al., 2018) (Uniform Manifold Approximation and Projection for Dimension Reduction) using the following parameters: n_neighbors = 10 and min_dist = 0.1.
All the morphological clustering analysis was performed in Python with scikit-image (van der Walt et al., 2014), vigra (http://ukoethe.github.io/vigra/), scipy (Virtanen et al., 2020), mahotas (Coelho, 2012), scikit-learn (Pedregosa et al., 2012), networkx (Hagberg et al., 2008), python-louvain (https://github.com/taynaud/python-louvain), umap-learn (McInnes et al., 2018), pandas (McKinney, 2010), numpy (van der Walt et al., 2011) and snakemake (Köster and Rahmann, 2012). All code is freely available in the github repository (the clustering code as a snakemake workflow). The snakemake workflow also contains code for various other analyses e.g., clustering of different subsets of morphological features, plotting of Platynereis regions on the UMAP, plotting gene expression on the UMAP, or construction of various heatmaps.
The script for calculation of morphological features is available here: https://github.com/mobie/platybrowser-datasets/blob/master/mmpb/extension/attributes/morphology_impl.py and the Snakemake workflow for clustering analysis is available here: https://github.com/mobie/platybrowser-datasets/tree/master/analysis/morphology_clustering. The final clustering is available here: https://github.com/mobie/platybrowser-datasets/blob/master/data/1.0.1/tables/sbem-6dpf-1-whole-segmented-cells/morphology_clusters.csv and the final UMAP here: https://github.com/mobie/platybrowser-datasets/blob/master/data/1.0.1/tables/sbem-6dpf-1-whole-segmented-cells/morphology_umap.csv.
Cell type annotation
In order to be able to investigate morphological and genetic differences between cell types, segmented cells were annotated as follows. First, seven morphological distinct cell categories were defined: neurons, dark cells (cells with very dark, uniform nuclei and cytoplasm), epithelial, muscle, digestive midgut cells, secretory (a broad category of any cells with prominent endoplasmic reticulum, Golgi and secretory vesicles / inclusions) and ciliated cells (multiciliated cells on the surface) (Figure S2A). Second, all the 11402 segmented cells were split into mini-blocks of consecutive 10 cells. In order to ensure uniform sampling across the animal, these mini-blocks were randomly shuffled and merged together into bigger blocks, consisting of 100 cells each. Random 10 of the resulting 115 blocks were manually annotated using the annotation functionality of the MoBIE viewer. The annotators could assign one of the abovementioned cell classes or the labels ‘unsure’ or ‘error’. As a result, 65% of the cells were annotated as neurons, 16% as epithelial cells, 11% as muscle cells, 3.5% as dark cells, 2.38% as midgut cells, 1.80% as secretory cells and 0.35% as ciliated cells. Afterward, more manual annotation was done specifically targeting the underrepresented cell classes: ciliated, secretory, midgut and dark cells. This resulted in 571 neuron, 141 epithelial cells, 53 muscle cells, 41 dark cells, 25 ciliated cells, 51 secretory cells and 55 midgut cells. The annotation results are available here: https://github.com/mobie/platybrowser-datasets/blob/master/data/1.0.1/tables/sbem-6dpf-1-whole-segmented-cells/default.csv (column ‘cell_type’).
Bilateral pair analysis
To assess how different subsets of morphological features (Table S1) perform on finding bilateral pairs of cells in the dataset, we established a set of xyz criteria for being bilateral based on the position of the cell’s nuclei. As the Platynereis is bent in the EM image volume, we cannot directly use the xyz position of the nuclei to determine if they are bilateral or not. Instead, we first calculated a midline surface for the Platynereis by fitting a second order polynomial of two variables to a set of manually chosen points that lie on the midline. The absolute distance from this surface could then be calculated for all nuclei. While this accounts for the distance from the midline, it doesn’t assess if nuclei have a similar position along the anterior-posterior (AP) or dorsal-ventral (DV) axis of the Platynereis. To account for this, we transformed the xyz position of all nuclei from the EM space back to the original ProSPr space via the program elastix (Klein et al., 2010; Shamonin et al., 2014) (same transformation parameters as calculated for the registration – see Registration of ProSPr to EM methods section). As the AP and DV axes are nicely aligned to y and z in the ProSPr space, we can use these coordinates as estimates for AP and DV position. The full criteria are then: similar absolute distance from midline; on opposite sides of the midline; similar y position in ProSPr space; similar z position in ProSPr space.
To assess a reasonable range to consider ‘similar’, these statistics were calculated for a set of manually curated bilateral pairs of cells (202 pairs total). The acceptable difference was then set to the mean over these pairs plus two standard deviations.
Given these criteria to assess if cells are bilateral, we then wished to calculate the distance in morphology space that must be traveled from one cell to find its potential bilateral partner. This was calculated first by taking a certain subset of the morphology criteria (calculated as in the Morphology clustering methods section) and standardizing them – by centring each feature’s mean to 0 and scaling to unit variance. For each cell, a ranking was then formed (based on Euclidean distance in morphology space) of every other cell in the dataset - from its very closest neighbor, to its most distant neighbor. Then, using the xyz criteria above, the closest neighbor was found in this ranking that was a potential bilateral partner.
The subsets used in Figure 3H were: ‘all’ (all morphology features), ‘cell’ (only cell features), ‘nucleus’ (all nucleus and chromatin features), and ‘chromatin’ (only intensity and texture features of chromatin).
To compare to random assignment of bilateral pairs, the ranking of cells in morphology space was randomly shuffled, and the same analysis performed. This was repeated 100 times, and the mean of all trials taken for the final randomized results shown in Figure 3H.
Scripts for this analysis are available here: https://github.com/mobie/platybrowser-datasets/tree/master/analysis/bilateral_pairs , and the midline calculation is available here: https://github.com/mobie/platybrowser-datasets/tree/master/analysis/midline. The table of manually curated pairs is available here: https://github.com/mobie/platybrowser-datasets/blob/master/data/1.0.1/tables/sbem-6dpf-1-whole-segmented-cells/symmetric_cells.csv. The midline fit and final graphs were calculated in R, making use of the tidyverse (Wickham et al., 2017) and rgl (https://cran.r-project.org/web/packages/rgl/index.html) packages. All other analysis was in Python with pandas (McKinney, 2010), numpy (van der Walt et al., 2011), scikit-learn (Pedregosa et al., 2012) and scipy (Virtanen et al., 2020).
Gene expression clustering
Overlap assignment was used to assign a vector of gene overlap values to each segmented cell. In brief, the fraction of the volume of each segmented cell overlapping with the registered volume of every gene was calculated. This resulted in a vector of length 201 (equal to the number of genes) with values ranging from 0 (no overlap) to 1 (complete overlap).
Segmented cells were then filtered to remove those most likely to be affected by segmentation errors – cells were removed when they had no assigned nucleus, when they were in certain regions (yolk, neuropil, cuticle, cavities), or when they were outside a reasonable size range. In addition, any cells that expressed no genes were discarded. This resulted in 11,366 remaining cells.
A K-nearest neighbor graph was then constructed (k = 20) using Euclidean distance between the gene overlap vectors. Community detection (using the Louvain method (Blondel et al., 2008)) was performed with a resolution parameter of 1.2, resulting in 15 clusters.
Clusters were visualized on a UMAP (McInnes et al., 2018) (Uniform Manifold Approximation and Projection for Dimension Reduction) using the following parameters: n_neighbors = 20 and min_dist = 0.1. All the gene expression clustering analysis was performed in Python with scikit-learn (Pedregosa et al., 2012), networkx (Hagberg et al., 2008), python-louvain (https://github.com/taynaud/python-louvain), umap-learn (McInnes et al., 2018), pandas (McKinney, 2010), numpy (van der Walt et al., 2011) and snakemake (Köster and Rahmann, 2012).
This section used version 1.0.0 of the data. The code is freely available as a snakemake workflow: https://github.com/mobie/platybrowser-datasets/tree/master/analysis/gene_clustering. The snakemake workflow also contains code for various other analyses e.g., clustering of different kinds of gene expression (binarised by a threshold, assignment by virtual cells etc), plotting of Platynereis regions on the UMAP, plotting morphology statistics on the UMAP, or construction of various heatmaps. The final clustering is available here: https://github.com/mobie/platybrowser-datasets/blob/master/data/0.6.5/tables/sbem-6dpf-1-whole-segmented-cells/gene_clusters.csv and the final UMAP here: https://github.com/mobie/platybrowser-datasets/blob/master/data/1.0.0/tables/sbem-6dpf-1-whole-segmented-cells/gene_umap.csv.
Ganglia and gene specificity analysis
The detail in the EM volume and cell segmentation allowed us to classify and group the cells in the animal’s head in a complete and unbiased manner by using ultrastructural landmarks such as connective tissue and basal membranes (as compared to previous efforts recognizing brain anatomy mostly based on immunohistochemistry (Chartier et al., 2018; Tomer et al., 2010)).
Segmented head ganglia were displayed on the same UMAP as the gene expression data (see Gene expression clustering methods).
A specificity score was calculated for every gene cluster and individual gene for every ganglia. This specificity score is a combination of two measures: first, the fraction of expression confined to a given ganglion (A) and second, the fraction of that ganglion covered (B). In analogy to the F1 score, specificity is calculated as: 2AB / (A + B)
The calculation for individual genes relies on overlap assignment to segmented cells (see Gene expression clustering methods). A threshold of 0.5 was used to label a cell as expressing a particular gene.
The specificity calculations are part of the same workflow as described in the Gene expression clustering section: https://github.com/mobie/platybrowser-datasets/tree/master/analysis/gene_clustering , specific script here: https://github.com/mobie/platybrowser-datasets/blob/master/analysis/gene_clustering/scripts/ganglia_specificity.py , and also used version 1.0.0 of the data.
The MB ganglia genetic heatmap was generated by the following method. Cells are grouped into clusters (genetic territories) obtained via louvain community detection (as described in section ‘Morphology clustering’). Genes were sorted hierarchically and grouped into modules respecting the hierarchy of the tree (gene modules). For each module and genetic territory, we calculated the average expression overlap value, which we use to derive a specificity score as described above.
Generation of virtual cells
The generation of Virtual Cells (VCs) was done as in Vergara et al. (2017) with some modifications (see also main text). Analysis of ProSPr data and generation of VCs was done using the software R Bioconductor (Gentleman et al., 2004). The atlas was binned by a factor of 3 to generate SuperVoxels (SVs) - cubes of 27 isotropic voxels of 0.55 μm side (each SV represents a cube of 1.65 μm edge length and 4.5 μm3 volume). On average, and given the size of a nucleus in the EM dataset (67.32 ± 15.19 μm3), each cell is minimally covered at least by 12 SVs. Note that we used the nucleus size to estimate cell coverage as the big majority of cells are neurons (small cytoplasm), and supervoxels along cell boundaries will result in noisy signals. Next, we generated a matrix with the SV spatial location in xyz and the proportion of pixels within that SV positive for each gene. SVs with very low correlation with their neighbors were removed from the analysis as they represent noise. Due to memory constraints, the SVs were subdivided into different anatomical regions (e.g., head, ventral nerve cord, foregut, etc) for further analysis.
Next, SVs were grouped into Virtual Cells (VCs). This is done by a recursive process of hierarchical clustering to find groups of SVs based on their similarity in expression profile (Figure S6A). This process does not take into account the location of the SVs, but it is constrained by the final size of the group, allowing groups to be formed so that they represent between 1 and 6 cells, always optimizing for expression correlation. For hierarchical clustering we used the function ‘hclust’ with the method ‘complete’. For distance calculation we used the method ‘jaccard’ from the ‘vegan’ package. For computing SVs correlation we used the basic R function ‘cor’ with the following parameters: ‘ use = “complete.obs,” method = “kendall” ’. We found the majority of clusters comprising spatially confined groups of SVs with bilateral symmetry, which is a good indication of the high quality of the atlas.
In contrast to Vergara et al. (2017), VCs were automatically curated based on size and spatial distribution of their constituent SVs. In more detail, first all the VCs that contain less than 16 supervoxels were filtered out, since below this size they consistently showed low spatial correlation (probably representing cell boundaries). Next, VCs were split into spatially connected components (CCs), and the ones below size 5 supervoxels in size were filtered out (cell size values below 5 SVs do not properly represent a cell). All CCs comprised of 8 or more supervoxels were kept without further checks, and the ones in the range of [5,8) were checked for a symmetric partner - if there was another CC of the same VC symmetrically located on the other side of the animal, the CC was considered plausible and retained. The partner CC was considered symmetric if located within the radius of 4 supervoxels (one cell diameter) from the mirrored coordinates of the center of the given CC. After this curation step the VCs are assembled together again (so they are retained as bilateral cell types) and their expression profile is calculated.
The complete automated pipeline for generating Virtual Cells from the ProSPr gene expression maps can be found on GitHub: https://github.com/mobie/prospr-vc-generation.
Virtual cell assignment
In order to assign the gene expression patterns to the segmented cells the Virtual Cells (VCs) volume was registered onto the EM one. To compensate for biological variability and registration error for each cell we considered all the VCs found in the radius of 5 μm from the cell boundaries. The expression pattern of the cell calculated by gene overlap (Figures 7A and S6B) was compared to the expression pattern of every VC found. Finally, the segmented cell was assigned to the genetically closest VC in the vicinity (Figures 7B and S6C). Considering the fact that the available 201 genes were mostly targeting specific tissues, the animal can and does have areas without or with minimal gene coverage. That is why in case for a given cell either no VCs were found around, or an absence of any expression was genetically closer than any of the VC’s found, the cell was allowed not to be assigned any gene expression or, more precisely, to be assigned zero expression for all the genes.
Notably, although the spatial location of the assigned virtual cells (Figure S6D) is consistent with their spatial location in ProSPr space (Figure S6A), only in 39% of cases the segmented cell was actually assigned the spatially closest Virtual Cell. This shows that biological variability combined with the registration error limits the precision of cellular assignment based on expression overlap only.
In order to assess the assignment accuracy we used morphologically identifiable symmetric cell pairs that are supposed to have identical gene expression profiles. In total, 208 unambiguous symmetric pairs of segmented cells were marked in the EM volume, covering most tissues (Figures S6H and S6I). For each cell pair, we calculated the difference between genetic profiles assigned to the cells in the pair. This difference was compared for overlap versus virtual cell assignment (Figure S6G).
To compare this type of assignment to purely spatial assignment we also assigned the Virtual Cells by overlap: each segmented cell was assigned either the VC that overlapped its volume the most, or no expression in case no VC overlap was present. Such assignment gave a worse result in the symmetric cells test: the mean gene distance between symmetric cells was 3.62 (compared to 2.0 distance for assignment by genetic distance described above).
This analysis used version 1.0.0 of the data. Only segmented cells that contained nuclei were used for the analysis.
MoBIE and the PlatyBrowser
To make available the Platynereis atlas to the community we developed MoBIE (https://mobie.github.io). MoBIE is a free and open-source platform for multi-modal big image data exploration and sharing. It consists of an S3 object store (https://en.wikipedia.org/wiki/Amazon_S3) backend for cloud based image data sharing (Bogovic et al., 2020), GitHub based storage of tabular data and project metadata, and an easy to install Fiji (Schindelin et al., 2012) viewer for integrated browsing of the whole dataset.
The PlatyBrowser uses the MoBIE framework for the interactive browsing of our Platynereis atlas data. To this end, the MoBIE viewer fetches data from the https://github.com/mobie/platybrowser-datasets/ GitHub repository (see above section on Data Provision). To ensure consistent data access we implemented a git-based versioning system to accommodate further improvements of the registration or segmentations, as well as contributions of entirely new imaging modalities from the community. The repository contains different versions of the data; the most recent version at time of publication is 1.0.1 and the reported results are based on this version unless noted otherwise.
The MoBIE Fiji viewer is written in Java and thereby executable on all major platforms (Linux, Mac, Windows). The code is publicly available in a GitHub repository (https://github.com/mobie/mobie-viewer-fiji), which also contains the latest installation and usage instructions (https://github.com/mobie/mobie-viewer-fiji#installation). The graphical user interface of the MoBIE Fiji viewer is implemented using the Java Swing library (https://en.wikipedia.org/wiki/Swing_(Java)). The image data handling and visualization is based on imglib2 (Pietzsch et al., 2012), a java library for processing large N-dimensional image data. For image data visualization, the MoBIE viewer uses BigDataViewer (BDV) (Pietzsch et al., 2015). BDV enables arbitrary plane slicing of volumetric image data. This viewing modality is ideal for the exploration of our SBEM data, which is too dense for volume rendering. In addition, BDV supports simultaneous display of image data of different resolutions, thereby enabling the overlay of ProSPr (0.55 × 0.55 × 0.55 μm3) and SBEM (10 × 10 × 25 nm3) images (e.g., Figures 4B–4H).
Thanks to lazy-loading from a chunked pyramidal image file format (Pietzsch et al., 2015), everyone with an internet connection is able to browse the TB-sized atlas from a standard computer. The PlatyBrowser is thus an accessible and extendable resource for the investigation of Platynereis dumerilii ultra-structural morphology and gene expression.
In order to efficiently explore complex datasets, we designed a user interface for choosing from hundreds of image sources of different modalities as well as for adjusting their display settings (Figures 7D and 7G). The registration of the ProSPr atlas to the SBEM dataset permits to compute a (ProSPr based) gene expression profile at each location in the SBEM space. To enable interactive exploration of this information the PlatyBrowser provides the possibility to query gene expression profiles at each location, presented as both a ranked list and gene expression table (Figure 7G).
To explore segmentations, MoBIE provides adjustable lookup tables (Glasbey et al., 2007; Hanslovsky et al., 2020) to overlay each segment with a distinct color onto the image data. Much of this code was conceptualised and developed during a “Fiji hackathon” in Ostrava (Czech Republic, January 2019), and we are very grateful to Tobias Pietzsch, who, during this event, provided invaluable support with the software design. We use this functionality to visualize nuclei, tissue, and cell segmentations on top of the image data (Figures 7E and 7G). This makes it possible to visually observe whether distinct tissue regions correlate with specific ultrastructural morphology. Often, not only ultrastructural morphology but also the 3D shape of anatomical regions is indicative of biological function. We therefore added the possibility to render segmented cells in an interactive 3D viewer (Schmid et al., 2010) (Figure 7G).
The nuclei, cell, and tissue segmentations allow for the extraction of features such as cellular morphological descriptors or gene expression profiles (Figures 3 and 4). To interactively explore these measurements, MoBIE displays a table for each segmentation, where table rows correspond to segmented objects and table columns to feature values. Tables can be sorted according to any feature value and objects of special interest can be selected (highlighted) both in the viewer and in the table. Importantly, the coloring of objects can be configured to reflect any feature value, making it possible to overlay object-based measurements on the image data (see Figures 3B, 3G, 5C, and 5D).
In addition, we implemented a functionality to manually assign object annotations by making the tables editable. For example, we have used this feature to assign the cells in the head to ganglia. We also implemented a bookmark function that not only stores positions and views, but can also update the whole browser state, including segmentations, tables, and objects rendered in the 3D viewer. Bookmarks were used to provide interactive versions for many figure panels presented in this manuscript (see figure legends in the main text).
Quantification and statistical analysis
Where appropriate, the quantitative analyses are described in the relevant sections of the Method details.
Additional resources
In addition to the methods described above, we have generated documentation for the MoBIE Fiji plugin, which can be found at https://mobie.github.io.
Acknowledgments
We thank Josep Moscardo and EMBL IT Services for cloud infrastructure and Stephan Saalfeld, Igor Pisarev, and Tobias Pietzsch for software design. We thank Gemma Estrada Girona for help with Figures 4 and 7 and the graphical abstract and for the design of the MoBIE logo. We thank Julia Tran and Remmer Janssen for curating muscle and neuropil segmentation and supervising groundtruth annotation. We thank Giulia Mizzon for annotations and segmentation validation, David Puga for helping reconstruct the midline and neuronal tracing, and Julieta Acevedo for H3K36me3 stains. Thanks to Sian Culley for the name MoBIE. H.M.V. was supported by an EMBO LTF, K.N.M. by Horizon 2020 “EvoCELL” project 766053, and D.A. and P.Y.B. by the Advanced grant “NeuralCellTypeEvo” 788921 of the European Commission.
Author contributions
Conceptualization, A.K., C.T., D.A., H.M.V., and Y.S.; methodology, A.A.W., A.K., C.G., C.P., C.T., H.M.V., K.I.M., and V.Z.; software, A.A.W., B.T., C.P., C.T., H.M.V., K.I.M., and V.Z.; validation, C.P., C.T., D.A., H.M.V., K.I.M., P.M., P.Y.B., R.M.T., V.Z., and Y.S.; formal analysis, C.P., H.M.V., K.I.M., O.S., and V.Z.; investigation, B.T., C.G., E.L.S., H.M.V., K.N.M., L.S., O.S., P.M., P.Y.B., and W.D.; resources, A.K., C.P., C.T., H.M.V., K.I.M., and V.Z.; data curation, C.P., K.I.M., and V.Z.; writing – original draft, A.K., C.P., C.T., D.A., H.M.V., K.I.M., R.M.T., V.Z., and Y.S.; writing – review and editing, A.A.W., A.K., B.T., C.G., C.P., C.T., D.A., H.M.V., K.I.M., R.M.T., R.W.F., V.Z., L.S., and Y.S.; visualization, C.P., C.T., H.M.V., K.I.M., R.M.T., V.Z., and Y.S.; supervision, A.K., C.P., C.T., D.A., H.M.V., and Y.S.; project administration, D.A. and Y.S.; funding acquisition, A.K., D.A., R.W.F., and Y.S.
Declaration of interests
A.A.W. is the founder and owner of ariadne.ai ag.
Published: August 10, 2021
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.cell.2021.07.017.
Contributor Information
Yannick Schwab, Email: yannick.schwab@embl.de.
Rainer W. Friedrich, Email: rainer.friedrich@fmi.ch.
Anna Kreshuk, Email: anna.kreshuk@embl.de.
Christian Tischer, Email: christian.tischer@embl.de.
Detlev Arendt, Email: arendt@embl.de.
Supporting citations
The following references appear in the supplemental information: Haralick et al. (1973).
Supplemental information
Data and code availability
All data and code generated during this study are publicly available.
The Electron microscopy data and EM based segmentation are archived and can be downloaded from EMPIAR (https://www.ebi.ac.uk/pdbe/emdb/empiar/) under the accession id 10365. Gene expression maps, prospr segmentations, and registration files are archived and can be downloaded from BioStudies (https://www.ebi.ac.uk/biostudies/) under the accession id S-BIAD14.
In addition to EMPIAR and BioStudies we host all image data on an aws-s3 (http://aws.amazon.com/s3) compatible object store at EMBL Heidelberg using the open source implementation of MinIO (https://min.io). The data are stored in the n5 data format (Bogovic et al., 2020), which allows chunking and compression for efficient data access. It can be accessed on demand through the n5 s3 API (https://github.com/saalfeldlab/n5-aws-s3).
The tabular data derived from images and segmentations are hosted on github: https://github.com/mobie/platybrowser-datasets/tree/master/data/1.0.1/tables. The repository https://github.com/mobie/platybrowser-datasets also contains the image metadata in the BigDataViewer xml file format, which serves as an entry point to start the PlatyBrowser for the data hosted on the s3 object store. In order to keep track of changes in the derived data, for example due to segmentation corrections, we use a versioning scheme inspired by https://semver.org/. Between each version only the data that has actually changed is updated, whereas unchanged data are referred to by links to older versions.
The software package used for EM acquisition SBEMImage is available at https://github.com/SBEMimage/SBEMimage and described in the publication (Titze et al., 2018). The software for generating the virtual cells from ProSPr is available at https://github.com/mobie/prospr-vc-generation and under the https://doi.org/10.5281/zenodo.4899523. The software for EM segmentation and morphological as well as genetic analysis of segmented cells is available at https://github.com/mobie/platybrowser-datasets and under the https://doi.org/10.5281/zenodo.4899527. The MoBIE Fiji viewer plugin is available at https://github.com/mobie/mobie-viewer-fiji and under the https://doi.org/10.5281/zenodo.2602754.
References
- Achim K., Eling N., Vergara H.M., Bertucci P.Y., Musser J., Vopalensky P., Brunet T., Collier P., Benes V., Marioni J.C., Arendt D. Whole-body single-cell sequencing reveals transcriptional domains in the annelid larval body. Mol. Biol. Evol. 2018;35:1047–1062. doi: 10.1093/molbev/msx336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Altun Z.F., Herndon L.A., Wolkow C.A., Crocker C., Lints R., Hall D.H. 2002. WormAtlas.http://www.wormatlas.org [Google Scholar]
- Anderson J.R., Jones B.W., Watt C.B., Shaw M.V., Yang J.-H., Demill D., Lauritzen J.S., Lin Y., Rapp K.D., Mastronarde D. Exploring the retinal connectome. Mol. Vis. 2011;17:355–379. [PMC free article] [PubMed] [Google Scholar]
- Arendt D., Musser J.M., Baker C.V.H., Bergman A., Cepko C., Erwin D.H., Pavlicev M., Schlosser G., Widder S., Laubichler M.D., Wagner G.P. The origin and evolution of cell types. Nat. Rev. Genet. 2016;17:744–757. doi: 10.1038/nrg.2016.127. [DOI] [PubMed] [Google Scholar]
- Arganda-Carreras I., Sorzano C.O.S., Marabini R., Carazo J.M., Ortiz-de-Solorzano C., Kybic J. In: Beichel R.R., Sonka M., editors. Springer Berlin Heidelberg; 2006. Consistent and elastic registration of histological sections using vector-spline regularization; pp. 85–95. (Computer Vision Approaches to Medical Image Analysis). [Google Scholar]
- Asadulina A., Panzera A., Verasztó C., Liebig C., Jékely G. Whole-body gene expression pattern registration in Platynereis larvae. Evodevo. 2012;3:27. doi: 10.1186/2041-9139-3-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ball G., Demmerle J., Kaufmann R., Davis I., Dobbie I.M., Schermelleh L. SIMcheck: a toolbox for successful super-resolution structured illumination microscopy. Sci. Rep. 2015;5:15915. doi: 10.1038/srep15915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartolomaeus T., Quast B. In: Bartolomaeus T., Purschke G., editors. Springer; 2005. Structure and development of nephridia in Annelida and related taxa; pp. 139–165. (Morphology, Molecules, Evolution and Phylogeny in Polychaeta and Related Taxa. Developments in Hydrobiology). [Google Scholar]
- Bates A.S., Janssens J., Jefferis G.S., Aerts S. Neuronal cell types in the fly: single-cell anatomy meets single-cell genomics. Curr. Opin. Neurobiol. 2019;56:125–134. doi: 10.1016/j.conb.2018.12.012. [DOI] [PubMed] [Google Scholar]
- Beier T., Pape C., Rahaman N., Prange T., Berg S., Bock D.D., Cardona A., Knott G.W., Plaza S.M., Scheffer L.K. Multicut brings automated neurite segmentation closer to human performance. Nat. Methods. 2017;14:101–102. doi: 10.1038/nmeth.4151. [DOI] [PubMed] [Google Scholar]
- Berg S., Kutra D., Kroeger T., Straehle C.N., Kausler B.X., Haubold C., Schiegg M., Ales J., Beier T., Rudy M. ilastik: interactive machine learning for (bio)image analysis. Nat. Methods. 2019;16:1226–1232. doi: 10.1038/s41592-019-0582-9. [DOI] [PubMed] [Google Scholar]
- Bezares-Calderón L.A., Berger J., Jasek S., Verasztó C., Mendes S., Gühmann M., Almeda R., Shahidi R., Jékely G. Neural circuitry of a polycystin-mediated hydrodynamic startle response for predator avoidance. eLife. 2018;7:e36262. doi: 10.7554/eLife.36262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blondel V.D., Guillaume J.-L., Lambiotte R., Lefebvre E. Fast unfolding of communities in large networks. J. Stat. Mech. 2008;2008:P10008. [Google Scholar]
- Bogovic J.A., Pisarev I., Hanslovsky P., Thistlethwaite N., Saalfeld S. 2020. N5, a scalable Java API for hierarchies of chunked n-dimensional tensors and structured meta-data. https://github.com/saalfeldlab/n5. [Google Scholar]
- Bumbarger D.J., Crum J., Ellisman M.H., Baldwin J.G. Three-dimensional reconstruction of the nose epidermal cells in the microbial feeding nematode, Acrobeles complexus (Nematoda: Rhabditida) J. Morphol. 2006;267:1257–1272. doi: 10.1002/jmor.10456. [DOI] [PubMed] [Google Scholar]
- Bumbarger D.J., Riebesell M., Rödelsperger C., Sommer R.J. System-wide rewiring underlies behavioral differences in predatory and bacterial-feeding nematodes. Cell. 2013;152:109–119. doi: 10.1016/j.cell.2012.12.013. [DOI] [PubMed] [Google Scholar]
- Cardona A., Saalfeld S., Schindelin J., Arganda-Carreras I., Preibisch S., Longair M., Tomancak P., Hartenstein V., Douglas R.J. TrakEM2 software for neural circuit reconstruction. PLoS ONE. 2012;7:e38011. doi: 10.1371/journal.pone.0038011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chartier T.F., Deschamps J., Dürichen W., Jékely G., Arendt D. Whole-head recording of chemosensory activity in the marine annelid Platynereis dumerilii. Open Biol. 2018;8:180139. doi: 10.1098/rsob.180139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen X., Sun Y.-C., Zhan H., Kebschull J.M., Fischer S., Matho K., Huang Z.J., Gillis J., Zador A.M. High-throughput mapping of long-range neuronal projection using in situ sequencing. Cell. 2019;179:772–786.e19. doi: 10.1016/j.cell.2019.09.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Çiçek Ö., Abdulkadir A., Lienkamp S.S., Brox T., Ronneberger O. In: Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016. Ourselin S., Joskowicz L., Sabuncu M.R., Unal G., Wells W., editors. Springer International Publishing; 2016. 3D U-Net: learning dense volumetric segmentation from sparse annotation; pp. 424–432. [Google Scholar]
- Clements J., Goina C., Kazimiers A., Otsuna H., Svirskas R., Rokicki K. Janelia Research Campus; 2020. NeuronBridge Codebase. [Google Scholar]
- Coelho L.P. Mahotas: open source software for scriptable computer vision. arXiv. 2012 https://arxiv.org/abs/1211.4907 arXiv:1211.4907v2. [Google Scholar]
- Davis F.P., Nern A., Picard S., Reiser M.B., Rubin G.M., Eddy S.R., Henry G.L. A genetic, genomic, and computational resource for exploring neural circuit function. eLife. 2020;9:e50901. doi: 10.7554/eLife.50901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deerinck T.J., Bushong E.A., Thor A., Ellisman M.H. 2010. NCMIR methods for 3D EM: a new protocol for preparation of biological specimens for serial blockface scanning electron microscopy.https://ncmir.ucsd.edu/sbem-protocol [Google Scholar]
- Demmerle J., Innocent C., North A.J., Ball G., Müller M., Miron E., Matsuda A., Dobbie I.M., Markaki Y., Schermelleh L. Strategic and practical guidelines for successful structured illumination microscopy. Nat. Protoc. 2017;12:988–1010. doi: 10.1038/nprot.2017.019. [DOI] [PubMed] [Google Scholar]
- Dürichen W. University of Heidelberg; 2016. Neuroanatomy and function of mushroom bodies in 6- and 12- days old Platynereis dumerilii larvae. Master thesis. [Google Scholar]
- Engelhardt R.P., Dhainaut-Courtois N., Tramu G. Immunohistochemical demonstration of a CCK-like peptide in the nervous system of a marine annelid worm, Nereis diversicolor O.F. Müller. Cell Tissue Res. 1982;227:401–411. doi: 10.1007/BF00210894. [DOI] [PubMed] [Google Scholar]
- Funke J., Tschopp F., Grisaitis W., Sheridan A., Singh C., Saalfeld S., Turaga S.C. Large scale image segmentation with structured loss based deep learning for connectome reconstruction. IEEE Trans. Pattern Anal. Mach. Intell. 2019;41:1669–1680. doi: 10.1109/TPAMI.2018.2835450. [DOI] [PubMed] [Google Scholar]
- Gentleman R.C., Carey V.J., Bates D.M., Bolstad B., Dettling M., Dudoit S., Ellis B., Gautier L., Ge Y., Gentry J. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5:R80. doi: 10.1186/gb-2004-5-10-r80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glasbey C., van der Heijden G., Toh V.F.K., Gray A. Colour displays for categorical images. Color Res. Appl. 2007;32:304–309. [Google Scholar]
- Hagberg A.A., Schult D.A., Swart P.J. In: Exploring Network Structure, Dynamics, and Function using NetworkX. Varoquaux G., Vaught T., Millman J., editors. USDOE; 2008. pp. 11–15. [Google Scholar]
- Hall D.H., Altun Z.F. Cold Spring Harbor Laboratory; 2007. C. elegans Atlas. [Google Scholar]
- Hanslovsky, Pisarev P., Leite I., Saalfeld, S. V., Funke J., Günther tpietzsch, Bogovic U., Pape J., Schmidt, U. C. 2020. saalfeldlab/paintera: Paintera 0.23.0.https://zenodo.org/record/3674037#.YPnI2UApBPY [Google Scholar]
- Haralick R.M., Shanmugam K., Dinstein I. Textural features for image classification. IEEE Trans. Syst. Man Cybern. 1973;3:610–621. [Google Scholar]
- Hasse C., Rebscher N., Reiher W., Sobjinski K., Moerschel E., Beck L., Tessmar-Raible K., Arendt D., Hassel M. Three consecutive generations of nephridia occur during development of Platynereis dumerilii (Annelida, Polychaeta) Dev. Dyn. 2010;239:1967–1976. doi: 10.1002/dvdy.22331. [DOI] [PubMed] [Google Scholar]
- He B., Buescher M., Farnworth M.S., Strobl F., Stelzer E.H., Koniszewski N.D., Muehlen D., Bucher G. An ancestral apical brain region contributes to the central complex under the control of foxQ2 in the beetle Tribolium. eLife. 2019;8:e49065. doi: 10.7554/eLife.49065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heinrich L., Funke J., Pape C., Nunez-Iglesias J., Saalfeld S. In: Frangi A.F., Schnabel J.A., Davatzikos C., Alberola-López C., Fichtinger G., editors. Springer International; 2018. Synaptic cleft segmentation in non-isotropic volume electron microscopy of the complete Drosophila brain; pp. 317–325. (Medical Image Computing and Computer Assisted Intervention – MICCAI 2018: 21st International Conference, Granada, Spain, September 16-20, 2018, Proceedings, Part II). [Google Scholar]
- Heuer C.M., Müller C.H., Todt C., Loesel R. Comparative neuroanatomy suggests repeated reduction of neuroarchitectural complexity in Annelida. Front. Zool. 2010;7:13. doi: 10.1186/1742-9994-7-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hobert O., Glenwinkel L., White J. Revisiting neuronal cell type classification in Caenorhabditis elegans. Curr. Biol. 2016;26:R1197–R1203. doi: 10.1016/j.cub.2016.10.027. [DOI] [PubMed] [Google Scholar]
- Horňáková A., Lange J.-H., Andres B. Proceedings of the 34th International Conference on Machine Learning - Volume 70. JMLR.org; 2017. Analysis and optimization of graph decompositions by lifted multicuts; pp. 1539–1548. [Google Scholar]
- Januszewski M., Kornfeld J., Li P.H., Pope A., Blakely T., Lindsey L., Maitin-Shepard J., Tyka M., Denk W., Jain V. High-precision automated reconstruction of neurons with flood-filling networks. Nat. Methods. 2018;15:605–610. doi: 10.1038/s41592-018-0049-4. [DOI] [PubMed] [Google Scholar]
- Jékely G., Colombelli J., Hausen H., Guy K., Stelzer E., Nédélec F., Arendt D. Mechanism of phototaxis in marine zooplankton. Nature. 2008;456:395–399. doi: 10.1038/nature07590. [DOI] [PubMed] [Google Scholar]
- Jenett A., Rubin G.M., Ngo T.-T.B., Shepherd D., Murphy C., Dionne H., Pfeiffer B.D., Cavallaro A., Hall D., Jeter J. A GAL4-driver line resource for Drosophila neurobiology. Cell Rep. 2012;2:991–1001. doi: 10.1016/j.celrep.2012.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones L., López-Bendito G., Gruss P., Stoykova A., Molnár Z. Pax6 is required for the normal development of the forebrain axonal connections. Development. 2002;129:5041–5052. doi: 10.1242/dev.129.21.5041. [DOI] [PubMed] [Google Scholar]
- Kasthuri N., Hayworth K.J., Berger D.R., Schalek R.L., Conchello J.A., Knowles-Barley S., Lee D., Vázquez-Reina A., Kaynig V., Jones T.R. Saturated reconstruction of a volume of neocortex. Cell. 2015;162:648–661. doi: 10.1016/j.cell.2015.06.054. [DOI] [PubMed] [Google Scholar]
- Katz P.S., Quinlan P.D. The importance of identified neurons in gastropod molluscs to neuroscience. Curr. Opin. Neurobiol. 2019;56:1–7. doi: 10.1016/j.conb.2018.10.009. [DOI] [PubMed] [Google Scholar]
- Kim D.-W., Yao Z., Graybuck L.T., Kim T.K., Nguyen T.N., Smith K.A., Fong O., Yi L., Koulena N., Pierson N. Multimodal analysis of cell types in a hypothalamic node controlling social behavior. Cell. 2019;179:713–728.e17. doi: 10.1016/j.cell.2019.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klein S., Staring M., Murphy K., Viergever M.A., Pluim J.P.W. elastix: a toolbox for intensity-based medical image registration. IEEE Trans. Med. Imaging. 2010;29:196–205. doi: 10.1109/TMI.2009.2035616. [DOI] [PubMed] [Google Scholar]
- Kljakic O., Janickova H., Prado V.F., Prado M.A.M. Cholinergic/glutamatergic co-transmission in striatal cholinergic interneurons: new mechanisms regulating striatal computation. J. Neurochem. 2017;142(Suppl 2):90–102. doi: 10.1111/jnc.14003. [DOI] [PubMed] [Google Scholar]
- Köster J., Rahmann S. Snakemake--a scalable bioinformatics workflow engine. Bioinformatics. 2012;28:2520–2522. doi: 10.1093/bioinformatics/bts480. [DOI] [PubMed] [Google Scholar]
- Lee K., Zung J., Li P., Jain V., Seung H.S. Superhuman accuracy on the SNEMI3D connectomics challenge. arXiv. 2017 https://arxiv.org/abs/1706.00120 arXiv:1706.00120v1. [Google Scholar]
- Lee D., Kume M., Holy T.E. A molecular logic of sensory coding revealed by optical tagging of physiologically-defined neuronal types. BioRxiv. 2019 doi: 10.1126/science.aax8055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lein E., Borm L.E., Linnarsson S. The promise of spatial transcriptomics for neuroscience in the era of molecular cell typing. Science. 2017;358:64–69. doi: 10.1126/science.aan6827. [DOI] [PubMed] [Google Scholar]
- Marsh E.D., Nasrallah M.P., Walsh C., Murray K.A., Nicole Sunnen C., McCoy A., Golden J.A. Developmental interneuron subtype deficits after targeted loss of Arx. BMC Neurosci. 2016;17:35. doi: 10.1186/s12868-016-0265-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsuda A., Schermelleh L., Hirano Y., Haraguchi T., Hiraoka Y. Accurate and fiducial-marker-free correction for three-dimensional chromatic shift in biological fluorescence microscopy. Sci. Rep. 2018;8:7583. doi: 10.1038/s41598-018-25922-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McInnes L., Healy J., Melville J. UMAP: uniform manifold approximation and projection for dimension reduction. arXiv. 2018 https://arxiv.org/abs/1802.03426 arXiv:1802.03426v3. [Google Scholar]
- McKinney W. In: van der Walt S., Millman J., editors. SciPy; 2010. Data structures for statistical computing in Python; pp. 51–56. (Proceedings of the 9th Python in Science Conference). [Google Scholar]
- Metcalfe W.K., Westerfield M. In: Raymond P.A., Easter S.S., Innocenti G.M., editors. Springer US; 1990. Primary motoneurons of the zebrafish; pp. 41–47. (Systems Approaches to Developmental Neurobiology). [Google Scholar]
- Milyaev N., Osumi-Sutherland D., Reeve S., Burton N., Baldock R.A., Armstrong J.D. The Virtual Fly Brain browser and query interface. Bioinformatics. 2012;28:411–415. doi: 10.1093/bioinformatics/btr677. [DOI] [PubMed] [Google Scholar]
- Miron E., Oldenkamp R., Brown J.M., Pinto D.M.S., Xu C.S., Faria A.R., Shaban H.A., Rhodes J.D.P., Innocent C., de Ornellas S. Chromatin arranges in chains of mesoscale domains with nanoscale functional topography independent of cohesin. Sci. Adv. 2020;6:eaba8811. doi: 10.1126/sciadv.aba8811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nicovich P.R., Taormina M.J., Baker C.A., Nguyen T.N., Thomsen E.R., Garren E., Long B., Gorham M., Miller J., Hage T. Multimodal cell type correspondence by intersectional mFISH in intact tissues. BioRxiv. 2019 [Google Scholar]
- Ohyama T., Schneider-Mizell C.M., Fetter R.D., Aleman J.V., Franconville R., Rivera-Alba M., Mensh B.D., Branson K.M., Simpson J.H., Truman J.W. A multilevel multimodal circuit enhances action selection in Drosophila. Nature. 2015;520:633–639. doi: 10.1038/nature14297. [DOI] [PubMed] [Google Scholar]
- Packer J.S., Zhu Q., Huynh C., Sivaramakrishnan P., Preston E., Dueck H., Stefanik D., Tan K., Trapnell C., Kim J. A lineage-resolved molecular atlas of C. elegans embryogenesis at single-cell resolution. Science. 2019;365:eaax1971. doi: 10.1126/science.aax1971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pape C., Beier T., Li P., Jain V., Bock D.D., Kreshuk A. 2017 IEEE International Conference on Computer Vision Workshops (ICCVW) IEEE; 2017. Solving large multicut problems for connectomics via domain decomposition; pp. 1–10. [Google Scholar]
- Pape C., Matskevych A., Wolny A., Hennies J., Mizzon G., Louveaux M., Musser J., Maizel A., Arendt D., Kreshuk A. Leveraging domain knowledge to improve microscopy image segmentation with lifted multicuts. Front. Comput. Sci. 2019;1:6. [Google Scholar]
- Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Müller A., Nothman J., Louppe G. Scikit-learn: machine learning in Python. arXiv. 2012 https://arxiv.org/abs/1201.0490 arXiv:1201.0490v4. [Google Scholar]
- Pfeffer C.K., Beltramo R. Correlating anatomy and function with gene expression in individual neurons by combining in vivo labeling, patch clamp, and single cell RNA-seq. Front. Cell. Neurosci. 2017;11:376. doi: 10.3389/fncel.2017.00376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phillips J.W., Schulmann A., Hara E., Winnubst J., Liu C., Valakh V., Wang L., Shields B.C., Korff W., Chandrashekar J. A repeated molecular architecture across thalamic pathways. Nat. Neurosci. 2019;22:1925–1935. doi: 10.1038/s41593-019-0483-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pietzsch T., Preibisch S., Tomancák P., Saalfeld S. ImgLib2--generic image processing in Java. Bioinformatics. 2012;28:3009–3011. doi: 10.1093/bioinformatics/bts543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pietzsch T., Saalfeld S., Preibisch S., Tomancak P. BigDataViewer: visualization and processing for large image data sets. Nat. Methods. 2015;12:481–483. doi: 10.1038/nmeth.3392. [DOI] [PubMed] [Google Scholar]
- Purschke G. In: Bartolomaeus T., Purschke G., editors. Springer-Verlag; 2005. Sense organs in polychaetes (Annelida) pp. 53–78. (Morphology, Molecules, Evolution and Phylogeny in Polychaeta and Related Taxa). [Google Scholar]
- Qian X., Harris K.D., Hauling T., Nicoloutsopoulos D., Muñoz-Manchado A.B., Skene N., Hjerling-Leffler J., Nilsson M. Probabilistic cell typing enables fine mapping of closely related cell types in situ. Nat. Methods. 2020;17:101–106. doi: 10.1038/s41592-019-0631-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Randel N., Asadulina A., Bezares-Calderón L.A., Verasztó C., Williams E.A., Conzelmann M., Shahidi R., Jékely G. Neuronal connectome of a sensory-motor circuit for visual navigation. eLife. 2014;3:e02730. doi: 10.7554/eLife.02730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Randel N., Shahidi R., Verasztó C., Bezares-Calderón L.A., Schmidt S., Jékely G. Inter-individual stereotypy of the Platynereis larval visual connectome. eLife. 2015;4:e08069. doi: 10.7554/eLife.08069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rhode B. Development and differentiation of the eye in Platynereis dumerilii (Annelida, Polychaeta) J. Morphol. 1992;212:71–85. doi: 10.1002/jmor.1052120108. [DOI] [PubMed] [Google Scholar]
- Rosenbluth J. OBLIQUELY STRIATED MUSCLE : IV. Sarcoplasmic Reticulum, Contractile Apparatus, and Endomysium of the Body Muscle of a Polychaete, Glycera, in Relation to Its Speed. J Cell Biol. 1968;36:245–259. [PMC free article] [PubMed] [Google Scholar]
- Rueckert D., Sonoda L.I., Hayes C., Hill D.L., Leach M.O., Hawkes D.J. Nonrigid registration using free-form deformations: application to breast MR images. IEEE Trans. Med. Imaging. 1999;18:712–721. doi: 10.1109/42.796284. [DOI] [PubMed] [Google Scholar]
- Rungger-Brändle E., Ripperger J.A., Steiner K., Conti A., Stieger A., Soltanieh S., Rungger D. Retinal patterning by Pax6-dependent cell adhesion molecules. Dev. Neurobiol. 2010;70:764–780. doi: 10.1002/dneu.20816. [DOI] [PubMed] [Google Scholar]
- Satoh N. In: Moody S.A., editor. Elsevier; 1999. Cell fate determination in the ascidian embryo; pp. 59–74. (Cell Lineage and Fate Determination). [Google Scholar]
- Saumweber T., Rohwedder A., Schleyer M., Eichler K., Chen Y.-C., Aso Y., Cardona A., Eschbach C., Kobler O., Voigt A. Functional architecture of reward learning in mushroom body extrinsic neurons of larval Drosophila. Nat. Commun. 2018;9:1104. doi: 10.1038/s41467-018-03130-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schafer W. Nematode nervous systems. Curr. Biol. 2016;26:R955–R959. doi: 10.1016/j.cub.2016.07.044. [DOI] [PubMed] [Google Scholar]
- Scheffer L.K., Xu C.S., Januszewski M., Lu Z., Takemura S.-Y., Hayworth K.J., Huang G.B., Shinomiya K., Maitlin-Shepard J., Berg S. A connectome and analysis of the adult Drosophila central brain. eLife. 2020;9:e57443. doi: 10.7554/eLife.57443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schieber N.L., Machado P., Markert S.M., Stigloher C., Schwab Y., Steyer A.M. Minimal resin embedding of multicellular specimens for targeted FIB-SEM imaging. Methods Cell Biol. 2017;140:69–83. doi: 10.1016/bs.mcb.2017.03.005. [DOI] [PubMed] [Google Scholar]
- Schindelin J., Arganda-Carreras I., Frise E., Kaynig V., Longair M., Pietzsch T., Preibisch S., Rueden C., Saalfeld S., Schmid B. Fiji: an open-source platform for biological-image analysis. Nat. Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmid B., Schindelin J., Cardona A., Longair M., Heisenberg M. A high-level 3D visualization API for Java and ImageJ. BMC Bioinformatics. 2010;11:274. doi: 10.1186/1471-2105-11-274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sebé-Pedrós A., Saudemont B., Chomsky E., Plessier F., Mailhé M.-P., Renno J., Loe-Mie Y., Lifshitz A., Mukamel Z., Schmutz S. Cnidarian cell type diversity and regulation revealed by whole-organism single-cell RNA-seq. Cell. 2018;173:1520–1534.e20. doi: 10.1016/j.cell.2018.05.019. [DOI] [PubMed] [Google Scholar]
- Sebé-Pedrós A., Chomsky E., Pang K., Lara-Astiaso D., Gaiti F., Mukamel Z., Amit I., Hejnol A., Degnan B.M., Tanay A. Early metazoan cell type diversity and the evolution of multicellular gene regulation. Nat. Ecol. Evol. 2018;2:1176–1188. doi: 10.1038/s41559-018-0575-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shamonin D.P., Bron E.E., Lelieveldt B.P.F., Smits M., Klein S., Staring M., Alzheimer’s Disease Neuroimaging Initiative Fast parallel image registration on CPU and GPU for diagnostic classification of Alzheimer’s disease. Front. Neuroinform. 2014;7:50. doi: 10.3389/fninf.2013.00050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strausfeld N.J. Brain homology: Dohrn of a new era? Brain Behav. Evol. 2010;76:165–167. doi: 10.1159/000322150. [DOI] [PubMed] [Google Scholar]
- Sunkin S.M., Ng L., Lau C., Dolbeare T., Gilbert T.L., Thompson C.L., Hawrylycz M., Dang C. Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system. Nucleic Acids Res. 2013;41:D996–D1008. doi: 10.1093/nar/gks1042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tasic B., Yao Z., Graybuck L.T., Smith K.A., Nguyen T.N., Bertagnolli D., Goldy J., Garren E., Economo M.N., Viswanathan S. Shared and distinct transcriptomic cell types across neocortical areas. Nature. 2018;563:72–78. doi: 10.1038/s41586-018-0654-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tessmar-Raible K., Steinmetz P.R.H., Snyman H., Hassel M., Arendt D. Fluorescent two-color whole mount in situ hybridization in Platynereis dumerilii (Polychaeta, Annelida), an emerging marine molecular model for evolution and development. Biotechniques. 2005;39:460–462. doi: 10.2144/000112023. 464. [DOI] [PubMed] [Google Scholar]
- Tessmar-Raible K., Raible F., Christodoulou F., Guy K., Rembold M., Hausen H., Arendt D. Conserved sensory-neurosecretory cell types in annelid and fish forebrain: insights into hypothalamus evolution. Cell. 2007;129:1389–1400. doi: 10.1016/j.cell.2007.04.041. [DOI] [PubMed] [Google Scholar]
- Tischer C. ElastixWrapper: Fiji plugin for 3D image registration with elastix. Zenodo. 2019 [Google Scholar]
- Titze B., Genoud C. Volume scanning electron microscopy for imaging biological ultrastructure. Biol. Cell. 2016;108:307–323. doi: 10.1111/boc.201600024. [DOI] [PubMed] [Google Scholar]
- Titze B., Genoud C., Friedrich R.W. SBEMimage: versatile acquisition control software for serial block-face electron microscopy. Front. Neural Circuits. 2018;12:54. doi: 10.3389/fncir.2018.00054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tomer R., Denes A.S., Tessmar-Raible K., Arendt D. Profiling by image registration reveals common origin of annelid mushroom bodies and vertebrate pallium. Cell. 2010;142:800–809. doi: 10.1016/j.cell.2010.07.043. [DOI] [PubMed] [Google Scholar]
- Tosches M.A., Arendt D. The bilaterian forebrain: an evolutionary chimaera. Curr. Opin. Neurobiol. 2013;23:1080–1089. doi: 10.1016/j.conb.2013.09.005. [DOI] [PubMed] [Google Scholar]
- van der Walt S., Colbert S.C., Varoquaux G. The NumPy Array: a structure for efficient numerical computation. Comput. Sci. Eng. 2011;13:22–30. [Google Scholar]
- van der Walt S., Schönberger J.L., Nunez-Iglesias J., Boulogne F., Warner J.D., Yager N., Gouillart E., Yu T., scikit-image contributors scikit-image: image processing in Python. PeerJ. 2014;2:e453. doi: 10.7717/peerj.453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Verasztó C., Ueda N., Bezares-Calderón L.A., Panzera A., Williams E.A., Shahidi R., Jékely G. Ciliomotor circuitry underlying whole-body coordination of ciliary activity in the Platynereis larva. eLife. 2017;6:e26000. doi: 10.7554/eLife.26000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Verasztó C., Jasek S., Gühmann M., Shahidi R., Ueda N., Beard J.D., Mendes S., Heinz K., Bezares-Calderón L.A., Williams E.A. Whole-animal connectome and cell-type complement of the three-segmented Platynereis dumerilii larva. BioRxiv. 2020 [Google Scholar]
- Vergara H.M., Bertucci P.Y., Hantz P., Tosches M.A., Achim K., Vopalensky P., Arendt D. Whole-organism cellular gene-expression atlas reveals conserved cell types in the ventral nerve cord of Platynereis dumerilii. Proc. Natl. Acad. Sci. USA. 2017;114:5878–5885. doi: 10.1073/pnas.1610602114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Virtanen P., Gommers R., Oliphant T.E., Haberland M., Reddy T., Cournapeau D., Burovski E., Peterson P., Weckesser W., Bright J., SciPy 1.0 Contributors SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vopalensky P., Tosches M.A., Achim K., Handberg-Thorsager M., Arendt D. From spiral cleavage to bilateral symmetry: the developmental cell lineage of the annelid brain. BMC Biol. 2019;17:81. doi: 10.1186/s12915-019-0705-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walz B., Schäffner K.H., Sawyer R.T. Ultrastructure of the anterior salivary gland cells of the giant leech, Haementeria ghilianii (Annelida, Hirudinea) J. Morphol. 1988;196:321–332. doi: 10.1002/jmor.1051960305. [DOI] [PubMed] [Google Scholar]
- Wanner A.A., Friedrich R.W. Whitening of odor representations by the wiring diagram of the olfactory bulb. Nat. Neurosci. 2020;23:433–442. doi: 10.1038/s41593-019-0576-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wanner A.A., Genoud C., Masudi T., Siksou L., Friedrich R.W. Dense EM-based reconstruction of the interglomerular projectome in the zebrafish olfactory bulb. Nat. Neurosci. 2016;19:816–825. doi: 10.1038/nn.4290. [DOI] [PubMed] [Google Scholar]
- Wanner A.A., Genoud C., Friedrich R.W. 3-dimensional electron microscopic imaging of the zebrafish olfactory bulb and dense reconstruction of neurons. Sci. Data. 2016;3:160100. doi: 10.1038/sdata.2016.100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weigert A., Helm C., Meyer M., Nickel B., Arendt D., Hausdorf B., Santos S.R., Halanych K.M., Purschke G., Bleidorn C., Struck T.H. Illuminating the base of the annelid tree using transcriptomics. Mol. Biol. Evol. 2014;31:1391–1401. doi: 10.1093/molbev/msu080. [DOI] [PubMed] [Google Scholar]
- White J.G., Southgate E., Thomson J.N., Brenner S. The structure of the nervous system of the nematode Caenorhabditis elegans. Philos. Trans. R. Soc. Lond. B Biol. Sci. 1986;314:1–340. doi: 10.1098/rstb.1986.0056. [DOI] [PubMed] [Google Scholar]
- Wickham H., Averick M., Bryan J., Chang W., McGowan L., François R., Grolemund G., Hayes A., Henry L., Hester J. JOSS; 2017. Welcome to the Tidyverse. [Google Scholar]
- Williams E.A., Jékely G. Neuronal cell types in the annelid Platynereis dumerilii. Curr Opin Neurobiol. 2019;56:106–116. doi: 10.1016/j.conb.2018.12.008. [DOI] [PubMed] [Google Scholar]
- Williams E.A., Verasztó C., Jasek S., Conzelmann M., Shahidi R., Bauknecht P., Mirabeau O., Jékely G. Synaptic and peptidergic connectome of a neurosecretory center in the annelid brain. eLife. 2017;6:e26349. doi: 10.7554/eLife.26349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wolf S., Pape C., Bailoni A., Rahaman N., Kreshuk A., Köthe U., Hamprecht F.A. In: Ferrari V., Hebert M., Sminchisescu C., Weiss Y., editors. Springer International; 2018. The Mutex Watershed: efficient, parameter-free image partitioning; pp. 571–587. (Computer Vision – ECCV 2018: 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part IV). [Google Scholar]
- Wolff G.H., Strausfeld N.J. Genealogical correspondence of a forebrain centre implies an executive brain in the protostome-deuterostome bilaterian ancestor. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2016;371:20150055. doi: 10.1098/rstb.2015.0055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeisel A., Hochgerner H., Lönnerberg P., Johnsson A., Memic F., van der Zwan J., Häring M., Braun E., Borm L.E., La Manno G. Molecular architecture of the mouse nervous system. Cell. 2018;174:999–1014.e22. doi: 10.1016/j.cell.2018.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang K., Wang M., Zhao Y., Wang W. Taiji: system-level identification of key transcription factors reveals transcriptional waves in mouse embryonic development. Sci. Adv. 2019;5:eaav3262. doi: 10.1126/sciadv.aav3262. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data and code generated during this study are publicly available.
The Electron microscopy data and EM based segmentation are archived and can be downloaded from EMPIAR (https://www.ebi.ac.uk/pdbe/emdb/empiar/) under the accession id 10365. Gene expression maps, prospr segmentations, and registration files are archived and can be downloaded from BioStudies (https://www.ebi.ac.uk/biostudies/) under the accession id S-BIAD14.
In addition to EMPIAR and BioStudies we host all image data on an aws-s3 (http://aws.amazon.com/s3) compatible object store at EMBL Heidelberg using the open source implementation of MinIO (https://min.io). The data are stored in the n5 data format (Bogovic et al., 2020), which allows chunking and compression for efficient data access. It can be accessed on demand through the n5 s3 API (https://github.com/saalfeldlab/n5-aws-s3).
The tabular data derived from images and segmentations are hosted on github: https://github.com/mobie/platybrowser-datasets/tree/master/data/1.0.1/tables. The repository https://github.com/mobie/platybrowser-datasets also contains the image metadata in the BigDataViewer xml file format, which serves as an entry point to start the PlatyBrowser for the data hosted on the s3 object store. In order to keep track of changes in the derived data, for example due to segmentation corrections, we use a versioning scheme inspired by https://semver.org/. Between each version only the data that has actually changed is updated, whereas unchanged data are referred to by links to older versions.
The software package used for EM acquisition SBEMImage is available at https://github.com/SBEMimage/SBEMimage and described in the publication (Titze et al., 2018). The software for generating the virtual cells from ProSPr is available at https://github.com/mobie/prospr-vc-generation and under the https://doi.org/10.5281/zenodo.4899523. The software for EM segmentation and morphological as well as genetic analysis of segmented cells is available at https://github.com/mobie/platybrowser-datasets and under the https://doi.org/10.5281/zenodo.4899527. The MoBIE Fiji viewer plugin is available at https://github.com/mobie/mobie-viewer-fiji and under the https://doi.org/10.5281/zenodo.2602754.