Abstract
Animal nervous system organization is crucial for all body functions and its disruption can manifest in severe cognitive and behavioral impairment1. This organization relies on features across scales, from nanoscale localization of synapses, through multiplicities of neuronal morphologies and their contribution to circuit organization, to stereotyped connections between different regions of the brain2. The sheer complexity of this organ means that, to date, we have yet to reconstruct and model the structure of a complete nervous system that is integrated across all these scales. Here, we present a complete structure-function model of the nematode C. elegans main neuropil, the nerve ring, which we derive by integrating the volumetric reconstruction from two animals with corresponding3 synaptic and gap junctional connectomes. Whereas previously the nerve ring was considered a densely packed tract of neural processes, we uncover internal organization and show how local neighborhoods spatially constrain and support the synaptic connectome. We find that the C. elegans connectome is not invariant, but that a precisely wired core circuit is embedded in a background of variable connectivity, and propose a corresponding reference connectome for the core circuit. Based on this reference, we propose a modular network architecture of the C. elegans brain that supports sensory computation and integration, sensorimotor convergence and brain-wide coordination. These findings point to scalable and robust features of brain organization that are likely universal across phyla.
A primary goal of systems neuroscience is to understand how the brain’s structure and function combine to generate behavior. Since the discovery of neurons and their connections through synapses and gap junctions, a major effort has focused on characterizing these units and the micro- and macro-circuits that they comprise, culminating in a growing body of high-resolution nanoconnectomic data across species3–12. Naturally, data, however rich, cannot on their own provide explanatory power to address the computation within circuits or to determine how these circuits communicate and coordinate information flow to generate behavior. Indeed, constructing a comprehensive brain map will require a meaningful strategy for integrating structure and function across scales. Achieving this feat in even a small animal can provide a useful model for postulating principles of brain organization across scales2.
The free-living nematode C. elegans has a small, compact nervous system3,5,7,13 while exhibiting a range of complex, individualized behaviors, making it an ideal model system for studies of whole brain organization2. All 302 C. elegans neurons have been anatomically characterized based on serial sectioned electron micrographs (EM)5 to produce a whole animal connectome3,5,13. This animal’s invariant cell-lineage14 and anatomy5 might suggest that its connectome too is invariant15. Unfortunately, the small sample size of available reconstructions has precluded a reliable estimate of reproducibility and variability of the synaptic connectome. Furthermore, while the synaptic wiring has been exhaustively characterized3,5,13,16,17, the spatial proximity of neurons is only partially determined18,19. Thus, it remains to be determined whether lessons about whole brain organization in C. elegans can inform questions and approaches for other systems.
We provide two complete volumetric reconstructions of the C. elegans nerve ring from legacy EMs5, from one adult and one larval stage 4 (L4) animal (Methods, Supplementary Table 1, Supplementary Videos 1–3, Supplementary Information 1). The two EM series (with roughly 300 sections in the L4 and 400 in the adult) span approximately the same 36 μm long volume, starting in the anterior and ending in the ventral ganglia (Fig. 1a). Our reconstructions provide the first contactome: a complete, nanoresolution dataset of all neuronal membrane contacts in the nerve rings of these two animals. We define two neurons as immediate neighbors if the membranes along their neural processes are physically adjacent in at least one EM section18. To characterize synaptic pathways within a spatial context, we integrated our volumetric reconstructions with our recent rescoring of synapses on the same L4 and adult animals3 (for validation and comparison with other datasets5,20, see Methods).
Conserved and variable circuits overlap
Consistent with White et al.5, our volumetric reconstructions show that neural processes are bilaterally (left/right) conserved (Supplementary Results, Supplementary Videos 4–7). We hypothesized that the bilateral symmetry of C. elegans processes extends to the nanoscale to support a homology of membrane contacts and synapses between cells. Homologous processes exhibit statistically high overlaps in the size and composition of their immediate neighborhood (Extended Data Fig. 1a–c) and in membrane contact locations along their processes (Methods, Extended Data Fig. 1d–f, Supplementary Information 2). In contrast, the smallest 35% of membrane contacts (< 0.4 μm2) are not reproducible (Extended Data Fig. 2a), account for only 2% of total membrane contact area between all neurons (Extended Data Fig. 2b,c) and contain predominantly nonreproducible synaptic contacts (Extended Data Fig. 2e). As such, we exclude them from our analysis. We conclude that the reproducibility of neuronal processes and their immediate neighborhoods supports a stereotyped pattern of cell-cell membrane contacts.
The availability of two reconstructions, combined with the bilateral homology of the nerve ring, naturally lends itself to establishing a reference dataset that is more likely conserved across animals, providing a basis to address mechanistic questions about precision and variability of the connectome at nanoscale resolution. We defined the adjacency graph, , of membrane contacts across 4 datasets (adult left, adult right, L4 left and L4 right), where labels the number of datasets in which a membrane contact occurs (Supplementary Information 3). The reference dataset, i.e. the most reproducible membrane contacts, comprises ~40% of all membrane contacts (Extended Data Fig. 2g) and exhibits above average membrane contact area (Extended Data Fig. 2h). Adjacency graphs of chemical synapse, , and gap junction, , contacts are similarly defined (Supplementary Information 3). We define , and contacts as reference datasets and hypothesize that the set of membrane contacts is representative of the conserved membrane contacts across individuals in C. elegans and is more likely to support a conserved synaptic connectome.
To examine this hypothesis, we exploit the combined spatial and synaptic information across datasets over the entire neuropil. We assume that stereotyped wiring patterns require precision to find target neurons and specificity to avoid off-target neurons, and formulate statistical models of membrane and synaptic contacts to capture their relative propensity to occur in 1, 2, 3 or all 4 of the datasets (Methods). We find that a minimal model with three parameters suffices (Methods); these are the fraction of target contacts, , the precision, , for target contacts, and the frequency to avoid off-target contacts or specificity, . Despite their parsimony, these models yield good fits for the distribution of membrane, synaptic and gap junctional contacts across the 4 datasets (Methods, Fig. 2a). The high reproducibility of membrane contacts across datasets ( count) is consistent with our model prediction that less than half of membrane contacts are actively targeted (, Fig. 2a) with high precision . The significant variability across datasets is accounted for by a non-negligible basal membrane contact rate ( ~25-30%). Therefore, high precision combined with basal connectivity are required to account for the reproducibility and variability of membrane contacts across datasets (Fig. 2a; Supplementary Results, Extended Data Fig. 3).
How useful is the reference in predicting conserved membrane contacts? Our model predicts that ~99% of the contacts and 68% of the contacts together constitute the vast majority (≳98%) of the core neuronal membrane adjacency matrix of the C. elegans nerve ring (Methods). Furthermore, above average membrane contacts (>1.77μm2) comprise more than 80% of contacts (Extended Data Fig. 2h) and are more reproducible (with higher precision, , and larger fraction, , Extended Data Fig. 3a–b). We conclude that the dataset offers an excellent candidate set of conserved membrane contacts. While highly reproducible, core membrane contacts are not easily distinguished from variable ones. Our model predicts that ~50% of membrane contacts are variable across animals. Using model-generated surrogate datasets (Methods), we estimate that 20 datasets (from 10 animals, with 2 datasets per bilateral reconstruction) would suffice to identify all core membrane contacts in the C. elegans nerve ring (Fig. 2b).
To model synaptic and gap-junctional precision, we re-fit the model to and (Methods). To control for synaptic variability due to differences in process placement, we restricted our analysis to contacts (for a more general treatment, see Extended Data Fig. 4a–c). Even among reproducible membrane contacts, our model predicts that high precision (p > 0.90) combined with basal connectivity () are required to account for the reproducibility and variability of synaptic and gap junctional contacts across datasets (Fig. 2a; Supplementary Results, Extended Data Fig. 4d–e). For the bilateral worm, a synaptic precision of 93% implies a ~99% probability of a core synaptic contact occurring at least once per animal (on the left, right or both sides), and ≳97% chance to occur in at least 3 of 4 datasets (across 2 animals). Conversely, we predict that ~98% of and are good representatives of the core circuit (as well as >60% of and ), lending further confidence to the usefulness of the reference connectome. However, the placement of the most reproducible synapses along the process is not restricted to reproducible membrane contact sites (Extended Data Fig. 1g–h). Thus, location along the process cannot be used to distinguish core from variable synapses. Taken together, these results demonstrate that each dataset can be divided into a common, precisely targeted core circuit and a variable component, and that, given additional connectomes, it should be possible to distinguish between them (Extended Data Fig. 4d,e).
We next asked what principles of spatial organization support the reproducible, highly specified neuronal placement in the nerve ring. To address this question, we noted that the observed variability of membrane contacts suggests that no one animal is representative of the population at large and even core contacts likely vary across individuals (Extended Data Fig. 4h,i). Hypothesizing that conserved membrane contacts form the basis of the neuropil organization, we estimated the expected variability in our reference contacts across a population of animals by computing the variability in contact areas across the four datasets at our disposal. We used the reference membrane contact distributions and their associated membrane contact area variability across the datasets to generate stochastic population models of core membrane contacts from the L4 and adult bilateral datasets and the reference dataset (Methods, Extended Data Fig. 5). To group together neurites with high spatial affinity, we used a multi-level graph clustering algorithm21 on each individual in our population model (Methods). We find that 5 subgroups of neurons consistently emerge from the data whose processes are spatially ordered along the anterior-posterior axis of the nerve ring (Fig. 1a, Supplementary Information 4). We label these clusters anterior, lateral, sublateral, avoidance and taxis (Supplementary Results). Regionalization of processes in the nerve ring into the anterior circuit (associated with mechanosensation), the posterior, amphid neural circuit (associated with chemosensation and navigation) and lateral and sublateral neurons (associated primarily with head motor control) has previously been highlighted5. Our quantitative analysis is consistent with the above description but our focus on the core nanostructure reveals finer organization of the nerve ring that may not be apparent from the raw volumetric data (Methods, Extended Data Fig. 6).
We asked whether the cluster organization of the nerve ring is indicative of modularization of synaptic pathways3,13,18,19,22. We find that most neurons have strong membrane and synaptic contacts within a single cluster, whereas others physically and synaptically contact neurons across multiple clusters (Fig. 3a-b, Extended Data Fig. 7). However, synaptically sparse lateral neurons and a number of neurons that closely link across the lateral and sublateral neighborhoods suggest that lateral and sublateral clusters may be merged for purposes of information processing analysis.
Neurons that synapse across clusters are often characterized by processes that change neighborhood along their trajectories (Fig. 3c-e, Extended Data Fig. 8k). We identified 33 cell classes whose processes synapse across different regions of the nerve ring (Methods). These cell classes use two principal strategies: synapse compartmentalization (19/33 cell classes, Supplementary Information 4, Fig. 3e) and flattened protrusions (23/33 cell classes, Supplementary Information 4). We hypothesize that a subset of neurons synaptically link different neighborhoods of the nerve ring to support brain-wide coordinated activity23. Consistent with our hypothesis, these specialized spatial features and the synapses they support are largely conserved across our 4 datasets. In summary, we find that the nerve ring obeys a consistent set of spatial organization principles across scales, including a macroscopic modular neighborhood organization which supports the mesoscopic organization along neurites, microscopic precision of membrane contacts and nanoscopic morphological features, that together support conserved synaptic wiring.
A C. elegans brain map
We integrate the knowledge gained to map the architecture of the C. elegans brain: The high-level spatial organization (Fig. 1a) – the ‘macro-connectome’2 – suggests modular circuits, with distinct functional roles. Neuronal organization within and across spatial regions, comprising predominantly local and some cross-cutting neurons (Fig. 3a-c) that exhibit micro- and nanoscale structures (Fig. 3e-h), allows us to map the coordination across the nerve ring. Our reference connectome allows us to focus on reliable, likely conserved connectivity (Fig. 2). Finally, classification of neurons as sensory, interneuron and motoneuron allows us to trace sensorimotor pathways within and across these modules. By combining these features in the data, we set out to construct a brain map of the C. elegans nerve ring.
We posit a parsimonious 3-layer architecture with parallel information processing modules and assign every neuron of the nerve ring into a layer roughly corresponding to the 5 neuron clusters (Methods). To achieve overall feed-forward pathways, sensory neurons all occupy the first layer whereas spatially cross-cutting neurons dominate layer 3 (Methods, Fig. 4). Connectomic features, identified from network analysis of the C. elegans connectome (such as highly connected ‘hub’ neurons, high assortativity hubs known as ‘rich-club neurons’24,25, network motifs13,26 and the small-world organization13 as well as new features such as fan-in and fan-out motifs13 (characterized by higher in- or out-degrees, respectively, Extended Data Fig. 9a) can now be interpreted within the context of modular, brain-wide computation and information flow (Supplementary Results, Extended Data Fig. 9b–j). In particular, the feed-forward loop motif, previously identified in the C. elegans connectome3,13,26, reappears in our map as the skeleton of the layered synaptic pathways within each module (>50% of all contacts; Fig. 4a, Extended Data Fig. 10 shows additional contacts). This system-wide feed-forward connectivity is reminiscent of the layered connectivity of pyramidal neurons in the mammalian cortex and its biologically inspired analogue – Residual Networks (ResNets)27. Such architectures have been conjectured to enhance the resilience of synaptic pathways and to support flexibility and plasticity27.
Examination of the C. elegans brain map (Fig. 4b) reveals a number of features. Layer 1 separates the modules (with a few notable and functional exceptions, Extended Data Fig. 10). The intra-module, intra-layer connectivity indicates that sensory neurons likely perform limited sensory computation in addition to sensory encoding of environmental cues, and allows the identification of sensory hub (high-degree) neurons (Supplementary Results). Layer 2 largely maintains the modular synaptic information flow. Convergence of sensory neurons onto this sparser layer reveals a fan-in architecture, supporting modular sensory integration (Supplementary Results, Extended Data Fig. 9c–d). Layer 3 contrasts with the above. Inputs are received from all three layers: Synapses from layers 1 and 2 comprise the core of each module, whereas layer-3 synapses interlink and couple the modules, forming a recurrent, highly distributed circuit, consistent with the dominance of spatially complex neurons in this layer and suggestive of brain-wide coordination roles (Extended Data Fig. 9e–j). Outputs from the nerve ring control the pharynx, head and neck muscles and the motor circuit of the ventral nerve cord (VNC). The taxis and avoidance modules support distinct information pathways (Fig. 4b) despite responding to overlapping sensory cues and both synapsing onto the VNC command interneurons. In contrast, the sublaterals highlight cross-connectivity within the nerve ring, with all but two neuron classes occupying layer 3. Pharyngeal output is mediated by layer-2 anterior neurons, indicating that the pharyngeal control is independent of the distributed layer-3 circuit. In contrast, head and neck muscles are controlled by layer-3 anterior, lateral and sublateral neurons and the VNC is controlled by all layer-3 modules, revealing the convergence of sensory pathways and associated modular subcircuits into a small number of highly coordinated motor programs.
Discussion
The C. elegans connectome has been available for over 30 years, and yet the delineation of functions within its main neuropil is still incomplete. By characterizing the spatial embedding of its connectome, we sought insight into the structures that could support a hierarchical, modular and nested architecture in the C. elegans brain. Previous analyses of the C. elegans connectome identified a common feed-forward loop motif among triplets of neurons3,26. Our brain map recasts this local motif as an architectural motif, reminiscent of layered cortical architectures28 and their artificial analogue, Residual Networks27. Such a ‘connectionist’ description of a biological brain provides a promising methodology for identifying parallel and distributed circuits.
While there are no physical boundaries within the nerve ring, our analysis points to spatial clustering of neural processes into five neighborhoods. The parallel pathways in our brain map largely fall into this modular neighborhood organization, linking spatial and functional organization. The spatial organization may also reflect developmental roles of nerve ring pioneers29 and constraints on synaptic and neuromuscular connectivity for motor coordination and control functions. Within the Residual-Network template are intra-layer local circuits, whose neurons by-and-large lack structural or functional compartmentalization. Thus, consistent with the neuron doctrine, within local subcircuits, neurons represent the basic unit of computation. However, the modular architecture converges within the final layer to achieve brain-wide coordination of behavior. In this distributed circuit, the nanoconnectome rules: specialized subcellular structures give rise to compartmentalized dynamics and interlink distant regions of the C. elegans brain. Similar subcellular structures performing analogous functions, found in thalamic local interneurons30, reveal a richness of subcellular computation. Thus, brain-wide coordination may be achieved by designated processes that interface between or thread across multiple subcircuits to underpin sensory convergence and sensorimotor transformations. the C. elegans brain map and its nested architecture might suggest a much closer analogy between the C. elegans neuropil and the coordination between the nano- and macro-connectomes of other invertebrates and even vertebrates31.
The concept of a reference connectome was key to our brain map and the modeling framework we used to establish this reference can easily be extended to accommodate future connectomes. In vertebrates, nanoscale organization underpinning individual synapses is variable, supporting individual wiring, plasticity and adaptability. In C. elegans, the proportion of conserved synapses was unknown. We found that the connectome consists of a core, conserved circuit that is embedded in a significant variable background. While pinning down the extent of the variable circuit is challenging due to the technical limitations of synaptic scoring and will therefore require multiple further connectomes, it is noteworthy that conserved synapses, like most variable ones, are constrained by the same contactome. Thus, if the core circuit represents the baseline functionality of the animal, the variable component could support redundancy, individuality32 and plasticity6.
The large number of cell classes, so densely packed in the nerve ring, presents a challenge to physically achieving stereotyped connectivity. Our finding of finely orchestrated organization across scales imposes spatial constraints on neurite and synaptic placement, thus restricting each neuron’s connectivity problem to a local neighborhood. This scalable solution is robust across a large population and naturally generalizes to much larger nervous systems. Viewed differently, the spatial organization reduces the required capacity for cell-cell molecular recognition machinery, while increasing the complexity of mechanisms producing the cell’s morphology and relative positioning in the tissue. But how is the neighborhood organization developmentally orchestrated? Previous models of neuropil development have proposed that pioneer neurites guide follower neurons33. While such models could be generalized to identify the pioneers of each neihborhood34, the highly reproducible pattern of membrane contacts indicates a more elaborate developmental mechanism. In complementary models, some guidance molecules would coordinate the relative neighborhood placement and others – the placement of neurites33. Identifying key guidance molecules in early nerve ring formation may help to address such predictions29,33. Whatever the developmental mechanisms may be, the brain map of C. elegans requires that these mechanisms too are nested and coordinated across scales to guide and support the modular, scalable and flexible neural architecture that produces the mind and behavior of the nematode C. elegans.
Methods
Anatomical and neuron-class nomenclature
The anatomy of the C. elegans nerve ring, associated ganglia and the delineation of 6 nerve bundles entering the nerve ring, was described in detail by Ware et al.35. Early observations, e.g. the distinction between papillary and amphid sensory specializations and their postulated mechano- and chemo-sensory roles have been validated since. Individual cell classes were identified and named by White et al.5. Each neuron name consists of either two or three uppercase letters indicating class and in some cases a number indicating the neuron number within one class (e.g. IL1, IL2). Bilaterally symmetric neurons (cell pairs) have a three letter/number class name followed by L (left) or R (right). Radially symmetrical neurons (with either 4 or 6 members) have a three-letter name followed by D (dorsal), or V (ventral), L (left) or R (right) (e.g. SIADL, SIADR, SIAVL, SIAVR and RMDL, RMDR, RMDDL, RMDDR, RMDVL, RMDVR). Unless otherwise noted, we use the term class synonymously with bilateral cell pair for radially symmetric cell classes (e.g. SIAV and SIAD are treated as separate classes). Additionally, 17 nerve ring neurons constitute the only members of their class (ALA, ALM, ALN, AQR, AVL, AVM, DVA, DVC, PVT, PVR, RID, RIH, RIR, RIS, RMED, RMEV and SABD). A small number of VNC motoneurons also enter the nerve ring. These VNC motoneurons names consists of two uppercase letters indicating muscle innervations (V: ventral, D: dorsal) and class (A-C) and a number indicating the neuron within one class (counted from anterior to posterior). Neurons are designated as sensory neurons, interneurons or motoneurons following their primary descriptions in WormAtlas36 (excluding proprioception from the sensory designation). We note, however, that in C. elegans, these designations are not exclusive. Our use of the term neighborhood to describe processes that run closely together in the nerve ring follows White et al.5,18. We use the stronger term immediate neighborhood to designate neural processes that make physical contact.
Electron micrograph (EM) preparation
The two legacy electron micrographs (EM) series used in this study were constructed in the MRC Laboratory of Molecular Biology (Cambridge, UK) during the 1970s. Both series are of hermaphrodite worms of the wild-type N2 (Bristol) strain. Worms were fixed in 1% osmium tetroxide in 0.1 M sodium phosphate, pH 7.5 for 1 h at 20°C before embedding, sectioning and post-staining5. This method was previously determined to best bring out cell membranes and synaptic structures at the expense of features within the cytoplasm. The EM series are transverse to the longitudinal axis of the worm; estimated section thickness is 70–90 nm, judged by silver color37. The original 55 cm × 60 cm montaged prints covering the nerve ring commissure and 30 cm × 40 cm covering the posterior lobe of the nerve ring have since been digitized, archived in the Hall Laboratory and available at www.wormimage.org.
The two series reconstructed for this study include the synaptically dense nerve ring neuropil and ventral ganglia regions of the anterior nervous system. One series is from a larval stage 4 (L4) worm and the other series is from an adult (estimated three days from adulthood3), referred to as JSH and N2U, respectively. The JSH series extends from just anterior of the nerve ring to the excretory pore. The N2U series is substantially longer, extending from just anterior of the nerve ring to the vulva. We only considered the section of the N2U series that physically corresponds to the JSH series. This resulted in 302 sections in the N2U series compared to 410 sections in the JSH series. In N2U, starting at the nerve ring posterior lobe, only every other EM section was imaged (N2U EM sections 183–302). Additionally, it is speculated that the JSH images may have slightly smaller section thickness. To correct for this when making comparisons between the L4 and the adult, data from this region in N2U was scaled by a factor of 2.
EM segmentation
EMs were manually segmented using TrakEM2 software38. The software provides GUI tools to facilitate the segmentation of cells across an EM stack. Within the EM series, we segmented all neuronal cell bodies and processes that extend into the nerve ring (Supplementary Videos 1 and 2). Cell bodies were then removed from our membrane contact analysis, because their large sizes skew the cell contact distribution. We also segmented the portion of the pharynx in the nerve ring, which serves as both a visual reference and spatial reference for the cylindrical coordinates. We did not segment dendrites of sensory neurons, because dendrites have very few synapses and therefore were not of interest for our analysis. We also did not reconstruct the sublateral cells SABVL and SABVR whose anterior processes leave the ventral nerve cord via the amphid commissure5. Measurements of the membrane contact between neurons were taken directly from the TrakEM2 XML data. We estimated each pixel to be ~5 nm2, based on size measurements of cell bodies which are estimated to be 2–3 μm wide. In all, we segmented 181 and 185 cells that innervate the L4 and adult nerve rings, respectively (the “complete dataset”).
Extracting adjacency data
We developed custom software (parsetrakem2, https://github.com/cabrittin/parsetrakem2) to quantify the pairwise membrane contacts between TrakEM2 segmented processes. In each EM, TrakEM2 stores each segmented cell as a set of boundary points. For each segmented cell, , our software defines a search radius that is proportional to the diameter of the segmented cell . Any immediately neighboring segmented cell, , that has a boundary point within the search radius is checked for adjacency to . We define adjacency for the pair (, ) as the number of boundary points of that are less than 10 pixels (~50 nm) from the boundary points of . We found that a radius of 10 pixels was sufficiently large to ensure that adjacencies were not missed. Any cell pairs erroneously identified as adjacent could easily be screened out in downstream analysis based on membrane contact area.
To check the accuracy of the algorithm, for two EM sections, we compared the contacts scored by our software to those obtained from manual scoring of membrane contacts (Supplementary Table 2). For manual scoring of membrane contacts, we used the connector feature in TrakEM2 to generate a connectivity graph of adjacent cells. An EM section with cells has possible cell pairings that were then classified as either adjacent (if the cells touch) or non-adjacent (if the cells do not touch). We assume the manually scored contacts to be the ‘ground truth’, which we use to define true positives , false positives , true negatives and false negatives in our automated classification. Sensitivity, defined as , measures how likely two physically touching cells are classified by our software as adjacent. Specificity, defined as , measures how likely two separate cells are classified by our software as non-adjacent. Aggregating results from the two manually scored EM sections (JSH001 and JSH040), the sensitivity and specificity of our classification algorithm are 0.974 and 1.00, respectively (Supplementary Table 2). In other words, the algorithm will miss ~2.5% of adjacent cell pairs within an EM section and a negligible number (<0.05%) of separate cell pairs will be incorrectly classified as adjacent. We assessed the missed adjacent cells in our test set and found that these adjacencies were small (tens of nanometers) and resulted primarily from poor segmentation (the manual cell segmentation did not extend completely to the cell membrane). Furthermore, all 11 cell pairs incorrectly classified as non-adjacent in the two test EM sections were correctly classified as adjacent in subsequent EM sections. As adjacent cell pairs missed in one EM section are likely to be correctly classified as adjacent in subsequent EM sections, and because most of our analyses aggregate adjacencies across EM sections, any missed adjacencies within a single EM section is likely to have negligible impact on our results.
As an additional test, we compared the adjacent cells extracted by our algorithm to the adjacent cells previously reported for a small subset of neurons based on a sparse analysis of physical adjacency in the L418. White et al. determined the neighborhoods of cells AIAR, AIBR and AQR in the L4 (JSH) EM series. Our algorithm was able to find all but 1 adjacent cell pair in the White et al.18 neighborhoods (Supplementary Information 5). Closer inspection revealed that the cell pair does not make physical contact and was thus mis-scored by White et al. as adjacent. Furthermore, we found an additional 69 adjacent cell pairs not included in the White et al. neighborhoods (Supplementary Information 5). Therefore, our volumetric dataset is more extensive than those previously reported.
EM annotation for synaptic connectivity
We used our previously published connectivity data for chemical synapses and gap junctions and refer the reader to Cook et al.3 for details on how synapses were annotated. Briefly, we used custom software39 to aid manual annotation of chemical synapses and gap junctions. For chemical synapses, presynaptic cells are identified by the presence of a presynaptic density while postsynaptic cells are identified as the cells directly apposed to the presynaptic density. Most synapses are polyadic – multiple postsynaptic partners are assigned to a single presynaptic cell. Gap junctions are recognized as a straightened or slightly curving region of apposed membranes with increased staining and a uniform small gap. For the purpose of the current study, we restrict the synaptic and gap junctional dataset to those in our volumetric reconstruction (i.e. those scored in EMs that were segmented for this study). In all, the numbers of synapses and gap junctions scored is larger than in the original connectome5, with a notable increase in synapses that were scored in only 1 EM section. Within our reference dataset (see Generating reference graphs below), Cook et al. (2019)3 scored an additional 489 synaptic contacts to the White et al. (1986)5 connectome, of which 249 (49%) synaptic contacts only occur in 1 EM section. To control for the possibility of false positives in this annotation, more restricted datasets were constructed for validation (see Validation against test datasets, below).
Generating reference graphs
In order to control for variations in connectivity, we found it useful to map the data to a novel data structure, which we call a reference graph. Reference graphs classify contacts (defined as the aggregate pairwise connections over all EM sections within a dataset) based on their degree of reproducibility across datasets. We took advantage of the bilateral symmetry of the worm to effectively double our sample size. We therefore generated 4 datasets (adult left/right and L4 left/right) from the two reconstructed nerve rings. For a sample size of , simply averaging across datasets is not a useful way to build a reference model of the data. Instead, we segregate the contacts into 4 separate categories based on their reproducibility. To this end, we removed from our analysis a number of neurons that exhibit appreciable differences in synaptic connectivity or process morphology laterally (PLN, PVN, HSN), between the L4 and adult (HSN, PVR, SABD), or those that make minimal membrane contact in the nerve ring (in VB, VC and VD classes), leaving 173 cells in 93 cell classes (the “restricted dataset”, Supplementary Information 3). The restricted dataset excludes HSNR, PLNL, PLNR, PVNL, PVR, SABD, VB01 and VD01 neurons – in both L4 and adult – and HSNL, PVNR, VB02 and VC01 – in the adult.
We generate reference graphs as follows. We first threshold membrane contacts by eliminating the smallest 35% of contacts in each of the adult and L4 datasets (Extended Data Fig. 2). From these, we then generate 4 datasets of membrane contacts: adult left, adult right, L4 left and L4 right. Each dataset was converted to a graph, where vertices are neurons and edges denote membrane contacts between a pair of adjacent neurons. The reference graphs , , and represent the set of membrane contacts found in and all 4 datasets (see explicit calculation of reproducibility degree, , below). Reference graphs for chemical synapses and gap junctions were generated similarly, but with slightly different edge thresholding. Whereas for , we thresholded based on the magnitude of membrane contact, for and we only included edges that correspond to contacts (or from or where explicitly mentioned). By only including edges in , we effectively eliminate differences in synaptic connectivity due to differences in process placement. Each edge in the membrane reference graph has an associated normalized mean contact area (across the 4 datasets). To control for slight differences in cell sizes between the larva and adult series, we normalize all membrane contact areas within each of the 4 datasets by the sum of all membrane contacts within that dataset. The normalized membrane contact area between neurons (, ) in is then the mean normalized contact area across the datasets in which the contact is present.
For bilateral cell classes, let indices, e.g. and , each denote some side of an animal (left or right) and let , , etc. denote the respective contralateral side. For a contact made between cell in class to in class in animal 1, is defined as the number of contacts among where the superscript 2 labels the other animal. For intra-class connections, is the number of contacts among , and for single cell classes, e.g. DVA connecting to some class Y (or vice versa), is counted among (or vice versa).
Population spatial models
The observed variability in membrane contacts, both bilateral and across the two animals, indicates that it is unlikely that any one animal is representative of the population at large. We generated a population model of all membrane contacts, by stochastically perturbing the area associated with each membrane contact, such that the overall distribution of mean membrane contact areas is preserved and that the variability in membrane contact areas across datasets is also preserved. To establish the baseline variability across the 4 datasets, we considered the log-normalized distribution of membrane contact areas (Extended Data Fig. 6a). For each contact in , we computed the normalized mean membrane contact area (see Generating reference graphs) and the standard deviation of membrane contact areas across the 4 datasets. We observed no correlation between the normalized mean membrane contact area and standard deviation (Extended Data Fig. 5b), indicating that the variability in membrane contact areas does not depend strongly on membrane contact area (similar to immediate neighborhood sizes in Extended Data Fig. 1a). Therefore, we estimated the variability in the membrane contact area by the mean variability among membrane contacts (Extended Data Fig. 5c).
To perturb each dataset, we applied multiplicative white noise to each membrane contact area, which we derived from the distribution of membrane contact areas, as follows. A log-transformed (un-skewed) and standardized (mean 0 and variance 1) membrane contact area is computed from membrane contact area by
(1) |
where and denote the geometric mean and standard deviations of the membrane contact areas (i.e. the arithmetic mean taken in the log domain), across the 4 datasets, per cell pair. Rearranging terms gives
(2) |
To perturb membrane contact areas , we add white noise in the log domain, i.e.,
(3) |
Hence, we scale each membrane contact by , where is determined by the membrane contact area distribution of the dataset and the distribution is drawn randomly from a normal distribution with mean 0 and standard deviation .
The standard deviation, , of the distribution sets the amplitude of the perturbation. We determined the appropriate noise amplitude by comparing the distributions of perturbed and empirical datasets. We found that a noise amplitude of – roughly half of the mean standard deviation of membrane contact areas (Extended Data Fig. 5c) – yields perturbed membrane contact area distributions (Extended Data Fig. 5d–f) that are qualitatively similar to the empirical dataset (Extended Data Fig. 5a–c). Moreover, the perturbed membrane contact areas scale linearly with membrane contact area (Extended Data Fig. 5g) and variability as a fraction of membrane contact area is uniform (Extended Data Fig. 5h).
Perturbed populations are denoted , and . For , we perturb contacts conserved across the 4 datasets (L4 left, L4 right, adult left, adult right). For and , we perturb bilaterally conserved contacts in the L4 and adult, respectively. Each population consists of 1000 perturbed datasets.
Spatial modularity analysis
To identify groups of neurites with high spatial affinity in the nerve ring, we performed a graph modularity analysis of the membrane contact areas. Since spatial adjacencies between neurons consist of both conserved and variable membrane contacts, we applied our clustering analysis to , and population models (unless otherwise stated). For clustering purposes, we reduced contralateral left/right homologue vertices to a single vertex class. For example, vertices ASHL and ASHR were reduced to the single vertex, ASH. The algorithm was then applied to each individual in the population.
The multilevel community detection algorithm yields a number of clusters of neuron classes whose neurites exhibit high spatial affinity. Topological clustering methods such as modularity optimization21,40 are well suited for characterizing the organization of a complex system from pairwise undirected linked relationships40,41, as is the case for characterizing spatial organization from membrane contacts between neural processes. In particular, algorithms of this class are appropriate when the organization sought is static40,41. Other, random-walk based algorithms34,41 assume or impose a flow on the network and are often ill-suited for characterizing spatial (i.e. static) organization, as they can introduce bias in the clustering or miss static features in the organization of the system41. We applied the Louvain method, a multilevel community detection algorithm using the igraph software package42. This topological clustering algorithm is a bottom-up heuristic method based on modularity optimization21. Initially, every vertex is placed in a separate community. Vertices are then iteratively moved between communities in a way that maximizes the vertex’s local contribution to the overall modularity score (the ratio of the number of intra- to inter-community edges). When no vertex movement increases the modularity score, communities are shrunk to a single vertex and the process is repeated.
Cluster assignment and validation
The graph clustering algorithm (see above) was applied to each individual in each population model. For each population, we generated a cluster frequency matrix that counts the number of times each pair of neurons is clustered together. We then sorted the rows and columns of the frequency matrix so as to minimize the variance along the main diagonal of the matrix (Extended Data Fig. 5i). Sorting was achieved using a hierarchical matrix clustering algorithm21. The resulting dendrogram assigns neurons to a cluster. We obtained a set of 5 largely overlapping clusters for each of the , and population models (Extended Data Fig. 5i). Cell classes whose cluster assignment agreed across the 3 population models were assigned to the consensus cluster. Seven neuron pairs (ADE, ALN, AVA, RID, RIR, RMD and URX) were classified differently across the different population models, and were designated ‘unclassified’ accordingly. To evaluate the robustness of the clusters to empirical variability between the L4 and adult series, we compared clusters obtained from population models of , and (see Population spatial model, Fig. 1b,c, Extended Data Fig. 5i and 6b).
We performed four sets of validation experiments using our population models to confirm the robustness of our neuron clusters. (1) As discussed above, we compared cluster assignments across , and (Fig. 1b,c, Extended Data Fig. 5i). (2) We generated cluster assignments for populations that were perturbed with different noise amplitudes (; see Population spatial models, Extended Data Fig. 5j). (3) We generated a new population from membrane contacts in a more restricted volume recently used in Moyle et al. (2020)34 which consists of the anterior ~60% of our nerve ring neuropil volume (Extended Data Fig. 5k). (4) We generated a new population that also includes the smallest 35% membrane contact areas (Extended Data Fig. 5l; recall smallest contacts were removed from our analysis, Extended Data Fig. 2). All of our validation experiments resulted in largely similar cluster assignments (Supplementary Information 4). A handful of neuron classes were assigned to different clusters in different population models, but the gross structure of the 5 main neuron clusters, as defined by the cluster assignments of a large majority of the neurons, was consistent across the populations. Thus, our cluster assignments are robust across model population datasets, the L4 and adult, a wide range of noise amplitudes, well above the observed inter-animal variability, and different spatial domains.
Next, we validated our population model by repeating the clustering analysis on the unperturbed reference dataset as well as on the unperturbed adult and L4 bilateral datasets (Extended Data Fig. 5k, and Extended Data Fig. 6b). We find small differences between these datasets, but those are not robust to small perturbation in our population models (, i.e. below our estimated level of expected biological variability in core contacts). We also validate our core assumption that the conserved structure of the nerve ring requires analysis of the reproducible membrane contacts by comparing clusters from unperturbed datasets. We find that whereas the reproducible membrane contact datasets consistently give rise to a small number of clusters, with largely similar composition, membrane contacts fail to reproduce these results (Extended Data Fig. 6a), suggesting that variable membrane contacts may be masking the core, conserved spatial organization of the nerve ring neuropil.
Mesoscale analysis of synaptic connectivity
Given that the organization of the neuropil is modular, with most neurons spatially clustering within local neighborhoods and others spatially interconnecting different neighborhoods, we wanted to determine if synapses form local subcircuits, or to what extent synaptic circuits also span different neighborhoods of the nerve ring. To assess the spatial organization of synaptic circuits, we considered the distribution of conserved synaptic contacts (Fig. 3a). Using the dataset, we calculated the mean ( cells) and standard deviation ( cells) of immediate neighborhood sizes (Anatomical and neuron-class nomenclature). We order the cells as in Fig. 1b so as to maximize the amount of physical contact along the diagonal of the matrix.
We define 5 zones based on the size distribution of immediate neighborhoods. Each zone refers to regions between two diagonals above and below the main diagonal of the matrix. The inner zone (labeled 0) consists of neighborhoods of size for each cell around the main diagonal. The next zone (1) extends from the edges of the inner zone to diagonals further away from the main diagonal, and zones 2–3 similarly extend by . The outermost zone (4) extends from the previous zone (3) to encompass cells in the remainder of the matrix. (Formally, zones are defined by their inner and outer diagonals, with the inner diagonal defined by from the main diagonal for zones 1 to 4 and the outer diagonals defined by from the main diagonal for zones 0 to 3.) We counted the number of contacts in each Zone (Fig. 3b). Finally, we counted contacts between pre- and postsynaptic neurons that have been assigned to the same cluster (Fig. 3c). For this purpose, synapses between two unclassified neurons are not considered intra-cluster.
Contact localization analysis
While some membrane contacts appear to be reproducible (our reference dataset), contacts are aggregate measures (along the entire process). To assess the reproducibility in the location of individual instances of membrane adjacencies along a neurite, we assigned each EM in each process a discrete coordinate, , from the anterior to the posterior of the process. This allows us to compare relative locations of a contact across the four datasets (L4 left/right, adult left/right). Different discretizations of (0.7 μm, 1.4 μm, 3.6 μm) define different resolution for the reproducibility of contacts along the process. For each contact, we define the spatial reproducibility count as the number of datasets where the contact was observed at a given position, . We further define the maximum spatial reproducibility count, , as the highest reproducibility count across all locations, , per cell pair (i.e. given an contact exists between two immediate neighbors, the highest reproducibility count of instances of membrane adjacencies between the two cells). To assess synaptic localization, we similarly measured the spatial reproducibility counts (and their maxima) for all contacts. See also Supplementary Results.
Synapse compartmentalization and subcellular structures
Identification of synaptic compartmentalization and subcellular structures was performed by visually inspecting the volumetric reconstruction of the processes of 173 neurons (the restricted dataset) in both the adult and L4 datasets (346 cells in total). To visualize synapses, we imported synapse locations3 (http://wormwiring.org) into the reconstructed TrakEM2 datasets. To facilitate visual identification, we colored synapses based on whether the cell is presynaptic or postsynaptic and whether the synapses occurs between cells of the same cluster or not. For each cell, we required synapse compartmentalization and/or subcellular structures to be bilaterally conserved in both the L4 and the adult (Supplementary Information 4). The one exception are the RMF cells, where there is clear branching in the L4 (both left and right) that is not observed in the adult. However, because we are limited to 2 samples, we cannot determine if these are developmental, individual differences or reconstruction error5.
We identified two types of synaptic compartmentalization: compartmentalization of synaptic inputs and outputs and compartmentalization of synapses with different clusters. To identify compartmentalization of inputs (outputs), we required neural segments to have ≥3 synaptic inputs (outputs) that are spatially distinct from segments with synaptic outputs (inputs) or neural segments with mixed synaptic inputs and outputs (Extended Data Fig. 9).
We identified flattened protrusions by looking for points along the neural processes with increased surface area. We further identified flattened protrusions with mixed synaptic inputs and outputs, which we interpret to be local points with diverse synaptic polarity. In some instances (Supplementary Information 4), these flattened protrusions appear to extend to branches or spine-like structures (Extended Data Fig. 8). Note that synaptic compartmentalization and flattened protrusions are not mutually exclusive. We observed 9 cells that exhibit co-localized synaptic compartmentalization and flattened protrusions. In these instances, the flattened protrusions appear to be used to compartmentalize reproducible () synapses (Extended Data Fig. 8).
Mapping neighborhood changes of neurites
We observe that some neuron processes extend into multiple neighborhoods (Fig. 3 and Extended Data Fig. 8). We manually mapped neighborhood changes along process trajectories for selected L4 left neurons (as representatives of their cell class). Starting at the proximal end of the process (closest to cell body), we followed the process trajectory through the EM stack. At each EM, we visually noted the cluster assignments of the neighboring neurites and assigned the neighborhood of that segment of the neurite accordingly. If the neighboring neurites comprised two or more clusters then we labeled the local neighborhood as ‘mixed’. The sequence of local neighborhood segments along the neurite was then scaled by the total length of the neurite so that all positions along the neurite range between 0 and 1. In the case of AVA and RIM, which have protrusions that branch out from the main process trajectory, we scaled the protrusion length by the same factor as the main neurite trajectory.
Brain map construction
We posited a 3-layer architecture as the minimal number of layers needed to capture the organizing principles of the connectome. Classifications of neurons as sensory neurons, interneurons or motoneurons followed WormAtlas36. All sensory neurons were assigned to the first layer. SDQ, BDU and ALN have been postulated to have sensory functions44,45 but were classified as interneurons as they are not ciliated and physiological evidence for sensory function is lacking. Reclassifying them as sensory neurons would not alter the high-level connectivity of the brain map. With the exception of AIY and AIA, all neurons that make at least one inter-cluster contact were placed in layer 3, with the remainder of neurons assigned to layer 2. Placing AIY and AIA in layer 2 is consistent with functional and ablation studies suggesting that these cells are first-layer amphid interneurons22,46. Furthermore, AIY and AIA each only make 1 inter-cluster synaptic contact (Extended Data Fig. 10). AIY synapses onto the multi-compartment cell RIA which traverses multiple neighborhoods (Extended Data Fig. 8a). AIA synapses onto RIF whose neurite is at the interface of taxis and avoidance cells in the nerve ring posterior lobe. We confirmed that our map is robust to small changes in which neurons with relatively few inter-cluster synaptic contacts between layers are shifted to layer 2. However, the configuration adopted here optimizes the feed-forward directionality of the synaptic circuit (from the sensory layer to layer 3).
Our information processing modules roughly correspond to the 5 spatially identified clusters. The sublateral and lateral clusters were merged into a single module. With one exception (CEPD), cell classes in the same cluster are placed within the same module. Because CEPD neurons follow the same looping neurite trajectories as other papillary sensory neurons, CEPD cells, which are assigned to the sublateral cluster, are more sensibly placed in the anterior module. Unclassified cells are difficult to cluster because they exhibit high spatial affinity with cells from different clusters. To place the 7 unclassified cell classes on the brain map, we relied on the relative placement of their process trajectories among the clusters. We identified representative cells from each cluster to serve as fiducial points for process placement (Anterior: RIH, Lateral: AVK and RIV, Sublateral: SIAD, Avoidance: AVB, Taxis: ASJ). Each unclassified cell was then added to the module of the representative cells whose neurite most closely aligned with the neurite of the unclassified cell.
Statistical connectivity models
We asked whether stochastic processes could account for the reproducibility and variability of contacts across the 4 datasets. For parsimony, we treat all potential contacts, or graph edges, as identical and allow for all-to-all connectivity. The empirical contact distributions ( for membrane contacts, for synapses and for gap junctions, Fig. 2a) are all bimodal. Therefore, within the above assumptions, a single stochastic process (for making, or equivalently suppressing) contacts cannot account for these distributions.
We therefore constructed a minimal 3-parameter model combining two stochastic processes – precision and specificity. Precise targeting of contacts and active avoidance of others both require us to distinguish between the set of candidate target contacts and the remainder (non-targets). Accordingly, we define a fraction of target contacts , the probability to form a target contact (precision, ) and the probability to avoid an off-target contact (specificity, ).
This model suffices to define the distribution 0, 1, 2, 3 and 4 contacts. For , the probability of is given by:
(4) |
where the parameters , and may take on different values for different instances of . In the absence of empirical data for estimating the physically accessible subset of contacts, we distribution 0, 1, 2, 3 and 4 contactsto , or in general, for datasets, using Bayes’ theorem:
(5) |
Model fits
We used a greedy search of the entire parameter space (with 1% resolution) to find the 3 parameter values (, and ) that minimize the -norm between the predicted and empirical distributions. Due to the symmetry of the equations, the model has two solutions that are equivalent up to relabeling of the nodes (and given by , , ) such that target and non-target populations are swapped both in size and in the probability of contacts. We choose the solution in which the target fraction, , corresponds to the solution with , such that precisely targeted contacts are synonymous with higher reproducibility across datasets.
A further equivalent reparametrization exists that replaces a specificity mechanism (acting only on non-target edges) with a uniform basal connectivity (that applies to both target and non-target edges). This variant of the model provides an alternative interpretation, in which the three parameters are the target fraction, , precision, , and basal activity level, . The solution can be obtained with the reparametrization: , , . Imposing the condition eliminates one of the two solutions for all our model fits.
Empirical data for fits and bias control
For membrane, synaptic and gap junction contacts, we found no evidence of higher reproducibility of edges between the left sides of the L4 and adult datasets, or between the right sides of the L4 and adult, as compared to L4 left and adult right, or vice versa (Supplementary Table 3). However, for membrane contacts, development leads to an overall increase of edges between the L4 and adult (Supplementary Table 1). We considered all neuron pairs in our complete dataset (3203 edges with membrane contact areas ≥35 percentile, Extended Data Fig. 4b) as well as the restricted dataset (see above, 2955 edges). Models of the complete and restricted datasets yielded quantitatively similar results (Extended Data Fig. 3e).
Our restricted dataset consists of 173 neurons. In the absence of spatial constraints, all-to-all connectivity would, in principle, allow for up to 173×172/2 = 14,878 edges. Conversely, using the model fit, the sum could provide an estimate for the size of the pool of physically accessible membrane contacts in the nerve ring. The above estimate (≈ 3500 edges for the restricted set of contacts) is about 23% of the all-to-all number. This model estimate points to the strong role that spatial constraints play in the actual circuit.
The space of possible synaptic and gap junction contact is restricted by the existence of a physical membrane contact. Unless otherwise noted, all fits were performed on and that were restricted to edges from the set of membrane contacts. To control for possible bias due to the subselection of contacts, validation plots were generated by considering contacts and scaling the counts for , where and ‘’ denotes synaptic contacts occurring on the domain of membrane contact (scaling was performed in the same way for gap junction contacts, ; Extended Data Fig. 4a–c). For chemical synapses, we find good agreement with when scaled by and when scaled by either or . Rescaling systematically underestimated across all datasets, possibly due to a subset of small synapses not accounted for by the model. Indeed, consistent with Hall and Russell6, we find that both synapses and gap junctions are significantly smaller (Extended Data Fig. 4f,g) and occur at smaller membrane contacts (Extended Data Fig. 2i, see also Validation against test datasets, below).
To estimate the fraction of target edges (for ), we used Eq. (5). For example, the probability of observing target membrane contacts is given by , whereas the probability of finding 4 variable membrane contacts, , is negligible. Thus, the estimated fraction of membrane contacts in the core circuit is estimated as , whereas, the estimated fraction of contacts in the core circuit is only . Finally, we estimate that in two animals (4 datasets) one would expect of core edges to occur in at least 3 datasets (corresponding to ~99% of core membrane contacts and ~97% of core synaptic edges). Additionally, we separately fit the model to intra-cluster and inter-cluster edges. For each set of membrane contacts, , we separated the contacts that occur between neurons with the same cluster identity (intra-cluster) and contacts between neurons with different cluster identities (inter-cluster). We then separately fit the model to the sets of intra- and inter-cluster edges, corresponding to membrane contacts, synapses and gap junctions (Extended Data Fig. 3c–d).
Simulation and generation of surrogate data
To construct each surrogate dataset, , we set the size of the dataset, (e.g. 2955 for membrane contacts) and created an ordered list of edges. We generated a binary target list (the first round elements in the list, and a binary non-target list ; among target edges, contact, i.e. 1, occurs with probability and among off-target edges, contact occurs with probability . We then aggregate the counts across surrogate datasets, , where corresponds to the number of datasets in which edge forms a contact. The list of then forms a surrogate dataset for the reproducibility of contacts, e.g. .
Validation against test datasets
As additional connectomes are generated and technologies change, we expect slight differences in scoring of different datasets generated from different EM sets3,6,7,17,18,39. These could arise from slightly different demarcation of the volume being scored, different EM sectioning (or sections scored) and different scoring criteria. In the absence of functional (molecular of physiological) data, it is difficult to avoid some false positives (scored synapses that are not fully developed and functional) and false negatives (missed synapses). Often, smaller synapses fare harder to score accurately. Furthermore, most C. elegans synapses are polyadic and present particular challenges, especially when one of the targets occurs with a considerably smaller membrane contact area. Methods and validation of synaptic scoring for the dataset used here have been described by Cook et al.3. Here, we address complementary aspects, relating to reproducibility of scores and implications for our model of core and variable circuits (Extended Data Fig. 3e–i).
Cook et al.3 (the dataset used here) scored a greater number of small synapses than White et al.5 (Extended Data Fig. 4f). Furthermore, while this paper was under submission, additional connectomes have been reported for eight hermaphrodite C. elegans nerve rings, including two adults20. We therefore validated our main results on synaptic reproducibility against the connectomes of White et al.5 and the two adults in Witvliet et al.20 (hereafter, ‘test datasets’, denoted with the subscript test). As the volumetric reconstruction and hence membrane contact analysis is only available for our study, we used the edges identified here as a common basis for comparison and validation.
Size dependence of synaptic reproducibility has previously been noted3,6,7,39. Consistently with these earlier results, Extended Data Fig. 4f shows that synapses, and less so synapses, have a considerably higher fraction of edges associated with higher EM section counts: 87% of and 37% of edges are observed in ≥ 5 EM sections, as compared to 13% and 21% in and , respectively. That said, a comparison with the White et al. test dataset5 shows that the additionally scored synaptic edges are evenly distributed across (Extended Data Fig. 4f). To check whether different scoring criteria leading to different counts of small synapses affect our conclusions, we re-fit our model to a more restricted synaptic dataset in which all 1-EM section synapses were excluded. While this substantially suppresses counts (hence affecting the relative core and variable fractions), its effect on our model precision and specificity is minor (Extended Data Fig. 3f). The scoring of polyadic synapses is also potentially challenging, if synapses are formed with only a subset of co-localized postsynaptic neighbors. To check whether excessive scoring of polyadic synapses might affect our results, we constructed a synaptic dataset in which for every polyadic pre-synaptic site, we excluded any postsynaptic partner that is in . Re-fitting our model to this restricted synaptic dataset, we again find similar precision and specificity.
Next, we reasoned that to be reliable, our statistical model should be robust across datasets. To validate this, we re-fit our model to the two test datasets (Extended Data Fig. 3h–i). Both test datasets show a qualitatively similar bimodal distribution of synaptic reproducibility () that is well fitted to our 3-parameter model. Model fit parameters varied only slightly from our results (Fig. 2): a synaptic edge precision of 92-96% and a specificity of 68-74%. For each synaptic edge scored by Cook et al.3, we then counted the number of edges scored in the test dataset. All but 1 of our edges and 93% of our edges were scored at least once by Witvliet et al.20 (Extended Data Fig. 4h), suggesting that some small synapses are in fact highly reproducible. While slight differences in our model fits preclude automatic merging of the datasets (or models), their similarity implies that it should be possible to quantitatively validate the two extremes, namely non-reproducible and entirely reproducible edge counts, as those are almost certain to come from the variable and core circuits, respectively.
To validate the scoring of postulated variable synapses, we use our model parameters and Eq. (4) to estimate what number of synaptic edges in our dataset would be statistically expected to be absent from two independent animals,
This expression yields an estimated number of ~330 synaptic edges. Empirically, we find that 374 of our synapses were not scored in the Witvliet et al. test dataset20 (Extended Data Fig. 3i; difference not statistically significant under binomial counting statistics). This result adds confidence to the scoring of variable and in particular, small synapses in our dataset. To examine the consistency of postulated conserved synapses, we estimated the number of synaptic edges scored by Cook et al.3 that would also be expected to be found in two independent animals (i.e. in a new set of ). Of our 450 synaptic edges, we expect a test dataset to include as (also equivalent to . Empirically, Witvliet et al.20 score 389 of our contacts, consistent with our model predictions.
Statistics and reproducibility
Membrane contact datasets are derived from the EM reconstructions of the nerve rings from 2 animals at different developmental stages. Each animal in our restricted dataset consists of 80 pairs of bilateral homologous cells. Extended Data Fig. 1 and 2 established that bilateral homologous cells are sufficiently similar. Accordingly, for the purposes of generating reference graphs and for our core-variable and population models, we assume the two sides of each animal may be treated as independent, yielding 4 independent datasets (L4 left, L4 right, adult left and adult right) each consisting of 93 cells classes. As further measures of reproducibility, we validated our core-variable synaptic and gap junction contact models against data scored by different experts on the same EM series5 and on different EM datasets20 (in both cases, limited to our contacts). Our models yielded qualitatively similar results for the different scorings and datasets (Extended Data Fig. 3h,i). Spatial population model data were drawn from distributions that matched the empirical distributions of membrane contact areas across the 4 datasets.
Extended Data
Supplementary Material
Acknowledgements
We thank Jonathan Hodgkin and John White for their help in donating archival TEM material from LMB/MRC to the Hall lab for curation. Thomas Ilett, Felix Salfelder and Samuel Braunstein provided useful discussion. We thank Mei Zhen for making their synaptic and gap junction data available (https://nemanode.org/). This work was supported by NIH grant NIMH F32MH115438 (SJC), NIMH R01MH11289 (SWE), the G. Harold and Leila Y. Mathers Charitable Foundation (SWE), NIH OD 010943 (DHH) and EPSRC EP/J004057/1 (NC). CB was supported by the Leeds International Research Scholarship.
Footnotes
Competing Interests
The authors declare no competing interests.
Code Availability
Software packages parsetrakem2 (extracting adjacency data) and elegansbrainmap (analysis and visualization software) are available at https://github.com/cabrittin/parsetrakem2 and https://github.com/cabrittin/elegansbrainmap respectively.
Data Availability
The volumetric datasets generated during the current study, associated connectivity databases and associated analysis are available at https://doi.org/10.5281/zenodo.4383277 and http://wormwiring.org/. The raw data for volumetric reconstructions for Figs. 1 and 3, Extended Data Fig. 8 and all Supplementary Videos is available at https://doi.org/10.5281/zenodo.4383277. Extracted adjacency data is available in Supplementary Information 1. The reference datasets are available in Supplementary Information 3. The Cytoscape files use to generate the brain map (Fig. 4 and Extended Data Fig. 10) and network motifs (Extended Data Fig. 10) are available at https://doi.org/10.5281/zenodo.4383277. See associated Source Data for the data used to generate plots. The collection of C. elegans nervous system electron micrographs are also available at https://www.wormatlas.org/ and https://wormimage.org.
References
- [1].Hahamy A, Behrmann M. & Malach R. The idiosyncratic brain: Distortion of spontaneous connectivity patterns in autism spectrum disorder. Nat. Neurosci 18, 302–309 (2015). [DOI] [PubMed] [Google Scholar]
- [2].Swanson LW & Lichtman JW From Cajal to connectome and beyond. Ann. Rev. Neurosci 39, 197–216 (2016). [DOI] [PubMed] [Google Scholar]
- [3].Cook SJ et al. Whole-animal connectomes of both Caenorhabditis elegans sexes. Nature 571, 63–71 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Ryan K, Lu Z. & Meinertzhagen IA The CNS connectome of a tadpole larva of Ciona intestinalis (L.) highlights sidedness in the brain of a chordate sibling. eLife 5, e16962 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].White JG, Southgate E, Thomson JN & Brenner S. The structure of the nervous system of the nematode Caenorhabditis elegans. Phil. Trans. R. Soc. Lond. B 314, 1–340 (1986). [DOI] [PubMed] [Google Scholar]
- [6].Hall DH & Russell RL The posterior nervous system of the nematode Caenorhabditis elegans: serial reconstruction of identified neurons and complete pattern of synaptic interactions. J. Neurosci 11, 1–22 (1991). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Jarrell TA et al. The connectome of a decision-making neural network. Science 337, 437–444 (2012). [DOI] [PubMed] [Google Scholar]
- [8].Bumbarger DJ, Riebesell M, Rodelsperger C. & Sommer RJ System-wide rewiring underlies¨ behavioral differences in predatory and bacterial-feeding nematodes. Cell 152, 109–119 (2013). [DOI] [PubMed] [Google Scholar]
- [9].Ohyama T. et al. A multilevel multimodal circuit enhances action selection in Drosophila. Nature 520, 633–639 (2015). [DOI] [PubMed] [Google Scholar]
- [10].Zheng Z. et al. A complete electron microscopy volume of the brain of adult Drosophila melanogaster. Cell 174, 730–743 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Kasthuri N. et al. Saturated reconstruction of a volume of neocortex. Cell 162, 648–61 (2015). [DOI] [PubMed] [Google Scholar]
- [12].Motta A. et al. Dense connectomic reconstruction in layer 4 of the somatosensory cortex. Science 366, eaay3134 (2019). [DOI] [PubMed] [Google Scholar]
- [13].Varshney LR, Chen BL, Paniagua E, Hall DH & Chklovskii DB Structural properties of the Caenorhabditis elegans neuronal network. PLoS Comput. Biol 7, 21 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Sulston JE, Schierenberg E, White JG & Thomson JN The embryonic cell lineage of the nematode Caenorhabditis elegans. Dev. Biol 100, 64–119 (1983). [DOI] [PubMed] [Google Scholar]
- [15].Barabási DL & Barabási A-L A genetic model of the connectome. Neuron 105, 435–445 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Albertson DG & Thomson JN The Pharynx of Caenorhabditis elegans. Phil. Trans. R. Soc. Lond. B 275, 299–325 (1976). [DOI] [PubMed] [Google Scholar]
- [17].Cook SJ et al. The connectome of the Caenorhabditis elegans pharynx. J. Comp. Neurol 528, 2767–2784 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].White J, Southgate E, Thomson J. & Brenner S. Factors that determine connectivity in the nervous system of Caenorhabditis elegans. Cold Spring Harb. Symp. Quant. Biol 48, 633–640 (1983). [DOI] [PubMed] [Google Scholar]
- [19].Durbin RM Studies on the development and organisation of the nervous system of Caenorhabditis elegans. Ph.D. thesis, University of Cambridge (1987). [Google Scholar]
- [20].Witvliet D. et al. Connectomes across development reveal principles of brain maturation in C. elegans. Preprint at https://www.biorxiv.org/content/10.1101/2020.04.30.066209v1 (2020). [DOI] [PMC free article] [PubMed]
- [21].Blondel VD, Guillaume J-L, Lambiotte R. & Lefebvre E. Fast unfolding of communities in large networks. J. Stat. Mech 2008, P10008 (2008). [Google Scholar]
- [22].Gray JM, Hill JJ & Bargmann CI A circuit for navigation in Caenorhabditis elegans. Proc. Natl. Acad. Sci. USA 102, 3184–91 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Kato S. et al. Global brain dynamics embed the motor command sequence of Caenorhabditis elegans. Cell 163, 656–669 (2015). [DOI] [PubMed] [Google Scholar]
- [24].Towlson EK, Vertes PE, Ahnert SE, Schafer WR & Bullmore ET The rich club of the C. elegans neuronal connectome. J. Neurosci 33, 6380–6387 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Cohen N. & Denham JE Whole animal modeling: piecing together nematode locomotion. Curr. Opin. Syst. Biol 13, 150–160 (2019). [Google Scholar]
- [26].Milo R. et al. Network motifs: simple building blocks of complex networks. Science 298, 824–827 (2002). [DOI] [PubMed] [Google Scholar]
- [27].He K, Zhang X, Ren S. & Sun J. Deep residual learning for image recognition. Proc. IEEE Comp. Soc. CVPR, 770–778 (2016). [Google Scholar]
- [28].Thomson AM Neocortical layer 6, a review. Front. Neuroanat 4 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Rapti G, Li C, Shan A, Lu Y. & Shaham S. Glia initiate brain assembly through noncanonical Chimaerin–Furin axon guidance in C. elegans. Nat. Neurosci 20, 1350–1360 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Morgan JL & Lichtman JW An individual interneuron participates in many kinds of inhibition and innervates much of the mouse visual thalamus. Neuron 106, 468–481 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].Chen X. et al. Brain-wide organization of neuronal activity and convergent sensorimotor transformations in larval zebrafish. Neuron 100, 876–890 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Stern S, Kirst C. & Bargmann CI Neuromodulatory control of long-term behavioral patterns and individuality across development. Cell 171, 1649–1662 (2017). [DOI] [PubMed] [Google Scholar]
- [33].Wang L. & Marquardt T. What axons tell each other: Axon-axon signaling in nerve and circuit assembly. Curr. Opin. Neurobiol 23, 974–982 (2013). [DOI] [PubMed] [Google Scholar]
- [34].Moyle MW et al. Structural and developmental principles of neuropil assembly in C. elegans Preprint at https://www.biorxiv.org/content/10.1101/2020.03.15.992222v1 (2020). [DOI] [PMC free article] [PubMed]
- [35].Ware RW, Clark D, Crossland K. & Russell RL The nerve ring of the nematode Caenorhabditis elegans: Sensory input and motor output. J. Comp. Neurol 162, 71–110 (1975). [Google Scholar]
- [36].Altun Z. & Hall D. WormAtlas (2012). URL http://www.wormatlas.org.
- [37].Peachey LD Thin sections. I. A study of section thickness and physical distortion produced during microtomy. J. Biophys. Biochem. Cyt 4, 233–242 (1958). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Cardona A. et al. TrakEM2 Software for Neural Circuit Reconstruction. PLoS ONE 7, e38011 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Xu M. et al. Computer assisted assembly of connectomes from electron micrographs: Application to Caenorhabditis elegans. PLoS ONE 8, e54050 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Newman ME & Girvan M. Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004). [DOI] [PubMed] [Google Scholar]
- [41].Rosvall M. & Bergstrom CT Maps of random walks on complex networks reveal community structure. Proc. Natl. Acad. Sci. USA 105, 1118–1123 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Csardi GC & Nepusz T. The igraph software package for complex network research. InterJournal Complex Sys. 1695 (2006). [Google Scholar]
- [43].Virtanen P. et al. SciPy 1.0 Contributors. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 17, 261–272 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Chang AJ, Chronis N, Karow DS, Marletta MA & Bargmann CI A distributed chemosensory circuit for oxygen preference in C. elegans. PLoS Biol. 4, e274 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [45].Zimmer M. et al. Neurons detect increases and decreases in oxygen levels using distinct guanylate cyclases. Neuron 61, 865–879 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Tomioka M. et al. The Insulin/PI 3-Kinase pathway regulates salt chemotaxis learning in Caenorhabditis elegans. Neuron 51, 613–625 (2006). [DOI] [PubMed] [Google Scholar]
- [47].Hendricks M, Ha H, Maffey N. & Zhang Y. Compartmentalized calcium dynamics in a C. elegans interneuron encode head movement. Nature 487, 99–103 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Perkins LA, Hedgecock EM, Thomson JN & Culotti JG Mutant sensory cilia in the nematode Caenorhabditis elegans. Dev. Biol 117, 456–487 (1986). [DOI] [PubMed] [Google Scholar]
- [49].Sawin ER, Ranganathan R. & Horvitz HR C. elegans locomotory rate is modulated by the environment through a dopaminergic pathway and by experience through a serotonergic pathway. Neuron 26, 619–631 (2000). [DOI] [PubMed] [Google Scholar]
- [50].Kang L, Gao J, Schafer WR, Xie Z. & Xu XZ C. elegans TRP family protein TRP-4 is apore-forming subunit of a native mechanotransduction channel. Neuron 67, 381–391 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [51].Chalfie M. & Sulston J. Developmental genetics of the mechanosensory neurons of Caenorhabditis elegans. Dev. Biol 82, 358–370 (1981). [DOI] [PubMed] [Google Scholar]
- [52].Suzuki H. et al. In vivo imaging of C. elegans mechanosensory neurons demonstrates a specific role for the MEC-4 channel in the process of gentle touch sensation. Neuron 39, 1005–1017 (2003). [DOI] [PubMed] [Google Scholar]
- [53].Chalfie M. et al. The neural circuit for touch sensitivity in Caenorhabditis elegans. J. Neurosci 5, 956–964 (1985). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Li C. et al. The FMRFamide-related neuropeptide FLP-20 is required in the mechanosensory neurons during memory for massed training in C. elegans. Learn. Mem 20, 103–108 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [55].Hukema RK, Rademakers S, Dekkers MPJ, Burghoorn J. & Jansen G. Antagonistic sensory cues generate gustatory plasticity in Caenorhabditis elegans. EMBO J. 25, 312–322 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The volumetric datasets generated during the current study, associated connectivity databases and associated analysis are available at https://doi.org/10.5281/zenodo.4383277 and http://wormwiring.org/. The raw data for volumetric reconstructions for Figs. 1 and 3, Extended Data Fig. 8 and all Supplementary Videos is available at https://doi.org/10.5281/zenodo.4383277. Extracted adjacency data is available in Supplementary Information 1. The reference datasets are available in Supplementary Information 3. The Cytoscape files use to generate the brain map (Fig. 4 and Extended Data Fig. 10) and network motifs (Extended Data Fig. 10) are available at https://doi.org/10.5281/zenodo.4383277. See associated Source Data for the data used to generate plots. The collection of C. elegans nervous system electron micrographs are also available at https://www.wormatlas.org/ and https://wormimage.org.