Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2011 Feb 3;7(2):e1001066. doi: 10.1371/journal.pcbi.1001066

Structural Properties of the Caenorhabditis elegans Neuronal Network

Lav R Varshney 1, Beth L Chen 2, Eric Paniagua 3, David H Hall 4, Dmitri B Chklovskii 5,*
Editor: Olaf Sporns6
PMCID: PMC3033362  PMID: 21304930

Abstract

Despite recent interest in reconstructing neuronal networks, complete wiring diagrams on the level of individual synapses remain scarce and the insights into function they can provide remain unclear. Even for Caenorhabditis elegans, whose neuronal network is relatively small and stereotypical from animal to animal, published wiring diagrams are neither accurate nor complete and self-consistent. Using materials from White et al. and new electron micrographs we assemble whole, self-consistent gap junction and chemical synapse networks of hermaphrodite C. elegans. We propose a method to visualize the wiring diagram, which reflects network signal flow. We calculate statistical and topological properties of the network, such as degree distributions, synaptic multiplicities, and small-world properties, that help in understanding network signal propagation. We identify neurons that may play central roles in information processing, and network motifs that could serve as functional modules of the network. We explore propagation of neuronal activity in response to sensory or artificial stimulation using linear systems theory and find several activity patterns that could serve as substrates of previously described behaviors. Finally, we analyze the interaction between the gap junction and the chemical synapse networks. Since several statistical properties of the C. elegans network, such as multiplicity and motif distributions are similar to those found in mammalian neocortex, they likely point to general principles of neuronal networks. The wiring diagram reported here can help in understanding the mechanistic basis of behavior by generating predictions about future experiments involving genetic perturbations, laser ablations, or monitoring propagation of neuronal activity in response to stimulation.

Author Summary

Connectomics, the generation and analysis of neuronal connectivity data, stands to revolutionize neurobiology just as genomics has revolutionized molecular biology. Indeed, since neuronal networks are the physical substrates upon which neural functions are carried out, their structural properties are intertwined with the organization and logic of function. In this paper, we report a near-complete wiring diagram of the nematode Caenorhabditis elegans and present several analyses of its properties, finding many nonrandom features. We give novel visualizations and compute network statistics to enhance understanding of the reported data. We also use principled systems-theoretic methods to generate hypotheses on how biological function may arise from the reported neuronal network structure. The wiring diagram reported here can further be used to generate predictions about signal propagation in future perturbation, ablation, or artificial stimulation experiments.

Introduction

Determining and examining base sequences in genomes [1], [2] has revolutionized molecular biology. Similarly, decoding and analyzing connectivity patterns among neurons in nervous systems, the aim of the emerging field of connectomics [3][6], may make a major impact on neurobiology. Knowledge of connectivity wiring diagrams alone may not be sufficient to understand the function of nervous systems, but it is likely necessary. Yet because of the scarcity of reconstructed connectomes, their significance remains uncertain.

The neuronal network of the nematode Caenorhabditis elegans is a logical model system for advancing the connectomics program. It is sufficiently small that it can be reconstructed and analyzed as a whole. The Inline graphic neurons in the hermaphrodite worm are identifiable and consistent across individuals [7]. Moreover the connections between neurons, consisting of chemical synapses and gap junctions, are stereotypical from animal to animal with more than Inline graphic reproducibility [7][10].

Despite a century of investigation [11], [12], knowledge of nematode neuronal networks is incomplete. The basic structure of the C. elegans nervous system had been reconstructed using electron micrographs [7], but a major gap in the connectivity of ventral cord neurons remained. Previous attempts to assemble the whole wiring diagram made unjustified assumptions that several reconstructed neurons were representative of others [13]. Much previous work analyzed the properties of the neuronal network (see e.g. [14][20] and references therein and thereto) based on these incomplete or inconsistent wiring diagrams [7], [13].

In this paper, we advance the experimental phase of the connectomics program [6], [21] by reporting a near-complete wiring diagram of C. elegans based on original data from White et al. [7] but also including new serial section electron microscopy reconstructions and updates. Although this new wiring diagram has not been published definitively before now, it has already been freely shared with the community through the WormAtlas [22] and has also been used in previous studies such as [23]. See Methods section for further details on the wiring diagram and on freely obtaining it in electronic form.

We advance the theoretical phase of connectomics [24], [25], by characterizing signal propagation through the reported neuronal network and its relation to behavior. We compute for the first time, local properties that may play a computational purpose, such as the distribution of multiplicity and the number of terminals, as well as global network properties associated with the speed of signal propagation. Unlike the conventional “hypothesis-driven” mode of biological research, our work is primarily “hypothesis-generating” in the tradition of systems biology.

Our results should help investigate the function of the C. elegans neuronal network in several ways. A full wiring diagram, especially when conveniently visualized using a method proposed here, helps in designing maximally informative optical ablation [26] or genetic inactivation [27] experiments. Our eigenspectrum analysis characterizes the dynamics of neuronal activity in the network, which should help predict and interpret the results of experiments using sensory and artificial stimulation and imaging of neuronal activity.

Organization of the Results section reflects the duality of contribution and follows the tradition laid down by genome sequencing [1], [2]. We start by describing and visualizing the wiring diagram. Next, we analyze the non-directional gap junction network and the directional chemical synapse network separately. We perform these analyses separately because understanding the parts before the whole provides didactic benefits and because this delays making assumptions about the relative weight of gap junctions and chemical synapses. Finally, we analyze the combined network of gap junctions and chemical synapses.

Results

Reconstruction

An updated wiring diagram

The C. elegans nervous system contains Inline graphic neurons and is divided into the pharyngeal nervous system containing Inline graphic neurons and the somatic nervous system containing Inline graphic neurons. We updated the wiring diagram (see Methods) of the larger somatic nervous system. Since neurons CANL/R and VC06 do not make synapses with other neurons, we restrict our attention to the remaining Inline graphic somatic neurons. The wiring diagram consists of Inline graphic chemical synapses, Inline graphic gap junctions, and Inline graphic neuromuscular junctions.

The new version of the wiring diagram incorporates original data from White et al. [7], Hall and Russell [10], updates based upon later work [8], [Hobert O and Hall DH, unpublished], as well as new reconstructions; see Methods for details. In total, over Inline graphic synaptic contacts, including chemical synapses, gap junctions, and neuromuscular junctions were either added or updated from the previous version of the C. elegans wiring diagram.

The current wiring diagram is considered self-consistent under the following criteria:

  1. A record of Neuron Inline graphic sending a chemical synapse to Neuron Inline graphic must be paired with a record of Neuron Inline graphic receiving a chemical synapse from Neuron Inline graphic.

  2. A record of gap junction between Neuron Inline graphic and Neuron Inline graphic must be paired with a separate record of gap junction between Neuron Inline graphic and Neuron Inline graphic.

Although the updated wiring diagram represents a significant advance, it is only about Inline graphic complete because of missing data and technical difficulties. Due to sparse sampling along lengths of the sublateral, canal-associated lateral, and midbody dorsal cords, about Inline graphic of the total chemical synapses are missing, as concluded from antibody staining for synapses [Duerr JS, Hall DH, and Rand JB, unpublished]. Many gap junctions are likely missing due to the difficulty in identifying them in electron micrographs using conventional fixation and imaging methods. Hopefully, application of high-pressure freezing techniques and electron tomography will help identify missing gap junctions [28]. Finally, it should be noted that this reconstruction combined partial imaging of three worms, with images for the posterior midbody being from the male N2Y.

The basic qualitative properties of the updated C. elegans nervous system remain as reported previously [7][9]. Neurons are divided into Inline graphic classes, based on morphology, dendritic specialization, and connectivity. Based on neuronal structural and functional properties, the classes can be divided into three categories: sensory neurons, interneurons, and motor neurons. Neurons known to respond to specific environmental conditions, either anatomically, by sensory ending location, or functionally, are classified as sensory neurons. They constitute about a third of neuron classes. Motor neurons are recognized by the presence of neuromuscular junctions. Interneurons are the remainder of the neuron classes and constitute about half of all classes. A few of the neurons could have dual classification, such as sensory/motor neurons. Some interneurons are much more important for developmental function than for function in the final neuronal network [28].

The majority of sensory neuron and interneuron categories contain pairs of bilaterally symmetric neurons. Motor neurons along the body are organized in repeating groups whereas motor neurons in the head have four- or six-fold symmetry. A large fraction of neurons send long processes to the nerve ring in the circumpharyngeal region to make synapses with other neurons [7].

The neurons in C. elegans are structurally simple: most neurons have one or two unbranched processes and form en passant synapses along them. Dendrites are recognized by being strictly “postsynaptic” or by containing a specialized sensory apparatus, such as amphid and phasmid sensory neurons. Interneurons lack clear dendritic specialization. It is interesting to note that a given worm neuron has connections with only about Inline graphic of neurons with which it has physical contact [7], [8], a similar number to the connectivity fraction in other nervous systems [29], [30].

Wiring diagram as adjacency matrices

In the remainder of the paper, we describe and analyze the connectivity of gap junction and chemical synapse networks of C. elegans neurons. Gap junctions are channels that provide electrical coupling between neurons, whereas chemical synapses use neurotransmitters to link neurons. The network of gap junctions and the network of chemical synapses are initially treated separately, with each represented by its own adjacency matrix, Figure 1. In an adjacency matrix Inline graphic, the element in the Inline graphicth row and Inline graphicth column, Inline graphic, represents the total number of synaptic contacts from the Inline graphicth neuron to the Inline graphicth. If neurons are unconnected, the corresponding element of the adjacency matrix is zero. An adjacency matrix may be used due to self-consistency in the gathered data.

Figure 1. Adjacency matrices for the gap junction network (blue circles) and the chemical synapse network (red points) with neurons grouped by category (sensory neurons, interneurons, motor neurons).

Figure 1

Within each category, neurons are in anteroposterior order. Among chemical synapse connections, small points indicate less than Inline graphic synaptic contacts, whereas large points indicate Inline graphic or more synaptic contacts. All gap junction connections are depicted in the same way, irrespective of number of gap junction contacts.

Although gap junctions may have directionality, i.e. conduct current in only one direction, this has not been demonstrated in C. elegans. Even if directionality existed, such information cannot be extracted from electron micrographs. Thus we treat the gap junction network as an undirected network with a symmetric adjacency matrix, as depicted in Figure 1. Weights in both Inline graphic and Inline graphic represent the total number of gap junctions between neurons Inline graphic and Inline graphic.

Since chemical synapses possess clear directionality that can be extracted from electron micrographs, we represent the chemical network as a directed network with an asymmetric adjacency matrix, Figure 1. The elements of the adjacency matrix take nonnegative values, which reflect the number of synaptic contacts between corresponding neurons. Contacts are given equal weight, regardless of the apparent size of the synaptic apposition. We use nonnegative values for most of the paper because we cannot determine whether a synapse is excitatory, inhibitory, or modulatory from electron micrographs of C. elegans. For the linear systems analysis, we do however make a rough guess of the signs of synapses based on neurotransmitter gene expression data.

Electron micrographs for C. elegans have a further limitation that causes some synaptic ambiguity. When a presynaptic terminal makes contact with two adjacent processes of different neurons (send_joint in Durbin's notation [8]), it is not known which of these processes acts as a postsynaptic terminal; both might be involved. We count all polyadic synaptic connections. Polyadic connections are briefly revisited in the Discussion.

Visualization

Although statistical measures that we investigate later in this paper provide significant insights, they are no substitute to exploring detailed connectivity in the neuronal network. As the number of connections between neurons is large even for relatively simple networks, such analysis requires a convenient way to visualize the wiring diagram. Previously, various fragments of the wiring diagram were drawn to illustrate specific pathways [8], [31], [32]. Here, we propose a method to visualize the whole wiring diagram in a way that reflects signal flow through the network as well as the closeness of neurons in the network, Figure 2. To this end, we use spectral network drawing techniques because they have certain optimality properties [33] and aesthetic appeal. Next, we give an intuitive description of our visualization method; mathematical details can be found in Text S1.

Figure 2. The C. elegans wiring diagram is a network of identifiable, labeled neurons connected by chemical and electrical synapses.

Figure 2

Red, sensory neurons; blue, interneurons; green, motorneurons. (a). Signal flow view shows neurons arranged so that the direction of signal flow is mostly downward. (b). Affinity view shows structure in the horizontal plane reflecting weighted non-directional adjacency of neurons in the network.

The vertical axis in Figure 2(a), represents the position of neurons in the signal flow hierarchy [34], [35] of the chemical synapse network with sensory neurons at the top and motor neurons at the bottom, with interneurons in between. We want the vertical coordinate of pre- and post-synaptic neurons to differ by one, however due to “frustration” this is not always possible. Frustration happens when distances measured along network connections cannot be made to correspond to the hierarchy distances: there are two different hierarchical paths that require a particular neuron to appear in two different places. We look for the layout that has smallest deviation from this condition and find a closed form solution [34], [36].

The distance along the vertical coordinate corresponds roughly to the number of synapses from sensory to motor neurons—the signal flow depth of the network. Depending on the specific neurons considered, the distance from a sensory neuron to a motor neuron is Inline graphicInline graphic. At the same time, the minimum number of chemical synapses crossed from sensory to motor neuron averaged over all such pairs is Inline graphic, see also [8].

Neuronal position on the horizontal plane, Figure 2(b), represents the connectivity closeness of neurons in the combined chemical and electrical synapse network. Neuronal coordinates are given by the second and third eigenmodes of the symmetrized network's graph Laplacian (see below). In this representation, pairs of synaptically coupled neurons with larger number of connections in parallel tend to be positioned closer in space.

Thus, Figure 2 represents not the physical placement of neurons in the worm but signal flow and closeness in the network. Such visualization reveals that motorneurons and some interneurons segregate into two lobes along the first horizontal axis: the right lobe contains motorneurons in the ventral cord and the left lobe consists of neck neurons. The bi-lobe structure suggests partial autonomy of motorneurons in the ventral cord and neck. Interneurons that could coordinate the function of the two lobes can be easily identified by their central location.

Gap Junction Network

For quantitative characterization, we first consider the gap junction network.

Basic structure and connectivity

The gap junction network that we analyze consists of Inline graphic neurons and Inline graphic gap junction connections, consisting of one or more junctions. The network is not fully connected, but is divided into a giant component containing Inline graphic neurons, two smaller components of Inline graphic and Inline graphic neurons, and Inline graphic isolated neurons with no gap junctions (Table 1 in Text S4). The giant component has Inline graphic connections. Using connectivity data from [13], Majewska and Yuste had previously pointed out that most neurons in C. elegans belong to the giant component [37]. Our results agree roughly with [37], although our dataset excludes non-neuronal cells and places certain neurons in different connected components.

To evaluate the significance of the number of neurons in the giant component, we compare it with those expected in random networks. We start with the Erdös-Rényi random network because its construction requires a single parameter, the probability of a connection between two neurons. An Erdös-Rényi random network with Inline graphic neurons and connection probability Inline graphic (thus with Inline graphic expected connections) would be expected to have Inline graphic neurons in the giant component. The true gap junction giant component is much smaller; the probability of finding such a small giant component in a random network is on the order of Inline graphic (see Methods).

A better comparison, however, can be made to random networks with degree distributions that match the degree distribution of the gap junction network [38]. Here, the degree of a neuron is the number of neurons with which it makes a gap junction. The giant component in a degree-matched random network would be expected to be Inline graphic neurons (see Methods), about the same size as the measured giant component.

We may explore the utility of representing the wiring diagram as a three-layer network by grouping neurons by category (sensory neurons, interneurons, motor neurons). As shown in Tables 2A and 2B in Text S4, each category has many recurrent connections within and between categories (with the exception of connections between sensory and motor neurons). In particular, Table 2B in Text S4 indicates that motor neurons send back to interneurons roughly the same number of connections as they recurrently sent back to motor neurons. These observations suggest that when considering only gap junctions, a three-layer network abstraction may not be particularly useful.

Distributions of degree, multiplicity and the number of terminals

In this section, we analyze statistical properties of individual neurons and synaptic connections. To characterize the ability of individual neurons to propagate or collect signals, we compute the degree Inline graphic of neuron Inline graphic, which is the number of neurons that are coupled to Inline graphic by at least one gap junction. The mean degree is Inline graphic, however this value is not representative as the degree varies in a wide range, from Inline graphic to Inline graphic. Thus, it is important to look at the degree distribution, which has been used to characterize and classify other networks previously [39][42].

To visualize the discrete degree distribution, Inline graphic, we use the survival function:

graphic file with name pcbi.1001066.e058.jpg (1)

which is the complement of the cumulative distribution function, Figure 3(a). The advantages of looking at the survival function rather than the degree distribution directly are that histogram binning is not required and that noise in the tail is reduced [43]. The survival function is also later applied to visualize other statistics. Various commonly encountered distributions and their corresponding survival functions are given in Text S2.

Figure 3. Survival functions for the distributions of degree, multiplicity, and number of synaptic terminals in the gap junction network.

Figure 3

Neurons or connections with exceptionally high statistics are labeled. The tails of the distributions can be fit by a power law with the exponent Inline graphic for the degrees (a); Inline graphic for the multiplicity distribution (b); Inline graphic for the number of synaptic terminals (c). The exponents for the power law fits of the corresponding survival functions are obtained by subtracting one.

We perform a fitting procedure for the tail of the gap junction degree distribution [42] (see Methods). We find that the tail (Inline graphic) can be fit by the power law with exponent Inline graphic), Figure 3(a), but not by the exponential decay (Inline graphic-valueInline graphic). This result is consistent with the view that the gap junction network is scale-free [40].

To characterize the direct impact that one neuron can have on another, we quantify the strength of connections by the multiplicity, Inline graphic, between neurons Inline graphic and Inline graphic, which is the number of synaptic contacts (here gap junctions) connecting Inline graphic to Inline graphic. The degree treats synaptic connections as binary, whereas the multiplicity, also called edge weight, quantifies the number of contacts. The multiplicity distribution for the gap junction network is shown in Figure 3(b). We find that the multiplicity distribution for Inline graphic obeys a power law with exponent Inline graphic). Although the exponential decay fit to the tail passes the Inline graphic-value test, the log-likelihood is significantly lower than for the power law.

Finally, the sum of the multiplicities of all gap junction connections of a given neuron is called the number of terminals, or the nodal strength. The tail of the distribution of the number of synaptic terminals, Figure 3(c), is adequately fit by a power law with exponent Inline graphic).

Identifying neurons that play a central or special role in the transmission or processing of information may also prove useful [44][48]. To rank neurons according to their roles, we introduce several centrality indices. Perhaps the simplest centrality index is degree centrality Inline graphic. Degree centrality is simply the degree of a neuron, Inline graphic, and is motivated by the idea that a neuron with connections to many other neurons has a more important or more central role in the network than a neuron connected to only a few other neurons. Neurons that have unusually high degree centrality include AVAL/R and AVBL/R. The same neurons lie in the tail of the distribution of the number of synaptic terminals, Figure 3(c), suggesting strong electrical coupling to the network. These neuron pairs are command interneurons responsible for coordinating backward and forward locomotion, respectively [22], [32], [49]. The high degree centralities of RIBL/R suggest a similarly central function for those neurons, though they each only have Inline graphic gap junction terminals, in the middle of the distribution of number of terminals, suggesting weaker electrical coupling to the network.

Small world properties

Having described statistical properties of individual neurons and connections, such as the degree and multiplicity distributions, we now investigate properties that may describe the efficiency of signal transmission across the gap junction network. Traditionally [14], this analysis does not consider multiplicity of gap junctions but treats them as binary. We analyze signal propagation when including multiplicities in the next subsection.

The geodesic distance, Inline graphic, between two neurons in the network is the length of the shortest network path between them. The network path is measured by the number of connections that are crossed rather than by physical distance. The average geodesic distance over all pairs of neurons is the characteristic path length [14]:

graphic file with name pcbi.1001066.e079.jpg (2)

where Inline graphic is the number of neurons. This global measure describes how readily or rapidly a signal can travel from one neuron to another since it is simply the average distance between all neurons. Clearly, the measure Inline graphic requires the network to be connected (otherwise Inline graphic diverges), so we restrict attention to the giant component.

A signal originating in one neuron in the giant component must cross Inline graphic gap junction connections on average to reach another neuron of the giant component. For an Erdös-Rényi random network with Inline graphic neurons and Inline graphic connections we computed the characteristic path length to be Inline graphic (Monte Carlo with Inline graphic). When the actual degree distribution of the gap junction network is taken into account, a random network from that ensemble would be expected to have characteristic path length Inline graphic (see Methods). The distribution of geodesic distances Inline graphic in the giant component is shown in Figure 1(a) in Text S4.

A second measure for signal propagation is the clustering coefficient Inline graphic, which measures the density of connections among an average neuron's neighbors. It is defined as [14]:

graphic file with name pcbi.1001066.e091.jpg (3)

where Inline graphic is the number of connections between neighbors of Inline graphic, Inline graphic is the number of neighbors of Inline graphic, and Inline graphic measures the density of connections in the neighborhood of neuron Inline graphic (we set Inline graphic when Inline graphic). We find the clustering coefficient Inline graphic. We computed the clustering coefficient for an Erdös-Rényi random network with Inline graphic neurons and Inline graphic connections to be Inline graphic (Monte Carlo with Inline graphic). For a degree-matched random network, we computed the clustering coefficient Inline graphic (Monte Carlo with Inline graphic). Thus, the giant component of the gap junction network is strongly clustered relative to random networks, both Erdös-Rényi and degree-matched.

Small world networks have much higher clustering coefficient relative to random networks without sacrificing the mean path length. For the giant component of the gap junction network, the corresponding ratios are Inline graphic and Inline graphic, indicating that the network is small world. Quantitatively, small-world-ness of a network may be defined relative to a degree-matched Erdös-Rényi random network as follows [50]:

graphic file with name pcbi.1001066.e109.jpg

In the case of the giant component of the gap junction network, small-world-ness Inline graphic.

Next we consider how quickly individual neurons reach all other neurons in the network. The normalized closeness of a neuron Inline graphic is the average geodesic distance Inline graphic across all neurons Inline graphic that are reachable from Inline graphic [45]:

graphic file with name pcbi.1001066.e115.jpg (4)

The normalized closeness centrality, which takes higher values for more central neurons, is defined as the inverse, Inline graphic.

Restricting to the gap junction giant component, the six most central neurons are AVAL, AVBR, RIGL, AVBL, RIBL, and AVKL. In addition to command interneuron classes AVA and AVB, these include RIBL and RIGL, both ring interneurons, and AVKL, an interneuron in the ventral ganglion of the head. The set of closeness central neurons mostly overlaps with the set of degree central neurons. The correlation between the two centrality measures does not extend to peripheral neurons, as the Spearman rank correlation coefficient [51] between degree centrality Inline graphic and closeness centrality Inline graphic for the entire giant component is only Inline graphic.

Spectral properties

Global network properties discussed in the previous section characterize signal transmission while ignoring connection weights. As weights affect the effectiveness of signal transmission and vary among connections, we now analyze the weighted network by using linear systems theory. Although neuronal dynamics can be nonlinear, spectral properties nevertheless provide important insights into function. For example, the initial success of the Google search engine is largely attributed to linear spectral analysis of the World Wide Web [52].

We characterize the dynamics of the gap junction network by the following system of linear differential equations, which follow from charge conservation [53], [54].

graphic file with name pcbi.1001066.e120.jpg (5)

where Inline graphic is the membrane potential of neuron Inline graphic, Inline graphic is the membrane capacitance of neuron Inline graphic, Inline graphic is the conductance of gap junctions between neurons Inline graphic and Inline graphic, and Inline graphic is the membrane conductance of neuron Inline graphic. Assuming that each neuron has the same capacitance Inline graphic and each gap junction has the same conductance Inline graphic, i.e. Inline graphic, we can rewrite this equation in terms of the time constant Inline graphic as:

graphic file with name pcbi.1001066.e134.jpg (6)

Assuming that gap junction conductance is greater than the membrane conductance, we temporarily neglect the last term and rewrite this equation in matrix form:

graphic file with name pcbi.1001066.e135.jpg (7)

where Inline graphic is the Laplacian matrix of the weighted network, Inline graphic contains the number of neuron gap junctions on the diagonal and zeros elsewhere, and Inline graphic is a column vector of the membrane potentials.

The system of coupled linear differential equations (6) can be solved by performing a coordinate transformation to the Laplacian eigenmodes. Since the Laplacian eigenmodes are decoupled and evolve independently in time, performing an eigendecomposition of initial conditions leads to a full description of the system dynamics. The survival function of the Laplacian eigenspectrum is shown in Figure 4(a).

Figure 4. Linear systems analysis of the giant component of the gap junction network.

Figure 4

(a). Survival function of the eigenvalue spectrum (blue). The algebraic connectivity, Inline graphic, is Inline graphic and the spectral radius, Inline graphic, is Inline graphic. A time scale associated with the decay constant is also given. (b). Scatterplot showing the Inline graphic norm and decay constant of the eigenmodes of the Laplacian. The fastest modes from Figure 3 in Text S4 are marked in red. The sparsest and slowest modes, most amenable to biological analysis, are located in the lower-left corner of the diagram. (c). Eigenmode of Laplacian corresponding to Inline graphic (marked green in panel (b)). (d). Eigenmode of Laplacian corresponding to Inline graphic (marked cyan in panel (b)).

What insight can be gained from inspection of the Laplacian eigenmodes? The gap junction network is equivalent to a network of resistors, where each gap junction acts as a resistor. The eigenmodes give intuition about experiments where a charge is distributed among neurons of the network and the spreading charge among the neurons is monitored in time. If the charge is distributed among neurons according to an eigenmode, the relative shape of the distribution does not change in time. The charge magnitude decays with a time constant specified by the eigenvalue. The smallest eigenvalue of the Laplacian is always zero, corresponding to the infinite relaxation time. In the corresponding eigenmode each neuron is charged equally.

If the charge is distributed according to eigenmodes corresponding to small eigenvalues, the decay is rather slow. Thus, these eigenmodes correspond to long-lived excitation. The existence of slowly decaying modes often indicates that the network contains weakly coupled subnetworks, in which neurons are strongly coupled among themselves. The corresponding charge distribution usually has negative values on one subnetwork and positive values on the other subnetwork. Because of the relatively slow equilibration of charge between the subnetworks, such an eigenmode decays slowly.

As an example of slow equilibration implying a subnetwork that is strongly internally coupled, one might speculate that the eigenmode associated with Inline graphic (Figure 4(c)) on the ‘black’ side reflects a coupling of chemosensory neurons in the tail (PHBL/R) along with interneurons (AVHL/R, AVFL/R) and motor neurons (VC01–05) involved in egg-laying behavior. At the level of gap junctions, these neurons are weakly coupled with chemosensory neurons in the head (ADFR, ASIL/R, AWAL/R) and related interneurons (AIAL/R) on the ‘red’ side.

Another interesting example is the eigenmode associated with Inline graphic (Figure 4(d)). Neurons on the ‘red’ side overlap significantly with those identified previously in a hub-and-spoke circuit mediating pheromone attraction, oxygen sensing, and social behavior [55]. Such overlap is consistent with the view [55] that this network of neurons solves a consensus problem [56].

These two examples demonstrate that spectral analysis can uncover circuits that have been described using experimental studies. The probability of a known functional circuit appearing in an eigenmode by chance is small (see Methods). It would be interesting to see whether other eigenmodes have a biological interpretation and therefore generate predictions for future experiments.

To prioritize further analysis of eigenmodes for biological significance, it may be advantageous to focus on the slow and sparse modes, where few neurons exhibit significant activity. We can quantify sparseness of normalized eigenmodes by the sum of absolute values of the eigenmode components, also known as the Inline graphic norm or Manhattan distance. Sparser eigenmodes have smaller Inline graphic norms [57]. Figure 4(b) is a scatterplot of eigenmodes showing both their decay constant and their Inline graphic norms.

The full set of eigenmodes of the connected component is shown in Figure 2 in Text S4. The eigenmodes corresponding to large eigenvalues decay fast, suggesting that corresponding neurons have the same membrane potential on relevant time scales and act effectively as a single unit. Many such eigenmodes peak (with opposite signs) for left-right neuronal pairs (Figure 3 in Text S4), often known to be functionally identical, which therefore act as a single unit.

To understand timescales, one might wonder what the absolute values of decay constants for various eigenmodes are. Current knowledge of electrical parameters for C. elegans neurons allows us to estimate the decay times only approximately. Assuming neuron capacitance of Inline graphicpF [58] and gap junction conductance of Inline graphicpS, we find a time constant Inline graphicms. This implies that the slowest non-trivial mode corresponding to the second lowest eigenvalue, Inline graphic has decay time of about Inline graphicms, Figure 4(a). This eigenvalue, Inline graphic, is known as the algebraic connectivity of a network [59].

What is the effect of the dropped term corresponding to the membrane current in (6)? As this term would correspond to adding a scaled identity matrix to the Laplacian, the spectrum should uniformly shift to higher values by the corresponding amount. Thus, even the eigenmode corresponding to the zero eigenvalue would now have a finite decay time. Assuming the membrane conductance of about Inline graphicpS [58], we find Inline graphicms decay time. This leads to a Inline graphic increase in the values of Inline graphic. Now, the slowest non-trivial mode corresponds to a decay time of about Inline graphicms.

In addition to highlighting groups of neurons that could be functionally related, spectral analysis allows us to predict, under linear approximation, the outcome of experiments that study the spread of an arbitrarily generated excitation in the neuronal network. Such excitation can be generated in sensory neurons by presenting a sensory stimulus [60] or in any neuron by expressing a light-gated ion channel, such as channelrhodopsin, in that cell and stimulating optically [26], [61], [62]. The spread of activity can be monitored electrophysiologically or using calcium-sensitive indicators.

To predict the spread of activity, we may decompose the excitation pattern into the eigenmodes and, by taking advantage of eigenmode independence, express temporal evolution as a superposition of the independently decaying eigenmodes. The initial redistribution of charge would correspond to the fast eigenmodes, whereas the long-term evolution of charge distribution would be described by the slow eigenmodes. Text S3 further discusses eigendecomposition and the interpretation of eigenmodes.

Understanding propagation of neuronal activity in response to stimulation (either for the complete network or for ablation studies) may also be carried out directly in the time domain by stepping through the dynamics in (6) or more electrophysiologically realistic nonlinear dynamics. Predictions of experimental results would then be determined by stimulating and measuring exactly as in the experiment itself.

Motifs

Several of the quantitative properties computed thus far measure global network structure or individual neuron properties. Now we analyze the frequency of various connectivity subnetworks among small local groups of neurons. Overrepresentation in the subnetwork distribution often displays building blocks of the network such as computational units [17], [63]. Since the gap junction network is undirected, there are four kinds of subnetworks that can appear over three neurons; this distribution is shown in Figure 5(a). As a null-hypothesis we use random network ensembles that preserve the degree distribution. We find that fully connected triplets are overrepresented.

Figure 5. Subnetwork distributions for the gap junction network.

Figure 5

Overrepresented subnetworks are boxed, with the Inline graphic-value from the step-down min-P-based algorithm for multiple-hypothesis correction [64], [77] (Inline graphic) shown inside. (a). The ratio of the Inline graphic-subnetwork distribution and for the mean of a degree-preserving ensemble of random networks (Inline graphic). The counts for the particular random networks that appeared in the ensemble are also shown. (b). The ratio of the Inline graphic-subnetwork distribution and for the mean of a degree and triangle-preserving ensemble of random networks (Inline graphic). The counts for the particular random networks that appeared in the ensemble are also shown.

Four neurons can be wired into 11 kinds of subnetworks; this distribution is shown in Figure 5(b). In the case of quadruplets, the null-hypothesis preserves the degree for each neuron and the number of triangles. A numerical rewiring procedure is used to generate samples from these random network ensembles [38], [64], since no analytical expression for expected subnetwork counts is extant [65]. We find that a “fan” (motif #7) and a “diamond” (motif #10) are overrepresented.

Note that neurons participating in motifs also make connections with neurons outside of the motif, which are traditionally not drawn in putative functional circuits [8], [60]. Such putative functional circuit diagrams may even omit connections within the motif [8], [60], which we do not allow.

Chemical Synapse Network

Now we consider the chemical synapse network. Recall that due to structural differences between presynaptic and postsynaptic ends of a chemical synapse, electron micrographs can be used to determine the directionality of connections. Hence the adjacency matrix is not symmetric as it was for the gap junction network.

Basic structure and connectivity

The network that we analyze consists of Inline graphic neurons and Inline graphic directed connections implemented by one or morhemical synapses. The adjacency matrix of the network shown in Figure 1 is suggestive of a three-layer architecture. The distribution of connections between categories, Table 3 in Text S4, reveals that each chemical subnetwork is characterized by a high number of recurrent connections, just as for the gap junction. However, the majority of connections with other subnetworks is consistent with feedforward information processing (sensory to interneuron and interneuron to motorneurons). Therefore, a three-layer network abstraction may be more valuable for chemical synapses than for gap junctions.

There are two different definitions of connectivity for directed networks. A weakly connected component is a maximal group of neurons which are mutually reachable by possibly violating the connection directions, whereas a strongly connected component is a maximal group of neurons that are mutually reachable without violating the connection directions. The whole chemical synapse network is weakly connected and can be divided into a giant strongly connected component with Inline graphic neurons, a smaller strongly connected component of Inline graphic neurons, and Inline graphic neurons that are not strongly connected (Table 4 in Text S4).

The random directed network corresponding to the chemical network is fully weakly connected, even when the degree distribution is taken into account (see Methods). A strongly connected giant component as small as in the chemical network is not likely in a random network (see [66]). Thus, the chemical network is more segregated than would be expected for a random network.

Distributions of degree, multiplicity and the number of terminals

Since chemical synapses form a directed network, neuron connectivity is characterized by in-degrees (the number of incoming connections) and out-degrees (the number of outgoing connections) rather than simply degrees. The joint distribution of in-degrees and out-degrees is shown in Figure 6(a). As can be seen by the distribution clustering around the diagonal line, the in-degrees and out-degrees are correlated; the Pearson correlation coefficient is Inline graphic (Inline graphic-value Inline graphic), very similar to the Pearson correlation coefficient of an email network, Inline graphic, though the email network was much larger (Inline graphic) [67].

Figure 6. Degree distribution (a) and survival functions for the distributions of in-/out-degree, multiplicity, and in-/out-number of synaptic terminals in the chemical synapse network (b)–(f).

Figure 6

Neurons or connections with unusually high statistics are labeled. The tails of the distributions can be fit by a power law with exponents Inline graphic for in-degree (b); Inline graphic for out-degree (c); and Inline graphic for out-number (f). The exponents for the survival function fits can be obtained by subtracting one. The survival function of the multiplicity distribution for Inline graphic can be fit by a stretched exponential of the form Inline graphic where Inline graphic and Inline graphic (d). No satisfactory fit was found for the distribution of in-numbers (e).

The survival functions associated with the marginal distributions of in-degrees and out-degrees are shown in Figures 6(b) and 6(c) respectively. The mean number of incoming and outgoing connections is Inline graphic each. We attempt to fit these distributions. The tails of the two distributions can be satisfactorily fit by power laws with exponents Inline graphic and Inline graphic respectively. Exponential fit is ruled out (Inline graphic-valueInline graphic) for the in-degree but not for the out-degree distribution. In the latter case, the log-likelihood is insignificantly lower for the exponential decay than for the power law.

Multiplicity of connection, Inline graphic, is the number of synapses in parallel from neuron Inline graphic to neuron Inline graphic. The corresponding survival function (including unconnected pairs) is shown in Figure 6(d). The mean number of synapses per connection (excluding unconnected pairs) is Inline graphic. The tail of the distribution can be fitted by an exponential, but not by a power law (Inline graphic-valueInline graphic). In addition, the whole distribution (Inline graphic) can be fit by a stretched exponential (or Weibull) distribution, Inline graphic with the scale parameter Inline graphic and the shape parameter Inline graphic. A stretched exponential applied to the whole distribution has the same number of fitting parameters as an exponential decay fit to the tail starting with an adjustable Inline graphic. Log-likelihood comparison of the tail exponential and the stretched exponential favors the latter.

As for the gap junction network, we can also study the distribution of number of synaptic terminals on a neuron. This involves adding the multiplicities of the connections, rather than just counting the number of pre- or post-synaptic partners. The joint histogram (not shown) exhibits similar correlation as for the degree distribution, with Pearson correlation coefficient Inline graphic (Inline graphic-value Inline graphic).

Figures 6(e) and 56(f) show the marginal survival functions for the number of post-synaptic terminals (in-number) and the number of pre-synaptic terminals (out-number). The mean number of pre- and post-synaptic terminals is Inline graphic each. We were unable to find a satisfactory simple fit to the in-number distribution, Figure 6(e). The tail of the out-number distribution could be fit by a power law with exponent Inline graphic, but not by an exponential, Figure 6(f).

As for the gap junction network, we can identify central neurons (cf. [47], [68]) for the chemical network. The degree centrality in a directed network may be defined with respect to the in-degree or the out-degree. Interestingly, neuron AVAL has the largest in-degree and AVAR has the second largest in-degree, whereas AVAR has the largest out-degree and AVAL has the second largest out-degree, Figure 6(a).

Small world properties

In the strongly connected component, we can define the directed geodesic distance as the shortest path between two neurons that respects the direction of the connections. The distribution of the directed geodesic distance, Figure 1(b) in Text S4, is characterized by the mean path length, Inline graphic computed over all pairs of neurons in the strongly connected component. For a random network degree-matched to the chemical network, one would expect Inline graphic (Monte Carlo with Inline graphic) and the ratio Inline graphic.

Although there are several definitions of clustering for directed graphs in the literature [69], we use the clustering of the out-connected neighbors since it captures signal flow emanating from a given neuron:

graphic file with name pcbi.1001066.e210.jpg (8)

where Inline graphic is the number of connections between out-neighbors of neuron Inline graphic, Inline graphic is the number of out-neighbors of Inline graphic, and Inline graphic measures the density of connections in the neighborhood of neuron Inline graphic. For the chemical network, the clustering coefficient is Inline graphic. For a degree-matched random network we computed the clustering coefficient Inline graphic (Monte Carlo with Inline graphic) and the ratio Inline graphic. Therefore, the chemical strongly connected component is a small-world network with Inline graphic.

For directed networks, measures of in-closeness and out-closeness may be defined using the average directed geodesic distance. In particular, the normalized in-closeness is the average geodesic distance from all other neurons to a given neuron:

graphic file with name pcbi.1001066.e222.jpg (9)

and the out-closeness is the average geodesic distance from a given neuron to all other neurons:

graphic file with name pcbi.1001066.e223.jpg (10)

where Inline graphic is the number of neurons. Normalized centralities are the inverses: Inline graphic and Inline graphic. The motivation behind these indices is similar to that in the gap junction case. In-closeness central neurons can be easily reached from all other neurons in the network. Out-closeness central neurons can easily reach all other neurons in the network. Normalized in-closeness centrality Inline graphic and normalized out-closeness centrality Inline graphic are weakly anti-correlated, with Pearson correlation coefficient Inline graphic (Inline graphic-value Inline graphic).

For the giant component of the chemical network, the most in-closeness central neurons include AVAL, AVAR, AVBR, AVEL, AVER, and AVBL. All are command interneurons involved in the locomotory circuit; these neurons are also central in the gap junction network. The in-closeness centrality of command interneurons may indicate that in the C. elegans nervous system, signals can propagate efficiently from various sources towards these neurons and that they are in a good position to integrate it.

The most out-closeness central neurons include DVA, ADEL, ADER, PVPR, AVJL, HSNR, PVCL, and BDUR. Only PVCL is a command interneuron involved in locomotion. The neuron DVA is an interneuron that performs mechanosensory integration; ADEL/R are sensory dopaminergic neurons in the head; and the other central neurons are interneurons in several parts of the worm. The out-closeness centrality of these neurons may indicate that signals can propagate efficiently from these neurons to the rest of the network and that they are in a good position for broadcast.

Spectral properties

Although chemical synapses are likely to introduce more nonlinearities than gap junctions, linear systems analysis can provide interesting insights, especially in the absence of other tools. Such an approach has additional merit in C. elegans, where neurons do not fire classical action potentials [58] and have chemical synapses that likely release neurotransmitters tonically [54]. To justify such analysis, a system of linear equations may be derived by approximating sigmoidal synaptic transmission functions with a Taylor series expansion around an equilibrium point [54].

A major source of uncertainty in linear systems analysis of the chemical network is the unknown sign of connections, i.e. excitatory or inhibitory, due to the difficulty in performing electrophysiology experiments. We use a rough approximation that GABAergic neurons make inhibitory synapses, whereas glutamergic and cholinergic neurons form excitatory synapses [70], but see [60]. The following Inline graphic neurons express GABA [71]: DVB, AVL, RIS, DD01–DD06, VD01–VD13, and the four RME neurons.

Similarly to the gap junction network, we write the system of linear differential equations for the chemical synapse network [53], [54]:

graphic file with name pcbi.1001066.e233.jpg (11)

where Inline graphic is the membrane potential of neuron Inline graphic measured relative to the equilibrium, Inline graphic is the membrane capacitance of neuron Inline graphic, Inline graphic is the conductance in neuron Inline graphic contributed by a chemical synapse in response to voltage Inline graphic measured relative to the equilibrium and Inline graphic is the membrane conductance of neuron Inline graphic. Assuming that each neuron has the same capacitance Inline graphic and each chemical synapse contact has the same conductance Inline graphic, i.e. Inline graphic, we can rewrite this equation in terms of the time constant Inline graphic as:

graphic file with name pcbi.1001066.e247.jpg (12)

To avoid redundancy we defer analyzing this system of differential equations to the next section, where we consider the combined system including both gap junctions and chemical synapses.

Motifs

We also find subnetwork distributions for the chemical synapse network. Since the network is directed, there are many more possible subnetworks. In particular there are Inline graphic possible subnetworks on two neurons and Inline graphic possible subnetworks on three neurons. We identify overrepresented subnetworks by comparing to random networks, generated with a rewiring procedure [38], [64]. Such random network ensembles preserve in-degree and out-degree in the case of doublets and, additionally, the numbers of bidirectional and unidirectional connections for each neuron in the case of triplets.

Figures 7(a) and 7(b) show the subnetwork distributions on two and three neurons, respectively. We find that the C. elegans network contains similar overrepresented subnetworks as found by analyzing incomplete data [17], [64]. For example, there is greater reciprocity in the chemical network than would be expected in a random network. Similarly, triplets with connections (of any direction) between each pair of neurons (seven rightmost triplets in Figure 7(b)) collectively occur with much greater frequency than would be expected for a random network.

Figure 7. Subnetwork distributions for the chemical synapse network.

Figure 7

Overrepresented subnetworks are boxed, with the Inline graphic-value from the step-down min-P-based algorithm for multiple-hypothesis correction [64], [77] (Inline graphic) shown inside. (a). The ratio of the Inline graphic-subnetwork distribution and the mean of a random network ensemble (Inline graphic). Realizations of the random network ensemble are also shown. (b). The ratio of the Inline graphic-subnetwork distribution and the mean of a random network ensemble (Inline graphic). Realizations of the random network ensemble are also shown.

Overrepresentation of reciprocal [8, Ch. 7] and triangle [7] motifs were previously noted. Such overrepresentation would arise naturally if proximity was a limiting factor for connectivity, however there is no evidence for this limitation. Thus, motifs may have a functional role.

Full Network

Having considered the gap junction network and the chemical synapse network separately, we also examine the two networks collectively. To study the two networks, one may either look at a single network that takes the union of the connections of the two networks or one may look at the interaction between the two networks.

Single combined network

First we look at a combined network, which is produced by simply adding the adjacency matrices of the gap junction and chemical networks together, while ignoring connection weights. Thus we implicitly treat gap junction connections as double-sided directed connections. This new network consists of Inline graphic neurons and Inline graphic directed connections. It has one large strongly connected component of Inline graphic neurons and Inline graphic strongly isolated neurons. The five isolated neurons are IL2DL/R, PLNR, DD06, and PVDR; this set is simply the intersection of the isolated neurons in the gap junction and chemical networks and does not seem to have any commonalities among members. Of course, it follows that since the chemical network is a single weakly connected component, this combined network is also a single weakly connected component.

Naturally, the combined network is more compact than the individual networks. The mean path length Inline graphic, the geodesic distance distribution (Figure 1(c) in Text S4) becomes narrower. For a random network degree-matched to the combined network, one would expect Inline graphic (Monte Carlo with Inline graphic) and the ratio Inline graphic. The clustering coefficient for the combined network is Inline graphic. The clustering coefficient for a degree-matched random network Inline graphic (Monte Carlo with Inline graphic) and the ratio Inline graphic. Therefore, the combined network is small-world, just like individual networks, with Inline graphic.

Turning to closeness centrality, the most in-close central neurons are AVAL/R, AVBR/L, and AVEL/R, as would be expected from the individual networks. The most out-close central neurons are DVA, ADEL, AVAR, AVBL, and AVAL, which include the top out-close neurons for both individual networks.

We can also calculate the degree distribution of this combined network. The Pearson correlation coefficient between the in-degree and out-degree is Inline graphic (Inline graphic-value Inline graphic); it is not surprising that the coefficient is so large considering that the gap junctions introduce an in- and out-connection simultaneously. Similar to the chemical synapse network, the tails of both the in-degree and the out-degree survival functions (Figures 4(a) and 4(b) in Text S4) can be fit with power laws. The tail of the out-degree could also be fit by an exponential decay, albeit with lower likelihood.

The neurons with the greatest degree centrality are AVAL and AVAR. As for the chemical synapse network, neuron AVAL has the largest in-degree and AVAR has the second largest in-degree, whereas AVAR has the largest out-degree and AVAL has the second largest out-degree (Figures 4(a) and 4(b) in Text S4). The next two neurons are AVBL/R in both in-degree and out-degree senses.

As for the chemical synapse network, the tail of the out-number distribution was fit by a power law and the tail of the in-number distribution could not be fit satisfactorily. The tail of the out-number distribution could also be fit by an exponential, albeit with lower likelihood. The multiplicity can be fit satisfactorily by a stretched exponential.

Spectral properties

In this section we apply linear systems analysis to the combined network of chemical synapses and gap junctions taking into account multiplicities of individual connections. Due to our ignorance about the relative conductance of a single gap junction and of a single chemical synapse, we assume that they are equal. By combining equations (6) and (12) we arrive at:

graphic file with name pcbi.1001066.e272.jpg (13)

where Inline graphic is negative if neuron Inline graphic is GABAergic and positive otherwise.

We proceed to find a spectral decomposition for the combined network. To avoid trivial eigenmodes, we restrict our attention to the strongly connected component of the combined network containing Inline graphic neurons. As before, we ignore the Inline graphic term and only study the matrix Inline graphic. Since Inline graphic is not symmetric, eigenvalues and eigenmodes may be complex-valued, occurring in complex conjugate pairs. Eigenvalues are plotted in the complex plane in Figure 8(a).

Figure 8. Linear systems analysis for the strong giant component of the combined network.

Figure 8

(a). Eigenvalues plotted in the complex plane. (b). The eigenmode associated with eigenvalue Inline graphic (marked cyan in panel (c)). (c). Scatterplot showing the sparseness and decay constant of the eigenmodes. (d). Sparse and slow eigenmodes of the combined network (marked red in panel (c)). The real parts of the eigenmodes corresponding to Inline graphic, and Inline graphic are shown. The eigenmodes are labeled with neurons that take value above a fixed absolute value threshold. Neurons with negative values are in red, whereas neurons with positive values are in black.

What is the meaning of complex eigenvalues? The imaginary part of an eigenvalue is the frequency at which the associated eigenmode oscillates. The real part of an eigenvalue determines the amplitude of the oscillation as it varies with time. Eigenmodes that have an eigenvalue with a negative real part decay with time, whereas eigenmodes that have an eigenvalue with a positive real part grow with time. When examining the temporal evolution of the eigenmodes whose eigenvalues are shown in Figure 8(a), one should keep in mind that the ignored Inline graphic term would shift the real part of the eigenvalues towards more negative values.

As for the gap junction network alone, we can look for eigenmodes that may have functional significance. For example, the sixth eigenmode of the combined network, Figure 8(b), includes neurons that are involved in sinusoidal body movement. As before, one may focus on sparse and slow eigenmodes for ease of investigation. The distribution of Inline graphic norm and real part of eigenvalues is shown in Figure 8(c), and twelve of the sparsest and slowest eigenmodes of the combined network are plotted in Figure 8(d).

Having the eigenspectrum of the combined network allows one to calculate the response of the network to various perturbations. By decomposing sensory stimulation among the eigenmodes and following the evolution of each eigenmode, one could predict the worm's response to the stimulation. A similar calculation could be done for artificial stimulation of the neuronal network, induced for example, using channelrhodopsin [26], [61], [62].

As noted for the gap junction alone, the network may also be studied in the time domain directly by stepping through the dynamics in (13) or more electrophysiologically realistic nonlinear dynamics.

Interaction between networks

We have measured the structural properties of the combined network formed by adding together the adjacency matrices of the gap junction and chemical synapse network, however it is unclear how they interact. The two networks could be independent, or their connections could overlap more or less often than by chance.

To investigate how the two networks overlap, we look at local structure. Figure 9 shows the likelihood ratios of chemical synapse connections being absent, being unidirectional, and being bidirectional given the presence or absence of a gap junction between the same pair of neurons (see Methods). As can be seen, chemical synapses are more likely to be absent when there is no gap junction than when there is one. Unidirectional, and especially bidirectional, chemical synapses are more likely when there is a gap junction between given neurons. In this sense, the two networks are correlated, however it should be noted that when there is a gap junction, about Inline graphic of the time there is no chemical synapse.

Figure 9. Likelihood ratio for the possible chemical network doublets (horizontal axis) given the absence/presence of a gap junction between the two neurons (as indicated by the green marks).

Figure 9

Durbin had found that chemical and gap junction networks are essentially independent when imposing physical adjacency restrictions [8, Ch.∼7], but as noted above, there is no evidence that proximity is a limiting factor for connectivity in C. elegans. Thus, there may be a functional role for correlation/anticorrelation of the joint presence of gap junction and chemical connections.

Why might the presence of connections in two networks either be correlated or anticorrelated? One possibility is that correlated connections simultaneously perform different functions [72] whereas anticorrelation yields connections between distinct kinds of neuronal pairs [73], [74], [Tavoularis CR and Wicker SB, unpublished].

What are the different functions performed by chemical synapses and gap junctions that could lead to correlation? One possibility is that the two different functions are sign-inverting and non-inverting coupling. Gap junctions are non-inverting: higher potential in a neuron raises the potential in other gap-junction-coupled neurons. Chemical synapses, on the other hand, may be either excitatory (non-inverting) or inhibitory (inverting). When the likelihood computations are repeated considering only neuron pairs where the presynaptic neuron is known to be GABAergic [71], there is not much change, see Figure 9. This suggests that the primary purpose of overlapping inhibitory chemical synapses is not to counter excitatory gap junctions. Some other reason, such as differing temporal properties or robustness from redundancy, is needed to explain correlation. This result, however, is only suggestive since the neurotransmitters and their action on postsynaptic receptors in many neurons have not been determined.

Another measure of the interaction between the two networks is the correlation between the degree sequences. The Pearson correlation coefficient between the gap junction degree and the chemical network in-degree is greater than the Pearson correlation coefficient between the chemical network in-degree and out-degree. The Pearson correlation coefficient between the gap junction degree and the chemical network out-degree is less than the Pearson correlation coefficient between the chemical network in-degree and out-degree. This is shown in Table 1 where comparisons to Pearson correlation coefficients between randomly permuted degree sequences (see Methods) are also shown. Large Pearson correlation coefficients imply that neurons are ordered in similar ways according to degree centrality.

Table 1. Degree sequence correlation Coefficients.
gap/in gap/out in/out email [67]
correlation coefficient Inline graphic (Inline graphic-value) Inline graphic Inline graphic Inline graphic Inline graphic
avg. rand. perm. Inline graphic Inline graphic Inline graphic Inline graphic

The two networks seem to primarily reinforce each other with correlated structure rather than augment each other with anticorrelated connections.

Robustness Analysis

Although the reported wiring diagram corrects errors in previous work and is considered self-consistent, one might wonder how remaining ambiguities and errors in the wiring diagram might affect the quantitative results presented. Furthermore there are connectivity pattern differences among individual worms; these individual variations may have similar effects on the analysis as errors and ambiguities.

For network properties that are defined locally, such as degree, multiplicity, and subnetwork distributions, clearly small errors in the measured wiring diagram lead to small errors in the calculated properties. For global properties such as characteristic path length and eigenmodes, things are less clear.

To study the robustness of global network properties to errors in the wiring diagram, we recalculate these properties in the wiring diagrams with simulated errors. We simulate errors by removing randomly chosen synaptic contacts with a certain probability and assigning them to a randomly chosen pair of neurons. Then, we calculate the global network properties on the ensemble of edited wiring diagrams. The variation of the properties in the ensemble gives us an idea of robustness.

First, we explore the robustness of the small world properties and the giant component calculations. We edit wiring diagrams by moving each gap junction contact with Inline graphic probability and chemical synapse contact with Inline graphic probability. Tables 5 and 6 in Text S4 show the global properties for Inline graphic random networks obtained by editing the experimentally measured network. These tables suggest that our quantitative results are reasonably robust to ambiguities and errors in the wiring diagram.

Properties for the neuronal network from prior work in [13] are also shown for comparison. The number of synaptic contacts that must be moved to achieve this network (editing distance) roughly corresponds to that with Inline graphic probability.

Second, we characterize robustness for the linear systems analysis. Because of greater sensitivity of the eigenvalues to errors, we edit wiring diagrams by moving each gap junction contact with Inline graphic probability and a chemical synapse contact with Inline graphic probability. The spectra for Inline graphic randomly edited networks along with the spectrum for the measured network (Figure 8(a)) are shown in Figure 10. Although the locations of eigenvalues shift in the complex plane, many of them move less than the nearest neighbor distance and remain isolated.

Figure 10. The spectrum of the giant component of the combined network matrix Inline graphic (red), ε-disks around the spectrum (light blue), spectra of Inline graphic randomly edited networks (blue), and the ε-pseudospectrum (orange).

Figure 10

The value Inline graphic is used (the average spectral norm of the Inline graphic editing matrices was Inline graphic). The spectrum of the giant component of the combined network matrix Inline graphic under an alternate quantitation of send_joint synapses is also shown (green).

In addition to considering the effect of typical random edits, we can characterize the effect of worst-case errors on the eigenvalues using the Inline graphic-pseudospectrum [75], which gives the eigenvalue loci Inline graphic for all perturbations by matrices of norm Inline graphic (Figure 10). For the gap junction, Inline graphic is simply the set of disks of radius Inline graphic around the eigenvalues, but for the chemical and combined networks, Inline graphic and Inline graphic are larger. In the worst case scenario, most eigenmodes become mixed up.

Electron micrographs of chemical synapses have a further ambiguity when more than one postsynaptic partner receives input at a release site. We treated such polyadic (send_joint) synapses no differently than other synapses, but one might alternatively determine multiplicity by counting such synapses at Inline graphic strength. This alternate quantitation clearly does not change statistics that ignore multiplicity; the change in the spectrum is depicted in Figure 10.

Small deviations from equality when weighting gap junctions and chemical synapses to form the combined network yield similar spectral changes as the alternate quantitation of chemical synapses displayed in Figure 10.

Discussion

We have presented a corrected and more comprehensive version of the neuronal wiring diagram of hermaphrodite C. elegans using materials from White et al. [7] and new electron micrographs. Despite the significant additions, this wiring diagram is still incomplete due to methodological limitations discussed in the An Updated Wiring Diagram section. Yet, our work represents the most comprehensive mapping of the neuronal wiring diagram to date. The sensitivity of our analysis to methodological limitations (and to network structure variation among individual organisms) is discussed in the Robustness Analysis section.

We proposed a convenient way to visualize the neuronal wiring diagram. The corrected wiring diagram and its visualization should help in planning experiments, such as neuron ablation.

Next, we performed several statistical analyses of the corrected wiring, which should help with inferring function from structure.

By using several different centrality indices, we found central neurons, which may play a special role in information processing. In particular, command interneurons responsible for worm locomotion have high degree centrality in both chemical and gap junction networks. Interestingly, command interneurons are also central according to in-closeness, implying that they are in a good position to integrate signals. However, most command interneurons do not have highest out-closeness, meaning that other out-closeness central neurons, such as DVA, ADEL/R, PVPR, etc., are in a good position to deliver signals to the rest of the network.

Linear systems analysis yielded a principled methodology to hypothesize functional circuits and to predict the outcome of both sensory and artificial stimulation experiments. We have identified several modes that map onto previously identified behaviors.

Networks with similar statistical structural properties may share functional properties thus providing insight into the function of the C. elegans nervous system. To enable comparison of the C. elegans network with other natural and technological networks [76], we computed several structural properties of the neuronal network. In particular, the gap junction network, the chemical synapse network, and the combined neuronal network may all be classified as small world networks because they simultaneously have small average path lengths and large clustering coefficients [14].

The tails of the degree and terminal number distributions for the gap, chemical and combined networks (with the exception of the in-numbers) follow a power law consistent with the network being scale-free in the sense of Barabási and Albert [40]. The tails of some distributions can also be fit by an exponential decay, consistent with a previous report [15]. However, we found that exponential fits for the tails have (sometimes insignificantly) lower log-likelihoods than power laws, making the exponential decay a less likely alternative. For whole distributions, neither distribution passes the Inline graphic-value test; if one is forced to choose, the exponential decay may be a less poor alternative.

Several statistical properties of the C. elegans network are similar to those of the mammalian cortex. In particular, the whole distribution of C. elegans chemical synapse multiplicity is well-fit by a stretched exponential (or Weibull) distribution (Figure 6(d)). Taking multiplicity as a proxy of synaptic connection strength, this is reminiscent of the synaptic strength distribution in mammalian cortex, which was measured electrophysiologically, [30], [77]. The definition of stretched exponential distribution is slightly different [30], but has the same tail behavior. The stretch factor is Inline graphic, close to that in the cortical network.

In addition, we found that motif frequencies in the chemical synapse network are similar to those in the mammalian cortex [77]. Both reciprocally connected neuron pairs and triplets with a connection between every pair of neurons (regardless of direction) are over-represented. The similarity of the connection strength and the motif distributions may reflect similar constraints in the two networks. Since proximity is unlikely to be the limiting factor, we suggest that these constraints may reflect functionality. We found that the chemical synapse and the gap junction networks are correlated, which may provide insight into their relative roles.

To conclude the paper, let us note that our scientific development was not hypothesis-driven, but rather exploratory. Yet we hope that the reported statistics will help in formulating a theory that explains how function arises from structure.

Materials and Methods

Data Acquisition

This section describes the methods used to determine neuronal connectivity; see [78] for further details.

We started assembling the wiring diagram by consolidating existing data from both published and unpublished sources. Using J. G. White et al.'s The Mind of a Worm (MOW) [7] as the starting point, we extracted wiring data from diagrams, figures, tables, and text (for example, see [7, Appendix A, pp. 118–122] on neuron AVAL/R). The connectivity of each neuron, its synaptic partner, and synaptic type (chemical, gap junction, neuromuscular) was manually entered into an electronic database. In the ventral cord, determining this level of synaptic specification was complicated by the fact that connections were recorded by neuron class. For example, bilateral neurons PVCL and PVCR were simply listed as PVC. We assigned proper connections to the appropriate left/right neuron by referring to White and coworker's original laboratory notebooks and original electron micrographs. In some cases, the number of synapses for a given neuron class in MOW differed from the sum of connections for the bilateral pairs in the notebooks and/or electron micrographs. The synaptic value of these neurons was determined by taking the value in MOW and dividing it between the left/right neurons proportionally to the values in the notebooks and/or electron micrographs.

Next we incorporated R. M. Durbin's data for the anterior portion of the worm, reconstructed from animal N2U [8]. For neurons that projected beyond the nerve ring, only the anterior connections needed update. Since data from MOW did not specify the location of synapses, integration proved difficult. For these neurons, we obtained positional information by cross-referencing Durbin's data against original electron micrographs and his handwritten annotations in White's laboratory notebooks. Only synapses located in regions addressed by Durbin were included. Connections in the middle and tail regions of the worm were mostly unaffected by these updates.

Studies based on green fluorescent protein (GFP) reporters mostly confirm the electron micrograph reconstructions described in MOW. A few differences between GFP-stained neurons and White's work have been observed [Hobert O and Hall DH, unpublished]. Notably, the anterior processes of DVB and PVT could have been mistakenly switched in MOW [7]. Based on these findings, we reversed the connections for neurons DVB and PVT anterior to the vulva.

Most published works have focused on the neck and tail regions of C. elegans where most neuron cell bodies reside. Reconstructions of neurons in the mid-body of the worm, on the other hand, are scant and incomplete. From a combination of published works [7], [8], [10], [79], we found that wiring data for Inline graphic neurons had large gaps or were missing entirely. Sixty-one of these were motor neurons in the ventral cord. Two were excretory neurons (CANL/R) that do not appear to make any synapses. The remaining neuron, RID, is the only process in the dorsal cord that extends over the length of the animal.

At the C. elegans archive (Albert Einstein College of Medicine), we uncovered a large number of reconstruction records in White et al.'s laboratory notebooks. These notebooks identified neurons by different color code labels depending on the animal, the location of the neurite (ventral or dorsal), and magnification of the electron micrograph. To ascertain the identity of the neurons, we relied on a combination of color code tables and comparisons of common anatomical structures between electron micrograph prints. In the end, we identified notes for full reconstructions of Inline graphic of the aforementioned neurons. Partial connectivity data for the remaining Inline graphic were also available where Inline graphic neurons have partial/missing dorsal side connections and Inline graphic neurons have partial ventral side connections. We checked the connections of all (both published and unpublished) neurons in the ventral cord against electron micrographs used by White and coworkers. Over Inline graphic updates were made to the original notes and published reconstructions. Many of these updates were additions of previously missed neuromuscular junctions between ventral cord motor neurons and body wall muscles.

We found that a large section on the dorsal side of the worm, from just anterior to the vulva to the pre-anal ganglion, was never electron micrographed at high power magnification. This dearth of imagery was why so many neurons were missing dorsal side reconstructions. Using original thin sections for the N2U worm prepared by White et al., we produced new high power electron micrographs of this dorsal region. Due to the condition of the sections, only one of every Inline graphicInline graphic sections was imaged. These new electron micrographs extended nearly Inline graphic on the dorsal side. New dorsal side data for 3 neurons (DA5, DB4, DD3) were obtained from these electron micrographs. Resource constraints prevented us from covering the entire dorsal gap.

From our compilation of wiring data, including new reconstructions of ventral cord motor neurons, we applied self-consistency criteria to isolate neurons with mismatched reciprocal records. The discrepancies were reconciled by checking against electron micrographs and the laboratory notebooks of White et al. Connections in the posterior region of the animal were also cross-referenced with reconstructions published in [10]. Reconciliation involved Inline graphic synapses for Inline graphic neurons (Inline graphic chemical “sends,” Inline graphic chemical “receives,” and Inline graphic electrical junctions).

Giant Component for Random Networks

For a random network with Inline graphic neurons and probability Inline graphic of a connection being present, if the constant Inline graphic, then the size of the giant component is asymptotically normal with mean Inline graphic and variance Inline graphic [80, p. 120]. These quantities are given by

graphic file with name pcbi.1001066.e337.jpg (14)

where

graphic file with name pcbi.1001066.e338.jpg (15)

and Inline graphic is the Lambert Inline graphic-function. If we take Inline graphic to be Inline graphic and Inline graphic to be Inline graphic, then Inline graphic. Using the asymptotic approximation, the size of the giant component is distributed approximately normally with mean Inline graphic and variance Inline graphic. Thus the probability of having a giant component of size Inline graphic, which is over Inline graphic standard deviations from the mean, is about Inline graphic. If a precise evaluation of this infinitesimal value is desired, large deviations techniques, rather than the asymptotic approximation may be more valid [81].

To apply this method to the weakly connected component of a directed network, we are interested in the undirected network formed by adding a connection between two neurons if there is a connection in either direction. For a random directed network with probability Inline graphic of presence of a directed connection, the probability of a connection existing in either direction is Inline graphic. Taking Inline graphic to be Inline graphic, Inline graphic is Inline graphic. Then for an undirected random network with Inline graphic and the specified Inline graphic, Inline graphic is Inline graphic. Then the size of the giant component is distributed approximately normally with mean Inline graphic and variance Inline graphic. Thus the probability of the giant weakly connected component containing all the neurons in such a random network is overwhelming. Again, large deviations techniques should be used to get a precise evaluation of the probability of being on the order of Inline graphic standard deviations away from the mean.

Giant Component for Random Networks with Given Degree Distribution

Consider the ensemble of random networks with a given degree distribution [82]. For the gap junction network, the generating function corresponding to the measured degree distribution is

graphic file with name pcbi.1001066.e364.jpg

with derivative

graphic file with name pcbi.1001066.e365.jpg

Therefore Inline graphic. The generating function Inline graphic is then

graphic file with name pcbi.1001066.e368.jpg

As shown in [82], the expected fraction of the network taken up by the giant component, Inline graphic, is Inline graphic, where Inline graphic is the smallest non-negative solution to Inline graphic. In our case, we find Inline graphic, and so Inline graphic. That is to say, one would expect the giant component to consist of Inline graphic neurons.

Using the computed Inline graphic and Inline graphic, we can find the average component size excluding the giant component, which turns out to be Inline graphic.

For the symmetrized chemical network, the generating function corresponding to the measured degree distribution is

graphic file with name pcbi.1001066.e379.jpg

with derivative

graphic file with name pcbi.1001066.e380.jpg

Therefore Inline graphic. The generating function Inline graphic is then

graphic file with name pcbi.1001066.e383.jpg

The expected fraction of the network taken up by the giant component, Inline graphic, is Inline graphic, where Inline graphic is the smallest non-negative solution to Inline graphic. Here Inline graphic is found to be Inline graphic, and so Inline graphic. That is to say, one would expect the giant component to consist of Inline graphic neurons.

Path Length for Random Networks with Given Degree Distribution

Continuing from the previous subsection, we find the derivative of the generating function Inline graphic for the gap junction network to be

graphic file with name pcbi.1001066.e393.jpg

Thus Inline graphic. Letting Inline graphic and Inline graphic, it is shown in [82, (53)], that the expected path length is

graphic file with name pcbi.1001066.e397.jpg (16)

Fitting Tails of Distributions

To find functional forms of the tails of various distributions, we follow the procedure outlined in [42]. For the candidate functional forms—say, the power law Inline graphic and the exponential decay Inline graphic—we perform the following steps. First, we find the optimal parameter of the fit by maximizing the log-likelihood and the optimal starting point of the fit by minimizing the Kolmogorov-Smirnov statistic. Second, we evaluate the goodness of fit by calculating the Inline graphic-value that the observed data was generated by the optimized distribution using Inline graphic as a criterion for plausibility. Finally, if several distributions pass the Inline graphic-value test we compare their log-likelihoods to find the most probable one.

Circuits in Eigenmodes

Let us bound the probability of finding an eigenmode that comprises a random set of neurons. Let Inline graphic be the number of neurons in the network being analyzed. Let Inline graphic be the number of neurons that appear strongly in the Inline graphicth eigenmode and let Inline graphic. Furthermore let Inline graphic be the number of neurons in the random set, which one might endeavor to investigate as a putative functional circuit derived from an eigenmode.

Now go through each eigenmode and add to a list all possible unordered Inline graphic-tuples of strong neurons. Even if all of these are unique, the size of the list is upper-bounded by Inline graphic which itself is upper-bounded by Inline graphic.

Additionally, we can compute the number of all unordered Inline graphic-tuples of neurons. This number is Inline graphic.

Thus, if a random set of neurons was selected from all possible sets of neurons, the probability Inline graphic that there would be an eigenmode containing all of them is upper-bounded as

graphic file with name pcbi.1001066.e414.jpg

Suppose we are interested in putative functional circuits of size Inline graphic in the giant component of the gap junction network, which has Inline graphic and from Figure 2 in Text S4 take Inline graphic. Then even the loosest upper-bound yields

graphic file with name pcbi.1001066.e418.jpg

and so finding a random set of neurons in an eigenmode is unlikely.

Suppose we know Inline graphic functional circuits of size Inline graphic through molecular biology and want to know the probability of at least one of them appearing in the eigenmodes by chance. By the union bound (Boole's inequality), this probability is less than Inline graphic. If we take Inline graphic and Inline graphic, the probability of a known functional circuit appearing in the eigenmodes by chance is less than Inline graphic for the giant component of the gap junction network.

Gap Junction–Chemical Synapse Likelihoods

The likelihood ratios shown in Figure 9 are the following quantities, empirically estimated from either all neuron pairs or pairs with a GABAergic presynaptic neuron. The first is

graphic file with name pcbi.1001066.e425.jpg

The second is

graphic file with name pcbi.1001066.e426.jpg

and the third is

graphic file with name pcbi.1001066.e427.jpg

Degree Correlation Coefficients

Table 1 shows the Pearson correlation coefficients between neuron degree sequences. The average Pearson correlation coefficients of randomly permuted degree sequences from Inline graphic trials are also shown for comparison. The standard deviation is also shown since the distributions of the three randomized correlation coefficients were all nearly symmetric about zero.

Inline graphic-Pseudospectrum Computation

We used the MATLAB package EigTool [83] to compute pseudospectra.

MATLAB Code and Data

Note that MATLAB code for computing several network properties is available at http://mit.edu/lrv/www/elegans/. This collection of software may be used not only to reproduce most of the figures in this paper, but also for future connectomics analyses.

The collected data is available from the WormAtlas [22] as well as from the same website as the MATLAB code.

Supporting Information

Text S1

Algorithm for directed network drawing.

(0.09 MB PDF)

Text S2

Algebraic form of survival functions.

(0.09 MB PDF)

Text S3

Eigendecomposition.

(0.16 MB PDF)

Text S4

Supporting figures and tables.

(0.08 MB PDF)

Acknowledgments

We thank John White and Jonathan Hodgkin for the generous donation of the MRC/LMB archival documents and experimental materials to Hall's laboratory at AECOM, without which this study would not have been possible. We also thank Markus Reigl for providing some of the software used in this study. We thank Sanjoy K. Mitter, Scott Emmons, Leon Avery, Mark Goldman, Cori Bargmann, Alexander Teplyaev, Shawn Lockery and Gonzalo de Polavieja for helpful discussions and for commenting on the manuscript.

Footnotes

The authors have declared that no competing interests exist.

LRV was supported by the NSF Graduate Research Fellowship and NSF Grants 0325774, 0836720, and 0729069. BLC was the recipient of an Arnold and Mabel Beckman Graduate Student Fellowship of the Watson School of Biological Sciences. DBC was supported by National Institute of Mental Health Grant 69838, the Swartz Foundation, and a Klingenstein Foundation Award. The Center for C. elegans Anatomy is supported by National Institutes of Health Grant RR 12596 (to DHH). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Text S1

Algorithm for directed network drawing.

(0.09 MB PDF)

Text S2

Algebraic form of survival functions.

(0.09 MB PDF)

Text S3

Eigendecomposition.

(0.16 MB PDF)

Text S4

Supporting figures and tables.

(0.08 MB PDF)


Articles from PLoS Computational Biology are provided here courtesy of PLOS

RESOURCES