The variability of membrane contacts (Fig. 2, Extended Data Fig. 2) suggest that no single animal is representative of the population. We estimated the variability among membrane contact areas. a, The log-normalized empirical distribution of membrane contact areas (mean centered at 0, STD: standard deviation, red line: normal distribution with empirical mean and standard deviation, membrane contacts). We estimated the variability across the four datasets (L4 left, L4 right, adult left and adult right). For each conserved contact, we computed the mean and standard deviation of the membrane contact area across the four datasets (see Methods). b, Plot of the standard deviation versus mean contact area across the datasets, where each point is one contact. Similar to Extended Data Fig. 1a, we find no dependence of the variability on membrane contact area. Therefore, we estimate membrane contact area variability by the mean variability among all membrane contact areas. c, The distribution of standard deviations of membrane contact area for all contacts. Red dashed line: mean standard deviation. d-i, A stochastic spatial population model matches the above distributions by randomly perturbing membrane contact areas in the four datasets with multiplicative white noise with standard deviation () of 0.23 (Methods). d-f, Spatial population data perturbs the membrane contact areas while maintaining contact area and variability distributions that are similar to the empirical contact area distributions. g, Perturbed contact areas scale linearly with the empirical contact areas. h, The spread of perturbed contact areas (log of the perturbed contact area as a fraction of the empirical contact area) is mostly uniform across membrane contact areas. i-l, Neurite clusters obtained from a population of 1,000 perturbed individuals and 1,000 and perturbed individuals (perturbing left/right conserved contacts in the L4 and adult datasets). For each perturbed individual in each population we used a multi-level graph clustering algorithm to identify spatial clusters. Across each population, we computed the frequency that cell pairs cluster together, represented as an cluster frequency matrix (). A hierarchical clustering algorithm is used to sort the rows and columns of the cluster frequency matrix in order to minimize variation along the diagonal. Hence, cells pairs that frequently cluster together are sorted together on the cluster frequency matrix (Methods). Five largely overlapping subgroups of neurons emerge across different perturbations (see main text). i, Consensus subgroups are robust across datasets. and clusters visualized using row and column colors of the population cluster assignments (dashed box). j, The consensus subgroups are robust across different noise amplitudes. Clustering applied to populations generated by perturbations to using white noise with standard deviations 0 (empirical data), 0.12, 0.45 and 0.9. k,l, The consensus subgroups are robust across different spatial domains. k, Clustering applied to populations generated from the more spatially restricted subset of the neuropil considered by Moyle et al. (2020)34, which excluded the posterior lobe of the neuropil. l, Clustering applied to populations generated by perturbations to all reproducible membrane contact areas after restoring the smallest 35% contact areas to each of the L4, adult and datasets (Extended Data Fig. 2). For all cluster frequency matrices: Matrix element (, ) corresponds to the frequency that cells and cluster together across the 1000 perturbed individuals. Row and column orders minimize variance along the diagonal (Methods). Cell cluster assignments (color) follow the perturbed dataset (Figure 1b reproduced in dashed box). Top: dendrogram of the hierarchical clustering.