Abstract
The emergence of differences in the arrangement of cells is the first step towards the establishment of many organs. Understanding this process is limited by the lack of systematic characterization of epithelial organisation. Here we apply network theory at the scale of individual cells to uncover patterns in cell-to-cell contacts that govern epithelial organisation. We provide an objective characterization of epithelia using network representation where cells are nodes and cell contacts are links. The features of individual cells, together with attributes of the cellular network produce a defining signature that distinguishes epithelia from different organs, species, developmental stages and genetic conditions. The approach permits characterization, quantification and classification of normal and perturbed epithelia and establishes a framework for understanding molecular mechanisms that underpin the architecture of complex tissues.
Introduction
Investigating the interactions between components as a network provides a common platform to uncover signatures of complex systems1-5. While this approach has been recently exploited to investigate biological systems of different scales, ranging from interactions between molecules to interactions between species, application of network theory at the level of individual cells has been rather limited. In this study, we present a network based approach to understand general principles in the organisation of cells in an organism (i.e., epithelia). Early in animal development, cells in an epithelia begin to divide and alter their position, shape and size in stereotypical ways6-15. Despite being a highly dynamic process, this results in ordered, robust structure that ultimately leads to the formation of mature organs with cellular organisation suited to their specialised functions. Though we have an understanding of the contribution of genetic mechanisms (e.g., external signals and the associated gene regulatory pathways6, 14) and cellular mechanics (e.g., intrinsic patterns arising due to the rate of cell division8, 9, 11, 15 or cell re-arrangement due to anisotropy of cortical forces within individual cells6, 16-20) to the development of epithelial architecture in various model systems, we lack the means to objectively characterise and quantify the similarities and differences in the organisation of epithelia.
Previous studies have offered insights into epithelial organisation by focusing primarily on geometric characteristics of individual cells such as the cell area and the number of contacts8, 9, 18, 21 and have led to the formulation of empirical relationships such as Aboav-Weaire’s law and Lewis’ law22, 23. This has largely emphasized the similarity between different epithelia. However, a more comprehensive view of epithelial organisation can be achieved if one considers the higher order organisation of cells such as the patterns in the network of interactions between cells that typically characterises an epithelium. The ability to do this would provide a way to describe objectively an epithelium, facilitate the investigation of fundamental questions about its organisation and dynamics, and establish an objective basis for comparative studies of epithelia from different sources. Importantly, the network characteristics of epithelial organisation (as opposed to geometric features) are not readily assessed by eye. This implies that higher-order organisation may not be accounted for in our current understanding of epithelial architecture.
In this work we present an approach we term GNEO (Geometric and Network representation of Epithelial Organisation) which by combining network and geometric measures of epithelial organisation, addresses these issues. We show that our approach is able to capture a defining signature that distinguishes epithelia from different organs, species, developmental stages and genetic conditions. In this way, GNEO permits characterization, quantification and classification of normal and perturbed epithelia in an objective manner.
Results
The GNEO method for characterising epithelial organisation
In order to capture information about the spatial organisation of cells and the global features of an epithelium, we generated network representations of confocal images of epithelia based on the cell-cell contacts. This allows principles from graph theory and complex networks2, 3, 5, 24 to be used to investigate short- and long-range patterns in epithelial organisation. In such a network, the centre of each cell is treated as a node and two nodes are linked if the two cells are neighbors (i.e., physically contact each other) in the epithelium (Methods; Fig. 1a and Supplementary Fig. S1). For each image, we generated a ‘feature vector’ consisting of eight features (Box 1): the means and standard deviations of the cell area, degree (i.e., number of neighbors), clustering coefficient (the amount of interconnectedness among a cell’s immediate neighbors) and average degree of neighbors (the average number of neighbors of a cell’s neighbor). Thus, the mean values of the features in the feature vector represent information about the cell shape (area and degree) and the pattern of cell-to-cell contacts (degree, clustering coefficient and average degree of neighbors). While the degree is informative of the short-range pattern of contacts (the immediate neighbors of cell), the clustering coefficient and average degree of the neighbors represents the cell’s context and surrounding, thus reflecting higher order organization. In turn, the standard deviation values are indicative of the cell-to-cell variability (i.e., heterogeneity) of a feature in an epithelium.
Figure 1. GNEO approach and epithelial comparisons performed.
(a) Geometric and Network representation of Epithelial Organisation (GNEO) approach to characterise epithelial organisation. (1) Images from the confocal microscope are processed to get a light background with dark cell contours. (2) The processed image is the source for defining individual cells, as well as for determining the number of neighbours for every cell. (3) This information is used to produce an epithelial network where each cell is represented as a node and two nodes are connected if the two cells are neighbours in the epithelium. (4) A region of interest (ROI; shown in a green box) is selected for further analysis and the cells that border the ROI are excluded. (5) The average and standard deviation of area and three network features over all cells in an epithelium are calculated. (6) This information is represented as a feature vector, which is an 8-dimensional vector that characterises each epithelium. Each of the four features considered in this work is abbreviated by the symbols shown in this figure. (b) Schematic representation of the comparisons of epithelia from different sources performed in this study. Images of representative epithelial samples (2-pixel wide cell contour) from Drosophila (different tones of green labels, fly) and chicken (different tones of brown labels) are shown. The reference prepupal wing pouch epithelium is shown within a gray box. The text in gray denotes the relationship between epithelia from the different sources that were compared. space: spatially separated epithelia from the same organism; time: temporally separated epithelia from different stages of development; type: different type of epithelia (e.g., squamous and columnar); species: epithelia from different organisms (vertebrate (chick) and invertebrate (Drosophila)) and mutation: mutant epithelia.
For each type of epithelium, we collected a set of images from several different individuals and extracted the feature vectors in each case (Supplementary Table S1). This allows us to compare epithelia, for example from different developmental stages, tissues and species (Fig. 1b and Supplementary Fig. S2-S3), and the natural variation (i.e., individual-to-individual variability) in epithelial organisation. To compare different epithelia, we used multivariate statistical methods to identify the contribution of the different feature in the feature vector that best separate different epithelial types. We took advantage of an unsupervised and a supervised method, namely Principal Component Analysis (PCA) and Discriminant Analysis (DA)4, 24, 25 (Box 1 and Methods). Both these methods provide information about the relative contribution of the features that distinguish different epithelia, termed ‘feature weights’ (Fig. 2a and Supplementary Table S2). The statistical significance of the separation of the different epithelia was assessed using the MANOVA test (Methods).
Figure 2. Discriminant Analysis of the different epithelia.
(a) Colour coded matrix representation of the weights of the features contributing to the observed projection in the Discriminant Analysis of the different comparisons. A higher value (darker color) represents a relatively higher contribution of the feature to the separation. (b-g) Discriminant Analysis graphs of the comparisons of epithelia from different sources. (b) dWP-cNT. (c) cNT-cEE. (d) dWP-cNT-cEE. (e) dWP-dNP. (f) dWP-dWL. (g) dWP-dNP-cNT-cEE.
Comparison of epithelia from different organisms
To validate the approach, we first used the feature vector to compare visually distinct epithelia. For this we took advantage of the neural tube (samples cNT1 to cNT12) and embryonic ectoderm (cEE1 to cEE14) from chicken embryos, and the Drosophila wing imaginal disc from the prepupal stage (dWP1 to dWP16). Discriminant Analysis and PCA revealed that the epithelia from the two different organisms could be clustered into two distinct groups (Fig. 2b; cNT and dWP; DA; p=9.17×10−18; Supplementary Fig. S4; cEE and dWP; DA; p=2.41×10−21, Supplementary Fig. S5) demonstrating the efficacy of the method. The greater importance of the average degree of the neighbours (N) and the standard deviation of the degree (D) in the dWP-cNT separation (Fig. 2a,b and Supplementary Table S1) suggests that these two network features capture a certain defining signature that is independent of the cell area (which is comparable for these epithelial types).
Comparison of different epithelial types
To verify whether we could discriminate between different types of epithelia from the same organism, we compared the columnar neuroepithelium (cNT) of the chick to the squamous embryonic ectoderm (cEE). PCA and Discriminant Analysis of the feature vectors of these two epithelia revealed that they form two distinct groups (Fig. 2c; DA; p=4.49×10−16; Supplementary Fig. S6; PCA; p=4.27×10−10). In this case, the cEE dataset was more spread out, reflecting the greater heterogeneity among these samples. However, each sample was clearly separated from cNT epithelia. In addition, the method separated the Drosophila wing pouch (dWP), the chick neuroepithelium (cNT) and embryonic ectoderm (cEE) epithelia into three groups (Fig. 2d), demonstrating that it is sensitive to different types of epithelial organization. Multiple features contributed to the separation of these epithelia, suggesting that it was the combination of features that allowed the discrimination (Fig. 2a). This underscores the importance of global structure of the network. In particular, we found that the standard deviation of the non-geometric (i.e., network) features were more important in separating the different epithelial types, suggesting that, in this case, GNEO is able to capture patterns in epithelial organisation which are not visually apparent and are independent of cell area.
Comparison of epithelia from different developmental stages
We next compared more closely related epithelia. Discriminant Analysis on the feature vectors of the wildtype (WT) epithelia of the wing pouch (which develops into the adult wing blade; dWP1 to dWP16) and the notum (which develops into the adult thorax; dNP1 to dNP12) of the wing imaginal disc from the prepupal stage showed that these samples were only partially separable. This indicates a similarity in the overall organisation of both epithelia at this stage during development (Fig. 2e; DA; p=0.001), consistent with these regions comprising different areas of the same epithelial sheet. Moreover, a comparison of the prepupal wing pouch (dWP1 to dWP16) with the third instar larva wing pouch (dWL1 to dWL15) produced a discriminant graph with two groups, which also overlapped but were better separated (Fig. 2f; DA; p=3.45×10−5). Together these data suggest that the global organization of the wing epithelia change gradually during development and across an epithelial sheet. The GNEO approach provides a way to assess the relatedness of different epithelia through approaches that calculate distances between data points using standard approaches such as hierarchical clustering. Consistent with our observation, a comparison of four epithelial types (Discriminant Analysis and PCA of the Drosophila dWP and dNP and the chick cEE and cNT) showed that dWP and dNP forms an overlapping cluster whereas the cEE and cNT form separate clusters (Fig. 2g). These observations confirm that the epithelial organisation of the prepupal notum and wing are similar whereas the neural tube and embryonic ectoderm are not.
Comparison of epithelia from different tissues
While the wing pouch and the notum are closely related, we tested if our approach can separate distantly related epithelia in Drosophila, by comparing the third instar larval wing disc epithelia (dWL) with the eye epithelia (dEL1 to dEL5). Unlike the wing disc epithelium, the third instar eye disc contains a gradient of apical constriction induced by myosin II activation26, 27, leading to apparently different cell shapes and sizes across the epithelial sheet. Discriminant Analysis of the feature vectors clearly demonstrated that the eye and wing disc epithelia are significantly different and that most of the features, except the average clustering coefficient, contribute to this separation (Supplementary Fig. S8; DA; p=1.11×10−5).
Natural variation in epithelial organisation
The observation that the network features of the feature vectors were important in discriminating different types of epithelial tissues raised the possibility that local cell packing (resulting from cell shape and distribution) gives rise to the characteristic long-range patterns of an epithelium. In this view, the global organization is a property of the epithelium that emerges from the collective behaviour of the cells. Consistent with this idea, the individual-to-individual variation (measured as coefficient of variation (C.V.), see Table 1 and Methods) of mean values of the network features between the different individuals is several fold smaller than that of cell area (Table 1). In addition, the individual-to-individual variation (C.V.) of the mean network features was much smaller than the C.V. of the cell-to-cell variability (i.e., the standard deviation of the features in the feature vector) across individuals. These observations suggest that there is a reproducible long-range epithelial structure, which is, to a large extent, independent of variations in cell size.
Table 1.
Coefficient of Variation x100 |
average | standard deviation | ||||||
---|---|---|---|---|---|---|---|---|
A | D | C | N | A | D | C | N | |
![]() |
36.21 | 0.55 | 0.42 | 0.64 | 37.59 | 6.49 | 7.28 | 4.90 |
![]() |
40.25 | 0.47 | 0.86 | 0.96 | 34.75 | 4.19 | 6.86 | 7.01 |
![]() |
27.83 | 0.34 | 0.50 | 0.41 | 33.62 | 4.15 | 5.20 | 3.60 |
![]() |
30.81 | 1.94 | 2.84 | 2.33 | 51.69 | 11.66 | 15.75 | 10.78 |
![]() |
18.59 | 0.64 | 2.77 | 1.53 | 24.30 | 8.93 | 12.70 | 10.74 |
![]() |
32.57 | 3.11 | 8.99 | 6.20 | 31.42 | 16.41 | 14.69 | 17.88 |
Area (A) Degree of a node (D) Clustering coefficient (C) Average degree of neighbours (N)
Comparison of wild-type and genetically perturbed epithelia
What regulates this reproducible long-range organization of epithelia? Several factors have been implicated in controlling the behaviour of individual cells and consequently epithelial architecture6, 8, 9, 11, 15-18. However, how global epithelial structure is determined by, for example the effect of the cytoskeleton of the cells within the epithelium is not understood. Therefore, we applied GNEO to quantify objectively the effect in the wing disc of removing myosin II heavy chain using RNAi28, 29, a genetic manipulation that robustly and uniformly disrupts the cytoskeletal organization and epithelial architecture (Methods). Both PCA and DA were clearly able to separate wildtype (WT) discs from those in which myosin II had been reduced (Fig. 3a and Supplementary Fig. S9a). Interestingly, while all WT wings formed one distinct tight cluster, the mutant wings were more broadly spread over the graph (Fig. 2a; PCA; p=4.27×10−10). This is consistent with a visual inspection of the data, which showed that reducing the levels of myosin II by RNAi knockdown disrupted epithelial organisation to different extents in different wing discs. In order to provide an objective score for the severity of the mutant phenotype, we calculated the Euclidean distance between each mutant wing and the center of mass of the WT wings (Supplementary Table S3). The coefficient of variation of the distances was ~26%, which most likely reflects the variability of the RNAi efficiency among the individuals. Individual mutant samples were between 15 and 35 times further from the center of mass than the average of the distances of the dWP samples (Supplementary Table S3).
Figure 3. PCA graphs of the wildtype and mutant epithelia.
(a) dWP-dMWP. (b) dWP-dNP-dMWP. (c) dWP-dMWP-cEE. (d) dWP-dMWP-cEE-cNT. dWP: Wing prepupa, cNT: chicken Neural Tube, cEE: chicken Embryonic Ectoderm, dMWP: mutant Wing prepupa.
We then investigated how the inclusion of other epithelia during the comparison affected our analysis of myosin II reduction. Addition of the prepupal notum (dNP) showed that both wildtype epithelia cluster together whereas the mutant wing pouch epithelia were still scattered (Fig. 3b; PCA; p=1.57×10−13 and Supplementary Fig. S9b; DA; p=1.89×10−18). Thus GNEO can objectively recognise that the Drosophila WT samples are similar to each other but distinct from the mutant ones. A PCA that includes the chick embryonic ectoderm and the neural tube, either together or alone, revealed that each group forms distinct clusters (PCA: Fig. 3c; p=2.98×10−31; Fig. 3d; p=3.40×10−22 and DA: Supplementary Fig. S9c; p=2.03×10−44; Supplementary Fig. S9d; p=1.36×10−34). This suggests that the Drosophila mutant epithelia are clearly distinct from the WT Drosophila and chick epithelia. Discriminant Analysis indicated that the area, the degree, the degree of neighbours and the standard deviation of the degree are the most important features (Fig. 2a) for obtaining this separation (Supplementary Fig. S9a-d). This is consistent with an effect of myosin II on regulating cell shape26, 27, 30, 31, whose levels when altered affects cell area, the number of neighbours and the “regularity” of the pattern of cell contacts. The increased variability of the network features when myosin II is knocked down demonstrates an important biological conclusion of this work: long-range constancy of epithelial packing is regulated by cytoskeletal organisation within individual cells. This is also reflected by the higher variation between discs from different individuals in the myosin II knock-down compared to WT discs (Table 1). Together, these data suggest that the long range organization of an epithelium is determined, at least in part, by the cytoskeleton of the cells comprising the tissue affecting interactions across the field of cells.
Discussion
The network-based approach, GNEO, introduced here, captures epithelial organisation by accounting for patterns of cell-contacts which cannot be quantified either by visual inspection or by using geometric features that describe individual cells alone. In particular, GNEO can objectively quantify differences between epithelia from different tissues and organisms, even when the size and shape of the cells comprising these epithelial appear visually indistinguishable. First, we show that epithelia from different organs and species have distinct, reproducible and quantifiable differences in their structure. Second, a surprising result is that differences in cell area play a relatively minor role in distinguishing wing disc and neural tube epithelia – two epithelia that come from different species which produce very different tissues. This indicates that there is unexpected consistency in both the average size and range of sizes of cells across epithelia from different species. Third, we provide evidence that the non-geometric features of the epithelia are most informative in distinguishing them. This shows that the topological organization of the epithelium differs strongly between tissues and between species. Together with the finding that the structure of the same epithelia from different individuals is highly reproducible, our analysis indicates an unexpected level of genetic control over the long range organization of epithelia. To the best of our knowledge this has not been previously reported.
Lewis’s and Aboav-Weaire’s laws23 define relationships between area, degree and neighbour degree, and place the emphasis on the universal connection between these features. By contrast our work examines how measurements of these features across a population of cells can be used to build a quantitative and objective description of an epithelium. The experiment in which we disrupt myosin II provides an indication of the mechanism by which population level features of an epithelium emerge from the collective behaviour of individual cells. Moreover, we provide evidence that GNEO can operate as a reliable classifier to differentiate mutant and WT epithelia, and to quantify precisely the severity of mutant phenotypes.
While there are other ways of constructing networks from images of epithelia, this representation of cell-to-cell contacts offers a simple and readily applicable method to analyze epithelia objectively. Representing epithelia as feature vectors opens up the possibility of applying artificial intelligence (e.g., pattern recognition) algorithms to classify them in an objective manner and can be extended to include more sophisticated network features. Since the method is automatable, adaptations of this approach can be used in high-throughput experiments aimed at identifying pathways and quantifying the effects of mutations in functional genomics screens. Of particular value, GNEO allows characterisation of subtle phenotypes undetectable by visual inspection. This approach can also be adapted to other biological samples such as nerve-cell connections, muscle cells attachments, and tumours.
Methods
Genetic strains and confocal imaging of the epithelia
Flies were grown by employing standard culture techniques. The following lines were used: Arm-GFP (WT), C765-Gal4 (http://flybase.bio.indiana.edu) and U-zip-RNAi (Vienna Drosophila RNAi Center collection). Imaginal discs from the prepupa and third instar larvae were dissected in PBS and fixed with 4% paraformaldehyde in PBS for 35 min. The samples were washed six times for 10 min with PBT (PBS, 0.3% triton) and 3 times for 5 min with PBS. Imaginal discs were mounted using Fluoromount-G (Southern Biotech). Images were taken with a BioRad Radiance 2100 laser scanning confocal microscope. All the images were captured using 63× immersions objective with 3 times zoom and exported as a 1024 × 1024 pixel TIFF file. The area of 1 pixel is 3.78×10−3 μm2. The regions of the imaginal discs that appear in the images were selected with the following criteria: For the prepupal and larval wing disc, images from dorsal compartments (leaving out the D/V boundary region) of the wing pouch region were obtained. The notum images were taken from the anterior part of the disc (Supplementary Fig. S2). For the eye disc, the images were taken locating the morphogenetic furrow at the side with an additional margin of three to five rows of cells (to include the first and second rows of clusters of photoreceptors). For the chicken images, Hamburger and Hamilton (HH) 32 stage 10 and 17 chick embryos (Supplementary Fig. S3), were fixed for 1 hour in 4% paraformaldehyde. HH st. 17 embryos were subsequently transferred to methanol then rehydrated. Immunostaining was performed with a ZO1 antibdoy (Zymed labs) and embryos were flat mounted. Images of neural tube epithelium, at intermediate dorsoventral positions, were obtained at the level of somite 5 of HH st. 17 embryos. Embryonic ectoderm was imaged adjacent to the most recently formed somite of HH st. 10 embryos. Imaging conditions were as for imaginal discs, image orientation: anterior, left; dorsal, up.
Image processing and generation of the epithelial network
The acquired images were converted into their respective 2-pixel BMP files. First, the confocal images were imported using Adobe Photoshop CS2. Colours were inverted to obtain a dark signal over a light background. Epithelial cells were identified from the processed images by using a semi-automated framework. The images first had their illumination corrected through polynomial fitting followed by thresholding, and were then manually checked and edited to remove artefactual connections between cells. The cells and their boundaries were defined using a foreground tracking program4. Adjacent cells were identified by transforming their images into an epithelial network where the centroid of each cell is represented by a node and the neighborhood relations are mapped into weighted edges (Supplementary Fig. S1). The methodology involves the following steps: (a) all cells have their borders detected4, (b) for each pixel p of the border of each cell i, all border pixels belonging to other cells and falling within the circle of radius r centered at p are identified and counted (we use r=6) (c) the node corresponding to this cell is connected to other cells by edges whose weights, w, correspond to the total number of neighboring pixels found during step (b). We established empirically that if the weight is greater than 40% of the minimum equivalent radius of the areas of each adjacent pair of cells, those cells can be considered to be neighbours. The equivalent radius of a cell with area A is defined as corresponding to . We selected an area within every image in order to have all the networks presented with similar boundary conditions. The squares were drawn to obtain the maximum possible surface without including the centroids of the cells in the border of the images. The features of the cells falling within this area were calculated. The cells outside were only used in order to provide neighbors to the cells analyzed in the network.
Calculation of geometric and network features
The area was measured by counting the number of pixels inside each cell4. Each epithelial image was represented as a network where each node corresponds to one of the cells and the links between nodes reflect the spatial adjacency between the epithelial cells (Fig. 1a). Several measurements can be estimated from these networks24 in order to provide useful characterisation and respective biological interpretations. Let the graph be represented in terms of its adjacency matrix K, such that the presence of a connection between nodes i and j, with 1 ≤ i, j ≤ N , implies K(i,j) = K(j,i) = 1, with K(i,j) = K(j,i) = 0 being enforced otherwise. In this work, we employed the following topological characteristics: Degree of a node i, which corresponds to the number of edges attached to it, i.e., . Clustering coefficient of a node i, which is obtained by dividing the number of edges between the neighbours of i, represented as n(i), by the maximum possible number of connections between those nodes. This measurement can be calculated as . Therefore, the clustering coefficient varies from 0 (no interconnections between the neighbours of i) to 1 (the neighbours are fully interconnected). Average degree of the neighbours of a node, calculated as the average of the number of edges that are attached to the neighbours of node i. This measurement can be calculated as . For every feature, both the average and standard deviation values across all cells in an epithelium were estimated and were used to characterise epithelial organisation. Cell elongation was initially considered as an independent parameter but was found not to contribute further to the classification and separation. We believe that this is because variations in the elongation tend to affect the degree, clustering coefficient and average degree of neighbours, therefore becoming correlated with those measurements.
Feature vector and multivariate statistical analysis
The geometric and network analyses of epithelial images yield a large number of features or measurements (Supplementary Table S1). More precisely, a total of 8 features, corresponding to the mean and standard deviation of the area of cells, as well as the degree, clustering coefficient and average degree of a node are obtained. Thus, for every image of an epithelium, a feature vector of eight dimensions was obtained. We apply an unsupervised and supervised multivariate statistical method, namely Principal Component Analysis (PCA) and Discriminant Analysis (DA)4, 24, 25 (see Supplementary Methods for explanation). Standardisation4 of the measurements is performed in order to eliminate the effect of the magnitude of the measurements on the respective separation between categories. More specifically, the average and standard deviation for every feature across all individuals in each of the epithelial type was calculated. For every feature in a feature vector representing an individual epithelium, the calculated average value was subtracted and divided by the standard deviation. Thus, the components of the eigenvectors associated to the largest eigenvalues provide a quantification of the degree of contribution of each original measurement in maximising the dispersion of the projection (in the case of PCA) and optimising the separation between the categories (in the case of discriminant analysis). Since both these methods output a weighted linear combination of all the features in the feature vector to obtain the final projection (the new axes), the loading values associated to each feature provide a direct measure of the contribution of that particular feature towards the projection onto the smaller-dimensional space. For our analysis, the maximum between the magnitudes of the values of the components of each feature (i.e., measurement) of the first two eigenvector (i.e., the loadings associated with the first two component axes) was defined as the weight of that respective measurement.
Estimation of statistical significance
The MANOVA (multivariate analysis of variance) test, which is a reference statistical test for probing the hypotheses that two or more populations, characterized in terms of two or more dependent variables, are or not distinct, was used to assess the statistical significance of the obtained separation. The tested (null) hypothesis H0 is that the two samples come from the same population, with H1 indicating different populations. The p-values are calculated in the standard way33, 34, after PCA or DA. Both PCA and DA provide new random variables that are linear combinations of the previous ones. In the case of PCA, the new variables are completely uncorrelated. In the case of the DA, the method does take into account the known categories of the cells and therefore enhances the separation between the categories. In this case, our interest was focused on the contribution of the measurements on the separation, not on the separation itself. The p-values in this case reflect the effect of the informed cell categories and should be treated as such. The assumptions required for MANOVA were verified even though after application of PCA and DA, as the respective clusters remained largely normal and with similar variances.
Supplementary Material
Box 1. Qualitative definition of the geometric and network features and the statistical approaches used.
Acknowledgements
The authors acknowledge the MRC for funding and would like to thank A Baonza, A Wuster, A Pombo, C Chothia, E Levy, F Velazquez, G Chalancon, J Casanova, J Gsponer, J Modolell, P Cicuta, S Balaji, S Munro, S Teichmann and T Lecuit for providing helpful comments on this work. MMB acknowledges Darwin College, EMBO YIP and Schlumberger Ltd for support. LME is funded by the Marie Curie and the EMBO fellowships. LdFC is grateful to FAPESP (05/00587-5), CNPq (301303/06-1) for financial support. Part of this work was performed during a Visiting Scholarship to LDFC from St. Catharine’s College, University of Cambridge. JB is supported by the MRC (UK) and AK by a FEBS fellowship.
Footnotes
Author contributions MMB and LME designed the study with help from LdFC and MF. LME and AK obtained the fly and chicken images, respectively. LME processed the images. LdFC wrote the software, generated the epithelial network and performed all the comparisons and statistical analysis. All authors participated in the interpretation of results, discussions and the development of the project. LME and MB wrote the manuscript with input from all authors.
Supplementary Information accompanies this paper at http://www.nature.com/naturecommunications.
Competing financial interests: The authors declare no competing financial interests.
References
- 1.Barabasi AL, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat Rev Genet. 2004;5:101–3. doi: 10.1038/nrg1272. [DOI] [PubMed] [Google Scholar]
- 2.Albert R, Barabasi A-L. Statistical mechanics of complex networks. Reviews of Modern Physics. 2002;74:47–97. [Google Scholar]
- 3.Amaral LAN, Ottino JM. Augmenting the framework for the study of complex systems. The European Physical Journal B. 2004;38:147–162. [Google Scholar]
- 4.Costa LD, Cesar RMJ. Shape Analysis and Classification: Theory and Practice. CRC Press; 2009. 2000. [Google Scholar]
- 5.Strogatz SH. Exploring complex networks. Nature. 2001;410:268–76. doi: 10.1038/35065725. [DOI] [PubMed] [Google Scholar]
- 6.Lecuit T, Le Goff L. Orchestrating size and shape during morphogenesis. Nature. 2007;450:189–92. doi: 10.1038/nature06304. [DOI] [PubMed] [Google Scholar]
- 7.Axelrod JD. Cell shape in proliferating epithelia: a multifaceted problem. Cell. 2006;126:643–5. doi: 10.1016/j.cell.2006.07.018. [DOI] [PubMed] [Google Scholar]
- 8.Farhadifar R, Roper JC, Aigouy B, Eaton S, Julicher F. The influence of cell mechanics, cell-cell interactions, and proliferation on epithelial packing. Curr Biol. 2007;17:2095–104. doi: 10.1016/j.cub.2007.11.049. [DOI] [PubMed] [Google Scholar]
- 9.Gibson MC, Patel AB, Nagpal R, Perrimon N. The emergence of geometric order in proliferating metazoan epithelia. Nature. 2006;442:1038–41. doi: 10.1038/nature05014. [DOI] [PubMed] [Google Scholar]
- 10.Hayashi T, Carthew RW. Surface mechanics mediate pattern formation in the developing retina. Nature. 2004;431:647–52. doi: 10.1038/nature02952. [DOI] [PubMed] [Google Scholar]
- 11.Rolland-Lagan AG, Bangham JA, Coen E. Growth dynamics underlying petal shape and asymmetry. Nature. 2003;422:161–3. doi: 10.1038/nature01443. [DOI] [PubMed] [Google Scholar]
- 12.Gho M, Bellaiche Y, Schweisguth F. Revisiting the Drosophila microchaete lineage: a novel intrinsically asymmetric cell division generates a glial cell. Development. 1999;126:3573–84. doi: 10.1242/dev.126.16.3573. [DOI] [PubMed] [Google Scholar]
- 13.Classen AK, Anderson KI, Marois E, Eaton S. Hexagonal packing of Drosophila wing epithelial cells by the planar cell polarity pathway. Dev Cell. 2005;9:805–17. doi: 10.1016/j.devcel.2005.10.016. [DOI] [PubMed] [Google Scholar]
- 14.Martinez-Arias M, Stewart A. Molecular Principles of Animal Development. OUP Oxford: 2002. [Google Scholar]
- 15.Patel AB, Gibson WT, Gibson MC, Nagpal R. Modeling and inferring cleavage patterns in proliferating epithelia. PLoS Comput Biol. 2009;5:e1000412. doi: 10.1371/journal.pcbi.1000412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cavey M, Rauzi M, Lenne PF, Lecuit T. A two-tiered mechanism for stabilization and immobilization of E-cadherin. Nature. 2008;453:751–6. doi: 10.1038/nature06953. [DOI] [PubMed] [Google Scholar]
- 17.Lecuit T, Lenne PF. Cell surface mechanics and the control of cell shape, tissue patterns and morphogenesis. Nat Rev Mol Cell Biol. 2007;8:633–44. doi: 10.1038/nrm2222. [DOI] [PubMed] [Google Scholar]
- 18.Rauzi M, Verant P, Lecuit T, Lenne PF. Nature and anisotropy of cortical forces orienting Drosophila tissue morphogenesis. Nat Cell Biol. 2008;10:1401–10. doi: 10.1038/ncb1798. [DOI] [PubMed] [Google Scholar]
- 19.Martin AC, Gelbart M, Fernandez-Gonzalez R, Kaschube M, Wieschaus EF. Integration of contractile forces during tissue invagination. J Cell Biol. 2010;188:735–49. doi: 10.1083/jcb.200910099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Martin AC, Kaschube M, Wieschaus EF. Pulsed contractions of an actin-myosin network drive apical constriction. Nature. 2009;457:495–9. doi: 10.1038/nature07522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ma D, et al. Cell packing influences planar cell polarity signaling. Proc Natl Acad Sci U S A. 2008;105:18800–5. doi: 10.1073/pnas.0808868105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mombach JC, de Almeida RM, Iglesias JR. Two-cell correlations in biological tissues. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics. 1993;47:3712–3716. doi: 10.1103/physreve.47.3712. [DOI] [PubMed] [Google Scholar]
- 23.Chiu SN. Aboav-Weaire’s and Lewis’ laws—A review. Materials Characterization. 1995;34:149–165. [Google Scholar]
- 24.Costa LD, Rodrigues FA, Travieso G, Boas PRV. Characterization of complex networks: a survey of measurements. Advances in Physics. 2007;56:167–242. [Google Scholar]
- 25.Pincus Z, Theriot JA. Comparison of quantitative methods for cell-shape analysis. J Microsc. 2007;227:140–56. doi: 10.1111/j.1365-2818.2007.01799.x. [DOI] [PubMed] [Google Scholar]
- 26.Corrigall D, Walther RF, Rodriguez L, Fichelson P, Pichaud F. Hedgehog signaling is a principal inducer of Myosin-II-driven cell ingression in Drosophila epithelia. Dev Cell. 2007;13:730–42. doi: 10.1016/j.devcel.2007.09.015. [DOI] [PubMed] [Google Scholar]
- 27.Escudero LM, Bischoff M, Freeman M. Myosin II regulates complex cellular arrangement and epithelial architecture in Drosophila. Dev Cell. 2007;13:717–29. doi: 10.1016/j.devcel.2007.09.002. [DOI] [PubMed] [Google Scholar]
- 28.Brand AH, Perrimon N. Targeted gene expression as a means of altering cell fates and generating dominant phenotypes. Development. 1993;118:401–15. doi: 10.1242/dev.118.2.401. [DOI] [PubMed] [Google Scholar]
- 29.Sharp PA. RNA interference--2001. Genes Dev. 2001;15:485–90. doi: 10.1101/gad.880001. [DOI] [PubMed] [Google Scholar]
- 30.Bertet C, Sulak L, Lecuit T. Myosin-dependent junction remodelling controls planar cell intercalation and axis elongation. Nature. 2004;429:667–71. doi: 10.1038/nature02590. [DOI] [PubMed] [Google Scholar]
- 31.Pilot F, Lecuit T. Compartmentalized morphogenesis in epithelia: from cell to tissue shape. Dev Dyn. 2005;232:685–94. doi: 10.1002/dvdy.20334. [DOI] [PubMed] [Google Scholar]
- 32.Hamburger V, Hamilton HL. A series of normal stages in the development of the chick embryo. J. Morphol. 1951;88:49–92. [PubMed] [Google Scholar]
- 33.Izenman AJ. Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning. Springer; 2008. [Google Scholar]
- 34.Jolliffe IT. Principal Component Analysis. Springer; 2010. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.