Skip to main content
eLife logoLink to eLife
. 2016 May 26;5:e14334. doi: 10.7554/eLife.14334

TissueMiner: A multiscale analysis toolkit to quantify how cellular processes create tissue dynamics

Raphaël Etournay 1,2,, Matthias Merkel 3,4,, Marko Popović 3,, Holger Brandl 1,, Natalie A Dye 1, Benoît Aigouy 5, Guillaume Salbreux 3,6, Suzanne Eaton 1,*, Frank Jülicher 2,*
Editor: W James Nelson7
PMCID: PMC4946903  PMID: 27228153

Abstract

Segmentation and tracking of cells in long-term time-lapse experiments has emerged as a powerful method to understand how tissue shape changes emerge from the complex choreography of constituent cells. However, methods to store and interrogate the large datasets produced by these experiments are not widely available. Furthermore, recently developed methods for relating tissue shape changes to cell dynamics have not yet been widely applied by biologists because of their technical complexity. We therefore developed a database format that stores cellular connectivity and geometry information of deforming epithelial tissues, and computational tools to interrogate it and perform multi-scale analysis of morphogenesis. We provide tutorials for this computational framework, called TissueMiner, and demonstrate its capabilities by comparing cell and tissue dynamics in vein and inter-vein subregions of the Drosophila pupal wing. These analyses reveal an unexpected role for convergent extension in shaping wing veins.

DOI: http://dx.doi.org/10.7554/eLife.14334.001

Research Organism: D. melanogaster

eLife digest

Cells interact, divide, rearrange and change shape to build an organ during development. Modern microscopy and computer technology can follow each individual cell of an entire organ in a living organism. However, to understand how the complex choreography of cells leads to well-shaped organs, researchers need tools to help the store and analyze the large amounts of data generated. Tools are also needed to visualize and quantify the complex cell behaviors in an easy and flexible manner.

During its development, a fruit fly’s wings become divided into distinct regions separated by tubular supports called veins. Early on in development, the vein cells are indistinguishable from their neighbors, but at late stages, vein cells become a different shape. Veins also become narrower, which is assumed to be due to the number of vein cells falling. However, the way in which cells behave to bring about these changes has not been studied in detail.

Etournay, Merkel, Popović, Brandl et al. have now developed a toolkit called TissueMiner that enables users to store large amounts of data about cells and analyze how cells collectively shape an organ. TissueMiner was then used to identify vein cells at late stages of wing development and follow them backward in time to reveal their position at early stages. This showed that veins become narrower and more elongated because the cells that make up the veins shrink more than cells in other regions.

TissueMiner was then used to show that vein cells specifically rearrange and elongate to produce thinner regions, while the number of cells increases slightly because the cells divide. These results suggest that the cell behaviors responsible for making veins elongate and narrow are likely to be different from what had previously been assumed.

TissueMiner can be used in future studies to help understand the molecule signals that influence how cells behave in veins during wing development. The toolkit could also now be used to explore the changes involved in the development of other organs in other organisms.

DOI: http://dx.doi.org/10.7554/eLife.14334.002

Introduction

Understanding how cells collectively shape a tissue is a long-standing question in developmental biology. We recently addressed this question by analyzing morphogenesis of the Drosophila pupal wing at cellular resolution (Etournay et al., 2015). To understand the cellular contributions to pupal wing shape changes, we quantified the spatial and temporal distribution of both cell state properties (e.g. cell area, shape and packing geometry), as well as dynamic cellular events like rearrangements, divisions, and extrusions. We quantitatively accounted for wing shape changes on the basis of these cellular events. By combining these analyses with mechanical and genetic perturbations, we were able to develop a multiscale physical model for wing morphogenesis and show how the interplay between epithelial stresses and cell dynamics reshapes the pupal wing.

Researchers interested in epithelial dynamics face similar challenges in processing and analyzing time-lapse movie data. Quantifying epithelial dynamics first requires image-processing steps including cell segmentation and tracking, to digitalize the time-lapse information. Recently, software tools for segmentation and tracking have become generally available (Aigouy et al., 2010; Mosaliganti et al., 2012; Sagner et al., 2012; Barbier et al., 2015; Cilla et al., 2015; Wiesmann et al., 2015Heller et al., 2016Aigouy et al., 2016). However, more advanced analysis is required to quantify, interpret and visualize the information derived from segmentation and tracking. Epithelial cells share a set of core behaviors, such as division, rearrangement, shape change and extrusion, which underlie a wide variety of morphogenetic events in different tissues. Methods for analyzing these core behaviors have been developed independently in several labs (Blanchard et al., 2009; Bosveld et al., 2012; Etournay et al., 2015Guirao et al., 2015). However, these analysis tools have not yet been made available to other users in an easy to use and well-documented form.

Here, we propose a generic data layout and a comprehensive and well-documented computational framework called TissueMiner (see Box 1) for the analysis of epithelial dynamics in 2D. It enables biologists and physicists to quantify cell state properties and cell dynamics, their spatial patterns and their time evolution in a fast, easy and flexible way. It also facilitates the comparison of quantities within and between tissues. To make TissueMiner accessible to a novice, we provide tutorials that guide the user through its capabilities in detail and release a workflow that automatically performs most of the analysis and visualization tasks we reported previously for Drosophila pupal wings (Etournay et al., 2015). These tutorials operate using one small example dataset and 3 large wild-type datasets corresponding to the distal wing blade, which we also provide. The code for TissueMiner, along with tutorials and datasets, are publically available (Box 1). We illustrate the utility and power of these tools by performing a more extensive analysis of pupal wing morphogenesis focused on differences in the behavior of vein and inter-vein cells.

Box 1. TissueMiner can be found on the web-based repository GitHub https://github.com/mpicbg-scicomp/tissue_miner#about along with its documentation and tutorials.

Several possibilities are offered to the user to run TissueMiner. For beginners we highly recommend the use of the docker, which allows to package an application with its dependencies into a standardized unit for software development (https://www.docker.com/) (Nickoloff, 2015). Using a provided docker image for TissueMiner, users can directly run it without any further setup being required. Additional instructions and examples are detailed in the supplementary information and on GitHub. We also provide one example biological dataset that can be used to run TissueMiner tutorials in R. In addition, we give access to 3 databases corresponding to wild-type pupal movies of the distal wing blade. These datasets are available at https://github.com/mpicbg-scicomp/tissue_miner#datasets along with the processed images. Tutorials can be found at https://github.com/mpicbg-scicomp/tissue_miner#documentation.

DOI: http://dx.doi.org/10.7554/eLife.14334.003

Wing veins are specified during larval stages, but only become morphologically distinct during prepupal and pupal morphogenesis. During pupal morphogenesis, the dorsal and ventral surfaces of the wing epithelium become apposed to each other on their basal sides, except in the regions that will give rise to veins - here the basal surfaces of dorsal and ventral cells form a lumen. Vein and inter-vein cells also differ on their apical surfaces. Vein cells have a narrower apical cross-section and form corrugations that protrude from the dorsal and ventral surfaces of the wing blade. The cell dynamics underlying vein morphogenesis have never been quantitatively examined.

Results

We analyze epithelial morphogenesis within TissueMiner in three steps (Figure 1—figure supplement 1). First, all epithelial cells of the tissue are digitalized (segmented) and automatically tracked over time using the interactive TissueAnalyzer software (Aigouy et al., 2010, 2016Sagner et al., 2012), which is included in the TissueMiner framework. This software generates segmented images, referred to as segmentation masks that contain information about cell geometry, cell neighbor topology and cell ancestry, which are essential for the study of morphogenesis (Aigouy et al., 2010; Sagner et al., 2012; Etournay et al., 2015). Second, we use a TissueMiner automated workflow to extract this information from the images and store it in a relational database. This workflow also automatically performs most of the visualization steps we describe in this paper (Materials and methods, and Appendix 1). Third, we use TissueMiner’s powerful and convenient library of tools for R and Python to query the database to both visualize the data and quantitatively compare cell properties and behaviors between different movies and subregions of the tissue.

Time-lapse datasets are rich with information, and one important set of tools that TissueMiner provides is the ability to visualize this information on the tissue. Such type of visualization can reveal interesting spatial and temporal patterns of core cell behaviors and can guide subsequent analyses. This is, however, insufficient for quantitatively comparing regions within the same tissue or even comparing how the tissue behaves across replicates or various conditions. Therefore, we developed tools to enable the user to define regions of interest, synchronize movies in time, and align all tissues to a common orientation. We then provide tools to easily plot average quantities in different regions or across movies. For each type of measurement, we refer to the tutorials regarding the specific visualization tools we have built (Box 1).

Preparing the dataset (TM R-User Manual sections 1.1 to 1.5)

Before conducting any analysis, the TissueMiner automated workflow reads three configuration files that contain (1) user-defined regions of interest (ROI’s), (2) time offsets for movie synchronization, and (3) the rotation angle used to align the tissue to a standard orientation (Figure 1—figure supplement 1). 

Defining regions of interest (howto Video 1)

As cellular behaviors may be spatially patterned, one should have the ability to quantify and compare cell dynamics within different ROI’s. TissueMiner provides a Fiji macro (draw_n_get_ROIcoord.ijm) to manually define a set of ROI’s directly on one given image of the movie. This program manages several lists of ROI’s, which the user can create, modify and delete. These lists help maintain the consistency of ROI labels, which is essential for subsequent analysis (Video 1).

In addition, defining ROI’s of different shapes and following them backwards and forwards in time (Figure 1A–E’) is a useful method to visualize tissue deformations (Figure 1D–D’, Video 2). These ROI’s can be defined at any frame within the movie. Thus, it is even possible to specify a region based on morphological features that only arise late in the morphogentic process under study, which is true of wing veins for example (see Figure 1E–E’). ROI definition allows the user to define morphologically relevant regions of interest and compare the behavior of cells in the different regions.

Video 1. HOWTO: drawing ROI’s.
Download video file (13.5MB, mp4)
DOI: 10.7554/eLife.14334.006

DOI: http://dx.doi.org/10.7554/eLife.14334.006

Video 2. Visualizing tissue deformation by using vertical stripes.
Download video file (12.2MB, mp4)
DOI: 10.7554/eLife.14334.007

DOI: http://dx.doi.org/10.7554/eLife.14334.007

Figure 1. Regions of interest are followed in time by browsing the cell lineages.

(A) Largest population of cells (purple) that remains visible throughout the entire time-lapse. Two cell rows in contact to margin cells were discarded as margin cells are usually not well segmented. (B) Largest blade cell population (green) that remains visible throughout the entire time-lapse. The blade region of interest (yellow line) was defined on the last frame of the time-lapse using a custom Fiji macro (https://github.com/mpicbg-scicomp/tissue_miner/blob/master/fiji_macros/). The underlying cell population was then subset using our lineage browser algorithm. (C) One can define veins and inter-vein regions of interest and apply the same algorithm as in (B). (DD') Regularly spaced regions of interest automatically selected and followed over time to visualize tissue deformation. (EE') Here, we make use of the lineage browser routine to trace back the vein positions at 15 hAPF, as they aren't visible yet at 15 hAPF. Scale bar 50 microns.

DOI: http://dx.doi.org/10.7554/eLife.14334.004

Figure 1.

Figure 1—figure supplement 1. Flow chart of TissueMiner.

Figure 1—figure supplement 1.

Solid lines depict the three main steps to analyze epithelial morphogenesis within TissueMiner. Dashed lines indicate additional inputs to the automated workflow: red boxes represent required inputs and black boxes indicate optional inputs. Arabic numbers indicate the order in which the tools are described in the main text. Cumulative time of the movie must be listed in a text file called cumultimesec.txt and located along with the movie images. The snakemake automated workflow is described in Figure 7.

By default, TissueMiner generates two regions of interest – raw and whole_tissue – in order to select cell populations by name. The raw ROI corresponds to all segmented and tracked cells. However cells located at the tissue margin may move in and out of the field of view of the microscope lens. TissueMiner identifies the population of cells (whole_tissue) whose entire lineage lies within the field of view throughout the movie. To identify this population, we developed a filtering tool to discard in each movie frame margin cells located at the edge of the segmentation mask and one additional row of cells that contact the margin cells. The choice of discarding two rows of cells is motivated by the fact that segmentation quality drops near the margin. We iterate over all time points to ensure that we discard all cells moving in and out the field of view (see Materials and methods). User-defined ROI’s are also subjected to this filtering.

Aligning movies in time

To temporally align movies, TissueMiner provides a configuration file in which to manually define a time correction for each movie relative to one reference movie whose time correction is set to zero. The time correction can be estimated based on the appearance of morphological landmarks, or by aligning curves of a defined state quantity in time, such as cell area or cell elongation, on the assumption that this quantity has a similar qualitative time evolution.

Aligning movie orientation (howto Video 3)

In order to compare replicates of the same dynamic biological process, all movies should have a common orientation. TissueMiner contains a Fiji macro (orient_tissue.ijm) to assist the user in finding the optimal angle through which each movie should be rotated so that all movies have a comparable orientation (see Video 3 for an example on the pupal wing). 

Video 3. HOWTO: Orienting a tissue.
Download video file (8.4MB, mp4)
DOI: 10.7554/eLife.14334.008

DOI: http://dx.doi.org/10.7554/eLife.14334.008

Visualizing cell area, cell shape and cell packing on the entire tissue (TM R-User Manual sections 2.2 and 2.6, Py-tutorial sections 2.1 to 2.3)

An important step in analyzing tissue morphogenesis is to quantify cell state properties over time. These properties include cell area, shape anisotropy and packing geometry. In this section, we demonstrate the analysis and visualization tools of TissueMiner by comparing how these state properties evolve during wing morphogenesis in vein and inter-vein regions.

Cell area and elongation (TM R-User Manual sections 2.2 – 2.5, Py-tutorial sections 2.1 – 2.2)

Morphogenesis is often characterized by changes in cell area and elongation. In the TissueMiner workflow, these properties are calculated from the original segmentation masks and stored in the database (Materials and methods). To visualize the evolution of the cell area pattern at the scale of the whole tissue, we map the area values of each individual cell to a gradient color scale (see Figure 2A–A’, Video 4). Each cell contour is filled with a color that corresponds to its area. Figure 2A’ shows the pattern of cell areas in the wing at the end pupal wing blade elongation. This visualization scheme reveals that cells in the proximal hinge and in wing veins have a smaller cross-sectional area (blue) at this time.

Video 4. Color-coded cell area pattern.

Download video file (14.3MB, mp4)
DOI: 10.7554/eLife.14334.010

DOI: http://dx.doi.org/10.7554/eLife.14334.010

Figure 2. Patterned cell state properties in the developing pupal wing of Drosophila.

Figure 2.

(AD') Cell state patterns at 22 hr and 31 hr after puparium formation (hAPF). (AA') Color-coded cell area. (BB') Color-coded cell elongation. The magnitude of cell elongation corresponds to the norm of the cell elongation nematic tensor. (CC'') Coarse-grained pattern of cell elongation nematics and (C'') cell elongation nematics represented as bars on each individual cell. The wing was divided into adjacent square-grid elements of 33x33 microns in which cell elongation nematics were averaged. (DD') Color-coded representation of the cell neighbor number. (E) Time evolution of the average cell area in different regions of interest: wing blade (Figure 1B), veins (Figure 1E), and inter-vein regions. (F) Time evolution of the average cell elongation magnitude in the blade, veins and inter-vein regions. Scale bar: 50 microns.

DOI: http://dx.doi.org/10.7554/eLife.14334.009

Cell elongation is characterized by a nematic tensor describing the axis and magnitude of the elongation (Aigouy et al., 2010). As with cell area, we map the magnitude of cell elongation to a color scale (Figure 2B–B’, Video 5). This fine-grained quantification of cell elongation highlights striking differences between inter-vein and vein cells. Inter-vein cells are more elongated than vein cells at 22 hr after puparium formation (hAPF), but this pattern is reversed by 31 hAPF. 

Video 5. Color-coded cell elongation norm pattern.

Download video file (16.1MB, mp4)
DOI: 10.7554/eLife.14334.011

DOI: http://dx.doi.org/10.7554/eLife.14334.011

The color scale above reveals only the magnitude of the tensor. To visualize both the magnitude and direction of cell elongation, we represent the elongation nematic as a line whose length and angle correspond to the magnitude and angle of cell elongation, respectively. Nematics can also be averaged across multiple cells in a region in order to coarse-grain the patterns and highlight the main features (Figure 2C–C’’, Video 6). For example, the coarse-grained elongation nematics shown in Figure 2C, highlight the global alignment of cell elongation in the proximal-distal direction at 22 hAPF.

Video 6. Coarse-grained cell elongation pattern.

Download video file (13.5MB, mp4)
DOI: 10.7554/eLife.14334.012

DOI: http://dx.doi.org/10.7554/eLife.14334.012

Packing geometry (TM R-User Manual section 2.6, Py-tutorial section 2.3)

Cells in the wing become progressively more hexagonal during pupal wing morphogenesis (Classen et al., 2005). To visualize packing geometry, we map the neighbor number of each cell to a discrete color code (Figure 2D–D’, Video 7). This makes changes in packing geometry during morphogenesis immediately obvious (22 and 31 hAPF).

Video 7. Color-coded cell packing pattern.

Download video file (17.5MB, mp4)
DOI: 10.7554/eLife.14334.013

DOI: http://dx.doi.org/10.7554/eLife.14334.013

Plotting temporal evolution of average cell properties (TM R-User manual sections 3.3 to 3.6, Py-tutorial section 3)

The visualization tools described above effectively reveal detailed spatial patterns of cell properties. To highlight how average cell properties change over time, and to facilitate comparison between movies and ROI’s, TissueMiner also provides tools to create plots of average quantities as a function of time. In Figure 2E and Figure 2F, we compare the time evolution of the average cell area and the average cell elongation in movies of the 3 WT wings (blue, green, red) used in (Etournay et al., 2015). The plots in Figure 2 compare the time evolution of average cell elongation and area values for vein and inter-vein cells. We previously showed that average cell area in the wing blade decreases during morphogenesis, but that cell area decrease is balanced by cell divisions to maintain wing blade area. Quantifying average area values in vein and inter-vein ROI’s reveals that vein cells contract over a longer period of time than inter-vein cells, and thus have a smaller cross-sectional area at the end of morphogenesis (Figure 2F). As previously described, cells in the wing blade elongate and then relax their shapes during pupal wing morphogenesis (Etournay et al., 2015) (Figure 2E, blade part). Plotting elongation in vein and inter-vein ROI’s reveals that vein cells elongate more slowly and also relax their elongation more slowly than inter-vein cells. These differences suggest that vein and inter-vein cells have different mechanical properties.

Visualizing patterns of cell divisions (TM R-User Manual sections 2.7 – 2.9, Py-tutorial section 2.4)

Oriented tissue morphogenesis may reflect the number, orientation and spatio-temporal pattern of cell divisions. TissueMiner provides several tools to visualize these events. Overlaying color-coded generation number on a pupal wing movie reveals patterns of cell divisions as they occur (Video 8), and examining the last frame of the movie (Figure 3A) reveals the cumulative pattern of cell divisions. This analysis is largely consistent with the cell division timing inferred from classical BrdU pulse-chase experiments (Schubiger and Palka, 1987; Garcia-Bellido et al., 1994; Milan et al., 1996), but also reveals unexpected additional features. The pattern of cell divisions correlates with veins: most cells in the wing blade divide only once during pupal morphogenesis, whereas in some parts of inter-vein regions they divide twice. These include the cells lying adjacent to veins L3, L4 and L5, and the region posterior to L5. We estimate the median cell-cycle length between the first and second rounds of cell divisions to be (5.25 ± 1.50) hr.

Video 8. Color-coded cell generation pattern.

Download video file (12.6MB, mp4)
DOI: 10.7554/eLife.14334.014

DOI: http://dx.doi.org/10.7554/eLife.14334.014

Figure 3. Visualization of cell generations and cell divisions.

Figure 3.

(A) Color-coded pattern of cell generations. The wing cartoon on the bottom right shows the names of subregions that we analyze in panel B. Scale bar 50 microns. (B) Cell division rate in different regions of interest. To smooth fluctuations, these rates were averaged in discrete time intervals of one hour (TM R-User Manual, section 3.7). We further averaged these rates amongst the three wild-type wings. Error bars depict the standard deviation between wings. Cells divide earlier in veins L2 and L4 than in L3 and L5. Two maxima corresponding to two rounds of divisions are visible in inter-vein regions: interL2-L3, distInterL3-L4 and postL5. (CC') A dividing cell with its unit nematic depicting the division orientation. Scale bar 10 microns. (D) Coarse-grained pattern of cell division orientation (grid size of 33x33 microns). Scale bar 50 microns.

DOI: http://dx.doi.org/10.7554/eLife.14334.015

To further investigate how cell divisions are patterned in the blade, we quantified the time evolution of cell division rates in each vein and inter-vein region (Figure 3B). This analysis reveals differences in the timing and numbers of cell divisions in these different ROI’s. Cells in veins L2 and L4 divide before those in L3 and L5. These divisions are followed by a second peak of division in the inter-vein regions distInterL3-L4, interL2-L3 and postL5 (see cartoon in Figure 3A).

To more easily visualize the spatio-temporal pattern of divisions in veins only, the powerful tools available in TissueMiner allow us to assign vein cells a color corresponding to the time at which they divide: blue for 16–18 hAPF and red for 18–20 hAPF (see Video 9). This analysis reveals more detailed patterning in division timing. Cell divisions in vein regions that protrude ventrally (L2 and proximal L4), peak at the same time and earlier than those that protrude dorsally (L3, distal L4 and L5). Precise correlation of cell divisions with specific vein and inter-vein regions suggests that they are autonomously controlled by signaling associated with veins.

Video 9. Color-coded cell division pattern in veins and by time intervals.

Download video file (13.6MB, mp4)
DOI: 10.7554/eLife.14334.016

DOI: http://dx.doi.org/10.7554/eLife.14334.016

To measure the orientation of cell divisions, we define a unit nematic tensor (see Materials and methods). For each cell division, the orientation of this unit nematic is defined by the line connecting the centers of mass of the two daughter cells when they first appear (see Figure 3C–C', and TM R-User Manual section 2.8). Each nematic is assigned a position on the tissue that corresponds to the center of combined mass of the two daughter cells. To visualize division orientation patterns, unit nematics can be added within different regions and averaged over different time intervals (Figure 3D, Video 10, TMR-User Manual section 2.9).

Video 10. Coarse-grained cell division pattern.

Download video file (14.1MB, mp4)
DOI: 10.7554/eLife.14334.017

DOI: http://dx.doi.org/10.7554/eLife.14334.017

Visualizing cell junction dynamics (TM R-User Manual sections 2.10 – 2.12, 3.8–3.9)

Epithelial tissues can be reshaped by cell rearrangements, or T1 transitions (for review [Walck-Shannon and Hardin, 2014]). In the simplest case, a T1 transition involves two pairs of cells, that exchange neighbors by disassembling one cell-cell contact and replacing it by another – bringing together two previously separated cells (Figure 4A). In reality, cell contacts may undergo multiple rounds of shrinkage and regrowth before resolving (Figure 4A'). Furthermore some epithelia undergo the related process of rosette formation where multiple cell junctions are disassembled before new neighbors are brought into contact. By separately quantifying the orientation with which cell contacts are gained and lost, one can reveal whether there is a net directionality to cell junction assembly and disassembly. To identify gained and lost cell contacts, we compare cell neighbor relationships between 2 subsequent frames. We exclude changes in neighbor relationships resulting from cell division, extrusion or a cell moving in and out of the field of view. The remaining neighbor relationship changes are used to define cell contacts that have appeared or disappeared.

Figure 4. Visualization and quantification of T1 transitions.

(AA') Cartoon depicting an effective T1 transition (A) that corresponds to cell-contact loss and gain in different directions. Each contact loss or gain is assigned a unit nematic describing its orientation. (BB') Pattern of cells losing contact (green), gaining contact (red) or both (blue). (C) Rate of neighbor change per cell and per hour in the blade, veins and inter-vein regions of interests. Rates were averaged within discrete time intervals of one hour and further averaged among the 3 WT wings (TM R-User Manual, section 3.8). Error bars depict the standard deviation amongst wings. (D) Coarse-grained pattern of neighbor exchange orientation at 17 hAPF. Cell neighbor change nematics were obtained by summing up unit nematics in each grid elements of 33x33 microns and further averaged in time using a 50 min time window. Scale bar 50 microns.

DOI: http://dx.doi.org/10.7554/eLife.14334.018

Figure 4.

Figure 4—figure supplement 1. T1 and cell elongation nematic orientation.

Figure 4—figure supplement 1.

(A) Cell neighbor change nematics were averaged at each frame within each region of interest and are represented as bars in a circular diagram. The bar angle indicate the average T1 orientation, and its length (nematic norm) reflects how ordered cell neighbor change nematics are in a given region of interest. Their color depicts the developmental time in hours after puparium formation. (B) Cell elongtation nematics were also averaged at each frame within each region of interest. The average T1 nematic orientation starts to match the average cell elongation nematic orientation from about 22 hAPF (peak of cell stretch) on, when stress-induced PD-oriented T1 dominate over autonomous AP-oriented T1.

We characterize the orientation of contact gains and losses by assigning them a unit nematic tensor. For contact loss, the orientation of the nematic is defined by the axis intersecting the two cell centers. For contact gain, the orientation of the nematic is perpendicular to the axis intersecting the two cell centers (Figure 4A–A’). If there is a simple disappearance and reappearance of a single cell contact, corresponding nematics will cancel out. Therefore, the sum of contact gain and contact loss nematics over time and/or space will represent an effective T1 nematic describing net direction of contact assembly/disassembly.

The rate of contact gain and loss can be visualized in different ways. Cell contact dynamics can be viewed directly on movies of tissue morphogenesis by assigning colors to cells as they gain (red) or lose (green) contacts. Those cells that simultaneously gain and lose different cell contacts are colored blue (Figure 4B–B’).

The frequency of contact gain and loss, independent of orientation, can be plotted over time. Figure 4C compares the frequency of contact assembly/disassembly in vein and inter-vein regions. In both regions, this rate begins to decrease in the second half of morphogenesis.

To visualize the pattern of orientation of T1 transitions throughout the wing, we sum contact gain and loss nematics over square grid elements, and average over a chosen time window (about 50 min in Figure 4D, Video 11, see TM R-User Manual section 2.12). 

Video 11. Coarse-grained cell rearrangement pattern.

Download video file (14.7MB, mp4)
DOI: 10.7554/eLife.14334.020

DOI: http://dx.doi.org/10.7554/eLife.14334.020

Finally, the average orientation of effective T1 nematics in sub-regions over time can be visualized using circular diagrams, where nematics are color-coded to indicate developmental time. Figure 4—figure supplement 1A reveals that the orientation of effective T1’s is along the anterior-posterior (AP) axis early (blue) and shifts to the proximal-distal (PD) axis in the second half of morphogenesis (red). A similar approach can be used to illustrate average cell elongation nematics over time (Figure 4—figure supplement 1B).

Quantification of tissue deformation and the contribution of different cellular events (TM R-user manual section 3.10)

While it is useful to quantify the number and orientation of cellular events like elongation, rearrangement, extrusion and division, this by itself does not provide quantitative information about the amount of tissue shape change contributed by each type of event. We therefore devised a method to measure deformation caused by these cellular processes such that they sum to the measured tissue deformation.

Tissue deformation can be decomposed into isotropic and anisotropic parts that distinguish changes in area (compression/expansion) from changes in aspect ratio (pure shear, for details see also Materials and methods). The quantities describing area changes are scalar, whereas the quantities describing shear rate in a 2D-network are nematic tensors harboring two distinct components that describe the orientation and magnitude of the shear.

Tissue area changes can be calculated based on cell area change and the number of cells gained and lost by divisions and extrusions – information that is all available in the TissueMiner database (Etournay et al., 2015).

To quantify the cellular contributions to anisotropic tissue deformation, TissueMiner uses the so-called Triangle Method, which is based on a triangular tiling of the junctional network (Etournay et al., 2015Merkel et al., 2016). Triangle elongation is a proxy for cell elongation, and topological changes in the network result in redrawing of triangles (Figure 5A–C). The resulting change in average triangle elongation can be used to calculate the shear due to the topological changes (Etournay et al., 2015). In addition to contributions from divisions, cell rearrangements, extrusions and cell shape changes, the method also takes into account deformation caused by correlations between elongation and both area change and rotation.

Figure 5. Visualization and quantification of anisotropic cell and tissue deformation.

(A) Triangulation of the cell network: each triangle vertex corresponds to a cell center. (BB') Cartons depicting triangle pure shear and total tissue shear along the x axis. (C) Cartons depicting shear due to T1 transition, cell division and extrusion. (D) Pattern of local tissue shear rate obtained from the triangulation method. Scale bar 50 microns. (E) shows the average rate of tissue shear (blue) in the blade, interveins and veins, and the corresponding cellular shear contributions (other colors). Shaded regions indicate the standard deviation amongst wings. (F) shows the accumulated tissue shear over time and the accumulated contributions of each type of cellular event. The tissue shear (blue) in veins is orientated along the PD axis and it is higher than in inter-vein regions during most of pupal morphogenesis. It leads to an extension along the PD axis and to a narrowing along the anterior-posterior (AP) direction. By the end of the movie, accumulated tissue shear (blue) is almost twice as high in veins as in inter-vein regions. Shaded regions represent the standard deviation amongst wings.

DOI: http://dx.doi.org/10.7554/eLife.14334.021

Figure 5.

Figure 5—figure supplement 1. Measurements of cell and tissue deformation from two computer-generated sheets of hexagonal cells.

Figure 5—figure supplement 1.

(AD) One dataset corresponds to hexagonal cells undergoing a constant isotropic expansion rate of 3.50 10–2 per frame, and the other corresponds to hexagonal cells undergoing constant pure shear rate of 1.75 10–2 per frame. These datasets are termed iso.exp movie and shear movie respectively in graphs. There isn't any topological change. To keep consistent sets of cells in time, we filtered out cells that become in contact to the image border. We then performed our measurement on these tracked regions of about 50 cells in the shear movie and about 100 cells in the iso.exp movie. (A) Relative tissue area changes (blue) and its decomposition into cell area changes (green), cell number increase by divisions (orange) and cell number descrease by extrusions (cyan). Their corresponding cumulative sums are shown in (B). (C) shows the average tissue shear (blue) and its decomposition into cellular shear contributions (other colors). Their corresponding cumulative sums are shown in (D).
Figure 5—figure supplement 2. Tissue isotropic deformation and cellular contributions in different regions.

Figure 5—figure supplement 2.

(A) Relative rates of tissue area changes (blue) averaged over 3 WT wings for the blade, veins and interveins, and its decomposition into cell area changes (green), cell number increase by divisions (orange) and cell number descrease by extrusions (cyan). Their corresponding cumulative sums are shown in (B). (B) Cumulative tissue area changes and its cellular contributions. Shaded regions represent the standard deviation amongst wings.
Figure 5—figure supplement 3. Comparison of patterns of cell event orientation with their correponding quantitative patterns of shear.

Figure 5—figure supplement 3.

(A–A') Coarse-grained patterns of cell division orientation (A) and of shear contributed by cell division (A'). The pattern shown in (A) was obtained by summing up cell division nematics in each grid element and by further averaging in time. The pattern shown in (A') was obtained by averaging the shear nematics in each grid element and by further averaging in time. (B–B') Coarse-grained patterns of neighbor-change orientation (B) and of shear contributed by neighbor changes (B'). These patterns were obtained similarly as for cell divisions. Only the shear patterns (A' and B') obtained with the triangulation method provide a quantitative measurement of the local deformation induced by each type of cellular event. Square-grid size of 26x26 microns. Time averaging covering about 55 min (11 frames) in each grid element. Scale bar 50 microns.

Validation of tissue deformation measurements using computer-generated cells

To test the reliability of TissueMiner in calculating large cell and tissue deformations, we created two computer-generated movies of hexagonal cells sheets (Videos 1213). In one movie, we imposed a constant isotropic expansion rate of 3.50 10−2 per frame, without any anisotropic deformation. In the second movie, we imposed a constant pure shear along the x-axis with a rate of 1.75 10−2 per frame, and without any isotropic expansion. The amounts of isotropic expansion and pure shear have been chosen to be at least 10 times higher than what we measure between subsequent frames of pupal wing movies.

Video 12. Computer-generated hexagonal cells with an imposed shear rate.

Download video file (891KB, mp4)
DOI: 10.7554/eLife.14334.025

DOI: http://dx.doi.org/10.7554/eLife.14334.025

Video 13. Computer-generated hexagonal cells with an imposed isotropic expansion rate.

Download video file (1.2MB, mp4)
DOI: 10.7554/eLife.14334.026

DOI: http://dx.doi.org/10.7554/eLife.14334.026

We then asked if TissueMiner could quantitatively recapitulate the respectively imposed deformation rates. In each dataset, TissueMiner automatically defines a 'whole_tissue' region of interest that corresponds to a consistent set of cells that are always visible (about 100 cells in the isotropic expansion movie and about 50 cells in the pure shear movie, green labels in Videos 12 and 13). All measurements are done in this ROI to avoid measuring deformation due to inward and outward cell flows. Figure 5—figure supplement 1 shows the time evolution of the measured tissue expansion rate (panel A) and tissue shear rate (panel C) that were averaged over the 'whole_tissue' ROI, and their respective cellular contributions. Panels B and D show the corresponding cumulated curves. As expected, in the isotropic expansion movie we observe a nearly constant isotropic expansion rate, which is accounted for by the cell area change contribution. We measure an average expansion rate of (3.53 ± 0.04) 10–2 per frame, which is consistent with the value imposed when creating the movie. The measured uncertainty is the 95% confidence interval of the standard error of the mean. The pure shear rate and its cellular contributions nearly vanish in this movie (Figure 5—figure supplement 1C,D).

For the pure shear movie, we measure an approximately constant horizontal component of the pure shear rate of (1.74 ± 0.02) 10–2 per frame, which is consistent with the value imposed when creating the movie. This pure shear rate is entirely accounted for by cell elongation change. The isotropic expansion rate and its cellular contributions nearly vanish (Figure 5—figure supplement 1A, B). Other contributions to expansion and shear rates are negligible in both movies.

The pixelated nature of individual cell contours contributes to fluctuations of our measured values. Moreover, we find that these fluctuations cancel out when cumulating the deformation (Figure 5—figure supplement 1B and D). Thus, the current implementation of TissueMiner captures the tissue isotropic expansion and pure shear rates as well as the corresponding cellular contributions with a good precision in these computer-generated movies.

Deformation of the pupal fly wing

Figure 5—figure supplement 2 shows the rate of relative area change and cumulative area change of vein and inter-vein regions over time, as well as the cellular contributions to these area changes. As previously noted, the area of the blade as a whole changes very little. However sub-region analysis reveals that inter-vein expansion compensates for compression in vein regions. Vein cells not only divide less than inter-vein cells, but also decrease their area more.

Next we use the Triangle Method to calculate pure shear rates in the time-lapse movies of developing pupal wings. To visualize the spatial pattern of pure shear rate in the wing, TissueMiner allows us to plot nematics corresponding to the local tissue shear rates (Figure 5D) and to rates of shear produced by different cellular contributions (Figure 5—figure supplement 3, and [Etournay et al., 2015]) averaged within the squares of about 26 x 26 microns.

To compare the time evolution of pure shear rate between different tissue subregions we plot this rate averaged over the corresponding ROI (Figure 5E–F and [Etournay et al., 2015]). A positive sign for shear indicates an extension along the PD axis and a contraction along the AP axis, whereas a negative sign indicates an extension along the AP axis and a contraction along the PD axis.

As reported previously, the wing blade as a whole shears along its PD axis between 16 and 32 hAPF. T1 transitions and cell elongation are major contributors to total PD shear, and they display complementary behavior that evolves over time. In the first phase, cells elongate in the PD axis in response to tissue stresses generated by hinge contraction, and by actively oriented T1 transitions that occur first along the AP axis. In the second phase, cell elongation causes the orientation of T1 transitions to shift 90˚ from the AP to the PD axis (Etournay et al., 2015). These PD oriented T1 transitions both contribute to tissue shear and relax PD cell elongation. We now compare shear and cellular contributions to shear in vein and inter-vein regions. Tissue shear peaks earlier in inter-vein regions than in veins, but veins shear more overall. Examining the cellular contributions to shear suggests that increased shear in veins reflects a different relationship between cell elongation and T1 transitions. PD-oriented T1 transitions do not only produce more shear in veins, they also fail to relax PD cell elongation as much as in inter-vein regions.

Discussion

Quantitative image analysis of developing epithelia is a powerful approach to understanding morphogenesis, but the tools with which to tame and analyze these complex data have not been widely available in a standard and well-documented format. Here we provide an introduction to the capabilities of TissueMiner and tutorials for its use. TissueMiner provides general strategy to store and analyze large data sets of interwoven objects by combining state of the art tools for data mining. It allows quantification and visualization of epithelial morphogenesis at multiple scales – from individual cells to entire tissues. It provides both a generic database format and a multi-platform toolkit to interrogate and visualize data and quantify cellular contributions to large-scale epithelial deformations.

TissueMiner has been designed to be versatile and expandable. The database format we provide standardizes the organization of tracked cell data and collects all data into a single file per movie. Such a standardized data format facilitates data sharing between different sources, thereby enhancing cross-laboratory reproducibility. As the database stores positional information about cells and cell contacts, as well as cell neighbor topology, it could also be useful for parameterizing simulations of epithelial remodeling by vertex models or other physical network models. The scheme of our relational database is expandable: additional properties of cells, bonds and vertices can be appended to the database without affecting the relationships between tables. As a consequence, our current query tools to interrogate the database remain functional, even if the database is extended with new properties of cells, bonds and vertices.

TissueMiner takes advantage of the advanced graphical capabilities of R and Python to enable the visualization of patterns of deformation and cell state properties directly on the movie images or quantitatively summarized in graphs. In particular, R provides a flexible grammar with which to manipulate tables obtained from the database and to easily plot graphs (Wickham, 2009; Francois and Wickham, 2015). TissueMiner also offers multiple options for coarse-graining data in space and time through an expandable collection of scripts, which constitutes the TissueMiner library for R or Python. These two easy-to-learn programming languages give TissueMiner its great flexibility to both address general questions of epithelial morphogenesis and project-specific questions, and enable automation, parallelization and customization of user-specific workflows.

The tools underlying TissueMiner were originally developed to understand the interplay of cell dynamics and epithelial tension on the developing wing of the fruit fly, where we described cellular contributions to pupal wing morphogenesis averaged throughout the entire wing blade (Etournay et al., 2015). Here, to illustrate the utility of the TissueMiner framework, we compared the behavior of vein and inter-vein regions in the developing pupal wing. Comparing cell dynamics in veins and inter-vein regions provided an unexpected explanation for the process of 'vein refinement'. Vein refinement refers to the fact that veins become narrower during pupal morphogenesis. This had been interpreted as a signaling-dependent reduction in the number of cells assuming the vein fate (Blair, 2007). Here we show instead that vein narrowing results from a convergent extension-like process that is stronger in veins than in inter-vein regions. This elongates and narrows the veins without reducing vein cell number. It will be interesting to examine how signaling pathways involved in vein refinement influence cell dynamics in veins during morphogenesis. The standardization of analysis that TissueMiner provides will facilitate these and other comparisons critical for deciphering the molecular mechanisms underlying epithelial morphogenesis.

Materials and methods

Live imaging of the pupal wing

The knock-in Ecad::GFP fly line (Huang et al., 2005) was used for live imaging of the developing pupal wing. Flies were raised and maintained at 25°C during imaging by using a temperature-controlled chamber equipped with a humidifier to prevent desiccation. Long-term time-lapse imaging was performed as previously described (Etournay et al., 2015). After the imaging session, flies were maintained in a humid environment where they eclosed at the term of pupal development.

A relational database to store the history of cells, their lineage and their constituent bonds and vertices

The visualization and quantification of cell dynamics underlying tissue morphogenesis rely on the ability to extract information about cell geometry, cell neighbor topology and cell histories from time-lapse movies (Aigouy et al., 2010; Etournay et al., 2015). We use TissueAnalyzer to segment and track the cell network over time. This results in a series of digital images that contain this information (Figure 6—figure supplement 1). To facilitate its access and use, we developed tools in the TissueMiner framework to extract and convert this information initially stored in images into a specific database format (see details in appendix 1), which we call 'TM-DB' (schematically outlined in Figure 6A).

Figure 6. Construction of the relational database of TissueMiner.

(A) Conceptual scheme of the database. Entities (square boxes) are related to other entities by associations (rounded boxes). Each entity contains an identifier (underlined) that uniquely defines each record. The database can be implemented by converting entities into tables (see appendix 1 and Figure 6—figure supplement 2). (B) Cell lineage trees are stored in the database: upon division a mother cell identifier a gives rise to two new daughter cell identifiers b and c. {a,b,c,d,e,f,g} defines one lineage group. (C) A pixelated cell contour in the 2D cell network: green=bond pixels, red=vertex pixels, white=other cell network pixels. (D) Vectorized representation of the cell shown in (C). To preserve the topology of the cell network, directed bonds (cyan) are defined from within a given cell alpha and ordered anticlockwisely along the cell contour. Each directed bond is complemented by a conjugated bond (magenta) and is linked to it next counter-clockwise follower (dashed).

DOI: http://dx.doi.org/10.7554/eLife.14334.027

Figure 6.

Figure 6—figure supplement 1. Tracked cells identified by unique colors in TissueAnalyzer.

Figure 6—figure supplement 1.

(A) shows two consecutive frames depicting colored-tracked cells from a time-lapse movie processed with TissueAnalyzer. Each cell is assigned a color identifier that uniquely defines it in the course of the time-lapse. One pixel wide cell-cell interfaces are visible in white on the raster image.
Figure 6—figure supplement 2. Logical scheme of the relational database.

Figure 6—figure supplement 2.

(A) The conceptual scheme shown in Figure 6A can be automatically converted to a logical scheme shown here by using softwares such as IntelliJDEA or MySQL workbench. The rules of conversion are briefly evoked in appendix 1. The entities defined in the conceptual scheme are converted into tables containing one primary key (upper part of the table) that uniquely defines each record in the table, the properties of each record, and the foreign keys (arrows). Foreign keys are properties of one table pointing to the primary key of a related table (ex: conj_dbond_id:dbond_id means that the conj_dbond_id column is a foreign key whose values must be defined in the dbond_id column of the dbonds table). As a consequence of logical contraints by foreign keys, tables harbor more columns that one expected from looking at Figure 6A. This logical scheme now shows all tables and columns of the database. This scheme is implemented in physical SQLite tables can are indexed for the sake of performance (see CreateDbFromParser.R on https://github.com/mpicbg-scicomp/tissue_miner).

First, the history of each tracked cell in the movie is stored as a separate row in the cell_histories table of the TM-DB (Figure 6A). This includes the movie frames in which it first appears and disappears and why, along with its lineage relationship to other cells (see appendix 1). The reason for cell appearance and disappearance is inferred by the parser. A primary reason could be a cell division, which results in the disappearance of the mother cell and in the appearance of two daughter cells. It could be a cell extrusion that results in its disappearance. It could also be that cells move in and out of the field of view of the microscope lens, resulting in gain and loss of cells. Furthermore, we use this information to establish the lineage relationship that corresponds to each group of cells related by ancestry (Figure 6B). Each cell within the lineage group is assigned a generation number. The lineage group and generation number for each cell are listed in the cell_histories table.

We store the time points at which the movie images were recorded into a frames table that links each movie frame to its corresponding time point. For each movie frame, we need to store geometrical and topological information about cells within the cellular network. Geometrical information includes position and shape descriptors, whereas topological information indicates the arrangement of neighboring cells around each cell. We use cell histories, geometry and topology to understand how individual cells contribute to the whole tissue deformation during morphogenesis (Etournay et al., 2015).

The geometrical information is stored in three tables of the TM-DB: cells, bonds and vertices. They correspond to the 3 generic entities - cells, cell-cell contacts and intersections between cell-cell contacts, respectively illustrated in Figure 6C. These entities are commonly used in vertex model simulations (for review [Fletcher et al., 2014]). The cells table contains cell geometrical data (center of mass, area, shape anisotropy) and the polarized distribution of proteins along the cell circumference, as represented by a polarity nematic tensor (Aigouy et al., 2010). The bonds table informs about bond length, and the vertices table about vertex position in each movie frame.

The directed_bonds table exclusively stores the cell neighbor topological information at each frame, i.e. how bonds are organized around each cell along with the cell neighbor relationship information. To store the cell neighbor topology in an unambiguous manner, we define for each cell a directed path of consecutive bond vectors oriented counterclockwise, which forms the oriented circumference of the cell (Figure 6D, see also [Kachalo et al., 2015]). We link each directed bond to its counter clockwise follower (left directed bond) in the same cell. To store the cell neighbor relationship, we link each directed bond to its corresponding directed bond (conjugated bond) of the neighboring cell (Figure 6D, and appendix 1).

The TM-DB is relational, which means that it establishes contextual relationships between items stored in one ore more tables (see appendix 1). These relationships are outlined in rounded boxes in the conceptual scheme of the TM-DB (Figure 6A). Technically, each item in a table is stored in a separate row and is given a unique number as identifier. For a relationship between two tables, one of the tables contains an additional column, which refers to items in the other table by holding their identifier number. Such additional columns for the TM-DB format are shown in blue in Figure 6—figure supplement 2. When extracting information from a database using so-called queries, these columns serve as bridges connecting the information stored about related items.

In essence, this structure creates a generic relational model to represent complex cell tracking data in 2D. In practice, the data for each movie is stored in a separate SQLite database file. Since all movie files are stored using the same database structure, automated data mining and visualization are greatly facilitated. For the same reason, usage of the TissueMiner database format encourages exchange of both movie data and analysis tools.

An automated workflow compliant with high performance computing platforms

To help the user to perform complex tissue morphogenesis analysis, we developed an automated pipeline that uses the tracked data from TissueAnalyzer as an input to build the database and perform all downstream analyses described above. To do so, we use the snakemake workflow engine developed by Koster and Rahmann (Koster and Rahmann, 2012). This engine channels the different processing steps into a well-formed workflow graph. Snakemake automatically determines the execution order, provides means for error recovery and job control, and supports High Performance Computing (HPC) environments. By using snakemake we enable the user to easily run and monitor TissueMiner, while maintaining a proper decoupling of tools as independent executables.

Practically, the user defines a workflow definition file in which processing steps are defined as a set of execution rules, namely a list of scripts to be run along with required input(s) and expected output(s). Snakemake automatically builds a directed graph from which the execution order of processing steps is inferred. If only one branch of the graph needs to be run, the engine will ensure that all input data are present and will automatically run upstream steps if necessary. This engine also provides the possibility to visualize a directed acyclic execution dependency and execution state graph (DAG) for a given workflow (see Figure 7).

Figure 7. Automated workflow using snakemake.

Figure 7.

(A) The snakemake engine can generate a directed acyclic graph (DAG) where we show an example here. This graph represents both the execution dependency (grey arrows) and the execution state of the workflow (solid or dashed line). Each box corresponds to an execution rule, namely a program to be run along with required input(s) and expected output(s). This DAG can be generated at any time when running the workflow (see documentation). Solid lines indicate the rules that have not been executed yet, whereas dashed lines depict completed jobs. The first rule to be executed is called 'prepare_movie': it prepares the tracked images from TissueAnalyzer to be converted by the parser into tables of values containing all necessary entities along with their properties ('parse_tables' rule). Then the 'make_db' rule is executed for building the database. Following the grey arrows can one navigate into the next steps of the workflow. The 'roi_tracking' rule filters out cells in contact to margin cells including user-defined regions of interest, and the 'roi_movie' rule allows us to visualize regions of interest over time. The 'deformation_movies' and 'db_elongation_movies' rules generate annotated movies to visualize the deformation of the tissue and the cell state properties (area, elongation). The 'four_way' rule detects four-way vertices and performs basic statistics on vertices. The 'tri_create' rule performs the triangulation of the network for further shear calculation and visualization ('shear_calculate' and 'shear_movies'). It also enables triangle tracking and mapping to each type of cell event ('tri_categorize'). The 'topo_countT1' rule detects neighbor changes that are not due to division or extrusion, and categorizes them into gained or lost neighbors. The 'topo_movies' rule allows one to visualize the coarse-grained rates of division and neighbor changes on the tissue. The 'topo_unbalance' rule is a quality check to verify that the number of gained neighbors is similar to the number of lost neighbors. The 'polygon_class' rule performs the cell-neighbor number count. The 'lineage_colors' rule allows us to optimize the color of each lineage group such that adjacent lineage groups always have different colors. Finally, the 'lineage_movies' rule allows one to visualize lineage groups and cell generations on the tissue. The rule 'all' checks that all upstream jobs have been completed.

DOI: http://dx.doi.org/10.7554/eLife.14334.030

One major advantage of a workflow engine such as snakemake is that it can run the workflow on various architectures - from single-core workstations to multi-core servers and clusters - without the need to modify the rules, thereby facilitating reproducible research. To simplify the TissueMiner installation procedure, we provide a pre-configured system to be loaded in the docker software available at http://docker.com. The TissueMiner docker image can be run without any setup using provided example data or custom user data as detailed out on the TissueMiner GitHub project page. More advanced users can use TissueMiner directly from the command-line with or without snakemake and can thus perform simultaneous analyses of multiple movies.

A user-friendly data-mining library to easily collect information for comparing multiple datasets

After applying our automated workflow to different movies, the results can be easily compared using a collection of command-line tools written in R and Python. These tools aggregate different experiments for plotting and performing comparative analysis. Here we describe the tools written in R. Python tools are described in the corresponding tutorial. The R tools are designed to be used in an integrated development environment such as RStudio, which provides a user-friendly environment to assist the user in writing and executing command lines. These command line tools are organized in the spirit of a grammar of data manipulation and they can be combined with the existing R tools like dplyr (Francois and Wickham, 2015) or ggplot2 (Wickham, 2009) for manipulating and visualizing data (https://mpicbg-scicomp.github.io/tissue_miner/user_manual/Learning_the_R_basics_for_TissueMiner.html).

We developed generic 'multi-query functions' (mqf) to collect specific information for individual movies. These mqf tools are organized into fine-grained and coarse-grained categories according to the type of analysis to be carried out. The fine-grained tools aggregate data at cellular level, namely individual cell properties inside regions of interest. These tools are prefixed with 'mqf_fg_'. The coarse-grained mqf tools are further separated into 'roi' and 'grid' categories to distinguish between regions moving with the tissue and static square regions tiled into a grid. They allow one to visualize and quantify average cell properties at different tissue locations and various spatial scales, and are prefixed with 'mqf_cg_roi_' and 'mqf_cg_grid_' respectively.

To compare fine-grained and coarse-grained cell properties amongst movies we developed a 'multi-db-query' tool, which streamlines the application of the mqf tools to a set of movies. To use this tool, the user should first align the movies in time, using convenient morphological or cellular landmarks. As for the Drosophila wing, we align movies such that the peaks of cell elongation coincide in the different movies. The user can then apply a chosen mqf tool to multiple movies and multiple ROI’s. All mqf tools, alone or in combination with the 'multi-db-query' tool, generate a table that contains individual or averaged measurements to be visualized on the tissue (Figure 1 A–E’, Figure 2A–D’, Figure 3A,D, Figure 4B,D, Figure 5D) or in graphs (Figure 2E–F, Figure 3B, Figure 4C, Figure 5E–F). This library of tools is described in detail in the TM R-User Manual, which also provides many examples. These tools can be easily extended to address project specific questions.

Detecting gain and loss of cell contacts

To detect cell neighbor changes, we developed a routine in R that queries the DB and establishes the cell-neighbor relationship at each frame. By comparing the list of neighboring cell identifiers for a given cell between two consecutive frames [f, f + 1], can one identify and count the changes in neighbor relationships. These can be subdivided into those caused by cell divisions, cell extrusions or the simple gain or loss of a cell contact (not due to division or extrusion). We call these half-T1’s because they resemble the gain and loss of cell contacts that occurs during a T1 transition – although they may also be generated by other events such as rosette formation. To assign a neighbor change to the half-T1 category, the corresponding cell identifiers must be present in both frames, ruling out extrusions and cells moving in and out of the field of view. To detect half-T1’s that occur simultaneously with divisions, we mask neighbor changes due to divisions by propagating the mother cell identifier (frame f) to the two daughter cells (frame f+1) that we fuse into one fake cell having the mother cell identifier. We iterate over each pair of consecutive frames and store the half-T1 events due to a gain and a loss of cell neighbors.

Cell lineages and lineage browsing to follow ROI’s forward and backward in time

We pool all lineage information (as contained in the cell_id, left_daughter_cell_id and right_daughter_cell_id columns from the cell_histories table) into a directed lineage graph (Nepusz, 2006) from which we infer a lineage group identifier and a generation number. By definition the root of each lineage tree is considered as the F0 generation and is thus given a generation value of 0. We follow ROI’s backward and forward in time by browsing lineage graphs that were selected based on the regions drawn by using the draw_n_get_ROIcoord.ijm Fiji macro. However cells may be lost or not detected when browsing the lineages. One primary reason is that extruding cells are not detected when browsing the lineage backward in time. Cells could also be lost due to possible tracking mistakes. To improve spatial continuity of ROI’s we have implemented a method to reassign lost cells to ROI’s when located within ROI’s. To identify lost cells for a frame within a given ROI, we first distinguish bonds that connect two cells within the ROI, only one cell within the ROI or none. All corresponding cell-pairs define an undirected graph on which a connected component analysis (Nepusz, 2006) allows to identify the ROI and non-ROI regions. All cells of non-ROI regions, except for the largest one, are reassigned to become part of the ROI. By doing so, we make the assumption that the largest non-ROI component is defined by the tissue surrounding the ROI.

Nematic tensors to describe cell elongation and the orientation of cellular processes

When analyzing and visualizing single cell properties, we use the same cell elongation definition as in Aigouy et al. (2010). For a given Cartesian xy coordinate system, the elongation of a given cell is defined by the nematic tensor

(ϵxxϵxyϵxyϵxx)

with

ϵxx=1Accos(2ϕ)dA
ϵxy=1Acsin(2ϕ)dA.

Here, Ais the area of the given cell, and the integrals are carried out over all points r within the cell. The angle is the angle between the vector r − rc and the x axis, where rc is the cell center defined as

rc=1Acr dA.

Here, the integral is again carried out over all points r within the cell. The magnitude of the elongation is given by ϵ=(ϵxx2+ϵxy2)12 and the elongation angle φ is given by the following two equations

cos(2φ)=ϵxxϵ
sin(2φ)=ϵxyϵ.

Note that this definition of cell elongation is different from the triangle-based definition that is also discussed in this article. However for the fruit fly wing, both cell elongation definitions yield very similar results.

To characterize the axes of cell divisions and T1 transition, we introduce the unit nematic tensors n~CDn~T1+, and n~T1. The orientation of a single cell division is quantified by the unit nematic n~CD defined by:

n~CD=(cos(2ϕCD)sin(2ϕCD)sin(2ϕCD)cos(2ϕCD)).

Here, the angle ϕ is the angle of the line connecting both cell centers with respect to the x axis, measured in counter-clockwise sense. The orientation for a half-T1 transition during which two cell lose neighborship is characterized by:

n~T1+=(cos(2ϕT1+)sin(2ϕT1+)sin(2ϕT1+)cos(2ϕT1+)),

where ϕT1+ is the angle of the line connecting the centers of the cells losing neighborship. The orientation for a half-T1 transition during which two cell gain neighborship is characterized by:

n~T1=(cos(2ϕT1)sin(2ϕT1)sin(2ϕT1)cos(2ϕT1)),

where ϕT1 is the angle of the line connecting the centers of the cells that gain neighborship. The axes of the nematics n~CD, n~T1+ and n~T1 roughly correspond to the axis along which the tissue extends due to the respective cell division or half-T1 transition. In particular, because of the minus sign in the definition of n~T1, when the same two cells gain neighborship and lose it again along the same axis, the total effect adding n~T1+ and n~T1 is zero.

Tissue deformation and cellular contributions to it

Here we discuss the formal definitions used to characterize tissue deformation, area change, and shear. We characterize the local rate of tissue deformation by the gradient of the velocity field v(r). We then define the overall deformation rate V of a given piece of tissue by the integral over the area At of this piece:

V=1At(vxxvyxvxyvyy)dA.

This 2x2 tensor can be decomposed into an isotropic part Viso characterizing the relative growth rate of tissue area, a symmetric, traceless part V~ characterizing the anisotropic part of the deformation (pure shear rate), and an antisymmetric part Ω characterizing overall tissue rotation:

V=VisoI2+V~+Ωe.

Here, we have defined Viso=1At(vxx+vyy)dA, Ω=12At(vxyvyx)dA,

I=(1001), V~=12At(vxxvyyvyx+vxyvyx+vxyvyyvxx)dA,  ande=(0110).

In recent work, we have shown that the overall shear rate V~ can be exactly decomposed into a sum of cellular contributions using our Triangle Method (Merkel, 2014Merkel et al., 2016):

V~=DQ~Dt+T+C+E+D.

Here, the nematic tensors Q~ is the average cell elongation defined based on triangles, and the nematic tensors T, CE, and D are the shear contributions by T1 transitions, cell divisions, cell extrusions, and correlation effects, respectively. The corotational time derivative DQ~/Dt is defined by

DQ~Dt=dQ~dt2(cΩ+[1c]dΦdt)eQ~.

The operator d/dt denotes the total derivative, c=tanh(2Q)/(2Q), and the dot denotes the tensor dot product. The quantities Q and Φ denote magnitude and angle of the average cell elongation tensor Q~.

These formal definitions for Q~DQ~/DtT, CE, and D refer to deformation rates in the limit of infinitesimal deformations. However, subsequent frames of any real tissue movie are separated by finite time intervals, i.e. finite deformations. There are different ways to adapt these definitions to finite deformations (Etournay et al., 2015Merkel et al., 2016). The current implementation of TissueMiner uses the finite-deformation definitions presented in detail in Etournay et al. (2015).

Acknowledgements

This work was a truly collaborative effort and the authors jointly wrote the manuscript. We are grateful to Christian Dahmann, Marcus Michel and Jacques Boutet de Monvel for critical reading of the manuscript, Benoit Lombardo for his help in Fiji macroing, and Peter Steinbach for useful discussion about workflow engines. We thank Franz Gruber, Vincent Michel and Nathalie Gourreau for testing the quickstart tutorials. RE acknowledges a Marie Curie fellowship from the EU 7th Framework Programme (FP7). This work was supported by the Max Planck Gesellschaft, and by the BMBF. SE acknowledges funding from the ERC.

Appendix 1

Parsing tracked-cell images to build the TM-DB

We used TissueAnalyzer to detect cell contours (segmentation) and to track cells over time. This software generates two output masks - the tracked-cell and the cell-division masks. These masks are raster images. In both masks, cell circumferences are represented by one pixel thick white lines. In the tracked-cell mask, all pixels inside the cell circumference have the same unique color. In consecutive frames, the same cell has the same color. In the cell-division mask, each cell is colored either in black or in blue. If a cell is blue, it is a daughter cell that emerged from a division between two consecutive frames. Otherwise, a cell is black in the cell-division mask.

We wrote a custom C++ parser that converts information contained in the tracked-cell and cell-division masks into tables that can be easily transformed into the TissueMiner database. This parser first extracts topological and geometrical information about cells, bonds and vertices for each individual frame. Afterwards, it analyzes the continuity of cell existences across consecutive frames. In particular, it tries to infer reasons for appearance or disappearance of cells. Finally, based on this information, history and lineage can be established for each cell (see Materials and methods).

The parser extracts the topological information for each frame from the tracked-cell mask. It scans the entire mask image row by row. Whenever it hits a cell boundary (white pixel), it defines the cell circumference and divides it into bonds defined as contiguous white pixels that are in contact with exactly two cells, and vertices defined as white pixel surrounded by 3 or more pixels of different colors). The topology, namely how neighboring cells are arranged around each cell, is obtained by creating a counter-clockwise series of consecutive directed bonds. Each directed bond stems from a unique vertex and points to the next vertex along the cell circumference. We created the concept of directed bonds to unambiguously characterize the wiring between cells, vertices, and (undirected) bonds (Figure 6D). The parser stores the topology by creating the relation of each directed bond with its next counter-clockwise follower on the cell circumference and with the vertex from which it stems (Figure 6D). To store the cell-nearest-neighbor relationships, we map each cell-cell contact (bond) to the two corresponding directed bonds, where each directed bond is associated with a single cell and a single vertex. This is illustrated in Figure 6D, where the cyan directed bond points towards vertex i and lies on the side of cell α, whereas the magenta directed bond points towards vertex j and lies on the side of cell β. We call the cyan and magenta directed bonds to be 'conjugated' to each other.

 The parser also extracts geometrical information for each given cell by going along the circumference of that cell. Cell area A is computed as:

A=12i[px(i)py(i+1)py(i)px(i+1)],

where the index i runs over all pixels in counter-clockwise order around the cell. The vector p(i)=(px(i),py(i)) denotes the position of pixel i. The cell center c is computed as:

c=16Ai[px(i)py(i+1)py(i)px(i+1)][p(i)+p(i+1)]

Cell shape anisotropy is described by the two components of the symmetric traceless tensor defined elsewhere (Aigouy et al., 2010). The cell perimeter is computed as the sum of the lengths of all bonds belonging to the cell boundary. The length of a bond is computed as the summed pixel distance going along this bond pixel by pixel. In particular, when advancing on pixel up, down, left, or right, one is added to the bond length. However, when advancing diagonally, 2 is added.

After extracting topology and geometry for each frame, the parser infers for each cell whether it stays in the tissue, or whether it appears or disappears in going from one frame f to the next one f+1. Which of the three possibilities occurs can be directly inferred using fact that each cell is assigned a unique color throughout all tracked-cell masks. If a cell is present in both tracked-cell masks, it is just staying within the tissue. If it is only present in frame f+1, it is appearing, which may happen for several reasons. For one, a cell may appear as a daughter cell of a division, which can be checked using the cell-division mask. Moreover, if a cell appears at the margin of the piece of tissue, it is declared as moving in via the margin. The same happens if an appearing cell is next to a cell that has already been declared as moving in via the margin. If none of these happened, the parser declares a segmentation error ('SegErrAppearance') as the reason for appearance.

If a cell is only present in frame f but not in f+1, it is disappearing, which may happen for several reasons, too. For one, the cell could be the mother cell of a division that occurs between frames f and f+1. This can be checked using the cell-division mask. Otherwise and if the cell is disappearing at the margin, the parser marks the cell as moving out of the margin. The same happens if the disappearing cell is next to a cell that has already been marked as moving out of the margin. Finally, every cell that disappears for none of the two previous reasons is marked as undergoing an extrusion/apoptosis.

Implementing the TissueMiner relational database

The TissueMiner parser generates tables from which we build the TissueMiner relational database (TM-DB). To do so, we used the formalism developed in the Merise method (Tardieu et al., 2000), which includes the entity-relationship model (Peter Pin-Shan, 1976), the relational database theory (Codd, 1970, 1972) and Codd's normal forms (Codd, 1971, 1974); thus, it allows one to translate the conceptual data model into a relational database scheme.

We first establish the 'entity-relationship' scheme of the database to represent the information extracted with the parser in entities, and to establish relationships between and within entities. This conceptual approach defines the basic elements of the entity-relationship model (Peter Pin-Shan, 1976): the entity, the association, the cardinality and the identifier. Entities consist of objects (cells, bonds, vertices, frames) or concepts (cell_histories, directed_bonds) that can be uniquely identified. The association is a link that relates two entities. The identifier is an obligatory property of an entity and uniquely defines each occurrence of the entity. The cardinality reflects the minimum and maximum connections (functional dependencies) between the identifiers of two associated entities: [1,n] stands for one-to-many, [0,n] for none-to-many, [1,1] for one-to-one, and [0,1] for none-to-one. Hence, each association is assigned two cardinalities corresponding to the 2 possible directions of association between the two entities. For the sake of clarity, Figure 6A shows a simplified 'entity-relationship' scheme of the TM-DB without cardinalities. However, cardinalities are used in the Merise method to translate the conceptual scheme (Figure 6A) into the logical scheme shown in Figure 6—figure supplement 2A. We therefore show them in Table 1. The rules to translate a conceptual scheme to a logical one can be found here (Tardieu et al., 2000). Below, we explain our conceptual scheme along with its translation into the logical scheme, which can be directly implemented using a chosen SQL language. Applying these rules to our TM-DB, these entities become physical tables in the logical scheme, and associations become table columns ('foreign keys' in blue) in related tables (Figure 6—figure supplement 2A). The foreign keys constitute a referential integrity constraint between tables.

The TM-DB consists of six entities, frames, cells, vertices, bonds, directed_bonds and cell_histories that are linked by logical associations (Figure 6A). Their respective identifier is underlined in the conceptual scheme (Figure 6A), and becomes the 'primary key' placed in the table header in the logical scheme (Figure 6—figure supplement 2A). In the TM-DB, identifiers (frame, cell_id, vertex_id, bond_id, dbond_id) are numbers that we use to index the corresponding tables. Time and movie frames are contained in the frames entity. Geometrical information is contained in the cells, vertices and bonds entities. Topological information including cell neighbor relationships is represented in the directed_bonds entity. The cell ancestry is represented in the cell_histories entity.

Table 1.

Cardinalities per association.

DOI: http://dx.doi.org/10.7554/eLife.14334.031

Entity A Entity B Association (A->B) Cardinality A->B Cardinality B->A
cell_histories cell_histories to be daughter of [0,1] [0,n]
cells cell_histories to belong to [1,1] [1,n]
cells frames to exist in [1,1] [1,n]
directed_bonds cells to be part of [1,1] [1,n]
directed_bonds directed_bonds to be conjugated [1,1] [1,1]
directed_bonds directed_bonds to be next left [1,1] [1,1]
directed_bonds frames to exist in [1,1] [1,n]
directed_bonds bonds to be part of [1,1] [1,n]
directed_bonds vertices to stem from [1,1] [1,n]
vertices frames to exist in [1,1] [1,n]
bonds frames to exist in [1,1] [1,n]

In order to relate a given cell to its lineage and intrinsic properties during the time evolution of the movie, we create specific associations within and between the cells and cell_histories entities. In the cell_history entity, a cell is uniquely determined by a cell identifier (cell_id) that exists as long as the tracked cell does not die or divide. All cells are represented in this entity, which stores in which frame a given cell appears (first_occ) and disappears (last_occ), and why (appears_by and disappears_by). The cell ancestry is represented by the 'be_daughter_of' association that relates each dividing cell to its two daughters (left_daughter_cell_id and right_daughter_cell_id columns, Figure 6—figure supplement 2A). To relate a cell to the time evolution of its properties (center of mass, area, shape anisotropy, polarized protein distribution), we create an association between the cells and cell_histories entities, in which each entry is uniquely determined by the combination of cell_id and frame. As movies may be acquired at different frame rates, we also represent the real time evolution (in seconds) in the frames entity that we connect to the cells entity.

To represent the cell topology in the database, we create a directed_bond entity along with a self-association 'be next left' that links each directed bond in each frame (dbond_id) to its next counter-clockwise follower (left_dbond_id column, Figure 6—figure supplement 2A). This stores the ordering of the directed bonds around each cell. To relate each cell with its neighbors in each frame, we define a 'be conjugated' self-association that links each directed bond to its corresponding conjugated bond (conj_dbond_id column, Figure 6—figure supplement 2A). To connect the topology to geometrical information, we first define an additional association ('be part of') that connects the cells to the directed bonds entities. We then connect both entities to the frames entity by defining the association 'exist in' that matching the frame attribute (Figure 6—figure supplement 2A). Finally, we connect directed bonds to bonds and directed bonds to vertices by creating the associations 'be part of' and 'stem from', respectively (see vertex_id and bond_id columns, Figure 6—figure supplement 2A).

The TM-DB follows the 3 first normal forms established by Codd (1971, 1974). The first normal form ensures that all entity properties are mono-valued and non-divisible, and that at least one of them is the identifier, which semantically determines all other properties of the entity. The second normal form adds constraints on the identifiers: if an identifier is composed of multiple properties (see cells entity), the other properties must be determined by the whole identifier and not by only part of it. The third normal form stipulates that a property isn't allowed to be determined by an existing property that isn't an identifier. In the conceptual scheme, those 3 normal forms ensure that the identifier uniquely defines each property of the entity. They also ensure that entity properties are entirely determined by the sole identifier. This helps clarifying the notion of entities and their content when creating the data model. It also helps reducing redundancy in the database.

The logical scheme of the TM-DB is implemented using the SQLite management system (Jay, 2010). We chose SQLite for its ease of use: there is no need to install a dedicated server and the DB is stored in a single file that is easily shared with collaborators. The source code is accessible on GitHub repository (see Box 1).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grants:

  • European Research Council to Raphaël Etournay, Natalie A Dye, Suzanne Eaton.

  • Seventh Framework Programme to Raphaël Etournay.

  • Max-Planck-Gesellschaft to Matthias Merkel, Marko Popović, Guillaume Salbreux, Frank Jülicher.

  • Bundesministerium für Bildung und Forschung to Matthias Merkel, Marko Popović, Suzanne Eaton, Frank Jülicher.

  • Alfred P. Sloan Foundation to Matthias Merkel.

  • Gordon and Betty Moore Foundation to Matthias Merkel.

  • National Science Foundation NSF-DMR-1352184 to Matthias Merkel.

Additional information

Competing interests

FJ: Reviewing editor, eLife.

SE: Reviewing editor, eLife.

The other authors declare that no competing interests exist.

Author contributions

RE, Participated in regular group discussions to develop the ideas presented, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

MM, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

MP, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

HB, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

NAD, Extensively tested TissueMiner and suggested key improvements for TissueMiner, Participated in regular group discussions to develop the ideas presented, Drafting or revising the article, Contributed unpublished essential data or reagents.

BA, Developed key image processing and image analysis methods in TissueAnalyzer, Drafting or revising the article, Contributed unpublished essential data or reagents.

GS, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data.

SE, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

FJ, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

References

  1. Aigouy B, Farhadifar R, Staple DB, Sagner A, Röper JC, Jülicher F, Eaton S. Cell flow reorients the axis of planar polarity in the wing epithelium of Drosophila. Cell. 2010;142:773–786. doi: 10.1016/j.cell.2010.07.042. [DOI] [PubMed] [Google Scholar]
  2. Aigouy B, Umetsu D, Eaton S. Drosophila: Methods and Protocols. 2016. Segmentation and quantitative analysis of epithelial tissues. [DOI] [PubMed] [Google Scholar]
  3. Barbier de Reuille P, Routier-Kierzkowska AL, Kierzkowski D, Bassel GW, Schüpbach T, Tauriello G, Bajpai N, Strauss S, Weber A, Kiss A, Burian A, Hofhuis H, Sapala A, Lipowczan M, Heimlicher MB, Robinson S, Bayer EM, Basler K, Koumoutsakos P, Roeder AH, Aegerter-Wilmsen T, Nakayama N, Tsiantis M, Hay A, Kwiatkowska D, Xenarios I, Kuhlemeier C, Smith RS. MorphoGraphX: A platform for quantifying morphogenesis in 4D. eLife. 2015;4:e14334. doi: 10.7554/eLife.05864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Blair SS. Wing vein patterning in Drosophila and the analysis of intercellular signaling. Annual Review of Cell and Developmental Biology. 2007;23:293–319. doi: 10.1146/annurev.cellbio.23.090506.123606. [DOI] [PubMed] [Google Scholar]
  5. Blanchard GB, Kabla AJ, Schultz NL, Butler LC, Sanson B, Gorfinkiel N, Mahadevan L, Adams RJ. Tissue tectonics: morphogenetic strain rates, cell shape change and intercalation. Nature Methods. 2009;6:458–464. doi: 10.1038/nmeth.1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bosveld F, Bonnet I, Guirao B, Tlili S, Wang Z, Petitalot A, Marchand R, Bardet PL, Marcq P, Graner F, Bellaïche Y. Mechanical control of morphogenesis by Fat/Dachsous/Four-jointed planar cell polarity pathway. Science. 2012;336:724–727. doi: 10.1126/science.1221071. [DOI] [PubMed] [Google Scholar]
  7. Cilla R, Mechery V, Hernandez de Madrid B, Del Signore S, Dotu I, Hatini V. Segmentation and tracking of adherens junctions in 3D for the analysis of epithelial tissue morphogenesis. PLOS Computational Biology. 2015;11:e14334. doi: 10.1371/journal.pcbi.1004124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Classen AK, Anderson KI, Marois E, Eaton S. Hexagonal packing of Drosophila wing epithelial cells by the planar cell polarity pathway. Developmental Cell. 2005;9:805–817. doi: 10.1016/j.devcel.2005.10.016. [DOI] [PubMed] [Google Scholar]
  9. Codd EF. A relational model of data for large shared data banks. Communications of the ACM. 1970;13:377–387. doi: 10.1145/362384.362685. [DOI] [Google Scholar]
  10. Codd EF. International Business Machines Corporation; 1971. Further Normalization of the Data Base Relational Model. [Google Scholar]
  11. Codd EF. International Business Machines Corporation; 1972. Relational Completeness of Data Base Sublanguages. [Google Scholar]
  12. Codd EF. International Business Machines Corporation; 1974. Recent Investigations in Relational Data Base Systems. [Google Scholar]
  13. Etournay R, Popović M, Merkel M, Nandi A, Blasse C, Aigouy B, Brandl H, Myers G, Salbreux G, Jülicher F, Eaton S. Interplay of cell dynamics and epithelial tension during morphogenesis of the Drosophila pupal wing. eLife. 2015;4:e14334. doi: 10.7554/eLife.07090. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fletcher AG, Osterfield M, Baker RE, Shvartsman SY. Vertex models of epithelial morphogenesis. Biophysical Journal. 2014;106:2291–2304. doi: 10.1016/j.bpj.2013.11.4498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Francois R, Wickham H. Dplyr : A Grammar of Data Manipulation. 2015. [Google Scholar]
  16. García-Bellido A, Cortés F, Milán M. Cell interactions in the control of size in Drosophila wings. Proceedings of the National Academy of Sciences of the United States of America. 1994;91:10222–10226. doi: 10.1073/pnas.91.21.10222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Guirao B, Rigaud SU, Bosveld F, Bailles A, López-Gay J, Ishihara S, Sugimura K, Graner F, Bellaïche Y. Unified quantitative characterization of epithelial tissue development. eLife. 2015;4:e14334. doi: 10.7554/eLife.08519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Heller D, Hoppe A, Restrepo S, Gatti L, Tournier AL, Tapon N, Basler K, Mao Y. EpiTools: An open-source image analysis toolkit for quantifying epithelial growth dynamics. Developmental Cell. 2016;36:103–119. doi: 10.1016/j.devcel.2015.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Huang ZP, Zhou H, He HL, Chen CL, Liang D, Qu LH. Genome-wide analyses of two families of snoRNA genes from Drosophila melanogaster, demonstrating the extensive utilization of introns for coding of snoRNAs. RNA. 2005;11:1303–1316. doi: 10.1261/rna.2380905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jay AK. Using SQLite. O'Reilly Media; 2010. p. 528. [Google Scholar]
  21. Kachalo S, Naveed H, Cao Y, Zhao J, Liang J. Mechanical model of geometric cell and topological algorithm for cell dynamics from single-cell to formation of monolayered tissues with pattern. PLOS ONE. 2015;10:e14334. doi: 10.1371/journal.pone.0126484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Köster J, Rahmann S. Snakemake--a scalable bioinformatics workflow engine. Bioinformatics. 2012;28:2520–2522. doi: 10.1093/bioinformatics/bts480. [DOI] [PubMed] [Google Scholar]
  23. Merkel M, Etournay M, Popović M, Salbreux G, Eaton S, Jülicher F. Triangles Bridge the Scales: Quantifying Cellular Contributions to Tissue Deformation. 2016. [DOI] [PubMed] [Google Scholar]
  24. Merkel M. From cells to tissues: Remodeling and polarity reorientation in epithelial tissues .Vol. Ph.D. Thesis. Technische Universität Dresden; 2014. [Google Scholar]
  25. Milán M, Campuzano S, García-Bellido A. Cell cycling and patterned cell proliferation in the Drosophila wing during metamorphosis. Proceedings of the National Academy of Sciences of the United States of America. 1996;93:11687–11692. doi: 10.1073/pnas.93.21.11687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Mosaliganti KR, Noche RR, Xiong F, Swinburne IA, Megason SG. ACME: automated cell morphology extractor for comprehensive reconstruction of cell membranes. PLoS Computational Biology. 2012;8:e14334. doi: 10.1371/journal.pcbi.1002780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Nepusz T, Csardi G. The igraph software package for complex network research. InterJournal Complex Systems. 2006;1695 [Google Scholar]
  28. Nickoloff J. Docker in Action. Shelter Island, NY: Manning Publications; 2015. [Google Scholar]
  29. Peter Pin-Shan C. The entity-relationship model—toward a unified view of data. ACM Transactions on Database Systems. 1976;15:9–36. doi: 10.1145/320434.320440. [DOI] [Google Scholar]
  30. Sagner A, Merkel M, Aigouy B, Gaebel J, Brankatschk M, Jülicher F, Eaton S. Establishment of global patterns of planar polarity during growth of the Drosophila wing epithelium. Current Biology. 2012;22:1296–1301. doi: 10.1016/j.cub.2012.04.066. [DOI] [PubMed] [Google Scholar]
  31. Schubiger M, Palka J. Changing spatial patterns of DNA replication in the developing wing of Drosophila. Developmental Biology. 1987;123:145–153. doi: 10.1016/0012-1606(87)90436-2. [DOI] [PubMed] [Google Scholar]
  32. Tardieu H, Rochfeld A, Colletti R, d'Organisation Editions . La Méthode MERISE: Principes Et Outils. 2000. [Google Scholar]
  33. Walck-Shannon E, Hardin J. Cell intercalation from top to bottom. Nature Reviews Molecular Cell Biology. 2014;15:34–48. doi: 10.1038/nrm3723. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Wickham H. Ggplot2: Elegant Graphics for Data Analysis. New York: Springer; 2009. [Google Scholar]
  35. Wiesmann V, Franz D, Held C, Münzenmayer C, Palmisano R, Wittenberg T. Review of free software tools for image analysis of fluorescence cell micrographs. Journal of Microscopy. 2015;257:39–53. doi: 10.1111/jmi.12184. [DOI] [PubMed] [Google Scholar]
eLife. 2016 May 26;5:e14334. doi: 10.7554/eLife.14334.032

Decision letter

Editor: W James Nelson1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your work entitled "TissueMiner: a multiscale analysis toolkit to quantify how cellular processes create tissue dynamics." for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Naama Barkai as the Senior Editor.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

Large scale tissue imaging is becoming an everyday tool in developmental biology, but high throughput analysis methods are restricted to few laboratories. Etournay and co-workers present a software platform for segmentation and tracking of 2D epithelial monolayers. The software provides a number of geometric outputs including deformation tensors and cell connectivity. As the tools within TissueMiner appear to be identical to those used by the authors previously (2015 eLife publication), the value of TissueMiner lies not its novelty or proof-of-concept, but squarely in the likelihood of it being used by an average researcher. Both reviewers felt that the software and current tutorials, while expansive, were far from user-friendly in their current implementation, and the tutorials, while extensive, assumed far more technical background than the average user would have. Nevertheless, both reviewers felt that TissueMiner, if improved as described below, would be an important advance and could be very useful to the community.

Some of the key problems encountered were:

1) Installation of the software is not user-friendly. Between the large size of the sample datasets (2+ GB each), and tedious GitHub interactions presented as a series of code fragments rather than a single script, it took one reviewer over an hour to get the sample data downloaded, and to deal with various docker virtual machine bugs. Even pulling the demo data down caused bugs due to permission errors that were not addressed in the tutorial.

2) Once docker was running, execution of the initial data processing step resulted in a dizzying array of analysis steps with no explanation of what was happening or how much time it would take.

3) Users must already have R installed, but this is not mentioned.

4) Preparing TissueMiner to run through RStudio should not require the user to copy-paste code fragments each time. A single script that does this should be included.

5) Tissue Analyzer – the preprocessing tool made by the same group and essentially required for use with Tissue Miner – works beautifully and has a series of video tutorials. This would be a good template for TissueMiner.

6) RStudio initialization script kept crashing. Apparently additional toolboxes are required, such as devtools.

Essential revisions:

In the context of the software, the reviewers specifically recommended that you:

1) Make the sample data much smaller in size (<2GB) so that the initial docker processing takes <5 min, or at least has a remaining time clock;

2) Make the software completely scripted, including all the initial importing in RStudio (no copy-pasting, just loading an RStudio file);

3) Prepare a new "First Use" R tutorial, skipping most of the R-101 training and just getting a user up and running with something simple, like plotting cell-area.

Then:

1) Conduct user testing with testers who truly have "no programming experience". Allow them to test installation unsupervised.

2) Create more streamlined, focused tutorials rather than an enormous master guide. TissueMiner tries to do everything at one time (e.g. the initial docker processing), and it is difficult to follow. Give the user the simplest data-set you can provide, and have them perform the simplest analysis (e.g. only focus on cell area, or tracking, or divisions).

3) The authors illustrate the capabilities of the software using the case of wing vein morphogenesis, but this is not a validation. To validate the software, the authors could use computer generated images of cell monolayers with known geometry, and demonstrate that the software is able to recover the different geometric parameters of each cell in the tissue.

eLife. 2016 May 26;5:e14334. doi: 10.7554/eLife.14334.033

Author response


Some of the key problems encountered were:

1) Installation of the software is not user-friendly. Between the large size of the sample datasets (2+ GB each), and tedious GitHub interactions presented as a series of code fragments rather than a single script, it took one reviewer over an hour to get the sample data downloaded, and to deal with various docker virtual machine bugs. Even pulling the demo data down caused bugs due to permission errors that were not addressed in the tutorial.

To address these problems, we have significantly simplified the installation procedure. For Ubuntu Linux, we now provide a single script install_tm.sh that fully installs TissueMiner. Instructions are found directly on the landing page of the TissueMiner GitHub repository: https://github.com/mpicbg-scicomp/tissue_miner#installation. For other systems, we provide a 2-step protocol for installing TissueMiner: first install the Docker Engine and second download the TissueMiner docker image. The installation procedure of this dockerized TissueMiner is explained in detail on the TissueMiner GitHub site: https://github.com/mpicbg-scicomp/tissue_miner#installation

The new installation procedure was successfully tested by two users with no programming experience on Ubuntu 14.04, Ubuntu 15.10, and on different versions of MacOSX (Mavericks, Yosemite and El Capitan).

We enhanced the former “R tutorial” and renamed it “TM R-User Manual”: https://mpicbg-scicomp.github.io/tissue_miner/user_manual/TM_R-UserManual.html

In addition, we now provide simplified “Quickstart Tutorials” that are designed to help new users quickly learn the procedures and capabilities of TissueMiner:

https://github.com/mpicbg-scicomp/tissue_miner#documentation

TM R-User Manual and Quickstart Tutorials are now based on a demo dataset that is smaller than that which was originally provided (~35Mb of compressed data). Nonetheless, we also provide the large datasets for more sophisticated analysis or for those interested in pupal wing morphogenesis. This larger dataset is now compressed (<800Mb each) to facilitate download and is used in the last part of the documentation (see TM R-User Manual, section 3).

2) Once docker was running, execution of the initial data processing step resulted in a dizzying array of analysis steps with no explanation of what was happening or how much time it would take.

To improve clarity and speed, we streamlined the initial processing (removing many of the automatic video creations) and now only display messages that indicate the progression of the workflow. With these changes, the initial data processing steps in the automated workflow require less than 3min on the provided demo sample.

The new Quickstart Tutorials are designed to guide the user through the database creation as well as through specific type of analysis (cell area, elongation, etc…) by running appropriate R scripts (a few seconds of run time for each script using the demo dataset). Their source code can be found on the GitHub repository:

https://github.com/mpicbg-scicomp/tissue_miner/tree/master/docs/quickstart/scripts

3) Users must already have R installed, but this is not mentioned.

On Ubuntu, our new installer script install_tm.sh solves this issue. On other systems, we provide a docker image of TissueMiner that doesn’t require additional R installation. Details about how to configure and use this dockerized TissueMiner for developing custom R scripts can be found in the TM R-User Manual (section 1.3.1):

https://mpicbg-scicomp.github.io/tissue_miner/user_manual/TM_R-UserManual.html

Optionally, MacOSX users may also find easier to install R, Rstudio and the TissueMiner API independently from the docker image (just a matter of taste). We therefore provide an installer install_tm_api.sh. Instructions are present in the TM R-User Manual(section 1.3.1).

4) Preparing TissueMiner to run through RStudio should not require the user to copy-paste code fragments each time. A single script that does this should be included.

We now provide a single script entitled analyze_movie.R that combines all individual quickstart tutorial scripts. This script is present on the GitHub repository: https://github.com/mpicbg-scicomp/tissue_miner/tree/master/docs/quickstart/scripts

Instructions about how to run this script can be found in the Quickstart R-Tutorials.

Additional explanations and code fragments for advanced users of TM have been moved to the TM R-User Manual.

5) Tissue Analyzer – the preprocessing tool made by the same group and essentially required for use with Tissue Miner – works beautifully and has a series of video tutorials. This would be a good template for TissueMiner.

Because TissueMiner is based on command line input, rather than graphical input as in TissueAnalyzer, we believe video tutorials are not as useful. Instead we designed the web-based tutorials: Quickstart TM-R Tutorials for Ubuntu and Quickstart docker-TM-R Tutorials for other systems. These tutorials guide even complete novices through the use of TissueMiner by providing not just instructions but commands that can be directly copy-pasted into the terminal to run the analyses:

https://github.com/mpicbg-scicomp/tissue_miner#documentation

We provide video tutorials for the ImageJ macros in TissueMiner (for drawing regions of interest and orienting the tissue).

6) RStudio initialization script kept crashing. Apparently additional toolboxes are required, such as devtools.

As mentioned in point 3), the new installation procedure resolves this problem.

Essential revisions:

In the context of the software, the reviewers specifically recommended that you:

1) Make the sample data much smaller in size (<2GB) so that the initial docker processing takes <5 min, or at least has a remaining time clock;

As explained above, we now provide a smaller sample dataset (a cropped region of the pupal wing, ~35Mb in size) to guide new users through the Quickstart Tutorials. Furthermore, the user can now select specific steps of the analysis to run. For example, it may not be necessary or desired at the beginning of the tutorial to create videos of patterned cell behaviors. For this example dataset, a simplified initial analysis workflow, including database creation, cell neighbor change detection and shear calculation, requires less than 3 min on a single core(2.7 GHz Intel Core i5; 2Gb of RAM). Subsequent analysis steps, such as video creation, can be run on demand by selecting the corresponding snakemake rules, see Figure 7 and TM R-User Manual.

The R programming language doesn’t easily allow one to systematically provide a progress bar. Nonetheless, we estimated the processing time required for three datasets of increasing size (excluding extra video creation):

dataset

Compressed Size (Mb)

True size (Gb)

Number of cell-cell contacts

Number of cell contours

Number of cell lineages

Number of ROIs

Run time

demo

31

0.1

~200 000

~68 000

~1200

3

2min48sec

WT_3

563

2.1

~1 450 000

~487 000

~8600

6

17min00sec

WT_1

786

2.5

~1 610 000

~540 000

~9400

6

18min03sec

The size of WT_2 dataset is similar to the size of WT_1.

Running these analyses on multicore computers would significantly reduce these times.

2) Make the software completely scripted, including all the initial importing in RStudio (no copy-pasting, just loading an RStudio file);

The full analysis of single movies is now completely encoded in a single script, analyze_movie.R. This script is explained in the Quickstart Tutorials and can either be used as is or custom edited in Rstudio.

Comparisons between multiple movies require adaption to the specific tissue being analyzed. Therefore, we provide a single template script, compare_multiple_movies.R, which requires the user to enter the location of the movies and/or ROIs to compare.

3) Prepare a new "First Use" R tutorial, skipping most of the R-101 training and just getting a user up and running with something simple, like plotting cell-area.

As requested, we developed “First Use” Quickstart Tutorials that quickly introduces novices to the capabilities and procedures of TissueMiner.

To help users that are interested in further developing TissueMiner in R for more sophisticated and customized analyses, we still provide some information about R, including the grammar of data manipulation and graphics, in a dedicated tutorial referenced in the TM R-User Manual: https://mpicbg-scicomp.github.io/tissue_miner/user_manual/Learning_the_R_basics_for_TissueMiner.html

Then:

1) Conduct user testing with testers who truly have "no programming experience". Allow them to test installation unsupervised.

Two testers with "no programming experience" managed to install and run the Quickstart Tutorials on both Ubuntu and MacOSX using the updated installation procedures.

We also provide two additional tutorials to help users getting started with the analysis of their own data. For Ubuntu:

https://github.com/mpicbg-scicomp/tissue_miner/blob/gh-pages/quickstart_tutorial/ubuntu/tm_qs_user_data.md#first-use-of-tissueminer-with-your-own-data

For other systems:

https://github.com/mpicbg-scicomp/tissue_miner/blob/gh-pages/quickstart_tutorial/other_os/tm_qs_user_data.md#first-use-of-tissueminer-with-your-own-data

2) Create more streamlined, focused tutorials rather than an enormous master guide. TissueMiner tries to do everything at one time (e.g. the initial docker processing), and it is difficult to follow. Give the user the simplest data-set you can provide, and have them perform the simplest analysis (e.g. only focus on cell area, or tracking, or divisions).

The new Quickstart Tutorials were designed to address this point. We also created the TM R-User Manual in html format, including a table of contents and links to each type of analysis, for those with more experience with R.

3) The authors illustrate the capabilities of the software using the case of wing vein morphogenesis, but this is not a validation. To validate the software, the authors could use computer generated images of cell monolayers with known geometry, and demonstrate that the software is able to recover the different geometric parameters of each cell in the tissue.

We produced two computer-generated movies of tissues with known geometry and deformation. One dataset corresponds to hexagonal cells undergoing homogeneous isotropic expansion with an imposed rate, and the other corresponds to hexagonal cells undergoing homogeneous pure shear with an imposed rate. We segmented and tracked both movies and performed the entire TissueMiner deformation analysis. We found that TissueMiner correctly recapitulates the imposed deformations and their cellular contributions. Moreover, we find for these movies that expected deviations due to pixelation are two orders of magnitude smaller than the deformations that we aim to measure.

We added a subsection entitled “Validation of tissue deformation measurements using computer-generated cells” – where we discuss all of these points.


Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

RESOURCES