Skip to main content
eLife logoLink to eLife
. 2019 Feb 26;8:e40162. doi: 10.7554/eLife.40162

Motion sensing superpixels (MOSES) is a systematic computational framework to quantify and discover cellular motion phenotypes

Felix Y Zhou 1,†,, Carlos Ruiz-Puig 1,, Richard P Owen 1, Michael J White 1, Jens Rittscher 1,2,3,, Xin Lu 1,
Editors: Hamid Mohammadi4, Didier Y Stainier5
PMCID: PMC6391079  PMID: 30803483

Abstract

Correct cell/cell interactions and motion dynamics are fundamental in tissue homeostasis, and defects in these cellular processes cause diseases. Therefore, there is strong interest in identifying factors, including drug candidates that affect cell/cell interactions and motion dynamics. However, existing quantitative tools for systematically interrogating complex motion phenotypes in timelapse datasets are limited. We present Motion Sensing Superpixels (MOSES), a computational framework that measures and characterises biological motion with a unique superpixel ‘mesh’ formulation. Using published datasets, MOSES demonstrates single-cell tracking capability and more advanced population quantification than Particle Image Velocimetry approaches. From > 190 co-culture videos, MOSES motion-mapped the interactions between human esophageal squamous epithelial and columnar cells mimicking the esophageal squamous-columnar junction, a site where Barrett’s esophagus and esophageal adenocarcinoma often arise clinically. MOSES is a powerful tool that will facilitate unbiased, systematic analysis of cellular dynamics from high-content time-lapse imaging screens with little prior knowledge and few assumptions.

Research organism: Human

Introduction

During tissue development and homeostasis in multicellular organisms, different cell types expand and migrate to form defined organ structures. For example, during wound-healing, both immune and epithelial cells are required to proliferate and coordinately migrate (Schaffer and Nanney, 1996; Clark, 2013; Leoni et al., 2015). Aberrant cellular motion can be caused by deregulation of key signalling pathways in pathological conditions, including cancer, and may be critical for disease development and progression, for example leading to invasion and metastasis. Therefore, there is strong biological and clinical need for precise, quantitative characterisation of cellular motion behaviour in a tissue-relevant context to objectively compare the effects of drugs and genetic mutations.

When two cell populations adjoin in vivo, they often form a sharp, stable interface termed a ‘boundary’, with limited intermingling (Dahmann et al., 2011). In adult humans, sharp boundaries separate different types of epithelia, such as between the squamous and columnar epithelia in the esophagus, cervix and anus. Disruption of these boundaries can lead to disease. For example, disruption of the esophageal squamous columnar epithelial boundary is a feature of Barrett’s Esophagus (BE), a condition that confers a 30–50 fold increased risk of esophageal adenocarcinoma (EAC) (Gaddam et al., 2013). Understanding how complex tissue motion dynamics relate to pathological phenotypes, and how they can be affected by intrinsic and extrinsic factors, is therefore a key challenge in biomedical research. Live cell imaging over a long time (e.g. hours/days) is required to study the underlying complex cellular motion.

Recently, there has been a rising interest in scaling up live cell imaging for unbiased high-throughput screening and analysis to identify drug targets and develop therapies that can cause or prevent abnormal cellular motion (Schmitz et al., 2010; Held et al., 2010; Pau et al., 2013). Accordingly, a diverse range of co-culture experimental assays have been developed (Goers et al., 2014), amongst which wound-healing type assays are one of the simplest and most widely used. Attempts have been made to use these assays to study the regulation of complex biological/pathological processes such as the stable boundary formation between homo- and heterotypic cell populations using live cell imaging over a period of up to 6 days. However, a major barrier is how to analyse the resulting complex biological motion phenotypes and its variation between different tested conditions, particularly for multi-cell type populations (Goers et al., 2014) in high-content screens.

In general, there are two types of cell movements; ‘single-cell’ migration in which each cell migrates independently or ‘collective’ migration where a group of cells migrates together in a coordinated fashion. Single-cell migration has been well studied due to the availability of many single-cell tracking methods that can extract rich motion features even for unbiased high-throughput screens (Padfield et al., 2011; Meijering et al., 2012; Maška et al., 2014; Schiegg et al., 2015; Nketia et al., 2017). In contrast, collective migration is much less understood due to a lack of tools to extract equally rich quantitative motion features of cell populations in a high-throughput fashion with minimal prior knowledge.

Extension of existing single-cell tracking methods to analyse collective cell motion in general is non-trivial. Single-cell tracking methods all require accurate single-cell image segmentation, which is highly challenging when cells adjoin or overlap as they do frequently in tissue and confluent cell cultures. Moreover, there is a lack of systematic techniques to associate the motion of individual cells to the global moving collective. Alternatively, existing popular methods to analyse collective motion in cell populations such as Particle Image Velocimetry (PIV) (Szabó et al., 2006; Petitjean et al., 2010) and its variants (e.g. cell image velocimetry (CIV) (Milde et al., 2012)), do not require image segmentation. However, they only extract the global motion for a single video frame in the form of a velocity field, a velocity vector for each image pixel without assignment of pixels to individual cells. This lack of continuous tracking of cellular motion limits the application of PIV methods when characterising collective migration in complex biological phenomena such as boundary formation and chemoattraction. These biological processes often occur over a highly variable time period ranging from minutes to days, and identification of involved cell groups is often desired. Furthermore, PIV-extracted motion fields fail to exploit temporal continuity to reduce noise and avoid perturbations by imaging artefacts including visual clutter, image occlusion, and autofluorescence. Most prohibitively, PIV methods output only the velocities of image pixels. There is a lack of a systematic method to build motion ‘signatures’ to automatically and unbiasedly discriminate and predict biological motion phenotypes in large datasets. To this end, previous studies have attempted to extract additional motion parameters from PIV and appearance parameters from individual video frames (Neumann et al., 2006; Zaritsky et al., 2012; Zaritsky et al., 2014; Zaritsky et al., 2015; Zaritsky et al., 2017). These include applying velocity-based clustering in migrating monolayers, and handcrafted image features to describe local image appearance using, for example, local binary patterns (Zaritsky et al., 2012). However, these approaches are sensitive to the number of identified phenotypes and data outliers, requiring relatively high-quality imaging, restricting the experimental systems they can be applied to. Importantly, all the approaches (Neumann et al., 2006; Zaritsky et al., 2012; Zaritsky et al., 2014; Zaritsky et al., 2015; Zaritsky et al., 2017) assume prior knowledge of the motion behaviour. Notably, kymographs specifically exploit motion patterns known to be symmetrical around a given spatial axis such as wound healing and cannot be used to identify novel spatial motion features that could be relevant to normal tissue development or to a defined disease setting.

A suitable computational method for studying cellular motion therefore should be able to address the joint challenge of analysing single-cell and collective motion behaviour. Additionally, it must be: (i) scalable in a medium- or high-throughput manner; (ii) sufficiently robust to handle inevitable variations in image acquisition and experimental protocol; (iii) sensitive to detect motion differences resulting from small changes in environment or stimuli with minimal replicates; (iv) automatic, requiring no manual intervention except for initial setting of parameters; and (v) unbiased to enable motion characterisation (e.g. as a motion ‘signature’) with minimal prior assumptions of motion behaviour.

To address these analytical challenges, here we developed Motion Sensing Superpixels (MOSES), a computational framework that aims to provide a flexible and general approach for biological motion extraction, characterisation and phenotyping. We empowered PIV-type methods with a mesh formulation that enables systematic measurement and unbiased extraction of rich motion features for single and collective cell motion suitable for high-throughput phenotypic screens. We use the analysis of a multi-well plate-based in vitro assay to study the complex cell population dynamics between different epithelial cell types from the esophageal squamous-columnar junction (SCJ) to demonstrate the potential of MOSES. Our analysis illustrates how MOSES can be used to effectively ‘encode’ complex dynamic patterns in the form of a motion ‘signature’, which would not be possible using standard globally extracted velocity-based measures from PIV. Finally, a side-by-side comparison with PIV analysis on published datasets illustrates the biological relevance and the advanced features of MOSES. In particular, MOSES can highlight novel motion phenotypes in high-content comparative biological video analysis.

Results

In vitro model to study the spatio-temporal dynamics of boundary formation between different cell populations

To develop MOSES, we chose to investigate in vitro the boundary formation dynamics between squamous and columnar epithelia at the esophageal squamous-columnar junction (SCJ) (Figure 1A). To recapitulate features of the in vivo boundary formation, we used three epithelial cell lines in pairwise combinations and an experimental model system with similar characteristics to wound-healing and migration assays but with additional complexity. Together the resulting videos pose a number of analytical challenges that require the development of a more advanced method beyond the current capabilities of PIV and CIV.

Figure 1. Temporary divider system to study interactions between cell populations.

(A) The squamous-columnar junction (SCJ) divides the stratified squamous epithelia of the esophagus and the columnar epithelia of the stomach. Barrett’s esophagus (BE) is characterised by squamous epithelia being replaced by columnar epithelial cells. The three cell lines derived from the indicated locations were used in the assays (EPC2, squamous esophagus epithelium, CP-A, Barrett’s esophagus and OE33, esophageal adenocarcinoma (EAC) cell line). (B) The three main epithelial interfaces that occur in BE to EAC progression. (C) Overview of the experimental procedure, described in steps 1–3. In our assay, cells were allowed to migrate and were filmed for 4–6 days after removal of the divider (step 4). (D) Cell density of red- vs green-dyed cells in the same culture, automatically counted from confocal images taken of fixed samples at 0, 1, 2, 3, and 4 days and co-plotted on the same axes. Each point is derived from a separate image. If a point lies on the identity line (black dashed), within the image, red- and green-dyed cells have the same cell density. (E,F) Top images: Snapshot at 96 h of three combinations of epithelial cell types, cultured in 0% or 5% serum as indicated. Bottom images: kymographs cut through the mid-height of the videos as marked by the dashed white line. All scale bars: 500 μm. (G) Displaced distance of the boundary following gap closure in (E,F) normalised by the image width. From left to right, n = 16, 16, 16, 17, 30, 17 videos.

Figure 1.

Figure 1—figure supplement 1. Automated cell counting with convolutional neural networks (CNN).

Figure 1—figure supplement 1.

(A) CNN training procedure. Image patches (64 × 64 pixels) are randomly subsampled from the large DAPI-stained images. The convolutional network is trained to transform a given DAPI image patch to a dot-like image such that the sum of all pixel intensities in the output dot-like image equals the number of cells in the DAPI image. During training, the ideal dot-like image is provided by manual annotation. (B) An example of a 64 × 64 pixels image patch of cells stained with DAPI (blue), with individual cells counted manually (left) or by automatic CNN counting (right). Red spots mark individual counted cells. (C) Plot of manually annotated cell counts vs automated cell counts tested on 64 × 64 image patches (n = 200). Each point is a patch. The mean absolute deviation (MAD) was 4.86 cells, a percentage error of 3.91% for an average of 124.3 cells per image. Pearson correlation coefficient, r = 0.992. Red dashed line is the ideal identity line. (D) Image segmentation of epithelial sheets coloured with red and green dyes, used for sheet-specific cell counting. Grey shaded area is the excluded image area. (E) Paired boxplots of cell density (number of cells/sheet area, plotted as x103 on y-axis) for each monolayer from fixed samples collected at different times (up to 4 days (96 h) after divider removal). In each pair, the left boxplot is for the red labelled cells and right for the green labelled cells, indicated by red or green dots, respectively. Each dot represents the value from a confocal image. Outline box colour indicates the cell line (see legend). All scale bars: 200 μm.
Figure 1—figure supplement 2. Cell migration is largely unaffected by dye colour.

Figure 1—figure supplement 2.

Mean squared displacement (MSD) curves computed from optical flow plotted on a log10-log10 axis as a function of the time interval for named same cell line combinations in 0% and 5% serum. (A–C) MSD curves coloured grey for phase-constrast non-dyed (A), red for red-dyed (B) and green for green-dyed cells (C). (D) Errorbar plots showing mean extracted exponent α from MSD = Δtα (Materials and methods) ± one standard deviation for time intervals 0Δt20 h, for the phase-contrast, red and green sheets of individual phase-contrast plus RGB channel videos. Each point is a video. Two sided t-test was applied assuming identical variances. * denotes p = < 0.05. Red and green lipophilic dye reduced the mobility of non-dyed EPC2 and OE33 cells with statistical significance. The size effect was particularly large in OE33, (bottom row marked with orange triangles) with a large drop in the MSD exponent. We do not know the cause but it may be a result of the dyes affecting each cell line differently.
Figure 1—figure supplement 3. Motion fields at gap closure between EPC2:EPC2, EPC2:CP-A and EPC2:OE33 cell line combinations.

Figure 1—figure supplement 3.

(A) Snapshot of the combined motion field of red (R) and green (G) dyed cells at the frame of gap closure (16 h, 14 h and 12 h for EPC2:EPC2, EPC2:CP-A, EPC2:OE33 respectively) coloured by direction of movement according to the colour wheel for representative videos. (B) Snapshots of the EPC2:OE33 motion field taken at different times following gap closure. Yellow triangles indicate identified motion in ‘green’ cells where the dye fluorescence is nearly lost. (C) Confocal staining of a fixed EPC2:OE33 sample at 72 h. DAPI stains the DNA; E-cadherin marks the adherens junctions; actin (β-actin) marks the actin-cytoskeleton; and K7 (Keratin 7) marks the OE33 cells of columnar epithelium origin. All scale bars: 200 μm.
Figure 1—figure supplement 4. Gap closure times and cell proliferation of cell line combinations in 0% and 5% serum.

Figure 1—figure supplement 4.

(A). Boxplots showing median and interquartile range (IQR) of gap closure in EPC2:EPC2, EPC2:CP-A and EPC2:OE33 combinations. Whiskers indicate data within 1.5 x IQR of lower and upper quartiles. Black dashed line = global mean gap closure time. (B) Root mean squared displacement (RMSD) curves of x-direction velocity components in 0% and 5% serum conditions with mean gap closure point marked (black dashed line) with ± 5 h either side (red dashed lines), the accuracy to which we could detect the gap closure frame (Figure 3—figure supplement 9). (C,D) Grouped boxplots of mean cell density (C) and cell density change (D) from automatic cell counting using convolutional neural networks. Cell line combinations are written cell line 1:cell line 2 thus EPC2:CP-A indicates cell line 1 = EPC2, cell line 2 = CP-A. Cell line 1 and cell line 2 are coloured green and yellow in the boxplots, respectively.
Figure 1—figure supplement 5. Collective sheet migration dynamics are lost in 0% serum.

Figure 1—figure supplement 5.

(A) Samples grown in 0% or 5% serum were fixed for staining at 24 h when the gap between the two sheets have just closed, and at 144 h at the end of filming. DAPI (blue) marks the cell nucleus. E-cadherin (green) marks the adherens junctions. Actin (β-actin, yellow) marks the actin-cytoskeleton. K7 (Keratin 7, red) is a specific marker of columnar cells. All scale bars: 20 μm. (B) Spatial correlation curves computed from superpixel tracks as a function of superpixel distance (Materials and methods). Black dots = computed values. Dashed black line = fitted line to black dots of the form, y=ae-x/b. Green solid line = median line computed from the black dots. Shaded region = ± 2 standard deviations of the green median line. (C) Plot of the extracted values a vs b, where a is the correlation with superpixels 1 superpixel away and b the characteristic number of superpixels away for which motion is correlated. Each video is a point, see legend for colour code. The higher points are on the plot, the greater the collective motion. Black solid line is the support vector of a linear support vector machine (SVM) trained to separate 0% and 5% serum according to the values of a and b. Dashed black lines mark the SVM margin. Serum separability, the ability to predict if a video contains 0% or 5% serum, is defined as the training SVM accuracy using the whole dataset, n = 112 videos (n = 16 each for EPC2:EPC2, EPC2:CP-A, EPC2:OE33 in 0% serum and n = 17 EPC2:EPC2, 30 EPC2:CP-A and 17 EPC2:EPC2 in 5% serum).

To model the relevant esophageal interfaces, we used three epithelial cell lines: EPC2, an immortalised squamous epithelial cell line from the normal esophagus (Harada et al., 2003); CP-A, an immortalised BE cell line with gastric columnar epithelial properties (Merlo et al., 2011); and OE33, derived from EAC (Boonstra et al., 2010). We co-cultured these lines in the following combinations: 1) EPC2:EPC2 (squamous:squamous, as a normal control); 2) EPC2:CP-A (squamous:columnar, as in Barrett’s esophagus); and 3) EPC2:OE33 (squamous:cancer, as in EAC) (Figure 1B). Two epithelial cell populations, each labelled with a different lipophilic membrane dye (Progatzky et al., 2013), were co-cultured in the same well of a 24-well plate, separated by a divider with width 500 µm. The divider was removed after 12 h and cells were allowed to migrate towards each other (Figure 1C).

We first compared the effects of the two dyes on proliferation and migration using monolayer combinations of the three cell types (Video 1). Proliferation was assessed by cell density, automatically counted from DAPI staining using convolutional neural networks (CNN; see Figure 1—figure supplement 1, Materials and methods). Migration (diffusive behaviour) was assessed by mean squared displacement (Park et al., 2015). Green and red fluorescent-labelled EPC2, CP-A, and OE33 cells proliferated at the same rate (Figure 1D, for EPC2 cells: slope = 0.976, Pearson correlation coefficient 0.978, Figure 1—figure supplement 1E), and migrated in a similar way (Figure 1—figure supplement 2). Compared to non-dyed cells, the mobility of CP-A cells were unaffected however both dyes equally reduced the mobility of EPC2 and OE33 cells. The diffusion modes of all cells were unaffected (Figure 1—figure supplement 2) and the boundary formation behaviour between dyed and non-dyed cells was identical.

Video 1. Dynamics of monolayer combinations of three oesophageal epithelial cell lines, EPC2:EPC2, CP-A:CP-A, OE33:OE33.

Download video file (6.1MB, mp4)
DOI: 10.7554/eLife.40162.008

Bar: 500 μm.

We next analysed boundary formation in the three different combinations of co-cultured epithelial cell lines (EPC2:EPC2, EPC2:CP-A, and EPC2:OE33), initially in serum-containing media (5% fetal bovine serum, FBS). In all three combinations, as expected, both populations moved as epithelial sheets (Video 2). Firstly, in the squamous combination using the same EPC2 cell line labelled with two different colours, the green- or red-labelled EPC2:EPC2 cells met and coalesced into a monolayer as expected. Secondly, in the squamous-columnar EPC2:CP-A combination, we observed boundary formation between the two populations after 72 h, following a short period of CP-A ‘pushing’ EPC2. Thirdly, in the squamous-cancer EPC2:OE33 combination, the cancer cell line OE33 expanded continuously, resulting in the disappearance of EPC2 from the field of view (Video 2, Figure 1E) as assessed by the motion field and confocal images (Figure 1—figure supplement 3). The forces that govern the behaviour of the two cell lines on contact are unknown and traction force microscopy is required to investigate the ‘retracting’ or ‘pushing’ behaviour of EPC2 or OE33 cells respectively in future studies. Nonetheless, the observed boundary formation in the EPC2:CP-A combination and disordered interactions between EPC2:OE33 cells suggest that the three chosen epithelial cell lines in the tested combinations may model certain motion dynamics of the in vivo cell-cell interactions. Interestingly, cell counting suggests the observed phenotypes were not due to differential proliferation rate between the individual cell lines in combination (Figure 1—figure supplement 4C,D).

Video 2. Sheet migration in EPC2:EPC2, EPC2:CP-A and EPC2:OE33 in 0% and 5% FBS.

Download video file (3.6MB, mp4)
DOI: 10.7554/eLife.40162.009

No boundary forms in 0% fetal bovine serum (FBS) where all combinations move similarly. A stable boundary forms between EPC2 and CP-A in 5% FBS. Film duration: 96 h. Bar: 500 μm. Videos were contrast enhanced for better visualisation.

Evidence from model systems, including Drosophila melanogaster embryonic parasegment (Monier et al., 2010) and anteroposterior and dorsoventral wing disc boundaries (Major and Irvine, 2005; Major and Irvine, 2006; Landsberg et al., 2009), suggests the importance of cell-cell interactions and collective migration for stable boundary formation between epithelial populations in vivo. Thus to create a dataset in which we predict that cell motion would be dramatically disrupted we repeated the assays in serum-free conditions. We used convolutional neural networks (Materials and methods) to determine the mean cell density and mean change in cell density from video frames over the first 48 h of cells grown in serum (5% FBS) and no serum (0% FBS) conditions. We observed that culturing of the same cell combinations in serum-free versus 5% serum medium had undetectable impact on cell density at the very confluent cell conditions investigated (Figure 1—figure supplement 4). However, as expected, serum-free condition induced large global changes in cell migration (Figure 1F, Figure 1—figure supplement 5A) with observed loss of cell contacts, collective sheet migration (Figure 1—figure supplement 5B,C) and the absence of boundary formation. We also observed reduced overall boundary displacement (Figure 1F,G) and all cell line combinations exhibited similar motion dynamics (Figure 1F,G, Video 2). These results illustrate that serum-free medium has a profound impact on cell motion dynamics and the generated video datasets in serum-free medium are ideal as an experimental condition for testing the ability of MOSES to detect motion dynamics under different experimental conditions. It is important to note that the serum-free condition is not used as a biological negative control of boundary formation but only as a computational negative control for the development of our method.

Development of MOSES to quantify cell motion dynamics

We next developed a computational workflow, MOtion SEnsing Superpixels (MOSES), to quantify cellular motion from video datasets. MOSES was formulated modularly with three components (Figure 2): 1) motion extraction; 2) construction of long-time motion tracks; and 3) capture of local dynamic context.

Figure 2. Schematic diagram of MOtion SEnsing Superpixels (MOSES).

(A) High level overview of the three configurable components that define MOSES: motion extraction, construction of long-time motion tracks, and capture of local dynamic context. Long-time here indicates tracking of superpixels for longer than one frame or timepoint. (B) Example ways to practically implement each of the three high-level MOSES component concepts described in (A) in software. (C) Using long-time tracking of superpixels MOSES bridges single-cell tracking for sparse cell culture and PIV for confluent monolayers to extract global motion patterns for both scenarios, single-cell and collective motion within one computational framework.

Figure 2.

Figure 2—figure supplement 1. Intensity-independent filtering of superpixel tracks for migrating epithelial sheets.

Figure 2—figure supplement 1.

(A) Graphic overview of the key steps in the algorithm, see Materials and methods for accompanying details. (B) Filtered tracks are robust to intensity artefacts. Top panels show snapshots from an example video at 96 h (left), 0 h (middle) and at the time of gap closure (right). Bottom left shows the ‘raw’ superpixel tracks from 0 to 96 h, which are spurious due to autofluorescence in the green channel after green cells leave the field-of-view. Bottom middle shows the assigned red and green superpixels depicted as coloured dots marking their centroid positions at 0 h for the image shown above. Bottom right shows all filtered superpixel tracks from 0 to 96 h using the algorithm in (A). The extent of the green tracks now correctly resembles the interface shape for the image above.
Figure 2—figure supplement 2. Experimental validation of MOSES optical flow superpixel tracking.

Figure 2—figure supplement 2.

(A) Schematic illustration of the spike-in experimental setup. A separate sample of cell population 1 or 2 is dyed a third colour (blue; dots on figure) and added to the respective red or green sheet (solid colours on figure). (B) Comparison of MOSES tracks of spiked-in cells and the benchmark single-cell TrackMate tracks when the blue spiked-in cells were explicitly tracked (blue boxplots, 10,000 superpixels) or indirectly inferred from the respective coloured sheet (black boxplots, 1000 superpixels). Red boxplot centre line = median. Boxes of boxplots show interquartile range (IQR). Whiskers are located 1.5 x IQR above and below quartiles. Track similarity is reported using normalised velocity cross-correlation after matching the TrackMate and MOSES tracks (M) and if the correct matchings were randomly permuted (P). Permuted track similarity values for each video are the average of 10 random permutations. (C) Track results of the best and worst inferred tracks from sheet motion according to track similarity and the respective single-cell spiked-in MOSES tracks overlaid on a video snapshot at 0 h. Magenta: TrackMate tracks. Green: MOSES tracks of green cell population. (D) Breakdown of the mean MOSES track similarity with TrackMate tracks in (B) with respect to the three tested cell lines in combination; EPC2, CP-A, OE33. Blue bars = matched tracks, black bars = permuted tracks as in (B). Error bars show ± one standard deviation.

Component 1: Motion extraction

We use dense optical flow to avoid the shortcomings of using cell segmentation to track individual cells in confluent tissue. Optical flow is similar to particle image velocimetry (PIV) but yields higher resolution motion fields with a displacement vector for each pixel at every time point. PIV typically estimates a single velocity for an image patch or ‘window’ using spatial correlation. This corresponds to extracting only the superpixel movement in Figure 2A over one frame. Optical flow is also easier to modify to account for additional physical phenomena such as out-of-plane motion and large, discontinuous movements not captured within a PIV ‘window’ (Brox et al., 2004; Brox et al., 2009).

Component 2: Construct long-time motion tracks

To capture all spatial motion dynamics in the video, we construct long-time motion tracks that continuously track the cellular motion for any amount of time using superpixels. Local neighbouring image pixels are grouped into a pre-specified (see below) number of image regions or ‘superpixels’ to enable spatial averaging of the pixel-based motion fields (Figure 2A, Step 1). At each time point, the (x,y) position of a given superpixel is updated by averaging the optical flow (Farnebäck, 2003) over the image area covered by the superpixel to obtain the displaced direction and distance (Figure 2A, Steps 2-4). Subsequently, long-time superpixel motion tracks are generated by concatenating the positions of each superpixel at all time-points (Figure 2A, Step 5). All long-time superpixel motion tracks together encode the entire spatio-temporal motion history within the video. By default, the number of superpixels used are user-specified and fixed to sufficiently subsample only the field-of-view of the initial video frame. Where a video exhibits highly dynamic movement and coverage of the full spatio-temporal motion, including the movement of new cells not initially present but which later move into the field of view, is necessary for analysis, additional superpixels can be dynamically added during tracking. Tracks produced by the latter approach are known as dense trajectories (Wang et al., 2011). In our experiments, we used a fixed number of 1000 superpixels to cover an image size of 1344 x 1024 (with this setting an average superpixel is 37 pixels x 37 pixels covering 14,000 μm2 (2x lens) or  3600 μm2 (4x lens)) to monitor epithelial sheet dynamics. For multichannel images, each channel representing different coloured cell populations (i.e. red and green in our assay) was independently tracked in this manner (Figure 2B).

Depending on the specific application, background artefacts such as stage drift or floating debris, can contribute tracks that are biologically irrelevant and need to be filtered in post-processing (Figure 2B, middle panel). In this filtering step, superpixels are assigned to cover only the dynamic motion of all ‘objects’ of interest. The ‘object’ for single-cell tracking is each individual cell, and for epithelial sheets is the entire sheet within the video frame. To assign superpixels to each sheet, we use motion information (Figure 2—figure supplement 1, Materials and methods). This approach avoids relying on image intensity features whose variation commonly leads to segmentation errors. We validated our motion extraction and long-time track construction using published single-cell tracking datasets from the cell tracking challenge (Maška et al., 2014) (Video 3), and also experimentally, by spiking one of the dyed cell populations in the in vitro setup described above with a third, sparse population of blue-dyed cells. Our method produced very similar tracks (similarity score of 0.8) to an open software tool for single-particle tracking, TrackMate (Tinevez et al., 2017) (Figure 2—figure supplement 2, Video 4).

Video 3. Single-cell motion extraction using MOSES workflow compared to manual annotations used in the cell tracking challenge dataset (Maška et al., 2014).
Download video file (6MB, mp4)
DOI: 10.7554/eLife.40162.013

Bar: 100 μm.

Video 4. Assessment of individual cell motion in confluent cell sheets using spiked-in fluorescent cell populations and MOSES.
Download video file (6.4MB, mp4)
DOI: 10.7554/eLife.40162.014

Blue (B) represents the spiked-in population of EPC2 cells. (R) and (G) refer to the red and green EPC2 epithelial sheets, respectively. Bar: 500 μm.

Component 3: Capture local dynamic context

A ‘mesh’ is constructed by ‘linking’ superpixels to capture how individual superpixels move with respect to their neighbours over time (Figure 2A, Step 6). The constructed dynamic mesh, inspired by work in human surveillance (Chang et al., 2011), naturally captures the local collective dynamics without explicit thresholding or clustering. Separate meshes are produced for each colour channel. Multiple different meshes can be defined and constructed for specific purposes (see Materials and methods). Because the mesh is a representation of interactions inferred from the local spatial relationships of superpixels, it can be used to derive robust local and global measures of motion relating to biological phenomena such as collective motion and tissue interactions (see below).

In summary, MOSES has been designed to describe the video motion in terms of the individual trajectories of moving ‘entities’ (defined by the image superpixels) and their spatiotemporal interactions (as captured by a dynamic mesh). Depending on the application, superpixels can be assigned to and biologically interpreted as part of an object, a single cell or multiple cells.

Quantitative measurement of squamous and columnar epithelial boundary formation using MOSES

To verify the enhanced ability of MOSES over standard PIV analysis to quantitatively assess relevant biological features of interest in tissue, we applied it to analyse the formation of boundary dynamics between squamous and columnar cells. In total, 125 videos (48 with 0% serum and 77 with 5% serum) were collected from four independent experiments and jointly analysed. These videos are highly heterogeneous, creating a challenging dataset for analysis (Supplementary file 1, Figure 3—figure supplement 1).

Standard velocity kymographs were able to firstly confirm our observations in Figure 1E–G of the differences in cell motion between 0% and 5% serum conditions. As shown in Figure 3A, all cell combinations grown in 0% serum exhibited a similar global motion pattern, minimal sheet-like motion and interface movement following gap closure. By contrast, in 5% serum the same three cell line combinations exhibited very different dynamics, amongst which we observed the formation of a stable equilibrium following contact only between EPC2 and CP-A cell sheets. However, subsequent computation of the mean sheet speed, as performed during routine PIV analysis, highlighted the analytical need for a computational framework to extract more descriptive motion measures. As shown in Figure 3B, aside from an indication of a global speed increase in 5% serum, speed was an insufficient metric to characterise the observed phenotypes, exhibiting high variance between replicates and failure to highlight the uniqueness of the EPC2:CP-A interaction. We thus used MOSES to construct more discriminative phenotypic measures to characterise the boundary formation of EPC2:CP-A.

Figure 3. MOSES quantifies boundary formation and epithelial sheet interaction dynamics.

(A) Projected ‘x’-direction velocity kymograph of the dense optical flow for each cell line combination at the two serum concentrations used in the assay. The speed and direction of movement are indicated by the intensity and colour respectively (blue, moving left; red, moving right). (B) Average speed of each video for each cell combination, coloured green and yellow for the first and second cell line in the named combinations respectively. (C,D) Violin plots of boundary formation index (C) and mesh stability (D) for each video (black dot) for different cell combinations in 0% and 5% serum. Dashed line is the threshold, one standard deviation above the pooled mean value of all cell line combinations in 0% serum. Red solid line in violins = mean, black solid line in violins = median. (E) Maximum (Max.) velocity cross-correlation between the two sheets, before and after gap closure, left and right violins respectively for each cell line combination. Shaded region of all violins is the probability density of the data whose width is proportional to the number of videos at this value. (F) Boxplots showing median and interquartile range (IQR) of velocity order parameter and (G) mesh order for each cell line combination coloured green and yellow for the first and second cell line in the named combination, respectively. Whiskers show data within 1.5 x IQR of upper and lower quartiles.

Figure 3.

Figure 3—figure supplement 1. Heterogeneity in motion dynamics and quality of image acquisition.

Figure 3—figure supplement 1.

A random selection of still images from videos are shown as examples of the variability in the acquired video frames. Cell combinations are labelled such that the first cell line name corresponds to the left sheet and the second cell line name the right sheet. The dye used is indicated as (R) for red and (G) for green. Left: Examples at 0 h at the start of the experiment. Middle: Examples when the red and green sheets first close the gap between them. The gap closure point varies among videos. An automatic method was used to detect this to ± 5 frames with 94% accuracy, (Figure 3—figure supplement 9). Right: at 96 h. All scale bars: 500 μm.
Figure 3—figure supplement 2. The motion saliency map and boundary formation index for analysing motion sources.

Figure 3—figure supplement 2.

(A) Illustration of the key steps in computing the motion saliency map and the boundary formation index. The term ‘values’ in the signal-to-noise ratio equation refers to the pixel intensity value of the motion saliency heat map image, which represents the degree of motion localisation at a given pixel location. Higher values indicate increased motion concentration. The terms ‘high’ and ‘low’ in the equation refer to a two-class clustering of the motion saliency image pixel values where a binary threshold is given by Otsu thresholding. All pixel values above the threshold are ‘high’ and those below the threshold are ‘low’. (B) Example video snapshots at 96 h of the three cell line combinations (EPC2:EPC2, EPC2:CP-A, EPC2:OE33) in 0% (top panel) and 5% serum (bottom panel) with corresponding motion saliency map. The computed boundary formation index is given below each image pair. All scale bars: 500 μm.
Figure 3—figure supplement 3. MOSES mesh and boundary formation index captures multiple boundary formation.

Figure 3—figure supplement 3.

The addition of high concentrations of an EGF inhibitor, Lapatnib, induces cell death in EPC2 cells, causing shrinkage at multiple sites and the effective formation of multiple boundaries. (A) Video snapshots of a representative EPC2(R):OE33(G) experiment in 5% serum with addition of 5 ng/ml Lapatnib (top row) with corresponding constructed MOSES mesh for each colour channel (middle and bottom rows). Time indicates hours after Lapatnib addition. (B) The corresponding mesh strain curve of EPC2 is irregular and the onset of cell death can be detected as early as 20 h even when the sheet appears relatively intact. (C) Snapshot of the MOSES mesh and induced strain ellipse distribution at 100 h. The formation of holes pushes local superpixels away whilst cell death sites bring neighbouring superpixels closer. Red ellipses highlight very high local concentration of superpixels. (D) Snapshot of final frame with corresponding motion saliency map and computed boundary formation index. All boundaries are detected but the boundary with the most movement, the leading edge of the OE33 (G, green) sheet is most highlighted. (E) Motion saliency map and boundary formation index for an EPC2 (R, red):CP-A (G, green) combination in 5% serum and addition of a more potent EGF inhibitor, Aeatinib (10 ng/ml). In this example, the CP-A sheet exhibits almost no movement and the multiple boundaries due to cell death are clearly highlighted. The MOSES motion saliency map thus captures the contribution from multiple motion sources.
Figure 3—figure supplement 4. The MOSES mesh stability index captures the stability of the local topology.

Figure 3—figure supplement 4.

When cells cease to move and exhibit zero velocity, the MOSES mesh stability is always stable (value of 1) but the converse is not always true. This can be shown starting from an initially regular arrangement of cells (red grid) and simulating the resultant arrangement of cells if they move with a constant average velocity towards the bottom right over time (green and blue grid snapshots). If all cells experience the same velocity (blue box, blue lines) the location relations or topology between all cells are preserved at all timepoints. Consequently, these cells have a constant 0 line as its MOSES strain curve. In contrast, stochastic movement (orange box) whereby individual cells move in random directions but collectively exhibit the same global mean direction and speed leads to an ever-changing topology. This is reflected in the increasing shape of the MOSES strain curve (orange line). The MOSES mesh stability index thus is suited for monitoring stability based on the local collectiveness even when there is no difference in the mean velocity of the collective.
Figure 3—figure supplement 5. The mesh strain vector and collective motion.

Figure 3—figure supplement 5.

(A) The mesh strain vector of a superpixel is the resultant displacement vector after summing together all individual displacement vectors of the superpixel to each connected ‘neighbour’ superpixel. (B) Comparison of the mesh strain vector with the velocity vector for two video examples in successive video frames. Top and bottom video is the MCF10-ctrl monolayer of Supp. Movie 3 and RAB5A monolayer of Supp. Movie 7 from Malinverno et al. (2017). See Video 5 of this paper for the accompanying videos to this figure.
Figure 3—figure supplement 6. Velocity cross-correlation (VCC) for measuring the motion coordination of two epithelial sheets.

Figure 3—figure supplement 6.

(A) A track is composed of sequential (x,y,t) points. (B) VCC evaluates the correlation between two given tracks at all possible time lags, yielding a curve of correlation vs time lag. (C) The maximum velocity cross-correlation corresponds to the highest value of this curve. (D) Example VCC evaluation between EPC2 (G, green cells) and CP-A (R, red cells). Each superpixel i is coloured (see colour map) according to the mean maximum VCC between it and every superpixel of the other colour that is VCCi=mean(max{VCCij}) | iColor 1, jColor 2. For ‘before closure’, we use the (x,y) tracks from the initial frame up to Tgap-5 and for ‘after closure’ from Tgap+5 to the end frame where Tgap is the gap closure frame. Superpixel tracks are plotted at their initial position for ‘before closure’ and at Tgap+5 for ‘after closure’.
Figure 3—figure supplement 7. Ranking of 5% serum videos according to boundary formation index.

Figure 3—figure supplement 7.

Final frame snapshots (96 or 144 h) ordered according to the MOSES boundary formation index in decreasing order reading from left to right, top to bottom, (n = 77). Values to 2 s.f. are given for the first video in each row and column. * demarcates the first image where a boundary does not form, according to the threshold set by our negative 0% serum controls from Figure 3 (0.69), that is all videos to the left of and above the red dashed line are predicted by MOSES to form a boundary. Coloured dots indicate the cell line combinations cultured. AGS cells are derived from gastric adenocarcinoma. EPC2:AGS is included as another columnar cell and cancer control for comparison.
Figure 3—figure supplement 8. Quantitative assessment of boundary formation and sheet-sheet interaction dynamics of all 5% serum videos.

Figure 3—figure supplement 8.

(A) Average speed of each cell line in each cell line combination in 5% serum. (B–D) Violin plots of (B) boundary formation index, (C) mesh stability index and (D) maximum (max.) velocity cross-correlation between the two sheets for all cell line combinations tested in 5% serum. Black dashed line in panels (B) and (C) is the threshold set using the 0% serum control as in Figure 3. In panel (D), for each paired violin, the left violin is before and the right violin is after gap closure. (E) Velocity cross-correlation difference before and after gap closure of (D). For all panels, each black point is a video, red solid line is the mean, black solid line is the median, shaded region is the probability density of the data whose width is proportional to the number of videos at this value, n = 77. Statistical two-tail test with Mann-Whitney U. (F,G) Boxplots of velocity order (F) and mesh order (G) for individual cell lines in each cell line combination with naming convention, cell line 1 (green):cell line 2 (yellow). AGS cells are derived from gastric adenocarcinoma. In all plots, EPC2:AGS is included as another columnar cell and cancer control for comparison.
Figure 3—figure supplement 9. Automatic determination of gap closure.

Figure 3—figure supplement 9.

(A) Epithelial sheet segmentation using the individual red and green image channels for each RGB video frame. Solid white lines mark the sheet boundaries. (B) Boundary points identified using a sweepline algorithm (Materials and methods) subsample the respective coloured sheet boundaries. Each red point is paired with the closest green point, shown with white solid lines. (C) The gap closure frame is the point of inflexion in the plot of the average pairwise distances between boundary points (normalised gap size) vs time (given as frame numbers (abbreviated as No.)). The inflexion point is automatically estimated by finding the point of intersection between the fitted linear spline approximation (green line) of the distance (red dots) and the fitted baseline + 2 times standard deviation (dashed red line). The baseline (dashed black line) is an estimation of the background noise measurement of the gap size due to image artefacts. (D) Comparison of frame number (no.) at the time of gap closure predicted by the algorithm vs the consensus of two independent human countings given by the average of their independently annotated frame number. Each blue point is a video, n = 246 videos. Note that this analysis included more videos than those reported in this paper for the purpose of increasing the rigour of the validation. These videos were derived from additional experiments, performed with the three cell lines described in combination with an additional cell line (OE21, an esophageal squamous cell carcinoma) under a range of different media conditions. Solid black line is the ideal identity line. Dashed red lines show ± 5 frames from the black ideal identity line. (E) Histogram of the error as measured by the absolute difference in frame number between automated and consensus manual annotations. (F) Histogram as in (E), showing difference between the two individual manual annotations. In (E) and (F) the accuracy score is reported treating a difference > 5 frames as a disagreement.
Figure 3—figure supplement 10. Cell infiltration, boundary shape and cell intermixing at the interface between two sheets.

Figure 3—figure supplement 10.

(A) Boxplots showing the cell infiltration of individual cell lines into the other cell line in each cell combination with naming convention, cell line 1 (green):cell line 2 (yellow) as a fraction of the total cell count 3 h after gap closure. Image segmentation was used to find the boundary. (B) Snapshot of representative videos from each cell line combination grown in 0% or 5% serum 24 h after gap closure. Scale bar: 200 μm (C) Example snapshot with found ‘boundary’ line (solid white line), its straight line equivalent (solid yellow line) and definition of boundary shape. (D) Boxplots of the mean boundary shape over time in 0% and 5% serum. Boundaries were found in each video frame using the MOSES superpixel tracks. (E) Found boundary using image segmentation (top) and thresholding of the motion saliency map (bottom). When the cell motion is coordinated, the two are the same. (F,G) Boxplots of the intermixing coefficient (see Materials and methods) based on image segmentation at 72 h, (F) or based on thresholding of the motion saliency map, (G). (H) Violin plot of (G) with width proportional to the number of videos (black points). All boxplots depict median and interquartile range (IQR) of values with whiskers defined 1.5 x IQR above and below the quartiles. Coef. is used as the abbreviation for coefficient. AGS cells are derived from gastric adenocarcinoma. In all plots, EPC2:AGS is included as another columnar cell and cancer control for comparison.

Using MOSES, we constructed a motion saliency map for visualising global motion patterns (Figure 3—figure supplement 2) and designed four different additional measures as summarised in Table 1 to quantitatively distinguish between different combinations of the three cell lines (EPC2, CP-A and OE33) used in our assay: i) boundary formation index, based on the degree that motion concentrates into a local spatial region (Figure 3—figure supplements 2 and 3); ii) mesh stability index, which measures the degree of movement between neighbouring cell groups locally within the sheet at the endpoint (Figure 3—figure supplement 4); iii) mesh order to measure the degree of collective motion of the whole sheet (Figure 3—figure supplement 5, Video 5) and iv) the maximum velocity cross-correlation, which measures the average correlation in motion between the two epithelial sheets from their movement history (Figure 3—figure supplement 6). The first three are based on the mesh; the fourth uses the individual superpixel tracks. These measures represent examples of statistics derivable using MOSES and are not limited to wound-healing type assays. For technical details, see Materials and methods. Cells grown in 0% serum were used as a computational negative control to set appropriate cut-offs for detecting boundary formation in our computational analysis (see Materials and methods for details).

Table 1. Summary of MOSES-derived measurements for discriminating boundary formation phenotype in this paper and their biological interpretation and application.

Proposed measures Biological interpretation Biological application
Motion Saliency Map ‘Heatmap’-like image that highlights spatial regions that attract or repel local cellular motion over a defined time period. Quantitative spatio-temporal readout of scratch or wound-healing assays. Highlights areas of salient motion activity spatially such as the migration of macrophages locally to inflicted wound sites.
Boundary formation index (value from 0 to 1) Quantifies the concentration of movement within one region of space. Multiple ‘hotspots’ decrease this index. Uniform distributed movement (no attraction) scores 0. Concentration of movement in a single region for example a line or point scores 1. Quantification of the spatial uniformity or ‘spread’ of motion activity. Highlights for example if all migrating cells move uniformly to close the wound in a wound-healing assay.
Mesh stability index (value from -∞ to 1) Measures the motion stability of local cell groups by measuring the change in relative spatial arrangement (topology) with respect to neighbouring cell groups at a given endpoint. Static epithelial sheets and epithelial sheets that move uniformly in a single direction preserve local cellular arrangement and scores 1. Motion that results in the local spatial rearrangement of cells such as neighbour exchange in embryos is unstable and scores < 1. Assessing the final global movement stability of a collective such as an embryo or epithelial sheet based on the movement of the cells that compose it.
Mesh order (value > 0) Measures the collectiveness of local cellular migration by the change in distance and direction of cells with respect to neighbouring cells. Hypothesizes that cells that are part of the same motion group exhibit the same ‘mesh’ force and move retaining the same local spatial arrangement. Quantitative assessment of the extent of global sheet-like movement.

Video 5. Comparison of the temporal stability of velocity and mesh strain vectors for measuring collective motion.

Download video file (38.8MB, mp4)
DOI: 10.7554/eLife.40162.032

Individual velocity vectors are represented with red arrows, individual mesh strain vectors with green arrows. The large blue and black arrow are the global mean velocity and global mean mesh strain vector respectively. The corresponding derived MOSES mesh is shown on the right of the arrow plots in black. Bar: 100 μm.

Computing the proposed indices for all videos, the boundary formation index was ranked on a continuous scale for all 5% serum videos (Figure 3—figure supplement 7) by MOSES. The boundary formation index was highest (median 0.74) for EPC2:CP-A grown in 5% serum (n = 30/77) (Figure 3C, Figure 3—figure supplement 8B), whilst EPC2:EPC2 and EPC2:OE33 in 5% serum were below the boundary formation cut-off. In 5% serum, EPC2:CP-A had the highest mesh stability index (median 0.90), compared to EPC2:EPC2 (0.76) and EPC2:OE33 (0.25) (Figure 3D, Figure 3—figure supplement 8C). Together these results illustrate that only the squamous-columnar combination (EPC2:CP-A) formed a final stable boundary.

We next measured sheet-sheet interactions using the maximum velocity cross-correlation (VCC) before and after gap closure for the three cell line combinations. Gap closure was automatically determined (Figure 3—figure supplement 9 , Materials and methods). For two initially diametrically opposed migrating sheets, a significant increase in VCC after gap closure compared to before gap closure is suggestive of increased sheet interaction. In 0% serum, no difference in velocity cross-correlation across all combinations was found before and after gap closure (Figure 3E ) - the two sheets do not move cohesively as a unit, as expected with minimal cell-cell contact. In serum, the difference for EPC2:CP-A (0.03 before and 0.20 after gap closure) was ~3-6 times larger than for EPC2:EPC2 (0.01 to 0.08) and EPC2:OE33 (0.02 to 0.05) (c.f. left and right violin plots in Figure 3E ), suggesting potential physical interaction. Interestingly, CP-A:OE33 (Barrett’s:cancer, n = 6) also exhibited a substantial increase in VCC following gap closure (0.03 to 0.17) (Figure 3—figure supplement 8D,E) but no substantial increase was observed for CP-A:CP-A (0.01 to 0.06) (Figure 3—figure supplement 8D,E), confirming this observation was not simply CP-A cell line-specific. Both the velocity order parameter and our mesh order parameter (Figure 3F,G) indicate clear overall reduction in collective motion in cells grown in 0% serum compared to 5% serum.

Finally, we used MOSES with image segmentation and convolutional neural network (CNN) cell counting to assess morphological variations of the interface among all cell line combinations to check for ‘invasive fingers’ or boundary breakdowns that often exist in normal and tumour cell-formed boundaries (Figure 3—figure supplement 10A,B). Interestingly, we found non-invading boundaries in all cell line combinations except CP-A:OE33, where we observed clear infiltration with ‘finger-like’ protrusions of CP-A into the OE33 cell sheet grown in 5% serum (Figure 3—figure supplement 10B). No such ‘finger-like’ protrusions or significant intermixing of cells (Figure 3—figure supplement 10D,F–H) were detected in EPC2:CP-A under the same conditions. Altogether, our characterisation suggests that, among all tested cell-type combinations, we detect an ‘interacting’ final stable boundary uniquely in the squamous-columnar EPC2:CP-A combination grown in 5% serum (Video 6).

Video 6. Representative videos of the sheet migration of all tested cell combinations in 5% FBS. (R) and (G) denote red and green dyed cells respectively.

Download video file (9.4MB, mp4)
DOI: 10.7554/eLife.40162.027

Film duration 144 h. Bar: 500 μm.

MOSES can measure subtle phenotype changes induced by external stimuli

The ability to assess subtle phenotype changes with a minimal number of replicates is critical for high-content screening applications. We thus tested the ability of MOSES to detect changes in EPC2:CP-A boundary formation using an external perturbation. The epidermal growth factor receptor (EGFR) is frequently mutated in EAC, sometimes overexpressed in BE and is activated by bile acid reflux, the main observed cause of BE clinically (Dixon et al., 2001; Souza, 2010). In our experiments we thus used epidermal growth factor (EGF), the ligand of EGFR, as a biologically relevant external perturbation. Increasing amounts of EGF (0 ng/ml to 20 ng/ml) were added to the culture medium to assess incremental effects on cellular motion and boundary formation in the EPC2:CP-A combination. A total of 40 videos (each 144 h, one frame per h) were collected from three independent experiments, in a 24-well plate medium-throughput screen (Supplementary file 2). With increasing EGF, the boundary position was displaced a distance farther from the initial point of contact between the two cell populations, with slightly enhanced cell speed and decreased boundary coherence (Figure 4A–D). This is quantitatively reflected in the shape of the mean normalised strain curve (Figure 4E) which measures the average distance between neighbouring cell groups (Materials and methods): at 0 ng/ml EGF, the curve linearly increases before plateauing around 72 h; as EGF concentration increases, the curve becomes more linear and the plateau is lost above 5 ng/ml.

Figure 4. EGF titration at physiological levels disrupts boundary formation.

(A) Destabilisation of the junction with EGF addition. All in 5% serum with snapshots of endpoint (144 h). Shown also is the green channel CP-A MOSES mesh. The closeness of the lines indicates impeded motion leading to a local aggregation of superpixels in the vicinity and is suggestive of a boundary. The less lattice-like the mesh, the less ordered the motion. Blue triangles mark the boundary position in the image and its corresponding inferred position in the CP-A mesh. All scale bars: 500 μm. (B) Top: maximum projected video kymograph. Bottom: x-direction velocity kymograph computed from optical flow for the representative videos in (A). (C) Grouped boxplot of the average speed for the different cell lines in the combination in 5% serum with increasing EGF concentration. (D) Mean displaced distance of the boundary normalised by image width following gap closure with increasing EGF concentration in 5% serum. Mean displaced distance of EPC2:CP-A and EPC2:OE33 cultured in 5% serum from Figure 1G are also plotted for comparison. T-test was used with * indicating p = < 0.05, ** p = < 0.01, *** p = < 0.001. Error bars are plotted for ± one standard deviation of the mean. (E) Mean normalised strain curves for EPC2:CP-A in 5% serum for each concentration of EGF. The mean curve for EPC2:OE33 videos in 5% serum without EGF in Figure 3 is shown for comparison (black curve). (F–H) Violin plots of boundary formation index (F), mesh stability index (G) and maximum velocity cross-correlation (H) for each concentration of EGF and cells in 5% serum. Red solid line = mean, Black solid line = median. Dots are individual videos, total n = 40. Shaded region is the probability density of the data whose width is proportional to the number of videos at this value. Violins of respective measures for EPC2:OE33 in 5% serum without EGF with thresholds (horizontal black line) from Figure 3 is shown for comparison. (I,J) Boxplots of velocity order (I) and mesh order (J) for individual cell lines (left) and pooled across the two cell lines in the combination (right). Values for EPC2:OE33 in 5% serum without EGF and threshold from Figure 3 are shown for comparison.

Figure 4.

Figure 4—figure supplement 1. Migration-independent cell counting to assess cell proliferation upon EGF addition to EPC2:CP-A in 5% serum.

Figure 4—figure supplement 1.

(A) Example results of CNN counting on fluorescence videos as described in Materials and methods with switching of dyes. For each panel, the top row shows video snapshots, the bottom row shows the CNN heatmap output and the numbers below it are the CNN predicted number of cells within the image patch. Exemplar patches shown are all sampled from the initial video frame. Consistent cell numbers across all sampled cell patches enables the use of the averaged cell density as an estimate of the cell density in the entire migrating epithelial sheet. Similar cell counts using either red or green dye shows cell counting is unaffected by the dye. (B) Similar examples presented as in (A) but using the matched phase contrast videos to illustrate the changing appearance of cells over time. Top panel: cell patches sampled from the initial frame (0 h). Bottom panel: cell patches sampled from the final frame (144 h). Individual cell types in co-cultures are distinguished using the matched fluorescence videos for image masking. (C) Boxplots of mean cell density (top) and corresponding mean % change in cell density (bottom) of each cell population, labelled with red and green dyes, where the average change in cell density is normalised by the average cell density across all frames up to 48 h. (D) Similar boxplots of mean % change as in (C) but with cell density computed from phase contrast videos. (E) Surface plot of the temporal evolution of the red and green channel normalised image intensity histograms for a representative video over 144 h. Given an image intensity value, the darker the colour the higher the proportion of image pixels share this image intensity (normalised frequency). (F) Plot of the change in the modal image intensity value of (E) over time with lighter colours representing later times. (G) Spatiotemporal evolution of the image intensity histogram of (E) as a heatmap with time on the y-axis, image intensity on the x-axis and the colour representing the normalised frequency. (H) Spatiotemporal evolution of the image intensity histogram of (E) shown as an overlay of histograms using the progression of dark to light colours to show time. (I) Plot of the (image intensity, normalised frequency) pair for the histogram peak estimated at each time point. Passage of time is indicated by the progression of dark to light colours. Points fit a linear regression. The proliferation gradient is the negative of the gradient of the best fit line. (J,K) Boxplots of the extracted proliferation coefficient for increasing EGF concentrations for individual cell populations where EPC2 is dyed ‘red’ (J) and ‘green’ (K). Compared to fitting the image intensity vs time in (F) which shows too discrete a transition between intensity values, fitting the normalised frequency vs image intensity was found to be more robust whilst yielding the same conclusions when all videos are of the same temporal length (144 h) as was the case here for EGF addition to EPC2:CP-A. All boxplots depict median and interquartile range (IQR) of values with whiskers defined 1.5 x IQR above and below the quartiles.
Figure 4—figure supplement 2. EGF addition to EPC2:CP-A in 0% serum does not induce boundary formation.

Figure 4—figure supplement 2.

(A) Snapshots of the video (top) and fused red EPC2 and green CP-A MOSES mesh (bottom) at 144 h with increasing EGF concentration. (B) Top: maximum projected video kymograph. Bottom: x-direction velocity kymograph computed from optical flow for the representative videos in (A). The speed and direction of movement are indicated by the intensity and colour, respectively (blue, moving left; red, moving right). (C) Average speed of each cell line in each cell line combination for increasing concentrations of EGF. (D) Mean displaced distance of the boundary following gap closure with increasing EGF concentration. Mean displaced distance of EPC2:CP-A and EPC2:OE33 in 0% serum from Figure 1G are also plotted for comparison. T-test was used with * indicating p = < 0.05, ** p = < 0.01, *** p = < 0.001. Error bars are plotted for ± one standard deviation of the mean. (E) Mean normalised strain curves for EPC2:CP-A in 0% serum with each tested EGF concentration. The mean curves for EPC2:OE33, 0% serum and EPC2:CP-A, 5% serum, EPC2:EPC2, 5% serum without EGF are also shown for comparison. (F) Evolution of the mean curves in (E) plotted onto the motion map of Figure 5. (G–I) Violin plots of (G) boundary formation index, (H) mesh stability index and (I) maximum (max.) velocity cross-correlation before and after gap closure, left and right violins respectively of each paired violin in (I). Shaded region represents the probability density of the data. Width of shaded region is proportional to the number of videos with that value. Each point is a video, black line is the median, red line is the mean. EPC2:OE33, 0% serum without EGF and thresholds in Figure 3 are shown for comparison. (J,K) Box plots of the velocity order (J) and mesh order (K) for individual cell lines (left) and pooled over the combination (right). EPC2:OE33 in 0% serum without EGF from Figure 3 is shown for comparison. (L,M) Boxplots of mean % change in cell density normalised by the average cell density (L) and mean cell density (M) of each cell for all frames up to 48 h. Each point is a video. All boxplots depict median and interquartile range (IQR) of values with whiskers defined 1.5 x IQR above and below the quartiles.

The boundary formation index decreased with increasing EGF (0.74 at 0 ng/ml to 0.46 at 20 ng/ml, comparable to EPC2:OE33 without EGF (0.46)), indicating loss of the boundary (i.e. index below the 0.69 cut-off) (Figure 4F). The mesh stability index decreased from 0.94 (stable, 0 ng/ml EGF) to 0.72 (unstable, 20 ng/ml) (Figure 4G), suggesting increased movement between neighbouring cells and the loss of interaction between the two cell populations since the maximum VCC difference before and after gap closure decreased from 0.16 (0 ng/ml EGF) to 0.04 (20 ng/ml EGF) (Figure 4H). The maximum VCC after gap closure was similar to that for EPC2:OE33 (0.02) (Figure 4H), but the mesh stability index remained higher (Figure 4G). Together these measures show that above 5 ng/ml EGF, the phenotype of EPC2:CP-A becomes similar to that between EPC2 and the EAC cell line, OE33 (Video 7). Cell counting and quantification of fluorescence decay suggest cell division is not the primary factor influencing the boundary at the cell density used in our experiments in 5% serum (Figure 4—figure supplement 1). Of note, titrating EGF did not rescue the effect of serum absence, with non-significant changes in the boundary formation index (0 ng/ml: 0.60 ± 0.07, 20 ng/ml: 0.65 ± 0.03) and maximum velocity cross-correlation, although the mesh stability index decreased, likely due to increased cell movement (Figure 4—figure supplement 2 (n = 25)).

Video 7. Motion dynamics of EPC2(R):CP-A(G) under increasing EGF addition (0–20 ng/ml).

Download video file (15.5MB, mp4)
DOI: 10.7554/eLife.40162.031

MOSES can also measure decreased collective migration in the green CP-A sheet with increasing EGF, a phenomena difficult to assess by eye. MOSES meshes are shown, red for red-dyed (R) EPC2 cells and yellow for green-dyed (G) CP-A cells, respectively. Bar: 500 μm.

Using both the velocity order parameter and mesh order index to characterise collective motion, we found little change in collective motion upon EGF addition in 0% serum (Figure 4—figure supplement 2J,K), which might explain the lack of boundary formation. However, in 5% serum plus EGF, the two measures exhibited opposite results: raising EGF concentration increased velocity order but decreased mesh order, Figure 4I,J. We note however that the mesh order, by explicitly accounting for the motion of neighbouring cells, better reflects human observation of motion in videos (Videos 57). This highlights the pitfalls of only quantifying the individual alignment of velocity vectors computed from one video frame. Cell counting for 0% serum again suggested minimal influence of cell proliferation (Figure 4—figure supplement 2L,M). In summary, this example with EGF demonstrates that MOSES enables robust continuous-scale quantification of motion phenotype following systematic perturbation.

MOSES generates motion signatures and 2D motion maps for unbiased characterisation of cellular motion phenotypes

High-content imaging screens are often explorative, with the aim of screening for unknown differences in complex cellular motions from a large number of videos in an unbiased manner (Zaritsky et al., 2017). In general it is therefore not known a priori the behaviour of the imaged cells. MOSES addresses this need for an unbiased phenotyping approach by enabling the systematic generation of unique ‘motion signatures’ for individual videos in a manner similar to the relatively automatic generation of geometric features for cell shape quantification in high-content image screens (Boutros et al., 2015; Bray et al., 2016). Below we demonstrate that unsupervised machine learning techniques requiring no manual user annotation can be applied to MOSES generated signatures to visualise all videos onto a 2D motion phenotype map. This advanced feature of MOSES enables easy visual assessment of motion phenotype and the generation of hypotheses without the need to individually interrogate each video.

The general process for the motion map generation is illustrated in Figure 5A. To position each video on a 2D map, we applied principal component analysis (PCA) to the normalised mesh strain curves of the 77 videos of all cell line combinations cultured in 5% serum conditions to learn the principal component vectors that define the x-y axis of the 2D map. The normalised mesh strain curve for each video was used here as an example 1D motion signature to summarise the entire video motion (see Materials and methods for constructing more descriptive signatures). The constructed PCA map of the 77 videos from cells cultured in 5% serum (Figure 5B) shows that this unbiased approach automatically clusters all videos corresponding to each cell line combination. Furthermore, the videos were ordered in a continuous manner, as shown by the increasingly linear shape of the mean normalised strain curve when looking left to right across the plot in Figure 5B from CP-A:CP-A to EPC2:OE33. This clustering was not achieved with root mean squared displacement (RMSD), the non-mesh version of the MOSES-normalised mesh strain curve (Figure 5—figure supplement 1). Moreover, it appears independent of the particular dimensionality reduction technique used (Figure 5—figure supplement 2), indicating that the signatures constructed using MOSES are intrinsically informative. Finally, the 1D MOSES-based motion signatures trained a machine learning classifier with no further processing to predict cell combination identity better than RMSD (Figure 5—figure supplement 2).

Figure 5. MOSES generates motion signatures to produce a 2D motion map for unbiased characterisation of cellular motion phenotypes.

In all panels, each point represents a video (see legends for colour code). The position of each video on the 2D plot is based on the normalised mesh strain curves, analysed by PCA. (A) The mapping process for a single video. (B) The 5% serum videos (n = 77) were used to set the PCA that maps a strain curve to a point in the 2D motion map. (C) The 0% serum videos (n = 48) were plotted onto the same map defined by the 5% serum videos using the learnt PCA. In (B) and (C), the mean mesh strain curves for each cell combination are shown in the insets. Light blue region marks the two standard deviations with respect to the mean curve (solid black line). (D) Same map as in (C) with points coloured according to 0% or 5% serum. (E) The normalised mean strain curves for 0–20 ng/ml EGF addition to EPC2:CP-A from Figure 4E plotted onto the same map defined by the 5% serum videos.

Figure 5.

Figure 5—figure supplement 1. Comparison of MOSES-normalised strain curves vs RMSD curves as motion signatures for motion map generation from 5% serum videos.

Figure 5—figure supplement 1.

Left: Normalised strain curves for each video organised by cell line combination with MOSES (top) and RMSD (bottom). Each video is represented by a single dashed curve coloured according to the dye colour used for the first cell line in the combination indicated on each plot. Thus for combinations such as CP-A:CP-A where the green dyed CP-A is always the left sheet, there is only one green dashed line. Solid black line is the mean curve, blue shaded region marks ± two standard deviations of the mean curve, total n = 77 videos. Middle: PCA analysis applied to MOSES (top) and RMSD (bottom) shown with the respective derived principal components plotted above the PCA motion map. Percentage of variance explained by each principal component or PC is displayed in brackets. Each point is a video (see legends for colour code). Right: Classification report with the precision, recall and f1 score after fitting a balanced linear SVM (support vector machine) machine learning classifier trained on the normalised MOSES strain or RMSD curves of all the videos, n = 77. AGS cells are derived from gastric adenocarcinoma. In all plots, EPC2:AGS is included as another columnar cell and cancer control for comparison.
Figure 5—figure supplement 2. Comparison of motion map learning using different dimensional reduction techniques with MOSES strain curves and RMSD curves. .

Figure 5—figure supplement 2.

In each panel, the same 77 serum videos were used. From left to right, PCA - principal components analysis, MDS - multidimensional scaling, TSNE - t-distributed stochastic neighbour embedding and a neural network autoencoder. See Material and methods for details. Each point represents a video, as indicated in the legend. In all plots, EPC2:AGS is included as another columnar cell and cancer control for comparison.

To demonstrate how such 2D motion phenotype maps can be used to compare videos, we next mapped the 48 videos from 0% serum cultures on the same axes as the videos from 5% serum cultures (Figure 5C,D). The videos from 0% serum mapped to a different area of the 2D plot, whilst preserving the continuous ordering of the previous videos. Therefore, without having watched the videos, it is easy to predict that the cells have markedly different motion dynamics in 0% serum compared to 5% serum. Furthermore, since the points for the 5% serum videos cover a larger area of the 2D plot than the 0% serum videos, one can predict more diversity of motion in 5% serum (Figure 5D).

The motion map can also capture subtle changes in dynamic behaviour. This is demonstrated by mapping the mean video motion for each concentration of EGF from 0 to 20 ng/ml (represented by the respective mean normalised strain curves for each concentration (one per concentration from a total n = 40 videos, Figure 4E) onto the same axis as the 5% serum videos in the absence of EGF (square points in Figure 5E). With increasing EGF, the squamous-columnar EPC2:CP-A motion dynamics become increasingly similar to squamous-cancer EPC2:OE33 above 5 ng/ml, as evidenced by the square points moving from the area of blue circular EPC2:CP-A points into the area of orange circular EPC2:OE33 points. Thus our motion map is consistent with the result using the specific derived measures (above). These results illustrate the ability to detect biological and technical variability across independent experiments and that MOSES possesses the required features for an algorithm to be used in an unbiased manner in high-content screens with minimal prior knowledge.

Comparison between MOSES and PIV

Finally, to further illustrate the full potential of MOSES and to demonstrate its application, we compared MOSES with the widely used PIV method on two published timelapse microscopy datasets of epithelial monolayers. In the original publication, Malinverno et al. (2017) used PIV to describe the induction of large-scale coordinated motility in MCF-10A RAB5A expressing cells compared to MCF-10A control cells. In the publication associated with the second dataset, Rodríguez-Franco et al. (2017) used PIV to show the detection of deformation waves that propagate away from the cell boundary between two epithelial monolayers. The MDCK cell monolayers expressed EphB2 and its ligand ephrinB1, respectively.

Reanalysing the datasets with MOSES, we found that motion fields inferred from optical flow by MOSES were similar to those from PIV, yielding both similar speed curves and velocity kymographs. However, MOSES exhibited greater sensitivity to salient motion events (indicated in Figure 6A). Compared to PIV velocity vectors, MOSES superpixel tracks are a more data-efficient (see Discussion) encoding of the spatio-temporal velocity distribution that naturally enhances and preserves the salient motion. Reconstructed velocity kymographs from the MOSES motion trajectories capture not only the pattern of the full velocity kymograph but further selectively enhanced the detection of the deformation wave signature formed at the interface between EphB2/ephrinB1 epithelial monolayers (as indicated in Figure 6A, right panel). Thus, all velocity-based statistics that can be derived from PIV, such as the velocity order parameter, are fully preserved in MOSES. Yet, MOSES offers additional advanced possibilities. Firstly, instantaneous velocity-based measures from single videos commonly derived from PIV are noisy. For example, the velocity order parameter variation for the slower MCF-10A control cells is non-smooth and highly variable between consecutive time points (Figure 6B). This leads to the misinterpretation that at certain time points, MCF-10A control cell motion is more collective than MCF-10A RAB5A expressing cells following induction. In contrast, the MOSES mesh order exploits long-time continuity and neighbourhood relations to robustly capture collective motion in a manner consistent with human observation (Video 5). Secondly, long-time MOSES tracks and superpixel mesh strain curves can unbiasedly cluster the global motion pattern into small spatio-temporal groups (Figure 6C, Video 8). This provides a systematic approach to the interrogation of motion sources (Figure 6D, Video 9). The computation of motion saliency maps from forward and backward tracked frames effectively highlight the spatial concentration of motion, and the boundary formation index attests to the efficacy of the MOSES-enabled measures across independent datasets. Finally, and uniquely, MOSES’ meshes and tracks present a systematic framework for users to define extensive sets of custom measures to comprehensively characterise complex motion phenotypes, such as the investigated boundary formation behaviour between epithelial cells. By exploiting this, we successfully constructed feature descriptors to generate interpretable motion maps for our large video dataset. Figure 6E summarises the key points of comparison between MOSES and PIV.

Figure 6. Comparison between MOSES and PIV.

Figure 6.

(A) Left: average speed curves of MCF-10A control (CTRL) and doxycycline inducible RAB5A-expressing (RAB5A) monolayer cell migration after doxycycline addition (Supp. Movie 3 of Malinverno et al., 2017) computed using PIV and MOSES (optical flow). Green triangles indicate notable events in the movie; E1 (4 h): addition of doxycycline, E2 (16 h): first bright ‘flash’ in movie followed by accelerated movement of RAB5A expressing cells, E3 (25 h): timepoint at which RAB5A cells moved fastest. Right: velocity kymographs of Videos 1 and 2 (c.f. Figures 3 and 4 respectively in Rodríguez-Franco et al. (2017)) computed from PIV and MOSES motion fields showing the presence of deformation waves (black dash-dot line) due to cell jamming following initial gap closure (green dashed line). The corresponding velocity kymographs reconstructed from a fixed number of 1000 MOSES superpixel tracks and a ‘dense’ number of superpixel tracks (starting with 1000 superpixels) is shown for comparison. The speed and direction of movement are indicated by the intensity and colour, respectively (blue, moving right; red, moving left). (B) Top and middle: velocity order parameter curves as defined in Malinverno et al. (2017) for the MCF-10A control and RAB5A expressing cell lines was computed for the same movies as (A) using PIV and MOSES. Bottom: corresponding MOSES mesh order curve. Black vertical dashed line mark the addition of doxycycline. (C) MOSES superpixel tracks computed for wound-healing assay of MCF-10A and RAB5A cells (Supp. Video 19 of Malinverno et al., 2017) were automatically clustered into distinct groupings and coloured uniquely according to their mesh strain curve using GMM with BIC model selection (Materials and methods, Video 8 of this paper). (D) (i,ii) Automatic clustered superpixels and associated motion saliency maps (backward tracking to identify ‘initial’ motion, forward tracking to identify ‘final’ motion) for different videos of MCF10A-control (CTRL) and RAB5A monolayer migration in Malinverno et al. (2017), (see also Video 9). (iii) Left to right: snapshots of initial and final frames, MOSES superpixel tracks (1000 superpixels) overlaid on final frames, associated ‘final’ motion saliency map and boundary formation index for CTRL and RAB5A cells for same videos in (C). (iv) Snapshot of final frame, overlaid MOSES superpixel tracks (1000 superpixels) and associated ‘final’ motion saliency map and boundary formation index for the boundary formation between EphB2 and ephrinB1 expressing MDCK monolayers (Rodríguez-Franco et al., 2017). (E) Summary of the comparison between MOSES and PIV. Long-time refers to the tracking of movement beyond one timepoint.

Video 8. Long-time superpixel track extraction and unbiased track clustering using the mesh strain curve of each superpixel as the feature vector for Supp. .

Download video file (37.8MB, mp4)
DOI: 10.7554/eLife.40162.037

Movie 19 of Malinverno et al. (2017). Each cluster is highlighted with a unique colour. The colours are arbitrary. There is no colour matching between individual videos. Bar: 100 μm.

Video 9. Long-time superpixel track extraction and unbiased track clustering using the mesh strain curve for Supp. Movie 3 and 7 of Malinverno et al. (2017).

Download video file (31.3MB, mp4)
DOI: 10.7554/eLife.40162.038

Each cluster is highlighted with a unique colour. The colours are arbitrary. There is no colour matching between individual videos. Bar: 100 μm.

Discussion

We have shown that MOSES combines the advantages of existing PIV and single-cell tracking methods to provide a single systematic approach for the analysis of complex motion and interaction patterns. Its operating principle builds upon two established and highly successful approaches, long-time trajectories and graphs/meshes. In computer vision, spatio-temporal ‘signatures’ constructed using long-time trajectories have proven superior to ‘signatures’ derived from PIV-type motion fields for encoding complex spatio-temporal events such as human actions in very large datasets (Wang et al., 2011). Meshes/graphs are ubiquitously regarded as one of the best approaches to capture relationships between ‘objects’ across many disciplines, from Google search to protein-protein interaction networks (Szklarczyk et al., 2015), flocking analyses (Ballerini et al., 2008; Zhou et al., 2013Shishika et al., 2014), to detection of cell jamming in biological physics (Lačević et al., 2003; Park et al., 2015). MOSES uniquely brings together these disparate uses of long-time trajectories and meshes into one general analysis framework. As a result, MOSES satisfies the four criteria (robust, sensitive, automatic, and unbiased) necessary for characterising and establishing new phenotypes from live cell imaging. The analysis of datasets that include variable quality videos and experiments with a small number of replicates demonstrates the potential of the proposed computational framework.

Importantly, MOSES is progress towards overcoming the individual limitations of single-cell tracking and PIV-type velocity methods. Single-cell tracks are notoriously problematic over long times; the track of a single cell may be lost or broken into many separate tracks. MOSES superpixel tracks avoids this and recovers the global motion patterns (c.f. motion saliency maps, derived measures and motion signatures). Side-by-side comparison of MOSES and the standard PIV method using published datasets demonstrates that MOSES not only enables all the measurements of PIV, but by further exploiting long-time tracks and neighbourhood relationships, delivers greater physical and biological insights. Complex salient spatio-temporal motion patterns and events such as boundary formation, deformation waves due to cell jamming between two cell populations and cell death can all be quantitatively captured by MOSES. Critically, the ability of MOSES to perform long-time tracking (up to 6 days demonstrated in this study) enabled spatial localisation of the cell populations involved in a particular motion phenotype.

MOSES does not require complex user settings to facilitate reproducibility in analyses because it does not aim to threshold or cluster out the moving objects or phenotypes during analysis, which would introduce intermediate processing errors. Rather its philosophy is to facilitate systematic generation of many motion-related measurements based on trajectory and mesh statistics sufficient for applying machine learning methods for data-driven object segmentation, video classification and phenotype detection in large video collections (e.g. Figure 5, motion map) with minimum prior information. The main parameter the user specifies is the number of initial superpixels, which determines the spatial resolution of analysis. No complicated fitting of complex models and no special hardware such as GPUs are required. MOSES is modular and its components can be readily adapted to suit specific applications, for example non-square superpixel shapes to better capture cells that undergo large shape changes. Analysis of 96 videos with 1344 × 1024 pixel resolution and 145 frames by tracking 1000 superpixels takes under 4 h on an unoptimized code implementation running on a single PC (3.2 GHz, 16 GB RAM). Results are stored efficiently (~1–2 MB per video) compared to ~0.1–1 GB per video, depending on the subsampling used to save the full spatio-temporal PIV/optical flow motion fields. Altogether our study illustrates the potential of MOSES as a powerful and systematic computational framework. It is particularly useful for unbiased explorative high-content screening with an aim to discover fundamental principles of cellular motion dynamics in biology and to identify factors or drugs that alter cellular motion dynamics in disease aetiology and treatment.

Materials and methods

Key resources table.

Reagent type (species)
or resource
Designation Source or
reference
Identifiers Additional
information
Cell line (H. sapiens) EPC2 https://www.med.upenn.edu/molecular/documents/EPCcellprotocol032008.pdf Prof. Hiroshi
Nakagawa (University
of Pennsylvania)
Cell line (H. sapiens) CP-A CP-A (KR-42421)
(ATCC )
ATCC:CRL-4027;
RRID:CVCL_C451
Cell line (H. sapiens) OE33 ECACC ECACC:96070808;
RRID: CVCL_0471
Cell line (H. sapiens) AGS AGS (ATCC CRL-1739) ATCC:CRL-1739;
RRID:CVCL_0139
Chemical
compound, drug
KSFM Gibco/Thermo Fisher Cat#:17005042
Chemical
compound, drug
RPMI 1640 medium Gibco/Thermo Fisher Cat#:21875–034
Chemical
compound, drug
Human recombinant EGF Gibco/Thermo Fisher Cat#:PHG0313
Chemical
compound, drug
Celltracker Orange (CMRA) Life Technologies/
Thermo Fisher
Cat#:C34551
Chemical
compound, drug
Celltracker Green (CMFDA) Life Technologies/
Thermo Fisher
Cat#:C7025
Chemical
compound, drug
Celltracker DeepRed Life Technologies/
Thermo Fisher
Cat#:C34565
Chemical
compound, drug
Image-iTTM FX
Signal Enhancer
Thermo Fisher Cat#:I36933
Chemical
compound, drug
Antibody diluent,
background reducing
Agilent Dako Cat#:S3022
Chemical
compound, drug
Antibody dilutent Agilent Dako Cat#:S0809
Chemical
compound, drug
Fluoromount-G SouthernBiotech Cat#:0100–01
Chemical
compound, drug
DAPI (1 mg/ml) Thermo Fisher Cat#:62248 1:1000 dilution
Antibody Goat polyclonal
anti-mouse Alexa
Fluor 488
Thermo Fisher Cat#:A-11001;
RRID:AB_2534069
1:400 dilution
Antibody Phalloidin Alexa
Fluor 647
Thermo Fisher Cat#:A22287;
RRID:AB_2620155
1:400 dilution
Antibody Mouse monoclonal
anti E-cadherin
Becton,
Dickinson U.K Ltd.
Cat#:610181;
RRID:AB_397580
1:400 dilution
Other 25 culture-inserts
2-Well for
self-insertion
Ibidi Cat#:80209
Software Fiji ImageJ https://imagej.net/Fiji RRID:SCR_002285 TrackMate Plugin
Software
/Algorithm
Motion Sensing
Superpixels
This paper RRID:SCR_016839 https://github.com/fyz11/MOSES

Cell lines and tissue culture

EPC2 (from Prof. Hiroshi Nakagawa, University of Pennsylvania, Perelman School of Medicine, Department of Gastroenterology, USA) and CP-A (ATCC) cells were grown in full KSFM (Thermo Fisher), AGS (ATCC) and OE33 (ECACC) in full RPMI with 10% FBS. Both were supplemented with glutamine and Penicillin streptomycin at 37°C and 5% CO2 until 80% confluent. To passage EPC2 and CP-A, cells were resuspended after trypsinization for 5 min with PBS supplemented with soybean trypsin inhibitor (0.25 g/L, Sigma) to prevent cell death prior to resuspension in KSFM. To store, cells were resuspended at a concentration of 106 cells/ml with 90% FBS +10% DMSO freezing media following centrifugation and stored at −80°C before passing to liquid nitrogen storage. All cell lines were tested monthly for mycoplasma infection using MycoAlertTM PLUS Mycoplasma Detection Kit (Catalog #: LT07-705 from Lonza) at the Ludwig Cancer Institute, Oxford, UK and have not shown evidence of Mycoplasma. Cell lines have been authenticated by Eurofins.

Fluorescent labelling

Cells were labelled using Celltracker Green CMFDA and Celltracker Orange CMRA dyes (Life Technologies) according to protocol. Two different concentrations 2.5 µM and 10 µM were used. The lower concentration still permits tracking but has fewer toxicity concerns.

Immunofluorescence staining

Samples were washed twice with PBS and fixed with 4% PFA for 15 min. The samples were then washed twice and permeabilised with 0.1% Triton-X for 15 min. Samples were blocked with image-iT Fx signal enhancer for 1 h before incubation with E-cadherin (610181 Becton Dickinson U.K. Limited) overnight at a 1:400 dilution in Agilent Dako Antibody Diluent with Background Reducing Components (S3022, Agilent Dako) at 4°C. Following 3 washes of 10 min in PBS, samples were incubated for 2 h in the dark with Alexa 488 goat anti-mouse (A11001, Thermo Fisher) secondary antibody and Alexa Fluor 647 Phalloidin (A22287, ThermoFisher), both at a 1:400 dilution and DAPI (1 mg/mL) at 1:1000 dilution (62248, Thermo Fisher) in Agilent Dako Antibody dilutent (S0809, Agilent Dako). Finally, samples were washed three times with PBS and mounted using Fluoromount-G (0100–01, SouthernBiotech).

Temporary divider Co-culture assay

In 70 µL of culture media, 70,000 labelled cells were seeded into each side of a cell culture insert (Ibidi) in one well of a 24-well plate. After 12 h, inserts were removed and the well washed three times with PBS to remove non-attached cells before adding the desired media (KSFM (0% serum in the text) or 1:1 mixture of KSFM:RPMI + 5% FBS (5% serum in the text)) for filming. For the perturbations, the effector, for example EGF, was also added to the media at the stated concentrations.

Spike-in experiments

A third population of cells dyed with 2.5 µM of CellTracker DeepRed (Life Technologies) was diluted 1:200 into one of the other two populations dyed with either Celltracker Green CMFDA (Life Technologies) or Celltracker Orange CMRA (Life Technologies) of the same cell line. The mix was then added to one side of the insert (Ibidi) as with the two cell population experiments described in the main text.

Image acquisition

The different conditions were filmed on a Nikon microscope for 96 or 144 h at a frequency of 1 image per hour. 2x and 4x objectives were used. The microscope filter wavelengths used to visualise the red and green dyes were 546 nm and 488 nm, respectively.

Automated cell counting with convolutional neural networks

The convolutional neural network (CNN) density counting approach of Xie et al., 2016 was used to automatically count cells in confocal DAPI-stained nuclei images or fluorescent and phase contrast video frames. Once trained, given an input image the CNN model outputs a dot-like image with the property that the sum of all pixel intensities in the output dot-like image equals the number of cells within the image (Figure 1—figure supplement 1A). We describe the training method for DAPI images only. Other image modalities were trained in a similar manner details of which are given below under ‘Migration independent cell counting in videos’. To generate the training data for DAPI cell counting, 200 image patches (size 256 pixels x 256 pixels) were first randomly extracted from the acquired DAPI images of resolution 4096 pixels x 4096 pixels (Figure 1—figure supplement 1A). For each extracted 256 × 256 image patch, the stained nuclei centroids were manually marked using a ‘dot’. Then for each 256 × 256 image patch, 50 randomly sampled 64 pixel x 64 pixel patches were extracted to yield a total training set of 10,000 image patches (Figure 1—figure supplement 1A). The CNN training settings used for a 70:30 train-test split were 200 epochs, batch size 100, RMSprop (lr = 0.001, rho = 0.9, epsilon = 1e-08, decay = 0.0) with a mean absolute error (MAE) loss. The final test accuracy was reported as the mean absolute deviation (MAD) between manually counted and predicted cell counts on the result of applying the learnt CNN to the larger 256 × 256 manually labelled image patches (Figure 1—figure supplement 1B,C). To count specific cell types stained with different coloured dyes from confocal images, epithelial sheets were segmented from their respective channels by applying k-means clustering on the RGB image pixel intensity values with k = 3 (retaining the two clusters of highest intensity) after downsampling the full-size images (4096 × 4096 pixels) by a factor of 4 (to 1024 × 1024 pixels). Small objects (<200 pixels) were removed, holes were filled and the largest connected component kept before upsampling the binary mask to its original resolution (4096 × 4096). The respective final binary mask was used to mask and count cells specific to each colour channel (Figure 1—figure supplement 1C). Counting of specific cell types in timelapse video frames, which are more variable in quality, is similar but uses the more optimized image segmentation protocol detailed below to segment individual epithelial sheets.

Migration-independent cell counting in videos

To count the migrating cells in the videos, two different approaches were used. The first and default approach used in this paper is the automatic CNN counting described above trained on manual annotations of the fluorescent video frames (Figure 4—figure supplement 1A,B). To bypass the issue of moving areas, we first produced a binary mask based on image segmentation as described below to identify the respective ‘red’ and ‘green’ cell sheet areas to quantify. Using the mask, equal sized image patches of 64 × 64 were randomly sampled. These were then fed into the trained CNN to produce cell count estimates for each image patch. The average cell density over 100 random such image patches were taken as an estimate of cell density for the entire frame. This operation was repeated for each channel separately and for every time point to yield a temporal cell count profile. The proliferation rate was subsequently estimated as the average absolute cell density change in successive frames normalised by the mean cell density (computed over the desired time frame) (Figure 4—figure supplement 1C). To check results were unaffected by the imaging modality, CNN based counting was also applied to the corresponding phase contrast images where the fluorescence channels were used to identify the individual cell types (Figure 4—figure supplement 1D). The second approach exploits the change in dye fluorescence as cells proliferate (the cell dye intensity decreases as they divide) (Figure 4—figure supplement 1E–H). For our videos, the image intensity exhibits too discrete a transition between time points (Figure 4—figure supplement 1F) for accurate extraction of the fluorescence decay. Instead, given that all videos of EGF addition to EPC2:CP-A were of the same temporal length (144 h), we found that the modal image intensity and modal normalised frequency of the intensity histogram peak at every time point served as an alternative quantification of fluorescence decay that yields the same conclusions (data not shown) but could be robustly fitted with a linear best-fit line (Figure 4—figure supplement 1I). The faster the fluorescence decay, the higher was the cell proliferation and the higher the absolute value of the fitted linear gradient that we used as a proxy proliferation coefficient (Figure 4—figure supplement 1J,K).

Image segmentation of epithelial sheets in timelapse video

Red and green channel images were anisotropically filtered (Perona et al., 1994) to enhance image edges and suppress stochastic image noise. The entropy image was then computed to enhance the epithelial sheet. Otsu thresholding was subsequently applied to obtain red and green binary masks of the sheet. Finally ‘holes’ in the resultant masks were filled using a line rastering approach.

Computing the distance displaced of the boundary following gap closure

A custom image edge finding script using Sobel filters to detect edges on downsampled fluorescent images was used to find the leading sheet edge (in terms of an average x-coordinate) of both cell populations for each time frame (the averaged y-coordinate of the boundary did not change significantly over the filmed duration). The gap closure point was determined by the intersection of the two x-coordinate curves. The displaced distance of the boundary following gap closure was computed as the absolute distance between the final frame averaged x-position and the gap-closure frame averaged x-position.

MOSES framework

MOSES was developed using the Python Anaconda 2.7 distribution, in particular it uses Numpy, the Scipy-stack and OpenCV libraries. It comprises separately a cell tracking and data analysis component.

Motion extraction

Regular superpixels were generated by applying the SLIC (Achanta et al., 2012) algorithm in scikit-image on a blank image, the same size as the video frame. 1000 superpixels were used throughout in this paper. Motion fields for updating the superpixel centroid position over time were computed with OpenCV Farnebäck optical flow (Farnebäck, 2003) using default parameters. For ease of implementation, displacement vectors were rounded to the nearest integer. Superpixels passing out of the frame progressively lose pixels and retain their last motion position for the tracking duration. For simplicity, lost area is not recovered. PIV was computed using the Python openpiv package, using the closest equivalent parameters to MOSES; window size of 16 pixels, overlap of 8 pixels and search area of 32 pixels.

Motion feature-based sheet segmentation and superpixel assignment

Step 1: Given the tracks of one colour for example red, identify all superpixels that initially move by thresholding on the cumulatively moved distance within the first few frames, (here we used 2 frames, equivalent to 2 h). Step 2: Form the superpixel neighbourhood graph by connecting together all identified moving superpixels from step 1 to any other identified moving superpixel within a specified radial distance cutoff (1.2 x average superpixel width here) using their initial (x,y) centroid positions. The largest connected graph component is then found to approximate the covered area of the epithelial sheet at frame 0. Step 3: In some videos, image artefacts such as autofluorescence or the presence of isolated cells contributes superpixel tracks that are biologically irrelevant and affects quantification of the migrating sheet dynamics. Tracks associated with these noise sources must be removed. The need for such removal is automatically evaluated through a user-set cutoff based on prior knowledge of the expected maximum fraction of the field-of-view covered by any one of the red or green populations at time t = 0 (e.g. for a 50:50 plating of red and green cells, a conservative cut-off fraction of 0.70 is used here). Assuming that no red and green superpixel can jointly occupy the same (x,y) position in frame 0, joint filtering based on the degree of movement is applied to clean segmentation errors from previous steps. Step 4 (not required if running MOSES using dense tracking): The kept superpixels after steps 1-3, are then iteratively dynamically propagated to identify ‘activated’ superpixels (those that lie in the joint area occupied by the kept superpixels) frame-by-frame. Step 5 (potentially optional): In case the dynamic superpixel propagation identifies superpixels that do not move much over the entire video and to ensure the temporal continuity of superpixel positions, steps 1-2 are repeated. Step 6: To ensure the same number of superpixel tracks across all videos for statistical comparison, finally we assign constant tracks for all unused or inactivated superpixel tracks where for all frames their (x,y) positions are fixed to their initial centroid position. The procedure described is illustrated more concisely in Figure 2—figure supplement 1A.

Dynamic mesh generation

To generate meshes, each superpixel is connected to its nearest ‘neighbours’. The notion of ‘neighbour’ is mathematically defined by the user (see below). For the MOSES mesh used here for visualisation and stability analysis, we defined ‘neighbours’ using a pre-set distance cut-off based on the distance between individual superpixel centroids at the start of tracking. The mesh strain curve thus measures the relative distortion between connected superpixels with respect to their initial mesh geometry over time. For computation of the boundary formation index, a different mesh was used where neighbours were independently determined frame-by-frame by a pre-set distance threshold. The specified threshold in both meshes are given in Euclidean distance as a multiplicative factor of the average superpixel width used. A factor of 1.2 for the mesh strain and 5.0 was used for the boundary formation index throughout.

MOSES dynamic meshes – mining contextual relationships of spatio-temporal tracks using geometry and graph theory

Numerous ways exist to connect a collection of (x,y) points to form a graph or mesh. Depending on the mesh formed, different aspects of the movement can be enhanced and probed in interesting ways. In this paper, the presence of collective motion biologically motivates the mesh concept. More generally, meshes are ‘abstract’ constructs to assess relationships. In terms of motion analysis, in this paper, we recognise that different spatially located points may be correlated particularly if they are spatially close. In view of this, we first describe and explain the rationale of the two meshes constructed in the main text for visualising and quantifying epithelial sheet dynamics before describing possible extensions and how these may be more useful for specific experiments. The first and the primary mesh used throughout is what we term the MOSES mesh. It is constructed by joining each superpixel track with all superpixel tracks whose initial (x,y) centroids are within a user-specified constant Euclidean distance cut-off. Implicitly this assumes that under collective sheet migration, initially spatially close superpixels continue to remain spatially close. Violation of this assumption leads to large stretching or compression of the MOSES mesh, which can be used to derive a measure of motion collectiveness (see mesh order below). The second mesh is used to generate the motion saliency maps for localising boundary formation, a dynamic state that varies frame to frame. Each superpixel track is again connected to all superpixel tracks whose (x,y) centroids are within a user-specified Euclidean distance cut-off. However, contrary to the first mesh, which uses only the centroid positions in frame 0, here the distances are determined using the current position of all superpixels in the current frame. Thus the ‘neighbours’ continuously change frame-to-frame. We refer to this mesh as the radius neighbour graph and the motion saliency map is the result of counting the number of neighbours for each superpixel (i.e. the node degree). The number of surrounding neighbours increases in spatial areas where motion concentrates e.g. there exists a local chemoattractant or a physical impedance to cell movement such as a boundary. Thus boundaries are natural attraction motion centres; leading superpixels at the boundary cannot advance whilst those behind continue to move towards the boundary. This leads to an overall accumulation of superpixels at the boundary (c.f. motion saliency maps in Figure 3—figure supplement 2). In both meshes, tuning the distance cut-off threshold tunes the length-scale of the spatial interaction one wishes to analyse. For boundary formation, a phenomenon that spans the entire height of the image, one should choose a relatively large distance cut-off such as 5x the average superpixel width compared to, for example, identifying the localisation of a macrophage to a cell apoptosis site. In the latter, the attraction site is more point-like and a radius cut-off 1x the average superpixel width may be more accurate. Finally, whilst both of the presented meshes in our paper only utilise physical distance to define neighbours this is by no means the only possibility. In some applications such as flocking, the topological distance (i.e. the number of points away) not the physical distance may be more relevant (Ballerini et al., 2008). In this situation, it is more common to construct the local kNN (k-nearest neighbour) graph, designating the closest k superpixels as neighbours. The kNN is also frequently used when the magnitude of the ‘interaction’ between points is not known a priori as a way to propagate local information and mine patterns in data c.f. t-SNE for dimensionality reduction, spectral clustering and similarity network fusion (Wang et al., 2014) for combining multimodal datasets. Finally, superpixels may be joined not only spatially but also temporally to enforce consistent temporal neighbour relations (Chang et al., 2011) as well as according to more ‘semantic’ notions of similarity for example similar image appearance, similar instantaneous velocities (Chang et al., 2011). In short, by formalising motion analysis under the framework of dynamical meshes that connects together ‘neighbouring’ superpixels, we can analyse complex motion not just in terms of instantaneous speed and orientation but can additionally leverage powerful established tools developed in the fields of computational graph theory, network theory, topology etc. to effectively quantify and mine increasingly complex motion phenotypes in high-content screens.

Mean squared displacement (MSD)

As a measure of cellular motions, MSD was computed as a function of time interval, Δt as in (Park et al., 2015).

MSD(Δt)=|ri(t+Δt)ri(t)|2

where ri(t) is the position of the superpixel i at time t and is the average over all time t and all superpixels. For small Δt, the MSD increases as a power law Δtα, where the exponent α is determined empirically by fitting. For unity exponent (α=1), the movement is uncorrelated random Brownian motion and cellular motion is diffusive. When α>1, cellular motions are super-diffusive, and when α=2, motions are ‘ballistic’.

Root Mean Squared Displacement (RMSD)

As a summary of the whole video motion and a measure of movement, the root mean squared displacement was computed as a function of time relative to the initial time t0RMSD(t)=|ri(t)ri(t0)|2 where ri(t) is the position of the superpixel i at time t and  is the average over all time t and all superpixels. For multi-channel videos, the average RMSD was used to describe video motion. Unless otherwise stated in the text, the normalised RMSD (here division by maximum value within the common time window of comparison) was plotted to permit comparison across different conditions. To compare across videos of different duration, in Figure 1—figure supplement 4B we instead compute the RMSD divided by its value at 96 h, the maximum timepoint shared by both our 96 h and 144 h videos.

Normalised spatial correlation

For each superpixel i we define its neighbourhood, Ni as all superpixels j which lie within a specified distance, r. Given the time-dependent velocity function VVt=rt+1-rt where rt is the track (all (x,y) positions) up to time t, the normalised spatial correlation of a video with a total of n superpixel tracks is defined as

1ni=1nEjNi[cov(ViVj)σViσVj]

where E is the mean function averaging over the superpixel neighbourhood, cov() is the covariance function and σVi is the standard deviation of Vit. Computing spatial correlation as a function of r for our videos yields an exponential decay which can be fitted to an equation of the form y=ae-x/b from which the initial correlation a and characteristic correlation distance b can be determined for plotting, (Figure 1—figure supplement 5). In our plots, the distance r is in terms of normalised units (i.e. the number of average superpixel widths away).

Manual vs MOSES comparison on cell tracking challenge datasets

MOSES does not explicitly handle individual cells leaving and entering the field of view or cell divisions during filming. For fair comparison of motion capture ability with manually annotated tracks, the tracks were only compared for cells present in the initial frame as depicted with coloured masks (Figure 2B). Single-cell tracks were generated from MOSES (1000 superpixels) by identifying the superpixel track that has moved the greatest distance over the video duration amongst all superpixel tracks whose initial (x,y) position lies within the area of the individual cell of interest as marked out by manual annotation at = 0 (Video 3). 

TrackMate single-cell tracking

The Fiji TrackMate plugin (Tinevez et al., 2017) was run on the third blue image channel containing only the sparse population of DeepRed dyed cells with the following settings: estimated blob diameter, 10 pixel (default); threshold, 2.5; linking max distance, 50; gap-closing max distance, 50; and gap-closing max frame gap, 100. All other parameters were left at their default values.

TrackMate vs MOSES comparison

As with the cell tracking challenge dataset for fair comparison of motion capture ability with single cell trackers like TrackMate, tracks were only compared for cells present in the initial frame as detected by TrackMate. To generate single-cell tracks using MOSES (with 10,000 superpixels) for the sparse DeepRed dyed cells, the nearest four superpixel tracks to each cell centroid were averaged to produce a single track. Similarly, to generate single cell tracks using MOSES (with 1000 superpixels) from the Green CMFDA dyed or Orange CMRA dyed sheet, for each cell, the nearest four green/red superpixel tracks were found to compute a mean track. Track similarity was computed by evaluating the normalised velocity cross-correlation (value between 0 and 1 as defined below) between each MOSES track and its corresponding TrackMate track with the average normalised velocity cross-correlation over all tracks reported for each video (Figure 2—figure supplement 2B, denoted M. for matched tracks). To assess the statistical significance of the resultant value, the track similarity from random pairing of the tracks were computed and the average of 10 permutations were reported (labelled P. for permuted in Figure 2—figure supplement 2B). All three combinations of cell types (EPC2:EPC2, EPC2:CP-A and EPC2:OE33) and all red/green dye combinations were tested, a total of 23 videos (each 144 h acquired with one image per h). The frame size of each acquired video was 512 × 672 pixels. As such, the mean spiked-in cell diameter was 5 pixels, the average superpixel width was 6 pixels (10,000 superpixels) and 19 pixels (1000 superpixels). From Figure 2—figure supplement 2B and Video 4, MOSES achieves near perfect similarity compared to TrackMate. In some cases, the produced MOSES tracks are more desirable, guaranteeing a continuous track whereas TrackMate requires explicit linking of cell detections across frames and thus often tends to produce many ‘broken’ tracks when the same cell is unable to be detected across all time points.

Motion saliency map

The motion saliency map illustrates in a heat map format spatial areas of motion sources and sinks, and was constructed using the MOSES dynamic meshes and superpixel tracks. It is inspired by Lagrangian fluid mechanics (Shadden et al., 2005; Ali and Shah, 2007). To compute this map for each frame, the radius neighbour graph was constructed (see above paragraph) using the spatial positions of superpixels in that frame and a blank image was populated at the (x,y) centroid position of each superpixel with the count of the number of surrounding neighbours according to the radius neighbour graph. This yielded an ‘image’ of size n_frames x n_rows x n_cols, where n_rows, n_cols are the video frame dimensions. We now have a multidimensional spatial heat map for each frame that captures the spatial-temporal motion saliency. To reveal long-time temporally persistent behaviour, we averaged the heatmap both spatially and temporally using the superpixel partition of the initial video frame as illustrated in Figure 3—figure supplement 2. By construction, this spatial map is general for studying any phenomenon where spatial localisation plays a role.

Quantification cut-offs

We assume normally/t- distributed statistics for all measures. Boundary formation cut-off was set one standard deviation above the pooled mean of 0% serum samples. Mesh stability index was set one standard deviation below the pooled mean of 0% serum EPC2:EPC2, EPC2:CP-A samples.

Boundary formation index

The boundary formation index (Figure 3—figure supplement 2) quantifies the extent to which motion concentrates into localised spatial regions in the motion saliency map as a signal-to-noise ratio with value from 0 to 1 suitable for global comparison across video datasets. For example a boundary concentrates motion along a ‘line’ whilst cell death may generate multiple spot-like concentrations (Figure 3—figure supplement 3). The higher the index, the more that motion is concentrated into a single spatial region. To compute the boundary formation index from the visual motion saliency image, the motion saliency image was segmented into ‘high’ and ‘low’ intensity using Otsu thresholding and the normalised signal-to-noise ratio was computed, defined by mean(high)mean(low)mean(high)

(Figure 3—figure supplement 2). The mean was used as the motion saliency map was computed from a sparse set of points given by the number of superpixels. Individual pixel statistics such as the maximum intensity are therefore noisy and not robust measures. The denominator was set to be the mean of the ‘high’ intensity region in order to give a numerical value bounded between 0–1 for standardised comparison. As this measure captures the ‘peakiness’ of the spatial distribution, it can also be used to quantify other localised spatial processes with adaptation for example cell death.

Normalised mesh strain curves

For a superpixel, i at time t we define the mesh strain, εi(t) of the local neighbourhood, Ni with n neighbours as the mean of the absolute difference in the distance between superpixel i and superpixel j in its neighbourhood at time t, rijt and at the start at t=0, rij0 so that εi(t)=1nΣjNi|rij(t)rij(0)| where || is the absolute value or L1-norm. The time-dependent mesh strain for one mesh is the mean local neighbourhood mesh strain over all superpixels for each time frame. The result is a vector the same length as the number of frames in the video. For multi-channels, the average vector is used to describe video motion. The absolute value of the strain curve is susceptible to the image acquisition conditions and geometry whilst the motion information is primarily encoded by the shape of the resulting curve. To permit comparison across different conditions, the normalised strain curve (here division by maximum strain within the common time window of comparison, 0-96 h here) up to 96 h (the maximum timepoint shared between 96 h and 144 h videos in this paper) was used as a simple signature to describe the global video motion pattern when computing motion maps.

Normalised mesh strain, L1-norm and robustness

Here we provide more details as to why we chose the L1-norm for computing the mesh strain. When computing the ‘distance’ between two vectors, x1,x2 both of length n one can define the notion of distance in different ways. A popular family of distances or norms to use is the Lp-norm denoted ||x1x2||p defined mathematically as follows:

||x1x2||p=(i=1n|x1ix2i|p)1p where i=1,,n is the ith element of the vector

|| denotes the absolute difference. Within this family the most popular is the L1 (p=1) and L2 (p=2) norms. The L1-norm is also called the mean absolute deviation and the L2-norm the Euclidean or mean squared distance. The quadratic function of L2 amplifies large differences (> 1) and reduces the effect of smaller differences close to zero thus where differentiability is not a concern L1 is the more robust choice. In our case where most of the distances are larger than 1 (as all superpixels are initially seeded at a distance > 1), we chose to use L1 which is more resistant to the effect of extremal values as a result of errors in motion extraction.

Mesh stability index

The mesh stability index attempts to quantify the global stationarity (Figure 3—figure supplement 4) across a moving collective such as an epithelial sheet by measuring the change in the average distance between each superpixel and its neighbours. Specifically, it measures the flatness of the normalised mesh strain curve as defined above. Note whilst a curve is flat only if its gradient is flat that is 0, it is more visually appealing to report increasing stability with increasing values therefore we define the mesh stability index as 1 minus the end gradient. The end gradient is most stable when it has a value of 0 therefore this index is upper bounded by 1. To standardise the value of the gradient for comparison, we normalise also across time such that for any length video, time is from 0 to 1. To compute the end gradient stably without curve fitting procedures, we assume the end point is locally linear with respect to time and average the first-order differences over the last few frames. For 96 h videos, the period of stability given by the curve plateau is shorter, therefore the last 10 frames (10 h) were used for computing the gradient. For 144 h videos, the last 24 frames (24 h) were used.

Mesh order

Inspired by the definition of the velocity order parameter, the mesh order is identically defined but uses the local resultant mesh strain vector instead of instantaneous velocity vectors for computation (Figure 3—figure supplement 5). In this paper, we used the MOSES mesh but any similar mesh construction is equally valid. Given a mesh, the mesh strain vector for a superpixel is the sum of the displacement vector of the superpixel relative to each neighbouring superpixel (Figure 3—figure supplement 5). The mesh order is computed for each frame accordingly for a video. The mean value over all frames was reported for statistical comparison.

Normalised velocity cross-correlation

Cross-correlation measures the similarity between two signals taking into account time delay, (Figure 3—figure supplement 6). As ‘signals’, we use the time-dependent velocity V(t) computed from the spatial displacement between the location rt+1 at time t+1 and location rt at time t, Vt=rt+1-rt which is spatially location-independent. Velocity is a vector quantity with both x and y components. Letting  denote the scalar product, the normalised velocity cross-correlation (VCC) of the tracks of red superpixel i and green superpixel j at time t and time lag m such that the extremas are bound by [-1,1] is

VCCij(m,t)=1Tm=TTVi^(t+m)Vj^(t)

where Vi^=ViViσi , Vi and σi is the mean and standard deviation of Vi respectively. T is the maximum time lag and is set to be the length of Vi. VCC can be either positive or negative. We report the maximum absolute value for all red-green pairings and the average over all pairings as evidence of interaction between two epithelial sheets. In the main text, this is computed with tracks before (up to - 5 frames) and after (from + 5 frames) the gap closure point. The offset of 5 frames either side is based on the accuracy to which we could determine the gap closure point automatically (see below).

Automatic gap closure determination

The frame of gap closure when the two epithelial sheets contact was found by finding the video frame in which the average distance between the migrating fronts of the two epithelial layers was minimised. To compute this distance as a function of time (frame number), first each epithelial sheet was independently segmented based on their respective colour channel pixel intensity (Figure 3—figure supplement 9A). For each sheet, images were preprocessed using a median filter (square kernel, the same size as the average superpixel width) and segmented using two class k-means clustering (for 0% serum, three class k-means was used to include the weaker pixel intensity of leading cells). The resulting segmentation was post-processed by binary morphological operations (binary closing with disk kernel of 5 pixels, removal of small objects (< 5% total image size) and binary filling). Sheet boundary points were efficiently identified using a sweepline algorithm. The image was evenly divided into 100 horizontal strips or sweeps in the y-direction. For each sweep, one boundary point was identified by selecting either the right-most point (largest x-coordinate) if the sheet was moving right, or left-most point (smallest x-coordinate) if the sheet was moving left (Figure 3—figure supplement 9B). Doing this for both sheets, each of the boundary points was paired in the red/green sheet to the closest in the opposing colour by physical distance. The average distance between the migrating fronts of the two epithelial layers for a particular frame was then the average euclidean distance of all red-green boundary point pairs in the frame. Computing the average distance between the two sheets frame-by-frame, there was a change in the rate of decrease as the gap closed (Figure 3—figure supplement 9C). To estimate the frame at which this transition occurs, asymmetric least means squares (Eilers and Boelens, 2005) was used to first fit the baseline (representing the contribution to the distance measurement due to image segmentation errors) and second a linear spline (smoothing factor 0.1*variance(curve)) was used to approximate the temporal average distance curve. The gap closure frame was then found as the first frame for which the fitted spline value falls below the fitted baseline + 2*std(fitted baseline), where std is the standard deviation operation. We validated the method using a total of n = 246 videos of different cell combinations in different media by comparing the inferred frame to the consensus (average frame) of two manual annotators. A strong Pearson correlation coefficient of r = 0.902 (Figure 3—figure supplement 9D) and an accuracy of 94% within ± 5 frames (Figure 3—figure supplement 9E) compared to an accuracy of 97% within ± 5 frames between two human annotators (Figure 3—figure supplement 9F) was found.

Boundary detection

Boundaries were detected either i) by image segmentation (early timepoints) or ii) from the MOSES tracks (late time points). For image segmentation, red and green epithelial sheets were segmented as described above for timelapse video frames. The boundary binary mask was then found as the mathematical set intersection of red and green binary masks. To derive a boundary line, the non-zero image coordinates of the binary boundary mask were forced to form a line given by unique (x,y) coordinates by returning the average x-coordinate (along image horizontal) for each unique y-coordinate (along image vertical). A piece-wise cubic spline was subsequently fitted to enable interpolation of the boundary line. To derive the boundary line from the MOSES superpixel tracks, first all non-moving tracks that is all tracks that do not move a total distance greater than a predefined threshold was removed. Then at each frame, t we considered all superpixels that have moved since the previous frame t-1. We then attempted to match the red superpixels to green superpixels where we defined a match when the distance between two points is smaller than a predetermined cut-off. All red and green superpixel points that have at least one successful match were kept. Together these points form the boundary candidate points. As discussed, a boundary is a motion attractor thus we can robustly find the boundary points from the candidate points using an asymmetric least squares filter (Eilers and Boelens, 2005). We modified the original formulation of Eilers and Boelens, 2005 to enforce density based filtering by using a cut-off based on the number of neighbours within a predefined distance. Finally, given the boundary points the boundary line was found as in the case of image segmentation.

Cell infiltration

For a single video frame, the infiltration of the first colour cell type into the second colour cell type is defined as the fraction of first colour cells that lie on the side of the boundary line of the second coloured sheet. The boundary line was located from image segmentation as described above.

Boundary shape

The boundary shape is defined as the total length of the boundary line (L) divided by the equivalent straight line length (L0) illustrated as solid white and yellow lines respectively in Figure 3—figure supplement 10C.

Intermixing coefficient

We computed two different measures to reflect the ‘intermixing’ behaviour of cells at the boundary, which we denoted either ‘(Image)’ or ‘(MOSES)’ in Figure 3—figure supplement 10. The image intermixing coefficient reflects the ‘spread’ of the wound after we perform image segmentation on the video frame. A heavily infiltrated and wavy boundary yields a larger binary mask and occupies a larger fraction of the image than a sharp, linear boundary. We capture this idea by defining the intermixing coefficient (image) as the area of the boundary mask relative to the total image area. This is however a static measure of intermixing. It does not account for the fact that the boundary is not static and can be highly dynamic such as in EPC2:EPC2 or how EPC2 cells appear to ‘flee’ OE33 cells with little coordination. We thus propose a second dynamic measure of intermixing coefficient based on measuring the spread in the motion behaviour. This intermixing coefficient (MOSES) is the area of the binary mask after thresholding the motion saliency map relative to the total image area. Of note, the two intermixing coefficients are identical if the final boundary is stable and the motion at the boundary during boundary formation is coordinated such as in EPC2:CP-A (Figure 3—figure supplement 10E).

Kymographs

Boundaries were detected using the MOSES tracks as described. The kymograph of one time slice shows the median values of the projected x-direction velocities of all pixels as a function of the x-position. For superpixel tracks, the x-axis was binned with two times the number of unique x-coordinates based on the initial superpixel centroids (in line with Nyquist sampling theorem). This procedure was repeated for each timepoint to build the full kymograph over time.

Generating feature descriptors for motion map generation

A feature descriptor for an image/video is a 1D numerical vector designed to summarise the image/video content in a compact code or signature. No universally optimal method exists to generate such a descriptor for all applications. In the main text, for simplicity to avoid the introduction of new concepts we used the normalised strain curve used to compute the mesh stability index as an example video feature vector for PCA. As we showed, this was sufficient to distinguish the different migration behaviour of the investigated cell combinations. However, the normalised strain curve is a coarse 1D approximation of the local MOSES mesh stretching and only one characteristic of the complex mesh dynamics. More generally, one can exploit other mesh constructions as discussed above to derive a plethora of graph theoretic measures such as the algebraic connectivity, the Laplacian spectrum and centrality, or supplement mesh-based statistics with trajectory-based measures such as turning angle and speed for a more comprehensive unbiased description of the spatio-temporal motion. In circumstances where cellular appearance exhibits large temporal changes such as in the case of migrating cells with lamellipodium, motion features alone may not provide a sufficiently descriptive signal for quantifying phenotypic differences. Here one could additionally supplement motion signatures with appearance-based features such as image texture descriptors (e.g. LBP, Haralick, HoG (histogram of oriented gradients) and SIFT (and its variants)) and construct the mesh semantically with the augmented motion-appearance descriptor.

Dimensionality reduction experiments

We used the Python scikit-learn implementation of PCA (principal components analysis), MDS (multi-dimensional scaling) and TSNE (t-distributed stochastic neighbour embedding) and applied them to the ‘raw’ normalised mesh strain curves (no pre-processing). PCA was applied with n_components = 2 without input whitening. MDS used default scikit-learn parameters with n_components = 2, random_state = 0. TSNE used n_components = 2, learning_rate = 100, random_state = 0, init = ‘random’. A 97-20-2 fully connected neural network autoencoder was implemented and trained with Keras (Tensorflow backend) with mean squared error loss using the Adam optimizer (lr = 0.001, beta_1 = 0.5, beta_2 = 0.999, epsilon = 1e-08, decay = 0.0). 48/77 of the 5% serum videos were used for training and the remaining 29/77 for validation to check for model overfitting. Tanh activations were used throughout. To maximise gradient propagation in the linear range of the ‘tanh’ activation function, we subtract 0.5 from the input normalised strain curves (values between 0 and 1) as a pre-processing step for the neural network.

Automatic clustering of superpixel tracks

To automatically cluster superpixel tracks, we first computed for each superpixel its local mesh strain curve with respect to its neighbours. This yields a matrix N rows by T columns for N superpixel tracks and a total of T timepoints. Gaussian mixture model (GMM) was then used to generate clusters using BIC (Bayesian Information Criterion) to select the optimal number of clusters (Fraley, 1998).

Software availability

The MOSES code is available open-source under a Ludwig non-commercial and academic license at GitHub, https://github.com/fyz11/MOSES.git (Zhou, 2019; copy archived at https://github.com/elifesciences-publications/MOSES) where it is maintained and updated. Example data for testing can be downloaded from Google Drive, https://drive.google.com/open?id=0BwFVL6r9ww5BaTh6NExLR1JMUXM. The full video dataset can be found under DOIs, https://dx.doi.org/10.17632/j8yrmntc7x.1https://dx.doi.org/10.17632/vrhtsdhprr.1 for videos with and without EGF addition, respectively.

Acknowledgements

We thank Prof. Hiroshi Nakagawa for generous donation of EPC2 cells and Profs. Roberto Cerbino and Xavier Trepat for making available the raw videos in their publications. We thank Dr. Mary Muers, Dr. Françoise Howe, Profs. Sebastian Nijman, Francis Szele, Colin Goding and Shankar Srinivas for critical reading of the manuscript; Mark Shipman for technical assistance with timelapse microscopy. This work is mainly funded by the Ludwig Institute for Cancer Research (LICR) with additional support from a CRUK grant to XL (C9720/A18513). FYZ is mainly funded through the EPSRC Life Sciences Interface Doctoral Training Centre EP/F500394/1, CRP and XL are funded by LICR, RPO is supported by LICR, the Oxford Health Services Research Committee and Oxford University Clinical Academic Graduate School supported by the National Institute for Health Research (NIHR) Biomedical Research Centre based at the Oxford University Hospitals Trust, Oxford, MJW is supported by CRUK (C5255/A19498, through an Oxford Cancer Research Centre Clinical Research Training Fellowship), and JR is funded by LICR and the EPSRC SeeBiByte Programme Grant (EP/M013774/1). The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Felix Y Zhou, Email: felix.zhou@ludwig.ox.ac.uk.

Jens Rittscher, Email: jens.rittscher@eng.ox.ac.uk.

Xin Lu, Email: xin.lu@ludwig.ox.ac.uk.

Hamid Mohammadi, Francis Crick Institute, United Kingdom.

Didier Y Stainier, Max Planck Institute for Heart and Lung Research, Germany.

Funding Information

This paper was supported by the following grants:

  • Ludwig Institute for Cancer Research to Felix Y Zhou, Carlos Ruiz-Puig, Richard P Owen, Jens Rittscher, Xin Lu.

  • Engineering and Physical Sciences Research Council EP/F500394/1 to Felix Y Zhou.

  • Oxford Health Services Research Committee to Richard P Owen.

  • Oxford University Clinical Academic Graduate School to Richard P Owen.

  • Cancer Research UK C5255/A19498 to Michael J White.

  • Engineering and Physical Sciences Research Council EP/M013774/1 to Jens Rittscher.

  • Cancer Research UK C9720/A18513 to Xin Lu.

Additional information

Competing interests

A patent is pending for MOSES (UK application no. GB1716893.1, International application no. PCT/GB2018/052935)). The code is available open-source and free for academic and non-profit users under a Ludwig academic and non-profit license.

No competing interests declared.

A patent is pending for MOSES (UK application no. GB1716893.1, International application no. PCT/GB2018/052935)). The code is available open-source and free for academic and non-profit users under a Ludwig academic and non-profit license.

Author contributions

Conceptualization, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—original draft, Project administration, Writing—review and editing, Conceived and implemented all of MOSES and analysed all data.

Conceptualization, Resources, Data curation, Validation, Investigation, Methodology, Writing—review and editing, Performed all live cell imaging videos. Established the 24-well plate based co-culture system.

Resources, Methodology, Writing—review and editing, Aided initial prototyping of MOSES with mixed co-culture timelapse videos.

Resources, Methodology, Writing—review and editing, Aided initial prototyping of MOSES with mixed co-culture timelapse videos.

Conceptualization, Supervision, Methodology, Project administration, Writing—review and editing, Provided computational supervision.

Conceptualization, Supervision, Funding acquisition, Methodology, Writing—original draft, Project administration, Writing—review and editing, Provided overall project supervision.

Additional files

Supplementary file 1. Table summary of video datasets and experiments analysed for 0% and 5% serum.
elife-40162-supp1.docx (14.1KB, docx)
DOI: 10.7554/eLife.40162.039
Supplementary file 2. Table summary of video datasets and experiments analysed for EGF addition.
elife-40162-supp2.docx (13.7KB, docx)
DOI: 10.7554/eLife.40162.040
Transparent reporting form
DOI: 10.7554/eLife.40162.041

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files. Source code files for MOSES are available and maintained on GitHub (copy archived at https://github.com/elifesciences-publications/MOSES). Analysed videos are made available online with unique DOIs ( http://dx.doi.org/10.17632/j8yrmntc7x.1, http://dx.doi.org/10.17632/vrhtsdhprr.1) through Mendeley datasets.

The following datasets were generated:

Zhou FY. 2018. Normal 2 Cell Population Migration Dataset. Mendeley Data.

Zhou FY, Puig CR. 2018. EGF Addition to EPC2:CP-A. Mendeley Data.

References

  1. Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2012;34:2274–2282. doi: 10.1109/TPAMI.2012.120. [DOI] [PubMed] [Google Scholar]
  2. Ali S, Shah M. A lagrangian particle dynamics approach for crowd flow segmentation and stability analysis. IEEE Conference On Computer Vision and Pattern Recognition; 2007. [DOI] [Google Scholar]
  3. Ballerini M, Cabibbo N, Candelier R, Cavagna A, Cisbani E, Giardina I, Lecomte V, Orlandi A, Parisi G, Procaccini A, Viale M, Zdravkovic V. Interaction ruling animal collective behavior depends on topological rather than metric distance: evidence from a field study. PNAS. 2008;105:1232–1237. doi: 10.1073/pnas.0711437105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Boonstra JJ, van Marion R, Beer DG, Lin L, Chaves P, Ribeiro C, Pereira AD, Roque L, Darnton SJ, Altorki NK, Schrump DS, Klimstra DS, Tang LH, Eshleman JR, Alvarez H, Shimada Y, van Dekken H, Tilanus HW, Dinjens WNM. Verification and unmasking of widely used human esophageal adenocarcinoma cell lines. JNCI: Journal of the National Cancer Institute. 2010;102:271–274. doi: 10.1093/jnci/djp499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Boutros M, Heigwer F, Laufer C. Microscopy-based high-content screening. Cell. 2015;163:1314–1325. doi: 10.1016/j.cell.2015.11.007. [DOI] [PubMed] [Google Scholar]
  6. Bray MA, Singh S, Han H, Davis CT, Borgeson B, Hartland C, Kost-Alimova M, Gustafsdottir SM, Gibson CC, Carpenter AE. Cell painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nature Protocols. 2016;11:1757–1774. doi: 10.1038/nprot.2016.105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brox T, Bruhn A, Papenberg N, Weickert J. High accuracy optical flow estimation based on a theory for warping. European Conference on Computer Vision. 2004;3024:25–36. [Google Scholar]
  8. Brox T, Bregler C, Malik J. Large displacement optical flow. IEEE International Conference on Computer Vision and Pattern Recognition; 2009. pp. 41–48. [Google Scholar]
  9. Chang M-C, Krahnstoever N, Ge W. Probabilistic group-level motion analysis and scenario recognition .  IEEE International Conference On Computer Vision; 2011. pp. 747–754. [DOI] [Google Scholar]
  10. Clark R. The Molecular and Cellular Biology of Wound Repair. Springer Science & Business Media; 2013. [Google Scholar]
  11. Dahmann C, Oates AC, Brand M. Boundary formation and maintenance in tissue development. Nature Reviews Genetics. 2011;12:43–55. doi: 10.1038/nrg2902. [DOI] [PubMed] [Google Scholar]
  12. Dixon MF, Neville PM, Mapstone NP, Moayyedi P, Axon AT. Bile reflux gastritis and Barrett's oesophagus: further evidence of a role for duodenogastro-oesophageal reflux? Gut. 2001;49:359–363. doi: 10.1136/gut.49.3.359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Eilers PH, Boelens HF. Baseline Correction with Asymmetric Least Squares Smoothing. Leiden University Medical Centre; 2005. [Google Scholar]
  14. Farnebäck G. Two-Frame Motion Estimation Based on Polynomial Expansion. Springer; 2003. [Google Scholar]
  15. Fraley C. How many clusters? which clustering method? answers via model-based cluster analysis. The Computer Journal. 1998;41:578–588. doi: 10.1093/comjnl/41.8.578. [DOI] [Google Scholar]
  16. Gaddam S, Singh M, Balasubramanian G, Thota P, Gupta N, Wani S, Higbee AD, Mathur SC, Horwhat JD, Rastogi A, Young PE, Cash BD, Bansal A, Vargo JJ, Falk GW, Lieberman DA, Sampliner RE, Sharma P. Persistence of Nondysplastic Barrett's esophagus identifies patients at lower risk for esophageal adenocarcinoma: results from a large multicenter cohort. Gastroenterology. 2013;145:548–553. doi: 10.1053/j.gastro.2013.05.040. [DOI] [PubMed] [Google Scholar]
  17. Goers L, Freemont P, Polizzi KM. Co-culture systems and technologies: taking synthetic biology to the next level. Journal of the Royal Society Interface. 2014;11:20140065. doi: 10.1098/rsif.2014.0065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Harada H, Nakagawa H, Oyama K, Takaoka M, Andl CD, Jacobmeier B, von Werder A, Enders GH, Opitz OG, Rustgi AK. Telomerase induces immortalization of human esophageal keratinocytes without p16INK4a inactivation. Molecular Cancer Research : MCR. 2003;1:729–738. [PubMed] [Google Scholar]
  19. Held M, Schmitz MH, Fischer B, Walter T, Neumann B, Olma MH, Peter M, Ellenberg J, Gerlich DW. CellCognition: time-resolved phenotype annotation in high-throughput live cell imaging. Nature Methods. 2010;7:747–754. doi: 10.1038/nmeth.1486. [DOI] [PubMed] [Google Scholar]
  20. Lačević N, Starr FW, Schrøder TB, Glotzer SC. Spatially heterogeneous dynamics investigated via a time-dependent four-point density correlation function. The Journal of Chemical Physics. 2003;119:7372–7387. doi: 10.1063/1.1605094. [DOI] [Google Scholar]
  21. Landsberg KP, Farhadifar R, Ranft J, Umetsu D, Widmann TJ, Bittig T, Said A, Jülicher F, Dahmann C. Increased cell bond tension governs cell sorting at the Drosophila anteroposterior compartment boundary. Current Biology. 2009;19:1950–1955. doi: 10.1016/j.cub.2009.10.021. [DOI] [PubMed] [Google Scholar]
  22. Leoni G, Neumann PA, Sumagin R, Denning TL, Nusrat A. Wound repair: role of immune-epithelial interactions. Mucosal Immunology. 2015;8:959–968. doi: 10.1038/mi.2015.63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Major RJ, Irvine KD. Influence of Notch on dorsoventral compartmentalization and actin organization in the Drosophila wing. Development. 2005;132:3823–3833. doi: 10.1242/dev.01957. [DOI] [PubMed] [Google Scholar]
  24. Major RJ, Irvine KD. Localization and requirement for myosin II at the dorsal-ventral compartment boundary of the Drosophila wing. Developmental Dynamics. 2006;235:3051–3058. doi: 10.1002/dvdy.20966. [DOI] [PubMed] [Google Scholar]
  25. Malinverno C, Corallino S, Giavazzi F, Bergert M, Li Q, Leoni M, Disanza A, Frittoli E, Oldani A, Martini E, Lendenmann T, Deflorian G, Beznoussenko GV, Poulikakos D, Haur OK, Uroz M, Trepat X, Parazzoli D, Maiuri P, Yu W, Ferrari A, Cerbino R, Scita G. Endocytic reawakening of motility in jammed epithelia. Nature Materials. 2017;16:587–596. doi: 10.1038/nmat4848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Maška M, Ulman V, Svoboda D, Matula P, Matula P, Ederra C, Urbiola A, España T, Venkatesan S, Balak DM, Karas P, Bolcková T, Streitová M, Carthel C, Coraluppi S, Harder N, Rohr K, Magnusson KE, Jaldén J, Blau HM, Dzyubachyk O, Křížek P, Hagen GM, Pastor-Escuredo D, Jimenez-Carretero D, Ledesma-Carbayo MJ, Muñoz-Barrutia A, Meijering E, Kozubek M, Ortiz-de-Solorzano C. A benchmark for comparison of cell tracking algorithms. Bioinformatics. 2014;30:1609–1617. doi: 10.1093/bioinformatics/btu080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Meijering E, Dzyubachyk O, Smal I. Methods for cell and particle tracking. Methods in Enzymology. 2012;504:183–200. doi: 10.1016/B978-0-12-391857-4.00009-4. [DOI] [PubMed] [Google Scholar]
  28. Merlo LM, Kosoff RE, Gardiner KL, Maley CC. An in vitro co-culture model of esophageal cells identifies ascorbic acid as a modulator of cell competition. BMC cancer. 2011;11:461. doi: 10.1186/1471-2407-11-461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Milde F, Franco D, Ferrari A, Kurtcuoglu V, Poulikakos D, Koumoutsakos P. Cell image velocimetry (CIV): boosting the automated quantification of cell migration in wound healing assays. Integrative Biology. 2012;4:1437–1447. doi: 10.1039/c2ib20113e. [DOI] [PubMed] [Google Scholar]
  30. Monier B, Pélissier-Monier A, Brand AH, Sanson B. An actomyosin-based barrier inhibits cell mixing at compartmental boundaries in Drosophila embryos. Nature cell biology. 2010;12:60–65. doi: 10.1038/ncb2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Neumann B, Held M, Liebel U, Erfle H, Rogers P, Pepperkok R, Ellenberg J. High-throughput RNAi screening by time-lapse imaging of live human cells. Nature Methods. 2006;3:385–390. doi: 10.1038/nmeth876. [DOI] [PubMed] [Google Scholar]
  32. Nketia TA, Sailem H, Rohde G, Machiraju R, Rittscher J. Analysis of live cell images: methods, tools and opportunities. Methods. 2017;115:65–79. doi: 10.1016/j.ymeth.2017.02.007. [DOI] [PubMed] [Google Scholar]
  33. Padfield D, Rittscher J, Roysam B. Coupled minimum-cost flow cell tracking for high-throughput quantitative analysis. Medical image analysis. 2011;15:650–668. doi: 10.1016/j.media.2010.07.006. [DOI] [PubMed] [Google Scholar]
  34. Park JA, Kim JH, Bi D, Mitchel JA, Qazvini NT, Tantisira K, Park CY, McGill M, Kim SH, Gweon B, Notbohm J, Steward R, Burger S, Randell SH, Kho AT, Tambe DT, Hardin C, Shore SA, Israel E, Weitz DA, Tschumperlin DJ, Henske EP, Weiss ST, Manning ML, Butler JP, Drazen JM, Fredberg JJ. Unjamming and cell shape in the asthmatic airway epithelium. Nature Materials. 2015;14:1040–1048. doi: 10.1038/nmat4357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Pau G, Walter T, Neumann B, Hériché JK, Ellenberg J, Huber W. Dynamical modelling of phenotypes in a genome-wide RNAi live-cell imaging assay. BMC Bioinformatics. 2013;14:308. doi: 10.1186/1471-2105-14-308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Peng J, Peng S, Jiang A, Wei J, Li C, Tan J. Asymmetric least squares for multiple spectra baseline correction. Analytica Chimica Acta. 2010;683:63–68. doi: 10.1016/j.aca.2010.08.033. [DOI] [PubMed] [Google Scholar]
  37. Perona P, Shiota T, Malik J. Geometry-Driven Diffusion in Computer Vision. Springer; 1994. [Google Scholar]
  38. Petitjean L, Reffay M, Grasland-Mongrain E, Poujade M, Ladoux B, Buguin A, Silberzan P. Velocity fields in a collectively migrating epithelium. Biophysical Journal. 2010;98:1790–1800. doi: 10.1016/j.bpj.2010.01.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Progatzky F, Dallman MJ, Lo Celso C. From seeing to believing: labelling strategies for in vivo cell-tracking experiments. Interface Focus. 2013;3:20130001. doi: 10.1098/rsfs.2013.0001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Rodríguez-Franco P, Brugués A, Marín-Llauradó A, Conte V, Solanas G, Batlle E, Fredberg JJ, Roca-Cusachs P, Sunyer R, Trepat X. Long-lived force patterns and deformation waves at repulsive epithelial boundaries. Nature Materials. 2017;16:1029–1037. doi: 10.1038/nmat4972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Schaffer CJ, Nanney LB. Cell biology of wound healing. International Review of Cytology. 1996;169:151–181. doi: 10.1016/s0074-7696(08)61986-5. [DOI] [PubMed] [Google Scholar]
  42. Schiegg M, Hanslovsky P, Haubold C, Koethe U, Hufnagel L, Hamprecht FA. Graphical model for joint segmentation and tracking of multiple dividing cells. Bioinformatics. 2015;31:948–956. doi: 10.1093/bioinformatics/btu764. [DOI] [PubMed] [Google Scholar]
  43. Schmitz MH, Held M, Janssens V, Hutchins JR, Hudecz O, Ivanova E, Goris J, Trinkle-Mulcahy L, Lamond AI, Poser I, Hyman AA, Mechtler K, Peters JM, Gerlich DW. Live-cell imaging RNAi screen identifies PP2A-B55alpha and importin-beta1 as key mitotic exit regulators in human cells. Nature Cell Biology. 2010;12:886–893. doi: 10.1038/ncb2092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Shadden SC, Lekien F, Marsden JE. Definition and properties of Lagrangian coherent structures from finite-time Lyapunov exponents in two-dimensional aperiodic flows. Physica D: Nonlinear Phenomena. 2005;212:271–304. doi: 10.1016/j.physd.2005.10.007. [DOI] [Google Scholar]
  45. Shishika D, Manoukis NC, Butail S, Paley DA. Male motion coordination in anopheline mating swarms. Scientific reports. 2014;4:6318. doi: 10.1038/srep06318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Souza RF. The role of acid and bile reflux in oesophagitis and Barrett's metaplasia. Biochemical Society Transactions. 2010;38:348–352. doi: 10.1042/BST0380348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Szabó B, Szöllösi GJ, Gönci B, Jurányi Z, Selmeczi D, Vicsek T. Phase transition in the collective migration of tissue cells: experiment and model. Physical Review E. 2006;74:061908. doi: 10.1103/PhysRevE.74.061908. [DOI] [PubMed] [Google Scholar]
  48. Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, Kuhn M, Bork P, Jensen LJ, von Mering C. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Research. 2015;43:D447–D452. doi: 10.1093/nar/gku1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Tinevez JY, Perry N, Schindelin J, Hoopes GM, Reynolds GD, Laplantine E, Bednarek SY, Shorte SL, Eliceiri KW. TrackMate: an open and extensible platform for single-particle tracking. Methods. 2017;115:80–90. doi: 10.1016/j.ymeth.2016.09.016. [DOI] [PubMed] [Google Scholar]
  50. Wang H, Kläser A, Schmid C, Liu C-L. Action recognition by dense trajectories . IEEE Conference on Computer Vision and Pattern Recognition; 2011. pp. 3169–3176. [DOI] [Google Scholar]
  51. Wang B, Mezlini AM, Demir F, Fiume M, Tu Z, Brudno M, Haibe-Kains B, Goldenberg A. Similarity network fusion for aggregating data types on a genomic scale. Nature Methods. 2014;11:333–337. doi: 10.1038/nmeth.2810. [DOI] [PubMed] [Google Scholar]
  52. Xie W, Noble JA, Zisserman A. Microscopy cell counting and detection with fully convolutional regression networks. Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization. 2016;6:283–292. [Google Scholar]
  53. Zaritsky A, Natan S, Ben-Jacob E, Tsarfaty I. Emergence of HGF/SF-induced coordinated cellular motility. PLOS ONE. 2012;7:e44671. doi: 10.1371/journal.pone.0044671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Zaritsky A, Kaplan D, Hecht I, Natan S, Wolf L, Gov NS, Ben-Jacob E, Tsarfaty I. Propagating waves of directionality and coordination orchestrate collective cell migration. PLOS Computational Biology. 2014;10:e1003747. doi: 10.1371/journal.pcbi.1003747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Zaritsky A, Welf ES, Tseng YY, Angeles Rabadán M, Serra-Picamal X, Trepat X, Danuser G. Seeds of locally aligned motion and stress coordinate a collective cell migration. Biophysical Journal. 2015;109:2492–2500. doi: 10.1016/j.bpj.2015.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Zaritsky A, Tseng YY, Rabadán MA, Krishna S, Overholtzer M, Danuser G, Hall A. Diverse roles of guanine nucleotide exchange factors in regulating collective cell migration. The Journal of Cell Biology. 2017;216:1543–1556. doi: 10.1083/jcb.201609095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Zhou B, Tang X, Wang X. Measuring crowd collectiveness. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; United States. 2013. pp. 3049–3056. [Google Scholar]
  58. Zhou FY. MOSES (Motion Sensing Superpixels) 9ab910fGitHub. 2019 https://github.com/fyz11/MOSES

Decision letter

Editor: Hamid Mohammadi1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

[Editors’ note: a previous version of this study was rejected after peer review, but the authors submitted for reconsideration. The first decision letter after peer review is shown below.]

Thank you for submitting your work entitled "Motion Sensing Superpixels (MOSES) is a systematic framework to quantify and discover cellular motion phenotypes" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and a Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Roberto Cerbino (Reviewer #3).

As you will note, the three expert reviewers find some merit in the MOSES method but raise many issues that preclude publication. As we feel the additional work needed to address the issues raised by the reviewers would take more than two months to complete, we are returning your submission to you now in case you wish to submit elsewhere for speedy publication. Further, the number and complexity of these issues make the editorial team reluctant to encourage a resubmission unless a very substantial amount of additional work was performed. In this case, eLife would be willing to look at a revised paper. Please note that it would be treated as a new submission with no guarantees of acceptance. The two most critical factors from an editorial perspective are:

i) Demonstrating the utility of the method on a diverse range of imaging inputs (reviewer #3 point #5 – there is also a reference to some suggested data) and;

ii) Increased biological measurements, including analysis of proliferation and/or cell density. Given that this study is focused on a new technique, the final decision on this work will be highly dependent on convincingly demonstrating the broad utility of the methodology to the cell migration and tissue morphogenesis community. In addition, it will be important to address the technical issues raised, including out of plane movement or loss of cells, further analysis of the 'boundary', and improving the readability of the study.

Our decision has been reached after consultation between the reviewers.

Reviewer #1:

In the manuscript "Motion Sensing Superpixels (MOSES): A systematic framework to quantify and discover cellular motion phenotypes", Zhou et al. present a new method for measuring cell motility in epithelial sheets and apply this to boundary formation in Barrett's Esophagus.

The approach presented fills the technical gap between PIV/Optical flow measurements and single-cell tracking and provides a method to characterise, and through PCA, discover, motion phenotypes within 2D epithelia. This is certainly a technically interesting concept that is relatively simple to implement and provides insight into motion phenotypes within a tissue. This method will be broadly applicable to many problems in characterising cellular motion. In general I find the manuscript well written and comprehensive in its description of the technique, but not particularly easy to read.

One of the difficulties is that it is unclear to me whether the authors intend to reveal new insight into the fundamental phenomena or simply provide a framework for automating the analysis for e.g. drug discovery. I would argue that cell state (sub cellular protein levels, cell cycle, cell death/extrusion, as examined in Schmitz et al., 2010, Pau et al., 2013, Held et al., 2010) are equally important and are not addressed by the current method.

In the present manuscript, only the cell motion is considered and yet cell proliferation/density is likely to be very important also. Can the authors provide cell density measurements (a proxy for cell proliferation) as a function of time, perhaps using the CNN counting approach, to give some estimate of motion versus proliferation?

This would be particularly interesting in light of the experiments involving titration of EGF. These are very interesting and provide some insight into how collective dynamics/cell adhesion are important in the tissue mechanics in general.

Another consideration is that of axial/upward motion at the interface between the two epithelial monolayers as they meet. Presumably the MOSES approach would register very little motion in a region where cells are being extruded upward (as in Figure 1B). How do the authors envisage addressing this?

Overall, I think that this is a technically interesting manuscript, which provides a set of tools for measuring and characterising cell motion phenotypes from high-throughput time-lapse imaging. This is likely to be a broadly useful tool for the community and addresses a long-standing problem of characterising mesoscale behaviour with close to single-cell accuracy. In my opinion, a revised manuscript would need to provide some additional information regarding the proliferation/density/state of cells within the tissue also.

Reviewer #2:

In this study, the authors utilized their newly developed mesh-based computational framework, MOSES, to analyze the collective motion of the cells and interactions between epithelial monolayers before and after their lateral collision. They have examined three main epithelial interactions that occur in normal esophagus to esophageal adenocarcinoma progression. These interactions (Video 2) lead to formation of a stable boundary with highly dynamic motion of the two cell populations after collision (in squamous-squamous interactions), a stable boundary with less dynamic motion of cell populations after collision and having squamous monolayer pushed by columnar monolayer (squamous-columnar), and no boundary formation with retraction of squamous cell population after collision (squamous-cancer). Authors mainly define two measurements (boundary formation index and motion stability index) to quantify the differences of these boundaries. Both measurements result in selection of squamous-columnar as stable boundary while leave the other two cell combinations in the "unstable boundary" category. Further, authors investigate the effect of activation of EGFR signaling pathway in boundary formation using MOSES. While the approach is appropriate, the current level of analyses does not provide substantial enough improvement relative to other works to justify publication in eLife. There are a number of critiques that authors should address before considering this manuscript for publication.

1) Although the nature of boundaries between squamous-squamous and squamous-cancer are distinct, both are considered as "no boundary" based on boundary index analysis and "unstable boundary" based on stability index analysis. Authors should define new measurements to extract the differences between these two boundaries.

2) As stated by the authors, PIV can be also used for motion extraction in dense monolayers. Authors should clearly state how potential users might benefit using MOSES that otherwise would not be possible using PIV.

3) How would the mesh analysis be affected in case that there would be no continuity in monolayers after collision (i.e. occurrence of detached patches of cells after collision)? Could it be captured in disorder index analysis?

4) Authors have used no serum condition in order to disrupt the cell-cell contact within the monolayers. As serum may affect numerous cell functions, it is not clear if loss of collective sheet migration is because of the reduced cell-cell contacts. Authors are encouraged to use cells that have deleted intercellular adhesion (e.g. α catenin).

5) It would be beneficial to measure migratory behavior of each superpixel with respect to its distance from the boundary before and after collision.

6) How may the analyses change in cases that a combination of stable and unstable boundaries exist in the same field of view?

7) Authors state that in squamous-squamous interactions when cells are exposed to 20 ng/ml of EGF the values of boundary and stability indices are similar to that of squamous-cancer interactions. However, there is no retraction observed in squamous-squamous interactions (Video 7). Again, this needs to be resolved.

Reviewer #3:

The manuscript under review describes a method (MOSES) to quantify cell dynamics. The method is tested with epithelial monolayers made of different cell types. MOSES appears to be an interesting approach to the quantification of cell dynamics. However, for the reasons outlined below, I cannot recommend publication of the manuscript in the present form.

1) The paper is too technical to be really useful for potential users. In particular:

1a) The description of the track filtering and mesh formulation steps are so involved that in the end, I did not understand how a mesh is generated and used.

1b) In a similar manner, the boundary formation index (and to some extent also the motion stability index) is introduced in a way that requires a bona fide effort from the reader to believe that it really measures what is claimed. In my view, a methodological paper like this one, does not benefit from having all the important definitions in the Materials and methods section. Also, it would be greatly appreciated to be able to understand why the proposed parameters for the quantification of the cell dynamics outperform other possible choices.

1c) There are parts that are incomprehensible, such as for instance the paragraph entitled Automated cell counting with convolutional neural networks. This is a pity because the principal component analysis represents one of the most promising features of the method.

1d) Some quantities are not sufficiently defined. For instance, how is TrackMate similarity in Figure 2—figure supplement 2C defined? How is the distance defined in the vertical axis of Figure 1 - figure supplement 5B?

1e) Some choices are not clearly explained: why the cut-off for boundary formation and stability are defined by using the standard deviation? This choice seems to me to be an ambiguous one. For instance, a cut-off for boundary formation is "defined statistically as one standard deviation higher than the pooled mean of all three combinations (Figure 3C). Above this cut-off, cell combinations are categorised as forming a boundary". Surprisingly, a few lines below the authors write "Similarly, experiments in 0% serum were used to set the global motion stability threshold (0.87), one standard deviation below the pooled mean" (Figure 3D). Can the authors explain this asymmetry? The use of standard deviations for fixing thresholds would need in my view that a band exists (pooled mean +/- standard deviation) where the behavior is not determined. Only below mean + sd and above mean - sd a clear behavior could be attributed.

I suggest that the authors consider seriously revising the structure of their paper to make it understandable to a wide audience and to enable the potential reader to properly evaluate the powerfulness and the limitations of the proposed approach.

2) The literature review presented in the Introduction is also not clear, being a puzzling mix of methods to quantify the cell dynamics (such as tracking and PIV) and models (such as the vertex model). To me, the authors fail in merging successfully these two branches of literature and, in particular, in explaining where does MOSES fit and why the previously available tools, are not as good as MOSES in terms of robustness, sensitivity, automatization, and unbiasedness.

3) There are some claims that seem not to be supported by the experimental results. Typical examples are:

3a) Subsection “In-vitro model to study the spatio-temporal dynamics of boundary formation between different cell populations”, first paragraph: Figure 1—figure supplement 2 is cited in support of the fact that cells labeled with different color dye move in a similar way. Inspection of SF2(Figure 1 - figure supplement 2) shows that in some cases the MSD can differ by about one order of magnitude. How is it that the authors consider this a proof of similarity?

3b) The method is allegedly working perfectly in an automatic and unbiased fashion. How comes that the boundary in Figure 3—figure supplement 9A does not seem to be adequately determined?

3c) The statement "The mesh disorder index showed statistically significant increases with EGF concentration" does not seem to be supported by the data in Supplementary Figure 9, where a non-monotonic behavior would also be compatible with the data.

4) There are some claims whose general validity (i.e. in other experiments) is rather dubious.

4a) The Authors claim that "individual cells behave similarly to their neighbors so global motion patterns can be used as a proxy to study single cell behavior". I doubt that this is always true. For instance, for the liquid states found in [Nature Materials 16, 587-596 (2017)] I believe that this claim wouldn't be true.

4b) Given that the L1-norm at is not defined, what is the general validity of the statement "We use L1 for robustness as this value is > 1"?

5) The authors propose MOSES as a robust, sensitive, automatic and unbiased method and I trust them that this might be true. However, the current version of the manuscript proves that MOSES works fairly well in the few cases selected by the authors and it is not clear to me that it could be really used successfully in all other cases. Maybe, the authors could comment on this by suggesting cases in which they expect MOSES to work and cases where they think it would not. For instance, do the authors think that MOSES would work for the experiments in [Nature Materials 16, 1029 (2017)]?

In summary, I do see some potential in MOSES but the current version is not making a good job in explaining clearly what the method does, why is it better than other approaches and when it should be applicable. I hope that the authors will be able to address the above issues.

[Editors’ note: what now follows is the decision letter after the authors submitted for further consideration.]

Thank you for submitting your article "Motion Sensing Superpixels (MOSES): A systematic framework to quantify and discover cellular motion phenotypes" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Didier Stainier as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Roberto Cerbino (Reviewer #2); Alison McGuigan (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

This manuscript is well improved and of high scientific value; however two of the reviewers raised several significant points that need clarification. These should be addressed in detail to resolve their concerns. The issue of serum-free condition as negative control may require additional experimentation for validation.

Reviewer #1:

The second version of the manuscript has been improved extensively and reads better than the first one. However, there are a few criticisms the authors should address before considering this manuscript for publication.

An important feature of the tissue boundaries is their shape. It would be beneficial if authors could also incorporate the analysis of boundary roughness in their computational framework.

Authors stated that the "cancer cell line OE33 pushed EPC2 out of the field of view". Since the traction forces applied at the interface between the two sheets is not presented in the manuscript, it is unclear whether the EPC2 cells retracted upon contact with OE33 cells or are continuously pushed by application of physical forces exerted by OE33 cells at the interface.

Authors have conducted new experiments, where the impact of depletion of Ca2+ on boundary formation is examined, to elucidate how the absent of cell-cell contact may disrupt the collective motion of the epithelial sheets. However, it is still unclear how serum depletion impacts the proliferation rate of the cells and that affects the collective motion of the sheets.

In conclusion, authors should present how much their computational framework can potentially improve our knowledge in biological processes and how it could be used to tackle new challenges.

Reviewer #2:

The authors have done an extensive amount of work to try to address all the reviewers' comments. As a result, the manuscript is now more clear and convincing in describing the proposed MOSES approach for quantifying the motility of cells belonging to a collective. I thus recommend publication in eLife.

Reviewer #3:

This paper presents a potentially useful tool to enable quantification of cell movement from large numbers of movies to better identify molecules that modulate collective cell migration dynamics. The scale of the data collected is impressive but I found this paper incredibly challenging to read and to understand what the data measurements physically represented and what these measurements meant biologically for our understanding of cell cooperation during boundary formation. Furthermore, I did not really understand the argument made by the authors that serum free represents no boundary formation versus for example delayed boundary formation since cells will move slower and proliferate less (and have a gap to fill before a boundary can form). This manuscript is interesting but is not currently accessible to a broad readership in my opinion.

The major comment I had that I think could significantly improve the impact of the work is can the authors better highlight the functional importance of the metrics they are quantifying in terms of the biological behaviours. For example is a boundary the best thing to be describing here or is wound healing a better example to be able to extract out different cell movement regimes to highlight the power of the tool? I could not understand how the metrics highlighted here could give me new insight into how a boundary is formed therefore it was not clear what I could learn using the tool.

The use of serum free medium to prevent boundary formation does not seem the most robust approach as this also likely impacts cell movement and proliferation, which will impact the timing required to form the boundary in the model being used (since the cells have to proliferate to fill the open space between the two cell domains. I am not sure of the logic behind specifying that doing things in serum free media gives a negative control that corresponds to a no-boundary formation case.

eLife. 2019 Feb 26;8:e40162. doi: 10.7554/eLife.40162.048

Author response


[Editors’ note: the author responses to the first round of peer review follow.]

Reviewer #1:

In the manuscript "Motion Sensing Superpixels (MOSES): A systematic framework to quantify and discover cellular motion phenotypes", Zhou et al. present a new method for measuring cell motility in epithelial sheets and apply this to boundary formation in Barrett's Esophagus.

The approach presented fills the technical gap between PIV/Optical flow measurements and single-cell tracking and provides a method to characterise, and through PCA, discover, motion phenotypes within 2D epithelia. This is certainly a technically interesting concept that is relatively simple to implement and provides insight into motion phenotypes within a tissue. This method will be broadly applicable to many problems in characterising cellular motion. In general I find the manuscript well written and comprehensive in its description of the technique, but not particularly easy to read.

One of the difficulties is that it is unclear to me whether the authors intend to reveal new insight into the fundamental phenomena or simply provide a framework for automating the analysis for e.g. drug discovery.

MOSES is a computational framework that was developed with an aim to reveal new insights into fundamental phenomena of cellular motion dynamics. However, the biological data generated in this study are best suited to demonstrate the potential of MOSES rather than to conclude any new insight into the fundamental biological principles of cellular motion dynamics. In the revised manuscript, we have carried out a comprehensive comparison between MOSES and a widely used biological motion analysis framework, PIV (Particle Image Velocimetry). We illustrated a number of advanced features of MOSES over PIV and additionally have now emphasised the potential of MOSES in automated high-throughput screening applications. The ability of MOSES to employ a systematic analysis pipeline to generate motion maps in an unbiased way that captures local and global motion patterns through a dynamic mesh construction, will enable future study of novel biological insights in cellular motion dynamics. These points have now been clarified in the revised manuscript.

I would argue that cell state (sub cellular protein levels, cell cycle, cell death/extrusion, as examined in Schmitz et al., 2010, Pau et al., 2013, Held et al., 2010) are equally important and are not addressed by the current method.

General methods for quantifying cell states are indeed important and pattern recognition methods are being applied to analyse protein levels. When cells are sparse, existing published software such as TrackMate, CellProfiler, CIV can be used for single cell or particle tracking. Our work here, however, primarily focuses on the extraction and analysis of collective biological motion and complex phenotypes regardless of cellular density for which software is rarely available and current analysis capabilities are limited. We have now mentioned this in the Discussion.

In the present manuscript, only the cell motion is considered and yet cell proliferation/density is likely to be very important also. Can the authors provide cell density measurements (a proxy for cell proliferation) as a function of time, perhaps using the CNN counting approach, to give some estimate of motion versus proliferation?

This would be particularly interesting in light of the experiments involving titration of EGF. These are very interesting and provide some insight into how collective dynamics/cell adhesion are important in the tissue mechanics in general.

Thank you for the suggestion. We considered that for cell counting to accurately reflect cell proliferation rate, the whole field of view should be covered or the cell population should be largely static (not migratory). In our videos of migrating epithelial sheets, the cells are both moving and proliferating. In these circumstances we propose two approaches to estimate cell proliferation independent of migration.

i) Cell density counting using CNNs (Materials and methods). To avoid issues of moving areas, cells were counted from randomly sampled equal sized image window patches of 64x64 pixels with full cell coverage inside. The average cell density for each cell population was reported as the average over 100 random such windows per time point (Materials and methods).

ii) Quantification of the rate in fluorescence decay. As fluorescently stained cells divide, over time the dye intensity exponentially decays. The faster the division rate, the faster the decay rate. (Materials and methods).

As requested, we applied this to the EGF experiments. Results are shown in new Figure 4—figure supplement 1. Both methods show no differences in the cell proliferation rates of a confluent epithelial sheet with increasing EGF concentration in 5% serum. This is different from a sparse culture in which EGF concentration is more likely to influence cell proliferation rate.

Another consideration is that of axial/upward motion at the interface between the two epithelial monolayers as they meet. Presumably the MOSES approach would register very little motion in a region where cells are being extruded upward (as in Figure 1B). How do the authors envisage addressing this?

Our primary interest in developing a scalable approach for high-throughput screening led us to focus our work on monolayers of cells and 2D timelapse videos, which are the most readily available. However we note that Boquet-Pujadas et al. (BioFlow: a non-invasive, image-based method to measure speed, pressure and forces inside living cells. Scientific reports 7.1 (2017): 9178) have shown already that the optical flow method of motion extraction can be extended with additional mathematical constraints to capture out-of-plane motion. Although the current version of MOSES does not contain such a feature however it can be extended and applied on 3D+time imaging to capture such extrusion processes in the future. This possibility has been discussed in our revised manuscript (see revised main text).

Reviewer #2:

[…] 1) Although the nature of boundaries between squamous-squamous and squamous-cancer are distinct, both are considered as "no boundary" based on boundary index analysis and "unstable boundary" based on stability index analysis. Authors should define new measurements to extract the differences between these two boundaries.

To address this reviewer’s comment, we quantified the observed differences between the squamous-squamous and squamous-cancer boundaries displaced distance and the results is now shown in the new Figure 1G.

2) As stated by the authors, PIV can be also used for motion extraction in dense monolayers. Authors should clearly state how potential users might benefit using MOSES that otherwise would not be possible using PIV.

We include a new section and a summary table in the revised manuscript that explicitly compares the benefits of MOSES with respect to PIV using two published datasets. Please see Results section “Comparison between MOSES and PIV” and new Figure 6.

3) How would the mesh analysis be affected in case that there would be no continuity in monolayers after collision (i.e. occurrence of detached patches of cells after collision)? Could it be captured in disorder index analysis?

To address this reviewer’s comments to perform the mesh analysis when there is no continuity in monolayers after collision we used a video of EPC2 (R):OE33 (G) cells with 5ng/ml of Lapatnib (EGF inhibitor). At this concentration, after the two sheets collide, EPC2 cells rapidly die and shrivel up, creating numerous ‘gaps’ in the epithelial sheet. The results are shown in the new Figure 3—figure supplement 3, where these irregular movements in EPC2 are clearly captured in snapshots of the mesh (Figure 3—figure supplement 3A), and registered in the corresponding mesh strain curve (Figure 3—figure supplement 3B).

4) Authors have used no serum condition in order to disrupt the cell-cell contact within the monolayers. As serum may affect numerous cell functions, it is not clear if loss of collective sheet migration is because of the reduced cell-cell contacts. Authors are encouraged to use cells that have deleted intercellular adhesion (e.g. α catenin).

Although we agree with this reviewer that depleting α catenin will disrupt cell/cell contact, to be able to disrupt and re-establish cell-cell contacts with high efficiency and reproducibility is crucial to test the robustness of MOSES. This is the reason why we did not use genetic manipulation to disrupt cell-cell contacts because the efficacy of RNAi or CRISPR mediated depletion of any given gene can be highly variable among different experimental conditions and it is not easily reversible.

Depletion of Ca2+ is very well known to disrupt the cell-cell contacts in epithelial cells and addition of Ca+ can also induce the re-establishment of the cell/cell contacts. Thus we measured the corresponding boundary formation index in cells with Ca+ depletion or with various concentrations of Ca+ added back to culture mediums as indicated. Additionally we performed the experiment with different FBS concentrations and different media; KSFM, KSFM+RPMI and KSFM+RPMI+FBS, to control for potential media effects. To measure motion collectiveness we used the standard velocity order parameter. The results are shown in the Author response image 1. Across all samples a significant correlation between the motion collectiveness as measured by the velocity order parameter and the boundary formation index (Pearson correlation coefficient, r = 0.41) was found when all n = 33 samples were pooled together (right hand scatter plot in Author response image 1).

Author response image 1.

Author response image 1.

All numerical concentrations without units in the figure refer to ng/ml, thus Cal:1000 denotes a 1000ng/ml concentration of Ca2+.

5) It would be beneficial to measure migratory behavior of each superpixel with respect to its distance from the boundary before and after collision.

Due to the difficulty of representing this information for all the superpixels (1000) across 144 frames and in 2D with a moving boundary, we approximated the interface between the two cells as a vertical line and constructed velocity kymographs from the extracted dense optical flow which the superpixels use to update their (x,y) positions, as in the paper of Trepat (Rodríguez-Franco et al., 2017). These are included as the new Figure 3A for the three principal cell combinations EPC2:EPC2, EPC2:CP-A and EPC2:OE33 in 0% and 5% serum, and as the new Figure 4B for EGF addition to EPC2:CP-A in serum.

6) How may the analyses change in cases that a combination of stable and unstable boundaries exist in the same field of view?

Unfortunately we do not have a video dataset where we have exclusively studied the coexistence of stable and unstable boundaries in the same field of view. However our experiments with EGF inhibition when applied to the cell combinations of EPC2:CP-A (squamous-columnar) and EPC2:OE33 (squamous-cancer) contain a small set of examples for illustration of the required changes and we report our results in Author response image 2.

Author response image 2.

Author response image 2.

In Author response image 2A, we observed partial rescue of the EPC2:OE33 phenotype upon EGF inhibition with motion behaviour more similar to EPC2:CP-A however at these concentrations the EGF inhibitor (Lapatnib) is toxic to the cells and leads to death. This effectively induces the formation of ‘multiple unstable boundaries’ in the EPC2 cells with a moving OE33 front. The boundary formation index cut-offs established in the paper in new Figure 3C implicitly assume the presence of one boundary. Applied here, whilst the boundary formation index still captures the motion concentration in the motion saliency map, as stated in the new main text, it will no longer be specific in only quantifying the interface (the boundary) between the two cell populations (Author response image 2B). To handle the presence of multiple boundaries, one can segment the different unique boundaries by thresholding on the motion saliency map and deriving more specific measures such as counting the number of ‘spot-like’ sites.

7) Authors state that in squamous-squamous interactions when cells are exposed to 20 ng/ml of EGF the values of boundary and stability indices are similar to that of squamous-cancer interactions. However, there is no retraction observed in squamous-squamous interactions (Video 7). Again, this needs to be resolved.

We assume by retraction the reviewer is referring to the squamous-columnar (EPC2:CP-A) combination where the CP-A cells push the squamous EPC2 cells out of the field of view. We have quantified this displaced distance relative to the size of the image in the new Figure 4D with the EPC2:OE33 videos as reference. Grouping the displaced distance in terms of <5ng/ml and >=5ng/ml, the difference in displaced distance is highly significant, and there was no statistically significant difference in the displaced distance between EPC2:CP-A with 20ng/ml EGF and EPC2:OE33.

Reviewer #3:

The manuscript under review describes a method (MOSES) to quantify cell dynamics. The method is tested with epithelial monolayers made of different cell types. MOSES appears to be an interesting approach to the quantification of cell dynamics. However, for the reasons outlined below, I cannot recommend publication of the manuscript in the present form.

1) The paper is too technical to be really useful for potential users. In particular:

1a) The description of the track filtering and mesh formulation steps are so involved that in the end, I did not understand how a mesh is generated and used.

We have simplified the definitions and abstracted the details more in the main text and the Materials and methods section, and refer interested/more specialised users to the Materials and methods and source code. We have added a schematic diagram of MOSES as a new Figure 2, and the new Figure 2—figure supplement 1 illustrates the technical explanation of the track filtering, which is detailed in the Materials and methods.

Track Filtering:

Our regular spaced superpixels across the whole image act like mini-cameras that assess the local velocity. In the initial frames, only those superpixels that cover the epithelial sheet will move. The rest will remain static as they cover the background image pixels that are black. The objective of the track filtering is to only identify those ‘wanted’ superpixels that cover the movement of the epithelial sheet. Using the above observation, briefly the wanted superpixels can be captured by thresholding on the observed velocities of superpixels in the initial few frames. We have added new text on this in the Materials and methods section of the main text.

Mesh formulation:

The mesh formulation aims to capture the movement of individual superpixels with respect to its local neighbouring superpixels. This involves taking each individual superpixels and connecting it to its neighbours. What constitutes a neighbour? Mathematically this is user defined based on the particular application. For example the nearest k superpixels based on distance gives rise to the popular k-NN neighbour graph when considering flocking behaviour, or one can choose all superpixels within a cutoff distance dc as we have done here. The definition of neighbour can be recalculated at every time frame but here we only compute the neighbour at the initial timepoint, frame = 0 and assume the same neighbours are retained in all subsequent frames. By doing so if the distance between the neighbours increases dramatically this suggests disruptive motion (a group of cells that started close do not end up in the same spatial area). We have added new text also on this in the Materials and methods section of the main text.

1b) In a similar manner, the boundary formation index (and to some extent also the motion stability index) is introduced in a way that requires a bona fide effort from the reader to believe that it really measures what is claimed.

We apologise for being too complex in our descriptions, and have worked to improve them throughout the manuscript. The boundary formation index is derived from the motion saliency map which is theoretically motivated by Lagrangian mechanics (subsection “Quantitative measurement of

squamous and columnar epithelial boundary formation using MOSES”, second paragraph) in the new main text and above discussion regarding divergence of a motion field).

The mesh stability index (called motion stability index in the original manuscript) is introduced to measure the rate of movement of neighbouring superpixels relative to each other. If the distances between adjacent superpixels are constantly moving then this leads to an unstable mesh and a low stability index. On the other hand if the distance between adjacent cell groups do not change with respect to their initial configuration then the stability index is high. We highlight this property with simulated data (see new Figure 3—figure supplement 4), and have endeavoured to make all the descriptions easier for readers outside the field to follow.

In my view, a methodological paper like this one, does not benefit from having all the important definitions in the Materials and methods section. Also, it would be greatly appreciated to be able to understand why the proposed parameters for the quantification of the cell dynamics outperform other possible choices.

We have now ensured that the important definitions are explained in the main text and refer only to the very detailed definitions and methods in the supplementary material. We also now include a table (new Table 1) whereby each statistical indices in the paper has a biological definition/interpretation (e.g., mesh order measures the collectiveness of local cellular migration) and example biological applications. Finally a number of new figures have been introduced to illustrate these more complicated concepts (see new Figure 3—figure supplements 2-5).

1c) There are parts that are incomprehensible, such as for instance the paragraph entitled Automated cell counting with convolutional neural networks. This is a pity because the principal component analysis represents one of the most promising features of the method.

We apologise for the confusion. The paragraph entitled ‘automated cell counting with convolutional neural networks’ in the Materials and methods has now been edited for clarity. A new supplementary figure has been added (see new Figure 1—figure supplement 1) that clarifies better the training and application procedure of the convolutional neural network.

1d) Some quantities are not sufficiently defined. For instance, how is TrackMate similarity in Figure 2—figure supplement 2C defined? How is the distance defined in the vertical axis of Supplementary Figure 9D?

Similarity of MOSES tracks with TrackMate tracks are measured using the normalised cross-correlation between the paired tracks (see new Figure 2—figure supplement 2 legend). The potentially confusing original Supplementary Figure 9 showing the mesh disorder index is now removed and no longer part of the manuscript. The distance was in number of superpixels as in the spatial correlation computation of Figure 1—figure supplement 5B.

1e) Some choices are not clearly explained: why the cut-off for boundary formation and stability are defined by using the standard deviation? This choice seems to me to be an ambiguous one.

The metrics exhibit a unimodal distribution (see violin plots of new Figure 3C-E). Correspondingly it is statistically justified to use the standard deviation and the mean to set cut-offs. This practice is equivalent to using a t-distribution/normal distribution and in line with the standard practice of one-tailed t-tests. We have added a new section in the Materials and methods to explain this more clearly.

For instance, a cut-off for boundary formation is "defined statistically as one standard deviation higher than the pooled mean of all three combinations (Figure 3B). Above this cut-off, cell combinations are categorised as forming a boundary".

The higher the boundary formation index, the greater the probability of forming a boundary. Thus we set one standard deviation higher than the pooled mean. This corresponds to a one-tailed t-test where we need to observe values at least as high as this threshold to confidently predict a boundary. The null hypothesis is the absence or insufficient evidence for boundary formation with the pooled combinations.

Surprisingly, a few lines below the authors write "Similarly, experiments in 0% serum were used to set the global motion stability threshold (0.87), one standard deviation below the pooled mean". Can the authors explain this asymmetry?

Similarly the lower the mesh stability the more the epithelial sheet moves as a whole. We set one standard deviation lower than the pooled mean corresponding to the one-tailed test where we need to observe values at least as low as this to predict instability. The null hypothesis is that with the pooled combinations we already see mesh stability. Therefore the interesting phenomena to test for here is instability.

By choosing the cut-offs in this way we are being more stringent with what constitutes ‘boundary-forming’ and ‘unstable’ which are the two ‘interesting’ phenotypes contrary to the norm.

The use of standard deviations for fixing thresholds would need in my view that a band exists (pooled mean +/- standard deviation) where the behavior is not determined. Only below mean+ sd and above mean-sd a clear behavior could be attributed.

Our threshold choices are equivalent to fitting the data with a 2-class classifier. The reviewer’s suggestion of +/- cutoffs corresponds to a 3-class classifier is also a plausible model where the extra class is used as a ‘miscellaneous’ class where we are unsure of the phenotype. We believe that both models are justified. We chose to use the former which is more fitting in the context of drug screening.

I suggest that the authors consider seriously revising the structure of their paper to make it understandable to a wide audience and to enable the potential reader to properly evaluate the powerfulness and the limitations of the proposed approach.

We have improved the readability of the manuscript during our revision and the revised manuscript have been read by various colleagues including a former journal editor outside the field. We hope the revised manuscript is now more understandable to the broader reader.

2) The literature review presented in the Introduction is also not clear, being a puzzling mix of methods to quantify the cell dynamics (such as tracking and PIV) and models (such as the vertex model). To me, the authors fail in merging successfully these two branches of literature and, in particular, in explaining where does MOSES fit and why the previously available tools, are not as good as MOSES in terms of robustness, sensitivity, automatization, and unbiasedness.

We have carried out extensive modifications to the Introduction by providing more technical context for our work with respect to the existing literature. We have also included a new comprehensive comparison between MOSES and PIV, the most commonly used method and demonstrate the more advanced features of MOSES over PIV in the main text and Discussion.

3) There are some claims that seem not to be supported by the experimental results. Typical examples are:

3a) Subsection “In-vitro model to study the spatio-temporal dynamics of boundary formation between different cell populations”, first paragraph: Figure 1—figure supplement 2 is cited in support of the fact that cells labeled with different color dye move in a similar way. Inspection of SF2 (Figure 1—figure supplement 2) shows that in some cases the MSD can differ by about one order of magnitude. How is it that the authors consider this a proof of similarity?

Thank you for the comments. We have investigated the issue and found a batch effect inherent in the MSD calculation. In one batch of videos, one cell type was plated more asymmetrically (occupying ~70% of the image) as opposed to the normal 50% and we didn’t filter these tracks out in the MSD computation. The corrected quantification is presented in the new Figure 1—figure supplement 2 and statistical tests have been carried out on the extracted MSD components.

3b) The method is allegedly working perfectly in an automatic and unbiased fashion. How comes that the boundary in Figure 3—figure supplement 9A does not seem to be adequately determined?

Figure 3—figure supplement 9A illustrates the process by which we infer the gap closure frame starting from image segmentation of the fluorescent images. The objective was not to delineate perfectly the sheet boundaries but to accurately infer the time of gap closure for downstream analysis. Comparison of the inferred gap closure times over n = 246 videos with the corresponding consensus manual annotations of 3 independent researchers demonstrate high correlation (0.902 Pearson correlation) and an agreement of 94% within 5 frames, (new Figure 3—figure supplement 9E) which is very comparable to the 97% accuracy between individual humans. This justifies the initial quality of image segmentation used for the intended purpose of inferring the frame in which the gap between the two sheets are closed.

3c) The statement "The mesh disorder index showed statistically significant increases with EGF concentration" does not seem to be supported by the data in Supplementary Figure 9, where a non-monotonic behavior would also be compatible with the data.

During the revision, we carried out extensive testing using our own and published datasets and concluded that the newly proposed mesh order in the revised manuscript is more general than the initially proposed mesh order index as a measure of collective motion based on our MOSES mesh construction. The previously proposed mesh disorder index was only applicable between videos with identical initial configuration i.e. same type of cells and plating as in the case of the EGF titration videos. The results shown in Supplementary Figure 9 (original manuscript) was intended to highlight that the mesh disorder index deviates from uniformity when EGF concentrations > 5ng/ml. Statistical tests were carried out to identify this deviation from a constant line model which is reasonable for 0-5ng/ml. Thus overall the behaviour is non-monotonic. With our new improved mesh order measurement, Supplementary Figure 9 of the original manuscript is now removed as we no longer use the mesh disorder index.

4) There are some claims whose general validity (i.e. in other experiments) is rather dubious.

4a) The Authors claim that "individual cells behave similarly to their neighbors so global motion patterns can be used as a proxy to study single cell behavior". I doubt that this is always true. For instance, for the liquid states found in [Nature Materials 16, 587-596 (2017)] I believe that this claim wouldn't be true.

We address this point together with point 5 (see below).

4b) Given that the L1-norm at is not defined, what is the general validity of the statement "We use L1 for robustness as this value is > 1"?

We apologise for neglecting to define this norm in our previous version of the manuscript. We have added a new section “Normalised Mesh Strain, L1-norm and robustness” in the Materials and methods explaining our design choice.

5) The authors propose MOSES as a robust, sensitive, automatic and unbiased method and I trust them that this might be true. However, the current version of the manuscript proves that MOSES works fairly well in the few cases selected by the authors and it is not clear to me that it could be really used successfully in all other cases. Maybe, the authors could comment on this by suggesting cases in which they expect MOSES to work and cases where they think it would not. For instance, do the authors think that MOSES would work for the experiments in [Nature Materials 16, 1029 (2017)]?

We thank the reviewer for their help in making available the raw videos of [Nature Materials 16, 587-596 (2017)] available. Together with the videos in [Nature Materials 16, 1029 (2017)] as our validation datasets we have refined our claims in the revised manuscript. In addition we profiled the similarity and differences between MOSES and PIV. The new results are included in the revised manuscript as a new section with an additional Figure 6 and accompanying summary table.

[Editors' note: the author responses to the re-review follow.]

Reviewer #1:

The second version of the manuscript has been improved extensively and reads better than the first one. However, there are a few criticisms the authors should address before considering this manuscript for publication.

An important feature of the tissue boundaries is their shape. It would be beneficial if authors could also incorporate the analysis of boundary roughness in their computational framework.

We quantified the ‘boundary roughness’ according to the deviation of the geometrical boundary shape from a straight line, L/L0 where L is the length of the boundary and L0 is the length of the straight line joining the endpoints of the boundary as in Javaherian, Sahar, et al (Modulation of cellular polarization and migration by ephrin/Eph signal-mediated boundary formation." Integrative Biology 9.12 (2017): 934-946), (Figure 3—figure supplement 10). No significant departures from a straight line (L/L0=1) were found. A new paragraph for extracting the boundary to calculate boundary shape has been added to the Materials and methods.

Authors stated that the "cancer cell line OE33 pushed EPC2 out of the field of view". Since the traction forces applied at the interface between the two sheets is not presented in the manuscript, it is unclear whether the EPC2 cells retracted upon contact with OE33 cells or are continuously pushed by application of physical forces exerted by OE33 cells at the interface.

We acknowledge that without traction force measurements we can’t draw a clear conclusion about the forces at play in our assay, whether it was EPC2 cells which retracted or was continuously pushed by OE33 cells. The wording in our previous manuscript, “cancer cell line OE33 pushed EPC2 out of the field of view”, was used to describe the phenomenon of OE33 cells expanding into EPC2 cells resulting in their eventual disappearance from the field-of-view. This is supported by the derived motion field (Figure 1—figure supplement 3). Just before gap closure, EPC2 and OE33 cells are moving in opposite directions. After gap closure EPC2 cells near the interface continue to move in opposite directions to OE33 cells initially before they appear to be dominated by OE33 cells. This is suggestive of ‘pushing’. Unfortunately due to the large fluorescence decay in OE33 cells near the interface it is not fully conclusive whether this ‘pushing’ is continuously maintained for the full duration of the timelapse video. Confocal image of cell junctions in fixed samples at 72 h together with evidence of motion suggest continual application of physical forces. However further experimentation is required to confirm this under timelapse microscopy study. In the main we now state “in the squamous-cancer EPC2:OE33 combination, the cancer cell line OE33 expanded continuously, resulting in the disappearance of EPC2 from the field of view (Video 2, Figure 1E) as assessed by the motion field and confocal images (Figure 1—figure supplement 3). The forces that govern the behaviour of the two cell lines on contact are unknown and traction force microscopy is required to investigate the ‘retracting’ or ‘pushing’ behaviour of EPC2 or OE33 cells respectively in future studies”.

Authors have conducted new experiments, where the impact of depletion of Ca2+ on boundary formation is examined, to elucidate how the absence of cell-cell contact may disrupt the collective motion of the epithelial sheets. However, it is still unclear how serum depletion impacts the proliferation rate of the cells and that affects the collective motion of the sheets.

To address this question, we used cell density as an indicator of cell proliferation. The mean cell density and mean change in cell density over the first 48 h (≈ 16 h before and 32 h after gap closure) were estimated from cells grown in serum (5% FBS) and no serum (0% FBS) conditions from individual video frames using the established convolutional neural network approach (Materials and methods). Our results, shown in the new Figure 1—figure supplement 4, suggest that under the experimental conditions used in these studies, the lack of serum has undetectable impact on cell density. This is in complete contrast to the observed increase in collective cell migration and presence of boundary formation in 5% serum. Together these results illustrate that serum-free medium has a profound impact on cell motion dynamics. We therefore used 0% serum in the paper as a computational negative control for the detection of boundary formation. We have modified wording in the text to provide explicit clarification to avoid confusion (see subsection “In-vitro model to study the spatio-temporal dynamics of boundary formation between different cell populations”, last paragraph).

In conclusion, authors should present how much their computational framework can potentially improve our knowledge in biological processes and how it could be used to tackle new challenges.

We thank the reviewer for the constructive comment. To address this we have rewritten the Introduction to provide additional rationale for the need of developing a computational framework like MOSES. In particular we highlight the advantage of MOSES over single-cell tracking and existing PIV/CIV type methods, “we developed Motion Sensing Superpixels (MOSES), a computational framework that aims to provide a flexible and general approach for biological motion extraction, characterisation and phenotyping. We empowered PIV-type methods with a mesh formulation that enables systematic measurement and unbiased extraction of rich motion features for single and collective cell motion suitable for high-throughput phenotypic screens.”

In the revised Discussion, the potential of MOSES to advance biological knowledge and its ability to tackle new challenges is stated as the following.

1) Single-cell tracks are notoriously problematic over long times; the track of a single cell may be lost or broken into many separate tracks. MOSES superpixel tracks avoids this and recovers the global motion patterns (c.f. motion saliency maps, derived measures and motion signatures). Whilst side-by-side comparison of MOSES and the standard PIV method using published datasets demonstrates that MOSES not only enables all the measurements of PIV, but by further exploiting long-time tracks and neighbourhood relationships, delivers greater physical and biological insights. (Discussion, second paragraph).

2) Complex salient spatio-temporal motion patterns and events such as boundary formation, deformation waves due to cell jamming between two cell populations and cell death can all be quantitatively captured by MOSES. Critically, the ability of MOSES to perform long-time tracking (up to 6 days demonstrated in this study) enabled spatial localisation of the cell populations involved in a particular motion phenotype. (Discussion, second paragraph).

3) MOSES does not require complex user settings to facilitate reproducibility in analyses because it does not aim to threshold or cluster out the moving objects or phenotypes during analysis, which would introduce intermediate processing errors. Rather its philosophy is to facilitate systematic generation of many motion-related measurements based on trajectory and mesh statistics sufficient for applying machine learning methods for data-driven object segmentation, video classification and phenotype detection in large video collections (e.g. Figure 5, motion map) with minimum prior information. The main parameter the user specifies is the number of initial superpixels, which determines the spatial resolution of analysis. No complicated fitting of complex models and no special hardware such as GPUs are required. (Discussion, last paragraph).

All of these illustrate the potential of MOSES as a powerful and systematic computational framework that is particularly useful for unbiased explorative high-content screening with an aim to discover fundamental principles of cellular motion dynamics in biology and to identify factors or drugs that can alter cellular motion dynamics in disease aetiology and treatment.

Reviewer #3:

This paper presents a potentially useful tool to enable quantification of cell movement from large numbers of movies to better identify molecules that modulate collective cell migration dynamics. The scale of the data collected is impressive but I found this paper incredibly challenging to read and to understand what the data measurements physically represented and what these measurements meant biologically for our understanding of cell cooperation during boundary formation. Furthermore, I did not really understand the argument made by the authors that serum free represents no boundary formation versus for example delayed boundary formation since cells will move slower and proliferate less (and have a gap to fill before a boundary can form). This manuscript is interesting but is not currently accessible to a broad readership in my opinion.

The major comment I had that I think could significantly improve the impact of the work is can the authors better highlight the functional importance of the metrics they are quantifying in terms of the biological behaviours. For example is a boundary the best thing to be describing here or is wound healing a better example to be able to extract out different cell movement regimes to highlight the power of the tool? I could not understand how the metrics highlighted here could give me new insight into how a boundary is formed therefore it was not clear what I could learn using the tool.

The aim of developing MOSES is as “a computational framework that aims to provide a flexible and general approach for biological motion extraction, characterisation and phenotyping. We empowered PIV-type methods with a mesh formulation that enables systematic measurement and unbiased extraction of rich motion features for single and collective cell motion suitable for high-throughput phenotypic screens.”. We have provided additional rationale for the need of developing such a computational framework in the revised Introduction. Accordingly in this paper the metrics are presented to highlight how MOSES offers a more flexible platform to define customized motion measures related to a specific biological phenomena compared to existing methods that better discriminates between the resulting motion phenotypes. Here we did not aim to specifically shed insight into how a boundary is formed. However it is important to note that further studies using our method to conduct a high-content screen following genetic or other experimental manipulations would enable us to reveal new mechanistic insights into boundary formation in an unbiased manner.

The use of serum free medium to prevent boundary formation does not seem the most robust approach as this also likely impacts cell movement and proliferation, which will impact the timing required to form the boundary in the model being used (since the cells have to proliferate to fill the open space between the two cell domains. I am not sure of the logic behind specifying that doing things in serum free media gives a negative control that corresponds to a no-boundary formation case.

This is a miscommunication on our part. In this paper our ‘negative’ control serves only to demonstrate the sensitivity of MOSES to distinguish between the clear boundary-like behaviour of EPC2:CP-A in 5% serum and the lack of in the corresponding ‘negative’ no-serum case. We do not attempt to claim any experimental conditions used in this study are biologically important in boundary formation as some of the conditions such as serum-free condition cannot be used as a rigorous biological control. Here we used serum-free medium as a condition to test the sensitivity of MOSES because it has a profound impact on cell motion dynamics with absence of boundary formation. We have now clearly stated this in the revised manuscript (subsection “In vitro model to study the spatio-temporal dynamics of boundary formation between different cell populations”, last paragraph, subsection “Quantitative measurement of squamous and columnar epithelial boundary formation using MOSES”, third paragraph).

To address this reviewer’s specific comments about the impact of serum on the timing and speed of boundary formation, we carried out further measurement during the revision and our results (new Figure 1—figure supplement 4, Figure 4—figure supplement 2) showed that:

1) The presence of serum causes variation in gap closure time (within +/- 5 h) but is not significant and consistent across all cell-line combinations (Figure 1—figure supplement 4A,B). This is in contrast to the notable average increased speed in serum across all cell combinations shown in Figure 3B.

2) The presence of serum does not have significant impact on the cell movement at the assay endpoint. All videos (96 h and 144 h) have the same normalized RMSD curves at late timepoints, (Figure 1—figure supplement 4B).

3) Consistent with our findings, serum has profound impact on cell movement. In 5% serum, cells moved more collectively (Figure 3F,G, Figure 1—figure supplement 5C) and moved quicker (Figure 3A,B). However increased cell movement speed alone does not contribute to boundary formation. Addition of EGF to EPC2:CP-A cells in no-serum conditions caused notable increase in cell movement speed but had minimal impact on collective migration and boundary formation (Figure 4—figure supplement 2C,G,J,K).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Zhou FY. 2018. Normal 2 Cell Population Migration Dataset. Mendeley Data. [DOI]
    2. Zhou FY, Puig CR. 2018. EGF Addition to EPC2:CP-A. Mendeley Data. [DOI]

    Supplementary Materials

    Supplementary file 1. Table summary of video datasets and experiments analysed for 0% and 5% serum.
    elife-40162-supp1.docx (14.1KB, docx)
    DOI: 10.7554/eLife.40162.039
    Supplementary file 2. Table summary of video datasets and experiments analysed for EGF addition.
    elife-40162-supp2.docx (13.7KB, docx)
    DOI: 10.7554/eLife.40162.040
    Transparent reporting form
    DOI: 10.7554/eLife.40162.041

    Data Availability Statement

    All data generated or analysed during this study are included in the manuscript and supporting files. Source code files for MOSES are available and maintained on GitHub (copy archived at https://github.com/elifesciences-publications/MOSES). Analysed videos are made available online with unique DOIs ( http://dx.doi.org/10.17632/j8yrmntc7x.1, http://dx.doi.org/10.17632/vrhtsdhprr.1) through Mendeley datasets.

    The following datasets were generated:

    Zhou FY. 2018. Normal 2 Cell Population Migration Dataset. Mendeley Data.

    Zhou FY, Puig CR. 2018. EGF Addition to EPC2:CP-A. Mendeley Data.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES