Abstract
The process of cell-sorting is essential for development and maintenance of tissues. Mathematical modeling can provide the means to analyze the consequences of different hypotheses about the underlying mechanisms. With the Differential Adhesion Hypothesis, Steinberg proposed that cell-sorting is determined by quantitative differences in cell-type-specific intercellular adhesion strengths. An implementation of the Differential Adhesion Hypothesis is the Differential Migration Model by Voss-Böhme and Deutsch. There, an effective adhesion parameter was derived analytically for systems with two cell types, which predicts the asymptotic sorting pattern. However, the existence and form of such a parameter for more than two cell types is unclear. Here, we generalize analytically the concept of an effective adhesion parameter to three and more cell types and demonstrate its existence numerically for three cell types based on in silico time-series data that is produced by a cellular-automaton implementation of the Differential Migration Model. Additionally, we classify the segregation behavior using statistical learning methods and show that the estimated effective adhesion parameter for three cell types matches our analytical prediction. Finally, we demonstrate that the effective adhesion parameter can resolve a recent dispute about the impact of interfacial adhesion, cortical tension and heterotypic repulsion on cell segregation.
Keywords: differential adhesion hypothesis, differential migration model, cell sorting, cellular automaton, statistical learning methods, high heterotypic interfacial tension hypothesis, pattern formation
1. Introduction
During the development of multicellular organisms, mechanical forces affect the internal states of cells as well as the interaction between cells, and they are, therefore, an integral part of all morphogenetic processes [1]. These forces are typically driven by molecular motors and transmitted via cytoskeleton elements and adhesion molecules within the cells and between them. Together, molecular motor driven movement and force transmission via adhesion complexes constitute two major self-organizing phenomena that drive tissue morphogenesis.
Since it constitutes a paradigmatic process for tissue morphogenesis, cell-sorting has found much attention both from experimental and theoretical research. It describes the tissue-scale segregation phenomenon that is observed when mixed heterotypic cell populations unmix into spatially confined homotypic cell clusters [2,3]. Over many decades this process has been studied in a number of in-vitro cell-sorting experiments for many different cell types [2,3,4,5,6,7]. To resolve which types of intercellular interaction guide individual mobile cells to find their homotypic neighbors, several theoretical hypotheses have been proposed and studied. Steinberg put forward the Differential Adhesion Hypothesis (DAH) which exploits the similarity of cell segregation with the unmixing of immiscible fluids such as water and oil. It states that cell-sorting is driven by the minimization of tissue surface tensions which result from quantitative differences in the strengths of cell-type specific intercellular adhesion [8]. However, Harris (1976) [9] remarked that tissue surface tension can result from several cellular mechanisms besides differential adhesion and suggested the differential contractility of cells as a major driver of cell-sorting. This is also acknowledged by the Differential Interfacial Tension Hypothesis (DITH) of Brodland and Chen (2000) [10]. There, the effects of intercellular adhesion and cellular contractility at cell–cell contacts are subsumed to the concept of differential interfacial tension at homo- and heterotypic cell contacts which determines the degree of mutual attachment between cells and thus drives cell-sorting. Canty et al. (2017) [11] demonstrated experimentally that repulsion at heterotypic cell–cell contacts with little to no contribution from adhesive or tensile differences between cell types can lead to cell-sorting and tissue separation. They propose the High Heterotypic Interfacial Tension Hypothesis (HIT) which asserts that heterotypic repulsion creates a situation where tension is strongly increased at heterotypic contacts compared to homotypic contacts and constitutes the major driver of cell-sorting and tissue separation.
The competing explanatory models DAH, DITH and HIT are unified within the Differential Migration Model (DMM) of Voss–Böhme and Deutsch (2010) [12]. They established that all the above hypotheses have in common that cell–cell contacts affect type-specifically the migration properties of the involved cells and parametrized this effect, independent of its specific nature - adhesion or repulsion, cortical tension or even signaling cascades. For the segregation of two cell types, the existence and form of an effective adhesion parameter (EAP) has been derived mathematically, which predicts the asymptotic sorting behavior [12].
However, while most previous studies focus on the simplest case of two cell types [7,13,14], the case of three and more cell types is more relevant in real biological systems. Studies of cell segregation and pattern formation with three or more cell types are still rare [15]. Although the DMM can be directly generalized to more than two cell types, it has only been studied for the minimal case of two cell types. In particular, the existence and form of the EAP for systems with three or more cell types remains an open question.
We study numerically the impact of the EAP and related parameters on the degree of tissue separation in terms of clear tissue boundaries as well as the separation time scale for mixtures of two cell types. We present new arguments to analytically predict the form of the EAP, thus generalizing the concept of the EAP to mixtures of arbitrary many cell types. We show numerically for three cell types that the analytically predicted EAP guides the asymptotic sorting behavior in the DMM, i.e., the EAP determines the asymptotic value of the normalized sum of heterotypic contacts. The form of the EAP on the asymptotic segregation behavior is independently quantified via two statistical learning methods, Support Vector Machines (SVM) and Logistic Regression model (Logit) [16]. The form estimated by the statistical learning methods matches our analytical prediction both for two and three cell types. Finally, we demonstrate that the EAP can explain different segregation behaviors in two cell- type Cellular Potts models which was previously attributed to the competing hypotheses DAH/DITH and HIT [11].
2. Materials and Methods
2.1. Differential Migration Model
The DMM is a special probabilistic cellular automaton (PCA) based on Voss-Böhme and Deutsch [12], which has been analyzed for two cell types. Here, we present the key features of this model for the case of arbitrary many cell types:
Lattice: The PCA works on a squared lattice with periodic boundaries and we set throughout our numerical analysis.
Cells: To emulate biological cells, every lattice site is assigned a cell type of all possible cell types in the system. In this way each cell regardless of its type, has approximately the same size and occupies exactly one lattice site. The number of cells of each type remains constant during the sorting process.
Configurations: A configuration of the lattice S represents the state of the model at time t. It belongs to the set of all possible configurations .
-
Migration: A configuration is changed by a cell position switch involving two adjacent lattice sites x and y with :
(1) Position switches between two cells of the same type do not change the configuration and are therefore neglected. Thus, neighboring cells situated at lattice sites x and y only switch their positions if they are heterotypic, i.e., .
- Differential adhesion: We assume that stronger bonds to neighboring cells hinder cell motility. Accordingly, a cell switch occurs with rate . The rate depends on the parameters and , which represent the binding strengths between cells at lattice sites x and y to positions from the von-Neumann-1 neighborhoods and , see Figure 1. The von-Neumann-1 neighborhood of a lattice site corresponds to all neighboring lattice sites with Manhattan distance one. The cell switch rate of the two adjacent lattice sites with and is as follows:
(2)
Notice that the particular mechanism which affects the cellular motility on intercellular contact is not specified. The parameters can be interpreted as repulsion, if , enhancing cellular motility, and it can be interpreted as binding strength resulting from the interplay of adhesion and relaxed cortical tension at cell–cell contact, if , inhibiting cellular motility. The details on the numerical implementation of this PCA are elaborated in the Appendix A. Depending on how many cell types are considered the number of intercellular adhesion parameters varies. For this, the vector of all intercellular adhesion parameters occurring in a system with N cell types is introduced, for instance
For a 2-cell-type model, an effective adhesion parameter (EAP)
was analytically predicted in Voss-Böhme and Deutsch [12]. The EAP determines the asymptotic cell segregation behavior of systems with cell types regardless of the value combinations of . The higher the EAP the more segregated the cell types become.
For simulation of the DMM model a Gillespie-related algorithm is used where independent cell switches, see Equation (1), are chosen and the waiting times between these events are calculated based on the cell switch rates, see Equation (2), within lattice configuration . For details about this algorithm, see Appendix A, Algorithm A1. Each numerical simulation yields a time-series of lattice configurations , since every cell switch, i.e., whenever two neighboring cells switch their position, see Equation (1), results in a new configuration. The variable t refers to the number of cell switches performed.
2.2. Order Indicator
To quantify the level of cell segregation an order indicator is introduced that quantifies the level of cell segregation of a lattice configuration . The order indicator is the normalized sum of homotypic cell–cell contacts within a configuration . This indicator increases the more cells of the same type are clustered together. In detail, the set of all homotypic von-Neumann-1 neighbors of a cell of type at position is given by
With , the total sum of homotypic connections is defined as
(3) |
The highest amount and the lowest amount of homotypic lattice site connections are reached in the case of a fully sorted and chessboard-patterned configuration, respectively. The exact values for 2- and 3-cell-type systems are listed in Appendix B. With and , the order indicator value for configuration is defined as the normalization of :
Thus, takes values in , where configurations with almost perfect sorted patterns, such as in Figure 2I, yield values above . In the case of chessboard configurations as in Figure 2III, the order indicator values are below . For a random lattice configuration, the order indicator value is , in the case of a 2-cell-type system, and it is for a 3-cell-type system.
With the order indicator , a time-series of configurations can be converted into a time-series of order indicator values , in short . Furthermore, the asymptotic cell segregation level for a simulation is estimated as the average over the last of the time-series and is denoted , see Figure 2 for an illustration.
3. Results
3.1. Cell System Parameters for Two Cell Types
We present a new simple argument for the form of the effective adhesion parameter guiding the asymptotic behavior in the DMM for two cell types [12] to generalize it to systems with arbitrary many cell types. For this, we introduce two additional system parameters and besides , which determine how changes to the intercellular adhesion parameters affect the asymptotic cell segregation behavior as well as the dynamics. Although the parameters and set the temporal scale of the waiting times and the asymptotic level of segregation, respectively, we argue that the parameter determines the number of cell switches required to reach the asymptotic level of segregation.
Indeed, the addition of a constant to all intercellular adhesion parameters simultaneously like
(4) |
rescales the cell switch rates in the DMM by a factor , since
(5) |
independent of the actual configuration and the considered lattice sites . This factor does not alter the relation between the cell switch rates, but only rescales all waiting times between cell switches by . Thus, the spatio-temporal order of the cell sorting dynamics and the asymptotic behavior are not affected. In order to parametrize this rescaling, a scaling parameter
(6) |
is introduced, where the functional, which maps , is given as a scalar product with the vector .
Secondly, we notice that the choice which cell type is denoted by type and which one by is arbitrary and the dynamics in the model is invariant against relabeling of the cell types. This invariance is reflected in the cell switch rates as well as in the order indicator , which quantifies the level of segregation of a configuration as sum of homotypic lattice site connections. Accordingly, the functional which maps the intercellular adhesion parameters to the effective adhesion parameter must reflect this invariance as well. This implies that an effective adhesion parameter , which controls the asymptotic sorting behavior, assuming it exists and is determined by a linear functional on the intercellular adhesion parameters, has the form
(7) |
with two real-valued constants , and a vector introduced analogously to Equation (6) to express the functional. The constants , are set by the condition that the asymptotic level of segregation and thus the differential adhesion parameter must not be affected by the temporal scaling parameter , which implies
(8) |
Without loss of generality, we can neglect an additional offset and factor in Equation (7) and choose for which follows
(9) |
We present an additional heuristic argument to make the assumption of a linear dependency in Equation (7) plausible and to estimate that the critical value of the effective adhesion parameter, at which the system remains randomly mixed, should be zero: Consider two limit scenarios of a cell switch, first two cells at a straight interface between clusters and secondly a single cell of one type moving inside a cluster of opposite type. As illustrated in Figure 1a, two cells of different type at the interface switch their position and thus cause a less segregated configuration with a rate . In contrast, a cell of type inside a cluster of type switches position with a given neighbor with rate , see Figure 1b. Analogously, for a single cell of type in a cluster of type the rate is . Both types of switches are necessary to move a single cell of opposite type out of a cluster. Since the former switch with rate reduces the level of segregation while the latter two switches with rates , are required to increase segregation, the ratio between these types of switches should determine the level of the asymptotic segregation. In particular, the point at which the system remains mixed is expected where mixing and unmixing rates are equal, i.e., where the ratio of the rates becomes 1. Since the exponential function is monotonous, it is sufficient to focus on the exponent, which is a multiple of the effective adhesion parameter . Thus, the critical value 1 of the ratio of rates translates into a critical parameter for which a two-cell-type system is expected to remain mixed.
After establishing the system parameters and , which are defined via scalar products with the corresponding vectors , , we point out that there is a third system parameter defined by the vector perpendicular to and
(10) |
The parameter quantifies the difference between the homotypic adhesion strengths of the two cell types. It becomes zero if both types have the same strength of homotypic adhesion. We use the absolute value in the definition of the parameter as we are not interested in which cell type has the stronger homotypic adhesion. We find numerically that the parameter affects how many cell switches are required to reach the asymptotic level of segregation, see Figure 3. For a fixed effective adhesion parameter , which means for the same asymptotic level of cell segregation, Figure 3 shows the number of cell switches required to reach certain thresholds of cell segregation, here , ≥0.8 and ≥0.9. This number increases with increasing independently of the chosen thresholds.
This numerical observation is supported by a heuristic argument: Consider again the limit scenarios of a cell switch, i.e., two cells at a straight interface between clusters which switches with a particular neighbor with rate and a single cell of one type within a cluster of opposite type which switches with rate , or respectively. In the symmetric case of equal homotypic adhesion parameters , which implies and , cell switches of the two single cells in either cluster are equally frequent and these switches occur in a certain ratio with cell switches at the interface. If the symmetry is broken by increasing at constant and , for instance , implying that , the single cell of type is favored to leave the cluster of opposite type in comparison to the symmetric case and in particular compared to the other single cell of type . Thus, segregation is now limited by the slower process of the single cell of type moving out of clusters of type . Moreover, the rate of a cell switch at the interface is constant if only the parameter is altered. Thus, in between two switches of a single cell of type (rate ) out of a type cluster there are more cell switches at the interface (rate ) than in the symmetric case. Please note that these back-and-forth switches at the interface do not progress the segregation but increase the number of total cell switches. This scenario illustrates how an increase of the parameter increases the number of cell switches required to reach the asymptotic level of segregation. Thus, we denote as convergence speed parameter.
Please note that the convergence speed parameter in Equation (10) is proportional to the Euclidean distance between and the related symmetric adhesion parameters with the same values of and . Indeed, for a given vector of adhesion parameters with system parameters , , and , the corresponding symmetric adhesion parameters have the same system parameters and but a convergence speed parameter of zero. The Euclid distance of the adhesion parameters to this symmetric case is
(11) |
Since the convergence speed parameter is zero, the symmetric case leads to the fastest convergence to the asymptotic level of segregation for the given system parameters , . In contrast to the definition of the convergence speed parameter in Equation (10), the definition of via Equation (11) can be generalized to N cell types, see below.
3.2. The Effective Adhesion Parameter for Arbitrary Number of Cell Types
For a mix of more than two cell types neither the form nor the existence of an effective adhesion parameter are known. Assuming that for an arbitrary number N of cell types an effective adhesion parameter exists, we postulate its form based on the generalization of our arguments for the case of two cell types. We also postulate the form of the convergence speed parameter as introduced for two cell types.
The scaling parameter is directly generalized from Equation (6) to N cell types
(12) |
and only scales all waiting times between cell switches as in the case of two cell types.
The invariance against relabeling of the cell types applies as well to N cell types which implies, analogously to Equation (7),
(13) |
where the constants , are set again by the condition
(14) |
Choosing again , the effective adhesion parameter has the form
(15) |
with corresponding vector . For this equals the result for two cell types from Equation (9) and for this takes the form
(16) |
In contrast to the case of two cell types, the space perpendicular to the vectors and for is not one- but -dimensional, i.e., two-dimensional for . Thus, the definition of a convergence speed parameter , analogous to Equation (10), is ambiguous. One option is to consider all pairs of cell types and their corresponding convergence speed parameters, e.g., , , and for . Since the segregation of all cell types requires the segregation of each pair of cell types, these pair-wise parameters should predict the convergence speed on subsets of intercellular adhesion parameters analogously to Figure 3. For instance, when two vectors of intercellular adhesion parameters are equal except they differ in one of these pair-wise convergence speed parameters, the one with the smaller parameter leads to a faster convergence. However, these pair-wise parameters do not allow comparison if several of them differ between two vectors of adhesion parameters and the influence of the heterotypic parameters is not even considered. Thus, the convergence speed parameter is instead defined by the generalization of Equation (11)
(17) |
We postulate, in analogy to the case of two cell types, that the parameter determines the number of cell switches required to reach the asymptotic level of segregation for a given effective adhesion parameter .
3.3. Numerical Evidence for the Effective Adhesion Parameter
We test our analytical prediction for the effective adhesion parameter for more than two cell types, by comparison with numerical simulations of the DMM model for two and three cell types. We find in both cases that the proposed effective adhesion parameter determines the asymptotic level of cell segregation. However, there are deviations due to the limited simulation time, from which the asymptotic level of the order indicator is extrapolated. This is revealed by additionally linking the simulation results to the convergence speed parameter , introduced above. For two cell types, we observe that for small values of , which refers to simulations which converge quickly to their asymptotic value, the dependency between and agrees almost perfectly with the analytically proven prediction. Deviations from the analytical prediction are the most pronounced the larger the convergence speed parameter is. For three cell types, we observe an analogous relation between deviations from the postulated dependency on the effective adhesion parameter and a convergence speed parameter as introduced above.
3.3.1. The Asymptotic Level of Cell Segregation Depends on the Effective Adhesion Parameter
At first the dependence of the level of asymptotic cell segregation on the effective adhesion parameter is tested. For the 2-cell-type, the effective adhesion parameter clearly determines the asymptotic sorting state, see dark blue points in Figure 4a, i.e., increases with increasing , as predicted by Voss-Böhme and Deutsch [12], and all configurations stay randomly mixed at , i.e., (see black dashed lines), as predicted by the heuristic argument above. The monotonic dependence of the asymptotic order parameter on the proposed effective parameter is supported by a value of of the spearman rank correlation coefficient (Kendall rank correlation coefficient of ). Additionally, Figure 4a shows the asserted influence of the convergence speed parameter on level of cell segregation dynamics. More precisely, the higher the value is the slower is the sorting process and the closer the respective value remains to until the end of the simulation. For these intercellular adhesion parameters , the asymptotic value of the order indicator has not yet been reached, which is why they are scattered above and below the increasing line, indicated by the dark blue points in Figure 4a. Note, that also the effective adhesion parameter shows impact on the amount of cell switches needed for a system to reach its asymptotic cell segregation . This is further investigated in Appendix D.
Analogous results for the case of three cell types with the postulated effective adhesion parameter are shown in Figure 4b. Qualitatively the same functional relation between the hypothesized effective adhesion parameter and is visible as in the case of two cell types, see dark blue points in Figure 4b. This includes the prediction that all configurations stay randomly mixed at , i.e., (see black dashed lines). The monotonic dependence of the asymptotic order parameter on the proposed effective parameter is supported by a value of of the Spearman rank correlation coefficient (Kendall rank correlation coefficient of ), which is of similar quality as in the case of two cell types. Based on the analogy of the numerical results for two and three cell types, we conclude that predicts the level of sorting reached asymptotically such as does in the 2-cell-type case. Correspondingly, the asymptotic cell segregation exhibits the postulated influence of the convergence speed parameter . Note that the restriction in Figure 4b to simulations with an ensures a sufficient sampling of data points with smaller parameter values. Smaller values result in more simulations reaching their asymptotic cell segregation.
For the 2-cell-type system, we can additionally visualize the effective adhesion parameter as a plane in 3D, see Figure 5, with for and its normal vector . This plane separates simulated data points based on their level of asymptotic cell segregation . This means, the direction indeed describes the direction of order progression, see color transition in Figure 5.
We present analogous analysis for 4- and 5-cell-type systems, see Appendix C, Figure A1 and find a strong support of the analytically predicted effective adhesion parameter as well.
3.3.2. Estimating the Effective Adhesion Parameter Using Statistical Learning Methods
We confirmed qualitatively the compliance between our prediction and the numerical data in the previous section. Now we confirm additionally our prediction by comparing the analytical effective adhesion parameters for two and three cell types to the corresponding estimates obtained via two statistical learning methods, Support Vector Machines (SVM) and Logistic Regression (Logit-Model) [16]. More precisely, for the general case of N cell types, the learning methods provide a hyperplane equation of the effective adhesion parameter . Therefore, we quantify statistically the deviation from our analytical prediction. This is done for and based on the numerical data referred to in the previous section and displayed in Figure 4a,b, respectively. Both methods are suited for this task because they fit the linear dependency of the parameter on the intercellular adhesion parameters , are well documented and directly available from libraries such as scikit-learn for Python [17], and, in contrast to some other approaches like deep learning, both quantify the factors that lead to classification.
The learning methods provide a hyperplane equation as a linear predictor to separate classified data points. Thus, the numerical data points are first classified based on their asymptotic order indicator and a classification threshold. As thresholds, we chose the random configurations, i.e. for two cell types and for three cell types. Due to the symmetry of the effective adhesion parameter , see Equations (7) and (13), the number of coefficients, which must be estimated, can be reduced to the constant for the homotypic and the constant for the heterotypic adhesion parameters. The resulting hyperplane equations for both 2-cell-type and 3-cell-type cases are presented in the headlines of Table 1a,b.
Table 1.
(a) 2-cell-type systems |
Model | |||
---|---|---|---|---|
Accuracy | ||||
SVM | 1.0000 | −1.9764 | −0.0374 | 0.9965 |
Logit | 1.0000 | −1.9763 | −0.0455 | 0.9965 |
Theoretical prediction | 1 | −2 | 0 | |
(b) 3-cell-type
systems |
Model | |||
Accuracy | ||||
SVM | 1.0000 | −0.9994 | 0.2835 | 0.9895 |
Logit | 1.0000 | −0.9935 | 0.2624 | 0.9900 |
Theoretical prediction | 1 | −1 | 0 |
For the 2-cell-type case in Table 1a, both relative coefficients, for homotypic and heterotypic intercellular adhesion parameters, are close to their predicted values 1 and , respectively. The relative intercept , i.e. the value of which corresponds to the chosen threshold , is for both learning methods near zero, as predicted by our heuristic argument. These predictions of the SVM and the Logit-Model are of high quality as indicated by model accuracy values close to one. This quantitative result of the statistical learning methods in Table 1a support the analytical prediction for .
Analogous for three cell types, the results of Table 1b support the analytical results by estimating with convincing model accuracy of about for both learning methods. The relative intercept , i.e., the value of which corresponds to the chosen threshold , is for both learning methods close to zero, as predicted by our heuristic argument. We attribute the deviation of the relative intercept from zero as well as the lower model accuracy compared to two cell types to insufficient convergence of the order indicators as predicted by the convergence speed .
We present analogous analysis for 4- and 5-cell-type systems, see Appendix C, Table A2 and again find a strong support of the analytically predicted parameters defining the effective adhesion parameter.
3.4. The Impact of Interfacial Tension, Adhesion or Repulsion on Cell Segregation
For two cell types, the EAP resolves previous discussions about the impact of interfacial tension on cell segregation compared to interfacial adhesion or repulsion [11]: We predict that a higher level of segregation is reached when the EAP is large, which can be achieved by both heterotypic repulsion or differential adhesion. In fact, transforming the relative contact tensions used in ref. [11] to the corresponding EAP by identifying high tensions in the experiments and the Cellular Potts model (CPM) with low adhesion parameters in the DMM, we can predict the experimental and numerical observations there, see Figure 6. In particular, we find that low lengths of heterotypic interfaces (LHI) are observed for high EAP values, analogous to the DMM. Figure 6 shows that the EAP is a better predictor for the asymptotic segregation behavior than the classification according to the DAH/DITH and HIT hypotheses.
Thus, we propose that rather than differentiating between intercellular adhesion and contact tension, the combined effect of intercellular contact on cellular motility, which is quantified by the EAP, should be focused on. Furthermore, the correct prediction of the asymptotic level of segregation in the CPM simulations of Ref. [11] using the effective adhesion parameter derived for the DMM suggests that our results may be directly applicable to CPM models of cell segregation. This is plausible due to the analogous structure of the exponent of the cell switch rates in the DMM and the energy functional in the CPM.
4. Discussion
We study analytically and numerically the differential migration model of Voss-Böhme and Deutsch (2010) [12], which is a cell-based model incorporating differential hypothesis [8], differential interfacial hypothesis [10], and High Heterotypic Interfacial Tension Hypothesis [11] within a unified framework. We generalize the existence and form of an effective adhesion parameter (EAP) guiding the asymptotic level of segregation, which is already known for systems with two cell types [12], to systems with an arbitrary number of cell types. We additionally predict the critical value of this effective adhesion parameter at which the system remains randomly mixed. For the case of two and three cell types, we confirm these theoretical predictions numerically and quantify the form of the effective adhesion parameter independently using statistical learning methods. The analogous results for 4- and 5-cell types suggest that our findings are valid in systems with even higher number of cell types. For two cell types, we show that the EAP resolves previous discussions about the impact of interfacial tension on cell segregation compared to interfacial adhesion or repulsion [11]. Thus, we propose that rather than differentiating between intercellular adhesion and contact tension, the combined effect of intercellular contact on cellular motility, which is quantified by the EAP, should be focused on.
Most previous studies focus on the simplest case of two cell types [7,13,14]. Although three and more cell types are more relevant in real biological systems, studies of cell segregation and pattern formation with three or more cell types are still rare [15]. By analytically and numerically demonstrating the existence of cell segregation for three cell types and characterizing its dynamics, we extent results from two cell types to the more general case.
The differential migration model has recently been applied successfully to reproduce experimental cell segregation data for two cell types [18]. In particular, the match between model and experimental observation demonstrated that contact tension and adhesion are sufficient to explain segregation data without additional mechanisms of collective motion. Analogously, our analytical and numerical results can be applied to cell segregation experiments with three or more cell types.
Acknowledgments
We thank Fabian Schwarzenberger for helpful discussions.
Appendix A. PCA Implementation
The Differential Migration Model (DMM) [12] is implemented as a rule-based probabilistic cellular automaton (PCA). Instead of the classical discrete-time implementation via Monte-Carlo steps, a Gillespie-like algorithm is used which simulates the PCA in continuous time. For this, a support data structure is employed which tracks all heterotypic cell switches. These switches correspond to all possible events of the system for the corresponding lattice configuration at a given point in time. The code effort is higher compared to more intuitive time-discrete algorithms due to the necessary update mechanics of the data structure but it allows the performance of one heterotypic cell switch per iteration step, since there are iteration steps without actual events in the analogous time-discrete Monte-Carlo simulation. At each iteration step, one entry from the support data structure is randomly chosen according to a weight which is calculated from the corresponding cell switch rate , see Equation (2). Afterwards, the associated lattice sites interchange their occupations. Then all entries of the support data structure that are affected by the configuration change are updated.
Algorithm A1 The PCA Algorithm. |
Appendix A.1. initializeLattice
Initially, the square lattice S has the start configuration . The lattice S is described by where . The parameter called "cellTypeDistribution" determines the total amount of cells for each cell type within the lattice.
Appendix A.2. initializeSupportDataStructure
The support data structure holds the information about the neighbors of all lattice sites together with the corresponding cell switch rates and whether a cell–cell contact is heterotypic or not. The parameter named "cellContactBindingStrength" provides the intercellular adhesion parameters to calculate the cell switch rate.
Appendix A.3. checkSimulationRunCondition
To finalize a simulation a run condition is met. For this, either an iteration step count or a normalized order indicator value is used. In most cases, the simulation is stopped at 312,500 cell switches except for the data in Figure 3 where thresholds of the order indicator are used.
Appendix A.4. getOneRandomCellSwitch
One heterotypic entry is drawn from the support data structure. This heterotypic cell switch identified by k has the corresponding cell switch rate . This rate belongs to the set of all heterotypic cell switch rates, given the lattice configuration . The probability that a cell switch k is chosen correlates directly with . That is shown in Equation (A1):
(A1) |
Appendix A.5. getLatticeSitesAssociatedToCellSwitch
The lattice sites from the chosen cell switch are obtained.
Appendix A.6. interchangeLatticeOccupationsAtSites
The new occupation of lattice site becomes the former occupation of lattice site and vice versa, according to Equation (1).
Appendix A.7. getAffectedLatticeSites
The set of all lattice sites affected by the cell switch and the resulting configuration change , see Equation (1), is composed of the direct neighbors of the lattice sites associated with the cell switch and their neighbors. Thus, it is
Appendix A.8. getNeighborsOfLatticeSite
For a lattice site the neighborhood is obtained, see description to Equation (2).
Appendix A.9. getCellSwitchRate
The cell switch rate between two lattice sites x and y is calculated according to Equation (2).
Appendix A.10. checkLatticeSitesAreHeterotypic
Two lattice sites and are heterotypic if their corresponding cell types are not equal.
Appendix A.11. updateSdsEntryForLatticeSiteWithData
The updated data in terms of cell switch rate and heterotypic cell contact of the neighborhood member is assigned to the neighborhood host z within the support data structure.
Appendix B. Order Indicator
Table A1.
2-Cell-Type Systems | 3-Cell-Type Systems | |
---|---|---|
1200 | 1175 | |
50 | 0 |
If we consider a lattice then the ideal case of a fully sorted configuration consists of stripes for each of the N cell types, each with a height of and a length of L. In this scenario the highest amount of homotypic lattice site connections is , since every lattice site has two unique homotypic connections except for those at cell-type interfaces. These only have one homotypic connection. For the same ideal scenario, a chessboard configuration where no homotypic connections exist can be achieved for the right lattice side length L. Therefore, .
In practice, a discrete lattice with lattice sites cannot be separated into even stripes. Due to this imperfection, small deviations from these values occur in data. In the case of 2-cell-type systems, a lattice side of causes because every cortical lattice site has one homotypic connection due to periodicity.
In other publications such as Canty et al. (2017) [11], the level of cell segregation and clustering is quantified by the length of the heterotypic interface, here we use the amount of homotypic lattice site connections, see Equation (3).
Appendix C. Numerical Evidence for the Effective Adhesion Parameter with Extended Cell-Type Number
Table A2.
(a) 4-cell-type systems |
Model | |||
---|---|---|---|---|
Accuracy | ||||
SVM | 1.0000 | −0.6553 | 0.8022 | 0.9615 |
Logit | 1.0000 | −0.6625 | 0.7330 | 0.9610 |
Theoretical prediction | 1 | −2/3 | 0 | |
(b) 5-cell-type
systems |
Model | |||
Accuracy | ||||
SVM | 1.0000 | −0.5030 | 0.5889 | 0.9555 |
Logit | 1.0000 | −0.4913 | 0.5992 | 0.9565 |
Theoretical prediction | 1 | −0.5 | 0 |
Appendix D. The Impact of the Effective Adhesion Parameter on the Convergence Speed
Figure A2 shows that the larger the EAP is the more cell switches are needed before a simulation reaches its asymptotic segregation state which is a fully sorted pattern in the case of and according to the DMM, see Section 2.1. This effect is visible from about and the cause is still unknown.
Appendix E. Statistical Learning Methods
All methods, SVM and Logistic regression as well as the 5-fold cross-validation, are executed in default mode and as recommended by the library scikit-learn. This includes among others the hyperparameter . The smaller this inverse regularization parameter is the more regularization is applied and the lower is the risk of model over fitting. This also can increase the numerical stability. The default penalization is in both cases l2-penalty [17]. Since all intercellular adhesion parameters are from the same distribution, see caption of Figure 4, preprocessing steps such as data scaling are omitted. Other execution details are:
SVM: used algorithm implementation “SVC”
SVM: kernel “linear”
Logit: used training algorithm/solver “lbfgs”
Author Contributions
Conceptualization, H.-J.B., S.L. and A.V.-B.; Data curation, P.R. and S.L.; Formal analysis, P.R., S.L. and A.V.-B.; Funding acquisition, H.-J.B. and A.V.-B.; Investigation, P.R., S.L. and A.V.-B.; Methodology, S.L. and A.V.-B.; Project administration, H.-J.B., S.L. and A.V.-B.; Resources, H.-J.B. and A.V.-B.; Software, P.R. and S.L.; Supervision, H.-J.B., S.L. and A.V.-B.; Validation, P.R., S.L. and A.V.-B.; Visualization, P.R., S.L. and A.V.-B.; Writing—original draft, P.R., S.L. and A.V.-B.; Writing—review & editing, P.R., H.-J.B., S.L. and A.V.-B. All authors have read and agreed to the published version of the manuscript.
Funding
This research is co-financed by the EU, by the European Social Fund (ESF) and by tax funds on the basis of the budget passed by the Saxon state parliament: SAB-Nr. 100382145. This article is additionally funded by the Open Access Publication Fund of Hochschule für Technik und Wirtschaft Dresden University of Applied Sciences and the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—432908064.
Data Availability Statement
The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are contained within the manuscript and its appendix.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Heisenberg C.P., Bellaïche Y. Forces in Tissue Morphogenesis and Patterning. Cell J. 2013;153:948–962. doi: 10.1016/j.cell.2013.05.008. [DOI] [PubMed] [Google Scholar]
- 2.Holtfreter J. Gewebsaffinität, ein Mittel der embryonalen Formbildung. Arch. Exp. Zellforsch. 1939;23:169–209. [Google Scholar]
- 3.Townes P.L., Holtfreter J. Directed movements and selective adhesion of embryonic amphibian cells. J. Exp. Zool. 1955;128:53–120. doi: 10.1002/jez.1401280105. [DOI] [PubMed] [Google Scholar]
- 4.Armstrong P.B. Light and electron microscope studies of cell sorting in combinations of chick embryo neural retina and retinal pigment epithelium. Wilhelm Roux Arch. Entwickl. Mech. Org. 1971;168:125–141. doi: 10.1007/BF00581804. [DOI] [PubMed] [Google Scholar]
- 5.Armstrong P.B. Cell sorting out: The self-assembly of tissues in vitro. Crit. Rev. Biochem. Mol. Biol. 1989;24:119–149. doi: 10.3109/10409238909086396. [DOI] [PubMed] [Google Scholar]
- 6.Beysens D.A., Forgacs G., Glazier J.A. Cell sorting is analogous to phase ordering in fluids. Proc. Natl. Acad. Sci. USA. 2000;97:9467–9471. doi: 10.1073/pnas.97.17.9467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Méhes E., Mones E., Németh V., Vicsek T. Collective Motion of Cells Mediates Segregation and Pattern Formation in Co-Cultures. PLoS ONE. 2012;7:e31711. doi: 10.1371/journal.pone.0031711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Steinberg M.S. On the Mechanism of Tissue Reconstruction by Dissociated Cells, Iii. Free Energy Relations and the Reorganization of Fused, Heteronomic Tissue Fragments. Proc. Natl. Acad. Sci. USA. 1962;48:1769–1776. doi: 10.1073/pnas.48.10.1769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Harris A.K. Is Cell sorting caused by differences in the work of intercellular adhesion? A critique of the Steinberg hypothesis. J. Theor. Biol. 1976;61:267–285. doi: 10.1016/0022-5193(76)90019-9. [DOI] [PubMed] [Google Scholar]
- 10.Brodland G.W., Chen H.H. The mechanics of heterotypic cell aggregates: Insights from computer simulations. J. Biomech. Eng. 2000;122:402–407. doi: 10.1115/1.1288205. [DOI] [PubMed] [Google Scholar]
- 11.Canty L., Zarour E., Kashkooli L., François P., Fagotto F. Sorting at embryonic boundaries requires high heterotypic interfacial tension. Nat. Commun. 2017;8:157. doi: 10.1038/s41467-017-00146-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Voss-Böhme A., Deutsch A. The cellular basis of cell sorting kinetics. J. Theor. Biol. 2010;263:419–436. doi: 10.1016/j.jtbi.2009.12.011. [DOI] [PubMed] [Google Scholar]
- 13.Ventrella R., Kaplan N., Getsios S. Asymmetry at cell-cell interfaces direct cell sorting, boundary formation, and tissue morphogenesis. Exp. Cell Res. 2017;358:58–64. doi: 10.1016/j.yexcr.2017.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ko J.M., Lobo D. Continuous Dynamic Modeling of Regulated Cell Adhesion: Sorting, Intercalation, and Involution. Biophys. J. 2019;117:2166–2179. doi: 10.1016/j.bpj.2019.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sivakumar N., Warner H.V., Peirce S.M., Lazzara M.J. Agent-Based Model of Multicellular Spheroid Pattern Formation Driven by Synthetic Cell Adhesion Signaling Circuits. bioRxiv. :2021. doi: 10.1101/2021.08.05.455232. [DOI] [Google Scholar]
- 16.Hastie T., Tibshirani R., Friedman J. The Elements of Statistical Learning: Data Mining, Inference and Prediction. 2nd ed. Springer; Berlin/Heidelberg, Germany: 2009. [Google Scholar]
- 17.Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]
- 18.Franke F., Aland S., Böhme H.J., Voss-Böhme A., Lange S. Is Cell segregation just like oil and water? arXiv. 2021 doi: 10.1371/journal.pcbi.1010460.2109.00364 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The authors confirm that all data underlying the findings are fully available without restriction. All relevant data are contained within the manuscript and its appendix.