Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2012 Sep 11;7(9):e42852. doi: 10.1371/journal.pone.0042852

Multi-Scale Modeling in Morphogenesis: A Critical Analysis of the Cellular Potts Model

Anja Voss-Böhme 1,*
Editor: Christof Markus Aegerter2
PMCID: PMC3439478  PMID: 22984409

Abstract

Cellular Potts models (CPMs) are used as a modeling framework to elucidate mechanisms of biological development. They allow a spatial resolution below the cellular scale and are applied particularly when problems are studied where multiple spatial and temporal scales are involved. Despite the increasing usage of CPMs in theoretical biology, this model class has received little attention from mathematical theory. To narrow this gap, the CPMs are subjected to a theoretical study here. It is asked to which extent the updating rules establish an appropriate dynamical model of intercellular interactions and what the principal behavior at different time scales characterizes. It is shown that the longtime behavior of a CPM is degenerate in the sense that the cells consecutively die out, independent of the specific interdependence structure that characterizes the model. While CPMs are naturally defined on finite, spatially bounded lattices, possible extensions to spatially unbounded systems are explored to assess to which extent spatio-temporal limit procedures can be applied to describe the emergent behavior at the tissue scale. To elucidate the mechanistic structure of CPMs, the model class is integrated into a general multiscale framework. It is shown that the central role of the surface fluctuations, which subsume several cellular and intercellular factors, entails substantial limitations for a CPM's exploitation both as a mechanistic and as a phenomenological model.

Introduction

Motivation

Understanding the mechanisms that control tissue organization during development belongs to the most fundamental goals in developmental biology. Quantitative approaches and mathematical models are essential to deduce the consequences of existing morphogenetic hypotheses, thus providing the basis for experimental testing and theoretical understanding. One approach to questions concerning patterning in developing organisms is to consider tissues as huge populations of cells which behave according to certain rules that depend on their genetic programs and inner structure as well as on environmental influences. To a large extent, the environmental influences are constituted of the states and actions of directly neighboring cells. Then, tissue organization can be understood as emergent behavior that results from local intercellular interaction, being the result of processes at different spatio-temporal scales. To understand the relevance of particular factors on the subcellular or cellular scale for tissue organization, the development and analysis of suitable mathematical models is indispensable.

Physics has a long history in modeling and analyzing problems where multiple spatio-temporal scales are involved, so-called multi-scale problems. Therefore, existing frameworks often originate in statistical physics. For instance, so-called equilibrium models are designed to study the macroscopic-scale characteristics of many particles which interact on a microscopic scale at equilibrium, that is when the temporal evolution has relaxed to a stationary state. Often these models are analyzed with the help of Markov chain Monte Carlo methods, for instance the Metropolis algorithm [1]. In these cases, an auxiliary dynamics is constructed which drives the system from an arbitrary initial state towards the equilibrium state that shall be studied.

One attempt to tackle patterning processes in development has been to adopt a suitable equilibrium model of statistical physics together with an auxiliary dynamics and modify it such that the needs of developmental biology are met. This approach was pursued by Glazier and Graner in a series of papers such as [2][7]. They took a model which was originally developed in solid state physics to study ferromagnetism. Adapting the term which describes the interdependence structure of the individual units at the lower spatial scale and modifying the updating algorithm of the Metropolis algorithm, they obtained a dynamical system that mimics observed biological behavior seemingly realistically. The thus proposed model has been called cellular Potts model (CPM) or Glazier-Graner-Hogeweg model. It was first used in computational biology for a theoretical study of cell sorting, a phenomenon where an initially mixed cell population segregates into homotypic clusters presumably due to type-specific differences in the strength of intercellular adhesion. Subsequently, the model has been extended more and more to address a variety of biological questions in different contexts including tumor formation and progression, see for instance [5], [8][15]. In general, CPM-based models are used to simulate the collective behavior of interacting cells and to predict the emergent behavior at the tissue scale.

In a CPM, biological cells are described as spatially extended but internally structureless objects that cover several nodes of a regular lattice. Cells move or change their shape by annexing or rejecting single nodes according to a rule which is dependent on a pre-specified cellular and subcellular interdependence structure. The resulting cell behavior in a CPM visually resembles membrane fluctuations and pseudopod protrusions as observed for biological cells. Due to the cells' subdivision into subcellular parts, a CPM is capable to model cells with type-specific sizes and morphologies.

There are only a few model classes besides the CPM that allow to study interacting cell populations with non-isotropic and type-specific cell morphologies. Established models with a similar spatial resolution are the Vertex Model [16][18] and the Subcellular Element Model [19], [20]. They are spatially continuous models of different origin and nature than the CPM. Their specific advantages and drawbacks render them in some respects comparable to the CPM, however a detailed analysis of the similarities and differences is left to further study. In this paper, the focus is laid solely on the properties of the CPM.

Notice that, despite its popularity in theoretical biology, the modeling framework of cellular Potts models has received little attention yet from mathematical and modeling theory. Though the CPM has its origin as an equilibrium model – for which the mathematical properties are well-understood – , the theoretical fundament of the CPM framework and area of sound application need further clarification. This is because there has been a paradigm shift when devising the CPM for patterning processes in developmental biology. CPMs are utilized as kinetic models, that is to study or identify major dynamical determinants of a temporarily evolving process. However, being no longer equilibrium models, they are not automatically appropriate kinetic models. To distinguish the problems where CPMs can be effectively applied as kinetic models for tissue organization and to assess the mathematical properties of this model type, the modeling framework of cellular Potts models is subjected to a theoretical study in this paper. It is explored under which conditions a CPM is an appropriate dynamical model for intracellular interaction and what its principal behavior at different time scales characterizes.

It is shown that the modifications in the Metropolis algorithm have a dramatic impact on the long-time behavior of the model. In the long run, the cells consecutively die out, independent of the specific interdependence structure that characterizes the model. At smaller time scales, when spatial correlations have already established but the stationary state is not yet reached, the model outcome is the result of an interplay between behavior that is controlled by the modeler via the specification of the cellular interdependence structure and an additional, hardly controllable impairment that is due to the modification of the Metropolis updating scheme. Dwelling deeper into the question to which extent the model can be exploited to derive reliable predictions of the macroscopic behavior that can be expected from particular microscopic interactions, the CPM is integrated into a general multiscale modeling framework. It is argued, that the CPM's resolution below cell level allows to overcome the lattice anisotropy and to model cells with flexible and adaptive morphologies. However, the characteristic to model intercellular interaction exclusively via surface fluctuations entails substantial constraints with respect to the level of detail from the subcellular scale that can be traced by the model. In addition, the cells' subdivision into subcellular parts necessitates non-local interaction rules to control the cellular morphology. These rather technical terms hinder the application of some powerful mathematical methods, such as rigorous spatio-temporal limit procedures, for the analysis of the emergent macroscopic behavior. Thus, the flexibility in the cells' morphologies comes at the price of less control over the model's cellular behavior and intercellular interaction and of limited analytic tractability, both leading to a reduced mechanistic understanding. It is clear from many successful applications of CPMs to deep biological questions, e.g. [4], [5], [9], [21], that the CPM frameworks is an expedient modeling approach if cell size, cell shape, or cell polarity essentially affect the intercellular interaction rules and, in particular, if the cellular morphology is considered adaptive to the surrounding cellular environment. However, to value the contribution of a model to the underlying biological problem, it is essential to understand the model's theoretical basis and construction and to discuss openly its power and limitations. There exists considerable empirical knowledge of how CPMs behave for certain choices of the parameter values which has been obtained from extensive CPM simulations. Nevertheless, it is necessary to complement empirical experience with rigorous analytical arguments to provide more clarity about the structural properties of CPMs and to distinguish reliable facts from mere beliefs. This also helps to expose existing inconsistencies and drawbacks of the model class as a basis and encouragement for further discussions and developments. This paper shall be a contribution towards this objective.

Mathematical model description

A CPM assigns a value Inline graphic from a set Inline graphic to each site Inline graphic of a countable set Inline graphic, cp. Fig. 1. The set Inline graphic resembles the discretized space and is usually chosen as a two- or three-dimensional regular lattice. The set Inline graphic contains so-called cell indices, where Inline graphic is the absolute number of cells that are considered in the model. The state of the system as a whole is described by configurations Inline graphic Given a configuration Inline graphic, a cell in the CPM is the set of all points in Inline graphic with the same cell index, Inline graphic The value Inline graphic is assigned to a given node, if this node is not occupied by a cell but by medium. Each cell is of a certain cell type, which determines the migration and interaction properties of the cell, the set of all possible cell types being denoted by Inline graphic Denote by Inline graphic the map that assigns each cell its cell type. A cell with index Inline graphic has volume

Figure 1. Cell-surface interaction in the cellular Potts model is regulated by the surface energy coefficients.

Figure 1

Three cells with cell indices 1, 2 and 3, respectively, each one covering several lattice sites, interact with each other at the cell surfaces. The cells 1 and 3 are of type A, depicted in dark grey, the cell 3 is of type B, depicted in light grey. The strength Inline graphic of the interaction depends on the cell types. There are also interactions between the cells and the medium (white, cell index 0). Possible boundary interactions are not shown.

graphic file with name pone.0042852.e017.jpg

where the Kronecker symbol Inline graphic is defined by Inline graphic if Inline graphic and Inline graphic otherwise, and surface length

graphic file with name pone.0042852.e022.jpg

The sum in the last term is taken over all interfaces of a given configuration Inline graphic, that are all pairs of lattice neighbors which do not belong to the same cell.

A cellular Potts model (CPM) is a time-discrete Markov chain with state space Inline graphic, where the transition probabilities are specified with the help of a Hamiltonian or energy. The latter is a function Inline graphic which often has a special structure. Usually it is the sum of several terms which are supposed to control single aspects of the cells' interdependence structure. Basically two terms are used in most CPM studies. First a surface interaction term

graphic file with name pone.0042852.e026.jpg (1)

is specified. Here Inline graphic, the matrix of so-called surface energy coefficients, is assumed to be symmetric. Second a volume constraint

graphic file with name pone.0042852.e028.jpg (2)

is used. Here Inline graphic, the target volume, and Inline graphic, the strength of the volume constraint, are cell-type specific parameters, Inline graphic Dependent on the actual situation that shall be studied with the help of a CPM, further addends can be included, for instance a surface constraint [3]

graphic file with name pone.0042852.e032.jpg (3)

Again Inline graphic, the target surface length, and Inline graphic, the strength of the surface constraint, are parameters, Inline graphic Thus, the typical structure of a CPM-Hamiltonian is

graphic file with name pone.0042852.e036.jpg (4)

where Inline graphic are given in (1) and (2) and Inline graphic is a model-specific addend. Transitions from one configuration to another follow a special rule which is called modified Metropolis algorithm. First two additional parameters Inline graphic, the so-called temperature, and Inline graphic, a transition threshold, are specified. Note that the transition threshold was set to Inline graphic in the original model proposed by [2] but it turned out that in some applications Inline graphic is a better choice [7], [22]. Then the following algorithm is performed:

  1. Start with configuration Inline graphic

  2. Pick a target site Inline graphic at random with uniform distribution.

  3. Pick a neighbor Inline graphic of Inline graphic at random with uniform distribution among all lattice neighbors of Inline graphic

  4. Calculate the energy gain, Inline graphic, that is reached if the present configuration Inline graphic is replaced by the trial configuration Inline graphic. The latter is obtained from Inline graphic by copying the index Inline graphic onto the node Inline graphic, that is Inline graphic if Inline graphic and Inline graphic otherwise.

  5. If the energy gain is below the transition threshold, that is if Inline graphic, accept the trial configuration and put Inline graphic; go to step 1. Otherwise, put Inline graphic with chance Inline graphic and keep Inline graphic unchanged with chance Inline graphic.

Consequently, only such transitions are possible where the index of at most one lattice site is changed, resulting in a shift of the cell's center of mass. The new assignment to this lattice site is chosen from the cell indices of the neighboring lattice sites. These dynamics are interpreted to resemble membrane fluctuations, where one cell shrinks in volume by one lattice site and a neighboring cell increases in volume by occupying this site.

To complete the model, appropriate boundary conditions must be specified. If the influence of the boundary shall be neglected, periodic boundary conditions are used. This means that the space can be thought of as being mapped onto a torus. However, fixed boundary conditions, where the interaction between the cell surfaces and the confining environment is explicitly modeled, can be defined within this modeling framework, as well.

It will turn out, that most of the properties of the CPM that shall be discussed within this article do not depend on the specific structure of the Hamiltonian Inline graphic. Therefore, it is assumed in the following that the Hamiltonian is a real function on Inline graphic without stipulating a special structure such as (4). This approach has the additional advantage that boundary conditions can be included by adjusting the Hamiltonian accordingly.

Definition

Let Inline graphic be a real function on Inline graphic and suppose that Inline graphic. A cellular Potts model is a discrete-time Markov process with state space Inline graphic and with transitions following the modified Metropolis algorithm with respect to Inline graphic and Inline graphic.

The CPM model formalism has been used for several problem-specific extensions. In general, this is done by including additional terms into the Hamiltonian (4). For instance, elongated cell shapes can be modeled in a CPM by imposing a cell length constraint which renders the major axis of the ellipsoidal approximation of the cell's shape to be close to some prescribed target value [23]. Rod cell shapes with particular stiffness have been modeled using a compartmentalized cell concept, where each cell consists of a row of standard CPM cells [12]. In some cases, the kinetics of the original CPM is altered by directly modulating the transition rates that are calculated in step 4. of the modified Metropolis algorithm. Specific control terms that may depend on the configuration of the system but also on addition system parameters, like the position Inline graphic of the target site, the position Inline graphic of the trial spin or the velocity increment of the affected cell, are added in step 3. of the modified Metropolis algorithm to the energy gain Inline graphic that is calculated from the Hamiltonian. Notice that these models cannot be represented within the classical model since the control terms cannot be derived from a Hamiltonian. Therefore these models with kinetic extensions will be referred to as extended CPMs. Examples comprise the explicit modeling of inertia by constraining the cell velocity increment [24] or the inclusion of chemotactic responses to some field Inline graphic of signals into the model as in [3], [25].

Another extension of the CPM framework comprises hybrid CPM models. The standard CPM treats cells as internally structureless lattice domains. However, several studies have adapted the CPM to allow the modeling of subcellular structures, as well. The latter are derived from models of the intracellular biochemistry, typically modeled in terms of ordinary differential equations. In this way, the spatial configuration of the cells, their sizes, shapes, motility properties as well as the intercellular interaction can be coupled with cell-intrinsic processes. For instance, the effect of intracellular actin dynamics on membrane protrusions and retractions is modeled via a hybrid CPM in [21].

Results

Equilibria of the cellular Potts model

The standard Metropolis algorithm, see for instance ([1], §4.3), differs from the rules 0.-4. described in the previous section Steps 0. and 1. remain unchanged. In 2., the cell index Inline graphic that is chosen to replace Inline graphic with some probability, is drawn uniformly from Inline graphic without considering the neighborhood of Inline graphic. Then, in 3., a trial configuration Inline graphic with

graphic file with name pone.0042852.e080.jpg

is used and the energy difference is calculated via Inline graphic. Afterwards an attempt is made to replace Inline graphic by Inline graphic according to the decision rule in step 4.

The standard Metropolis algorithm is a Markov chain Monte Carlo method to explore the equilibrium model corresponding to the Hamiltonian Inline graphic. It samples typical configurations of the so-called Gibbs measure corresponding to Inline graphic. The latter is a measure Inline graphic on Inline graphic defined by

graphic file with name pone.0042852.e088.jpg

where Inline graphic is the normalizing factor. This measure is a widely accepted model of statistical physics for the equilibrium behavior of particle systems at temperature Inline graphic whose microscopic interdependence structure is described by Inline graphic. The transition rates Inline graphic for transitions Inline graphic of the standard Metropolis algorithm satisfy the detailed balance conditions w.r.t. Inline graphic, that is

graphic file with name pone.0042852.e095.jpg

See ([1], §4.3) for details. Therefore, the Markov chain generated by the standard Metropolis algorithm has the Gibbs measure Inline graphic as a reversible measure. Since this chain is finite and irreducible, Inline graphic is its only invariant measure and the distribution at time Inline graphic converges to Inline graphic for Inline graphic. Thus, starting from an arbitrary initial configuration, the standard Metropolis algorithm produces realizations which are distributed approximately according to Inline graphic after a sufficiently long relaxation time.

The modified Metropolis algorithm does not satisfy the detailed balance conditions w.r.t. Inline graphic. Indeed, it is easily verified by applying the results in ([26], Thm. 4.1) that the transitions in the CPM cannot satisfy detailed balance w.r.t. any measure neither related nor non-related to Inline graphic. The main argument behind this statement is as follows. A cell in the CPM that covers only a single node has a positive chance to disappear during a transition, while the probability to reappear is zero. This behavior is a direct consequence of the modification in the Metropolis algorithm and is not present in the classical method. Therefore, the Markov chain defined by the CPM dynamics has several absorbing states, namely all those configurations that consist of only one cell that covers all nodes of the lattice. As it is shown rigorously in the Methods section, a CPM is eventually trapped in one of these absorbing states regardless of the special structure of its Hamiltonian. Its distribution Inline graphic at time Inline graphic converges as Inline graphic to a measure Inline graphic that is a convex combination of point measures Inline graphic, each of them concentrated on a constant configuration Inline graphic with Inline graphic. In detail, Inline graphic converges towards

graphic file with name pone.0042852.e112.jpg

where Inline graphic, and Inline graphic. The weight Inline graphic depends on the initial configuration Inline graphic and equals the probability that the CPM started in Inline graphic is absorbed by the constant configuration Inline graphic. The values Inline graphic, can be calculated explicitly, see the Theorem 1 in the Methods section. The time until absorption depends on the structure and parameter values of the Hamiltonian. Estimates of the time till absorption are provided in the Methods section.

The differences between standard and modified Metropolis algorithm become particularly blatant when Inline graphic (or Inline graphic). In this case, the CPM is a multi-type voter model [27], since in each transition the cell index of the target site is replaced by the cell index of a randomly chosen neighboring site. Consequently, the longtime behavior agrees with that of the voter model, where the cells consecutively die out. In contrast, the standard Metropolis algorithm decouples for Inline graphic, that means the evolution of each lattice site is independent from that of the other lattice sites and one observes a uniform distribution of spins in the long-time limit. Note that, for large temperatures, it was observed but not studied in detail in [2] that in a CPM ‘the pattern loses energy simply by eliminating cells’, which is a clear cue towards absorption.

Thus, it turns out that the modifications in the updating algorithm change the longtime behavior of the corresponding Markov chain dramatically in comparison with the standard, equilibrium model. Since detailed balance w.r.t. the Gibbs measure related to Inline graphic is broken and absorbing states are present, the long-time behavior is no longer controlled by the Hamiltonian Inline graphic. The modifications in the Metropolis algorithm, which could seem to be marginal, produce a qualitatively different behavior. In the long run, the evolution in the CPM is not directed towards the minimization of the energy Inline graphic but the cells in the CPM consecutively die out.

Impact of the Hamiltonian on the model dynamics at different time scales

It is pointed out in subsection Equilibria of the cellular Potts model that the modification of the Metropolis algorithm has a major impact on the dynamics. By relating the transition mechanism to the cell indices in the neighborhood of the target site, the impact of the Hamiltonian on the actual transition probabilities is reduced and even vanishes in the long-time limit. Still, it might be objected that the phenomenon of successive cell extinction in a CPM only marginally affects its behavior in parameter ranges that are of interest in the applications and that the above considerations are of theoretical value only. The main arguments in this direction are as follows. Firstly, a pragmatic ‘no-extinction’-rule for the CPM cells could be implemented. Secondly, it might be argued that the disappearance of CPM cells is so seldom that it can be neglected and that interesting and complex behavior is observed in the CPM before the process of cell extinction becomes manifest. The third objection could be that the role of the Hamiltonian Inline graphic is purely technical to give the transition rates a bias and that the focus of interest in CPM studies is put neither on the long-time behavior nor on the minimization of the energy Inline graphic.

However, there is a methodological problem with these arguments. A modeler controls the structure and the parameters of the Hamiltonian. With the help of the Hamiltonian, he implements his ideas about the underlying biological process into the model. The transition probabilities and thus the model kinetics, however, depend not only on this controlled term but also on a non-controlled voter-like part, stemming from the modification of the Metropolis dynamics, which depends on the geometric composition of a target spin's neighborhood. The latter part of the dynamics is sensible, for instance, to the topology of the underlying lattice, to the size of the chosen neighborhood structure and to the number of subcellular parts a CPM cell is divided into. These are technical parameters that have no mechanistic role for the biological process of interest. Thus, the CPM dynamics is characterized by a constant conflict of controlled behavior specified by the Hamiltonian and a non-controlled impairment.

The impact of the Hamiltonian, the controlled behavior, vanishes in favor of the non-controlled behavior for certain – long – time scales, as is discussed in subsection Equilibria of the cellular Potts model. To exploit the model in possibly well-behaved parameter regions, it is essential to address the following questions. What characterizes those regimes of a CPM, where the non-controlled, voter-like part of the transitions is marginal for the emergent behavior compared to the part of the transitions that is controlled by the Hamiltonian? To which extent does this regime depend on the structure and parameters of the Hamiltonian? To which extent does this regime depend on the dimension and topology of the underlying lattice and the grain size of the subcellular segmentation? What are the typical time scales that separate ‘good’, controlled behavior from ‘bad’, largely uncontrolled behavior? These theoretical questions have not been addressed in depth so far but need to be thoroughly discussed, if the results that are obtained from the analysis of a CPM are to be carried over into biological understanding.

Thus, coming back to the above objections, it can be argued that, firstly, a pragmatic ‘no-extinction’ rule for the CPM cells – as it is implemented in many applications, sometimes without explicit notice [28] – conceals the underlying conflict between controlled and uncontrolled behavior in the CPM. It has no mechanistic biological interpretation since it aims at a symptom that is of model-technical origin. Secondly, if the complex behavior observed in the CPM before the process of cell extinction becomes manifest shall be exploited for biological comprehension, it is necessary to validate the model appropriately. The interpretation of the model outcomes in biological terms is valuable in those cases where the extent of possible non-controlled influences is clearly assessed. This is particularly important, when quantitative predictions are to be derived. Thirdly, the focus of interest in CPM studies is predominantly in identifying the distinctive signature at the tissue level that emerges from specific intercellular interactions. It is characteristic for emergent phenomena, that the effect of the super-positioned microscopic interactions becomes not evident at the macroscopic spatial scale until a certain time – also measurable at a macroscopic scale – has elapsed. Therefore, numerical studies of the long-time behavior of CPMs and approximative descriptions of the their dynamics particularly at long, macroscopic time scales are indispensable.

Thus, the Hamiltonian Inline graphic has a technical role by favoring those transitions which lower the energy. However, the dynamics is not driven by the Hamiltonian alone but there is a constant conflict with a non-controlled voter-like portion in the transition rates. This conflict becomes particularly blatant in the long-time behavior. The consequences of this conflict within other parameter regimes are not yet well-understood. The ambiguity in the model's dynamics constitutes a serious restraint for utilizing the CPM as a mechanistic model.

Utility for spatio-temporal limit procedures

Typical properties of a spatially explicit dynamical model are often revealed by applying suitable spatio-temporal limit procedures. For instance, the ensemble Inline graphic of Gibbs measures on increasing cubes Inline graphic is studied in statistical physics. The cluster points of these measures are Gibbs measures on the infinite lattice Inline graphic. By studying the latter objects the phenomenon of phase transitions in the original model can be understood. In the case of the two-type voter model, the longtime behavior of spatially bounded systems is always trivial, that is the system is absorbed in one of the two constant configurations. However the extension of the voter model to infinite lattices shows a more complex behavior in dimensions Inline graphic [27], [29], [30]. Exploiting the fact that spatially large but bounded systems are ‘close’ to infinite systems, the characteristics of the clustering process before absorption can be derived [31]. Another important tool of analysis is the derivation of a spatially and temporally continuous description of the considered process by sending the lattice spacing and the time unit to zero. The resulting model, which is often a partial differential equation, can be considered to be a description of the emergent macroscopic behavior that arises from the microscopic interaction [32], [33].

To carry out spatio-temporal limit procedures rigorously, it is necessary that the original Markov chain model on Inline graphic can be extended to the infinite, spatially unbounded lattice. Assuming, for simplicity, that S is a Inline graphic-dimensional cubic lattice, an extension of the model to Inline graphic is required. The specification of transition probabilities is no longer sufficient for the description of the model, since the state space Inline graphic is innumerable then. This becomes apparent, for instance, if it is tried to perform the Metropolis algorithm on an infinite lattice. Actually, it is not self-evident that an extension of the model to spatially unbounded domains exists as a mathematically well-defined object. However, such an extension is straightforward, if the model can be interpreted as an interacting particle system (IPS) in the sense of Liggett [29]. This is the case, if the following two conditions are satisfied.

  1. The original Markov chain model is temporally continuous.

  2. The transitions are local.

Condition (1) is no serious restriction, since a time-continuous Markov chain can be constructed from a time-discrete Markov chain by a standard procedure. Indeed, let Inline graphic be the transition matrix of the original Markov chain model and define

graphic file with name pone.0042852.e138.jpg

where Inline graphic is the unit matrix on Inline graphic. Then Inline graphic satisfies Inline graphicand Inline graphic. Hence Inline graphic is a rate matrix which generates a time-continuous Markov chain. The evolution of this chain is very close to that of the original chain. The only difference is that the time to the next attempted transition is now exponentially distributed instead of constant.

Condition (2) is essential for defining a Markov process on Inline graphic. Transitions are local, if

(2-I) the configuration is changed only locally when a transition is performed;

(2-II) to calculate the transition rate, it is sufficient to know the configuration within a local neighborhood of the region that is to be changed.

A mathematically precise formulation of these conditions is given in the Methods section. In a CPM, the transition rates of the time-continuous algorithm satisfy (2-I) since only one site is changed in an (infinitesimal) transition. However, (2-II) is not satisfied, since the volume constraint Inline graphic (2) is a non-local function. Indeed, to assess the impact of a cell-index change at location Inline graphic on Inline graphic, the volumes of the affected cells need to be known. These volumes can be determined only if the configuration on the whole lattice is examined. It is not enough to scan an appropriate neighborhood of Inline graphic. A detailed discussion of this issue can be found in the Methods section.

Thus, a CPM would be qualified for the application of spatio-temporal limit procedures, if it had a natural extension to spatially unbounded systems. However, the non-local nature of the transition rates blocks the integration of a CPM with Hamiltonian (4) into the model class of interacting particle systems in the sense of Liggett [29]. The methods that are available for CPM analysis so far comprise essentially numerical simulation studies, such as [7], [34], and heuristic approximations as in [35], [36], for instance. This reveals a present challenge when exploiting CPMs, since the extent of additional insight that can be gained by applying the model as well as the stringency of the conclusions within the model depend strongly on the capability and the rigor of the available analytical tools.

Multiscale modeling within the CPM framework

CPMs are typically utilized to study the tissue scale properties that result from specific intercellular interactions. In extended CPM models such as [8], [10], [15], [21], [23], [25], [37], intracellular or molecular details are included additionally. Therefore, since multiple spatial and temporal scales are coupled into one description, CPMs are considered to be multi-scale models.

There are two principal classes of such models that need to be distinguished. Mechanistic models evolve according to rules that have been abstracted from the underlying biological process. These rules represent a proposed or hypothetical mechanism concerning the intercellular interaction. The latter may depend on cellular characteristics and intracellular processes. The goal of developing a mechanistic model is essentially to provide a proof-of-principle for a proposed mechanism or to ‘verify’/falsify a hypothetical mechanism. This can be accomplished by determining – with the help of the model – the distinctive characteristics at the tissue level which emerge from the assumed intercellular interaction and their comparison with experimental observation. A scheme of this mechanistic multi-scale framework is depicted in Fig. 2. The main challenge of mechanistic models lies in accounting for the appropriateness of the model class by assessing the possible impact of simplifying model assumptions on the intended mechanism.

Figure 2. A mechanistic multiscale framework is characterized by the coupling of multiple spatial and temporal scales on the basis of abstracted rules.

Figure 2

The assumed intercellular interaction may depend on an interplay with cellular characteristics and intercellular details. By determining the distinctive characteristics at the tissue level and their comparison with experimental observation, it can be tested wether a specific mechanism explains the behavior of an experimentally studied cell system.

In contrast, a model of interacting cells is phenomenological, sometimes also called empirical or data-based, if it agrees with observed biological behavior in a statistical sense but its intrinsic mechanism of evolution is secondary. The models principal qualification for the studied question needs to be verified on an appropriate data basis. If adequate agreement is reached, phenomenological models can be exploited to explore theoretically the tissue scale outcome at conditions that resemble or complement the experimentally tested ones, thereby describing, inter- or even extrapolating existing biological data. The biological experimentalist can thus be guided towards the most interesting behavior. The main challenge of phenomenological models lies in validating the model appropriately, that is to identify and match particularly those characteristics that are critical for accordant model behavior.

Thus, mechanistic and phenomenological models are used for different purposes. Sufficiently validated phenomenological models can be utilized to summarize and visualize data, to make predictions and as heuristics for designing experiments. Mechanistic models with well-founded model assumptions can also provide predictions of the system's behavior but the focus is put rather on an satisfactory explanation of the observed phenomena. Notice that phenomenological models might be constructed partly by mechanistic considerations. However, if the exploited model behavior is not robust with respect to additional, purely technical model assumptions or a full parameter variation within the biologically explained constraints, its explanatory power becomes ambiguous.

The mechanistic construction of a CPM is displayed schematically in Fig. 3. The central device in a CPM are the CPM cells' surface fluctuations. They are mainly regulated by the Hamiltonian Inline graphic, but the actual control of the Hamiltonian over the intensity of the surface fluctuations is attenuated by a voter-like portion in the transition rates, as is discussed in the subsection on the Impact of the Hamiltonian. The Hamiltonian is a sum of terms, typically at least the surface interaction term Inline graphic and the volume constraint Inline graphic, see (1) and (2), respectively, which are assumed to reflect simultaneously the effect of the intercellular interaction and of the cellular characteristics. Supplementary terms, such as the surface constraint Inline graphic, see (3), are integrated into the Hamiltonian to further enforce phenomenologically realistic behavior. All these terms are, however, not derived from a mechanistic assumption about the behavior and interaction of the subcellular parts that are resembled by the single nodes of a CPM cell. Instead, these terms describe heuristically the effect of all determinants – from the subcellular to the intercellular scale – which are assumed to become ‘somehow’ manifest as cell surface fluctuations. Thus, the parameters of a CPM Hamiltonian can be dived into (i) directly biologically interpretable or measurable parameters, like the cells' target volumes Inline graphic, (ii) effective parameters that subsume various intercellular processes and cellular details, such as the surface interaction strengths Inline graphic, and (iii) merely technical parameters with ambiguous biological interpretation, like the parameters Inline graphic and Inline graphic, Inline graphic, which determine the impact of the volume and surface constraints. The temperature Inline graphic, which weights the overall impact of the Hamiltonian on the dynamics, is also a purely technical parameter. In the model, the parameter Inline graphic controls the strength of interaction between neighboring lattice nodes. The higher Inline graphic, the less dependent they evolve. Thus, it affects at the same time the subcellular cohesion, the intercellular interaction and the degree of control that is exerted via the Hamiltonian onto the surface fluctuations. There is a continuing obscurity concerning the interpretation that can be given to this parameter [2], [3], [11], [15]. It seems that it remained from the physical origin of the model. Notice, that the parameter Inline graphic of the CPM can be omitted by a scaling Inline graphic.

Figure 3. Cell surface fluctuations are the central device in the realization of the multiscale concept in CPMs.

Figure 3

Both the rules of intercellular interaction and the considered cellular characteristics are eventually coded, via the Hamiltonian or directly for extended models, into an expression that regulates the intensity of CPM-cells' surface fluctuations. Additional technical parameters are integrated into the Hamiltonian to be able to suppress phenomenologically unrealistic behavior. The actual impact of the Hamiltonian on the intensity of CPM cells' surface fluctuations is attenuated by a voter-like portion in the transition rates. The surface fluctuations drive simultaneously the actual behavior of a CPM at the cellular scale, the specifics of intercellular interaction and the emerging behavior at the tissue scale. Single aspects of the cellular properties in the model, for instance the cell shape flexibility, the magnitude of random cell displacements or the cells' surface roughness, and of the intercellular interaction, like the strength of intercellular adhesion, cannot be controlled individually but are interlinked with each other. Likewise, purely model-technical control parameters such as the cellular integrity, that is the property of CPM cells to span over connected, essentially convex lattice domains, are coupled indirectly with biologically interpretable cellular and intercellular properties. The emerging tissue scale behavior is solely rooted in the specified characteristics of the CPM cells' surface fluctuations and not linked directly to cellular and intracellular specifics.

The surface fluctuations drive simultaneously the actual behavior of a CPM at the cellular scale, the specifics of intercellular interaction and the emerging behavior at the tissue scale. Single aspects of the cellular properties in the model, for instance the cell shape flexibility, the magnitude of random cell displacements or the emerging cells' surface roughness, and of the intercellular interaction, like the strength of intercellular adhesion, cannot be controlled individually but are interlinked with each other. Likewise, purely model-technical control parameters such as the cellular integrity, that is the property of CPM cells to span over connected, essentially convex lattice domains, are coupled indirectly with biologically interpretable cellular and intercellular properties. The emerging tissue scale behavior is solely rooted in the specified characteristics of the CPM cells' surface fluctuations and not directly linked to cellular and intracellular specifics.

If a CPM shall be utilized as an explanatory mechanistic model, the central role of the surface fluctuations constitute a handicap for a thorough understanding and interpretation of the model outcome. This is because the single aspects of cellular behavior in a CPM cannot be assessed individually by the modeler but are coupled to each other in an indirect fashion. An observed signature at the tissue scale might be traced down to the underlying intensity of surface fluctuations. However, it is hardly possible to identify and separate the effect of single components in the model's interplay of intercellular interaction, cellular characteristics and even intracellular or molecular details that is apparent as cell surface fluctuations.

Thus, a CPM's spatial resolution below the cellular level, which is the basis for modeling deformable cells, and the central role of the surface fluctuations entail substantial drawbacks for its mechanistic construction and, consequently, the explanatory power of the model. If flexible, adaptive and non-isotropic cell shapes or a variability in the cells' sizes are assumed to essentially affect the intercellular interaction, a CPM is one of a few existing models that can be applied to show that a proposed mechanism ‘somehow’ produces the observed behavior. Disagreement with the biological data, however, could mean both: the incapability of the model to correctly capture the postulated mechanism or the inappropriateness of the hypothetical mechanism for the biological system at hand. Therefore, mechanistic conclusions drawn from CPM models are only limitedly reliable. A validation of the results with the help of alternative models which operate at differing levels of complexity and thus represent different modeling compromises is worthwhile. If cell sizes and shapes are of minor importance for the interaction, more coarse-grained individual-based approaches such as interacting particle systems, e.g. [29], [38][40] may lead to mechanistically better understood and analytically easier tractable models.

If a CPM shall be exploited as a phenomenological model, it is necessary to empirically match the kinetic properties of the model with the respective data from the relevant biological experiments. The benefit of this approach depends on the availability of representative data for parameter estimation and model validation as well as on theoretical knowledge about the fundamental dynamical properties of CPMs. Although blatantly unrealistic cell behavior is prevented by modifying the standard Metropolis algorithm, it is not self-evident that the kinetic properties of a given CPM with Hamiltonian Inline graphic really resemble the dynamical behavior of the considered experimental system. The challenge consists in (I) identifying the key determinants that have to be empirically matched in model and experiment to maximize the descriptive power of the model and in (II) defining suitable measures of the descriptive and predictive performance with respect to specific traits. To progress in this direction, it is necessary to have a substantial knowledge of the model's principal behavior in the respective parameter regions. So far, there are a few studies that analyze the kinetic properties of CPM's and compare them to corresponding characteristics found in experimental data. For example, [34] studied numerically the relaxation kinetics for clustering in the original CPM introduced by [2] for cell sorting, thus providing a reference framework for the model's comparison with experimental data. Other examples include [12] who assess the biological plausibility of single cell dynamics before they turn towards their actual focus of study, the collective behavior that results from alignment rules based on elongated cell shapes, and [41] who provide empirical guidelines about how to tune a CPM in order to optimize its behavior with respect to certain biophysical characteristics.Notice that the necessity to empirically match simulations and experiments has already been recognized in [42]: ‘A first step towards quantitative cell-based modeling is to ensure the cell behavior modeled by the CPM matches experiments exactly.’ Here, it shall be emphasized that tuning the parameters of a model such that it empirically matches the observations with respect to certain key characteristics – regardless of the qualitative or quantitative nature of the conformance measures – implies that the model is utilized predominantly as a phenomenological model. Those models describe or visualize rather than explain experimentally observed behavior.

To describe a given experimental situation by a CPM, it is necessary that there is a protocol of how to adjust the model parameters in such a way, that the ‘in silico’ condition is comparable with the actual preparation of the biological system. Since the CPM parameters interfere with each other in a complex way, a simultaneous parameter fitting is often applied. At present there is no standard algorithm for the model adjustment. Instead, it is a very intricate task that requires much intuition and skill by the user [28].

Vice versa, if the behavior at conditions that have not yet been explored experimentally shall be predicted, it is essential that a concrete ‘in-silico’ setting can be translated into a biologically condition that is defined by the states of certain experimentally manageable quantities. However, several CPM parameters, in particular those which subsume various intercellular and cellular details into unspecific effects becoming manifest as surface fluctuations, like the surface energy coefficients Inline graphic, are hard to match with biological traits. Varying Inline graphic, for instance, affects the details of intercellular interaction and, simultaneously, the cell motility, the magnitude of surface fluctuations and the smoothness of the cell surface: ‘more cohesive cells [in a CPM] have more crumbled surfaces, larger membrane fluctuations and diffuse further than less cohesive cells’ [3]. Even the actual cell size in the CPM is affected: ‘... cells with a higher surface energy (but the same target volume) overall are smaller ...’ [8]. There are a number of observed tissue scale characteristics in the CPM that have a direct biological interpretation, such as the average magnitude of cell center displacements within the cell population, the actual average magnitude of surface fluctuations, the apparent smoothness of cell surfaces, or the observed distribution of cell sizes. However, the attribution of these emergent characteristics to manageable quantities at the cellular and intercellular scale, such as the strength of intercellular adhesion between two cells, the degree of the intrinsic motility of an individual cell, or even the expression profiles of certain molecules at the cell surfaces, is sometimes rather vague.

Thus, the central role of the surface fluctuations for the CPM dynamics entails substantial drawbacks for its exploitation as a phenomenological model of collective cell behavior. If the key characteristics of the studied biological system that have to be matched by the model system include flexible, adaptive and non-isotropic cell shapes or a considerable variability in the cells' sizes, a CPM is one of a few existing models that can be applied.

The exploitation of a CPM as a phenomenological model is also reasonable if the morphometric composition and the spatial arrangement of the CPM cells and their dynamic reorganizations solely constitute the spatial structure for the study of coupled intracellular and molecular processes. In this case, the focus of interest is put on analyzing the patterns and structures that emerge from the interaction of these processes, for instance by modeling them as coupled ODE systems. If the underlying spatial structure shall not be static but temporarily varying or even be slightly adaptive to the modeled intracellular occurrences, a CPM can provide such a spatial framework. A CPM that is empirically adjusted to match the key determinants of the morphometric cell composition and the spatial cell arrangement in the given experimentally assay, can be utilized then as the spatial basis for an additional, mechanistic modeling stacked on top on it.

Discussion

CPMs are typically applied if the tissue scale properties that emerge from specific intercellular interactions shall be described, predicted or explained. The model class originates in statistical physics, where Markov chain Monte Carlo methods are utilized to study the behavior of many interacting particles at equilibrium. The model's adaption to the requirements of modeling morphogenetic processes involves a paradigm shift from equilibrium to non-equilibrium, kinetic modeling. The temporal evolution in the model – which before has been an auxiliary tool to drive the system towards the equilibrium state of interest – turns out to be the core of the transition mechanism in a CPM. Correction terms in the CPM Hamiltonian and additional modifications in the original transition mechanism help to eliminate biologically unrealistic behavior. The modifications in the transition mechanism dramatically alter the long-time behavior of a CPM compared to its counterpart in statistical physics. In the long-run, the temporal evolution of a CPM is not directed towards the minimization of the Hamiltonian or energy but instead the CPM cells consecutively die out. The correction terms in the Hamiltonian render the transition mechanism to be non-local, thus hindering the application of powerful analysis methods from statistical physics such as spatio-temporal limit procedures. Thus, the descent from a well-studied model class in physics can hardly be exploited for CPMs.

To assess whether CPMs constitute good dynamical models for multi-scale problems in morphogenesis, it is helpful to distinguish between the intended purpose of modeling: mechanistic modeling can be applied to explain an observed phenomenon, while phenomenological modeling rather describes biological observations. In both cases, the mechanistic construction of the CPM dynamics, where the intensity of the surface fluctuations is the central device that subsumes the effect of all cellular and intercellular details, entails substantial limitations for a CPM's exploitation in the respective direction. If considerable variability in the cell sizes and shapes or flexible cellular neighborhood relations are supposed to essentially determine the intercellular interaction, the CPM framework is one of a few model classes that can be utilized. The application of CPMs is also reasonable, if a non-static, dynamically changing spatial structure shall be simulated that forms the cellular basis for interacting intercellular and molecular processes. Then the focus is put on the patterns and structures that emerge from the interaction of these processes and the tissue rearrangement described by the CPM is of minor importance, utilized rather to represent the fluctuations in the spatial composition of the cell population. In those cases, however, where essentially isotropic, non-polarized cells of uniform size are considered, it is worthwhile to validate the results by comparing them to the outcomes of more coarse-grained modeling approaches, like Cellular Automata or Interacting Particle Systems, that are mechanistically better understood and analytically more accessible although they may look visually less appealing.

In most cases where a CPM is used, an important biological problem is addressed which is characterized by an interplay of several factors from different scales, acting at the intracellular, the intercellular and the tissue level. The developed CPM usually incorporates much detail and substantiates deep biological insight. Computer simulations can be an important tool for a deeper understanding. However, already the original system, which underlies all more elaborate CPMs, is still poorly understood in its theoretical and mathematical properties. The methods that are available for its analysis so far comprise essentially numerical studies and heuristic approximations. Since the stringency of the conclusion that can be gained by applying a model depends substantially on the capability and rigor of the available analytical tools, this presents a considerable challenge. The more details from the cellular, intracellular and possibly intracellular scale are included into the model the more pronounced are the challenges which are encountered when adjusting the model specifics to the biological situation at hand or when analyzing the model outcomes theoretically. To value the contribution of a CPM to the understanding of an underlying biological problem, it is essential that the theoretical characteristics of the model class are well-understood. Artifacts and non-robustness of the model behavior deserve particular attention, since laying them open helps to define the good of the model. This study provides a starting point for such work. It also constitutes a theoretical basis for developing assistance in constructing and choosing expedient model parameters and to give practical advice for cellular Potts implementations. Quantitative estimates for choosing the parameter values such that a CPM behaves as intended within certain time scales depend largely on the specific model that shall be applied. While the formulas to calculate the time to extinction for a given CPM are derived explicitly here, the development of further quantitative support for the CPM construction, the appropriate parameter choices and the determination of reasonable time scales for conclusive simulations is left to future studies.

Further theoretical analysis of the CPM class is worthwhile and shall be encouraged by this work. In particular, the study of highly simplified models may lead to mathematically well-founded assessments of the principal behavior of CPMs under various conditions concerning the temporal scales and the specific structure of the Hamiltonian as well as the parameter regimes. Besides this, the simultaneous representation and theoretical as well as empirical analysis of the same biological mechanism by various models which differ in their spatial resolution and particular model structure can help to distinguish the factors that are robustly described from the effects that must be attributed to the model's peculiarities. In this respect, the comparison between CPMs and non-lattice models, such as the Vertex model, or between CPM and cellular automata or interacting particle systems seems to be most promising.

Methods

Absorption for cellular Potts models

First, it shall be shown that any cellular Potts model as defined in subsection Mathematical model description is eventually absorbed by a constant configuration. Recall that Inline graphic and Inline graphic, where Inline graphic is a finite set. For Inline graphic, let be Inline graphic the set of non-empty proper subsets of Inline graphic. Define further

graphic file with name pone.0042852.e173.jpg (5)

the set of all configurations where exactly the cells with cell indices from Inline graphic are present. Notice that, for Inline graphic, the set Inline graphic contains only the constant-Inline graphic configuration Inline graphic.

Since the probability for the next transition in a given CPM is determined solely by the present state but not the past ones, the temporal evolution of a CPM is a Markov chain ([43], Def.2.1.1). The behavior of the latter is completely characterized by the transition matrix Inline graphic, where Inline graphic is the probability of a transition Inline graphic by one step of the modified Metropolis algorithm, Inline graphic.

In the following, the assertion that any CPM is eventually absorbed by one of the constant configurations is derived from considerations about the structure of the transition matrix. The findings presented here are based on results in the theory of finite Markov chains. See, for instance, ([44], §4-§5) or [43] for more detail.

Proposition 1

1. The sets Inline graphic, are the communication classes associated with the transition matrix Inline graphic.

2. For Inline graphic, the class Inline graphic is closed if and only if Inline graphic.

3. The elements of

graphic file with name pone.0042852.e188.jpg

are the absorbing states associated to Inline graphic while the states that belong to Inline graphic are the transient states.

4. If the configurations in Inline graphic are arranged appropriately, the transition matrix has the form

graphic file with name pone.0042852.e192.jpg (6)

where Inline graphic is the unit matrix on Inline graphic, Inline graphic is the null-matrix on Inline graphic, Inline graphic is a non-negative, non-vanishing matrix on Inline graphic and Inline graphic is a substochastic matrix on Inline graphic.

Notice that a matrix Inline graphic is substochastic if it has non-negative entries with row sums less or equal than one but strictly less than one for at least one row.

Proof

(1) Given two configurations Inline graphic, it is possible to reach Inline graphic from Inline graphic by performing a finite number of transitions each with positive transition probability, and vice versa. Thus, the elements of Inline graphic are communicating ([43], Def.2.4.1). If a configuration Inline graphic is given, each set Inline graphic with Inline graphic can be reached by performing a finite number of transitions each with positive transition probability, but the sets Inline graphic where Inline graphic cannot be reached in this way. Therefore, the sets Inline graphic, are the communication classes associated with the transition matrix Inline graphic ([43], §2.4.1).

(2) One observes that

graphic file with name pone.0042852.e213.jpg

Therefore, the states Inline graphic, are absorbing and the singletons Inline graphic are closed communication classes ([43], Def.2.4.2). Since the class Inline graphic is accessible from the class Inline graphic, if Inline graphic and since Inline graphic for Inline graphic, it can be concluded that for Inline graphic, there exists an Inline graphic such that

graphic file with name pone.0042852.e223.jpg

Therefore, the classes Inline graphic are not closed if Inline graphic.

(3) Any state of a Markov chain is either recurrent or transient, compare ([43], §3.1.1). Both properties are class properties ([43], Thm.3.1.2), that means either all elements of a communication class are recurrent or all class members are transient. A recurrent communication class is closed ([43], §3.1.3). Thus, one finds that

graphic file with name pone.0042852.e226.jpg

is the set of absorbing states associated to Inline graphic, and Inline graphic are the transient states.

(4) It follows from (3) that the transition matrix Inline graphic has the structure described in (4) if the configurations in Inline graphic are arranged in such a way that the absorbing configurations from Inline graphic are followed by the transient configurations from Inline graphic.

Next, asymptotic properties of the matrices Inline graphic and Inline graphic are derived, where Inline graphic is the Inline graphic-th matrix power of the matrix Inline graphic and Inline graphic is defined by

graphic file with name pone.0042852.e239.jpg (7)

both Inline graphic and Inline graphic given by (6). This is motivated by the fact that the long-time behavior of a Markov chain with transition probability Inline graphic is completely determined by the Inline graphic-th matrix power Inline graphic of the transition matrix Inline graphic for sufficiently large Inline graphic. Indeed, if Inline graphic is the initial distribution, then

graphic file with name pone.0042852.e248.jpg

is the distribution at time Inline graphic ([43], §3). Notice that

graphic file with name pone.0042852.e250.jpg (8)

Proposition 2

(1) For any Inline graphic, it holds that

graphic file with name pone.0042852.e252.jpg

(2)

graphic file with name pone.0042852.e253.jpg

Proof

(1) ([44], Prop. 5.1(i)).

(2) Define Inline graphic. Since

graphic file with name pone.0042852.e255.jpg

one obtains by letting Inline graphic that

graphic file with name pone.0042852.e257.jpg

Thus, Inline graphic. The assertion (2) follows now from (7) by letting Inline graphic. Indeed, one finds that

graphic file with name pone.0042852.e260.jpg

Now the results about eventual absorption and the probabilities of absorption by a particular configuration can be stated. Suppose that the Markov chain with initial distribution Inline graphic and transition matrix Inline graphic is denoted by Inline graphic and the underlying probability space is denoted by Inline graphic. Absorption is the event that the Markov chain reaches one of the absorbing states within finite time, that is

graphic file with name pone.0042852.e265.jpg

The event of being absorbed by a particular constant-Inline graphic configuration Inline graphic is defined by

graphic file with name pone.0042852.e268.jpg

Denote by Inline graphic the Dirac or point measure in Inline graphic.

Theorem 1

(1) For any initial distribution, absorption occurs almost surely, that is

graphic file with name pone.0042852.e271.jpg

(2) If the Markov chain Inline graphic is started in the configuration Inline graphic, then the probability of absorption by the constant configuration Inline graphic is given by

graphic file with name pone.0042852.e275.jpg

(3) The set of stationary distributions of a CPM is given by

graphic file with name pone.0042852.e276.jpg

Proof

(1) One finds that the event Inline graphic implies the event Inline graphic, since the Markov chain cannot escape from an absorbing state once it was captured there. Consequently,

graphic file with name pone.0042852.e279.jpg

by the continuity of the measure Inline graphic.

For the distribution Inline graphic of the Markov chain at time Inline graphic, it holds that Inline graphic, where Inline graphic is the initial distribution and Inline graphic is the Inline graphic-th matrix power of the transition matrix Inline graphic. Since

graphic file with name pone.0042852.e288.jpg

with Inline graphic defined by (7), one obtains

graphic file with name pone.0042852.e290.jpg (9)

and

graphic file with name pone.0042852.e291.jpg (10)

Thus on can conclude

graphic file with name pone.0042852.e292.jpg
graphic file with name pone.0042852.e293.jpg
graphic file with name pone.0042852.e294.jpg
graphic file with name pone.0042852.e295.jpg

where the last equality follows from (9). By Proposition 2 (1), each addend converges to zero for Inline graphic. Taking into account that the sum consists only of a finite number of addends, the assertion is proven.

(2) Fix Inline graphic. The event Inline graphic implies the event Inline graphic, since the Markov chain cannot escape from an absorbing state Inline graphic once it was captured there. Thus Inline graphic by the continuity of the measure Inline graphic. Since Inline graphic, one obtains from (10) and Proposition 2 (2) that

graphic file with name pone.0042852.e304.jpg

(3) Since the Markov chain is almost sure captured by one of the absorbing states Inline graphic, the set of stationary distributions is the convex hull of the point measures Inline graphic concentrated on the constant-Inline graphic configurations Inline graphic.

Time till absorption for cellular Potts models

A CPM's development towards absorption proceeds from an initial state where all cell indices of Inline graphic are present via the consecutive disappearance of single cell indices until the final absorbing state is reached. Therefore, the time of absorption can be estimated if the time until the first disappearance of a cell index, that is the time of the Markov chain exit time from the set Inline graphic, can be estimated. Let the random variable Inline graphic denote this time of exit from the set Inline graphic,

graphic file with name pone.0042852.e313.jpg

where Inline graphic is given by (5). Thus, Inline graphic is the time where the first CPM cell dies out.

To state the results about the distribution of Inline graphic, some additional notation is necessary. Notice that the transition matrix Inline graphic has the structure

graphic file with name pone.0042852.e318.jpg

where Inline graphic, Inline graphic and Inline graphic, if the rows and columns of Inline graphic are arranged appropriately. Consequently, it holds that

graphic file with name pone.0042852.e323.jpg

where Inline graphic and Inline graphic are the Inline graphic-th matrix power of Inline graphic and Inline graphic, respectively, and Inline graphic is some non-negative non-vanishing matrix composed from Inline graphic and Inline graphic. Further, Inline graphic shall represent a function of Inline graphic such that there exist Inline graphic, with Inline graphic for all sufficiently large Inline graphic.

Proposition 3

There exists a real eigenvalue Inline graphic of Inline graphic such that Inline graphic for any other eigenvalue Inline graphic of Inline graphic. Moreover, the left eigenvector Inline graphic and the right eigenvector Inline graphic associated with Inline graphic can be chosen positive and such that Inline graphic, where Inline graphic denotes the transpose of a vector Inline graphic. Suppose that Inline graphic is an eigenvalue of Inline graphic with multiplicity Inline graphic such that Inline graphic for all other eigenvalues which are different from Inline graphic and Inline graphic. Then it holds that

graphic file with name pone.0042852.e354.jpg (11)

Proof

The matrix Inline graphic is substochastic and primitive. The latter property means that Inline graphic is both irreducible and aperiodic ([43], Def.6.1.2). Therefore, the Perron-Frobenius Theorem can be applied ([43], Thm.6.1.1), which proves the assertion.

A direct consequence is the following theorem.

Theorem 2

The distribution of Inline graphic is approximately geometric with parameter Inline graphic, that is

graphic file with name pone.0042852.e359.jpg (12)

where Inline graphic is the eigenvalue of Inline graphic which is the largest in absolute value.

Proof

Define Inline graphic and Inline graphic. The distribution of Inline graphic is computed by

graphic file with name pone.0042852.e365.jpg
graphic file with name pone.0042852.e366.jpg
graphic file with name pone.0042852.e367.jpg
graphic file with name pone.0042852.e368.jpg

Inline graphic, where the latter equality follows from (11). Since Inline graphic is a constant independent of Inline graphic which does not vanish, the assertion is proven.

Thus, the path towards absorption is as follows. If a CPM started on Inline graphic leaves this set, it runs next into one the sets Inline graphic, Inline graphic. Almost surely, this happens within a finite number of time steps. The distribution of the exit time is approximately geometric. Once the chain has entered a set Inline graphic with Inline graphic, it cannot go back to Inline graphic. It stays in Inline graphic until the next cell dies out. In this way it successively visits classes Inline graphic where Inline graphic. The differences between two consecutive classes are singletons. The set Inline graphic is absorbing, since it consists only of one constant configuration.

The time until absorption can therefore be estimated by consecutive application of Theorem 2. Notice that the future evolution of a CPM which has already entered a class Inline graphic with Inline graphic is the same as that of a CPM restricted to Inline graphic. To restrict a CPM to the set Inline graphic, consider the restriction Inline graphic of the original Hamiltonian and perform the modified Metropolis algorithm on Inline graphic. It turns out that the associated transition probabilities are described by Inline graphic. Consequently, the time of exit from Inline graphic is again approximately geometrically distributed and the parameter of this distribution can be obtained as the leading eigenvalue of Inline graphic by applying the above arguments to Inline graphic.

Locality of transition rates

The locality conditions (2-I) and (2-II) stated above represent a slightly informal interpretation of the finite range conditions for interacting particle systems (IPS) as defined in ([29], Def.I.4.17). Notice that IPS are continuous-time models while CPM evolve in discrete time steps. However, as explained in the section Utility for spatio-temporal limit procedures, a continuous-time Markov chain can be constructed from a temporally discrete chain by choosing the rate matrix Inline graphic according to Inline graphic. The matrix entries of Inline graphic and Inline graphic differ only in their diagonal entries. The diagonal entries, however, are determined by the off-diagonal elements, since, for stochastic and rate matrices, the row sums are equal to one and zero, respectively. This implies that the locality conditions, which are stated precisely in the following, can be examined on the basis of the transition matrix Inline graphic as well as on the basis of the corresponding rate matrix Inline graphic.

Conditions (2-I') and (2-I'') as stated below are exactly the finite range conditions for IPS ([29], Def.I.4.17), however, they are adapted to the notation of this paper. Notice that locality conditions are important for ensuring the existence of a process on spatially unbounded lattices. Therefore, when considering a mechanism on a finite lattice which shall be extended to an infinite lattice, it is essential that the constants Inline graphic and Inline graphic in (2-I') and (2-I''), respectively, are independent from the lattice size. In the following, Condition (2-I) and (2-II) shall be formalized. For this, define the diameter of a set by Inline graphic, with Inline graphic denoting the metric on Inline graphic which is induced by the Euclidian norm on Inline graphic. The set of points where two configurations Inline graphic differ is given by . Further, denote by Inline graphic the distance of to sets. Then the precise locality conditions are as follows.

(2-I') There is a Inline graphic such that Inline graphic, unless Inline graphic.

(2-II') There is a Inline graphic such that Inline graphic for all Inline graphic with Inline graphic.

Proposition 4

(1) A CPM mechanism satisfies condition (2-I').

(2) A CPM with volume constraint does not satisfy condition (2-II')

Proof

(1) The transition rates of the time-continuous CPM algorithm satisfy (2-I') since only one site is changed in an transition. Indeed, Inline graphic, unless Inline graphic for suitable Inline graphic with Inline graphic. Thus Inline graphic for Inline graphic with Inline graphic.

(2) The rate for a transition Inline graphic is a function of Inline graphic,

graphic file with name pone.0042852.e422.jpg

where Inline graphic. Thus the locality properties of Inline graphic depend on the properties of Inline graphic. The typical structure of a CPM-Hamiltonian as given in (4) includes a volume constraint Inline graphic with

graphic file with name pone.0042852.e427.jpg

The difference Inline graphic must be calculated to determine the transition rate for a cell-index change at location Inline graphic. Due to the quadratic term, this difference depends explicitly on the volumes Inline graphic of the affected cells Inline graphic and not solely on the volumes' increase or decrease. However, to determine the volume of a cell in a CPM, it is not enough to scan an appropriate neighborhood of Inline graphic. Therefore, the constant Inline graphic in condition (2-II') would depend on the lattice size, which means that (2-II') is not satisfied.

Two remarks are in order. First, it is easy to see by the above arguments, that any mechanism where the transition probabilities can be determined only if the configuration on the whole lattice is examined. This applies, in particular, to a surface constraint, but also to some proliferation or shape control mechanisms of CPMs. Second, the rates would be local, if the Hamiltonian Inline graphic were constructed solely from a finite range potential. The latter is a family Inline graphic of functions

graphic file with name pone.0042852.e436.jpg

which satisfy for each Inline graphic

(I) Inline graphic if Inline graphic and

(II) Inline graphic for all Inline graphic with Inline graphic.

Notice that the constant must be independent of the lattice size, if a potential on a finite lattice is studied. Given such a potential, a Hamiltonian can be constructed via

graphic file with name pone.0042852.e443.jpg

The Hamiltonians that are used in statistical physics are usually constructed from finite range potentials. Notice that the surface interaction term Inline graphic of a CPM can be derived from the finite range potential

graphic file with name pone.0042852.e445.jpg

However, as soon as there is a non-local function like the volume constraint added, the CPM does not satisfy (2-II'). Actually, the locality condition on both the transition rates of an IPS as well as the finite range condition for the potential can be slightly relaxed ([19], Thm.I.3.9). Nevertheless the CPM does not fall into this model class.

Acknowledgments

Discussions with Andreas Deutsch, Walter de Back, Jörn Starruß and Lutz Brusch are gratefully acknowledged.

Funding Statement

This study was supported by the German Federal Ministry of Education and Research (Virtual Liver Network). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Madras NN (2002) Lectures on Monte Carlo methods. Fields Institute monographs. American Mathematical Society.
  • 2. Glazier JA, Graner F (1993) Simulation of the differential adhesion driven rearrangement of biological cells. Phys Rev E 47: 2128–2154. [DOI] [PubMed] [Google Scholar]
  • 3.Glazier JA, Balter A, Poplawski NJ (2007) Magnetization to morphogenesis: A brief history of the glazier-graner-hogeweg model. In A R A Anderson, M A J Chaplain, and K A Rejniak, editors, Single Cell-Based Models in Biology and Medicine, Mathematics and Biosciences in Interaction: 79–106.
  • 4. Käfer J, Hayashi T, Marée AF, Carthew RW, Graner F (2007) Cell adhesion and cortex contractility determine cell patterning in the drosophila retina. PNAS 104: 18549–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Krieg M, Arboleda-Estudillo Y, Puech PH, Käfer J, Graner F, et al. (2008) Tensile forces govern germ-layer organization in zebrafish. Nat Cell Biol 10: 429–436. [DOI] [PubMed] [Google Scholar]
  • 6. Merks RMH, Glazier JA (2005) A cell-centered approach to developmental biology. Physica A 352: 113–130. [Google Scholar]
  • 7. Ouchi NB, Glazier JA, Rieu J, Upadhyaya A, Sawada Y (2003) Improving the realism of the cellular Potts model in simulations of biological cells. Physica A 329: 451–458. [Google Scholar]
  • 8. Käfer J, Hogeweg P, Marée AFM (2006) Moving forward moving backward: Directional sorting of chemotactic cells due to size and adhesion differences. PLoS Comput Biol 2: e56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Köhn-Luque A, de Back W, Starruß J, Mattiotti A, Deutsch A, et al. (2011) Early embryonic vascular patterning by matrix-mediated paracrine signalling: A mathematical model study. PLoS ONE 6: e24175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Merks RMH, Perryn ED, Shirinifard A, Glazier JA (2008) Contact-inhibited chemotaxis in de novo and sprouting blood-vessel growth. PLoS Comp Biol 4: e1000163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Sottoriva A, Vermeulen L, Tavaré S (2011) Modeling evolutionary dynamics of epigenetic mutations in hierarchically organized tumors. PLoS Comput Biol 7: e1001132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Starruss J, Bley T, Sogaard-Andersen L, Deutsch A (2007) A new mechanism for collective migration in myxococcus xanthus. J Stat Phys 128: 269–286. [Google Scholar]
  • 13. Savill N, Sherratt JA (2003) The control of epidermal stem cell clusters by Notch mediated lateral induction. Dev Biol 258: 141–153. [DOI] [PubMed] [Google Scholar]
  • 14. Szabo A, Ünnep R, Mehes E, Twal WO, Argraves WS, et al. (2010) Collective cell motion in endothelial monolayers. Phys Biol 7: 046007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Zhang Y, Thomas GL, Swat M, Shirinifard A, Glazier JA (2011) Computer simulations. of cell sorting due to differential adhesion. PLoS ONE 6: e24999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Nagai T, Kawasaki K, Nakamura K (1988) Vertex dynamics of two-dimensional cellular patterns. J Phys Soc Jpn 57: 2221–2224. [Google Scholar]
  • 17. Weliky M, Oster G (1990) The mechanical basis of cell rearrangement. Development 109: 373–386. [DOI] [PubMed] [Google Scholar]
  • 18. Landsberg KP, Farhadifar R, Ranft J, Umetsu D, Widmann TJ, et al. (2009) Increased cell bond tension governs cell sorting at the drosophila anteroposterior compartment boundary. Curr Biol 19: 1950–1955. [DOI] [PubMed] [Google Scholar]
  • 19. Newman TJ (2005) Modeling multicellular systems using subcellular elements. Math Biosci Eng 2: 611–622. [DOI] [PubMed] [Google Scholar]
  • 20.Sandersius SA, Weijer CJ, Newman TJ (2011) Emergent cell and tissue dynamics from subcellular modeling of active biomechanical processes. Phys Biol 8: 045007+. [DOI] [PubMed]
  • 21. Marée AFM, Jilkine A, Dawes A, Grieneisen VA, Edelstein-Keshet L (2006) Polarization and movement of keratocytes: A multiscale modelling approach. Bull Math Biol 68: 1169–1211. [DOI] [PubMed] [Google Scholar]
  • 22. Hogeweg P (2000) Evolving mechanisms of morphogenesis: on the interplay between differential adhesion and cell di_erentiation. J Theor Biol 203: 317–333. [DOI] [PubMed] [Google Scholar]
  • 23. Zajac M, Jones GL, Glazier JA (2003) Simulating convergent extension by way of anisotropic differential adhesion. J Theor Biol 222: 247–259. [DOI] [PubMed] [Google Scholar]
  • 24.Balter A, Merks RMH, Poplawski NJ, Swat M, Glazier JA (2007) The Glazier-Graner-Hogeweg model: Extensions, future directions, and opportunities for further study. In A R A Anderson, M A J Chaplain, and K A Rejniak, editors, Single Cell-Based Models in Biology and Medicine, Mathematics and Biosciences in Interaction : 151–167.
  • 25. Saville NJ, Hogeweg P (1997) Modelling morphogenesis: From single cells to crawling slugs. J Theor Biol 184: 229–235. [DOI] [PubMed] [Google Scholar]
  • 26. Voss-Böhme A (2009) Gibbsian characterization for the reversible measures of interacting particle systems. Mark Proc Rel Fields 15: 441–476. [Google Scholar]
  • 27. Suzuki Y (1991) Invariant measures for the multitype voter model. Tokyo J Math 14: 61–72. [Google Scholar]
  • 28.Starruß J (priv. communication).
  • 29.Liggett TM (1985) Interacting Particle Systems. Springer.
  • 30.Liggett TM (1999) Stochastic interacting systems: contact, voter and exclusion processes. Springer.
  • 31. Cox JT, Greven A (1990) On the long term behavior of some finite particle systems. Probab Th Rel Fields 85: 195–237. [Google Scholar]
  • 32.DeMasi A, Presutti E (1991) Mathematical methods for hydrodynamic limits. Springer.
  • 33. Varadhan SRS, Yau H (1997) Di_usive limit of lattice gas with mixing conditions. Asian J Math 1: 623–678. [Google Scholar]
  • 34. Nakajima A, Ishihara S (2011) Kinetics of the cellular potts model revisited. New Journal of Physics 13: 033035. [Google Scholar]
  • 35. Alber M, Chen N, Lushnikov PM, Newman SA (2007) Continuous macroscopic limit of a discrete stochastic model for interaction of living cells. Phys Rev Lett 99: 168102. [DOI] [PubMed] [Google Scholar]
  • 36. Turner S, Sherratt JA, Painter KJ, Savill NJ (2004) From a discrete to a continuous model of biological cell movement. Phys Rev E 69: 021910. [DOI] [PubMed] [Google Scholar]
  • 37. Harrison NC, Diez del Corral R, Vasiev B (2011) Coordination of cell di_erentiation and migration in mathematical models of caudal embryonic axis extension. PLoS ONE 6: e22700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Deroulers C, Aubert M, Badoual M, Grammaticos B (2009) Modeling tumor cell migration: From microscopic to macroscopic models. Phys Rev E 79: 031917. [DOI] [PubMed] [Google Scholar]
  • 39. Simpson MJ, Towne C, McElwain DLS, Upton Z (2010) Migration of breast cancer cells: Understanding the roles of volume exclusion and cell-to-cell adhesion. Phys Rev E 82: 041901. [DOI] [PubMed] [Google Scholar]
  • 40. Voss-Böhme A, Deutsch A (2010) On the cellular basis of cell sorting kinetics. J Theor Biol 263: 419–436. [DOI] [PubMed] [Google Scholar]
  • 41.Marée AFM, Grieneisen VA, Hogeweg P (2007) The cellular Potts model and biophysical properties of cells, tissues and morphogenesis. In A R A Anderson, M A J Chaplain, and K A Rejniak, editors, Single Cell-Based Models in Biology and Medicine, Mathematics and Biosciences in Interaction : 107–136.
  • 42. Merks RMH, Koolwijk P (2009) Modeling morphogenesis in silico and in vitro: towards quantitative, predictive, cell-based modeling. Math Model Nat Phenom 4: 149–171. [Google Scholar]
  • 43.Brémaud P (1999) Markov chains. Gibbs fields, Monte Carlo simulation and queues. Springer.
  • 44.Behrends E (2000) Introduction to Markov chains. With special emphasis on rapid mixing. Advanced Lectures in Mathematics. Vieweg.

Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES