Abstract
We examine the practical identifiability of parameters in a spatio-temporal reaction–diffusion model of a scratch assay. Experimental data involve fluorescent cell cycle labels, providing spatial information about cell position and temporal information about the cell cycle phase. Cell cycle labelling is incorporated into the reaction–diffusion model by treating the total population as two interacting subpopulations. Practical identifiability is examined using a Bayesian Markov chain Monte Carlo (MCMC) framework, confirming that the parameters are identifiable when we assume the diffusivities of the subpopulations are identical, but that the parameters are practically non-identifiable when we allow the diffusivities to be distinct. We also assess practical identifiability using a profile likelihood approach, providing similar results to MCMC with the advantage of being an order of magnitude faster to compute. Therefore, we suggest that the profile likelihood ought to be adopted as a screening tool to assess practical identifiability before MCMC computations are performed.
Keywords: identifiability analysis, Bayesian inference, profile likelihood, reaction–diffusion, cell cycle
1. Introduction
Combined cell migration and cell proliferation leads to moving fronts of cells, often called cell invasion [1], which is essential for tissue repair and wound healing [2]. While sophisticated experimental techniques are continually developed to interrogate cell invasion, traditional scratch assays remain widely used because they are simple, fast and inexpensive [3]. Scratch assays involve growing (figure 1a,b) and then scratching (figure 1c) a cell monolayer, and imaging a small region around the scratch as the wounded area closes (figure 1d,e).
Figure 1.
Experiments in a six-well plate (a) are initiated with a uniform cell monolayer (b). A scratch is made (c), and the 1296 μm × 1745 μm field of view is observed between t = 0 h (d) and t = 48 h (e). The cell cycle is labelled using fluorescent ubiquitination-based cell cycle indicator (FUCCI) [4] so that cells in G1 phase fluoresce red and cells in S/G2/M phase fluoresce green (f). A freely cycling cell in G1 phase (red) will transition into S/G2/M phase (green) at rate kr > 0. A freely cycling cell in S/G2/M phase (green) will divide into two cells in G1 phase (red) at rate kg > 0, provided there is sufficient space to accommodate division (g). Images of the field of view at t = 0, 16, 32 and 48 h (h–k), with the non-dimensional densities of G1 (red) and S/G2/M (green) subpopulations shown as a function of space and time (l–o). Scale bars in (h–k) show 200 μm.
The first mathematical models to interpret experimental observations of cell invasion were reaction–diffusion models [5]. Sherratt & Murray [6] modelled a wound-healing-type assay using the Fisher–Kolmogorov model, and many subsequent studies have related solutions of mathematical models to experimental observations, including Maini et al. [2], Sengers et al. [7] and Nardini et al. [8]. By matching the solution of a mathematical model with certain experimental observations, these studies provide both parameter estimates and mechanistic insight.
One challenge of using scratch assays is that there is no standard, widely accepted experimental protocol. There are many differences in: (i) the initial monolayer density [9]; (ii) the width and shape of the scratch [1,10]; and (iii) the experimental time scale [3]. These differences make it difficult to compare different experiments, and it is unclear whether different mathematical models of varying complexity (i.e. varying numbers of unknown parameters or varying the model structure) are required under different experimental protocols.
It is reasonable to argue that simple experimental data ought to be modelled using simple mathematical models with a small number of unknown parameters, and that more sophisticated mathematical models with additional parameters should be used only when greater amounts of experimental data are available. However, it is difficult to use this kind of qualitative reasoning to justify particular decisions about how complicated a mathematical model ought to be. Methods of identifiability analysis [11–21] provide more systematic means to determine the appropriate model complexity relative to available data. A model is identifiable when distinct parameter values imply distinct distributions of observations, and hence when it is possible to uniquely determine the model parameters using an infinite amount of ideal data [22–25]. In the systems biology literature that focuses on models formulated as ordinary differential equations, identifiability has also been referred to as structural identifiability [19,20]. Working with ordinary differential equations that describe temporal processes, it is possible to formally analyse the structural identifiability [19,20,26,27]. By contrast, here we work with reaction–diffusion partial differential equations since we are interested in spatio-temporal processes. In this case, formal analysis of structural identifiability is more difficult and so we explore structural identifiability numerically (electronic supplementary material).
In contrast to predictive performance measures, such as Akaike’s information criterion (AIC) [28], identifiability analysis focuses on distinguishing parameter values and, as a result, understanding underlying mechanisms. Shmueli [29] characterizes this general difference as the choice of whether to ‘explain or predict’. The formal definition of identifiability requires an infinite amount of ideal data. Thus, in addition to the strict technical concept of (structural) identifiability, the terms practical identifiability [16] and estimability [22,25] are used to describe whether it is possible to provide reasonably precise estimates using finite, non-ideal data.
Since practical identifiability is a finite, non-ideal data concept, it depends more strongly on the inferential framework adopted and is less clearly defined in the literature than structural identifiability. We consider practical identifiability analysis to encompass a set of precision-based pre-data planning and post-data analysis tools similar to those advocated by Cox [30], Bland [31] and Rothman & Greenland [32]. This is consistent with the systems biology literature [11–18,21], where it is typically characterized in terms of the ability to provide finite, sufficiently precise interval estimates at various levels of confidence or credibility given finite amounts of non-ideal, real data. This is the approach we take here; we consider both the typical precision achieved under repeated simulation of ideal synthetic data (electronic supplementary material) and the post-data, realized precision given real data. It is important to note that, if a model is found to be practically non-identifiable, this does not mean that the model is incorrect. Instead it simply indicates that additional model complexity is not required, and that such a model would be difficult to falsify in realistic experiments.
In systems biology, experimental data often take the form of time series describing temporal variations of molecular species in a chemical reaction network, and these data are modelled using ordinary or stochastic differential equations [33]. Here, we focus on applying methods of practical identifiability analysis to spatio-temporal partial differential equation models, which represent both spatial and temporal variations in quantities of interest. It is instructive to consider both a Bayesian inference framework [14,15] and profile-likelihood analysis [16,17].
Bayesian inference is widely adopted within the mathematical biology community [34–39]. A drawback of the Bayesian approach is that it can be computationally expensive, particularly when Markov chain Monte Carlo (MCMC) methods are used to sample the distributions of interest. By contrast, profile likelihood analysis is a more standard tool of statistical inference [30,40], but is far less familiar within the mathematical biology community. Being optimization-based, profile likelihood can be computationally inexpensive.
Here, we consider data from a scratch assay where cells are labelled using two fluorescent probes to show real-time progression through the cell cycle [4]. Such labelling allows us to describe the total population of cells as two subpopulations according to the two cell cycle labels [41]. This experimental approach provides far more information than a standard scratch assay without cell cycle labels. We model the experiments using a system of reaction–diffusion equations that are an extension of the Fisher–Kolmogorov model. Using these data and this model, we explore the practical identifiability of the model parameters using both Bayesian inference and profile likelihood.
One of the main outcomes of this work is that, instead of performing potentially computationally expensive Bayesian inference to assess practical identifiability and determine parameter estimates, we show that a profile likelihood analysis can provide a fast and reliable preliminary assessment of practical identifiability. Both analysis methods provide good insight into the identifiability of the spatio-temporal models considered here. In summary, we consider minimal model scratch assay data generated using fluorescent cell cycle labelling and show that it is practically identifiable, whereas a simple extension of that model is not, despite the fact that numerical experiments indicate that it is structurally identifiable in the limit of infinite data. In all cases, the Bayesian analysis is consistent with profile likelihood results. However, the profile likelihood analysis is an order of magnitude faster to implement.
2. Experimental data
We consider a scratch assay performed with the 1205Lu melanoma cell line with fluorescent ubiquitination-based cell cycle indicator (FUCCI) labelling [4]. The scratch assay is initiated by seeding 1205Lu melanoma cells into six-well plates and allowing sufficient time for the cells to attach, to begin to proliferate (figure 1a), and to form an approximately uniform monolayer (figure 1b). A scratch is made in the monolayer (figure 1c) and a small region of the experiment is imaged (figure 1c,d) [41]. The geometry of the experiment means that the distribution of cells in the imaged region is independent of vertical position, and so we treat the cell density as a function of horizontal position, x, and time, t [41]. The boundaries around the imaged region are not physical boundaries and cells are free to migrate across these boundaries. Since the density of cells is approximately constant away from the scratch, the net flux of cells locally across each boundary of the imaged region is approximately zero [41].
Cells in G1 phase fluoresce red and cells in S/G2/M phase fluoresce green (figure 1f). We represent FUCCI labelling by treating the total population as two subpopulations: (i) red cells in G1 and (ii) green cells in S/G2/M. Cells in G1 (red) transition into S/G2/M cells (green) at rate kr > 0. This red-to-green transition does not involve any change in cell number so we assume this transition is unaffected by the availability of physical space. Cells in S/G2/M (green) undergo mitosis at rate kg > 0 to produce two daughter cells in G1 (red) (figure 1f). Since the green-to-red transition involves the production of two daughter cells we assume this transition only occurs provided there is sufficient space.
Images of the experiment (figure 1h–k) show that cells move into the initially scratched region while simultaneously progressing through the cell cycle. At the beginning of the experiment, the scratch is approximately 500 μm wide, and by the end of the experiment the initially vacant wound space becomes occupied as cells have migrated into the gap. Visual examination of the density of cells away from the scratched region shows that the cell density increases with time, and this is driven by cell proliferation. To quantify these visual observations we note that each experimental image is 1296 μm wide, and we discretize each image into 24 equally spaced columns of width 54 μm. We count the number of red cells and green cells per column and divide by the area of the column and by the theoretical maximum packing density of [41] to give an estimate of the non-dimensional density of red cells and green cells per column. Assuming that each experimental density estimate represents the centre of each column, these data (figure 1l–o) show the spatial variation in density of G1 (red) and S/G2/M (green) cells as a function of space and time.
We summarize the experimental data using the following notation. Our observations are vectors containing the observed non-dimensional density of red cells, , and the observed non-dimensional density of green cells, , per column at each position xi and time ti, i.e.
| 2.1 |
We use an explicit superscript ‘o’ to distinguish (noisy) observations of r and g from their modelled counterparts introduced below, while we assume that the space and time coordinates x and t, respectively, are noise-free. In particular, we use
| 2.2 |
to denote the ith combined vector of observations of r and g and the ith combined vector of spatio-temporal coordinates x and t, respectively. Observations are taken at 24 equally spaced spatial locations, where the spacing between the observations is 54 μm, at four equally spaced time points, t = 0, 16, 32 and 4 h. Thus, yo consists of 24 × 4 = 96 observations of both of r and g, giving N = 24 × 4 × 2 = 192 measurements of cell density in total. Experimental data are provided in the electronic supplementary material.
3. Mathematical model
We treat our mathematical model as having two components: (i) a deterministic spatio-temporal process model governing the dynamics of the experiment and (ii) a probabilistic observation model. This approach invokes the reasonable assumption that observations are noisy versions of a deterministic latent model [14]. We describe both models and note that the same model components are used in the Bayesian and the profile likelihood analyses.
3.1. Process model
A reasonably simple process model for the FUCCI scratch assay can be written as
| 3.1 |
and
| 3.2 |
where Dr > 0 is the diffusivity of cells in G1, Dg > 0 is the diffusivity of cells in S/G2/M, kr > 0 is the rate at which cells in G1 transition into S/G2/M, and kg > 0 is the rate at which cells in S/G2/M undergo mitosis to produce two cells in G1 [41]. The solution of equations (3.1) and (3.2), r(x, t) and g(x, t), are taken to represent the (continuous) underlying mean densities, while represent noisy observations of these mean densities at the measurement locations si = (xi, ti).
The process model has four parameters: θ = (Dr, Dg, kr, kg). Given suitable initial conditions, r(x, 0) and g(x, 0), and parameter values θ, equations (3.1) and (3.2) are solved numerically (electronic supplementary material).
Despite the abundance of experimental data in figure 1 and the apparent simplicity of equations (3.1) and (3.2), it is unclear whether these experimental data are sufficient to estimate the four parameters: θ = (Dr, Dg, kr, kg). Therefore, we also consider a simpler model whereby we set D = Dr = Dg, implying that cells in G1 diffuse at the same rate as cells in S/G2/M. This simpler model is characterized by three parameters: θ = (D, kr, kg).
3.2. Observation model
We assume that observations are noisy versions of the latent model solutions, y(si, θ). A standard approach in the mathematical biology literature is to further assume that this observation error is independent and identically distributed (iid), and that this noise is additive and normally distributed, with zero mean and common variance σ2 [9,14]. Therefore, we have
| 3.3 |
where I is the 2 × 2 identity matrix. Since our main interest is in θ = (Dr, Dg, kr, kg), we directly specify σ = 0.05 [9,14], and we will discuss this choice later.
4. Practical identifiability analysis
We consider two approaches for practical identifiability analysis. The first is based on Bayesian inference and MCMC, and follows the approach outlined by Hines et al. [14]. The second approach is based on profile likelihood and follows Raue et al. [16]. Both approaches have been considered in the context of identifiability of non-spatial models, and, as we will show, the same ideas can deal with models of spatio-temporal processes.
As discussed by Raue et al. [42], estimation results from MCMC and profile likelihood are typically similar in the case of identifiable parameters (and in the low-dimensional setting), but can be very different in the presence of non-identifiability. In particular, Raue et al. [42] demonstrate that results from MCMC can be potentially misleading in the presence of non-identifiability. On the other hand, Hines et al. [14] and Seikmann et al. [15] argue that appropriate MCMC diagnostics can indicate the presence of non-identifiability. Hence we use both methods and consider the extent to which they agree.
4.1. Parameter bounds
Before outlining the identifiability methodologies, we note that the parameters in equations (3.1) and (3.2) have a physical interpretation and we can formulate some biologically motivated bounds. Practical identifiability, given observed data, can be evaluated by comparing the width of (realized) interval estimates relative to these simple bounds.
Previous estimates of melanoma cell diffusivity in similar experiments found that typical values are often less than 1000 μm2 h−1 [43], so we take conservative bounds on Dr and Dg to be 0 < Dr, Dg < 2000 μm2 h−1. Bounds on kr and kg can be inferred from previous experimental measurements. The duration of time 1205Lu melanoma cells remain in the G1 phase varies between 8 and 30 h, whereas the time that 1205Lu melanoma cells remain in the S/G2/M phase varies between 8 and 17 h [4]. These measurements imply 0.033 < kr < 0.125/h and 0.059 < kg < 0.125/h, so we take conservative bounds on kr and kg to be 0 < kr, kg < 0.2/h.
4.2. Bayesian inference
Following Hines et al. [14], we take a Bayesian approach to assess parameter identifiability. Bayesian inference relies on Bayes’ theorem, written here as
| 4.1 |
where p(yo|θ) is the likelihood function, p(θ) is the prior and p(θ| yo) is the posterior. The posterior distribution is the inferential target; this summarizes the information about the parameter θ in light of the observed data, yo, and the prior information specified by p(θ). The likelihood function represents the information contributed by the data, and corresponds to the probabilistic observation model in (3.3) evaluated at the observed data.
We explore the parameter space θ by sampling its posterior distribution using a Metropolis–Hastings MCMC algorithm [14]. The Markov chain starts at position θi, and a potential move to θ* is accepted with probability α,
| 4.2 |
This Markov chain tends to move towards regions of high posterior probability, but it also provides a mechanism to move away from local minima by allowing transitions to regions of lower posterior probability. The Markov chain produced by this algorithm explores the parameter space in proportion to the posterior probability and provides a finite number of independent, identically distributed samples from the posterior distribution. Proposals in the MCMC algorithm are made by sampling a multivariate normal distribution with zero mean and covariance matrix Σ. When θ = (Dr, Dg, kr, kg) we specify Σ = diag(102, 102, 10−6, 10−6), whereas when θ = (D, kr, kg) we specify Σ = diag(102, 10−6, 10−6).
Poor identifiability in a Bayesian setting using MCMC is typically characterized by poorly converging chains, label-switching, multimodal or overly broad distributions, and similar phenomena that can be diagnosed either graphically or by computing various diagnostic statistics [14,15]. We discuss these diagnostic methods in more detail in the Results and discussion section.
4.3. Profile likelihood
Here we describe how a profile likelihood identifiability analysis can be undertaken. This is based on the same likelihood as above, p(yo|θ), but here we will present results in terms of the normalized likelihood function, denoted
| 4.3 |
This is considered as a function of θ for fixed data yo. It is also common to take the likelihood as only defined up to a constant, but here we fix the proportionality constant by presenting results in terms of the normalized likelihood function as above.
We assume our full parameter θ can be partitioned into an interest parameter ψ and a nuisance parameter λ, i.e. θ = (ψ, λ). These can also be considered functions ψ(θ) and λ(θ) of the full parameter θ, specifying the partition components. Here we will restrict attention to a scalar interest parameter and a vector nuisance parameter, i.e. multiple scalar nuisance parameters. Then the profile likelihood for the interest parameter ψ can be written as [30,40]
| 4.4 |
where λ is ‘optimized out’ for each value of ψ, and this implicitly defines a function λ*(ψ) of optimal λ-values for each value of ψ. For example, given the full parameter θ = (Dr, Dg, kr, kg), we may consider the diffusion constant of red cells as the interest parameter and the other parameters as nuisance parameters, i.e. ψ(θ) = Dr and λ(θ) = (Dg, kr, kg). This would give
| 4.5 |
Given the assumption of normally distributed errors, and assuming a fixed, common standard deviation σ, profiling reduces to solving a series of nonlinear least-squares problems, one for each value of the parameter of interest, by using log-density functions. Additional details and example calculations are provided in the electronic supplementary material.
We implement this optimization using Matlab’s lsqnonlin function with the ‘trust-region-reflective’ algorithm to implement bound constraints [44]. For each value of the interest parameter, taken over a sufficiently fine grid, the nuisance parameter is optimized out and the previous optimal value is used as the starting guess for the next optimization problem. Uniformly spaced grids of 50 points, defined between previously discussed lower and upper bounds, are used. Results are plotted in terms of the normalized profile likelihood functions; cubic interpolations are used to define the profiles for all parameter values. Solutions did not exhibit any notable dependence on initial guesses.
The likelihood function is often characterized as representing the information that the data contain about the parameters, and the relative likelihood for different parameter values as indicating the relative evidence for these parameter values [40,45,46]. As such, a flat profile is indicative of non-identifiability, therefore a lack of information in the data about a parameter [16]. In general, the degree of curvature is related to the inferential precision [16,17,42]. Likelihood-based confidence intervals can be formed by choosing a threshold-relative profile likelihood value, which can be approximately calibrated via the χ2-distribution (or via simulation). We use a threshold of 0.15 as a reference, which corresponds to an approximate 95% confidence interval for sufficiently regular problems [40]. The points of intersection were determined using the interpolated profile likelihood functions.
5. Results and discussion
Here we consider the results of the above methods of identifiability analysis under the two scenarios introduced previously. In scenario 1, we consider equations (3.1) and (3.2) under the assumption that both subpopulations have the same diffusivity, Dr = Dg = D, so that θ = (D, kr, kg). In scenario 2, we consider the same model without this assumption so that θ = (Dr, Dg, kr, kg). We first consider the Bayesian analysis using MCMC, and then the profile likelihood analysis. All units of diffusivities are micrometres squared per hour and all units for rate parameters are per hour.
5.1. Bayesian analysis
As we are focused on the question of identifiability, here we use uniform priors on all parameters. This is a natural choice when we are interested in identifiability since we want to focus on the information about the parameter values inherent within the data rather than from some imposed prior [14].
In the first scenario where Dr = Dg = D, we see rapid convergence of the Markov chain for D, kr and kg (figure 2a–c). Importantly, after the Markov chain moves away from the initial location, θ0, it remains within the previously stated conservative bounds. Additional results in the electronic supplementary material confirm similar results for different θ0. A plot matrix representation of the univariate and bivariate marginal densities is given (figure 2d–i) [47], where the univariate marginals (figure 2d,g,i) are unimodal and approximately symmetric. The univariate posterior modes are , confirming that each component lies within the biologically feasible range, and the univariate 95% credible intervals are relatively narrow. A posterior predictive check [47] with 30 randomly sampled parameter choices from the converged region of the Markov chain provides visual confidence in the MCMC results and the appropriateness of the model (figure 2j–m) [47]. Additional comparison of the experimental data and solution of the model parametrized with also confirms this (electronic supplementary material). In summary, setting D = Dr = Dg, our model parameters appear to be identifiable according to the Bayesian approach.
Figure 2.
Typical Markov chain iterations, of length 51 000, for D, kr and kg in (a)–(c), respectively. In this case, the Markov chain is initiated with θ0 = (500, 0.05, 0.05). Results in (d)–(i) show a plot matrix representation of the univariate marginals and bivariate marginals estimated using the final 50 000 iterations of the Markov chain in (a)–(c). For the univariate distribution, the posterior modes are , and , and the 95% credible intervals are D ∈ [455, 1007], kr ∈ [0.0187, 0.0362] and kg ∈ [0.0628, 0.0727]. In the univariate marginals, the 95% credible intervals are shown in red dashed vertical lines; in the bivariate marginals the region of maximum density is shown in the darkest shade. Results in (j)–(m) superimpose 30 solutions of equations (3.1) and (3.2) where θ is randomly sampled from the Markov chain. MCMC results use σ = 0.05.
In the second scenario, where Dr ≠ Dg, the Markov chains for Dr, Dg, kr and kg (figure 3a–d) are very different from the first scenario. The Markov chains for kr and kg remain within biologically feasible bounds. However, the Markov chains for Dr and Dg often move beyond the previously stated conservative bounds with no obvious convergence. Additional results for different choices of θ0 (electronic supplementary material) indicate that these poor results for Dr and Dg are consistent across a number of choices of θ0. A plot matrix representation confirms that we obtain well-behaved posterior distributions for kr and kg, but that the results for Dr and Dg are problematic since the univariate posterior distributions are not unimodal and the 95% credible intervals are relatively wide (figure 3e–n). A posterior predictive check with 30 randomly sampled parameter choices from the latter part of the Markov chain are provided for completeness (figure 3o–r). The posterior predictive check compares model predictions with the experimental data, with additional comparisons using the posterior median (electronic supplementary material). However, these comparisons are less useful since they are based on parameters arising from a Markov chain that has not settled to a well-defined posterior. In summary, without assuming Dr = Dg, our model parameters are practically non-identifiable according to the Bayesian approach.
Figure 3.
Typical Markov chain iterations, of length 51 000, for Dr, Dg, kr and kg in (a)–(d), respectively. In this case, the Markov chain is initiated with θ0 = (228, 803, 0.0334, 0.0790). Results in (e)–(n) show a plot matrix representation of the univariate marginals and bivariate marginals estimated using the final 50 000 iterations of the Markov chain in (a)–(d). For the univariate distribution the posterior modes are , , and . The 95% credible intervals are Dr ∈ [86, 1100], Dg ∈ [104, 1939], kr ∈ [0.0188, 0.0371] and kg ∈ [0.0617, 0.0863]. In the univariate marginals, the 95% credible intervals are shown in red vertical dashed lines; in the bivariate marginals the region of maximum density is shown in the darkest shade. Results in (o)–(r) superimpose 30 solutions of equations (3.1) and (3.2) where θ is randomly sampled from the Markov chain. MCMC results use σ = 0.05.
5.2. Profile likelihood analysis
Here we consider the results from our profile likelihood analysis. Each profile corresponds to taking a particular parameter as the interest parameter and the remaining parameters as nuisance parameters to be optimized out. More general partitions into an interest parameter and nuisance parameters are also considered in the electronic supplementary material. Here we present results for scenario 1 (figure 4a–c) and results for scenario 2 (figure 4d–f).
Figure 4.
Profile likelihoods. Each profile likelihood is computed using a grid of 50 equally spaced points; cubic interpolation is used for display and to determine confidence intervals. Approximate 95% confidence intervals are indicated based on a relative likelihood threshold of 0.15 [40]. Results for the first scenario where Dr = Dg = D (a–c), and results for the second scenario where Dr ≠ Dg (d–f). In (d), the profile likelihood for Dg and Dr are in red (solid) and green (dashed), respectively. Interval estimates are: D ∈ [439, 1018], kr ∈ [0.0212, 0.0366], kg ∈ [0.0601, 0.0853] for D = Dr = Dg. Maximum-likelihood estimates are: D*, , = 691, 0.0289, 0.0719 for D = Dr = Dg. Interval estimates are: Dr ∈ [1.4, 1647], Dg ∈ [0, 2000] (i.e. the whole interval), kr ∈ [0.0209, 0.0373], kg ∈ [0.0597, 0.0861] for Dr ≠ Dg. Maximum-likelihood estimates are: , , , = 612, 842, 0.0290, 0.0717 for Dr ≠ Dg.
In the first scenario where Dr = Dg = D, we see regular-shaped profiles with clearly defined peaks for D, kr and kg (figure 4a–c). These results are very similar to those obtained from the Bayesian analysis using MCMC, in terms of both interval estimates and maximum-likelihood estimates. The maximum-likelihood estimates are .
In the second scenario, where Dr ≠ Dg, we see very similar estimates for kr and kg. The diffusivities, Dr and Dg, are much less well determined, however. In particular, the profile-likelihood-based confidence interval for Dg covers the entire range of the previously stated conservative bounds, while that for Dr only excludes a negligible part of this region. Thus, while kr and kg are well identified, Dr and Dg are not. These results are again very similar to those obtained from the Bayesian analysis using MCMC, though the diffusivity estimates are even more conservative here. The maximum-likelihood estimates are . We also generated profiles for the difference Dr − Dg (electronic supplementary material). The associated interval encompasses a broad range of values, including zero, but also includes very large positive and negative values. This is consistent with a lack of practical identifiability of Dr − Dg. Despite the fact that the diffusivities are practically non-identifiable with our relatively abundant experimental data, further experimentation with synthetic data indicates that all parameters are, in fact, strictly identifiable provided sufficient data (electronic supplementary material). These additional results (electronic supplementary material) allow us to explore how the choice of data resolution affects practical identifiability.
In both cases, the results of the profile likelihood analysis are consistent with the MCMC-based Bayesian analysis. However, the profiles are far cheaper to compute than full MCMC posterior distributions. For example, each profile here was computed in less than 1 min on a standard laptop, whereas the MCMC analysis took approximately 30 min of computation. Each optimization for a given value of the target parameter in the profile analysis took less than 1 s to solve, and profiles can be computed in parallel.
6. Conclusion and outlook
In this work, we assess the practical identifiability of a spatio-temporal reaction–diffusion model of a scratch assay where we have access to a relatively large amount of experimental data. In the literature, most scratch assays are reported by providing single images at the beginning and the conclusion of the experiment without making measurements of spatial distributions of cell density at multiple time points (e.g. [48,49]). By contrast, we consider detailed cell density measurements of two subpopulations at four time points with a relatively high spatial resolution of cell density at 24 spatial locations. This means that we work with 24 × 4 × 2 = 192 measurements of cell density. Given this dataset, a key decision when using a mathematical model is to determine how complicated the model ought to be. We first explored this question through a Bayesian, MCMC-based framework, and then with a profile likelihood-based approach.
MCMC-based Bayesian analysis is relatively common in the systems biology literature for temporal models [34]. Bayesian MCMC approaches for spatio-temporal models are less common, but increasingly of interest. While it is established that practical parameter identifiability can often be diagnosed using an MCMC framework, many applications of Bayesian MCMC in the mathematical biology literature never explicitly consider the question of practical identifiability [35]. Furthermore, MCMC can be misleading in the presence of true non-identifiability [42], as can other methods such as the bootstrap or Fisher-information-based approaches [17]. Even assuming that MCMC can reliably determine non-identifiability, MCMC can be computationally expensive, especially for models with large |θ|. Thus, we also assess identifiability using an optimization-based, profile likelihood approach. In contrast to previous work on temporal data and temporal processes, here we apply this approach to spatio-temporal reaction–diffusion models and spatio-temporal data. Algorithms are available on GitHub (https://github.com/ProfMJSimpson/Identifiability).
A key feature of our study is that we consider two slightly different modelling scenarios. In the first scenario, we assume D = Dr = Dg so that the diffusivity of cells in G1 phase is identical to the diffusivity of cells in S/G2/M phase. In this case, there are three unknown parameters, θ = (D, kr, kg). By contrast, for the second scenario we do not invoke this assumption, and there are four unknown parameters, θ = (Dr, Dg, kr, kg). Despite the relative abundance of experimental data and the apparent simplicity of the reaction–diffusion model, we find that the parameters are identifiable in the first scenario but are practically non-identifiable in the second.
Our results show that the profile likelihood provides similar results to the Bayesian MCMC approach, with the advantage of being an order of magnitude faster to implement. Typical MCMC results in figures 2 and 3 require approximately 30 min of computation time, whereas the profile likelihood results in figure 4 require approximately 1 min to compute on a standard laptop (Dell, Intel Core i7 processor, 2.8 GHz, 16 GB RAM). Both methods indicate practical non-identifiability of the diffusivities in the second scenario. As mentioned, however, profile likelihood has previously been shown to be more reliable in the presence of true non-identifiability [17,42]; this is an important consideration as the reaction–diffusion models we used here are already relatively simple and neglect certain mechanisms. Potential mechanisms that might be incorporated in a further extended model include cell–cell adhesion [8,50] or directed motion such as chemotaxis [51,52].
Overall, we find that MCMC provides additional information about practical identifiability but it is more expensive to compute, whereas the profile likelihood analysis provides less information but is computationally cheaper. There are situations where this additional information, such as determining if parameter estimates are correlated, is useful and in this sense the MCMC is preferred. However, there are also other situations where MCMC is challenging or computationally impossible, such as if we are dealing with models that are very expensive to simulate, or models with a much larger |θ|, but perhaps a smaller number of ‘interest’ parameters. In this case, MCMC could be computationally expensive, making the profile likelihood preferred. Our aim in this work is to present a balanced view, and not to promote one method over the other. Since the profile likelihood is often simpler and/or faster to compute, our general recommendation is that the profile likelihood makes an excellent screening tool that can provide rapid insight before more complicated MCMC results are generated, if necessary.
Visual inspection of the posterior predictive check in figure 2 shows that, while the overall comparison between the model and the experimental data is reasonable, there are some minor discrepancies, such as in the G1 (red) subpopulation near the centre of the scratch at t = 32 h, suggesting some model inadequacy. This observation is consistent with the fact that we neglect some mechanisms that could potentially improve the quality of the match. Had we taken a more standard approach without considering an identifiability analysis, we might have been tempted to extend equations (3.1) and (3.2) to improve the quality of match between the data and the solution of the model. Such extensions (e.g. adhesion, chemotaxis), while conceptually straightforward to implement, involve increasing the dimension of the parameter space. We caution against naively implementing such extensions since our analysis shows that even the simpler models we consider are not always practically identifiable even with the extensive dataset that we work with here. A profile likelihood analysis of particular, lower dimensional interest parameters within a more complex model may still be feasible, however.
There are many ways our analysis could be extended, in terms of the process model, the observation model and the experimental model. A key assumption here is that the observation model is Gaussian with a fixed variance. While this is a standard assumption [9,14], one possible extension would be to treat the observation model parameter σ as a variable to be estimated at the same time as the process model parameters θ. Another extension would be to recognize that, since we are dealing with count data, a multinomial observation model could be more appropriate. Both of these extensions could be dealt with directly in the same MCMC framework but the profile likelihood optimization problems would become slightly more complicated than simple least squares. In terms of the process model, a further extension would be to apply the same reaction–diffusion models in a different geometry. For example, data reported by Jin et al. [53] use a different experimental approach to produce square- and triangular-shaped scratches. While the same reaction–diffusion models as those used here could be used to model Jin et al.’s [53] experiments, such models would need to be solved in two spatial dimensions because of the difference in initial conditions [54]. This extension could be handled by the same MCMC and the same profile likelihood approaches without increasing the dimension of the parameter space. However, this extension would increase the computational effort required to solve the reaction–diffusion model and this would amplify the computational advantage of the profile likelihood method over the Bayesian MCMC approach. In terms of the experimental model, in this work we use cell cycle labelling with two colours, red and green. More advanced cell cycle labelling technologies, such as FUCCI4 [55,56], are now available, and these technologies label the cell cycle with four fluorescent colours [55,56]. Data from a scratch assay where cells are labelled with FUCCI4 could be interpreted with an extended system of reaction–diffusion equations, similar to equations (3.1) and (3.2). Such an extended system of equations would be similar to equations (3.1) and (3.2), but the dimensionality of the parameter space would increase. With four colours we would have the possibility of working with four distinct cell diffusivities and four distinct rate parameters, thereby doubling the size of the parameter space, hence we leave this for future consideration.
Supplementary Material
Supplementary Material
Data accessibility
Experimental data are available in the electronic supplementary material.
Authors' contributions
All authors designed the investigation. M.J.S. and O.J.M. conducted the analysis, S.T.V. performed the experiments, M.J.S. and O.J.M. wrote the manuscript and all authors approved the final version of the manuscript.
Competing interests
We declare we have no competing interest.
Funding
M.J.S. is supported by the Australian Research Council (DP170100474) and the University of Canterbury Erskine Fellowship. O.J.M. is supported by the University of Auckland, Faculty of Engineering James and Hazel D. Lord Emerging Faculty Fellowship. R.E.B. is supported by a Royal Society Wolfson Research Merit Award, a Leverhulme Trust Research Fellowship and BBSRC(BB/R000816/1). We thank three referees for helpful feedback.
References
- 1.Simpson MJ, Treloar KK, Binder BJ, Haridas P, Manton KJ, Leavesley DI, McElwain DLS, Baker RE. 2013. Quantifying the roles of cell motility and cell proliferation in a circular barrier assay. J. R. Soc. Interface 10, 20130007 ( 10.1098/rsif.2013.0007) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Maini PK, McElwain DLS, Leavesley DI. 2004. Traveling wave model to interpret a wound-healing cell migration assay for human peritoneal mesothelial cells. Tissue Eng. 10, 475–482. ( 10.1089/107632704323061834) [DOI] [PubMed] [Google Scholar]
- 3.Grada A, Otero-Vinas M, Prieto-Castrillo F, Obagi Z, Falanga V. 2016. Research techniques made simple: analysis of collective cell migration using the wound healing assay. J. Invest. Dermatol. 137, e11–e16. ( 10.1016/j.jid.2016.11.020) [DOI] [PubMed] [Google Scholar]
- 4.Haass NK, Beaumont KA, Hill DS, Anfosso A, Mrass P, Munoz MA, Kinjyo I, Weninger W. 2014. Realtime cell cycle imaging during melanoma growth, invasion, and drug response. Pigment Cell Melanoma Invasion 27, 764–776. ( 10.1111/pcmr.12274) [DOI] [PubMed] [Google Scholar]
- 5.Murray JD. 2002. Mathematical biology I: an introduction. Heidelberg, Germany: Springer. [Google Scholar]
- 6.Sherratt JA, Murray JD. 1990. Models of epidermal wound healing. Proc. R. Soc. Lond. B 241, 29–36. ( 10.1098/rspb.1990.0061) [DOI] [PubMed] [Google Scholar]
- 7.Sengers BG, Please CP, Oreffo ROC. 2007. Experimental characterization and computational modelling of two-dimensional cell spreading for skeletal regeneration. J. R. Soc. Interface 4, 1107–1117. ( 10.1098/rsif.2007.0233) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nardini JT, Chapnick DA, Liu X, Bortz DM. 2016. Modeling keratinocyte wound healing dynamics: cell–cell adhesion promotes sustained collective migration. J. Theor. Biol. 400, 103–117. ( 10.1016/j.jtbi.2016.04.015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Warne DJ, Baker RE, Simpson MJ. 2019. Using experimental data and information criteria to guide model selection for reaction-diffusion problems in mathematical biology. Bull. Math. Biol. 81, 1760–1804. ( 10.1007/s11538-019-00589-x) [DOI] [PubMed] [Google Scholar]
- 10.Johnston ST, Ross RV, Binder BJ, McElwain DLS, Haridas P, Simpson MJ. 2016. Quantifying the effect of experimental design choices for in vitro scratch assays. J. Theor. Biol. 400, 19–31. ( 10.1016/j.jtbi.2016.04.012) [DOI] [PubMed] [Google Scholar]
- 11.Audoly S, Bellu G, D’Angiò L, Saccomani MP, Cobelli C. 2001. Global identifiability of nonlinear models of biological systems. IEEE Trans. Biomed. Eng. 48, 55–65. ( 10.1109/10.900248) [DOI] [PubMed] [Google Scholar]
- 12.Chis O-T, Banga JR, Balsa-Canto E. 2011. Structural identifiability of systems biology models: a critical comparison of methods. PLoS ONE 6, e27755 ( 10.1371/journal.pone.0027755) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cobelli C, Distefano JJ. 1980. Parameter and structural identifiability concepts and ambiguities: a critical review and anaysis. Am. J. Physiol. Regul. Integr. Comp. Physiol. 8, R7–R24. ( 10.1152/ajpregu.1980.239.1.R7) [DOI] [PubMed] [Google Scholar]
- 14.Hines KE, Middendorf TR, Aldrich RW. 2014. Determination of parameter identifiability in nonlinear biophysical models: a Bayesian approach. J. Gen. Physiol. 143, 401 ( 10.1085/jgp.201311116) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Siekmann I, Sneyd J, Crampin EJ. 2012. MCMC can detect nonidentifiable models. Biophys. J. 103, 2275–2286. ( 10.1016/j.bpj.2012.10.024) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingmüller U, Timmer J. 2009. Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics 25, 1923–1929. ( 10.1093/bioinformatics/btp358) [DOI] [PubMed] [Google Scholar]
- 17.Fröhlich F, Theis FJ, Hasenauer J. 2014. Uncertainty analysis for non-identifiable dynamical systems: profile likelihoods, bootstrapping and more. In Computational Methods in Systems Biology: Proc. 12th Int. Conf., CMSB 2014, Manchester, UK, 17–19 November 2014, pp. 61–72. Berlin, Germany: Springer.
- 18.Villaverde AF, Tsiantis N, Banga JR. 2019. Full observability and estimation of unknown inputs, states and parameters of nonlinear biological models. J. R. Soc. Interface 16, 20190043 ( 10.1098/rsif.2019.0043) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Janzén DLI, Bergenholm L, Jirstrand M, Parkinson J, Yates J, Evans ND, Chappell MJ. 2016. Parameter identifiability of fundamental pharmacodynamic models. Front. Physiol. v7, 590 ( 10.3389/fphys.2016.00590) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Raue A, Karlsson J, Saccomani MP, Jirstrand M, Timmer J. 2014. Comparison of approaches for parameter identifiability analysis of biological systems. Bioinformatics v30, 1440–1448. ( 10.1093/bioinformatics/btu006) [DOI] [PubMed] [Google Scholar]
- 21.Campbell DA, Chkrebtii O. 2013. Maximum profile likelihood estimation of differential equation parameters through model based smoothing state estimate. Math. Biosci. 246, 283–292. ( 10.1016/j.mbs.2013.03.011) [DOI] [PubMed] [Google Scholar]
- 22.Greenland S. 2011. Identifiability. In International encyclopedia of statistical science (ed. Lovric M.), p. 645 Berlin, Germany: Springer-Verlag. [Google Scholar]
- 23.Bickel PJ, Doksum KA. 2015. Mathematical statistics: basic ideas and selected topics, vol. 1 Boca Raton, FL: CRC Press. [Google Scholar]
- 24.Evans SN, Stark PB. 2002. Inverse problems as statistics. Inverse Prob. 18, R55 ( 10.1088/0266-5611/18/4/201) [DOI] [Google Scholar]
- 25.Maclaren OJ, Nicholson R. 2019. What can be estimated? Identifiabiliy, estimability, casual inference and ill-posed inverse problems. (http://arxiv.org/abs/1904.02826).
- 26.Bellu G, Saccomani MP, Audoly S, D'Angio L. 2007. DAISY: a new software tool to test global identifiability of biological and physiological systems. Comput. Methods Programs Biomed. 88, 52–61. ( 10.1016/j.cmpb.2007.07.002) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Eisenberg MC, Robertson SL, Tien JH. 2013. Identifiability and estimation of multiple transmission pathways in cholera and waterborne disease. J. Theor. Biol. 234, 84–102. ( 10.1016/j.jtbi.2012.12.021) [DOI] [PubMed] [Google Scholar]
- 28.Akaike H. 1974. A new look at the statistical model identification. IEEE Trans. Automat. Control 19, 716–723. ( 10.1109/TAC.1974.1100705) [DOI] [Google Scholar]
- 29.Shmueli G. 2010. To explain or to predict? Stat. Sci. 25, 289–310. ( 10.1214/10-STS330) [DOI] [Google Scholar]
- 30.Cox DR. 2006. Principles of statistical inference. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 31.Bland JM. 2009. The tyranny of power: is there a better way to calculate sample size?. Br. Med. J. 339, b3985 ( 10.1136/bmj.b3985) [DOI] [PubMed] [Google Scholar]
- 32.Rothman KJ, Greenland S. 2018. Planning study size based on precision rather than power. Epidemiology 29, 599–603. ( 10.1097/EDE.0000000000000876) [DOI] [PubMed] [Google Scholar]
- 33.Warne DJ, Baker RE, Simpson MJ. 2019. Simulation and inference algorithms for biochemical stochastic reaction networks: from basic concepts to state-of-the-art. J. R. Soc. Interface 16, 20180943 ( 10.1098/rsif.2018.0943) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Toni T, Welch D, Strelkowa N, Ipsen A, Stumpf MPH. 2009. Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems. J. R. Soc. Interface 6, 187–202. ( 10.1098/rsif.2008.0172) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Vo BN, Drovandi CC, Pettitt AN, Simpson MJ. 2015. Quantifying uncertainty in parameter estimates for stochastic models of collective cell spreading using approximate Bayesian computation. Math. Biosci. 263, 133–142. ( 10.1016/j.mbs.2015.02.010) [DOI] [PubMed] [Google Scholar]
- 36.Browning AP, Haridas P, Simpson MJ. 2019. A Bayesian sequential learning framework to parameterise continuum models of melanoma invasion into human skin. Bull. Math. Biol. 81, 676–698. ( 10.1007/s11538-018-0532-1) [DOI] [PubMed] [Google Scholar]
- 37.Maclaren OJ, Parker A, Pin C, Carding SR, Watson A, Fletcher AG, Byrne HM, Maini PK. 2017. A hierarchical Bayesian model for understanding the spatiotemporal dynamics of the intestinal epithelium. PLoS Comput. Biol. 13, e1005688 ( 10.1371/journal.pcbi.1005688) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lambert B, MacLean AL, Fletcher AG, Combes AN, Little MH, Byrne HM. 2018. Bayesian inference of agent-based models: a tool for studying kidney branching morphogenesis. J. Math. Biol. 76, 1673–1697. ( 10.1007/s00285-018-1208-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wilkinson RD. 2013. Approximate Bayesian computation (ABC) gives exact results under the assumption of model error. Stat. Appl. Genet. Mol. Biol. 12, 129–141. ( 10.1515/sagmb-2013-0010) [DOI] [PubMed] [Google Scholar]
- 40.Pawitan Y. 2001. In all likelihood: statistical modelling and inference using likelihood. Oxford, UK: Oxford University Press. [Google Scholar]
- 41.Vittadello ST, McCue SW, Gunasingh G, Haass NK, Simpson MJ. 2018. Mathematical models for cell migration with real-time cell cycle dynamics. Biophys. J. 114, 1241–1253. ( 10.1016/j.bpj.2017.12.041) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Raue A, Kreutz C, Theis FJ, Timmer J. 2013. Joining forces of Bayesian and frequentist methodology: a study for inference in the presence of non-identifiability. Phil. Trans. R. Soc. A 371, 20110544 ( 10.1098/rsta.2011.0544) [DOI] [PubMed] [Google Scholar]
- 43.Treloar KK, Simpson MJ, Haridas P, Manton KJ, Leavesley DI, McElwain DLS, Baker RE. 2013. Multiple types of data are required to identify the mechanisms influencing the spatial expansion of melanoma cell colonies. BMC Syst. Biol. 7, 137 ( 10.1186/1752-0509-7-137) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Mathworks. 2020. lsqnonlin. Solve nonlinear least-squares. https://au.mathworks.com/help/optim/ug/lsqnonlin.html (accessed February 2020).
- 45.Edwards AW. 1992. Likelihood, expanded edn Baltimore, MD: Johns Hopkins University Press. [Google Scholar]
- 46.Royall R. 2017. Statistical evidence: a likelihood paradigm. London, UK: Routledge. [Google Scholar]
- 47.Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. 2014. Bayesian data analysis, 3rd edn London, UK: Chapman & Hall. [Google Scholar]
- 48.Fattahi F. et al. 2016. Deriving human ENS lineages for cell therapy and drug discovery in Hirschsprung disease. Nature 531, 105–109. ( 10.1038/nature16951) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wang X. et al. 2019. UDP-glucose accelerates SNAI1 mRNA decay and impairs lung cancer metastasis. Nature 571, 127–131. ( 10.1038/s41586-019-1340-y) [DOI] [PubMed] [Google Scholar]
- 50.Armstrong NJ, Painter KJ, Sherratt JA. 2006. A continuum approach to modelling cell-cell adhesion. J. Theor. Biol. 243, 98–113. ( 10.1016/j.jtbi.2006.05.030) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Keller EF, Segel LA. 1971. Model for chemotaxis. J. Theor. Biol. 30, 225–234. ( 10.1016/0022-5193(71)90050-6) [DOI] [PubMed] [Google Scholar]
- 52.Simpson MJ, Landman KA, Newgreen DF. 2006. Chemotactic and diffusive migration on a nonuniformly growing domain: numerical algorithm development and applications. J. Comput. Appl. Math. 192, 282–300. ( 10.1016/j.cam.2005.05.003) [DOI] [Google Scholar]
- 53.Jin W, Lo K-Y, Chou S-E, McCue SW, Simpson MJ. 2018. The role of initial geometry in experimental models of wound closing. Chem. Eng. Sci. 179, 221–226. ( 10.1016/j.ces.2018.01.004) [DOI] [Google Scholar]
- 54.Simpson MJ, Landman KA, Hughes BD. 2009. Multi-species simple exclusion processes. Physica A 388, 399–406. ( 10.1016/j.physa.2008.10.038) [DOI] [Google Scholar]
- 55.Bajar BT. et al. 2016. Fluorescent indicators for simultaneous reporting of all four cell cycle phases. Nat. Methods 13, 993–996. ( 10.1038/nmeth.4045) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Liccardi G, Fava L. 2017. Fiat lux: illuminating the cell cycle. Cell Death Discovery 3, 17042 ( 10.1038/cddiscovery.2017.42) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Experimental data are available in the electronic supplementary material.




