Skip to main content
Journal of the Royal Society Interface logoLink to Journal of the Royal Society Interface
. 2019 Jul 10;16(156):20190293. doi: 10.1098/rsif.2019.0293

Model selection and parameter estimation for root architecture models using likelihood-free inference

Clare Ziegler 1,2, Rosemary J Dyson 2,3, Iain G Johnston 1,2,4,5,
PMCID: PMC6685013  PMID: 31288651

Abstract

Plant root systems play vital roles in the biosphere, environment and agriculture, but the quantitative principles governing their growth and architecture remain poorly understood. The ‘forward problem’ of what root forms can arise from given models and parameters has been well studied through modelling and simulation, but comparatively little attention has been given to the ‘inverse problem’: what models and parameters are responsible for producing an experimentally observed root system? Here, we propose the use of approximate Bayesian computation (ABC) to infer mechanistic parameters governing root growth and architecture, allowing us to learn and quantify uncertainty in parameters and model structures using observed root architectures. We demonstrate the use of this platform on synthetic and experimental root data and show how it may be used to identify growth mechanisms and characterize growth parameters in different mutants. Our highly adaptable framework can be used to gain mechanistic insight into the generation of observed root system architectures.

Keywords: likelihood-free inference, root system architecture, root growth, approximate Bayesian computation

1. Introduction

Root systems are essential to plants’ structure and uptake of water and nutrients and constitute more than 5% by mass of the total global carbon budget [1]. They stabilize plants [2], stabilize soils [3], foster beneficial microbes [4] and are the entry point for water and nutrients to the plant [5]. The shape of a plant’s root system is generated by a variety of physiological and signalling pathways within the plant, and understanding the generation of this system opens paths to its optimization to maximize crop yield [6].

Despite this importance, the mechanisms underlying root growth remain challenging to quantitatively understand [79]. The complexity of root systems and their below-ground nature poses observational challenges. Experimental techniques aiming to elucidate root architecture have historically included sketches of root systems and the use of hydroponics, then images of cleaned root systems. More recent advances have facilitated the imaging of plants in situ through the use of X-ray µ-computed tomography [10], magnetic resonance imaging scanning [11,12] and transparent soil [13], which have been used to investigate root soil exploration and uptake of water and nutrients.

In parallel with this experimental elucidation, in an effort to understand how root systems grow, many physical and mathematical models of root growth and structure have been produced. These models attempt to solve the ‘forward problem’: given knowledge of the parameters governing growth processes in plants, they produce the details and dynamics of a likely simulated root system. For example, the Lockhart equation described the elongation of a cell under turgor pressure [14] and has been widely adapted to describe the growth of many plant organs, including roots [15,16]. Hackett & Rose produced the first root system model in the 1970s [17] based on the growth and branching of barley roots, while Lungley [18] produced a computational model which generated root systems represented using ASCII characters. Fitter [19] introduced a topological model of root architecture where a root system was considered as a set of links. This idea was extended in the three-dimensional modelling of Pagès et al. [20] and Diggle, whose ROOTMAP model could be applied to a variety of plant species [21], while Tatsumi et al. represented variation in root systems using fractal analysis [22,23]. Lynch et al. modelled a root system as a network with nodes as branches and inter-branch distances as edges. The model also included root radius and volume changes along the growing root [24]. Advances in computation have led to a further plethora of root system architecture models, which produce a three-dimensional reproduction of a root system using a detailed parameter set [7]. Previous modelling approaches were combined in the production of Root Typ in 2004 [25]. This has allowed the underlying model structure to be adapted for use by other researchers [26,27]. Another key root architecture model is RootBox [28], which is designed to be combined with soil and water uptake models along with allowing for the simulation of roots grown in containers of user-defined shape and dimensions. This model has been recently updated to produce CRootBox, and there are plans to eventually extend the modelling approach to also consider above-ground plant growth [29]. Many of these effective models are able to reproduce root systems of many different plant species, which necessitates the incorporation of root data collected in situ [8].

While a great number of advances have been made in simulating root systems from a set of parameters, relatively little work has been done on the inverse problem: extracting the growth parameters from an observed root system. Model parametrization is often limited by difficulties in root observation [27,30]; however, this step is crucial to gain biological insight from these root models. It is vital that we can validate root models and the predictions they make, and quantify uncertainty in their mechanisms and parameters. A manual approach to the inverse problem, feeding specific measured parameters into generative models for root systems and assessing their ability to reproduce observations, has been used to gain biological insight and validate generative models [31]. However, without an automated approach, it remains challenging to explore the full ranges of parameters and mechanisms that could give rise to observed structures and the likelihoods of each. Advancing technologies are allowing observation of root systems in increasing detail, making it even more important to bridge the gap between theory and observation.

A major challenge in solving the inverse problem with traditional statistical methods is finding a likelihood function for an observed root system. Modern statistical approaches allow this challenge to be circumvented through the use of stochastic simulation and approximate Bayesian computation (ABC) techniques [32], which produce a computational approximation replacing the likelihood and remove the need for its explicit calculation. Another strength of these techniques lies in their natural capacity for model selection and the inclusion of prior knowledge about the system in an inference setting. Here, we report a novel pipeline by which ABC, embedded in a sequential Monte Carlo (SMC) framework [33], can be used to learn the values of, and uncertainty in, generative, mechanistic parameters underlying root growth and architecture, and to compare different root architecture models. Arabidopsis thaliana (thale cress) is used in both computational and experimental investigation throughout as a model plant, but this process can readily be extended to any root system, as we also demonstrate with Lupinus angustifolius (narrowleaf lupin). We demonstrate how this framework can be used to identify generative parameters according to a given model, distinguish phenotypic differences, and evaluate the comparative effectiveness of different models for root elongation and root branching processes, providing insight into the underlying mechanisms.

2. Results

2.1. An ABC SMC framework for inferring mechanistic parameters from root systems

For generality, we begin by considering a highly simplified model for root growth (see Material and methods). Starting from an infinitesimal initial condition, a primary root elongates according to a growth law. Branches from this primary root occur stochastically according to a branching law. Branches elongate according to the primary root growth law multiplicatively scaled (allowing, for example, branches to grow at a slower rate than the primary root). Branching is for now restricted to first-order branches from the primary root, though nothing in our framework is dependent on this or any other structural choice.

This coarse-grained model was chosen to reflect the core behaviour shared at the intersection of several contemporary root models [25,28,29]. Its computational simplicity is an advantage but not a necessity for our inference framework; we later consider an alternative generative model to demonstrate the transferability of our approach. The details of the model are described in Material and methods, but in this section we consider constructing the inference framework for a general mechanistic model, the parameters of which we denote θ.

The platform proceeds by simulating outputs from this model with different trial parametrizations, using a distance function to compare these outputs to summary statistics of experimental observations, and iterating this process within a Bayesian framework to build up posterior distributions on model structures and parameters given the observed data.

To compare simulation to experiment, we focus on a mechanistically informative set of summary statistics. For a given observation of root structure data d, these are the number of branches B, length of the primary root L and average length of the lateral roots l^. Within our scheme, these lengths are for convenience measured in centimetres; different scalings of these features can be used to emphasize different aspects of root architecture in the simulation-data comparison. The distance function we use to compare two structures d1, d2 is based on the Euclidean distance

ρ(d1,d2)=13((B1B2)2+(L1L2)2+(l^1l^2)2). 2.1

A dataset D may consist of a set of structures dij, where i labels individual plants and j labels longitudinal observations. In this case, for each observed plant i, a model plant is simulated and its structure recorded at each of the times corresponding to the longitudinal observations. We will call these recorded structures dij and are interested in the comparison between each recorded structure and its observed counterpart

ρ(D,D)=ij|iρ(dij,dij). 2.2

We deliberately choose this model and summary statistics to focus on the topological aspects of root architecture and ignore any specific physical embeddings (for example, branching angles). This focus on topological degrees of freedom increases the generality of the approach, but features like branching angles and higher-order topological statistics can readily be included in the modelling and distance calculation if they reflect important degrees of freedom for the scientific question under consideration.

Equation (2.1) balances the ability to capture the fine detail of the root system against the computational time required to obtain a reasonable number of samples from the posterior. Including more detail and/or degrees of freedom in the distance function will allow more detailed matching of observations but will increase the sampling effort required to find regions of parameter space that match these criteria.

ABC involves accepting a trial set of parameters as a sample from the posterior distribution when ρ(D,D)<ϵ, or, in other words, when the summary statistics of the structure emerging from the simulation are ‘close’ to those arising from the experimental data. The posterior distribution on parameters θ built up from a set of samples taken in this way is P(θ|D;ρ<ϵ), which forms an increasingly good approximation to the true posterior P(θ|D) as ε is decreased [34].

For parametric inference within a fixed model, a simple rejection-sampling pipeline is then given by algorithm 1 (Material and methods). This approach would be sufficient to identify generative parameters from data, but rejection sampling is an inefficient paradigm, as any ‘good’ regions of parameter space are immediately forgotten when the next draw from the prior is made. To facilitate more efficient parametric inference as well as model selection, we use ABC embedded in a SMC framework as in Toni et al. [33]. ABC SMC first enforces only a relaxed fit to the data then sequentially uses the inferred parameter distributions as effective priors while enforcing a tighter fit to data. This sequential process is parametrized by a sequence of ε values describing the fit threshold required at each step in the sequence. Model selection can proceed by including a ‘model index’ parameter describing which model structure is to be used, applying a prior to this parameter (thus incorporating prior knowledge about which model structures are more likely), then treating this index as a parameter to be inferred through SMC. Following Toni et al. [33], the coupled inference and model selection pipeline is then given by algorithm 2 (Material and methods).

2.2. Inferring parameters from a simulated root system

We first sought to test the applicability of our likelihood-free inference process on synthetic root data, to confirm its ability to identify known generative parameters. To this end, the CRootBox root simulation model [29] was used to produce an example of an Arabidopsis thaliana root system. The governing parameters were mean growth rates of 0.49 cm day−1 for the primary root, 0.08 cm day−1 for the lateral roots, and an inter-lateral distance of 0.2 cm, although the inter-lateral distance is not an explicit parameter in our model (see next section). CRootBox adds an element of stochasticity to its generative parameters; in the default Arabidopsis case, this corresponded to a coefficient of variation of 0.1 in the growth rates and 0.45 in the lateral spacing parameters. The simulation was run over 15 simulated days, yielding the structure in figure 1a.

Figure 1.

Figure 1.

Validating root-inference platform with synthetic Arabidopsis data. (a) Output from CRootBox simulation of Arabidopsis root growth; black scale bar is 1 cm. (bd) Output posteriors from an ABC SMC framework run on CRootBox output, with final ABC SMC tolerance ε = 0.5 (Material and methods). (b) Posterior distribution on branching rate b in the growth model. (c) Two-dimensional posterior on primary root growth rate g and lateral root growth scaling α. (d) Posterior distribution on lmax, the maximum length parameter in the negative-exponential growth model used.

To mirror the pipeline that will be used for experimental data, we analysed this simulation output with SmartRoot image analysis software [35], obtaining the statistics of tap and lateral root length and placement. We then applied our ABC SMC framework to estimate posterior distributions on the mechanistic parameters of our simple growth model (Material and methods). These parameters are g (primary root growth rate), lmax (primary root scaling constant), b (branching rate) and α (lateral root growth scaling).

As shown in figure 1bd, the growth rates and, notably, their variability are well captured in the resultant posteriors, with g inferred to lie around 0.55 ± 0.10 cm day−1, compatible with the true growth rate parametrizing the synthetic data. The branching rate parameter is more broadly spread, with a mean of 0.6 day−1 corresponding to the observed number of branches, and flexibility in the posterior reflecting the stochastic nature of this parameter’s influence. α was inferred to lie around 0.021 ± 0.02, corresponding to a lateral growth rate around 00.02cm day1; this is rather lower than the value used in the simulation, reflecting the rather limited lateral growth occurring in the specific simulated instance of the model. The posterior for lmax is close to recovering the prior, which suggests that the model output is minimally dependent on the value of this parameter. We found this limited lmax dependence to generally be the case, and in subsequent sections will omit lmax from the posterior plots; all lmax posteriors, generally recovering priors, are plotted in the electronic supplementary material, figure S1. This assessment of the relative importance of, and flexibility in, generative mechanistic parameters reflects a powerful aspect of this inverse modelling approach.

2.3. Inferring mechanistic parameters for other synthetic phenotypes and root simulation models

To test the wider applicability of our likelihood-free inference process, we next tested the ability to identify known generative parameters when using a different, existing root simulation model, and for different plant species. RootBox [28] was chosen for its wide application in the field. We embedded RootBox as the generative model in our inference framework, which was then applied as in §2.2 to the previous synthetic Arabidopsis thaliana data and a simulated Lupinus angustifolius root system. The Lupinus simulation involved an initial growth rate of 1 cm day−1 for the primary root, 0.2 cm day−1 for the laterals and an inter-root distance of 0.9 cm, and proceeded for 15 simulation days.

RootBox employs a different branching protocol from our simple model above. Rather than allowing stochastic branching anywhere on the primary root, RootBox allows lateral branches to emerge at specified intervals along the primary structure. This interval d, and a value bmax governing the maximum number of allowed lateral branches, are parameters of the model and we therefore seek posterior distributions on these quantities as well as the other mechanistic parameters which directly map to those in our simple model.

Figure 2 shows the resultant posteriors after applying our inference approach using RootBox as the core mechanistic model. Once more, the original generative parameters are well supported by the resulting posteriors, which also agree with the inferred values for primary and lateral growth rates using our simplified core model above (figure 1). The inter-lateral distances d, present in RootBox but not above, are also well recovered by the inference process. The maximum number of branches bmax is not tightly constrained by the synthetic data (electronic supplementary material, figure S2).

Figure 2.

Figure 2.

Posterior distributions on Arabidopsis thaliana and Lupinus angustifolius roots generated using RootBox. (a) Output from RootBox simulation of Lupinus angustifolius root growth; black scale bar is 1 cm. (b,c) Output posteriors from an ABC SMC framework run on RootBox output of Lupinus angustifolius, with final ABC SMC tolerance ε = 0.4 (Material and methods). (b) Two-dimensional posterior on primary root growth rate g and lateral root growth scaling α. (c) Posterior distribution on branch separation d in the RootBox growth model. (d) Output from RootBox simulation of Arabidopsis thaliana root growth; black scale bar is 1 cm. (e,f) Output posteriors from an ABC SMC framework run on RootBox output of Arabidopsis thaliana, with final ABC SMC tolerance ε = 0.4 (Material and methods). (e) Two-dimensional posterior on primary root growth rate g and lateral root growth scaling α. (f) Posterior distribution on branch separation d in the RootBox growth model. (Online version in colour.)

2.4. Inferring mechanistic parameters for wild-type Arabidopsis thaliana root systems

To test the pipeline on experimental data, we grew Arabidopsis Col-0 plants on vertical (1/2) MS agar plates (Material and methods) and used a digital camera to capture their root system structure over several days. We used SmartRoot image analysis software [35] to extract the lengths and placements of tap and lateral roots from these digital images at each sampled timepoint. An example of the digitized data is shown in figure 3.

Figure 3.

Figure 3.

Example data and simulation output for the root-inference framework. (a) Arabidopsis seedlings grown vertically on agar (Material and methods) provide example root systems for the analysis pipeline. (b) Digitization of the root systems using SmartRoot [35] provides the quantitative data used in the inference process. (c) Example outputs from the stochastic growth model with parameters identified through the ABC SMC inference process. Black scale bar is 1 cm. (Online version in colour.)

We applied our ABC SMC framework to estimate the posterior distributions of the mechanistic parameters underlying the development of these root systems. The earlier, high-ε populations of the SMC process gave a diverse range of simulated root structures; by the final population, the simulation outputs provide excellent visual matches to the observed experimental structures (figure 3) given the deliberate simplicity of the model. This intuitive snapshot matching is supported by the good agreement between the experimentally observed time series of summary statistics and those arising from simulation with the final posteriors (figure 4). Here, both the mean and the variability in the experimental statistics over time are captured by the distributions of simulated behaviour arising from the posteriors.

Figure 4.

Figure 4.

Summary statistic comparison between data and parametrized model. Individual line traces in each plot show time series of the summary statistic from observed Arabidopsis seedlings in figure 3; boxplots give the range of values arising from stochastic model simulation after parameter values have been learned. (a) Number of branch points; (b) primary root length; (c) average lateral root length. (Online version in colour.)

The posterior distributions themselves are shown in figure 5. The primary root growth rate g is reasonably well constrained, with a mean that intuitively falls around the total growth average. Notably, the posterior distribution on g is tighter than for the synthetic data example. This refinement reflects the strength of including time-course data in the inference platform. Observations of systems at different times provide more information on dynamic rate parameters, allowing better estimates than are available from single-instance observations alone.

Figure 5.

Figure 5.

Posterior distributions on mechanistic parameters for Arabidopsis seedling roots. Posteriors from our ABC SMC framework run on the data from figure 3 with final tolerance ε = 2.5. (a) Posterior distribution on branching rate b. (b) Two-dimensional posterior on primary root growth rate g and lateral root growth scaling α. (Online version in colour.)

The scaling of lateral growth rate α has a broader variance, reflecting the greater variability in average lateral root length observed in the data, and is correlated to some extent, as expected, with the value of g. The distribution of branching rate b is also broad, reflecting a greater variability in the experimental observation of branch number over time, and also the stochastic nature of this process: as b reflects the mean rate of a Poisson process, the same branching structure can be achieved with a variety of different b values. The modal value of b matches the average branching rate observed in the data. Overall, therefore, the ABC SMC framework gives reliable and intuitive readouts linked to both the average observed behaviour and plant-to-plant variability in the root structure.

2.5. Model selection for root growth and branching mechanisms

We next asked whether our approach could select between competing generative models, given time course data on the evolution of a root system. To this end, we considered a range of possible generative mechanisms for root growth and branching. We will employ uniform priors over competing models, reflecting the fact that, before any observations are made, we have no belief that one mechanism is more likely than another. This prior belief can of course be arbitrarily changed within our Bayesian framework to reflect prior information. We then use our ABC SMC framework to identify the posterior support for each mechanistic model, given the observed data [33] (Material and methods).

First, we consider different elongation laws for root growth. The first model involves root growth at a constant rate; the second involves a negative-exponential growth law supported by [29] of the form

l(t)=lmax(1egt/lmax), 2.3

where l(t) is the length of the root at time t, parametrized by a rate constant g and scaling constant lmax. The posterior distribution over model index through the SMC process is shown in figure 6a,b. The most permissive population (highest ε) shows less support for the exponential model, favouring model parsimony. As a better fit to the data is required, the support for the exponential model increases until it overcomes the lower weightings due to the additional parameter and is preferentially selected.

Figure 6.

Figure 6.

ABC SMC allows selection of competing mechanistic models for root growth. (a) Model selection posteriors from comparing a simple, constant-rate growth model to the negative-exponential model used in CRootBox [29] for decreasing ABC tolerance ε. The negative-exponential model acquires greater support as the tolerance decreases. (b) Model selection posteriors from comparing a uniform branching model to a model imposing a minimum separation from existing branches. The more parsimonious uniform model experiences higher support with relaxed tolerance but the minimum-separation model gains support with tighter tolerances (tolerances are higher for this model selection run, reflecting a tradeoff with the greater computational resource required). (c) Posterior for the minimum branch separation, δ, required when the minimum spacing model is implemented. (Online version in colour.)

Next, we explore a more nuanced mechanistic question underlying root architecture. We compared two models for branch placement positions. First, a uniform branching model, where the branching location was chosen at random anywhere along the primary root. Second, a minimally spaced model, which imposed a distance parameter δ around each existing branch where no further branching could occur. If a branch was attempted within this distance no branch was implemented and the algorithm continues. Figure 6b,c shows the model selection posteriors with decreasing tolerance, and the posterior on δ when the minimally spaced model was implemented. Here, the posteriors for the spacing model are lower for the more permissive populations, reflecting the increased model complexity—the extra parameter δ makes the model less parsimonious. The support for the model increases as a better fit to data is required in the subsequent populations. By the final tolerance, the models have comparable support. Hence, the dataset suggests roughly equal support for both models despite their difference in complexity.

These simple experiments serve to illustrate the ability of ABC SMC to provide statistical support for competing mechanistic hypotheses (for example, linear versus negative-exponential root elongation laws). There is, however, nothing to prevent other targeted mechanistic questions being addressed using this framework (see Discussion).

2.6. Comparison between root structures

Next, we asked whether the ABC SMC framework could distinguish between two phenotypes—those corresponding to wild-type Arabidopsis and the friendly mutant line. FRIENDLY is a mitochondrial fusion gene that when compromised has a range of bioenergetic effects which lead to reduced root growth [36].

Wild-type and friendly plants were grown under the same conditions as above (Material and methods), and the inference pipeline was run as before, with exponential growth and uniform branching. The output posteriors in figure 7 reflect the differing root systems shown in the tracings, with a clear separation in the parameter space between the two phenotypes. The branching rate b is fairly unconstrained as observed in §2.4 due to inherent stochasticity in the branching mechanism. The values of g vary significantly between wild-type and friendly, as reflected in the tracings, with little change in the value of α. The distribution of g is substantially shifted towards lower values for the friendly plants, reflecting the known challenge to root growth resulting from this mutation. lmax shows a wide variability in both phenotypes, while representing clear differences consistent with the reduced root growth observed in the friendly mutant line. There is a very little constraint in the value of lmax, suggesting little reliance on the value, although smaller values appear to be favoured for the friendly phenotype.

Figure 7.

Figure 7.

Distinguishing phenotypes with mechanistic inference. (a) (left) Wild-type and (right) friendly Arabidopsis seedlings grown in agar as described in §4.1, demonstrating the root growth phenotype of friendly. The tracings were produced using SmartRoot software [35] and the colours adjusted; black scale bar is 1 cm. (b,c) Output posteriors for branching rate b show similar distributions for wild-type and friendly. (d,e) Two-dimensional posteriors on primary root growth rate g and lateral root growth scaling α demonstrate clear separation, reflecting the reduced root growth observed in friendly. (Online version in colour.)

Taken together, these results demonstrate that the physical parameters governing root architecture growth can be learned using this ABC SMC approach, and uncertainty in these learned outcomes quantified. The mechanistic model within our inference process both allows us to harness time-course data and dissects which parameters change (here, growth rate g) and which remain similar (here, lateral root scaling α) in different cases. This is a key strength of this method and would be difficult to obtain through traditional parameter inference methods. The platform readily identifies the physical different mechanisms underlying root architecture in a mutant line and identifies accepted physical model structures for root growth.

3. Discussion

We have presented a framework for the inference of parameter values and mechanisms in root growth models when applied to an observed root system. While there has been much work undertaken producing plant root simulations from given parameters, our approach addressed the much less-studied ‘inverse problem’: that of finding generative parameter values and mechanisms that can reproduce a given root system. Knowledge about these mechanisms and parameters, and their ranges, flexibility and relative importance, is necessary for an understanding of root growth processes such as growth and branching decisions, and how these may relate to biological processes within the plant. We hope that this highly general approach will allow for a more mechanistic understanding of root growth and to quantify the efficacy of existing models.

We first used a very general growth model to (i) retain consistency with the ‘core’ of the maximum possible number of existing growth models and (ii) focus on parameters related to the growing plant and its phenotype, rather than the specifics of its physical embedding. We have demonstrated using RootBox [28] that our approach can readily be adapted to other specific existing root models to allow the quantification of values of and uncertainty in generative parameters, furthering understanding of root system architecture. We also illustrated how alternative hypothesized mechanistic models can straightforwardly be compared, using SMC model selection. A strength of the Bayesian embedding here is that the most parsimonious model that is capable of explaining observations is naturally selected in the case of models with different numbers of parameters [33].

Advances in imaging techniques are allowing for greater insight into root system architecture [8] and specially designed image analysis software [37] allows for increasingly efficient data collection from images. The combination of root models, advanced imaging techniques, image analysis software and an SMC framework could allow further advances in our understanding of root growth. We anticipate that, with the increasing developments in root imaging, this technique will find application in a growing variety of datasets, allowing for the investigation of generative parameters for a wide variety of root system phenotypes. A natural future extension for this work would be to perform inference based directly on image data, rather than statistics of these data. This approach would require simulation of the imaging process as well as the generation of model root systems, for example, embedding the idealized root system in a simulated soil substrate and simulating the artefacts and noise involved in the imaging process. While (much) more computationally intensive, this approach would allow a more direct leveraging of phenotype data from experimental studies.

Notably, our approach allows inference based on time-course measurements of a developing root system, which increase the power and precision with which parameters and mechanisms can be identified. As demonstrated with our synthetic examples, this approach can readily be applied to single-instance observations but also naturally leverages dynamic information to refine posterior distributions on physical rate parameters.

A stochastic modelling framework for root growth allows for a wide variety of possible outputs to be considered in the inference process, reflecting the variation between root systems in the real world. In this way, the modelling approach allows for investigation into the underlying mechanisms which are widely applicable, while avoiding a reliance on specificities and overfitting to a particular phenotype or growth environment. Predicting a branching event would require the consideration of processes such as genetics, cellular interactions and organism-scale resource partitioning [38], necessitating the development of a multiscale framework. As such multiscale approaches develop, we anticipate the use of likelihood-free inference to be further embraced to resolve inverse problems in parameter identification.

While the generality of our approach is appropriate for the scope of this study, greater specificity is required to gain a true understanding of plant processes. Care needs to be taken in the application of ABC techniques: choices must be made over elements such as the tolerance, priors and summary statistics to achieve a balance between convergence rate and specificity of results. As specific choices for these values can be hard to interpret, simulation outputs must be verified to provide a reasonable match to genuine behaviour (as we have attempted throughout). We have worked with different models to explore the behaviour of our method under different generative assumptions. In Bayesian model selection, prior beliefs about models can strongly affect their support and interpretation must take this into account [33]. However, we have aimed to demonstrate the strength of this approach when carefully applied and interpreted.

Overall, we have demonstrated a technique to allow for greater insight into model parameters for root systems, which could aid in increasing understanding of root growth mechanisms. The generalized approach allowed for investigation of the key aspects underlying root topology while being highly adaptable for use with existing root architecture models.

4. Material and methods

4.1. Plant growth

Arabidopsis Col-0 and friendly seeds were sterilized with three 3 min washing steps in 50% domestic bleach and water rinses, then plated on (1/2) MS agar in vertical plates. Plants were grown at constant 25°C on a 16 h light/8 h dark cycle. Plates were photographed over a time course of 2, 5, 7 and 10 days to produce time-series images of the seedling growth. Summary statistics were extracted from the images using SmartRoot [35], an imageJ plugin. The root systems were traced manually using thresholding, producing a skeleton over the original image. Summary statistics on root length and branch placement were then recorded from this skeleton.

4.2. Model structure

Root growth was simulated using a hybrid stochastic–deterministic algorithm. Primary root growth, by default, was assumed to follow a negative-exponential growth law

l(t)=lmax(1egt/lmax), 4.1

parametrized by a rate constant g and a scaling constant lmax. The alternative uniform growth model simply took the form l(t) = gt. Lateral roots grow according to the same growth law as the primary root, but with a multiplicative factor α applied to g so that for lateral roots gl = αg.

Branching was treated as a Poisson event with rate parameter b. The time until the next branching event is found using the Gillespie algorithm [39], and the length of existing branches is updated from the current time until the time of the branching event. The branching location was then determined by a specified branching model, initially specified as a uniform probability distribution along the length of the primary root. In visualizing structures, branching angle was always set to an angle of 45° from the growth direction, with the equal change of being placed each side of the primary root, although these angles and positions play no role in the simulation. These steps are repeated until the time of the next branch exceeds the maximum simulation time, at which point the branch lengths are updated up to the maximum simulation time, and no branching event occurs. Once a branching event has occurred, the sidebranch grows according to the same growth law as the main root, scaled by parameter α; variability in lateral root length thus corresponds to variability in initial branching times and positions.

4.3. ABC SMC implementation

An ABC framework was implemented in Matlab. Model parameters were drawn from specified distributions and passed to the model as described in model structure above. Broadly, the simulated root systems are then compared to data, and the parameter values accepted if the simulation is sufficiently close to the data, with tolerance ε defined at the time of implementation. If the previous values were accepted, the parameter values are perturbed with a perturbation kernel Kt. If the previous values were not accepted, the parameter values were drawn from the priors as previously described. This process was repeated until 1000 hits were obtained at the specified tolerance.

We follow [33] in our ABC SMC implementation. For completeness, algorithm 2 introduces a simple rejection-sampling scheme under ABC. Algorithm 2 embeds this scheme in an SMC framework for parameter inference and model selection.

Algorithm 1.

ABC rejection sampling for parameter inference.

  • (1)

    Given Np plant structures and Nt(i) longitudinal observations for plant i, characterize the summary statistics dij={B,L,l^} from every plant i and observation j in the dataset.

  • (2)

    Draw a trial set of parameters θ from the prior distribution π(θ).

  • (3)

    Simulate Np instances of root growth, recording the state of structure i at each of the Nt(i) time points corresponding to an experimental observation.

  • (4)

    Compute ρ using equation (2.2) above, to give the separation between each recorded structure and its simulated counterpart.

  • (5)

    If ρ<ϵ, where ϵ is a given tolerance, accept θ as a sample from the posterior.

  • (6)

    If a termination condition is not met, return to 2.

Algorithm 2.

ABC SMC for parameter inference and model selection.

  • (1)

    Given Np plant structures and Nt(i) longitudinal observations for plant i, characterize the summary statistics dij={B,L,l^} from every plant i and observation j in the dataset.

  • (2)

    Initialize tolerance vector E containing T elements. Set population indicator t=0.

  • (3)

    Set particle indicator i=1.

  • (4)

    Sample model indicator m from prior π(m).

  • (5)

    If t=0, sample θ from π(θ(m)). If t>0, sample θ from the previous population {θ(m)t1} with weights w(m)t1, and set θKt(θ|θ).

  • (6)

    If π(θ)=0, go to 4.

  • (7)

    Simulate Np instances of root growth using θ, recording the state of structure i at each of the Nt(i) time points corresponding to an experimental observation.

  • (8)

    Compute ρ using equation (2.2) above.

  • (9)

    If ρE[t], go to 4.

  • (10)
    Set mt(i)=m and add θ to the population {θ(m)t}. If t=0, set weights wt(i)=0, otherwise
    wt(i)=π(θ)j=1Nwt1(j)Kt(θt1(j),θ).

    If i<N, set i=i+1, go to 4.

  • (11)

    For every m, normalize the weights. If t<T, set t=t+1, go to 3.

For a single model, the prior π(m) associated with that model is unity and the choice of model indicator m* plays no role in the inference process.

We used uniform priors over all model structures for π(m) and uniform priors between 0−1.4 day−1 for g, 0−1.4 day−1 for b, 0–1 for α and 5–40 cm for lmax. The perturbation kernel we used was KtN(0, 0.1P), where P is the width of the uniform prior. The tolerance vector was E = ε{5, 3, 2, 1.5, 1}.

Supplementary Material

Supplementary Figure 1
rsif20190293supp1.svg (267.6KB, svg)

Supplementary Material

Supplementary Figure 2
rsif20190293supp2.svg (47KB, svg)

Acknowledgements

All authors acknowledge support from the Birmingham Institute of Forest Research and thank Prof. David Logan for the friendly line. I.G.J. acknowledges support from a Birmingham Fellowship for the University of Birmingham and Turing Fellowship from the Alan Turing Institute.

Data accessibility

The data and code used are freely available in Github repository https://github.com/StochasticBiology/root-inference.

Authors' contributions

C.Z. developed the code, performed the plant growth experiments and drafted the manuscript. I.G.J. conceived the project. R.J.D. and I.G.J. supervised the project. All authors wrote the final manuscript.

Competing interests

The authors declare that no competing interests exist.

Funding

I.G.J. is funded by fellowships from the University of Birmingham and the Alan Turing Institute, R.J.D. is funded by EPSRC grant no. EP/M00015X/1, C.Z. is funded by the Birmingham Institute for Forest Research.

References

  • 1.Jackson R-B, Mooney HA, Schulze E-D. 1997. A global budget for fine root biomass, surface area, and nutrient contents. Proc. Natl Acad. Sci. USA 94, 7362–7366. ( 10.1073/pnas.94.14.7362) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yang M, Défossez P, Danjon F, Dupont S, Fourcaud T. 2017. Which root architectural elements contribute the best to anchorage of pinus species? insights from in silico experiments. Plant Soil 411, 275–291. ( 10.1007/s11104-016-2992-0) [DOI] [Google Scholar]
  • 3.Vannoppen W, Vanmaercke M, De Baets S, Poesen J. 2015. A review of the mechanical effects of plant roots on concentrated flow erosion rates. Earth Sci. Rev. 150, 666–678. ( 10.1016/j.earscirev.2015.08.011) [DOI] [Google Scholar]
  • 4.Reinhold-Hurek B, Bünger W, Burbano CS, Sabale M, Hurek T. 2015. Roots shaping their microbiome: global hotspots for microbial activity. Annu. Rev. Phytopathol. 53, 403–424. ( 10.1146/annurev-phyto-082712-102342) [DOI] [PubMed] [Google Scholar]
  • 5.Dotaniya ML, Meena VD. 2015. Rhizosphere effect on nutrient availability in soil and its uptake by plants: a review. Proc. Natl Acad. Sci., India, Sect. B 85, 1–12. ( 10.1007/s40011-013-0297-0) [DOI] [Google Scholar]
  • 6.Shahzad Z, Amtmann A. 2017. Food for thought: how nutrients regulate root system architecture. Curr. Opin Plant Biol. 39, 80–87. ( 10.1016/j.pbi.2017.06.008) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dunbabin VM. et al. 2013. Modelling root–soil interactions using three-dimensional models of root growth, architecture and function. Plant Soil 372, 93–124. ( 10.1007/s11104-013-1769-y) [DOI] [Google Scholar]
  • 8.Roose T, Keyes SD, Daly KR, Carminati A, Otten W, Vetterlein D, Peth S. 2016. Challenges in imaging and predictive modeling of rhizosphere processes. Plant Soil 407, 9–38. ( 10.1007/s11104-016-2872-7) [DOI] [Google Scholar]
  • 9.Sievänen R, Godin C, DeJong TM, Nikinmaa E. 2014. Functional–structural plant models: a growing paradigm for plant studies. Ann. Bot. 114, 599–603. ( 10.1093/aob/mcu175) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Mairhofer S, Zappala S, Tracy S, Sturrock C, Bennett MJ, Mooney SJ, Pridmore TP. 2013. Recovering complete plant root system architectures from soil via X-ray µ-computed tomography. Plant Methods 9, 1 ( 10.1186/1746-4811-9-8) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Metzner R, Eggert A, van Dusschoten D, Pflugfelder D, Gerth S, Schurr U, Uhlmann N, Jahnke S. 2015. Direct comparison of mri and X-ray CT technologies for 3D imaging of root systems in soil: potential and challenges for root trait quantification. Plant Methods 11, 17 ( 10.1186/s13007-015-0060-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.van Dusschoten D, Metzner R, Kochs J, Postma JA, Pflugfelder D, Bühler J, Schurr U, Jahnke S. 2016. Quantitative 3D analysis of plant roots growing in soil using magnetic resonance imaging. Plant Physiol. 170, 1176–1188. ( 10.1104/pp.15.01388) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Downie H, Holden N, Otten W, Spiers AJ, Valentine A, Dupuy LX. 2012. Transparent soil for imaging the rhizosphere. PLoS ONE 7, e44276 ( 10.1371/journal.pone.0044276) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lockhart JA. 1965. An analysis of irreversible plant cell elongation. J. Theor. Biol. 8, 264–275. ( 10.1016/0022-5193(65)90077-9) [DOI] [PubMed] [Google Scholar]
  • 15.Dyson RJ. et al. 2014. Mechanical modelling quantifies the functional importance of outer tissue layers during root elongation and bending. New Phytol. 202, 1212–1222. ( 10.1111/nph.12764) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Geitmann A, Ortega JKE. 2009. Mechanics and modeling of plant cell growth. Trends Plant Sci. 14, 467–478. ( 10.1016/j.tplants.2009.07.006) [DOI] [PubMed] [Google Scholar]
  • 17.Hackett C, Rose DA. 1972. A model of the extension and branching of a seminal root of barley, and its use in studying relations between root dimensions in the model. Aust. J. Biol. Sci. 25, 669–680. ( 10.1071/BI9720669) [DOI] [Google Scholar]
  • 18.Lungley DR. 1973. The growth of root systems–a numerical computer simulation model. Plant Soil 38, 145–159. ( 10.1007/BF00011223) [DOI] [Google Scholar]
  • 19.Fitter AH. 1987. An architectural approach to the comparative ecology of plant root systems. New Phytol. 106, 61–77. ( 10.1111/nph.1987.106.issue-s1) [DOI] [Google Scholar]
  • 20.Pagès L, Jordan M-O, Picard D. 1989. A simulation model of the three-dimensional architecture of the maize root system. Plant Soil 119, 147–154. ( 10.1007/BF02370279) [DOI] [Google Scholar]
  • 21.Diggle AJ. 1988. ROOTMAP – a model in three-dimensional coordinates of the growth and structure of fibrous root systems. Plant Soil 105, 169–178. ( 10.1007/BF02376780) [DOI] [Google Scholar]
  • 22.Tatsumi J, Yamauchi A, Kono Y. 1989. Fractal analysis of plant root systems. Ann. Bot. 64, 499–503. ( 10.1093/oxfordjournals.aob.a087871) [DOI] [Google Scholar]
  • 23.Pagès L, Asseng S, Pellerin S, Diggle A. 2000. Modelling root system growth and architecture. In Root Methods (eds Smit AL, Bengough AG, Engels C, van Noordwijk M, Pellerin S, van de Geijn SC), pp. 113–146. Berlin, Germany: Springer. [Google Scholar]
  • 24.Lynch JP, Nielsen KL, Davis RD, Jablokow AG. 1997. SimRoot: modelling and visualization of root systems. Plant Soil 188, 139–151. ( 10.1023/A:1004276724310) [DOI] [Google Scholar]
  • 25.Pagès L, Vercambre G, Drouet J-L, Lecompte F, Collet C, Le Bot J. 2004. Root Typ: a generic model to depict and analyse the root system architecture. Plant Soil 258, 103–119. ( 10.1023/B:PLSO.0000016540.47134.03) [DOI] [Google Scholar]
  • 26.Collet C, Löf M, Pagès L. 2006. Root system development of oak seedlings analysed using an architectural model effects of competition with grass. Plant Soil 279, 367–383. ( 10.1007/s11104-005-2419-9) [DOI] [Google Scholar]
  • 27.Garré S. et al. 2012. Parameterizing a dynamic architectural model of the root system of spring barley from minirhizotron data. Vadose Zone J. 11, 179 ( 10.2136/vzj2011.0179) [DOI] [Google Scholar]
  • 28.Leitner D, Klepsch S, Bodner G, Schnepf A. 2010. A dynamic root system growth model based on l-systems. Plant Soil 332, 177–192. ( 10.1007/s11104-010-0284-7) [DOI] [Google Scholar]
  • 29.Schnepf A. et al. 2018. CRootBox: a structural–functional modelling framework for root systems. Ann. Bot. 121, 1033–1053. ( 10.1093/aob/mcx221) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Pagès L, Pellerin S. 1994. Evaluation of parameters describing the root system architecture of field grown maize plants (Zea mays L.). Plant Soil 164, 169–176. ( 10.1007/BF00010068) [DOI] [Google Scholar]
  • 31.Chen YL, Dunbabin VM, Postma JA, Diggle AJ, Palta JA, Lynch JP, Siddique KHM, Rengel Z. 2011. Phenotypic variability and modelling of root structure of wild lupinus angustifolius genotypes. Plant Soil 348, 345 ( 10.1007/s11104-011-0939-z) [DOI] [Google Scholar]
  • 32.Beaumont MA, Zhang W, Balding DJ. 2002. Approximate Bayesian computation in population genetics. Genetics 162, 2025–2035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Toni T, Welch D, Strelkowa N, Ipsen A, Stumpf MPH. 2009. Approximate Bayesian computation scheme for parameter inference and model selection in dynamical systems. J. R. Soc. Interface 6, 187–202. ( 10.1098/rsif.2008.0172) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Csilléry K, Blum MGB, Gaggiotti OE, François O. 2010. Approximate Bayesian computation (abc) in practice. Trends Ecol. Evol. 25, 410–418. ( 10.1016/j.tree.2010.04.001) [DOI] [PubMed] [Google Scholar]
  • 35.Lobet G, Pagès L, Draye X. 2011. A novel image-analysis toolbox enabling quantitative analysis of root system architecture. Plant Physiol. 157, 29–39. ( 10.1104/pp.111.179895) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.El Zawily AM. et al. 2014. Friendly regulates mitochondrial distribution, fusion, and quality control in Arabidopsis. Plant Physiol. 166, 808–828. ( 10.1104/pp.114.243824) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Lobet G, Draye X, Périlleux C. 2013. An online database for plant image analysis software tools. Plant Methods 9, 38 ( 10.1186/1746-4811-9-38) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Band LR, Fozard JA, Godin C, Jensen OE, Pridmore T, Bennett MJ, King JR. 2012. Multiscale systems analysis of root growth and development: modeling beyond the network and cellular scales. Plant Cell 24, 3892–3906. ( 10.1105/tpc.112.101550) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Gillespie DT. 1977. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361. ( 10.1021/j100540a008) [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figure 1
rsif20190293supp1.svg (267.6KB, svg)
Supplementary Figure 2
rsif20190293supp2.svg (47KB, svg)

Data Availability Statement

The data and code used are freely available in Github repository https://github.com/StochasticBiology/root-inference.


Articles from Journal of the Royal Society Interface are provided here courtesy of The Royal Society

RESOURCES