Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Mar 31;17(3):e1008852. doi: 10.1371/journal.pcbi.1008852

Personalized computational model quantifies heterogeneity in postprandial responses to oral glucose challenge

Balázs Erdős 1,2,*,#, Bart van Sloun 1,2,#, Michiel E Adriaens 1,2, Shauna D O’Donovan 3, Dominique Langin 4, Arne Astrup 5, Ellen E Blaak 1,6, Ilja C W Arts 1,2,, Natal A W van Riel 7,
Editor: James Gallo8
PMCID: PMC8011733  PMID: 33788828

Abstract

Plasma glucose and insulin responses following an oral glucose challenge are representative of glucose tolerance and insulin resistance, key indicators of type 2 diabetes mellitus pathophysiology. A large heterogeneity in individuals’ challenge test responses has been shown to underlie the effectiveness of lifestyle intervention. Currently, this heterogeneity is overlooked due to a lack of methods to quantify the interconnected dynamics in the glucose and insulin time-courses. Here, a physiology-based mathematical model of the human glucose-insulin system is personalized to elucidate the heterogeneity in individuals’ responses using a large population of overweight/obese individuals (n = 738) from the DIOGenes study. The personalized models are derived from population level models through a systematic parameter selection pipeline that may be generalized to other biological systems. The resulting personalized models showed a 4-5 fold decrease in discrepancy between measurements and model simulation compared to population level. The estimated model parameters capture relevant features of individuals’ metabolic health such as gastric emptying, endogenous insulin secretion and insulin dependent glucose disposal into tissues, with the latter also showing a significant association with the Insulinogenic index and the Matsuda insulin sensitivity index, respectively.

Author summary

The postprandial glucose and insulin responses to an identical meal can vary greatly across individuals. Certain dynamic features of these response curves have been shown to be indicative of the state of the glucose homeostasis and therefore relevant for targeted lifestyle intervention. In this study, we implement personalized computational models of the insulin regulated glucose homeostasis that take advantage of the complete time-courses of postprandial glucose and insulin trajectories following an oral glucose challenge in a large population of overweight/obese individuals. We show that the personalized models capture the responses more accurately compared to population-level models. In addition, the physiological basis of the models provides insight into the variability in the glucose homeostasis of individuals. The model parameters represent relevant features of the metabolic health such as insulin secretion or insulin mediated disposal of glucose into tissues that also correlated with independent measures of insulin secretion and whole-body insulin resistance, respectively. Furthermore, the models can be quantified from comparatively non-invasive sampling techniques and may be readily transferable to the clinic.

Introduction

Glucose homeostasis is maintained by the complex interplay between several tissues and organs including the brain, pancreas, skeletal muscle, liver and adipose tissue. The evolution of plasma glucose and insulin concentrations during an oral glucose tolerance test (OGTT) provide a readout of the metabolic health of these underlying tissues capturing the effects of insulin sensitivity and metabolic resilience status [1]. Certain features of the standard 75g OGTT response curves are widely accepted to be representative of glycemic control, most notably the fasting and 2 hour post-load glucose values are used in the diagnosis of prediabetes and type 2 diabetes mellitus (T2DM) [2]. The area under the plasma glucose/insulin response curve (AUC) is an extensively employed measure to compare responses [3] and has been successfully used in targeted and even personalized nutrition approaches [46]. However, the AUC is a somewhat crude measure that may often lead to ambiguous classifications [7]. Therefore, certain dynamic properties of the glucose response curves e.g. peak time, have been nominated as relevant for pathophysiological characterization [6, 811]. In addition, the post-load glucose and insulin trajectories may be used to derive proxy measures of whole-body and tissue-specific insulin sensitivity to serve as a surrogate to the hyperinsulinemic-euglycemic clamp. The HOMA-IR [12] and Matsuda insulin sensitivity indices [13, 14] have been widely utilized to quantify whole-body insulin resistance from fasting and average postprandial glucose/insulin levels, respectively. In recent years, the increased recognition of tissue-specific insulin resistance [15] leading to metabolically distinct phenotypes, has resulted in the development of the HIRI and MISI indices, quantifying hepatic and skeletal muscle insulin resistance from OGTT responses [1618]. While these measures capture certain aspects of metabolic resilience, they rely upon single time-point or average glucose and insulin values taken from the response curves, as a result the dynamics of the time-courses are largely disregarded. Recently, Hulman et al. have shown that using a latent class mixed models framework, the glucose trajectories of healthy individuals following an OGTT may be classified into four distinct insulin sensitive phenotypes [19]. This approach—making use of the complete time-courses—highlights the importance of the dynamics of the glucose responses, however it does not allow for an individualized exploration due to the limited number of prospective classes.

The move towards personalized interventions requires the characterization of the large heterogeneity in individuals’ glycemic regulation. Therefore, a holistic approach, accounting for the dynamic properties of the response curves is needed on the individual level. Furthermore, the close interplay between plasma glucose and insulin concentrations calls for the evaluation of glucose and insulin trajectories as a whole, rather than as disjoint indicators.

Physiology-based mathematical models of the human glucose-insulin regulatory system can provide quantitative information on the dynamics while capturing the mechanistic link between glucose and insulin. Such models are built to describe the physiological processes by which insulin regulates glucose levels using a priori understanding of the underlying biological system. The detail to which the model can accurately simulate the glucose-insulin response mechanism depends on the desired scope and the availability of quantitative data. The Bergman model, a simplistic model of glucose disappearance containing only 5 parameters, has been extensively used to approximate insulin sensitivity and β-cell function using plasma glucose and insulin values following a frequently sampled intravenous glucose tolerance test [20]. The integrated glucose-insulin model has been used to describe population as well as individual responses to a frequently sampled OGTT [21], however its applicability (to nutritional and metabolic studies) is limited due to the complexity in the model’s glucose absorption term that is made possible by an unusually frequent sampling strategy. A more complex model built by Dalla-Man et al. provides a detailed account of the underlying processes governing glucose utilization following a meal [22]. Here, the complexity of the model is enabled by the availability of triple tracer glucose data, quantifying the glucose fluxes between tissues. While the Bergman model can be applied to individual data, the Dalla-Man model has mostly been applied to population average data for in silico simulation and testing of insulin pump systems. The Eindhoven-Diabetes Education Simulator (E-DES) is a comparatively simple multi-compartmental model containing 12 parameters that has been used to describe the dynamics of the glucose homeostasis in healthy, type 1 and type 2 diabetic populations [23, 24].

Quantifying uncertainty in model parameters is essential to understand the limitations and predictive power of the model [25]. It is particularly important to consider parameter identifiability when estimating model parameters on the individual level—where sensitivity to measurement error may be high—to retain parameters that can be reliably estimated. Identifiability analysis may be carried out through methods such as Profile Likelihood Analysis (PLA) to evaluate how well parameter values can be determined given the available data [26, 27].

The aim of the current work is to explore the heterogeneity in the glucose and insulin responses to an OGTT in a large population of individuals by developing personalized dynamic models of the insulin mediated glucose metabolism, using an adapted E-DES model. The model parameters are estimated from measured postprandial trajectories of both glucose and insulin, and represent physiologically relevant properties that in turn may be used in the early identification of deterioration in the glucose homeostasis. Furthermore, the workflow presented here for transitioning a dynamic model away from describing population averages and towards individual response patterns may prove useful in numerous other applications, as it is generalizable to other biological models and systems.

Materials and methods

Ethics statement

The Medical Ethical Committees of the respective countries approved the DIOGenes study protocol. Participants provided informed written consent, and all procedures were conducted in accordance with the Declaration of Helsinki. Trial registration number: NCT00390637.

Data

Data from the DIOGenes study (NCT00390637), a pan-European, multi-center, randomized controlled dietary intervention study were used in this work [28]. At the baseline of the intervention, following an overnight fast (n = 1118) overweight/obese (BMI > 27kg/m2) but otherwise healthy adult participants underwent a two hour 75g OGTT, with plasma samples taken at the fasting state (t = 0) and 30, 60, 90 and 120 minutes after the glucose ingestion. The plasma samples were subsequently analyzed for glucose and insulin concentrations. Responses at the baseline of the intervention were used. Individuals with an incomplete set of glucose and/or insulin measurements were excluded from the analysis.

Adapted E-DES model

The Eindhoven-Diabetes Education Simulator is a physiology-based mathematical model of the human insulin mediated glucose regulatory system in healthy, type 1 diabetes, and T2DM phenotypes [24]. The two compartment model describes the following physiological processes through coupled differential equations (see model schematic and details in S1 Fig, S1 and S2 Appendices): Glucose mass is emptied into the gut according to an exponential decay function, followed by uptake into the plasma proportionally to the amount of glucose present in the gut. Both glucose and insulin fluxes are considered in the plasma compartment. Insulin secretion from the pancreas is modelled through a proportional-integral-derivative (PID) controller, responding to elevated plasma glucose levels. The insulin response facilitates the insulin-dependent glucose disposal to tissues such as the muscle. In addition, there is a constant glucose removal from the plasma by obligate glucose oxidizers such as the brain or the red blood cells. While the plasma glucose levels are elevated, endogenous glucose production (EGP) in the liver is suppressed. Finally, insulin is cleared by the liver proportionally to the plasma insulin concentration, as well as by a transfer and degradation in the interstitial fluid. The parameters corresponding to these physiological parameters control the rate of change in glucose or insulin concentrations. Through modulation of the parameters, responses of metabolically different phenotypes may be simulated in silico. The model has been previously parameterized and validated on multiple OGTT data sets from healthy populations [24]. The E-DES model was implemented and analyzed in MATLAB 2018b (The Mathworks, Inc., Natick, Massachusetts, United States). For the current study population, an adapted E-DES model is used. Model equations, including a description of parameters and modifications are described in detail in S1 Appendix.

Parameter estimation

Parameters were estimated through minimizing the combined sum of squared residual (SSR) in the model prediction for glucose and insulin (Eq 1) using lsqnonlin, a non-linear least squares solver in MATLAB. To avoid becoming trapped in erroneous local minima, the optimal parameter sets were obtained following fifty initializations of the optimization algorithm with 25% random noise starting from the original parameter value for the average healthy population from the publication [24].

SSR=j=1mi=1N(γ((yi,j|θ)-di,j))2 (1)

Where m, and N represent the number of metabolites and the number of time-points, respectively. The measured data point is denoted by d, while y is the corresponding model prediction given the parameter vector θ. A weight factor γ = 0.1 was used in the case of insulin (γ = 1 in case of glucose) to account for the unit difference (mmol/L, mU/L for glucose and insulin, respectively) between the molecules.

Model selection

A model selection approach was implemented in order to systematically identify parameters for personalized model fitting. The aim of the approach is to maintain parameter functionality and identifiability when transitioning from modelling population average responses to individual responses. The workflow involves systematically reducing the number of parameters to estimate for each individual response to ensure reliable, accurate parameter estimates. The parameters that are not selected in the approach for personalized fitting are fixed to reference population values [24] across all individuals. The subset of model parameters to be estimated was selected based on the following criteria: the selected parameters had to (i) exhibit high sensitivity, (ii) demonstrate good model fit while maintaining parsimony, (iii) be identifiable. The steps to meet these criteria are detailed below and a flowchart of the approach is shown on Fig 1.

Fig 1. Flowchart of the model selection approach.

Fig 1

  • (i)

    Sensitivity analysis. We carried out local parameter sensitivity analysis (LPSA) by varying one parameter in both directions while maintaining the others at a constant value, inspecting the effect on the resulting model outcome. A threshold of 75% in both directions compared to the average healthy population values was selected as the limit of sensitivity. Parameters that exhibit sensitivity at this level are expected to have a large modulatory effect on the model outcome. Parameters that are not sensitive at the threshold were fixed as a constant to their respective values (reported in [24]), as they have little impact on the model behavior. Only sensitive parameters were considered in subsequent steps of the model selection.

  • (ii)

    Model fit. The set of all possible combinations of 3 or more sensitive parameters were generated. Subsequently models were fit on a set of representative responses from the DIOGenes data set with these candidate parameter sets estimated from the data, while the rest of the parameters were fixed to the population reference values. The representative responses comprised of the median normal glucose tolerant (NGT), impaired fasting glucose (IFG), impaired glucose tolerant (IGT), both IFG and IGT (IFG&IGT) and T2DM responses in the data, based on the American Diabetes Association (ADA) diagnosis criteria [29]. The median responses were calculated by taking the median glucose and insulin values per time-point across all individuals in the respective groups. In addition to the median responses, both extreme responses (largest and smallest response in the data set by area under the glucose curve) were also included. The model with the candidate parameter set that showed the lowest Akaike Information Criterion (AIC) score across the set of representative curves (i.e. NGT, IFG, IGT, IFG&IGT, T2DM, Min, Max) was selected as most parsimonious model.

  • (iii)
    Parameter identifiability. The parameter set that produced the most parsimonious model was finally evaluated for identifiability using Profile Likelihood Analysis (PLA) [27]. In PLA the value of one parameter is changed iteratively from its optimal value and the remaining parameters are re-estimated. An increase in the cost function (SSR) for the model fit indicates that a reliable parameter estimate has been obtained and the parameter is identifiable given the model structure and data. Confidence intervals were derived using a Chi-squared threshold on the likelihood (Eq 2).
    -2log(L(θPL)L(θopt))χ2(α,df) (2)
    Where χ2(α, df) is the α quantile of the χ2-distribution with df degrees of freedom, θPL and θopt are the profiled path and optimal parameter vectors, respectively. The threshold α was set to 0.95 and df equals one or the number of parameters (see S3 and S4 Figs)

Principal component analysis

The parameter space of the personalized E-DES model is visualized by reducing the number of dimensions from the number of estimated parameters to two dimensions using principal component analysis (PCA). Prior to PCA, the parameter values were normalized to zero mean and unit standard deviation.

Results

A total of 738 participants were included in the analysis, after excluding participants with incomplete OGTT measurements (n = 373) and participants with physiologically implausible responses (i.e. where the OGTT failed; n = 7). The identification of physiologically implausible responses was carried out by independent experts. The remaining 738 responses were characterized by the ADA criterion for prediabetes and diabetes as summarized in Table 1.

Table 1. Classification of participant’s responses based on ADA diabetes criteria.

Diagnosis1 NGT IFG IGT IFG&IGT T2DM
N 496 42 41 119 40
Age2[years] 40.7 (6.4) 42.0 (5.3) 43.6 (4.8) 41.8 (6.2) 45.0 (6.7)
Sex [%female] 65.8 42.9 53.7 68.1 55.0
BMI2 [kg m−2] 34.5 (4.8) 34.7 (4.5) 36.5 (5.8) 34.2 (4.5) 35.1 (5.1)

1 NGT: normal glucose tolerant, IFG: impaired fasting glucose, IGT: impaired glucose tolerant, T2DM: type 2 diabetes mellitus. For details about the criteria, see S1 Table.

2 Age and BMI are reported as mean and (standard deviation).

In general, DIOGenes contains overweight/obese but otherwise healthy participants [28]. However, characterization by the ADA guidelines shows that in fact, several participants may be diagnosed as prediabetic or type 2 diabetic.

Model selection

Out of the eleven parameters contained in the adjusted E-DES model, only the physiological parameters k1 to k9 (defined in Table C of S1 Appendix) were considered in the personalization, while the two remaining parameters, the shape factor and Michaelis-Menten constant for glucose uptake, were fixed to their respective population values. As the first step of the model selection approach local parameter sensitivity analysis was carried out. The outcome of the sensitivity analysis on the plasma glucose response is shown in Fig 2. The effect on plasma insulin can be seen in S2 Fig.

Fig 2. Local parameter sensitivity analysis demonstrated on simulated plasma glucose response.

Fig 2

Parameters k1, k4, k5, k6, k8, k9 were found to be sensitive at the ±75% threshold, and therefore were considered for further analyses, while the remainder of the parameters were kept constant in all subsequent analysis. As the second step of the model selection, the set of all possible combinations of 3,4,5, and 6 sensitive parameters were generated and models with these parameters estimated from data were assessed. This way a total of 42 different models were examined for model fit according to AIC on the median NGT, IFG, IGT, IFG&IGT, and T2DM responses as well as the largest and smallest response in the data set. The ten best performing candidate models with the resulting SSR and AIC values are shown in S2 Table.

The highest scoring model according to our criteria contained the parameters k1, k5, k6, and k8 with a SSR of 41.39. Visual inspection of the model output displayed good accordance with the majority of the data on the various group median and extreme responses as seen in Fig 3. The extreme responses are simulated less accurately compared to the median responses. Specifically, the model struggles with accurately capturing the part of the response that goes below basal. The best scoring model was subsequently evaluated for parameter identifiability in the last step of the model selection approach.

Fig 3. Plasma glucose and insulin simulation of the set of representative responses in the DIOGenes study with estimated parameters k1, k5, k6, k8.

Fig 3

The median responses were calculated as the median plasma glucose value of the ADA diabetes classification group at each time point (A). The ‘Min’ and ‘Max’ are the smallest and largest glucose responses in the data set, determined by area under the curve (B). Median measured values are shown as black dashes with the interquartile range and measured responses are indicated by black crosses.

The identifiability of the parameters k1, k5, k6, k8 was assessed on the median NGT, IFG, IGT, IFG&IGT, T2DM and the extreme responses to infer the reliability in estimating the selected parameters. The PLA profiles indicated that parameters were identifiable, with the exception of parameters k6 and k8, which were practically non-identifiable [27] for the lowest response. The parameter profiles can be found in S3 Fig. Additionally, to verify the choice of the 4 parameter model, the best performing 5 parameter candidate model (with parameters k1, k5, k6, k8, k9) was also evaluated for parameter identifiability. The PLA profiles from the 5 parameter model indicate that k9 was structurally non-identifiable in 6 out of 7 representative responses, with further two parameters (k6 and k8) proving to be non-identifiable in 2 out of 7 cases (S4 Fig).

The model selection pipeline resulted in a model with parameters k1, k5, k6, k8 to be estimated from experimental data in personalized models. The selected parameters describe the rate constant of glucose appearance in the gut (k1), the rate constant of insulin-dependent glucose uptake (k5), the proportional rate constant of insulin secretion due to the difference in the actual plasma glucose level compared to baseline (k6), and the insulin secretion dependent of the rate of change in plasma glucose (k8).

Individual simulation

A population of 738 personalized models were generated in silico through estimating the selected parameters on post-load time series of glucose and insulin in participants from the DIOGenes study. To evaluate the success of simulating individual responses, we compared the discrepancy of population specific simulations to that of individualized simulations. The median response was calculated in every group (NGT, IFG, IGT, IFG&IGT, and T2DM) and the selected 4 parameter model was used to simulate the median glucose and insulin trajectories on the calculated responses. The individuals’ measured data were then compared to the median simulations per group. The SSRs in the personalized model simulations were substantially lower than those of the median simulations in every group (Table 2), indicating that the personalized models were able to capture a wide range of response curves.

Table 2. Mean (standard deviation) of sum of squared residuals in the model simulations.

NGT IFG IGT IFG&IGT T2DM
Group simulation 149.87 213.52 205.15 153.46 195.93
(153.11) (205.15) (134.92) (126.97) (181.90)
Individual simulation 32.37 44.29 35.33 43.54 36.69
(36.26) (32.11) (47.89) (29.92) (33.56)

The best and worst personalized model simulations by SSR are shown in Fig 4A and 4B, respectively. While the measured glucose and insulin responses ranged from 1.8 to 18.3 mmol/L and from 2.0 to 749.0 mU/L, the simulations show good agreement with the measured data in most cases. To highlight other striking model behavior, additional, hand selected example responses and their corresponding simulations are shown in Fig 4C.

Fig 4. Time courses of plasma glucose and insulin concentrations for individuals with the lowest and highest simulation error (quantified by the SSR) in each ADA category, and other interesting model behaviour (A, B and C respectively).

Fig 4

Black crosses and orange/blue lines correspond to measurement and model simulation of glucose/insulin, respectively.

In particular, metabolite responses with an intermediate dip between two values were found to be difficult to capture using the 4 parameter model (e.g. participant 183, Fig 4C). However, there were cases of such bi-phasic curves, that the model could replicate accurately (e.g. participant 513, Fig 4C). In some cases, the model predicted a fast response, with a probable peak between the 0 and 30 minute measurements (e.g. participants 129, Fig 4C). Furthermore, the success of simulating complex shapes appeared to depend on the scale of the insulin values in the responses, where higher insulin values lead to difficulties in accurately fitting the glucose response (e.g. participant 175, Fig 4C).

The distribution of parameter values estimated from individuals’ responses are shown by subgroup in Fig 5. In general, the range of estimated parameters was greatest in the group that was NGT according to the ADA diabetes criteria, with values spanning the whole range of the other groups. The rate constant of glucose appearance in the gut (k1) was largest in the NGT and IFG groups. Similarly, insulin-dependent glucose uptake (k5), and glucose-dependent insulin production (k6) were lower in the IGT, IFG&IGT, and T2DM groups compared to the NGT and the IFG groups. The plasma glucose rate of change-dependent insulin production (k8) was lower in the IFG&IGT, and T2DM groups compared to the other groups. Additionally, the association of the parameter values with frequently used measures of insulin secretion and insulin resistance were evaluated to assess model structure. Parameters k6 and k8 associated with the insulinogenic index (r = 0.56, p < 0.001 and r = 0.49, p < 0.001, respectively; S7 Fig), a frequently used measure of first-phase insulin secretion [30]. Additionally, parameter k5, describing insulin mediated uptake of glucose into the periphery showed a significant positive correlation with the Matsuda index (Pearson r = 0.68, p < 0.001; S7 Fig).

Fig 5. The distribution of the estimated parameters k1, k5, k6, k8 by subgroup based on ADA diabetes criteria.

Fig 5

k1—rate constant of glucose appearance in the gut, k5—rate constant of insulin-dependent glucose uptake, k6—rate constant of insulin secretion proportional to glucose elevation, k8—rate constant of insulin secretion by the rate of change in glucose elevation. For details, see S1 Appendix. Observations outside the interquartile range of the 25th and 75th percentile of each group are visualized in S5 Fig. The boxes represent the 25th and 75th percentiles, the whiskers represent the min, and max values, and the horizontal line represents the median.

A better grasp of the parameter space of the model can be obtained by visualizing it after reducing the four dimensional space to two dimensions via principal component analysis. The personalized models in the resulting space are shown in Fig 6. The unique parameter set in each model defines the model’s place in the parameter space, where the model is colored according to the ADA criteria (A) and the participants’ Matsuda index (B). The explained variance and the loading vectors indicate that the parameters pertain to distinct mechanisms and retain their functionality (Fig 6B insert). The spread over the first two principal components demonstrate the large heterogeneity in the modelled population. Furthermore, the additional insight gained by the personalized models is illustrated when coloring the parameter space by the ADA criterion for prediabetes and diabetes (Fig 6A). The featured examples highlight the large heterogeneity that remains hidden when considering only the single time-point measures of the diagnosis categories, but is captured by the person specific models. Fig 6B shows examples of participants with different states of insulin sensitivity. The examples positioned along the direction of the loading vectors of k5 show responses with increasing insulin sensitivity noted by the rapid clearance of glucose from plasma (i.e. curve rapidly approaching baseline) as well as lower insulin secretion. Responses of individuals with varying levels of first-phase insulin secretion are shown on S8 Fig with the corresponding models in the parameter space colored by the insulinogenic index. Responses along the loading vectors of both k6 and k8 indicate increasing first-phase secretion of insulin, with highlighted examples of low (e.g. participant 347) moderate (e.g. participant 693) and high (e.g. participant 51) secretion. Additionally, S9 Fig highlights examples in the parameter space colored by the error in the simulation of the individual as measured by SSR.

Fig 6. Individual parameter sets in the parameter space of the model colored by the ADA diabetes criterion (A) and the Matsuda index (B) after reduction to 2d via principal component analysis.

Fig 6

The personalized model simulations of five participants’ responses are highlighted (individuals 21, 388, 378, 715, 339 on panel A and 522, 308, 730, 693, 605 on panel B). Orange and blue lines correspond to glucose and insulin model simulation, while crosses represent measured data. The loading vectors of k1, k5, k6, and k8 are shown in the purple insert in panel B.

Discussion

In this work, we implemented a pipeline to convert a physiological model of the postprandial glucose-insulin dynamics describing population averages into a personalizable model. A key aspect of the parameter selection process was to maintain certainty in the parameter estimates and consequent model predictions by systematically reducing the number of parameters to be estimated in personalized models, taking into account the availability of quantitative data. Furthermore, our approach allows the comparison of personalized models due to retaining the same model structure across individuals. The protocol does not require biological knowledge and may be generalized to models of other systems. The resulting personalized models were able to accurately describe a wide variety of responses from the DIOGenes data set, a large population of overweight/obese but otherwise healthy individuals. Furthermore, the estimated parameter values of the model allowed mechanistic insight into the differences in individuals’ glucose metabolism.

Parameters exhibiting high sensitivity exert a large effect on model outcome, whereas not sensitive parameters may be fixed to a constant value. Sensitivity of the parameters was assessed through their modulatory effect on glucose and insulin concentrations of an average healthy simulation, to keep in line with the study population. Due to the structure of the model, different responses might indicate different parameters to be sensitive, such as the parameter controlling the rate of endogenous glucose production, which is expected to behave differently when the glucose response goes below the basal level. However, in the current study population such behavior is rare (66/738 responses) and thus an average healthy simulation was considered adequate for sensitivity analysis. Following the selection of the sensitive parameters, parsimony and model fit was considered. The ADA group median and extreme responses were chosen to promote the model to be able to fit a wide range of responses. We hypothesize, that these responses are representative of the parameter space that we aim to capture with the model. Thus, if the model is able to capture these responses accurately, it is likely to be able to simulate arbitrary responses in intermediate states as well. In order to impose a criterion towards parsimony, we used the AIC to introduce a penalty term on the number of parameters in the model. As shown in S2 Table, candidate models with more parameters had a lower SSR. However, the top 5 and 6 parameter models performed only marginally better than the 4 parameter ones, with a SSR of 37.02, 35.20, 41.39 respectively. The best scoring 3 parameter model (k1, k5, k9) had more than twice the SSR (87.00) of the best 4 parameter model. Based on the AIC, the 4 parameter candidate model containing parameters k1, k5, k6 and k8 to be estimated was selected as the most parsimonious model. Finally, the identifiability of the candidate model was examined via PLA on the representative median and extreme responses. Besides the smallest response, PLA profiles showed that the parameters were identifiable (S3 Fig) indicating that a unique solution exists in the tested range.

The parameters identified in the model selection process indicate that the most discriminating processes in this population are transition of glucose from the stomach to the gut (k1), the insulin-dependent glucose uptake to the periphery (k5), and the processes representative of insulin secretion (k6 and k8). It is known that gastric emptying is a major determinant of postprandial glycemia that has been shown to exhibit large intra- and interindividual variability [31, 32]. The decline in insulin mediated glucose disposal into tissues such as the muscle, liver or adipose tissue is acknowledged as one of the key process leading to the development of T2DM [15]. Furthermore, defects in the first phase insulin secretion are known to appear in the early stages of deteriorating glucose control compared to abnormalities in second phase insulin secretion, which is more prevalent in advanced stages of T2DM [33]. The ability to potentially quantify these otherwise difficult to measure processes from time-series of postprandial glucose and insulin using a computation model may prove incredibly useful in the advent of personalized medicine and targeted nutritional interventions.

Following the identification of the model structure to be personalized, we elected to generate our population of personalized models by fitting the model to the individuals’ corresponding meal response data via maximum likelihood estimation (MLE) while fixing other parameters to population averages. Here, it is important to note that, approaches such as non-linear mixed effects modelling, where population and individual level dynamics are estimated simultaneously, may provide a valuable alternative to our approach [34, 35]. In addition, future applications may benefit from integrating regularization in the MLE as proposed by Dolejsch et al. [36].

The personalized models showed a 4-5 fold decrease in SSR in all groups compared to group simulations, confirming that the model personalization was successful, as well as re-enforcing the need for a personalized approach when assessing such dynamic responses. A good accordance with data was further confirmed by visually inspecting the model output (Fig 3 and S6 Fig). However, the SSR does not always give a realistic overview of the model fit, for instance, it can be susceptible to bias towards responses with extreme glucose and especially insulin values. Thus, to further highlight the limits of the model, a manual selection of responses and corresponding model simulations were shown in Fig 4C. The model frequently struggled with accurately predicting an intermediate dip in the glucose response (e.g. participant 183, Fig 4C). The more complex bi-phasic shapes were only accurately modelled in a few cases (e.g. participants 513, Fig 4C), although it is thought that, this lack of fit could be avoided by estimating additional parameters. Responses with high insulin values drove the model to fail at accurately capturing the glucose response. This is partly due to the combined glucose and insulin error function used in model fitting, in which insulin values were multiplied by 0.1 to account for the difference in scale compared to glucose. However, in case of extremely high insulin responses, the insulin values are still favored during the optimization (e.g. participants 175, 556, Fig 4B and 4C). By estimating additional glucose parameters, such as the parameter handling endogenous glucose production (k3), these responses might be captured more accurately. Additionally, in some cases where the glucose levels quickly returned below the basal value the model exhibited oscillatory behaviour (e.g. participants 129, Fig 4C). This may originate from the parameter estimates relating to the insulin secretion term in the model, however further examination of this was outside the scope of the current study. Furthermore, it is worth noting that the outlying parameter estimates not necessarily indicate erroneous simulations but rather unusual or extreme responses as can be seen on Fig 6, S8 and S9 Figs.

Each of the 738 personalized models contain a unique parameter set pertaining to the physiological state of the participant’s glucose homeostasis. The largest range for all of the estimated parameters was found in the NGT group, which could partly be explained due to the data set containing more normo-glycemic individuals (see Table 1) resulting in a larger variability. Furthermore, normo-glycemic individuals are also known to be more likely to exhibit bi-phasic responses [11], raising the variability of responses, and thereby the range of estimated parameters values in this category. In addition, the groupings defined by the ADA criteria only consider the fasting and 2h plasma glucose values while ignoring the insulin levels. Thus, individuals that exhibit normal glucose levels at the fasting and 2h time-points due to unusually high insulin values still end up in the NGT group. This lack of consideration for the dynamics and insulin values make it difficult to detect early deterioration in individuals’ responses indicative of insulin resistance. However, taking into account the complete dynamics of both glucose and insulin the personalized models outlined here are able to indicate such transitions before they are detected by steady state or single time-point measures (e.g. 221, 522 Fig 6). By screening for the parameter estimates of k6 and k8 one can identify cases where the glucose response appears normoglycemic, however the insulin levels are abnormally high.

Variation in gastric emptying linked to obesity has been previously reported, however we found no difference in the parameter estimates for k1 between overweight and obese participants as well as no association between the parameter k1 and BMI [37]. Importantly, the model parameters corresponding to insulin secretion (k6, k8) were found to be lower for individuals with more severe metabolic conditions (IGT, IFG&IGT, T2DM). The insulin secretion parameters also showed a significant association with the insulinogenic index, a frequently used measure of insulin secretion. While the parameter controlling the insulin-dependent glucose uptake k5 was lower in IGT, IFG&IGT and T2DM compared to the other groups and showed a significant association with the Matsuda index. These findings reinforce that the model structure captures relevant features of the insulin mediated glucose homeostasis and the personalized models can distinguish between divergent impairments in the insulin regulated glucose control. Therefore, our modeling framework might prove beneficial in revealing nuanced behaviour specifically for the early detection of decline in the glucose homeostasis from a standard five time-point OGTT. Moreover, the personalized models may be used to assess the effects of lifestyle and diet interventions, where the observed effects can be quite subtle. Our results also highlights the possibility of using such an approach to generate cohorts of virtual patients with varying glucose homeostasis for potential in silico testing.

The population in the study may be considered relatively homogeneous in terms of glucose homeostasis, as measured by current single time-point measures such as the ADA criterion. However, the personalized models utilizing the dynamic, intertwined plasma glucose and insulin responses of individuals, allowed the quantification of an immense heterogeneity in the responses even within the ADA groups. Furthermore, the mechanistic nature of the model promotes the identification and allows comparison of distinctive processes underlying individuals’ metabolic health. We believe that such personalized modelling approaches will be essential in advancing personalized nutrition.

Conclusion

The systematic model selection pipeline implemented in this work allows the personalization of a mathematical model through reducing the number of parameters to be estimated in personalized models. The approach results in the most parsimonious model that contains identifiable parameters. The selection pipeline is generalizable in the sense, that it does not require biological insight to implement, therefore it may be applied to other systems or models to gain insight on the individual level. The E-DES model, a computational model of the human glucose-insulin system, was personalized using the approach and subsequently a population of personalized models were simulated from a large data set of overweight/obese but otherwise healthy individuals. The personalized models, consisting of only four parameters estimated from experimental data were capable of simulating a wide variety of postprandial glucose and insulin responses to a standard OGTT from the DIOGenes data set. Taking advantage of a frequently sampled time-series of both glucose and insulin the dynamic models were able to capture a large, previously overlooked heterogeneity in the population. The mechanistic aspect of the model allows the description and comparison of the physiological state of the individuals’ glucose homeostasis and provide mechanistic insight into the glycemic variability observed in the responses.

Supporting information

S1 Fig. Schematic of the E-DES model in use.

(TIF)

S2 Fig. Local parameter sensitivity analysis on the simulated plasma insulin response.

(TIFF)

S3 Fig. Profile Likelihood Analysis results of the 4 parameter model (k1, k5, k6, k8) on the median NGT, IFG, IGT, IFG&IGT, T2DM, min and max responses.

The red star indicates the SSR of the model fitted using the optimal parameter values estimated from data, while the blue line corresponds to the error as the other parameter values are being re-estimated after adjusting the parameter value iteratively. The dashed lines indicate confidence intervals where the degrees of freedom equals one (lower), and the number of parameters (upper), respectively.

(TIF)

S4 Fig. Profile Likelihood Analysis results of the 5 parameter model (k1, k5, k6, k8, k9) on the median NGT, IFG, IGT, IFG&IGT, T2DM, min and max responses.

The red star indicates the SSR of the model fitted using the optimal parameter values estimated from data, while the blue line corresponds to the error as the other parameter values are being re-estimated after adjusting the parameter value iteratively. The dashed lines indicate confidence intervals where the degrees of freedom equals one (lower), and the number of parameters (upper), respectively.

(TIF)

S5 Fig. The distribution of parameters k1, k5, k6, k8 by subgroup (based on ADA diabetes criteria) with the outliers highlighted.

The boxes represent the 25th and 75th percentiles, the whiskers represent the min, and max values, and the horizontal line represents the median.

(TIFF)

S6 Fig. Pooled residuals in the personalized models per time-point per metabolite, colored by the ADA prediabetes and diabetes diagnosis criteria.

(TIFF)

S7 Fig. Pairwise scatter plots and density plots of the personalized model parameters, the Insulinogenic index and the Matsuda index from the DIOGenes study colored by the ADA prediabetes and diabetes diagnosis criteria.

(TIF)

S8 Fig. Personalized models colored by the Insulinogenic index in the parameter space of the model after reduction to 2d via principal component analysis.

The personalized model simulations of five participants with varying first-phase insulin secretion are highlighted (individuals 51, 45, 347, 430, 693). Orange and blue lines correspond to plasma glucose and insulin model simulation, while crosses represent measured data.

(TIFF)

S9 Fig. Personalized models colored by SSR in the parameter space of the model after reduction to 2d via principal component analysis.

The personalized model simulations of five participants with varying first-phase insulin secretion are highlighted (individuals 175, 738, 556, 676, 445). Orange and blue lines correspond to plasma glucose and insulin model simulation, while crosses represent measured data.

(TIFF)

S1 Appendix. E-DES model structure, fluxes, inputs, parameters and constants.

(PDF)

S2 Appendix. MATLAB implementation of the model used in the manuscript.

(ZIP)

S1 Table. Criteria for prediabetes and diabetes classification used in this study.

Based on the standard two hour OGTT by the American Diabetes Association.

(PDF)

S2 Table. Results of step two of the model selection approach.

Sum of squared residuals (SSR) and Akaike Information Criterion (AIC) of the ten best performing candidate models.

(PDF)

Acknowledgments

The authors would like to thank the staff and participants of the DIOGenes study.

Data Availability

The data supporting the findings in this study are available to eligible researchers from Gabby Hul [g.hul@maastrichtuniversity.nl]. The data are not publicly available due to ethical restrictions and privacy of participant data.

Funding Statement

The project is organized by and executed under the auspices of TiFN [https://www.tifn.nl/], a public - private partnership on precompetitive research in food and nutrition. Funding for this research was obtained by E.B., M.A., I.A. and N.vR. from DSM Nutritional Products [https://www.dsm.com], FrieslandCampina [https://www.frieslandcampina.com/], Danone Nutricia Research [https://www.nutriciaresearch.com/] and the Topsector Agri&Food [https://topsectoragrifood.nl/] (grant number: TiFN 16NH04). Additional funding by the Netherlands Organisation for Scientific Research [https://www.nwo.nl/] (grant number: ALWTF.2016.021) was awarded to I.A and N.vR. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. van Ommen B, van der Greef J, Ordovas JM, Daniel H. Phenotypic flexibility as key factor in the human nutrition and health relationship. Genes & Nutrition. 2014;9(5):423. 10.1007/s12263-014-0423-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Tabák AG, Herder C, Rathmann W, Brunner EJ, Kivimäki M. Prediabetes: a high-risk state for diabetes development. The Lancet. 2012;379(9833):2279–2290. 10.1016/S0140-6736(12)60283-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Meng H, Matthan NR, Ausman LM, Lichtenstein AH. Effect of macronutrients and fiber on postprandial glycemic responses and meal glycemic index and glycemic load value determinations. The American Journal of Clinical Nutrition. 2017;105(4):842–853. 10.3945/ajcn.116.144162 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Zeevi D, Korem T, Zmora N, Israeli D, Rothschild D, Weinberger A, et al. Personalized Nutrition by Prediction of Glycemic Responses. Cell. 2015;163(5):1079–1094. 10.1016/j.cell.2015.11.001 [DOI] [PubMed] [Google Scholar]
  • 5. Astrup A, Hjorth MF. Classification of obesity targeted personalized dietary weight loss management based on carbohydrate tolerance. European Journal of Clinical Nutrition. 2018;72(9):1300–1304. 10.1038/s41430-018-0227-6 [DOI] [PubMed] [Google Scholar]
  • 6. Berry SE, Valdes AM, Drew DA, Asnicar F, Mazidi M, Wolf J, et al. Human postprandial responses to food and potential for precision nutrition. Nature Medicine. 2020. 10.1038/s41591-020-1130-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Potteiger J, Jacobsen D, Donnelly J. A comparison of methods for analyzing glucose and insulin areas under the curve following nine months of exercise in overweight adults. International Journal of Obesity. 2002;26(1):87–89. 10.1038/sj.ijo.0801839 [DOI] [PubMed] [Google Scholar]
  • 8. Tschritter O, Fritsche A, Shirkavand F, Machicao F, Haring H, Stumvoll M. Assessing the Shape of the Glucose Curve During an Oral Glucose Tolerance Test. Diabetes Care. 2003;26(4):1026–1033. 10.2337/diacare.26.4.1026 [DOI] [PubMed] [Google Scholar]
  • 9. Tura A, Morbiducci U, Sbrignadello S, Winhofer Y, Pacini G, Kautzky-Willer A. Shape of glucose, insulin, C-peptide curves during a 3-h oral glucose tolerance test: any relationship with the degree of glucose tolerance? American Journal of Physiology-Regulatory, Integrative and Comparative Physiology. 2011;300(4):R941–R948. 10.1152/ajpregu.00650.2010 [DOI] [PubMed] [Google Scholar]
  • 10. Frøslie KF, Røislien J, Qvigstad E, Godang K, Bollerslev J, Voldner N, et al. Shape information from glucose curves: Functional data analysis compared with traditional summary measures. BMC Medical Research Methodology. 2013;13(1). 10.1186/1471-2288-13-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Cheng X, Yang N, Li Y, Sun Q, Qiu L, Xu L, et al. The shape of the glucose response curve during an oral glucose tolerance test heralds β–cell function in a large Chinese population. BMC Endocrine Disorders. 2019;19(1):119. 10.1186/s12902-019-0446-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Matthews DR, Hosker JP, Rudenski AS, Naylor BA, Treacher DF, Turner RC. Homeostasis model assessment: insulin resistance and beta-cell function from fasting plasma glucose and insulin concentrations in man. Diabetologia. 1985;28(7):412–419. 10.1007/BF00280883 [DOI] [PubMed] [Google Scholar]
  • 13. Matsuda M, DeFronzo RA. Insulin sensitivity indices obtained from oral glucose tolerance testing: comparison with the euglycemic insulin clamp. Diabetes Care. 1999;22(9):1462–1470. 10.2337/diacare.22.9.1462 [DOI] [PubMed] [Google Scholar]
  • 14.Blaak EE. Current metabolic perspective on malnutrition in obesity: towards more subgroup-based nutritional approaches? Proceedings of the Nutrition Society. 2020; p. 1–7. [DOI] [PMC free article] [PubMed]
  • 15. Roden M, Shulman GI. The integrative biology of type 2 diabetes. Nature. 2019;576(7785):51–60. 10.1038/s41586-019-1797-8 [DOI] [PubMed] [Google Scholar]
  • 16. Abdul-Ghani MA, Matsuda M, Balas B, DeFronzo RA. Muscle and Liver Insulin Resistance Indexes Derived From the Oral Glucose Tolerance Test. Diabetes Care. 2007;30(1):89–94. 10.2337/dc06-1519 [DOI] [PubMed] [Google Scholar]
  • 17. Blanco-Rojo R, Alcala-Diaz JF, Wopereis S, Perez-Martinez P, Quintana-Navarro GM, Marin C, et al. The insulin resistance phenotype (muscle or liver) interacts with the type of diet to determine changes in disposition index after 2 years of intervention: the CORDIOPREV-DIAB randomised clinical trial. Diabetologia. 2016;59(1):67–76. 10.1007/s00125-015-3776-4 [DOI] [PubMed] [Google Scholar]
  • 18. O’Donovan SD, Lenz M, Goossens GH, van der Kallen CJH, Eussen SJMP, Stehouwer CDA, et al. Improved quantification of muscle insulin sensitivity using oral glucose tolerance test data: the MISI Calculator. Scientific Reports. 2019;9(1):9388. 10.1038/s41598-019-45858-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Hulman A, Witte DR, Vistisen D, Balkau B, Dekker JM, Herder C, et al. Pathophysiological Characteristics Underlying Different Glucose Response Curves: A Latent Class Trajectory Analysis From the Prospective EGIR-RISC Study. Diabetes Care. 2018;41(8):1740–1748. 10.2337/dc18-0279 [DOI] [PubMed] [Google Scholar]
  • 20. Bergman RN, Ider YZ, Bowden CR, Cobelli C. Quantitative estimation of insulin sensitivity. The American journal of physiology. 1979;236(6):E667–677. [DOI] [PubMed] [Google Scholar]
  • 21. Silber HE, Frey N, Karlsson MO. An Integrated Glucose-Insulin Model to Describe Oral Glucose Tolerance Test Data in Healthy Volunteers. The Journal of Clinical Pharmacology. 2010;50(3):246–256. 10.1177/0091270009341185 [DOI] [PubMed] [Google Scholar]
  • 22. Dalla Man C, Rizza RA, Cobelli C. Meal Simulation Model of the Glucose-Insulin System. IEEE Transactions on Biomedical Engineering. 2007;54(10):1740–1749. 10.1109/TBME.2007.893506 [DOI] [PubMed] [Google Scholar]
  • 23. Maas AH, Rozendaal YJW, van Pul C, Hilbers PAJ, Cottaar WJ, Haak HR, et al. A Physiology-Based Model Describing Heterogeneity in Glucose Metabolism: The Core of the Eindhoven Diabetes Education Simulator (E-DES). Journal of Diabetes Science and Technology. 2015;9(2):282–292. 10.1177/1932296814562607 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Maas AH. Playing with numbers: the development of an educational diabetes game. Technische Universiteit Eindhoven; 2017.
  • 25. Kreutz C, Raue A, Kaschek D, Timmer J. Profile likelihood in systems biology. FEBS Journal. 2013;280(11):2564–2571. 10.1111/febs.12276 [DOI] [PubMed] [Google Scholar]
  • 26. Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingmüller U, et al. Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics. 2009;25(15):1923–1929. 10.1093/bioinformatics/btp358 [DOI] [PubMed] [Google Scholar]
  • 27. Vanlier J, Tiemann CA, Hilbers PAJ, van Riel NAW. Parameter uncertainty in biochemical models described by ordinary differential equations. Mathematical Biosciences. 2013;246(2):305–314. 10.1016/j.mbs.2013.03.006 [DOI] [PubMed] [Google Scholar]
  • 28. Larsen TM, Dalskov S, Van Baak M, Jebb S, Kafatos A, Pfeiffer A, et al. The Diet, Obesity and Genes (Diogenes) Dietary Study in eight European countries—a comprehensive design for long-term intervention. Obesity Reviews. 2010;11(1):76–91. 10.1111/j.1467-789X.2009.00603.x [DOI] [PubMed] [Google Scholar]
  • 29. American Diabetes Association. 2. Classification and Diagnosis of Diabetes. Diabetes Care. 2015;38(Supplement_1):S8–S16. 10.2337/dc15-S005 [DOI] [PubMed] [Google Scholar]
  • 30. Yeckel CW, Weiss R, Dziura J, Taksali SE, Dufour S, Burgert TS, et al. Validation of Insulin Sensitivity Indices from Oral Glucose Tolerance Test Parameters in Obese Children and Adolescents. The Journal of Clinical Endocrinology & Metabolism. 2004;89(3):1096–1101. 10.1210/jc.2003-031503 [DOI] [PubMed] [Google Scholar]
  • 31. Marathe CS, Rayner CK, Jones KL, Horowitz M. Relationships Between Gastric Emptying, Postprandial Glycemia, and Incretin Hormones. Diabetes Care. 2013;36(5):1396–1405. 10.2337/dc12-1609 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Grimm M, Koziolek M, Kühn JP, Weitschies W. Interindividual and intraindividual variability of fasted state gastric fluid volume and gastric emptying of water. European Journal of Pharmaceutics and Biopharmaceutics. 2018;127:309–317. 10.1016/j.ejpb.2018.03.002 [DOI] [PubMed] [Google Scholar]
  • 33. Seino S, Shibasaki T, Minami K. Dynamics of insulin secretion and the clinical implications for obesity and diabetes. Journal of Clinical Investigation. 2011;121(6):2118–2125. 10.1172/JCI45680 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. P Denti, A Bertoldo, P Vicini, C Cobelli. Nonlinear Mixed Effects to Improve Glucose Minimal Model Parameter Estimation: A Simulation Study in Intensive and Sparse Sampling. IEEE Transactions on Biomedical Engineering. 2009;56(9):2156–2166. 10.1109/TBME.2009.2020171 [DOI] [PubMed] [Google Scholar]
  • 35. Fröhlich F, Reiser A, Fink L, Woschée D, Ligon T, Theis FJ, et al. Multi-experiment nonlinear mixed effect modeling of single-cell translation kinetics after transfection. npj Systems Biology and Applications. 2018;4(1):42. 10.1038/s41540-018-0079-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Dolejsch P, Hass H, Timmer J. Extensions of l1 regularization increase detection specificity for cell-type specific parameters in dynamic models. BMC Bioinformatics. 2019;20(1):395. 10.1186/s12859-019-2976-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Oberoi A, Giezenaar C, Jensen C, Lange K, Hausken T, Jones KL, et al. Acute effects of whey protein on energy intake, appetite and gastric emptying in younger and older, obese men. Nutrition & Diabetes. 2020;10(1):37. 10.1038/s41387-020-00139-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008852.r001

Decision Letter 0

Jason A Papin, James Gallo

14 Dec 2020

Dear Mr. Erdos,

Thank you very much for submitting your manuscript "Individualized computational model quantifies heterogeneity in postprandial responses to oral glucose challenge" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

James Gallo

Associate Editor

PLOS Computational Biology

Jason Papin

Editor-in-Chief

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Reproducibility report has been uploaded as an attachment.

Reviewer #2: Comments for the Author

The manuscript by Erdos B. et al. simulates the data of oral glucose tolerance test for overweight / obese participants in DIOGene study. The authors previously launched E-DES model using already existing dataset of OGTT and in this research by fitting to individual data the weight of each parameter and its correlation to physiological findings. This model seems to be useful for estimating mechanistic insight such as individual insulin resistance or endogenous insulin secretion and evaluating the effect of interventions.

1 In previous paper (Mass AH, et al. 2015), the parameters k5, k6, k7 and k8 were the most sensitive to change by a multiparametric sensitivity analysis. In this research, the parameter k7 is omitted on the basis of LPSA while the parameter k1 is employed to following analysis.

1.1 What is the reason of these discrepancies between previous research and this research? Does it reflect the difference in backgrounds between previously used dataset and the data of DIOGenes study?

1.2 The parameter k1 changes the peak concentration and time of gut glucose mass resulting in plasma glucose concentration. The authors should discuss about the association between the factor regulating parameter k1 (e.g. intestinal motility) and the variation of overweight / obesity subjects.

1.3 Is the parameter k1 also higher level in NGT or IFG than ones in IGT, IFG&IGT and T2DM? As well as the parameters k5, k6 and k8 (line 250-254), this should be mentioned.

2 Although the authors classify the participant in DIOGenes according to ADA diagnosis criteria (line 144-147), the median of blood glucose in figure 2 seems strange. Since there may be some errors in column annotations, please explain with actual median values and correct if needed. When corrected, please mention about the rightfulness of following analysis.

2.1 Plasma glucose level at 120 min in IGT seems lower than 140mg/dL.

2.2 Plasma glucose level at 120 min in IFG seems higher than 140mg/dL.

2.3 Plasma glucose levels at 0 min in IGT and IGF are higher than one in IFG&IGT.

3 Can this model predict the OGTT patterns of new subjects? In other words, are these significant parameters of k1, k5, k6 and k8 able to be predicted by some other physical parameters (e.g. Body mass index, HOMA-IR / Matsuda index, HbA1c, etc)?

4 PCA analysis for these four parameters shown in figure 5 is very interesting. At a glance, NGT is a most heterogenous population but no significant difference between groups (e.g. no group make clear cluster or tendency). In other words, no differential tendency of diagnostic classification along with vectors of the parameter k1, k5, k6 and k8 is shown. In terms of the pathogenesis of diabetes, T2DM will show lower insulin secretion (k6 and k8) and/or insulin resistance (k5). How the authors explain this discrepancy? The opposite direction of k5 and one of k6 or k8 may be explanation for it, is it a feature of this modeling or used data set?

5 In figure S5, the association between SSR and the four parameters are well described. Participant 175 shows higher SSR resulting incorrect simulation for the parameters. While participant 556 shows notably high SSR but the parameters locates within an average range. This may be a result of the difference in absolute value of plasma glucose concentration. The authors may be better to normalize the SSR using absolute value.

6 The parameters visualized to reflect the physical status are most important and interesting in this research.

6.1 The authors visualized the correlations between these parameters and insulin sensitivity in figure 5. While endogenous insulin secretion from pancreatic beta cells is also important on the pathogenesis of type 2 diabetes. Insulinogenic index calculated from OGTT result is often good indicator of endogenous insulin secretion. The association between these parameters and the indicator of endogenous insulin secretion should be shown.

6.2 In this study, can any alteration in these parameters after intervention be observed?

Reviewer #3: In this manuscript, the authors fitted their previous well-established physiology-based dynamics model of glucose-insulin system to individuals' data to elucidate the heterogeneity in individuals' responses. In order to reduce the number of estimated parameters for individualized models, the authors did sensitivity analysis, model selection based on Akaike information criterion (AIC), examined the parameter identifiability using profile likelihood, and finally chose the model with four parameters (k1, k5, k6, k8) to be estimated. The individualized model fitted pretty good, although the authors also reported its inability in some cases. Based on the estimated parameters of individualized models, the authors also did principal component analysis (PCA) and showed correlation between k5 (rate constant of insulin-dependent glucose uptake) with Matsuda index.

Overall, this is a pretty good study. The manuscript looks quite clear and easy for understanding. The purpose of this study developing individualized models seems to elucidate the heterogeneity in individuals' responses. It might be better if the authors could provide some more details in the application of these individualized models. And I listed some other comments as follows.

Major comments:

1. Page 10/22 Line 200, 'inability to go below fasting value', is it possible that this behavior is due to magnitude difference in k10 and k3 (as shown in Appendix S1 Table 3). In fact, I did not find k10 = 2.60 in ref [23]; I am not sure if I missed something.

With similar thoughts, since these rate constants k1-k10 are all multiplicative in the dynamics equations, maybe the authors should explore some magnitude of possible values (such as 0.01X, 0.02X, 0.05X, 0.1X, ... to 100X) instead of current explred range 25% to 175%? And maybe that is part of the reason why some of the rate constants are not sensitive (as shown in Fig 1).

2. Page 12/22 Line 255, 'significant positive correlation with the Matsuda index'. Could the authors show a scatter plot between the two variables for the reported correlation?

Similarly, for better understanding of the heterogeneity, it might be better to also show the pairwise scatter plot with density contour between k1, k5, k6 and k8, in additional to the boxplot in Fig 4 and the PCA plot in Fig 5.

3. Page 14/22 Line 318, 'as this type of reponse is rare in modelled population'. How many cases are with this type of response? Will it help if the authors first select out these cases, and then estimate the parameters on this group of people?

4. Page 15/22 Line 335, 'The individualized models showed a 4-5 fold derease in SSR in all groups compared to group simulations, indicating that the model individualization was successful'. I am wondering the individualized model is estimating 4 parameters to fit 8 data points, right? It might be expected that individualized models would have smaller SSR. Although the authors have shown some cases of under-fitting (Fig 3-B, 3-C, and Fig S5), is it possible for the individualized models over-fitting the data? In other words, how reliable are the estimated parameters of individualized models?

If it is estimating 4 parameters to fit 8 data points, and over-fitting might be an issue, then two possible suggestions might be: (1) using Bayesian framework on parameter estimation; (2) exploring the relationship between individuals by similarity in their data points (e.g. hierarchical clustering or tSNE embedding), and then estimating and possibly smoothing the paramters considring the group of neighbors.

5. Fig 3B, participant 556, it looks weird to see a sharp angle on the right of the red curve when it reaches the fasting level, which might indicate that the parameters for glucose level less than its fasting level are wrongly specified in magnitude difference, which might be the reason for the inability of the model going below fasting value.

Minor comments:

1. Page 6/22 Line 118, 'a weight of 0.1', is the weight inside or outside the square of formula (1)? And m and N in formula (1) might be better defined or explained.

Instead of fitting to median (or min or max) responses, is it possible or better to estimate the parameters with confidence interval of the model by fitting it to all the data for each group (NGT, IGF, IGT, IGF&IGT, and T2DM)?

2. Page 7/22 Line 132, 'original publication' might be better have the citation to ref [23].

3. Page 8/22 Line 152, 'seven curves' looks unclear and confuses me, until I saw Fig 2.

4. Page 8/22 Line 161, 'X^2(alpha, df)', what number are used as alpha and df? And this term 'X^2(alpha, df)' is not shown exactly the same in formula (2).

5. Page 11/22 Line 241, 'the model could replicate accurately (e.g. participants 110, 513, Fig 3-C).' The fitting to data points of participant 110 looks not so accurate, while the fitting to data points of participant 513 looks much more accurate.

6. Fig 4 legend, box-plot elements might be better defined, such as median, box limits, and whiskers. And 'Fig S3' in Fig 4 legend might be typo of 'Fig S4'.

7. Page 16/22 Line 367, 'individuals ... due to unually high insulin values still end up in the NGT group'. How many such cases in 496 NGT individuals?

8. Fig 1, the label of y-axis might be better as 'Plasma glucose, mmol/L'; or mention it in figure legend.

9. Fig 5A-B, the color coding might be too close for distinguishing (e.g. IGT vs IFG); and for matsuda index, it might be better colored in monocolor, e.g. white-green-black.

10. Fig 5B, The loading vector subplot is not mentioned in figure legend; and no explanation of the color of the gray scatter points in that subplot.

11. Table S2, it might be better if the authors could show some percentiles (e.g. 5%, 25%, 50%, 75%, 95%) in addition to the mean and standard deviation of each parameter. And since the parameters are multiplicative, and Fig 4 shows near normal after log-transformation, the standard deviation without log-transformation might not be very meaningful.

In addition, it might be helpful if the authors could compare Table S2 with the estimates with confidence intervals of the models fitted to the median responses.

Reviewer #4: Uploaded as an attachment.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: None

Reviewer #2: Yes

Reviewer #3: No: The authors stated that the original data are not publicly available due to ethical restrictions and privacy, but are available on request from the DIOGenes study data manager. The MATLAB code has been provided as Appendix S2.

Reviewer #4: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Reviewer #4: Yes: Lindsay Clegg

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods

Attachment

Submitted filename: Reproducible_report_PCOMPBIOL_D_20_01381.pdf

Attachment

Submitted filename: PCB Nove 2020 Response.docx

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008852.r003

Decision Letter 1

Jason A Papin, James Gallo

3 Mar 2021

Dear Mr. Erdos,

We are pleased to inform you that your manuscript 'Personalized computational model quantifies heterogeneity in postprandial responses to oral glucose challenge' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

James Gallo

Associate Editor

PLOS Computational Biology

Jason Papin

Editor-in-Chief

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The reproducibility report has been uploaded as an attachment.

Reviewer #2: The manuscript has been revised satisfactory.

Reviewer #3: The authors have made great effort to improve their manuscript, and have addressed all my concerns.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: None

Reviewer #2: None

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Anand K. Rampadarath

Reviewer #2: No

Reviewer #3: Yes: Adam Yongxin Ye

Attachment

Submitted filename: Reproducible_report_PCOMPBIOL_D_20_01381R1 (1).pdf

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008852.r004

Acceptance letter

Jason A Papin, James Gallo

16 Mar 2021

PCOMPBIOL-D-20-01381R1

Personalized computational model quantifies heterogeneity in postprandial responses to oral glucose challenge

Dear Dr Erdos,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Alice Ellingham

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Schematic of the E-DES model in use.

    (TIF)

    S2 Fig. Local parameter sensitivity analysis on the simulated plasma insulin response.

    (TIFF)

    S3 Fig. Profile Likelihood Analysis results of the 4 parameter model (k1, k5, k6, k8) on the median NGT, IFG, IGT, IFG&IGT, T2DM, min and max responses.

    The red star indicates the SSR of the model fitted using the optimal parameter values estimated from data, while the blue line corresponds to the error as the other parameter values are being re-estimated after adjusting the parameter value iteratively. The dashed lines indicate confidence intervals where the degrees of freedom equals one (lower), and the number of parameters (upper), respectively.

    (TIF)

    S4 Fig. Profile Likelihood Analysis results of the 5 parameter model (k1, k5, k6, k8, k9) on the median NGT, IFG, IGT, IFG&IGT, T2DM, min and max responses.

    The red star indicates the SSR of the model fitted using the optimal parameter values estimated from data, while the blue line corresponds to the error as the other parameter values are being re-estimated after adjusting the parameter value iteratively. The dashed lines indicate confidence intervals where the degrees of freedom equals one (lower), and the number of parameters (upper), respectively.

    (TIF)

    S5 Fig. The distribution of parameters k1, k5, k6, k8 by subgroup (based on ADA diabetes criteria) with the outliers highlighted.

    The boxes represent the 25th and 75th percentiles, the whiskers represent the min, and max values, and the horizontal line represents the median.

    (TIFF)

    S6 Fig. Pooled residuals in the personalized models per time-point per metabolite, colored by the ADA prediabetes and diabetes diagnosis criteria.

    (TIFF)

    S7 Fig. Pairwise scatter plots and density plots of the personalized model parameters, the Insulinogenic index and the Matsuda index from the DIOGenes study colored by the ADA prediabetes and diabetes diagnosis criteria.

    (TIF)

    S8 Fig. Personalized models colored by the Insulinogenic index in the parameter space of the model after reduction to 2d via principal component analysis.

    The personalized model simulations of five participants with varying first-phase insulin secretion are highlighted (individuals 51, 45, 347, 430, 693). Orange and blue lines correspond to plasma glucose and insulin model simulation, while crosses represent measured data.

    (TIFF)

    S9 Fig. Personalized models colored by SSR in the parameter space of the model after reduction to 2d via principal component analysis.

    The personalized model simulations of five participants with varying first-phase insulin secretion are highlighted (individuals 175, 738, 556, 676, 445). Orange and blue lines correspond to plasma glucose and insulin model simulation, while crosses represent measured data.

    (TIFF)

    S1 Appendix. E-DES model structure, fluxes, inputs, parameters and constants.

    (PDF)

    S2 Appendix. MATLAB implementation of the model used in the manuscript.

    (ZIP)

    S1 Table. Criteria for prediabetes and diabetes classification used in this study.

    Based on the standard two hour OGTT by the American Diabetes Association.

    (PDF)

    S2 Table. Results of step two of the model selection approach.

    Sum of squared residuals (SSR) and Akaike Information Criterion (AIC) of the ten best performing candidate models.

    (PDF)

    Attachment

    Submitted filename: Reproducible_report_PCOMPBIOL_D_20_01381.pdf

    Attachment

    Submitted filename: PCB Nove 2020 Response.docx

    Attachment

    Submitted filename: Response_to_Reviewers.pdf

    Attachment

    Submitted filename: Reproducible_report_PCOMPBIOL_D_20_01381R1 (1).pdf

    Data Availability Statement

    The data supporting the findings in this study are available to eligible researchers from Gabby Hul [g.hul@maastrichtuniversity.nl]. The data are not publicly available due to ethical restrictions and privacy of participant data.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES