Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Feb 1.
Published in final edited form as: Med Decis Making. 2024 Dec 25;45(2):127–142. doi: 10.1177/0272989X241305414

Microsimulation Estimates of Decision Uncertainty and Value of Information Are Biased but Consistent

Jeremy D Goldhaber-Fiebert 1,2, Hawre Jalal 3, Fernando Alarid-Escudero 4,5
PMCID: PMC12185039  NIHMSID: NIHMS2087991  PMID: 39720850

Abstract

Purpose.

Individual-level state-transition microsimulations (iSTMs) have proliferated for economic evaluations in place of cohort state transition models (cSTMs). Probabilistic economic evaluations quantify decision uncertainty and value of information (VOI). Previous studies show that iSTMs provide unbiased estimates of expected incremental net monetary benefits (EINMB), but statistical properties of iSTM-produced estimates of decision uncertainty and VOI remain uncharacterized.

Methods.

We compare iSTM-produced estimates of decision uncertainty and VOI to corresponding cSTMs. For a 2-alternative decision and normally distributed incremental costs and benefits, we derive analytical expressions for the probability of being cost-effective and the expected value of perfect information (EVPI) for cSTMs and iSTMs, accounting for correlations in incremental outcomes at the population and individual levels. We use numerical simulations to illustrate our findings and explore the impact of relaxing normality assumptions or having >2 decision alternatives.

Results.

iSTM estimates of decision uncertainty and VOI are biased but asymptotically consistent (i.e., bias approaches 0 as number of microsimulated individuals approaches infinity). Decision uncertainty depends on 1 tail of the INMB distribution (e.g., P[INMB <0]), which depends on estimated variance (larger with iSTMs given first-order noise). While iSTMs overestimate EVPI, their direction of bias for the probability of being cost-effective is ambiguous. Bias is larger when uncertainties in incremental costs and effects are negatively correlated since this increases INMB variance.

Conclusions.

iSTMs are useful for probabilistic economic evaluations. While more samples at the population uncertainty level are interchangeable with more microsimulations for estimating EINMB, minimizing iSTM bias in estimating decision uncertainty and VOI depends on sufficient microsimulations. Analysts should account for this when allocating their computational budgets and, at minimum, characterize such bias in their reported results.

Keywords: bias, consistency, cost-effectiveness acceptability curve, decision uncertainty, microsimulation, Monte Carlo, probabilistic analysis, sampling, value of information


Model-based economic evaluations are increasingly used to inform decisions about which interventions to implement and whether there is value in conducting potential future research. The use of probabilistic analyses within economic evaluations has gained increasing traction and is now the recommended approach for base-case analyses.13 Probabilistic analyses permit unbiased estimates of expected outcomes for nonlinear models and quantification of decision uncertainty stemming from uncertainty in population-level model inputs (i.e., secondorder parameter uncertainty, the uncertainty in quantities such as probabilities estimated using finite, empirical data).4 Because uncertainty in model parameters can be correlated, prior work has developed approaches for estimating, sampling from, and propagating the joint uncertainty distribution of model parameters.411

While cohort state-transition models (cSTMs) (i.e., mean-process models) were the standard modeling technique in the past, individual-level state-transition microsimulations (iSTMs) are increasingly used for economic evaluations.12,13 For any cSTM, a corresponding and equivalent iSTM that models the same underlying processes can be formulated.12 The expected outcomes produced by simulations using an iSTM are estimates of the corresponding mean-process outcomes obtained from simulations with the equivalent cSTM.14

Several trends in medicine and public health have driven the increased use of iSTMs. These include 1) the proliferation of high-performance computing,10,12 2) the focus on sophisticated and personalized interventions coupled with the rise of sophisticated risk prediction models that depend on a complex set of dynamic individual characteristics,15,16 and 3) the move to proactively address health disparities, with health equity defined across multiple dimensions (e.g., race/ethnicity, socioeconomic status, and social determinants of health).1719

When a sufficiently large number of individuals are simulated using an iSTM, the outcomes produced converge with those that would have been produced by a corresponding cSTM.12,2023 However, when smaller numbers of individuals are simulated, modeled outcomes produced by the iSTM can differ appreciably from the cSTM’s outcomes due to Monte Carlo noise (i.e., firstorder uncertainty). Prior work conceptualizes the rate of convergence and hence the necessary number of microsimulated individuals needed to estimate expected outcomes and expected incremental outcomes within a given error bound.2023

Prior work also considers probabilistic analyses using iSTMs,2123 which involves both first-order and secondorder uncertainty. Specifically, this prior work developed methods to produce estimates with as few samples as possible from the joint model parameter uncertainty distribution because each sample then involves computationally costly microsimulations of many individuals. A key finding is that for a given number of samples from the joint model parameter uncertainty distribution, estimates of outcomes made with an iSTM will be unbiased but that their uncertainty will be systematically overestimated because they also contain Monte Carlo noise.

While the prior work cited here has focused on making estimates of expected outcomes and expected incremental outcomes providing guidance on the number of samples to take from the joint uncertainty distribution and number of individuals to microsimulate, it does not provide guidance for other quantities of interest related to decision uncertainty and the value of information. The present study provides analytic results and numerical simulations to characterize the precision and bias of estimating such quantities using microsimulation models.

Theoretical Model

We explore the precision and bias of estimates of decision uncertainty and the value of information using a simple decision model with 2 alternatives (treatment v. no treatment). We make these estimates using an iSTM with various numbers of sampled individuals (nmicro) and compare them to estimates produced with a cSTM (i.e., an iSTM where nmicro=).

Cohort State-Transition Markov Model

Let μ=μΔQ,μΔC be the hyperparameters determining the incremental quality-adjusted life-years (QALYs) and costs of the decision model with σΔQ,σΔC reflecting the current level of second-order uncertainty regarding these values. A realization of our parameters would then be a sample from our second-order uncertainty distribution

θΔQ~NμΔQ,σΔQθΔC~NμΔC,σΔCΘ=θΔQ,θΔC (1)

We first specify the cSTM, which is a function of our parameters (f(Θ)).

At the population level, we assume that the uncertainty in the incremental discounted lifetime QALYs, ΔQpsa, and incremental discounted lifetime costs, ΔCpsa, with treatment compared with with no treatment are characterized by

ΔQpsa,ΔCpsa~fNμΔQ,σΔQ,NμΔC,σΔC (2)

While the random variables ΔQpsa and ΔCpsa that we will consider below need not be normally distributed, we make this assumption because doing so will permit the analytic expressions we derive later to be more concrete. Furthermore, for simplicity, we assume that f(Θ) is the identity map, which implies that the second-order uncertainty in our model inputs is equal to the second-order uncertainty of our model output (i.e., σΔQ2,σΔC2, describing the variance of both θΔQ,θΔC and ΔQpsa,ΔCpsa.

More generally, as the uncertainty of incremental QALYs and incremental costs may be correlated, we can rewrite their joint uncertainty as

ΔQpsa,ΔCpsa~fMVNμΔQ,μΔC,Σpsa, (3)

where Σpsa is the variance-covariance matrix, where the diagonal consists of σΔQ2 and σΔC2, and the off-diagonal elements are defined as ρpsaσΔQσΔC, with correlation in the population uncertainty of incremental QALYs and incremental costs equal to ρpsa (i.e., the second-order correlation).

For the cSTM, a given draw from the population distribution of incremental QALYs and incremental costs (i.e., Probabilistic Sensitivity Analysis [PSA] sample) can be expressed in terms of incremental net monetary benefit:

INMBpsa=λΔQpsa-ΔCpsa (4)

where λ is the willingness to pay for QALYs gained. The variance of the incremental net monetary benefit across PSA samples is then

VarINMBpsa=VarλΔQpsaΔCpsaλ2VarΔQpsa+VarΔCpsa2λCovΔQpsa,ΔCpsa=λσΔQ2+σΔC22λρpsaσΔQσΔC (5)

Note that equation (5), the variance of INMBpsa, does not rely on the normality assumptions above.

For the cSTM, the expected incremental net monetary benefit of treatment for a given finite number of PSA samples (npsa) can be estimated as

EINMBpsa=EΘINMBpsaEINMBpsa¯=i=1npsaINMBpsa,inpsa (6)

As the number of PSA samples is finite npsa, there is uncertainty in this unbiased estimate of EINMBpsa, given by

VarEINMBpsa¯=VarINMBpsanpsa1=λσΔQ2+σΔC22λρpsaσΔQσΔCnpsa1=λ2seΔQ2+seΔC22λρpsaseΔQseΔC, (7)

where seΔQ and seΔC are standard errors for the estimated incremental QALYs and costs, respectively. The variance of EINMBpsa¯ in equation (7) does not rely on the normality assumptions above.

In terms of decision uncertainty, that is, the probability that a given strategy is the most cost-effective (often represented as a cost-effectiveness acceptability curve [CEAC]), the probability that treatment is the optimal strategy for the cSTM is

CEACpsa=PINMBpsa>0=1Φ0EINMBpsaVarINMBpsa=1Φ0EINMBpsaλσΔQ2+σΔC22λρpsaσΔQσΔC (8)

where Φ(.) is the cumulative density function of the standard normal distribution.

The expected value of perfect information (EVPI) for the cSTM is defined as

EVPIpsa=Emax0,-INMBpsaifEINMBpsa>0Emax0,INMBpsaifEINMBpsa<0 (9)

We require both cases because, at some values of λ, the expected incremental net monetary benefit of treatment (EINMBpsa) could be negative, implying that the preferred strategy on expectation is no treatment. This, in turn, implies that information will be valuable for PSA samples in which the incremental benefit of treatment (INMBpsa) is positive.

In the case of EINMBpsa>0, we can express the value of information as

EVPIpsa=0INMBpsa1VarINMBpsaφINMBpsaEINMBpsaVarINMBpsadINMBpsa (10)

where φ(.) is the probability density function of the standard normal distribution. This expression is a form of the unit normal loss integral function, which was first discussed by Raiffa and Schlaifer in the 1960s.2426 Likewise, in the case of EINMBpsa<0, this expression is

EVPIpsa=0INMBpsa1VarINMBpsaφINMBpsaEINMBpsaVarINMBpsadINMBpsa (11)

Individual-Level Microsimulation Model

If an iSTM equivalent to the above-described cSTM is used to estimate the quantities described for the cSTM using a given number of microsimulated individuals (nmicro) for each PSA sample, we arrive at the corresponding set of equations.

For any PSA sample from the population distribution, the distribution of incremental QALYs with treatment compared to with no treatment that is estimated from an iSTM defined in terms of ΔQpsa, the population incremental QALYs with treatment sampled from the second-order uncertainty distribution, and σΔQ,micro, the square root of the variance of incremental QALYs with treatment across individuals in the microsimulation, can be written as

ΔQmicro~NΔQpsa,seΔQ,microwhereseΔQ,micro=σΔQ,micronmicro1 (12)

For our simple example, we assume that the first-order noise from the microsimulations is homoscedastic with respect to the level of incremental QALYs with treatment, that is, that the variance across microsimulated individuals in terms of incremental QALYs from treatment (σΔQ,micro) is independent of ΔQpsa. Hence, equation (12) can then be expressed equivalently as

ΔQmicro=ΔQpsa+εΔQ,microwhereεΔQ,micro~N0,seΔQ,micro (13)

In other words, an estimate of the population incremental QALYs generated from an iSTM will have the same expectation as the value produced by the cSTM with the addition of zero-centered noise due to the finite number of simulated individuals in the iSTM.

Likewise, for any PSA sample from the population distribution, the distribution of incremental costs with treatment compared to with no treatment estimated from an iSTM is

ΔCmicro~NΔCpsa,seΔC,microwhereseΔC,micro=σΔC,micronmicro1 (14)

For equation (14), the assumptions are parallel to those for equation (12) regarding incremental QALYs, which we then express as

ΔCmicro=ΔCpsa+εΔC,microwhereεΔC,micro~N0,seΔC,micro (15)

Since the uncertainty of incremental QALYs and incremental costs may be correlated within an individual in the microsimulation (i.e., some individuals could gain more than average number of QALYs with treatment and also have higher than average incremental costs with treatment in the case of positive within-individual correlation), we can rewrite the joint distribution as

ΔQmicro,ΔCmicro~MVNΔQpsa,ΔCpsa,Σmicro (16)

where Σmicro is the variance-covariance matrix with diagonal elements defined by seΔQ,micro2 and seΔC,micro2 and off-diagonal elements defined as ρmicroseΔQ,microseΔC,micro, with correlation in the incremental QALYs and incremental costs experienced by individuals equal to ρmicro.

For the iSTM, given a draw from the population distribution of incremental QALYs and incremental costs (i.e., PSA sample), the microsimulated estimate of the incremental net monetary benefit can be expressed as

INMBmicro=λΔQmicro-ΔCmicroINMBmicro=λΔQpsa+εΔQ,micro-ΔCpsa+εΔC,micro (17)

where λ is the willingness to pay for QALYs gained. The variance of the microsimulated incremental net monetary benefit across PSA samples is

Var(INMBmicro=VarλΔQmicro-ΔCmicro=VarλΔQpsa+εΔQ,micro-ΔCpsa+εΔC,micro=λ2VarΔQpsa+λ2VarεΔQ,micro+VarΔCpsa+VarεΔC,micro-2λCovΔQpsa,ΔCpsa-2λCovεΔQ,micro,εΔC,micro=λσΔQ2+σΔC2+λseΔQ,micro2+seΔC,micro2-2λρpsaσΔQσΔC-2λρmicroseΔQ,microseΔC,micro=VarINMBpsa+λseΔQ,micro2+seΔC,micro2-2λρmicroseΔQ,microseΔC,micro (18)

This expression for the variance of INMBmicro does not rely on the normality assumptions above. Note that for simplicity, in the example we are considering here, we assume that the covariances between the microsimulation noise terms (εΔQ,micro and εΔC,micro) and the PSA sampled quantities (ΔQpsa and ΔCpsa) are 0. In principle, if this were not the case, then we would have additional terms in the variance equation of the form -2ρX,YXY, which when the correlation was less than 0 would lead to even larger variances. From equation (18), as the number of microsimulated individuals increases, that is, nmicro,VarINMBmicroVarINMBpsa.

The expected incremental net monetary benefit (EINMB) estimated with the iSTM, where expectations are taken over the parameter uncertainty (Θ), is then

EINMBmicro=EΘINMBmicro=EλΔQpsa+εΔQ,microΔCpsa+εΔC,micro=λEΔQpsa+εΔQ,microEΔCpsa+εΔC,micro=λEΔQpsa+E0EΔCpsaE0=EλΔQpsaΔCpsa=EINMBpsa=EINMBpsa (19)

Note that the final equation above implies that the expected incremental net monetary benefit estimated with the iSTM is equal to the expected incremental net monetary benefit estimated with the cSTM.

The uncertainty of the expected incremental net monetary benefit estimated with the microsimulation over the PSA samples is

VarEINMBmicro=VarINMBmicronpsa1=λσΔQ2+σΔC2+λseΔQ,micro2+seΔC,micro22λρpsaσΔQσΔC2λρmicroseΔQ,microseΔC,micronpsa1=λ2seΔQ2+seΔC22λρpsaseΔQseΔC+λ2seΔQ,micro2+seΔC,micro22λρmicroseΔQ,microseΔC,micronpsa1=VarEINMBpsa¯+λ2seΔQ,micro2+seΔC,micro22λρmicroseΔQ,microseΔC,micronpsa1 (20)

This expression for the variance of EINMBmicro in equation (20) does not rely on the normality assumptions above.

In terms of decision uncertainty and the value of information, the probability that treatment is the optimal strategy for the iSTM is

CEACmicro=PλΔQpsa+εΔQ,microCpsa+εΔC,micro>0=1Φ0EINMBmicroVarINMBmicro=1Φ0EINMBmicroλσΔQ2+σΔC2+λseΔQ,micro2+seΔC,micro22λρpsaσΔQσΔC2λρmicroseΔQ,microseΔC,micro (21)

The EVPI for the iSTM is defined as

EVPImicro=Emax0,INMBmicroifEINMBmicro>0Emax0,INMBmicroifEINMBmicro<0 (22)

In the case of EINMBmicro>0, we can express the value of information as

EVPImicro=0INMBmicro1VarINMBmicroφINMBmicroEINMBmicroVarINMBmicrodINMBmicro (23)

Likewise, in the case of EINMBmicro<0, this expression is then

EVPImicro=0INMBmicro1VarINMBmicroφINMBmicroEINMBmicroVarINMBmicrodINMBmicro (24)

Definition of Bias

As noted in the introduction, the expected outcomes from simulations with the iSTM provide estimates of corresponding outcomes from the mean-process represented by cSTM. We define bias as

Biasmicro(Outcomemicro)=E(OutcomemicroOutcomecSTM) (25)

In other words, the iSTM estimate of an expected outcome for the mean-process it represents is biased if it differs systematically from that produced by the meanprocess model (cSTM). In (25), since it is typical to compute outcomes of both the iSTM and cSTM with finite numbers of PSA samples, what is produced is an estimate of the bias. However, if one replicated the process many times (repeatedly generating a finite number of samples from the PSA distribution and then computing outcomes with an iSTM and cSTM), one could compute the expected bias by averaging across these replicates. It is possible, as we have noted above, that for some outcomes, the iSTM estimate may be unbiased (i.e., the expected Biasmicro=0), while for others, it may be biased. Furthermore, if for some outcome where the iSTM estimate is biased on expectation and the expected Biasmicro approaches 0 as nmicro approaches , then the iSTM will be biased but consistent.

Numerical Simulations

To illustrate the results implied by the equations above, we perform numerical simulations. The goal is to show that there exist cases in which the decision uncertainty and VOI outcomes generated by the iSTM are biased estimates of the outcomes from the corresponding cSTM and to show that the direction of bias for the decision uncertainty outcome can be either upwards or downwards.

First, we replicate outcomes from a cSTM accounting for parameter uncertainty by sampling from various distributions directly defined for the INMBpsa (normal distribution or mixture of normal distributions [that might occur due to bimodal priors or heterogeneity]) for decision problems involving a given number of strategies (2 strategies or 3 strategies), which then implies either 1 or 2 INMB distributions relative to the comparator strategy. For each of these, to replicate outcomes from a corresponding iSTM, we then simulate samples from the corresponding distributions of INMBmicro by adding normally distributed zero-mean noise to the samples of INMBpsa where the variance of this zero-mean noise decreases with more microsimulated individuals (see the online appendix for further details). With these samples, we compute the EINMB, the probability of being costeffective, and the EVPI for the cSTM and iSTM per the expressions above for these outcomes.

Next, we illustrate these results at various numbers of PSA samples and microsimulated individuals and with various correlation structures for the incremental costs and incremental QALYs at the population and/or individual levels. We perform additional numerical simulations using the parameter values defined in Table 1. As Table 1 shows, there are, in fact, 9 sets of parameter values as we consider combinations of second-order correlation of incremental QALYs and incremental costs with treatment (i.e., nonzero covariance in the joint parameter uncertainty distribution, where higher average incremental QALYs implies higher [or lower] average incremental costs) as well as how incremental QALYs and incremental costs may be correlated within microsimulated individuals (i.e., people who have higher than average incremental QALYs from treatment are also more likely to have higher [or lower] than average incremental costs with treatment). For each set of parameter values, we compute cSTM results with two sizes of PSA samples npsa{1000,4000}. For each PSA sample size, we then compute iSTM results with different numbers of microsimulated individuals (nmicro{10,40,160,640,2560}). For each PSA sample size and number of microsimulated individuals, we bootstrap uncertainty intervals for the iSTM results by repeating the microsimulations 400 times and then taking the 2.5th and 97.5th percentile of the results.

Table 1.

Parameter Values for Numerical Simulations

Parameter Description Value for Example
ΔQpsa Mean incremental QALYs with treatment for second-order uncertainty distribution 0.02
ΔCpsa Mean incremental costs with treatment for second-order uncertainty distribution 250
σΔQ Standard deviation of incremental QALYs with treatment for second-order uncertainty distribution 0.01
σΔC Standard deviation of incremental costs with treatment for second-order uncertainty distribution 20
ρpsa Correlation in the second-order uncertainty of incremental QALYs and incremental costsa {−0.9, 0, 0.9}
σΔQ,micro Standard deviation of incremental QALYs with treatment across individuals in the microsimulation 0.20
σΔC,micro Standard deviation of incremental costs with treatment across individuals in the microsimulation 400
ρmicro Correlation within individuals of incremental QALYs and incremental costsa {−0.9, 0, 0.9}
a

Correlations show multiple values as we conduct numerical simulations with all combinations of positive, negative, and no correlations in the second-order uncertainty distribution and the within-person outcomes.

To enable reproducibility, all code for the numerical simulations is included in an online repository accessible via persistent URL (https://purl.stanford.edu/sk494zp7768).

Results

Overall, in probabilistic analyses, while individual-level microsimulations produced unbiased estimates of EINMB when compared with equivalent cohort state transition Markov models, their estimates of decision uncertainty (e.g., the probability of being cost-effective at a given willingness-to-pay threshold) and EVPIs are systematically biased (but asymptotically consistent).

Figure 1A provides the underlying intuition. In the case of a normally distributed INMB in a 2-strategy decision problem, 1) the EINMB is the same for both the cSTM and iSTM, 2) there is greater variance in the INMB (and hence greater uncertainty in the estimate of the EINMB) with the iSTM due to the added Monte Carlo noise of microsimulating a finite number of individuals, 3) the iSTM estimate of the probability of not being cost-effective differs from the cSTM’s estimate, and 4) the iSTMs overestimates EVPI compared with the cSTM. The misestimates and overestimates occur because while the distributions for the cSTM and iSTM are symmetric and have equal centers, statistics concerning decision uncertainty depend on one tail of the distribution, and hence, the differentially large variance of the iSTM’s INMB distribution generates bias in its estimates of these statistics.

Figure 1.

Figure 1

Distributions of samples from the incremental net monetary benefit (INMB) distribution of treatment compared with no treatment using a cohort state transition model (cSTM; red lines) versus using an individual-level state-transition microsimulation model (iSTM; blue lines). (A) Results when the population uncertainty distribution of the INMB is normal. (B) Results when the distribution is bimodal (nonnormal). In both panels, estimates of expected INMB (EINMB) are the same for the cSTM and iSTM, but the distribution generated with the iSTM has a higher variance, which leads to bias in estimates of the probability that treatment is costeffective compared with the cSTM estimates (1 minus the shaded areas to the left of the INMB = 0 under the red and blue curves) as well as to biased estimates of expected value of perfect information (EVPI; see table inset). The direction of the bias for the probability of being cost-effective with normally distributed INMB differs from the direction of bias in the nonnormally distributed case.

Figure 1B extends the intuition for a case in which the INMB distribution is a mixture of normal distributions (i.e., not normally distributed). As with the normally distributed INMB, the EINMB for the CSTM and ISTM is equal, and the EVPI from the iSTM is overestimated. However, unlike the normally distributed case, the probability of being cost-effective is now underestimated with the iSTM. Hence, while this statistic is biased in both cases, the direction of bias is not obvious a priori.

Biases in the Case of 2 Decision Alternatives

Our numerical simulations for the iSTM estimates of the EINMB show them to be unbiased estimates of those produced by a corresponding cSTM (Figure 2A and B). This is the case regardless of whether incremental costs and incremental QALYs are correlated at the individual level or at the population level (Appendix Figure 1). One can increase the precision of an iSTM’s estimate of the EINMB (i.e., reduce its variance) by increasing both the number of PSA samples and the number of individuals microsimulated. Notably, the estimates of the EINMB produced by an iSTM are less precise (i.e., have higher variance) with greater individual-level anticorrelation between incremental QALY gains with treatment and incremental costs with treatment (Figure 2C).

Figure 2.

Figure 2

Distributions of estimates of the expected incremental net monetary benefit (EINMB) derived from a cohort state transition model (cSTM; black vertical line) versus an individual-level state-transition microsimulation model (iSTM) using different numbers of PSA samples and different numbers of microsimulated individuals per PSA sample (the colors denote the number of microsimulated individuals per sample). (A) Results at a higher willingness-to-pay threshold. (B) Results at a lower willingness-to-pay threshold. Variance in the estimate of the EINMB made with an iSTM is reduced by having larger numbers of PSA samples and by having larger numbers of microsimulated individuals per PSA sample. (C) Because the INMB involves subtracting costs from the monetary equivalent of quality-adjusted life-years (QALYs), a higher negative individual-level correlation between incremental QALYs and incremental costs leads to larger variance in the iSTM estimate of EINMB. For all panels, the distributions of EINMB are bootstrapped by replicating the entire procedure (PSA sampling and microsimulation for each PSA sample) 400 times. PSA, Probabilistic Sensitivity Analysis.

While iSTM estimates of EINMB are unbiased, their estimates of the probability of being cost-effective are biased (Figure 3A). In the case of a 2-alternative decision problem and normally distributed incremental costs and QALYs, iSTM estimates are downward biased when the cSTM estimate of the probability of being cost-effective is greater than 50% and upward biased when the cSTM estimate is less than 50%. This is due to additional first-order Monte Carlo noise leading to increased variance, increasing the proportion of PSA samples for which the INMB is on the opposite below (above) 0 when treatment is the optimal (nonoptimal) action. More generally, for decisions with larger numbers of alternatives and different distributions, patterns of bias can differ and are more complex (see below).

Figure 3.

Figure 3

Cost-effectiveness acceptability curves estimated for the probability that treatment is cost-effective at different willingness-to-pay thresholds using a cohort state transition model (cSTM; black line) versus an individual-level state-transition microsimulation model (iSTM) using different numbers of PSA samples and different numbers of microsimulated individuals per PSA sample (colored lines denote the number of microsimulated individuals per sample; panel A). Increasing the number of PSA samples does not decrease the bias of the iSTM estimates; only increasing the number of microsimulations per PSA sample does. (B) Because the incremental net monetary benefit (INMB) involves subtracting costs from the monetary equivalent of quality-adjusted life-years (QALYs), a higher negative individual-level correlation between incremental QALYs and incremental costs leads to larger expected bias in the iSTM estimates. For each PSA sample size and number of microsimulated individuals, we bootstrap uncertainty intervals (shaded colored regions around each line in A) for the iSTM results by repeating the microsimulations 400 times and then taking the 2.5th and 97.5th percentile of the results. PSA, Probabilistic Sensitivity Analysis.

While the iSTM estimates of the probability of being cost-effective are biased, they are asymptotically consistent, in that as the number of microsimulated individuals is increased, the size of the bias is expected to decrease to 0. Importantly, the bias is not decreased by having more PSA samples because those represent second-order uncertainty. In addition, bias is larger when the within-person incremental QALYs with treatment and within-person incremental costs with treatment are more negatively correlated (Figure 3B). Appendix Figure 2 shows that there are some differences in the bias in the probability of being cost-effective due to correlation at the individual and population level in incremental QALYs and incremental costs, but across all cases, the number of microsimulated individuals exerts an important influence on the size of this bias.

The iSTM estimates of EVPI are also biased (Figure 4). In our example, the direction of the bias is upward. The estimates, while biased, are asymptotically consistent, in that as more individuals are microsimulated, the bias tends to 0. Since information has value when it allows one to shift from a suboptimal decision to an optimal decision and since with iSTMs there is a larger fraction of PSA samples for which INMB estimates fall on the opposite side of 0 from the EINMB, a greater proportion of PSA samples will appear to have a positive value of information. Furthermore, since the variance of the INMB estimates with the iSTM is greater, there will be a higher proportion of samples with larger magnitude values of information. As can be seen in Appendix Figure 3, especially at lower WTP thresholds (e.g., below ~ $12,500/QALY gained in the example), the bias in the EVPI is larger when the within-person incremental QALYs with treatment and within-person incremental costs with treatment are more negatively correlated. Appendix Figure 3 shows that there are some differences in the EVPI bias due to correlation at the individual and population level in incremental QALYs and incremental costs, but across all cases, the number of microsimulated individuals exerts an important influence on the size of this bias.

Figure 4.

Figure 4

The figure shows the expected value of perfect information at different willingness-to-pay thresholds using a cohort state transition model (cSTM; black line) versus an individual-level state-transition microsimulation model (iSTM) using different numbers of PSA samples and different numbers of microsimulated individuals per PSA sample (colored lines denote number of microsimulated individual per sample). Increasing the number of PSA samples does not decrease the bias of the iSTM estimates; only increasing the number of microsimulations per PSA sample does. For each PSA sample size and number of microsimulated individuals, we bootstrap uncertainty intervals (shaded colored regions around each line) for the iSTM results by repeating the microsimulations 400 times and then taking the 2.5th and 97.5th percentile of the results. PSA, Probabilistic Sensitivity Analysis.

Bias in the Case of 3+ Decision Alternatives

We examine the case of a decision problem with 3 alternatives where the INMBs for 2 of the alternatives (strategies 1 and 2) are defined with respect to the status quo (strategy 0). To illustrate some of the potentially complex patterns of bias in the probability of cost-effectiveness and EVPI, we simulate many cases of this decision problem that differ in terms of the means and variances of the INMB distributions (normal) defined for the alternatives and the amount of 0-mean Monte Carlo noise that exists for each alternative when using an iSTM to simulate it. In all cases, strategy 1 has the highest EINMB and is therefore the preferred option with current information. Strategy 2 has a lower EINMB, which can either be positive (better than the status quo strategy 0) or negative (worse than the status quo). In total, we simulate 5,760 different combinations of EINMBs and amount of Monte Carlo noise in the INMB distributions due to microsimulation (Appendix Table 1).

Figure 5 shows patterns of bias for 3 illustrative cases. Given the normality assumption for the INMB distributions, the probability that strategy 1 (the preferred strategy on expectation) is optimal computed with the iSTM is downward biased in these cases. Because of this, the probability of another alternative being optimal is overestimated for at least 1 of the 2 other alternatives. However, the cases show that sometimes the probabilities for both strategies 0 and 2 can be overestimated, but at other times, for one alternative, the probability is overestimated, and for the other, the probability is underestimated. For these cases, just as with the 2-alternative decision problems considered above, the bias in EVPI estimated with the iSTM is ≥0 (upward biased; see Appendix Figures 46 for full details of the patterns of bias for the probabilities of being cost-effective and the EVPI for the 5,760 combinations we simulated; the appendix figures show that the patterns in the illustrative examples are present in many of the combinations we simulated).

Figure 5.

Figure 5

Complex patterns of bias that can emerge when using an individual-level state-transition microsimulation model (iSTM) to assess decision uncertainty for decision problems involving 3 alternatives. The figure shows summary results for 3 example problems. All 3 examples have expected incremental net monetary benefit (EINMB) for strategy 1 (the strategy preferred on expectation) of 0.2 with the EINMB for strategy 0 (status quo) and strategy 1 being 0 (a toss-up for second best). The 3 examples differ in terms of how much stochastic (Monte Carlo) noise there is for the iSTM’s INMB estimates for strategy 1 (from least to most noise going down the panels of the figure) while holding the noise for the strategies fixed and equal to or less than for strategy 1. The probability of being cost-effective is underestimated for strategy 1 in all 3 examples; however, strategy 2’s probability of being cost-effective can be either overestimated or underestimated such that, in some examples, the probabilities of being cost-effective for both strategy 0 and strategy 2 are overestimated whereas in other cases only 1 of them is overestimated (see the appendix for results for the 5,760 variants of the 3-alterantive decision problem that we explored numerically, which further illustrate the complexity of patterns of bias that can arise).

Discussion

Economic evaluation in health and medicine is witnessing 3 converging trends. The first is a shift from cohort state transition models (i.e., Markov cohort models, cSTMs) to individual-level state transition models (i.e., microsimulation or iSTMs) to enable modelers to reflect more complex and heterogeneous diseases and intervention strategies. The second is the use of probabilistic analyses to appropriately estimate the expectations of outcomes simulated using these decision models and to propagate population-level parameter uncertainty in these expectations. The third is a growing interest in gauging current levels of decision uncertainty and the value of collecting additional information to resolve that uncertainty.

Given these trends, our study considers the potential for bias introduced into estimates of decision uncertainty and value of information through iSTMs whose outputs also necessarily contain first-order Monte Carlo noise. We confirm both analytically and through a range of numerical simulations for iSTMs that while their estimates of the expected incremental value of decision alternatives are unbiased with respect to corresponding cSTMs, their estimates of decision uncertainty and the value of information are biased but asymptotically consistent.

A key implication of the study findings involves how analysts conducting economic evaluations with iSTMs allocate their computational budgets.2123 For computing EINMB, the number of parameter sets sampled from the parameter uncertainty distributions (i.e., outer loops) and the number of microsimulated individuals simulated for each parameter set sampled (i.e., inner loops) are interchangeable. However, for decision uncertainty and value of information, taking more samples from the parameter uncertainty distribution does not reduce the bias in the estimates produced by the iSTM; only simulating larger numbers of microsimulated individuals will reduce this bias. Hence, the appropriate balance between inner and outer loop samples depends on the quantities that the analyst is interested in estimating and the precision required for each of them requiring tradeoffs in their accuracy given the computation budget.

Given the study findings, one area of future research is developing efficient diagnostics to gauge the magnitude of the bias for given combinations of inner and outer loop samples, which can be used to suggest efficient computational budget allocations. For example, while more samples are better if one ignores computational budgets, how many samples would be sufficient to make the iSTM bias suitably small depends on the model used and the decision being considered. A model focused on comparing 2 highly effective strategies for preventing a relatively rare disease outcome (e.g., 2 aggressive screening strategies for a relatively rare cancer) may have to simulate many more individuals to form stable estimates of incremental net benefits of one strategy versus the other than if the outcome were common or the strategies being compared were substantially different in their costs and the health benefits they produced. Approaches might use a fraction of the overall computational budget to perform a probabilistic analysis with a given number of outer (e.g., 1,000) and inner loops (e.g., 400) and then act as if all of these samples were not available to consider how the model estimates change if there were 1,000 outer and 100 inner samples or 1,000 outer and 200 inner samples. One could then use a form of convergence analysis to examine how and at what rate the estimates changed as a function of the number of inner loop samples. In principle, one could adapt value-of-information approaches concerning optimal sample size and study design to this situation as well.27

While our study has established that there exist cases in which iSTMs generate biased but asymptotically consistent estimates of decision uncertainty and value-of-information quantities with respect to equivalent cSTMs, the characterization of the situations in which these biases are most pronounced and most consequential will require additional work. One important extension is greater consideration of various distributions of incremental net monetary benefit as well as the distribution of the incremental cost and incremental health benefits and their correlations within and between the population and individual levels. While the present analysis explored a wide range of correlations, empirical characterization of these correlations from data and from outputs of simulation models used in practice would be informative. Another important extension is considering these biases as the number of interventions increases. While the analyses visualized in Figure 5 demonstrate that biases exist for decision problems involving 3 alternatives like they do in the case of 2-decision alternatives, the complexity in the potential patterns of bias appears to grow with the number of alternatives. A third extension would be to consider the biases with iSTMs for estimating other measures of value of information such as partial perfect information and the expected net benefit of sampling and optimal sample size. A fourth area would be to gauge the potential size of biases in the existing applied literature by examining the estimates of these quantities from published model-based analyses using iSTMs and reimplementing the equivalent cSTMs to calculate the biases and perhaps re-performing the iSTM analyses with different inner and outer loop sample sizes. In summary, there are numerous opportunities for promising future work in this area.

Our study is careful to define bias as a systematic deviation in estimating decision-analytic outcomes of the mean process (cSTM) using a corresponding iSTM. The mean-process outcomes are those relevant to a risk-neutral decision maker and in situations in which the actual population for whom the decision is relevant is large. In this case, the additional noise from microsimulation is essentially a nuisance that should be reduced and removed to the extent possible.2830 However, in the case of risk-averse decision makers in situations where the population for whom the decision is relevant is small (i.e., so that the number of microsimulated individuals might be equal to the number of actual people in that population itself), the noise from the iSTM may be relevant. For example, if one were deciding between 2 alternative actions for an institution housing 100 people, the fraction of time that 1 alternative action will produce the highest INMB should include Monte Carlo noise, although such an estimate systematically differs from what one would have estimated for a very large institution using a cSTM. In other words, in these situations, the iSTM is not introducing bias but rather telling us something very important that we may well want to know in addition to what meanprocess estimates might tell us.

In conclusion, despite being unbiased in identifying the best strategy on expectation, iSTMs introduce bias in estimates of decision uncertainty and value of information. Fortunately, as the bias is asymptotically consistent, the size of the bias can be reduced by using sufficiently large numbers of microsimulated individuals per probabilistic sample. Analysts using iSTMs for economic evaluations that include probabilistic analyses should explicitly demonstrate that the number of microsimulated individuals per probabilistic sample is sufficiently large to make the bias in these estimates small.

Supplementary Material

Supplementary Material

Supplementary material for this article is available online at https://doi.org/10.1177/0272989X241305414.

Highlights.

  • Individual-level state-transition microsimulation models (iSTMs) produce biased but consistent estimates of the probability that interventions are cost-effective.

  • iSTMs also produce biased but consistent estimates of the expected value of perfect information.

  • The biases in these decision uncertainty and value-of-information measures are not reduced by more parameter sets being sampled from their population-level uncertainty distribution but rather by more individuals being microsimulated for each parameter set sampled.

  • Analysts using iSTMs to quantify decision uncertainty and value of information should account for these biases when allocating their computational budgets and, at minimum, characterize such bias in their reported results.

Acknowledgments

An earlier version of this work was presented at the Society for Medical Decision Making’s 2024 Annual North American Meeting

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Dr. Alarid Escudero is supported by grant U01CA253913 from the National Cancer Institute (NCI) as part of the Cancer Intervention and Surveillance Modeling Network (CISNET), Dr. Jalal is supported by a Canada Research Chair, and Drs. Alarid Escudero and Jalal are supported by grant U01CA265750 from NCI as part of CISNET. The funders had no role in designing the study, interpreting the data, writing, or publishing the report.

Footnotes

Ethical Considerations

As this study does not involve human subjects, it is deemed exempt from institutional review board review. There were no patients or participants involved, and hence, no consent was obtained. No patient or participant materials are involved, and hence, no permission for publication is required.

Contributor Information

Jeremy D. Goldhaber-Fiebert, Department of Health Policy, Stanford School of Medicine, Stanford, CA, USA; Center for Health Policy, Freeman Spogli Institute, Stanford University, Stanford, CA, USA;

Hawre Jalal, School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada..

Fernando Alarid-Escudero, Department of Health Policy, Stanford School of Medicine, Stanford, CA, USA;; Center for Health Policy, Freeman Spogli Institute, Stanford University, Stanford, CA, USA;

Data Availability

All code and data used to carry out the analysis have been deposited in a public archive to facilitate transparency and replicability (https://purl.stanford.edu/sk494zp7768).

References

  • 1.Neumann PJ, Sanders GD, Russell LB, Siegel JE, Ganiats TG, eds. Cost-Effectiveness in Health and Medicine. 2nd ed. New York (NY): Oxford Academic; 2016. DOI: 10.1093/acprof:oso/9780190492939.001.0001 [DOI] [Google Scholar]
  • 2.National Institute for Health and Care Excellence. NICE Health Technology Evaluations: The Manual (PMG36). Available from: https://www.nice.org.uk/process/pmg36/resources/nice-health-technology-evaluations-the-manual-pdf-72286779244741 [Accessed 2 September, 2024].
  • 3.Canadian Agency for Drugs and Technologies in Health. Guidelines for the economic evaluation of health technologies: Canada—4th edition. Available from: https://www.cda-amc.ca/guidelines-economic-evaluation-health-technologies-canada-4th-edition [Accessed 2 September, 2024]
  • 4.Briggs AH, Weinstein MC, Fenwick EA, et al. Model parameter estimation and uncertainty analysis: a report of the ISPOR-SMDM Modeling Good Research Practices Task Force Working Group-6. Med Decis Making. 2012;32(5):722–32. [DOI] [PubMed] [Google Scholar]
  • 5.Goldhaber-Fiebert JD, Stout NK, Ortendahl J, Kuntz KM, Goldie SJ, Salomon JA. Modeling human papilloma-virus and cervical cancer in the United States for analyses of screening and vaccination. Popul Health Metr. 2007;5: 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Goldhaber-Fiebert JD, Brandeau ML. Modeling and calibration for exposure to time-varying, modifiable risk factors: the example of smoking behavior in India. Med Decis Making. 2015;35(2):196–210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Goldhaber-Fiebert JD, Jalal HJ. Some health states are better than others: using health state rank order to improve probabilistic analyses. Med Decis Making. 2016;36(8):927–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Alarid-Escudero F, MacLehose RF, Peralta Y, Kuntz KM, Enns EA. Nonidentifiability in model calibration and implications for medical decision making. Med Decis Making. 2018;38(7):810–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Jalal H, Trikalinos TA, Alarid-Escudero F. BayCANN: streamlining Bayesian calibration with artificial neural network metamodeling. Front Physiol. 2021;12:662314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Alarid-Escudero F, Knudsen AB, Ozik J, Collier N, Kuntz KM. Characterization and valuation of the uncertainty of calibrated parameters in microsimulation decision models. Front Physiol. 2022;13:780917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chrysanthopoulou SA, Rutter CM, Gatsonis CA. Bayesian versus empirical calibration of microsimulation models: a comparative analysis. Med Decis Making. 2021;41(6):714–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Krijkamp EM, Alarid-Escudero F, Enns EA, Jalal HJ, Hunink MGM, Pechlivanoglou P. Microsimulation modeling for health decision sciences using R: a tutorial. Med Decis Making. 2018;38(3):400–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mertens E, Genbrugge E, Ocira J, Peñalvo JL. Microsimulation modeling in food policy: a scoping review of methodological aspects. Adv Nutr. 2022;13(2):621–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Iskandar R Adding noise to Markov cohort state-transition model in decision modeling and cost-effectiveness analysis. Stat Med. 2020;39(10):1529–40. [DOI] [PubMed] [Google Scholar]
  • 15.Basu A, Meltzer D. Value of information on preference heterogeneity and individualized care. Med Decis Making. 2007;27(2):112–27. [DOI] [PubMed] [Google Scholar]
  • 16.Kasztura M, Richard A, Bempong NE, Loncar D, Flahault A. Cost-effectiveness of precision medicine: a scoping review. Int J Public Health. 2019;64(9):1261–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Pitt AL, Goldhaber-Fiebert JD, Brandeau ML. Public health interventions with harms and benefits: a graphical framework for evaluating tradeoffs. Med Decis Making. 2020;40(8):978–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Podolsky MI, Present I, Neumann PJ, Kim DD. A systematic review of economic evaluations of COVID-19 interventions: considerations of non-health impacts and distributional issues. Value Health. 2022;25(8):1298–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Avanceña ALV, Prosser LA. Examining equity effects of health interventions in cost-effectiveness analysis: a systematic review. Value Health. 2021;24(1):136–43. [DOI] [PubMed] [Google Scholar]
  • 20.Kuntz KM, Weinstein MC. Modelling in economic evaluation. In: Drummond M, McGuire A, eds. Economic Evaluation in Health Care: Merging Theory with Practice. New York (NY): Oxford Academic; 2023. p 141–71. DOI: 10.1093/oso/9780192631770.003.0007 [DOI] [Google Scholar]
  • 21.O’Hagan A, Stevenson M, Madan J. Monte Carlo probabilistic sensitivity analysis for patient level simulation models: efficient estimation of mean and variance using ANOVA. Health Econ. 2007;16(10):1009–23. [DOI] [PubMed] [Google Scholar]
  • 22.Hatswell AJ, Bullement A, Briggs A, Paulden M, Stevenson MD. Probabilistic sensitivity analysis in cost-effectiveness models: determining model convergence in cohort models. Pharmacoeconomics. 2018;36(12):1421–6. [DOI] [PubMed] [Google Scholar]
  • 23.Yaesoubi R How many Monte Carlo samples are needed for probabilistic cost-effectiveness analyses? Value Health. 2024;27(11):1553–63. [DOI] [PubMed] [Google Scholar]
  • 24.Lee TY, Gustafson P, Sadatsafavi M. Closed-form solution of the unit normal loss integral in 2 dimensions, with applications in value-of-information analysis. Med Decis Making. 2023;43(5):621–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Schlaifer RO. Probability and Statistics for Business Decisions: An Introduction to Managerial Economics under Uncertainty. New York (NY): McGraw-Hill; 1959. [Google Scholar]
  • 26.Jalal H, Goldhaber-Fiebert JD, Kuntz KM. Computing expected value of information from probabilistic sensitivity analysis using linear regression metamodeling. Med Decis Making. 2015;35(5):584–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Conti S, Claxton K. Dimensions of design space: a decision-theoretic approach to optimal research design. Med Decis Making. 2009;29(6):643–60. [DOI] [PubMed] [Google Scholar]
  • 28.Reitsma MB, Salomon JA, Goldhaber-Fiebert JD. Common random numbers for stochastic network-based transmission dynamic models (Abstract: #23940 [QMTD]). Presented to the 2023 Society of Medical Decision Making Annual Meeting; October 22–25, 2023, Philadelphia, PA. [Google Scholar]
  • 29.Li Z, Knowlton G, Wheatley MM, Jenness SM, Enns EA. Linear regression metamodeling for variance reduction in probabilistic sensitivity analysis (Abstract: #23897 [QMTD]). Presented to the 2023 Society of Medical Decision Making Annual Meeting; October 22–25, 2023, Philadelphia, PA. [Google Scholar]
  • 30.Stout NK, Goldie SJ. Keeping the noise down: common random numbers for disease simulation modeling. Health Care Manag Sci. 2008;11(4):399–406. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material

Data Availability Statement

All code and data used to carry out the analysis have been deposited in a public archive to facilitate transparency and replicability (https://purl.stanford.edu/sk494zp7768).

RESOURCES