Skip to main content
Sage Choice logoLink to Sage Choice
. 2025 May 8;45(5):569–586. doi: 10.1177/0272989X251333398

Evaluating Semi-Markov Processes and Other Epidemiological Time-to-Event Models by Computing Disease Sojourn Density as Partial Differential Equations

Joachim Worthington 1,, Eleonora Feletto 2, Emily He 3, Stephen Wade 4, Barbara de Graaff 5, Anh Le Tuan Nguyen 6,7, Jacob George 8, Karen Canfell 9, Michael Caruana 10
PMCID: PMC12166149  PMID: 40340615

Abstract

Introduction

Epidemiological models benefit from incorporating detailed time-to-event data to understand how disease risk evolves. For example, decompensation risk in liver cirrhosis depends on sojourn time spent with cirrhosis. Semi-Markov and related models capture these details by modeling time-to-event distributions based on published survival data. However, implementations of semi-Markov processes rely on Monte Carlo sampling methods, which increase computational requirements and introduce stochastic variability. Explicitly calculating the evolving transition likelihood can avoid these issues and provide fast, reliable estimates.

Methods

We present the sojourn time density framework for computing semi-Markov and related models by calculating the evolving sojourn time probability density as a system of partial differential equations. The framework is parametrized by commonly used hazard and models the distribution of current disease state and sojourn time. We describe the mathematical background, a numerical method for computation, and an example model of liver disease.

Results

Models developed with the sojourn time density framework can directly incorporate time-to-event data and serial events in a deterministic system. This increases the level of potential model detail over Markov-type models, improves parameter identifiability, and reduces computational burden and stochastic uncertainty compared with Monte Carlo methods. The example model of liver disease was able to accurately reproduce targets without extensive calibration or fitting and required minimal computational burden.

Conclusions

Explicitly modeling sojourn time distribution allows us to represent semi-Markov systems using detailed survival data from epidemiological studies without requiring sampling, avoiding the need for calibration, reducing computational time, and allowing for more robust probabilistic sensitivity analyses.

Highlights

  • Time-inhomogeneous semi-Markov models and other time-to-event–based modeling approaches can capture risks that evolve over time spent with a disease.

  • We describe an approach to computing these models that represents them as partial differential equations representing the evolution of the sojourn time probability density.

  • This sojourn time density framework incorporates complex data sources on competing risks and serial events while minimizing computational complexity.

Keywords: cost-effectiveness analysis, Markov models, probabilistic sensitivity analysis, simulation methods, survival analysis, semi-markov models, time-to-event modelling


Graphical Abstract.

Graphical Abstract

This is a visual representation of the abstract.


Health economic modeling evaluates the impact and cost-effectiveness of interventions by synthesizing and extrapolating data on health and cost outcomes. Almost all health economic models are state-transition models, 1 capturing a set of discrete health states an individual can be in and the likelihood of transitioning between these states. These models can simulate health care scenarios to predict outcomes, aiding informed decision making in resource allocation. There are various mathematical and computational approaches to state-transition modeling, each with their own strengths and limitations. 2

Many health economic analyses use Markov models, which are easy to comprehend, implement, and compute, and they often have high parameter identifiability (i.e., transition rates can be supported by the available data) due to the clear delineation of health states. However, Markov models have notable disadvantages. The Markov property restricts the likelihood of a state transition to be either constant (time homogeneous) or dependent on only the current model time (time inhomogenous); they cannot depend on the time spent in the current state (the sojourn time). Consequently, shorter sojourn times for any particular state transition at a given time are more likely than any longer times. 3 While this may be addressed with the addition of hidden states, this can lead to serious problems such as overfitting 4 due to lack of identifiability, 5 leading to poorly fitted models and spurious findings. A poor choice of time discretization/cycle length 6 can also dramatically bias a Markov model’s outcomes, and altering the cycle length can lead to numerical errors. 7

Semi-Markov models (or equivalently, Markov renewal processes)810 can capture significantly more detail than Markov models can, allowing the time between state transitions to vary according to any distribution 11 rather than being constrained to a geometric distribution. This can dramatically increase model fidelity. This means the risk of developing a disease can depend directly on the length of time spent with a precursor condition,12,13 the sojourn time. Detailed survival analyses from prior studies can be used directly to inform disease risk, which has been identified as a priority in health technology assessments.14,15 This approach conserves identifiability by explicitly including states for which survival data are available. By further generalizing to time-inhomogeneous semi-Markov processes, 11 transitions can change over time, allowing the model to also capture risks that vary by age and secular trends. Semi-Markov models are also used in fields such as actuarial science1618 and engineering. 19 For example, semi-Markov models of cancer control can be used to increase the fidelity of estimates of disease progression and the impact of interventions.20,21

Unlike Markov models, the dynamics of semi-Markov models cannot usually be analytically computed. Instead, Monte Carlo sampling methods are typically used to estimate patient-level dynamics.2224 In this formulation, the semi-Markov model will describe a patient-level simulation approach 25 equivalent to a microsimulation or discrete event simulation,2629 which allows for a significant level of depth and detail.2,30 However, Monte Carlo sampling introduces stochastic uncertainty to the model (first-order uncertainty).25,31 In general, there is no consistent method of estimating the number of samples required to estimate a particular outcome. As a first-order approximation, if outcomes are assumed to be binomially distributed by the model, the length of confidence interval for the outcome is approximately O(1/N) , meaning an increasing number of simulations is required to accurately estimate an outcome, particularly for rare events. This can dramatically increase the computation time required for modeling, which both slows model development and limits the ability to conduct extensive sensitivity analyses, a priority for health economic estimates. 32 The tradeoff between discrete-time Markov-type models and continuous-time semi-Markov simulation models can lead to many potential sources of bias and inaccuracy. 33

We developed the sojourn density model framework for explicit evaluation of semi-Markov and related models. This approach models the dynamics of the probability density of the sojourn time as a system of partial differential equations (PDEs), tracking both the likelihood of being in a given state and the time spent in that state, avoiding the need for stochastic sampling methods. The modeling framework is parametrized by the cause-specific hazard rates34,35 (analogous to transition intensities 36 or the force of increment 37 ), allowing us to exploit survival data and methods 38 such as Kaplan–Meier estimators34,39,40 and risk ratios. 41 This captures the same detail as survival models, with the addition of serial state transitions. This approach is adapted from existing methods used in actuarial science,11,1618 which use the Kolmogorov forward equation. 42 We have modified this for an epidemiology and health economic context, including adapting the notation and simplifying the construction where possible.

In this study, we describe the model structure for sojourn density models, compare their strengths and weaknesses versus other modeling approaches, and describe a numerical scheme for calculating these models. To demonstrate the advantages of this approach, we include an illustrative model of liver disease. In Australia, liver cancer is the seventh most common cause of cancer-related death, 43 with increasing incidence and mortality trends. 44 Routine surveillance of patients with liver cirrhosis (late-stage liver scarring) can increase detection at early stages and improve survival 45 but requires an understanding of the competing risk of liver cancer versus other-cause mortality. Our modeling approach allows us to incorporate competing risks of decompensation, cancer, and other-cause mortality and complex survival data,46,47 which have been noted as key considerations in liver disease modeling. 48

Methods

We now describe the sojourn density model framework and demonstrate its useful mathematical properties. We also develop a numerical scheme for the calculation of these models.

Sojourn Density Model

Consider a state transition model with N possible discrete states labeled 1,2,...,N , such that at any time tTR (typically T=[0,) ), an individual is in exactly 1 of these states. We not only describe the likelihood of an individual being in a given state but also track the distribution of the length of time spent in that state, the sojourn time τT .

Doing so defines a continuous-time random process {Xt}tT with discrete state space {1,2,...,N} such that P(Xt=i)=P(individualinstateiattimet) . We can equivalently define the jump process {Yn,Tn}n=0,1,2, such that Yn=Xtt[Tn,Tn+1) , YnYn+1 . 49

The cause-specific hazard rates

λi,j(t,τ), (1)

depending on both the model time t and sojourn time τ , are the key parameters for the model. These are the instantaneous transition rates for an individual in state i to transition to state j at time t , given that the individual has so far spent time τ in i without any transitions (i.e, entered state i at time tτ ). Hazard rates are widely used parameters in public health modeling. 22 In some cases, h is used for hazard rates. 50 They are equivalent to the force of increment discussed in equation 4.6 of Hoem 11 in the case s=t (i.e., the reference time when the model is in state i is the same as the start of the period over which the transition may occur).

As the transition likelihoods depend on only the current state and the sojourn time, the random process Xt is a semi-Markov proccess, 36 as the sequence of transitions is Markov, but the time between transitions is not neccesarily exponentially distributed. By allowing the hazard rate to depend on t , this generalizes to time-inhomogeneous semi-Markov models. This t dependency can capture temporal trends as well as evolving risk with age a=a0+t , where a0 is the age at t=0 . While Xt is a semi-Markov process, the jump process (Yn,Tn) is a Markov renewal process. 51

For the hazard rate λi,j(t,τ) to represent the transition rate from state i to state j at time t given sojourn time τ , by equation (3) in Król et al., 22 the following should hold:

λi,j(t,τ)=limΔt01ΔtP(Yn+1=j,Tn+1[t,t+Δt)|Yn=i,Tn=tτ)=limΔt01ΔtP(Xt+Δt=j|Xs=is[tτ,t),Xtτi). (2)

For convenience, assume λi,i(t,τ)=0 (i.e., there are no self-loops). If there is no possible transition between i and j , then λi,j=0 .

To track the model state and the sojourn time, we introduce the dynamic density functions for the sojourn time:

fi(t,τ):T×T[0,) (3)

for i=1,2,,N . For a fixed time tT , these are probability density functions on the space of sojourn times and states (τ,i)T×{1,2,...N} , describing the likelihood of being in a particular state with a particular sojourn at the present time. This notation is based on the notation introduced by Asanjarani et al., 36 who described an equivalent density function with a single time/sojourn time variable.

Then we can define the process Xt by the total probability mass in that state:

P(Xt=i)=0fi(t,s)ds=:gi(t). (4)

We should expect the total probability mass to be 1, that is,

igi(t)=1. (5)

Our goal now is to describe the dynamics of fi(t,τ) and then show this definition satisfies (2) and (5). Let fi(t,τ) satisfy the partial differential equation

tfi(t,τ)+τfi(t,τ)=(jλi,j(t,τ))fi(t,τ) (6)

with boundary condition

fi(t,0)=j(0λj,i(t,τ)fj(t,τ)dτ). (7)

This equation is equivalent to the Kolmogorov forward equation.17,42 The left-hand side of (6) represents the transport equation along lines of constant tτ , representing the likelihood of remaining in state i while both t and τ increase, whereas the right-hand side captures the likelihood of making a transition to a subsequent state. Along the directional derivative (1,1)fi , the likelihood decays proportionally to the hazard rate (see the “Method of Characteristics and Survival Analysis” section). More rigorous derivations of this PDE are included elsewhere.16,17 The findings here have been simplified for a health economic context and assume the hazard rates λi,j(t,τ) are smooth.

The boundary condition (7) represents the accumulation of probability mass in state i post-transition. By (5), the initial conditions fi(0,τ) should satisfy

i0fi(0,τ)dτ=1. (8)

In addition,

limτfi(t,τ)=0 (9)

(i.e., supported on finite time).

Compare (6) to equation (2.37.1) in Helwich. 17 Helwich considered the likelihood pij(s,t,u,v) of transition from state y to state z such that Xs=i with sojourn time u and Xt=j with sojourn time less than v . In our notation, this is equivalent to fixing the model at state i with sojourn time u at time s (i.e., fk(s,τ)=δ(τu) if k=i else 0 ), then tracking the evolution of fj(t,τ) for t in a small neighbourhood of s .

We now have a complete description of the random process Xt through (4), (6), (7) and appropriate initial conditions. This constitutes what we will call a sojourn density model. With this current definition, sojourn density models are a type of semi-Markov model as a specific parametrization using hazard rates and computed through the sojourn density functions. In the sections “Sojourn-Preserving Transitions” and “Inhomogeneous Hazard Rates and Infectious Disease Modeling,” we introduce modifications that expand the sojourn density framework to include some non-semi-Markov models.

We now show that the process defined by (4) is well-defined and driven by the hazards λi,j(t,τ) as expected.

  • Theorem 2.1. The random process defined by (4) with dynamics defined by (6) and (7) satisfies the conditions (2) and (5).

Proof. The boundary condition (7) can be justified by calculating the derivative of gi(t) , that is, the rate of change in likelihood of being in state i at time t :

ddtgi(t)=ddt0fi(t,τ)dτ=0tfi(t,τ)dτ=0τfi(t,τ)dτ0(jλi,j(t,τ))fi(t,τ)dτ=fi(t,0)0(jλi,j(t,τ))fi(t,τ)dτ=j(0λj,i(t,τ)fj(t,τ)dτenteringstatei)0(jλi,j(t,τ)leavingstatei)fi(t,τ)dτ. (10)

This satisfies the definition of the hazard rate: the likelihood of being in state i decreases proportionally to the hazard of every transition leaving state ijλi,j(t,τ) and increases for every transition entering state ijλj,i(t,τ) proportional to the weight of those states fj(t,τ) . In addition,

ddt(igi(t))=0, (11)

that is, that the total probability mass is conserved. Therefore, with initial conditions satisfying equation (8) and boundary conditions (7), (5) is satisfied.

The propagation of hazards (2) through (6) can be observed through analogy with corollary 2.37 in Helwich 17 as noted above; a proof in the current notation is included in Appendix A.    ■

Method of Characteristics and Survival Analysis

We now compute the survival curves using method of characteristics. We refer to survival in the sense of the time until an event of interest, which may or may not be mortality, as is often used in epidemiological analyses.34,52 The use of characteristics is equivalent to the approach in section 2.5 of Buchardt et al. 16 or the explicit solution in equation 9 from Asanjarani et al. 36 As noted, equation (6) is best interpreted along characteristics of constant tτ corresponding to the state entry time (Figure 1). This allows explicit solutions through method of characteristics as well as analogies to other survival modeling approaches.

Figure 1.

Figure 1

Characteristics of constant t*=tτ for fi(t,τ) (left) and the corresponding elements Fim,n with constant mn for the numerical scheme (right).

Consider the characteristic t=t*+τ for fixed t*R (Figure 1). Then, along this characteristic,

ddτfi(t*+τ,τ)=tfi(t*+τ,τ)ddτ(t*+τ)+τfi(t*+τ,τ) (12)
=(jλi,j(t*+τ,τ))fi(t*+τ,τ). (13)

If t>0 , then t>τ , which corresponds to a characteristic starting from the boundary τ=0 ; if t*0 , then tτ , which corresponds to a characteristic from the initial condition t=0 .

Defining t0=max(t*,0) , τ0=max(t*,0) (so that t*=t0τ0 , t0,τ00 ), the initial condition (at the observable start of the characteristic; not necessarily at t=0 ) is at (t0,τ0) and so (12) has solution

fi(t*+τ,τ)=fi(t0,τ0)eλi(t*+τ,τ) (14)

where

λi(t,τ)=j(τ0τλi,j(tτ+s,s)ds). (15)

The function λi(t,τ) is the overall cumulative hazard 41 along the characteristic tτ=t* . Then,

Si(t,τ)=eλi(t,τ), (16)

is the overall survival (i.e., the likelihood no transitions occur) along this characteristic, and fi(t+τ,τ)=fi(t0,τ0)Si(t+τ,τ) .

We can correspondingly calculate the time-to-event distribution after entering state i at time tτ as the distribution

λi,j(t,τ)Si(t,τ) (17)

across j=1,2,,N . This allows translation to time-to-event epidemiological models and access to time-to-event methodology.35,53,54

It follows that the probability of the next transition being from ij before time t=t*+τ given entry to state i at time t0 , is

P(nexttransitiontime<tnexttransitionij|enteredstateiattimet0)=P(Yn+1=j,Tn+1<t|Yn=i,Tn=t0)=t0tλi,j(s,st*)Si(s,st*)ds. (18)

The limit of this is the likelihood of the next transition being to a particular state:

P(nexttransitionij|enteredstateiattimet0)=t0λi,j(s,st*)Si(s,st*)ds. (19)

We can also calculate the cause-specific survival, assuming other transitions (or events) are censored:

Si,j(t,τ)=eτ0τλi,j(tτ+s,s)ds. (20)

Where cause-specific survival data (equivalently, cause-specific cumulative incidence) is available with competing events censored, this may provide a calibration target for the hazard rates, which can be directly identified from the survival model (see “Hazard and Cumulative “Hazard in Clark et al. 34 ); parametrization of λi,j(t,τ) across t* can then be identified.

Compare this with the common approach in analytical epidemiological models in which the likelihood of transition is computed explicitly as the convolution of the τ -dependent hazard rate and the time of entry.5557 In our formulation, this is reflected in the boundary conditions

fi(t,0)=j(0λj,i(t,τ)fj(t,τ)dτ)=j(0fj(tτ,0)λj,i(t,τ)Sj(t,τ)dτ). (21)

If we define entry time distribution Fj(x)=fj(x,0) and survival-adjusted cause-specific hazard rate H(j,i)(x)=λj,i(t,x)Sj(t,x) , then Fi(t)=j(Fj*Hj,i)(t) , the convolution of these. Our approach generalizes this to incorporate serial transitions.

Initial Conditions

The appropriate initial conditions are highly dependent on the system being modeled. The Dirac delta function around τ=0 is a potential initial condition, corresponding to starting in state i with no existing sojourn time; explicitly,

fj(0,τ)={δ(τ)ifi=j0ifij. (22)

In practice, modeling may start in situ, that is, after some sojourn time, informing a distribution for the initial condition. For example, this could reflect the time an individual would have spent in a health state by the start of a trial (“Incorporating Lead-Time Biases” section).

Numerical Scheme

The characteristic solutions developed in the “Method of Characteristics and Survival Analysis” section can be used to develop a numerical scheme. For a discretization of T by fixed time steps ΔtR+ , define

Fim,n:=fi(mΔt,nΔt)={fi((mn)Δt,0)eλi(mΔt,nΔt)formnfi(0,(nm)Δt)eλi(mΔt,nΔt)form<n={Fimn,0eλi(mΔt,nΔt)formnFi0,nmeλi(mΔt,nΔt)form<n (23)

for m,n+ . Each value of mnZ corresponds to a characteristic with t*=(mn)Δt . For m<n , the initial condition is Fi0,nm=fi(0,(nm)Δt) . For mn , the initial condition Fimn,0=fi((mn)Δt,0) is the boundary condition of the full PDE.

The cumulative hazard functions λi(t,τ) may be computed by (15) in simple cases or otherwise approximated. Equivalently, each Fim,n element can be calculated iteratively by

Fim,n=Fm1,n1ej0Δtλi,j((m1)Δt+s,(n1)Δt+s)ds. (24)

(see Figure 1 for an illustration of this; compare Buchardt et al. 16 ). In many cases, the integral in (24) can be explicitly computed; otherwise, it can be approximated by the midpoint 0δtλi,j((m1)Δt+s,(n1)Δt+s)dsΔtλi,j((m0.5)Δt,(n0.5)Δt) . The exponential function could similarly be approximated using the appropriate Taylor polynomial for improved performance.

A first-order scheme for the boundary conditions can be computed as

Fim,0=fi(mΔt,0)=j(0λj,i(mΔt,τ)fj(mΔt,τ)dτ)j(k>0Δtλj,i(mΔt,kΔt)fj(mΔt,kΔt))=k>0,j(Δtλj,i(mΔt,kΔt)Fjm,k) (25)

for m0 .

One can subsequently estimate the evolution of the random process Xt as

P(Xt=i)=gi(mΔt)k=0Δtfi(mΔt,kΔt)k=0ΔtFim,k. (26)

An improvement can be made to this numerical scheme by adjusting the boundary conditions to exactly preserve the total mass at all timesteps:

Fim,0=j,kh~j,im,k(Fjm1,k1Fjm,k) (27)

where

h~i,jm,k=λi,j((m12)Δt,(k12)Δt)lλi,l((m12)Δt,(k12)Δt). (28)

We can then verify that i,kFim,k is constant; that is, the total probability mass igi(t) is conserved. We have here used the midpoint estimates for λi,j . This scheme can improve accuracy when there is a significant scale difference between hazards for competing events.

In epidemiological contexts, hazards are often modeled as continuous and smooth functions, so this scheme will have high accuracy and be convergent due to the inclusion of attractor death states.

A straightforward way to approach the numerical scheme is to calculate Fim,n across all i and all m<0 , tracking the accumulated boundary conditions Fm,0 for m>0 , and then computing Fim,n for each m0 across all i,n . This has been implemented in Supplementary Material: Example Code.

Hazard Functions and Common Parametric Distributions

The sojourn time density modeling scheme is determined by the hazard rates λi,j(t,τ) . It is common to describe these rates parameterically25,58 to define a manageable set of model parameters. For example, constant hazard rates λi,j(t,τ)=c correspond to an exponential time-to-event distribution, while λi,j(t,τ)=bkτk1 and λi,j(t,τ)=perτ correspond to Weibull and Gompertz distributions, respectively. 59 There are many other potential choices for λi,j(t,τ) , which may vary with τ , t , or both.

The choice of hazard function is highly dependent on the nature of the system being modeled and the available data sources. 60 van Wijk et al. 58 provided a tutorial and general overview of determining appropriate parametrized hazard functions, including periodic functions that can represent seasonal variations or other cyclical patterns. Hazard rates can be estimated from processed survival data, such as smoothed 61 Kaplan–Meier or Fine–Grey estimates or time-to-event distributions.36,62 Covariates can be directly included in hazard rates; for instance, Cox proportional hazard ratios can be used to define λ~i,j(t,τ|x)=λi,j(t,τ)eβ·x for some vector of covariates xRn and a baseline hazard function λi,j(t,τ) .41,63 Constant multipliers for the hazards can be used as calibration targets, to maintain the shape of the distribution. 64

Asanjarani et al. 36 showed how time-to-event/discrete event simulation distributions38,65 can be translated into hazard rates. Hazard rate can often be defined piecewise, 64 for example, from life tables.66,67 There are also more advanced methods for calculating and representing hazard rates, including nonparametric smoothing 61 and rational polynomial fraction parametrization. 68 Arbitrarily detailed and complex hazard rates can be used in sojourn density models; an appropriate level of detail supported by the data must be chosen to avoid overfitting.

Sojourn-Preserving Transitions

In an epidemiological model, there are often transitions that are independent of the underlying growth of disease. For instance, in a model of cancer development and screening, entry into an “surveillance” cohort would not affect the progress of the disease. This can be reflected in our model framework by introducing transitions that preserve the sojourn time.

To achieve this, we can introduce a new set of hazard rates μi,j(t,τ) that are the transition rates for an individual in state i to transition to state j , at time t , given that the individual has already spent time τ in the state without any transitions occurring, while conserving the sojourn time. These rates μi,j will depend on the type of transition being modeled but would typically have form similar to the λi,j hazard rates (“Hazard Functions and Common Parametric Distributions” section). Then, equation (6) can be modified as follows:

tfi(t,τ)+τfi(t,τ)=(jλi,j(t,τ))fi(t,τ)(jμi,j(t,τ))fi(t,τ)+(kμk,i(t,τ)fk(t,τ)) (29)

with boundary condition (7). With this formulation, theorem 2.1 holds with minor alterations.

This approach may be useful for models in which there are multiple categories of states; for example, a health economic model may have a “health outcome” state space {1,2,3,} (e.g., no lesions, precancerous lesions, cancer) as well as an “intervention” state space {a,b,c...} (e.g., screened, unscreened). The state space corresponding to the Cartesian product of these (1,a),(1,b),(2,a) and so forth can be used for a model that tracks both, with transitions between intervention states governed by the λ parameters and transitions between the intervention states governend by μ .

This can be further extended by continuing to track the sojourn time across multiple independent types of transitions. For instance, one could classify k independent “types” of transitions λi,j1 , λi,j2 , … λi,jk corresponding to k independent sojourn times τ1,τ2,...,τk which are each tracked by the model. Then (29) can be generalized to keep track of multiple sojourn times as

tfi(t,τ)+kτkfi,j(t,τ)=(k,jλi,jk(t,τ))fi(t,τ) (30)

where τ=(τ1,τ2,,τk) , with new boundary conditions for each sojourn time τi

fi(t,τ1(1δ1,n),τ2(1δ2,n),...,τk(1δk,n))=k,j(0λi,jk(t,τ)fj(t,τ)dτk) (31)

for n=1,2,k where δx,y is the Kronecker delta function. Then, each transition type λk “sends” the density to the corresponding boundary condition at τk=0 . This expansion allows for a more general and class of models.

Computing Health Economic Outcomes

In addition to the likelihood of being in each state, there are other outcomes relevant for health economic analyses that can be computed from these models. These can usually be assessed via either the expected time spent in each state or the likelihood of transition into a state.

For example, the quality-adjusted life-years (QALYs 69 ) can be calculated based on the expected time spent in each state. If a person in state i has a QALY value of xi , the expected QALY value over the period t0 to t1 is the weighted sum

ixit0t1gi(t)dt. (32)

Outcomes such as costs associated with entering a particular state can be calculated via the boundary condition. For instance, the likelihood of entering state i over the period t0 to t1 is

t0t1fi(t,0)dt. (33)

If there is a cost αi associated with entering state i , the total expected cost across this period is

iαit0t1fi(t,0)dt. (34)

If there are ongoing costs associated with the time spent in a particular state, these can be calculated in the same way as QALYs above.

Inhomogeneous Hazard Rates and Infectious Disease Modeling

By relaxing the assumptions on the form of the hazard functions, the sojourn time density models can be used to simulate a more general class of models than semi-Markov models. For example, we could allow the hazard to depend on the current model state, that is, have the form

λi,j(t,τ,g(t)) (35)

where g(t)={gi(t)} is a vector of the values (4). One can then simulate models where the effect size is dependent on the current state of the system.

For example, the standard SIR model 70 has states S (susceptible), I (infectious), R (recovered). The risk of infection is proportional to the currently infected population, leading to the hazard

λS,I(t,τ,g(t))=βP(Xt=I)=β0fI(t,τ)dτ=βgI(t). (36)

The recovery rate for infected individuals is constant, that is,

λI,R(t,τ,g(t))=γ. (37)

These hazards can be used to define a sojourn density model; we can confirm that

dgSdt=βgS(t)gI(t),dgIdt=βgS(t)gI(t)γgI(t),dgRdt=γgI(t), (38)

that is, the defining equations for the standard SIR model. Using this starting point, we could modify this to introduce more complex risks of infection and recovery. For instance, a distribution for time to disease recovery could be simulated by modifying λI,R to be a function of τ , the time in infectious state, to reflect an inflection that can clear after only a few days minimum. In the sojourn density model framework, new dynamics can be included by modifying the hazard functions rather than by adjusting the design of the model.

This addition of heterogeneous generalizes the sojourn density model framework and could allow for the synthesis of different model types into one framework. For example, a model of both a viral infection and a resulting disease (such as hepatitis and liver cancer) could be directly modeled. The addition of proportional terms to the hazard does not affect theorem 2.1 but does mean the method of characteristics described in the Method of Characteristics and survival analysis section may no longer be applied.

Incorporating Lead-Time Biases

As the sojourn time is modeled explicitly in this framework, this allows us to incorporate lead-time bias. Lead-time biases occur in many contexts; for instance, screen detection of cancer artificially increases survival times due to the increased observation window.71,72 A related bias occurs in modeling that has been informed by survival data measured from the start of a trial rather than onset of disease.

These biases can be addressed by modifying the hazard rate. A highly simplistic approach may be to define the adjusted hazard rate

h~i,j(t,τ)={0forτ<τ0λi,j(t,ττ0)ττ0 (39)

where hi,j(t,τ) is the observed cause-specific hazard rate from a trial that may include a lead-time bias and τ0 is a cutoff time. This assumes that there is no risk of transition until some given time τ0 . Alternatively, the hazard could be assumed to increase to the observed rates at time τ0 , linearly or otherwise. In practice, a useful estimate for τ0 may be difficult to determine, requiring a sensitivity analysis over a range; there are many useful methods for estimating the lead time.7376

Example Model: Liver Disease

To demonstrate our new approach to modeling, we now describe a model of liver disease and development of cancer. A key intervention in the prevention of liver cancer death is early detection via routine surveillance. Recommendations like 6-monthly ultrasound for people with cirrhosis (late-stage liver scarring) can increase the likelihood of detection at early stages when curative treatment is possible. 77 However, people with cirrhosis are also at high risk of both liver decompensation (irreversably worsened liver damage) and noncancer death. People who develop liver decompensation or die before the onset of liver cancer cannot benefit from early detection. It is therefore critical to understand the competing risks of liver cancer, liver decompensation, and other-cause death when assessing whether routine surveillance is advisable or whether the costs and patient discomfort caused by surveillance outweigh the health benefits. This model shows how the risk of liver cancer, and competing risk of decompensation, can be captured in a way that evolves with the length of time spent with liver cirrhosis.

The structure of the simplified model is shown in Figure 2 and captures individuals with liver cirrhosis, decompensation events, onset of liver cancer, and death. As the model is designed to illustrate the potential utility of surveillance for liver cancer in the presence of competing risks of decompensation and death, liver cancer and death were used as terminal states. Hazard rates for relevant transitions (decompensation, onset of liver cancer, death) were calibrated by identifying appropriate parametric forms and calibrating to relevant data sources. With this approach, the design of the model states and flow is kept simple, while the complexity is shifted to computing the hazard rates determining the transitions between states. Each cause-specific hazard was fitted in turn, and the overall fit assessed. For this iteration of the model, a cohort aged 55 y was simulated, based on the mean age of cirrhosis diagnosis.

Figure 2.

Figure 2

Schematic of the example liver disease model and the sojourn- and time-dependent hazards for competing risks for people with compensated cirrhosis.

The modeling described in this article is part of a larger model developed to inform the “Clinical Practice Guidelines for Liver Cancer Surveillance for People at High Risk in Australia.” 78 The development of these guidelines, including the modeling, was led by clinicians in the area of liver disease and liver cancer alongside a multidisciplinary working party, which included health care and clinical representatives, representatives with lived experience, and other community representatives. The full model includes detailed liver cancer stage progression and survival as well as economic details. For further details, see Worthington et al. 79

For this analysis, published survival curves were used as target data for the modeling. Where patient-level data are available, this could be used as a calibration target instead, using methods in the existing literature on semi-Markov models. 22

Results

Performance of Sojourn Density Models

The described model structure can capture the same level of detail as semi-Markov models and many other patient-level simulations, increasing the level of detail while allowing for deterministic evaluation. The computational complexity of the numerical algorithm described in the “Numerical Scheme” section is O(T2S2) , where T is the number of timesteps and S is the number of model states. Vectorization across characteristics can improve performance considerably, for example, via the Python NumPy library or other scientific computing libraries. For models in which not every state has a transition to every other state, the complexity is O(T2U) where U<S2 is the number of possible transitions. Compare this with a patient-level simulation, including most implementations of semi-Markov models, in which the computational complexity is driven by the number of patients simulated O(TN) , where N is the number of simulated patients. As noted previously, the precision of such models is approximately O(1/N) , and so in some applications, N may need to be very large for precise estimates. This is avoided in the sojourn time density model. For comparison, a standard numerical algorithm for a Markov model would have computational complexity O(TS2) .

In practice, sojourn density models can run in less than 1 s on a standard computer. This allows for runs with a variety of parameter values to assess sensitivity analyses, particular with parallel computing. In most cases, the optimal timestep Δt can be identified by assessing a particular parameter set, selecting a small enough value for the results of interest to converge. This choice will be highly dependent on the problem being assessed, roughly scaling with the expected transition time. Future work is planned to assess the effect of the timestep.

With an appropriate implementation of the numerical algorithm described in the “Numerical Scheme” section, computing a sojourn density model requires specifying only the model states and the form of the hazard rates between them. Listing 1 shows the syntax of the code for this model; the full code, including the implementation of the numerical scheme, has been included in Supplementary Material: Example Code.

Listing 1.

Excerpt Showing Code Syntax for the Liver Disease Model a

Liver_Disease_Model=Sojourn_Model.Model()
state_names=[“Compensated cirrhosis”,“Decompensated cirrhosis”,“Liver
↪ cancer”,“Death”]
Liver_Disease_Model.add_state(state_names)
#Liver cancer onset transition
Liver_Cancer_Hazard=Hazard_Functions.Hazard( Hazard_Type =
↪ “gompertz_sojourn”, Parameters={“p”:1.197e-02,“r”:6.654e-02})
Liver_Disease_Model.add_transition(“Compensated cirrhosis”,“Liver
↪ cancer”,Liver_Cancer_Hazard)
///Other transitions omitted for brevity
res_df=Liver_Disease_Model.run_model(dt=0.01, max_t=10,
↪ init_conds=[“Compensated cirrhosis”])
a

Full code is included in Supplementary Material: Example Code.

Liver Disease Model Calibration

The model was calibrated to reproduce key targets, with model parameters shown in Table 1. For each target, an appropriate parameterized hazard rate was selected, and optimal parameters were calculated using the BFGS algorithm in the SciPy package. We also calibrated distributions for hazard ratios for each of the hazard rate functions. These were calibrated to reproduce the 95% confidence intervals in each data source across the range of hazard ratio values and act as multipliers for each distribution for the probabilistic sensitivity analysis. The model was well-fitted to the target data, reproducing terminal values for the survival curves and remaining within the 95% confidence intervals for the duration. Key targets are shown in Figure 3. For all-cause death rates, a piecewise constant hazard was used based on age-specific rates published by the Australian Bureau of Statistics. 82

Table 1.

Calibrated Parameters for Example Model of Liver Disease

Parameter Model Value (95% PI) Target (95% CI) Note
Cirrhosis decompensation hazard λ=0.0129t+0.00668fort<6,0.0359fort6 Calibrated to Vilar-Gomez et al. 80
Hazard ratio distribution for PSA Lognormal(μ=0.0213,σ2=0.1172) Calibrated to Vilar-Gomez et al. 80
Annual decompensation risk (target) 1.03% (0.9%–1.2%) 1.59% (0.17%–3.01%) Calibration target 80
Ten-year decompensation risk 34.5% (29.5%–40.2%) 33.7% (28.4%–39.1%)
Liver cancer onset hazard λ=0.0102τ1.377 Calibrated to Vilar-Gomez et al. 80
Hazard ratio distribution for PSA Lognormal(μ=0.00171,σ2=0.1582) Calibrated to Vilar-Gomez et al. 80
Annual liver cancer risk 0.99% (0.8%–1.3%) 1.53% (0.14%–2.93%) Calibration target 80
Ten-year liver cancer risk 15.2% (11.8%–19.2%) 15.6% (11.5%–19.7%)
Death hazard (compensated cirrhosis) λ=0.0336τ0.238 Calibrated to D’Amico et al. 81
Hazard ratio distribution for PSA Lognormal(μ=0.0254,σ2=0.1472) Calibrated to D’Amico et al. 81
Annual death risk (compensated) 4.72% (3.8%–5.8%) 3.97% (2.62%–5.31%) Calibration target 81
Ten-year death risk (compensated) 39.4% (35.2%–43.6%)* 39.4% (36.1%–42.8%)
Death hazard (decompensated cirrhosis) λ=0.4470.150τ+0.0237τ2 Calibrated to D’Amico et al. 81
Hazard ratio distribution for PSA Lognormal(μ=0.0478,σ2=0.08952) Calibrated to D’Amico et al. 81
Annual death risk (decompensated) 37.8% (33.1%–42.4%)* 38.3% (35.0%–41.6%) Calibration target 81
Ten-year death risk (decompensated) 90.6% (89.0%–91.9%)* 91.1% (89.2%–93.0%)
Death hazard (other causes) Age-specific annual rates Australian life tables 82

CI, confidence interval; PSA, probabilistic sensitivity analysis.

*

95% PI: 95% prediction interval of model outcomes, based on samples from the specified hazard ratio distributions.

Figure 3.

Figure 3

Selected calibration outcomes. Top row: Liver cancer incidence rates (left; target Vilar Gomez et al. 80 ) and decompensation rates (right). Bottom row: all-cause death rates in patients with compensated (left; includes any decompensation events) and decompensated (right) cirrhosis.

A code snippet showing the syntax to define a model using the included code is shown in Listing 1. Examples of the kinds of outputs that can be generated are shown in Figure 4. This includes the likelihood of Xt being in a given state by time (through g(t) ). This was used to calculate the 5-y risk of liver cancer or non–liver cancer death in people who have not yet developed liver cancer or died, a key outcome for assessing the benefits of liver surveillance.

Figure 4.

Figure 4

Outcomes from the example model of liver disease. Left: evolving likelihood of liver decompensation, liver cancer, and non–liver cancer death. Right: 5-y risk of liver cancer versus 5-y risk of non–liver cancer death for patients who have not yet developed liver cancer/died, including 95% model range based on probabilistic sensitivity analysis.

A probabilistic sensitivity analysis was run by sampling hazard ratios from the distributions given in Table 1. The sensitivity analysis ran 100,000 parameter sets, which was completed in under 2 h. In the 5-y risk outcomes in Figure 4, this was used to develop a prediction interval; 95% of the simulated parameter sets led to outcomes in the indicated range.

Discussion

We described a novel approach to epidemiological modeling by calculating the evolution of the probability density of the sojourn time. This allows detailed survival data sources to be used to model competing risks. We demonstrated this sojourn density model approach by presenting a model of liver disease. The model was calibrated and reproduced the targets accurately with low computational burden.

The novel sojourn density model structure allows for flexible analyses that rely on detailed survival data and flexible time scales. This gives advantage over typical epidemiological modeling approaches, improving detail without sacrificing interprability while limiting computational burden. Designing models that reduce computational burden to facilitate probabilistic sensitivity analyses has been identified as a priority for health technology assessments, 32 often requiring surrogate methods such as meta-modeling/emulators to reduce the reliance on computationally expensive models.8386

By using the hazard rates as parameters, the models can be interpreted directly in the context of source epidemiology studies, and parameter values can be directly inferred from survival data. In the context of liver disease modeling, this means the model can reflect the true health benefits of early-stage diagnosis by accurately capturing disease progression. The ability to numerically evaluate our model at any time scale also allows us to accurately model phenomena such as surveillance intervals, which were not been included in real-world trials with a high degree of precision. Avoiding a fixed time discretization reduces the likelihood of numerical errors if chosen poorly 87 and improves the ability to capture short time scale events.

There is great potential to refine survival analysis estimates designed to guide policy recommendations, 15 and it is hoped that this modeling approach allows for the development of straightforward models that incorporate existing data sources in a clear and flexible way, reducing the need for model calibration and minimizing data postprocessing and model design effort. The benefits of this approach are demonstrated in the closeness of fit to the calibration targets (Figure 3) as well as the flexibility to evaluate surveillance at a wide range of intervals. 79 By providing a numerical implementation, we have also reduced the implementation burden on health economic practitioners, requiring only the specification of the model states and the hazard rates associated with the transitions between them.

The limitations of this model structure include the semi-Markov property; each state transition depends only on the current state, the sojourn time, and the model time. This approach may therefore not be appropriate in contexts where multiple prior events are required to inform future likelihoods.

The liver disease model described here is a part of a larger model of liver disease, 79 which was used to evaluate surveillance recommendations for the “Clinical Practice Guidelines for Hepatocellular Carcinoma Surveillance for People at High Risk in Australia.” 78 The flexible sojourn density model allowed us to analyze combinations of surveillance technologies and intervals to analyze more complex surveillance algorithms and optimize these algorithms through iterative design enabled by the fast computation time and high precision.

There are many potential future developments to build on the work presented here. Higher-order approximations for the integrals in (25) and (24) could be used to derive more accurate numerical schemes, which may help when dealing with pathological hazards such as those that are nonsmooth or have discontinuities. It may also be possible to develop spectral methods to approximate solutions without time discretization. 88 Further work could also use these models to assess the potential impact of lead-time biases, and the loss in precision when these are not considered in modeling. It is hoped that the method described here will be of general use to health economic practitioners.

Conclusions

To inform health policy and make an impact, epidemiological models must be able to provide reliable, timely, and interpretable results. Sojourn density models can incorporate published survival data and generate competing risk estimates quickly, allowing for simple design and fast calibration. We hope this approach will be of use to modelers in public health.

Supplemental Material

sj-pdf-1-mdm-10.1177_0272989X251333398 – Supplemental material for Evaluating Semi-Markov Processes and Other Epidemiological Time-to-Event Models by Computing Disease Sojourn Density as Partial Differential Equations

Supplemental material, sj-pdf-1-mdm-10.1177_0272989X251333398 for Evaluating Semi-Markov Processes and Other Epidemiological Time-to-Event Models by Computing Disease Sojourn Density as Partial Differential Equations by Joachim Worthington, Eleonora Feletto, Emily He, Stephen Wade, Barbara de Graaff, Anh Le Tuan Nguyen, Jacob George, Karen Canfell and Michael Caruana in Medical Decision Making

Acknowledgments

The authors would like to acknowledge the Expert Advisory Group for the Optimising Liver Cancer Control in Australia project convened by Cancer Council Australia. We would also like to thank Cancer Council Australia for coordination of this project.

Materials reported in this article, including the modeling and results, were developed to support the Optimising Liver Cancer Control in Australia project and were originally developed and reported in “Clinical Practice Guidelines for Hepatocellular Carcinoma Surveillance for People at High Risk in Australia: Appendix D7,” a non–peer-reviewed report.

The authors would like to acknowledge the working group for the “Clinical Practice Guidelines for Hepatocellular Carcinoma Surveillance for People at High Risk in Australia.” A full list of working group members is available in “Clinical Practice Guidelines for Hepatocellular Carcinoma Surveillance for People at High Risk in Australia: Appendix I.”

The authors would like to thank the Daffodil Centre Systematic Review team, who provided assistance in reviewing the literature on existing models of hepatocellular carcinoma and surveillance.

The authors would also like to thank the reviewers and editors at Medical Decision Making for their handling of this article and the substantial contributions and improvements they collectively made to the direction and style of the final article.

This article uses custom data sets from the Australian Institute of Health and Welfare and the Cancer Institute NSW. We thank the Australian Institute of Health and Welfare and the population-based cancer registries of New South Wales, Victoria, Queensland, Western Australia, South Australia, Tasmania, the Australian Capital Territory, and the Northern Territory for the provision of data from the Australian Cancer Database. We would also like to thank the Cancer Institute NSW for their assistance with the data request.

Karen Canfell receives salary funding from the National Health and Medical Research Council Australia (NHMRC Leadership Fellowship APP1194679).

Footnotes

The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Karen Canfell is co-principal investigator (PI) of an investigator-initiated trial of human papillomavirus (HPV) screening in Australia (Compass), which is conducted and funded by the Australian Centre for the Prevention of Cervical Cancer (ACPCC), a government-funded health promotion charity. The ACPCC has previously received equipment and a funding contribution for the Compass trial from Roche Molecular Systems USA. She is also co-PI on a major implementation program, “Elimination of Cervical Cancer in the Western Pacific,” which receives support from the Minderoo Foundation and equipment donations from Cepheid Inc. Michael Caruana is an investigator on an investigator-initiated trial of cytology and primary HPV screening in Australia (“Compass”; ACTRN12613001207707 and NCT02328872), which is conducted and funded by the Australian Centre for the Prevention of Cervical Cancer, a government-funded health promotion charity. The Australian Centre for the Prevention of Cervical Cancer has received equipment and a funding contribution for the Compass trial from Roche Molecular Systems and operational support from the Australian government. However, neither MC nor his institution on his behalf (the Daffodil Centre, a joint venture between Cancer Council NSW and The University of Sydney) receive direct or indirect funding from industry for Compass Australia. All other authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article. The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Cancer Council Australia financially supported this study as part of the Optimising Liver Cancer Control in Australia project, which was funded through the Australian Government Department of Health and Aged Care. The funding agreement ensured the authors’ independence in designing the study, interpreting the data, writing, and publishing the report.

Author’s Note: Anh Le Tuan Nguyen is also affiliated to Hanoi Medical University, Vietnam

Ethical Considerations: Not applicable. This article does not report on animal or human studies and did not use any individual-level data to inform the analysis.

Consent to Participate: Not applicable. This article does not report on animal or human studies and did not use any individual-level data to inform the analysis.

Consent for Publication: Not applicable. This article does not report on animal or human studies and did not use any individual-level data to inform the analysis.

Data Availability: The code used to generate these figures has been provided as a supplementary file at https://doi.org/10.6084/m9.figshare.28004642, including all relevant parameters.

Contributor Information

Joachim Worthington, The Daffodil Centre, The University of Sydney, a joint venture with Cancer Council NSW, Sydney, NSW, Australia.

Eleonora Feletto, The Daffodil Centre, The University of Sydney, a joint venture with Cancer Council NSW, Sydney, NSW, Australia.

Emily He, The Daffodil Centre, The University of Sydney, a joint venture with Cancer Council NSW, Sydney, NSW, Australia.

Stephen Wade, The Daffodil Centre, The University of Sydney, a joint venture with Cancer Council NSW, Sydney, NSW, Australia.

Barbara de Graaff, Menzies Institute for Medical Research, The University of Tasmania, Hobart, TAS, Australia.

Anh Le Tuan Nguyen, Menzies Institute for Medical Research, The University of Tasmania, Hobart, TAS, Australia; WHO Collaborating Centre for Viral Hepatitis, The Peter Doherty Institute for Infection and Immunity.

Jacob George, Storr Liver Centre, The Westmead Institute for Medical Research, Westmead Hospital and University of Sydney, Sydney, NSW, Australia.

Karen Canfell, School of Public Health, Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, Australia.

Michael Caruana, The Daffodil Centre, The University of Sydney, a joint venture with Cancer Council NSW, Sydney, NSW, Australia.

References

  • 1. Siebert U, Alagoz O, Bayoumi AM, et al. State-transition modeling: a report of the ISPOR-SMDM modeling good research practices task force-3. Value Health. 2012;15(6):812–20. DOI: 10.1016/j.jval.2012.06.014 [DOI] [PubMed] [Google Scholar]
  • 2. Brennan A, Chick SE, Davies R. A taxonomy of model structures for economic evaluation of health technologies. Health Econ. 2006;15(12):1295–310. [DOI] [PubMed] [Google Scholar]
  • 3. Shappell H, Caffo BS, Pekar JJ, Lindquist MA. Improved state change estimation in dynamic functional connectivity using hidden semi-Markov models. NeuroImage. 2019;191:243–57. DOI: 10.1016/j.neuroimage.2019.02.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Rousseau J, Mengersen K. Asymptotic behaviour of the posterior distribution in overfitted mixture models. J R Stat Soc Ser B Stat Methodol. 2011;73(5):689–710. [Google Scholar]
  • 5. An Y, Hu Y, Shum M. Identifiability and Inference of Hidden Markov Models. Technical Report, Citeseer; 2013. [Google Scholar]
  • 6. Chrosny W, Stevenson M, Munzer A. Comparison of Markov and discrete event simulation modeling techniques with application to cost effectiveness analyses. Value Health. 2013;16(7):A587. DOI: 10.1016/j.jval.2013.08.1623 [DOI] [Google Scholar]
  • 7. Chhatwal J, Jayasuriya S, Elbasha EH. Changing cycle lengths in state-transition models: challenges and solutions. Med Decis Making. 2016;36(8):952–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Smith WL. Regenerative stochastic processes. Proc R Soc A. Math Phys Sci. 1955;232(1188):6–31. [Google Scholar]
  • 9. Janssen J, Limnios N. Semi-Markov Models and Applications. New York: Springer Science & Business Media; 2013. [Google Scholar]
  • 10. Barbu VS, Limnios N. Semi-Markov Chains and Hidden Semi-Markov Models toward Applications: Their Use in Reliability and DNA Analysis. Vol. 191. New York: Springer Science & Business Media, 2009. [Google Scholar]
  • 11. Hoem JM. Inhomogeneous semi-Markov processes, select actuarial tables, and duration-dependence in demography. In: Greville TNE, eds. Population Dynamics. Philadelphia: Elsevier; 1972. p :251–96. [Google Scholar]
  • 12. Commenges D. Multi-state models in epidemiology. Lifetime Data Anal. 1999;5:315–27. [DOI] [PubMed] [Google Scholar]
  • 13. Cao Q, Buskens E, Feenstra T, Jaarsma T, Hillege H, Postmus D. Continuous-time semi-markov models in health economic decision making: an illustrative example in heart failure disease management. Med Decis Making. 2016;36(1):59–71. DOI: 10.1177/0272989x15593080 [DOI] [PubMed] [Google Scholar]
  • 14. Latimer NR. Survival analysis for economic evaluations alongside clinical trials—extrapolation with Patient-Level Data: Inconsistencies, Limitations, and a Practical Guide. Medl Decis Making. 2013;33(6):743-754. DOI: 10.1177/0272989X12472398 [DOI] [PubMed] [Google Scholar]
  • 15. Bell Gorrod H, Kearns B, Stevens J, et al. A review of survival analysis methods used in NICE technology appraisals of cancer treatments: consistency, limitations, and areas for improvement. Med Decis Making. 2019;39(8):899–909. DOI: 10.1177/0272989X19881967 [DOI] [PubMed] [Google Scholar]
  • 16. Buchardt K, Møller T, Schmidt KB. Cash flows and policyholder behaviour in the semi-Markov life insurance setup. Scand Actuar J. 2015;2015(8):660–88. [Google Scholar]
  • 17. Helwich M. Durational Effects and Non-smooth Semi-Markov Models in Life Insurance [doctoral dissertation]. Rostock (Germany): University of Rostock; 2008. [Google Scholar]
  • 18. Christiansen M. Multistate Models in Health Insurance [PhD thesis]. Ulm (Germany): Institut für Versicherungswissenschaften, Universität Ulm; 2010. Available from: https://www.uni-ulm.de/fileadmin/website_uni_ulm/mawi2/dokumente/preprint-server/2010/1015_multmodinhealthins.pdf [Google Scholar]
  • 19. Limnios N, Oprisan G. Semi-Markov Processes and Reliability. New York: Springer Science & Business Media; 2012. [Google Scholar]
  • 20. Castelli C, Combescure C, Foucher Y, Daures JP. Cost-effectiveness analysis in colorectal cancer using a semi-Markov model. Stat Med. 2007;26(30):5557–71. DOI: 10.1002/sim.3112 [DOI] [PubMed] [Google Scholar]
  • 21. Bullement A, Cranmer HL, Shields GE. A review of recent decision-analytic models used to evaluate the economic value of cancer treatments. Appl Health Econ Health Policy. 2019;17(6):771–80. DOI: 10.1007/s40258-019-00513-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Król A, Saint-Pierre P. SemiMarkov: an R package for parametric estimation in multi-state semi-Markov models. J Stat Softw. 2015;66:1–16. DOI: 10.18637/jss.v066.i06 [DOI] [Google Scholar]
  • 23. Zucchini W, MacDonald IL, Langrock R. Hidden Markov Models for Time Series: An Introduction Using R. Boca Raton (FL): CRC Press; 2017. [Google Scholar]
  • 24. Wu B, Maya BIG, Limnios N. Using semi-Markov chains to solve semi-Markov processes. Methodol Comput Appl Probab. 2021;23:1419–31. [Google Scholar]
  • 25. Davis S, Stevenson M, Tappenden P, Wailoo A. NICE DSU technical support document 15: cost-effectiveness modelling using patient-level simulation. Report by the NICE Decision Support Unit; 2014. [PubMed]
  • 26. Caro JJ. Pharmacoeconomic analyses using discrete event simulation. Pharmacoeconomics. 2005;23:323–32. [DOI] [PubMed] [Google Scholar]
  • 27. Degeling K, Franken MD, May AM, et al. Matching the model with the evidence: comparing discrete event simulation and state-transition modeling for time-to-event predictions in a cost-effectiveness analysis of treatment in metastatic colorectal cancer patients. Cancer Epidemiol. 2018;57:60–7. DOI: 10.1016/j.canep.2018.09.008 [DOI] [PubMed] [Google Scholar]
  • 28. Degeling K, IJzerman MJ, Groothuis-Oudshoorn CGM, et al. Comparing modeling approaches for discrete event simulations with competing risks based on censored individual patient data: a simulation study and illustration in colorectal cancer. Value Health. 2022;25(1):104–15. DOI: 10.1016/j.jval.2021.07.016 [DOI] [PubMed] [Google Scholar]
  • 29. Çağlayan C, Terawaki H, Chen Q, Rai A, Ayer T, Flowers CR. Microsimulation modeling in oncology. JCO Clin Cancer Inform. 2018;2:1–11. DOI: 10.1200/CCI.17.00029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Barton P, Bryan S, Robinson S. Modelling in the economic evaluation of health care: selecting the appropriate approach. J Health Serv Res Policy. 2004;9(2):110–8. [DOI] [PubMed] [Google Scholar]
  • 31. Groot Koerkamp B, Stijnen T, Weinstein MC, Hunink MG. The combined analysis of uncertainty and patient heterogeneity in medical decision models. Med Decis Making. 2011;31(4):650–61. [DOI] [PubMed] [Google Scholar]
  • 32. Claxton K, Sculpher M, McCabe C, et al. Probabilistic sensitivity analysis for NICE technology assessment: not an optional extra. Health Econ. 2005;14(4):339–47. DOI: 10.1002/hec.985 [DOI] [PubMed] [Google Scholar]
  • 33. Soares MO, Canto E, Castro L. Continuous time simulation and discretized models for cost-effectiveness analysis. Pharmacoeconomics. 2012;30:1101–17. [DOI] [PubMed] [Google Scholar]
  • 34. Clark TG, Bradburn MJ, Love SB, Altman DG. Survival analysis part I: basic concepts and first analyses. Br J Cancer. 2003;89(2):232–8. DOI: 10.1038/sj.bjc.6601118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Hosmer DW, Jr, Lemeshow S, May S. Applied Survival Analysis: Regression Modeling of Time-To-Event Data. New York: John Wiley & Sons; 2011. [Google Scholar]
  • 36. Asanjarani A, Liquet B, Nazarathy Y. Estimation of semi-Markov multi-state models: a comparison of the sojourn times and transition intensities approaches. Int J Biostat. 2021;18(1):243–62. DOI: 10.1515/ijb-2020-0083 [DOI] [PubMed] [Google Scholar]
  • 37. Hoem JM. Markov chain models in life insurance. Blätter der DGVFM. 1969;9(2):91–107. [Google Scholar]
  • 38. Latimer NR. Survival analysis for economic evaluations alongside clinical trials—extrapolation with patient-level data: inconsistencies, limitations, and a practical guide. Med Decis Making. 2013;33(6):743–54. DOI: 10.1177/0272989X12472398 [DOI] [PubMed] [Google Scholar]
  • 39. Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. J Am Statist Assoc. 1958;53(282):457–81. [Google Scholar]
  • 40. Goel MK, Khanna P, Kishore J. Understanding survival analysis: Kaplan-Meier estimate. Int J Ayurveda Res. 2010;1(4):274. DOI: 10.4103/0974-7788.76794 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Cox DR, Oakes D. Analysis of Survival Data. Vol. 21. Boca Raton (FL): CRC Press; 1984. [Google Scholar]
  • 42. Kolmogoroff A. Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung. Math Ann. 1931;104:415–58. [Google Scholar]
  • 43. Wallace MC, Preen DB, Short MW, Adams LA, Jeffrey GP. Hepatocellular carcinoma in Australia 1982–2014: increasing incidence and improving survival. Liver Int. 2019;39(3):522–30. DOI: 10.1111/liv.13966 [DOI] [PubMed] [Google Scholar]
  • 44. Luo Q, O’Connell DL, Yu XQ, et al. Cancer incidence and mortality in Australia from 2020 to 2044 and an exploratory analysis of the potential effect of treatment delays during the COVID-19 pandemic: a statistical modelling study. Lancet Public Health. 2022;7(6):e537–48. DOI: 10.1016/S2468-2667(22)00090-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Huang DQ, Singal AG, Kanwal F, et al. Hepatocellular carcinoma surveillance — utilization, barriers and the impact of changing aetiology. Nat Rev Gastroenterol Hepatol. 2023;20(12):797–809. DOI: 10.1038/s41575-023-00818-8 [DOI] [PubMed] [Google Scholar]
  • 46. Gupta A, Rana R, Agarwal S, et al. Assessing the risk of further decompensation and survival in patients with cirrhosis with variceal bleeding as their first decompensation event. Am J Gastroenterol. 2022;118(5):833–9. DOI: 10.14309/ajg.0000000000002018 [DOI] [PubMed] [Google Scholar]
  • 47. Fleming KM, Aithal GP, Card TR, West J. The rate of decompensation and clinical progression of disease in people with cirrhosis: a cohort study. Aliment Pharmacol Ther. 2010;32(11–12):1343–50. DOI: 10.1111/j.1365-2036.2010.04473.x [DOI] [PubMed] [Google Scholar]
  • 48. Jepsen P, Vilstrup H, Andersen PK. The clinical course of cirrhosis: the importance of multistate models and competing risks analysis. Hepatology. 2015;62(1):292–302. DOI: 10.1002/hep.27598 [DOI] [PubMed] [Google Scholar]
  • 49. Georgoulas A, Hillston J, Sanguinetti G. Unbiased Bayesian inference for population Markov jump processes via random truncations. Stat Comput. 2017;27:991–1002. DOI: 10.1007/s11222-016-9667-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Lee ET, Wang J. Statistical Methods for Survival Data Analysis. Vol. 476. New York: John Wiley & Sons; 2003. [Google Scholar]
  • 51. Medhi J. Stochastic Processes. New Delhi: New Age International; 1994. [Google Scholar]
  • 52. Nahhas RW. Introduction to Regression Methods for Public Health Using R. Boca Raton (FL): CRC Press; 2024. [Google Scholar]
  • 53. Seaman SR, Presanis A, Jackson C. Estimating a time-to-event distribution from right-truncated data in an epidemic: a review of methods. Stat Methods Med Res. 2022;31(9):1641–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Tierney JF, Stewart LA, Ghersi D, Burdett S, Sydes MR. Practical methods for incorporating summary time-to-event data into meta-analysis. Trials. 2007;8:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Hirschmann H. The Semi-Markov Process. Generalizations and Calculation Rules for Application in the Analysis of Systems. Technical report Eidgenoessisches Inst. fuer Reaktorforschung; 1983. [Google Scholar]
  • 56. Pinsky PF. An early-and late-stage convolution model for disease natural history. Biometrics. 2004;60(1):191–8. DOI: 10.1111/j.0006-341X.2004.00023.x [DOI] [PubMed] [Google Scholar]
  • 57. Flaxman S, Mishra S, Gandy A, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. 2020;584(7820):257–61. DOI: 10.1038/s41586-020-2405-7 [DOI] [PubMed] [Google Scholar]
  • 58. van Wijk RC, Simonsson USH. Finding the right hazard function for time to event modelling: a tutorial and Shiny application. CPT Pharmacometrics Syst Pharmacol. 2022;11:991–1001. DOI: 10.1002/psp4.12797 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Juckett DA, Rosenberg B. Comparison of the Gompertz and Weibull functions as descriptors for human mortality distributions and their intersections. Mech Ageing Dev. 1993;69(1–2):1–31. DOI: 10.1016/0047-6374(93)90068-3 [DOI] [PubMed] [Google Scholar]
  • 60. Kleinbaum DG, Klein M. Parametric survival models. In: Survival Analysis: A Self-Learning Text. New York: Springer; 2012. p 289–361. [Google Scholar]
  • 61. Wang J-L. Smoothing hazard rates. In: Armitage P, Colton T, eds. Encyclopedia of Biostatistics. Vol. 7. New York: Wiley; 2005. p :4986–97. [Google Scholar]
  • 62. Guo C, So Y. Cause-Specific Analysis of Competing Risks Using the PHREG Procedure. Paper No.: SAS2159-2018. Cary (NC): SAS Institute Inc.; 2018. [Google Scholar]
  • 63. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. 1999;94(446):496–509. DOI: 10.2307/2670170 [DOI] [Google Scholar]
  • 64. Coldman AJ, Phillips N, Brisson J, et al. Using the cancer risk management model to evaluate colorectal cancer screening options for Canada. Curr Oncol. 2015;22(2):41–50. DOI: 10.3747/co.22.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Crowther MJ, Lambert PC. Parametric multistate survival models: flexible modelling allowing transition-specific distributions with application to estimating clinically useful measures of effect differences. Stat Med. 2017;36(29):4719–42. [DOI] [PubMed] [Google Scholar]
  • 66. World Health Organisation. WHO Methods and Data Sources for Life Tables 1990–2019. Geneva: World Health Organisation; 2020. [Google Scholar]
  • 67. Australian Bureau of Statistics. Life Expectancy Methodology. Technical report. Sydney: Australian Bureau of Statistics; 2022. Available from: https://www.abs.gov.au/methodologies/life-expectancy-methodology/2020-2022 [Google Scholar]
  • 68. Bebbington M, Lai C-D, Murthy DNP, Zitikis R. Rational polynomial hazard functions. Int J Performability Eng. 2010;6(1):35. [Google Scholar]
  • 69. Gold MR, Stevenson D, Fryback DG. HALYS and QALYS and DALYS, oh my: similarities and differences in summary measures of population health. Annu Rev Public Health. 2002;23(1):115–34. [DOI] [PubMed] [Google Scholar]
  • 70. Hethcote HW. Three basic epidemiological models. In: Levin SA, Hallam TG, Gross LJ, eds. Applied Mathematical Ecology. Berlin: Springer; 1989. p :119–44. [Google Scholar]
  • 71. Hutchison GB, Shapiro S. Lead time gained by diagnostic screening for breast cancer. J Natl Cancer Inst. 1968;41(3):665–81. [PubMed] [Google Scholar]
  • 72. Jansen RJ, Alexander BH, Anderson KE, Church TR. Quantifying lead-time bias in risk factor studies of cancer through simulation. Ann Epidemiol. 2013;23(11):735–41. DOI: 10.1016/j.annepidem.2013.07.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Wu D, Rosner GL, Broemeling LD. Bayesian inference for the lead time in periodic cancer screening. Biometrics. 2007;63(3):873–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Draisma G, van Rosmalen J. A note on the catch-up time method for estimating lead or sojourn time in prostate cancer screening. Stat Med. 2013;32(19):3332–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Muehleisen RT, Bergerson J. Bayesian calibration—what, why and how. In: International High Performance Buildings Conference. West Lafayette, Indiana, USA. 10–14 July, 2022. [Google Scholar]
  • 76. Wade S, Sarich P, Vaneckova P, et al. Using Bayesian evidence synthesis to quantify uncertainty in population trends in smoking behaviour. Stat Methods Med Res. 2025;34(3):545–560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Hong TP, Gow PJ, Fink M, et al. Surveillance improves survival of patients with hepatocellular carcinoma: a prospective population-based study. Med J Aust. 2018;209(8):348–54. DOI: 10.5694/mja18.00373 [DOI] [PubMed] [Google Scholar]
  • 78. Cancer Council Australia Hepatocellular Carcinoma Surveillance Working Group. Clinical Practice Guidelines for Hepatocellular Carcinoma Surveillance for People at High Risk in Australia. Sydney: Cancer Council Australia; 2023. Available from: https://www.cancer.org.au/clinical-guidelines/liver-cancer/hepatocellular-carcinoma [Google Scholar]
  • 79. Worthington J, He E, Caruana M, et al. An economic evaluation of routine hepatocellular carcinoma surveillance for high-risk patients. medRxiv. 2024. Available from: https://www.medrxiv.org/content/10.1101/2024.02.20.24303111v1
  • 80. Vilar-Gomez E, Calzadilla-Bertot L, Wai-Sun Wong V, et al. Fibrosis severity as a determinant of cause-specific mortality in patients with advanced nonalcoholic fatty liver disease: a multi-national cohort study. Gastroenterology. 2018;155(2):443–57. DOI: 10.1053/j.gastro.2018.04.034 [DOI] [PubMed] [Google Scholar]
  • 81. D’Amico G, Garcia-Tsao G, Pagliaro L. Natural history and prognostic indicators of survival in cirrhosis: a systematic review of 118 studies. J Hepatol. 2006;44(1):217–31. DOI: 10.1016/j.jhep.2005.10.013 [DOI] [PubMed] [Google Scholar]
  • 82. Australian Bureau of Statistics. Life Tables. Australian Bureau of Statistics; 2022. Available from: https://www.abs.gov.au/statistics/people/population/life-tables/latest-release#cite-window1 [Google Scholar]
  • 83. Koffijberg H, Degeling K, IJzerman MJ, Coupé VMH, Greuter MJE. Using metamodeling to identify the optimal strategy for colorectal cancer screening. Value Health. 2021;24(2):206–15. DOI: 10.1016/j.jval.2020.08.2099 [DOI] [PubMed] [Google Scholar]
  • 84. Degeling K, IJzerman MJ, Koffijberg H. A scoping review of metamodeling applications and opportunities for advanced health economic analyses. Expert Rev Pharmacoecon Outcomes Res. 2019;19(2):181–7. DOI: 10.1080/14737167.2019.1548279 [DOI] [PubMed] [Google Scholar]
  • 85. Vahdat V, Alagoz O, Chen JV, Saoud L, Borah BJ, Limburg PJ. Calibration and validation of the Colorectal Cancer and Adenoma Incidence and Mortality (CRC-AIM) microsimulation model using deep neural networks. Med Decis Making. 2023;43(6):719–36. DOI: 10.1177/0272989X231184175 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Pineda-Antunez C, Seguin C, van Duuren LA, et al. Emulator-based Bayesian calibration of the CISNET colorectal cancer models. Med Decis Making. 2024;44(5):543–53. DOI: 10.1177/0272989x241255618 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Standfield L, Comans T, Scuffham P. Markov modeling and discrete event simulation in health care: a systematic comparison. Int J Technol Assess Health Care. 2014;30(2):165–72. DOI: 10.1017/S0266462314000117 [DOI] [PubMed] [Google Scholar]
  • 88. Delgado-Vences F, Flandoli F. A spectral-based numerical method for Kolmogorov equations in Hilbert spaces. Infin Dimension Anal Quantum Probab Relat Top. 2016;19(3):1650020. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sj-pdf-1-mdm-10.1177_0272989X251333398 – Supplemental material for Evaluating Semi-Markov Processes and Other Epidemiological Time-to-Event Models by Computing Disease Sojourn Density as Partial Differential Equations

Supplemental material, sj-pdf-1-mdm-10.1177_0272989X251333398 for Evaluating Semi-Markov Processes and Other Epidemiological Time-to-Event Models by Computing Disease Sojourn Density as Partial Differential Equations by Joachim Worthington, Eleonora Feletto, Emily He, Stephen Wade, Barbara de Graaff, Anh Le Tuan Nguyen, Jacob George, Karen Canfell and Michael Caruana in Medical Decision Making


Articles from Medical Decision Making are provided here courtesy of SAGE Publications

RESOURCES