Skip to main content
BMJ Open logoLink to BMJ Open
. 2017 Nov 15;7(11):e018374. doi: 10.1136/bmjopen-2017-018374

Dynamic treatment selection and modification for personalised blood pressure therapy using a Markov decision process model: a cost-effectiveness analysis

Sung Eun Choi 1, Margaret L Brandeau 1, Sanjay Basu 2,3
PMCID: PMC5695480  PMID: 29146652

Abstract

Objective

Personalised medicine seeks to select and modify treatments based on individual patient characteristics and preferences. We sought to develop an automated strategy to select and modify blood pressure treatments, incorporating the likelihood that patients with different characteristics would benefit from different types of medications and dosages and the potential severity and impact of different side effects among patients with different characteristics.

Design, setting and participants

We developed a Markov decision process (MDP) model to incorporate meta-analytic data and estimate the optimal treatment for maximising discounted lifetime quality-adjusted life-years (QALYs) based on individual patient characteristics, incorporating medication adjustment choices when a patient incurs side effects. We compared the MDP to current US blood pressure treatment guidelines (the Eighth Joint National Committee, JNC8) and a variant of current guidelines that incorporates results of a major recent trial of intensive treatment (Intensive JNC8). We used a microsimulation model of patient demographics, cardiovascular disease risk factors and side effect probabilities, sampling from the National Health and Nutrition Examination Survey (2003–2014), to compare the expected population outcomes from adopting the MDP versus guideline-based strategies.

Main outcome measures

Costs and QALYs for the MDP-based treatment (MDPT), JNC8 and Intensive JNC8 strategies.

Results

Compared with the JNC8 guideline, the MDPT strategy would be cost-saving from a societal perspective with discounted savings of US$1187 per capita (95% CI 1178 to 1209) and an estimated discounted gain of 0.06 QALYs per capita (95% CI 0.04 to 0.08) among the US adult population. QALY gains would largely accrue from reductions in severe side effects associated with higher treatment doses later in life. The Intensive JNC8 strategy was dominated by the MDPT strategy.

Conclusions

An MDP-based approach can aid decision-making by incorporating meta-analytic evidence to personalise blood pressure treatment and improve overall population health compared with current blood pressure treatment guidelines.

Keywords: hypertension, cardiovascular disease, personalised medicine


Strengths and limitations of this study.

  • This analysis provides a computational tool to operationalise personalised medicine for blood pressure therapy and determine the optimal treatment for an individual, incorporating a complex variety of individual-level covariates, treatment effect modifiers and risks and benefits of treatment alternatives.

  • This framework can approximate optimal treatment decisions in complex and uncertain environments and can be applied to other disease processes.

  • Based on published data, the health benefits of medications were assumed to be mediated through changes in blood pressure as per current physiological understanding.

  • This study used survey data that are subject to recall biases, acceptability biases and under-reporting that may lead to mis-estimation of baseline covariates, and limits the analysis to the non-institutionalised US population.

Introduction

Personalised medicine seeks to select and modify treatments based on individual patient characteristics and preferences.1 While much of personalised medicine focuses on genetics, an increasing number of studies have suggested that personalised medicine could also assist in improving overall population health outcomes even when genetics are not considered or relevant, by using large-scale data synthesis approaches to improve medical decision-making, particularly when numerous patient features and multifaceted data must be considered.2–4 For example, blood pressure therapy guidelines (the Eighth Joint National Committee, JNC8) currently recommend treating large categories of people (eg, older adults, those with diabetes and those with chronic kidney disease) to attain specific targets.5 A few general classes of medications are recommended based on race and comorbid conditions such as diabetes and chronic kidney disease.

Such simplification poses the dilemma of recommending suboptimal treatment choices with submaximal benefit and substantial side effect risk among some patients, in favour of an approach that may direct appropriate therapies to a majority of patients.3 Any particular individual patient may have some or all of these features (eg, not only features suggesting the need for intensive treatment but also features posing risk for severe side effects), requiring clinicians to judge how intensively to treat patients who have multiple features. The target levels of treatment are themselves also subject to controversy. Lower targets for treatment have been advocated for some patients with high cardiovascular disease (CVD) risk following the Systolic Blood Pressure Intervention Trial (SPRINT) study in which a target of 120 mm Hg for systolic blood pressure (SBP) was found to reduce CVD event risk among patients with very high risk6–8; yet post hoc analyses found considerable heterogeneity in the treatment effects and harms, with some researchers proposing decision rules to find groups of patients more likely to experience benefit or harm from the lower blood pressure target.9

Furthermore, side effects of therapy vary among patients with different features, and the complex matrix of different side effect probabilities is challenging for a clinician to remember and calculate for any given patient.10 Hence, automated synthesis of complex data to enable personalised therapy may be helpful to maximise benefits and minimise risks among patients.

We developed and tested a generalisable computational strategy to help personalise treatment when multiple treatment options, benefits and risks must be considered, using a Markov decision process (MDP) model—a model in which outcomes are partly under the control of a decision-maker and partly based on probabilistic calculations from high-quality meta-analytic data. Prior work suggests that personalising optimal treatment policies using an MDP framework could improve patient health outcomes compared with JNC7 treatment guidelines.11 This prior work focuses on assessing the impact of the MDP-based policies on certain cohorts of patients, not on population health. Our study advances this prior literature by incorporating large-scale meta-analytic and network meta-analytic data sources, providing a full life-course simulation to evaluate long-term impact and using population-representative data to assess the overall national implications of personalised blood pressure treatment selection. We open-sourced the model code to permit its replication, application and potential modification for solving other, similar treatment problems.

Methods

Model overview

We modelled the process of sequentially choosing blood pressure treatment medications with a discrete-time, finite time-horizon MDP, which obtains input data from meta-analyses describing medication choices, their potential treatment effects and side effects among patients with different characteristics. Accounting for patient’s characteristics, our MDP performs optimisation using additional data to compute the quality-adjusted life-years (QALYs) that would be gained from averted myocardial infarctions (MIs) or strokes (the primary benefits of lowering blood pressure) or lost to specific side effects, over a simulated lifetime.12 13 The online supplementary appendix file contains all input data, equations and complete technical details consistent with international model reporting guidelines,14 along with a link to program code for replication and extension of our analysis. Here, we provide an overview of the model’s key components (figure 1, table 1).

Figure 1.

Figure 1

Model schematic. *‘High-risk’ was defined the same as SPRINT and ACCORD trial enrolment eligibility criteria except for type 2 diabetes status. ACCORD, action to control cardiovascular risk in diabetes; BP, blood pressure; CVD, cardiovascular disease; JNC8, Eighth Joint National Committee; MDP, Markov decision process; SBP, systolic blood pressure; SPRINT, Systolic Blood Pressure Intervention Trial; yrs, years.

Table 1.

Model parameters and sources

Parameters Source
Population size of demographic cohorts NHANES 2003–2014
Risk of MI or stroke by demographic group (online supplementary appendix eText 3) Model-based estimates from meta-analysis data19–21
Baseline MI history prevalence (online supplementary appendix eTable 5) NHANES 2003–2014
Baseline stroke history prevalence (online supplementary appendix eTable 6) NHANES 2003–2014
Baseline hypertension medication use prevalence (online supplementary appendix eTable 7) NHANES 2003–2014
Baseline systolic blood pressure (online supplementary appendix eTable 8) NHANES 2003–2014
Baseline total cholesterol (online supplementary appendix eTable 9) NHANES 2003–2014
Baseline HDL cholesterol (online supplementary appendix eTable 10) NHANES 2003–2014
Baseline smoking prevalence (online supplementary appendix eTable 11) NHANES 2003–2014
Baseline type 2 diabetes prevalence (online supplementary appendix eTable 12) NHANES 2003–2014
Baseline chronic kidney disease prevalence (online supplementary appendix eTable 13) NHANES 2003–2014
MI or stroke mortality rate (online supplementary appendix eText 4) Model calibration to national data19–21
All-cause mortality rate CDC22

CDC, Centers for Disease Control and Prevention; HDL, high-density lipoprotein; MI, myocardial infarction; NHANES, National Health and Nutrition Examination Survey.

Supplementary file 1

bmjopen-2017-018374supp001.pdf (1.3MB, pdf)

Model formulation

At each monthly time step t=1,…, T over a patient’s simulated lifetime, the MDP model (online supplementary appendix eText 1) is described with four components: state space (S), action space (A), transition probabilities P(st+1| st, at) and rewards R(s). The cycle length of 1 month was chosen to be in accordance with current clinical guidelines and with the timing of observations for SBP reduction in randomised trials used to inform the model and to capture rapid response in blood pressure to antihypertension medication.6 15

The action space (A) consisted of a finite set of possible actions (treatment decisions). For each state st at time t, three actions, at, were possible: the patient could stop a medication treatment, remain on the current medication treatment(s) and dose level(s) or change medication treatment (by increasing a dosage of a current medication and/or changing the medication) (online supplementary appendix eFigure 1). The state space S, comprising the states at time t (st, consisted of demographic information (age, sex, race/ethnicity), the patient’s CVD-related covariates (age-, sex- and race-dependent chronic kidney disease status, type 2 diabetes status, lipid profile and tobacco smoking status) and the patient’s health state. A patient could enter one of seven health states (online supplementary appendix eFigure 2): (a) no CVD history (no history of MI or stroke); (b) adverse medication side effect but no CVD history; (c) acute MI; (d) acute stroke; (e) post-CVD event (survived MI or stroke); (f) adverse medication side effect with a CVD history or (g) dead from any cause. Transition probabilities P(st+1| st, at) for a certain action were determined from the action and the state, which contains a patient’s characteristics (demographic information and CVD-related covariates) in the current state.

The objective of the MDP was to determine the optimal treatment policy for a patient, π, that maximised the patient’s expected discounted QALYs over the patient’s simulated lifetime, using a standard 3% annual discount rate (online supplementary appendix eText 1). The MDP determines treatment dosage and medication type tailored to individual patients, accounting for their demographic information and CVD-related covariates, rather than following the current treatment guidelines to treat based on set blood pressure targets (online supplementary appendix eFigure 3).

Input data

Input data for the model were taken from previously validated risk calculations and meta-analyses of randomised trials. The effect of blood pressure medication on lowering SBP was estimated as a function of the number of prescribed drugs and the dosage level (full or half dose) of each drug, using equations from previously published studies.16 In our model, dose levels from 0 (no medication) to 4 (four drugs at full dose) in increments of 0.5 were modelled: thus, for example, a dose of 0.5 means a half dose of one particular medicine, and 1 means a full dose of one particular medicine. Patients were allowed to take up to full doses of four different medicines, given evidence of no incremental benefit and substantial harm from side effects when escalating therapy beyond four full doses (online supplementary appendix eText 2).3

The probability that a person transitioned from healthy to either MI or stroke was based on equations previously validated in several diverse cohorts containing more than 23 000 subjects and subsequently against prospective patient-level data on blood pressure and CVD mortality from more than 1 million adults (online supplementary appendix eText 3).17–21 The equations are a function of the set of covariates: age, sex, race/ethnicity (Centers for Disease Control and Prevention categories of non-Hispanic White, non-Hispanic Black, Mexican American and Other), initial SBP and diastolic blood pressure, total and high-density lipoprotein cholesterol, tobacco smoking status, current treatment with blood pressure medication, type 2 diabetes status and previous history of MI or stroke. Covariate values were updated annually to reflect linear age and secular time trends. The degree to which a given SBP reduction from a medication at a particular dosage reduced the risk of MI or stroke was calculated based on a prior meta-analysis of randomised trials.19 Deaths attributable to MI/stroke and competing risks were taken into account as a function of age and sex (online supplementary appendix eText 4).21 22 The probability of severe adverse events (significant side effects leading to clinically significant disability or hospitalisation) was determined by a prior meta-analysis, specific to patient covariates, blood pressure medication choice and dosage (online supplementary appendix eTable 1).

We simulated 10 000 adults aged 18–85 years old over their lifetimes, by repeated sampling with replacement from the correlated covariates in the National Health and Nutrition Examination Survey (NHANES, 2003–2014, n=74 501). We used NHANES survey weights to generate sampling distributions for each covariate,23 and used multivariate sampling with copulas to capture the correlations among the covariates.24 To estimate untreated blood pressure for those NHANES participants on antihypertensive medication, we back-calculated the pretreatment blood pressure for those reporting current blood pressure treatment, using a previously published procedure.3 To ensure face validity, MI and stroke incidence rates were compared through backwards projection while maintaining current blood pressure medications listed in NHANES to years 2003 and 2014, to compare to estimates from the Atherosclerosis Risk in Communities study, the Greater Cincinnati/Northern Kentucky Stroke Study and independent cohort studies from the National Heart Lung and Blood Institute.20 25 26 The validation exercise showed that the model was within 5% absolute error from actual observed incidence rates of MI and stroke (online supplementary appendix eFigures 4 and 5, eTable 2).

Comparative effectiveness analysis

We compared three treatment strategies: (1) JNC8 based on current US guidelines, targeting blood pressure <140/90 mm Hg for persons with chronic kidney disease or diabetes or persons <60 years old and blood pressure <150/90 mm Hg for persons >60 years old and without chronic kidney disease or diabetes5; (2) Intensive JNC8 (JNC8 modified by results of the recent SPRINT study), specifically lowering SBP to 120 mm Hg among high-risk patients (those with >15% 10-year risk of combined MI and stroke) but using the same blood pressure targets as in JNC8 for others6 15; and (3) an MDP-based treatment (MDPT) strategy that determines optimal medication choices based on the MDP (figure 1). To restrict our analysis to the range of possibilities considered safe and effective, we set the number of blood pressure medications to a maximum of 4 per person, and stopped augmentation of blood pressure treatment if a simulated patient’s SBP fell below 120 mm Hg.3 27

As suggested in JNC8 guidelines, for all treatment strategies, initial blood pressure medication choice included a thiazide, calcium channel blocker (CCB), ACE inhibitor (ACE-I) or angiotensin receptor blocker (ARB) in the general non-Black population or a thiazide or CCB for the general Black population. In the JNC8 and Intensive JNC8 strategies, if the target blood pressure was not reached within 1 month, the dosage of the medication was increased, or another medication was added.

We used two outcome metrics to compare the treatment strategies: (i) total discounted QALYs over the lifetime of each simulated individual; and (ii) the incremental cost-effectiveness ratio of the MDPT strategy versus JNC8 and that of Intensive JNC8 versus JNC8.

Cost-effectiveness analysis

Following current cost-effectiveness guidelines,28 29 costs and QALY estimates were integrated over the life-course for all simulated individuals, starting in 2017. Costs of medications and associated adverse events were obtained from the National Inpatient Sample survey and Red Book (online supplementary appendix eTable 3).30 31 Annual disease-specific healthcare costs and the disutility of disease states and side effects to calculate QALYs were based on large-scale survey data (online supplementary appendix eTable 3).32 33 Costs were expressed in 2017 US Dollars using the Consumer Price Index,34and QALYs were discounted at 3% annually.

Sensitivity and uncertainty analyses

We performed sensitivity analyses to assess key sources of variability in the outcome estimates, specifically by varying disutility weights for disease states: treatment-related disutility (QALYs per medication per year of treatment associated with serious adverse events) was varied from 0.001 to 0.02. The disutility from CVD events (separately for MI and stroke) was varied across the range from 0.5 to 0.9 based on variations observed in patient survey data.3 35–37 We performed uncertainty analyses across all simulations by re-running the full model while repeatedly Monte Carlo sampling with replacement 10 000 times (thus, 10 000 discrete runs) from the probability distributions of all input parameters.38

All analyses were performed in R (V.3.2.1, The R Foundation for Statistical Computing, Vienna), with model code available at Stanford Digital Repository (https://sdr.stanford.edu) concurrent with publication.

Results

MDPT strategy versus JNC8

Over a simulated life-course, the MDPT strategy tended to treat patients more intensively than JNC8 (figure 2). The MDPT strategy prescribed 2.34 (95% CI 2.33 to 2.34) medication doses per person, versus 2.22 (95% CI 2.21 to 2.22) under JNC8 (table 2). The greatest increases in dosages from the JNC8 to the MDPT strategy were among high-risk individuals aged less than 60 years; patients were treated more intensively earlier in life under the MDPT strategy when their initial 10-year CVD risk was higher (figure 2 and table 3). These increases in dosages were more prominent among Blacks (online supplementary eFigure 6). By starting to treat patients more intensively earlier in life based on individual risks and estimated benefits, the MDPT strategy averted more CVD events over the course of life, preventing more intensive treatment requirements later in life and thereby saving QALYs (online supplementary eFigure 7 and table 3).

Figure 2.

Figure 2

Treatment dose levels under each treatment strategy. JNC8, Eighth Joint National Committee; MDP, Markov decision process.

Table 2.

Differences in treatment (mean (SE)) and cost-effectiveness analysis

Treatment strategy Treatment dose MI events (%) Stroke events (%) Total QALYs Total cost (US$) Incremental per person QALYs versus JNC8 Incremental per person cost (US$) versus JNC8 ICER versus JNC8
JNC8 2.22
(0.001)
33.21
(0.11)
35.60
(0.11)
18.97
(0.01)
16 459
(10.68)
Intensive JNC8 3.18
(0.001)
32.79
(0.11)
34.99
(0.11)
18.99
(0.01)
17 385
(11.52)
0.02 926 Dominated
MDP-based treatment (MDPT) 2.34
(0.002)
29.99
(0.11)
32.63
(0.11)
19.03
(0.01)
15 272
(9.81)
0.06 −1187 Cost-saving

ICER, incremental cost-effectiveness ratio; JNC8, Eighth Joint National Committee; MDP, Markov decision process; MDPT, MDP-based treatment; MI, myocardial infarction; QALY, quality-adjusted life-year.

Table 3.

Comparison of MDPT versus JNC8 strategy

People treated similarly* by both JNC8 and MDPT strategies People treated more intensively by MDPT strategy People treated more intensively by JNC8 strategy
% of population 12.4 47.8 39.8
Mean initial 10-year CVD risk 13.4 15.2 13.1
Mean post-10 years of treatment of 10-year CVD risk 14.1 14.7 14.7
Averted QALY loss from CVD events, per 1000 patients treated, compared with JNC8 1.39 3.01 1.54
Averted QALY loss from adverse events, per 1000 patients treated, compared with JNC8 −0.03 −0.09 −0.02
Total QALYs saved, per 1000 patients treated, compared with JNC8 40.0 74.9 44.6

*‘Treated similarly’ was defined as rounded mean treatment dose level over the patient’s lifetime being the same between the JNC8 and MDPT strategies.

CVD, cardiovascular disease; JNC8, Eighth Joint National Committee; MDPT, Markov decision process-based treatment; QALY, quality-adjusted life-year.

Treated individuals experienced 19.03 (95% CI 19.01 to 19.05) and 18.97 (95% CI 18.95 to 18.98) total discounted QALYs per person under the MDPT and JNC8 strategies, respectively. Compared with JNC8, the MDPT strategy gained 0.06 (95% CI 0.04 to 0.08) QALYs per person and was cost-saving with estimated discounted savings of US$1187 (95% CI 1168 to 1206) per person. The QALY gains would be realised largely from averting CVD events among high-risk individuals by treating them earlier with relatively lower treatment dosages than they would have received later in life.

The projected QALY gains were larger among men than women because larger shifts from the JNC8-recommended to MDPT-recommended treatments tended to intensify men’s blood pressure treatments due to their higher risk and higher benefit and, in turn, further reduce men’s MI and stroke risks (online supplementary appendix eFigures 7 and 8).

MDPT strategy versus Intensive JNC8 strategy

The Intensive JNC8 strategy tended to treat patients more intensively later in life than the MDPT strategy (figure 2). Over the course of life, individuals under the Intensive JNC8 strategy received 3.18 (95% CI 3.18 to 3.19) doses on average per person per day as compared with 2.34 (95% CI 2.33 to 2.34) doses under the MDPT strategy. Compared with the MDPT strategy, 63.3% of the simulated population received higher dosages over the course of life under the Intensive JNC8 strategy although the initial 10-year CVD risk of those people was significantly lower than that of people treated more intensively by the MDPT strategy (online supplementary appendix eTable 4). By treating based on individual risks, the MDPT strategy tended to treat patients more intensively earlier in life and less intensively at older ages (figure 2).

The Intensive JNC8 strategy was dominated by the MDPT strategy. Treated individuals under the Intensive JNC8 strategy experienced 0.04 (95% CI 0.02 to 0.06) fewer total discounted QALYs per person than those treated under the MDPT strategy, and at higher costs. By waiting to control blood pressure later in life under the Intensive JNC8 strategy, cumulative exposure to high blood pressure from young adulthood to middle age resulted in higher QALY loss due to CVD events (online supplementary appendix eTable 7).

Medication choices and SBP levels

Among Black populations, CCBs and ARBs were the most prescribed medications as the first and second medications in both strategies. In addition, thiazides and ACE-Is (or beta-blockers and thiazides for people with prior CVD history) were prescribed the most as the third and fourth medications, respectively, in the MDPT strategy (online supplementary appendix eFigure 7).

We measured individuals’ SBP levels achieved under each treatment strategy over the simulated period. SBP levels of individuals under the MDPT strategy remained the lowest among individuals aged less than 60 years, at an average of 118.1 mm Hg (95% CI 117.6 to 118.6) in patients aged 18–39 years old and 126.4 mm Hg (95% CI 125.6 to 127.2) in patients aged 40–59 years old (figure 3). For patients older than 60 years old, SBP levels were lowest under the Intensive JNC8 strategy, 139.4 mm Hg (95% CI 138.8 to 140.0) as compared with 148.9 mm Hg (95% CI 148.0 to 149.8) under the MDPT strategy.

Figure 3.

Figure 3

Mean SBP levels of individuals achieved under each treatment strategy. JNC8, Eighth Joint National Committee; MDP, Markov decision process; SBP, systolic blood pressure; trt, treatment.

Results of sensitivity analyses

None of the sensitivity analyses substantially changed our fundamental finding of benefits and cost-savings from the MDPT strategy compared with the Intensive JNC8 and JNC8 strategies (online supplementary appendix eFigures 10 and 11). The QALY weight for the adverse event states in our base-case analysis was 0.999, which is a conservative estimate for serious adverse effects of treatment.3 36 When we varied adverse event state QALY weights from 0.98 to 0.999, the lowest estimated QALY gain from the MDPT strategy compared with the JNC8 strategy was 41.0 (95% CI 21.4 to 60.6) QALYs per 1000 persons (online supplementary appendix eFigure 10). When a QALY weight of 0.98 was assumed for the adverse event states, treated individuals under the Intensive JNC8 did not experience significantly different QALYs than JNC8. With lower QALY weights for adverse events, patients with a CVD history did not have substantially different dose levels under the MDPT strategy (online supplementary appendix eFigure 10), but patients without CVD history had lower dose levels than under Intensive JNC8 and JNC8.

Next, we varied MI and stroke QALY weights from 0.5 to 0.9 for either or both of the disease states. The QALY gains from the MDPT strategy compared with the JNC8 strategy ranged from 56.7 (95% CI: 36.4 to 75.6) to 59.0 (95% CI 39.4 to 78.6) QALYs per 1000 persons for MI and from 56.0 (95% CI 37.1 to 76.3) to 61.0 (95% CI 41.4 to 80.6) QALYs per 1000 persons for stroke (online supplementary appendix eFigure 10). Treatment dose levels were not significantly affected by varying QALY weights for adverse events, stroke or MI (online supplementary appendix eFigure 11).

Discussion

Personalising medical decisions will require considering an increasingly complex variety of individual-level covariates, treatment effect modifiers and risks and benefits of treatment alternatives. Personalising decisions is recommended by current CVD management guidelines, as with guidelines for many other conditions, but how to operationalise such guideline statements remains unclear because guidelines generally recommend therapy based on broad categories of features rather than the complex combinations of features that any individual would have.39 As a result, simple rules of thumb (such as specific blood pressure levels or medication of choice for all people with diabetes) are favoured by physicians due to the burden of performing complex risk/benefit calculations at each step of the treatment decision-making process.3 40Personalising decisions may therefore be made more optimal through the use of automated computational strategies to incorporate individual patient data into individualised treatment recommendations. While several studies have previously suggested that blood pressure therapy should involve calculations of patient risks and benefits, using a Markov simulation model, this is the first time—to our knowledge—that a systematic, comprehensive and automated calculation approach has been proposed to achieve this objective by incorporating patient heterogeneity in risk factors and responses to treatment (probabilistically) and recommending detailed treatment suggestions (doses and types of medication).3 4 From a clinical perspective, our analysis advances the current treatment guidelines beyond the decision of whether to treat high blood pressure because both type and dosage of medication are critical decision points.5 41

We developed an MDP-based model to optimise treatment of blood pressure medication and dose selection, based on currently best-available meta-analytic data and risk calculations. While there remains considerable uncertainty in any model-based treatment selection, we found that even when repeatedly sampling over uncertainty ranges of blood pressure treatment benefit and risk, a substantial improvement could be made over current blood pressure management guidelines not only for individuals but also for the overall population. Our study suggests that initiating blood pressure management earlier for young and middle-aged adults with prehypertension substantially reduces MI and/or stroke occurrences over the course of life; the cumulative QALY benefit from the MDP-based approach was primarily from long-term chronic outcomes of high blood pressure leading to long-term sequelae of MI and strokes. This is particularly notable in light of doubts that personalised medicine can improve population health.31 42 We open-sourced our code to permit replication and extension to other disease processes that similarly involve risk/benefit calculations among a large number of potential treatments with different treatment effects and side effect probabilities among different types of patients, as is currently the case with type 2 diabetes, cancer and HIV.43 44

As with any modelling exercise, our study is limited by the quality of its input data and assumptions. First, we modelled the effects of blood pressure treatment medication based on published data, assuming that the health benefits of medications were mediated through changes in blood pressure as per current physiological understanding. Second, we used data from NHANES, which are subject to the limitations of survey studies, including recall biases, acceptability biases and under-reporting that may lead to mis-estimation of baseline covariates and limits the analysis to the civilian (non-institutionalised) US population.45 Third, we did not incorporate compliance/adherence parameters in the model because our purpose in this model is to compare how a change in guidelines from the current guidelines to the MDP-based strategy would affect overall population-level outcomes under the ideal treatment condition. Adding in a compliance/adherence parameter would simply linearly scale the outcomes to the proportion of patients who adhere, unless we have further data from (not yet extant) randomised trials suggesting that patients would adhere differently to treatment regimens found using the MDP-based approach than to regimens identified under the current guidelines. We suggest that such randomised trials and real-world studies should be performed now that a proof-of-concept model is available, to gather empirical data comparing adherence rates and observed outcomes between the MDP-based and current guideline approaches. Fourth, although we performed uncertainty analyses by sampling from distributions around the input parameter data sources, we cannot capture all possible uncertainties in the model. We focused on MI and stroke because of high-quality, validated risk equations for predicting these outcomes and their risk reduction through therapy. Although congestive heart failure and kidney diseases may also be averted, we did not model them due to the lack of robust and validated risk equations; however, their impact is considered proportional to the impact on MI and stroke3 and, thus, should not affect our comparative effectiveness analysis. Ignoring these potential benefits makes our cost-effectiveness estimates conservative. Fifth, one of the largest ongoing debates in personalised medicine is the fact that model-based personalisation may offer incremental gains beyond large standardised guidelines, but at the risk of producing a ‘black box’. A larger discussion must take place in the literature about the benefits and risks of such model-based personalisation before such measures are considered for practice. A next logical step for research is to perform a head-to-head randomised pilot trial to compare usability, interpretability, patient and provider reactions and patient safety of current guidelines versus our personalised blood pressure treatment selection tool. We note that the result of a small gain in QALYs is expected in the case of blood pressure treatment because only a small subset of patients experience a CVD event; this gain, although small, is cost-saving from a societal perspective. Lastly, we chose MDP over other reinforcement learning approaches due to its simplicity and flexibility. Perfect knowledge about a patient’s states under MDP is a strong assumption, but we chose to use MDP since it is often tractable to solve (exact solution) and relatively easy to specify and because CVD health states are objectively observable by clinicians under nearly all circumstances (unlike with some other diseases).46–50 Partially observed MDPs (POMDPs) are more complex than MDPs and are often computationally intractable. The use of POMDP for determining personalised blood pressure treatment guidelines, which may yield a more realistic but less tractable model, is an area for future research.

The next logical step for analysis is to prospectively test the MDP-based model in clinical settings to identify safety, adoptability and impact on patient outcomes. Existing clinical guidelines are typically easy for clinicians to interpret because they involve univariate decisions. To make use of the MDP-based approach, clinicians would need to shift conceptually from a univariate to a multivariate decision process as well as accept computationally complex ‘black box’ results (as they do currently with some imaging and pathological diagnostic guidelines). An additional logical step for future research is to identify whether QALY values covary meaningfully among individuals as a function of their risk, age and prior treatment. Use of personalised QALYs would enable more personalised decision-making rather than assuming that QALY weights of CVD outcomes or serious adverse events are consistent across the population. For example, some persons may favour lower risk of side effects than CVD events if they are concerned about short-term suffering rather than long-term mortality (eg, if they are near the end of life).

While these remain important topics for future research, our current findings indicate that blood pressure treatment policies informed by a Markov decision process framework may improve patient outcomes compared with the use of standardised target-based guidelines, by accounting for individual patient covariates in treatment decision-making processes, and are likely to be cost-saving compared with current guidelines.

Supplementary Material

Reviewer comments
Author's manuscript

Footnotes

Contributors: SEC, MLB and SB: study conception and design, analysis and interpretation of data, and critical revision of the manuscript. SEC and SB: statistical analysis, acquisition of data and drafting of the manuscript. SB obtained funding.

Funding: Research reported in this publication was supported by the National Heart, Lung and Blood Institute and National Institute on Minority Health And Health Disparities of the National Institutes of Health under award numbers K08HL121056 and DP2MD010478.

Disclaimer: The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Competing interests: None declared.

Provenance and peer review: Not commissioned; externally peer reviewed.

Data sharing statement: No additional data are available.

References

  • 1.Kotchen TA, Cowley AW, Liang M. Ushering hypertension into a new era of precision medicine. JAMA 2016;315:343–4. 10.1001/jama.2015.18359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Basu S, Yudkin JS, Sussman JB, et al. Alternative strategies to achieve cardiovascular mortality goals in China and India: a microsimulation of target- versus risk-based blood pressure treatment. Circulation 2016;133:840–8. 10.1161/CIRCULATIONAHA.115.019985 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Sussman J, Vijan S, Hayward R. Using benefit-based tailored treatment to improve the use of antihypertensive medications. Circulation 2013;128:2309–17. 10.1161/CIRCULATIONAHA.113.002290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Muntner P, Whelton PK. Using predicted cardiovascular disease risk in conjunction with blood pressure to guide antihypertensive medication treatment. J Am Coll Cardiol 2017;69:2446–56. 10.1016/j.jacc.2017.02.066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.James PA, Oparil S, Carter BL, et al. 2014 Evidence-based guideline for the management of high blood pressure in adults: report from the panel members appointed to the Eighth Joint National Committee (JNC 8). JAMA 2014;311:507–20. 10.1001/jama.2013.284427 [DOI] [PubMed] [Google Scholar]
  • 6.Wright JT, Williamson JD, Whelton PK, et al. A randomized trial of intensive versus standard blood-pressure control. N Engl J Med 2015;373:2103–16. 10.1056/NEJMoa1511939 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Perkovic V, Rodgers A. Redefining blood-pressure targets—SPRINT starts the marathon. N Engl J Med 2015;373:2175–8. 10.1056/NEJMe1513301 [DOI] [PubMed] [Google Scholar]
  • 8.Mente A, O’Donnell MJ, Rangarajan S, et al. Association of urinary sodium and potassium excretion with blood pressure. N Engl J Med Overseas Ed 2014;371:601–11. 10.1056/NEJMoa1311989 [DOI] [PubMed] [Google Scholar]
  • 9.Patel KK, Arnold SV, Chan PS, et al. Personalizing the intensity of blood pressure control: modeling the heterogeneity of risks and benefits from SPRINT (Systolic Blood Pressure Intervention Trial). Circ Cardiovasc Qual Outcomes 2017;10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kent DM, Alsheikh-Ali A, Hayward RA. Competing risk and heterogeneity of treatment effect in clinical trials. Trials 2008;9:30 10.1186/1745-6215-9-30 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Schell GJ, Marrero WJ, Lavieri MS, et al. Data-driven Markov decision process approximations for personalized hypertension treatment planning. MDM Policy Pract 2016;1:1–9. 10.1177/2381468316674214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Choi SE, Brandeau ML, Basu S. Expansion of the National Salt Reduction Initiative: a mathematical model of benefits and risks of population-level sodium reduction. Med Decis Making 2016;36:72–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Choi SE, Seligman H, Basu S. Cost effectiveness of subsidizing fruit and vegetable purchases through the supplemental nutrition assistance program. Am J Prev Med 2017;52:e147–e155. 10.1016/j.amepre.2016.12.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rahmandad H, Sterman JD. Reporting guidelines for simulation-based research in social sciences. Syst Dyn Rev 2012;28:396–411. 10.1002/sdr.1481 [DOI] [Google Scholar]
  • 15.Cushman WC, Evans GW, Byington RP, et al. Effects of intensive blood-pressure control in type 2 diabetes mellitus N Engl J Med 2010;362:1575–85. 10.1056/NEJMoa1001286 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Law MR, Morris JK, Wald NJ. Use of blood pressure lowering drugs in the prevention of cardiovascular disease: meta-analysis of 147 randomised trials in the context of expectations from prospective epidemiological studies. BMJ 2009;338:b1665 10.1136/bmj.b1665 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.D’Agostino RB, Grundy S, Sullivan LM, et al. Validation of the Framingham coronary heart disease prediction scores: results of a multiple ethnic groups investigation. JAMA 2001;286:180–7. [DOI] [PubMed] [Google Scholar]
  • 18.D’Agostino RB, Vasan RS, Pencina MJ, et al. General cardiovascular risk profile for use in primary care: the Framingham Heart Study. Circulation 2008;117:743–53. 10.1161/CIRCULATIONAHA.107.699579 [DOI] [PubMed] [Google Scholar]
  • 19.Lewington S, Clarke R, Qizilbash N, et al. Age-specific relevance of usual blood pressure to vascular mortality: a meta-analysis of individual data for one million adults in 61 prospective studies. Lancet 2002;360:1903–13. [DOI] [PubMed] [Google Scholar]
  • 20.National Heart Lung and Blood Institute. Incidence and prevalence: chart book on cardiovascular and lung diseases. 2006. http://www.nhlbi.nih.gov/resources/docs/cht-book_ip.htm (accessed 15 Jun 2017).
  • 21.Smith-Spangler CM, Juusola JL, Enns EA, et al. Population strategies to decrease sodium intake and the burden of cardiovascular disease: a cost-effectiveness analysis. Ann Intern Med 2010;152:W170–3. 10.7326/0003-4819-152-8-201004200-00212 [DOI] [PubMed] [Google Scholar]
  • 22.Centers for Disease Control and Prevention. United States life tables. 2013. http://www.cdc.gov/nchs/products/life_tables.htm (accessed 15 May 2017).
  • 23.Centers for Disease Control and Prevention. Continuous NHANES web tutorial. http://www.cdc.gov/nchs/tutorials/Nhanes/index_continuous.htm (accessed 5 Jan 2016).
  • 24.Ripley B, Venables B, Bates DM, et al. MASS: Support functions and datasets for Venables and Ripley’s MASS. https://cran.r-project.org/web/packages/MASS/MASS.pdf (accessed 13 Jun 2017).
  • 25.ARIC Investigators. The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. Am J Epidemiol 1989;129:687–702. [PubMed] [Google Scholar]
  • 26.Broderick J, Brott T, Kothari R, et al. The Greater Cincinnati/Northern Kentucky Stroke Study: preliminary first-ever and total incidence rates of stroke among blacks. Stroke 1998;29:415–21. 10.1161/01.STR.29.2.415 [DOI] [PubMed] [Google Scholar]
  • 27.Basu S, Sussman JB, Hayward RA. Detecting heterogeneous treatment effects to guide personalized blood pressure treatment: a modeling study of randomized clinical trials. Ann Intern Med 2017;166:354–60. 10.7326/M16-1756 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Weinstein MC, Siegel JE, Gold MR, et al. Recommendations of the panel on cost-effectiveness in health and medicine. JAMA 1996;276:1253–8. 10.1001/jama.1996.03540150055031 [DOI] [PubMed] [Google Scholar]
  • 29.World Health Organization. WHO guide to cost-effectiveness analysis. http://www.who.int/choice/publications/p_2003_generalised_cea.pdf?ua=1 (accessed 2 Feb 2016).
  • 30.RED BOOK Drug References. http://www.micromedexsolutions.com/micromedex2/4.34.0/WebHelp/RED_BOOK/Introduction_to_REDB_BOOK_Online.htm (accessed 17 Apr 2017).
  • 31.Khoury MJ, Galea S. Will precision medicine improve population health? JAMA 2016;316:1357–8. 10.1001/jama.2016.12260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Agency for Healthcare Research and Quality. Disparities in health care quality among racial and ethnic minority groups. http://www.ahrq.gov/research/findings/nhqrdr/nhqrdr10/minority.pdf.
  • 33.Institute for Health Metrics and Evaluation. Global Burden of Disease Study 2010 (GBD 2010) Disability Weights. 2010. http://ghdx.healthdata.org/record/global-burden-disease-study-2010-gbd-2010-disability-weights (accessed 22 Feb 2016).
  • 34.Bureau of Labor Statistics. Consumer Price Index (CPI) Inflation Calculator. http://www.bls.gov/data/inflation_calculator.htm (accessed 5 Jan 2016).
  • 35.Salomon JA, Vos T, Hogan DR, et al. Common values in assessing health outcomes from disease and injury: disability weights measurement study for the Global Burden of Disease Study 2010 (vol 380, pg 2129, 2012). Lancet 2013;381:628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Pignone M, Earnshaw S, Pletcher MJ, et al. Aspirin for the primary prevention of cardiovascular disease in women: a cost-utility analysis. Arch Intern Med 2007;167:290–5. 10.1001/archinte.167.3.290 [DOI] [PubMed] [Google Scholar]
  • 37.Fryback DG, Dasbach EJ, Klein R, et al. The Beaver Dam Health Outcomes Study: initial catalog of health-state quality factors. Med Decis Making 1993;13:89–102. [DOI] [PubMed] [Google Scholar]
  • 38.Briggs AH, Weinstein MC, Fenwick EA, et al. Model parameter estimation and uncertainty analysis: a report of the ISPOR-SMDM Modeling Good Research Practices Task Force Working Group-6. Med Decis Making 2012;32:722–32. [DOI] [PubMed] [Google Scholar]
  • 39.Dahabreh IJ, Hayward R, Kent DM. Using group data to treat individuals: understanding heterogeneous treatment effects in the age of precision medicine and patient-centred evidence. Int J Epidemiol 2016:dyw125 10.1093/ije/dyw125 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chobanian AV. Hypertension in 2017-What is the right target? JAMA 2017;317:579–80. 10.1001/jama.2017.0105 [DOI] [PubMed] [Google Scholar]
  • 41.Chobanian AV, Bakris GL, Black HR, et al. Seventh report of the Joint National Committee on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure. Hypertension 2003;42:1206–52. 10.1161/01.HYP.0000107251.49515.c2 [DOI] [PubMed] [Google Scholar]
  • 42.Bayer R, Galea S. Public Health in the precision-medicine era. N Engl J Med 2015;373:499–501. 10.1056/NEJMp1506241 [DOI] [PubMed] [Google Scholar]
  • 43.Palmer SC, Mavridis D, Navarese E, et al. Comparative efficacy and safety of blood pressure-lowering agents in adults with diabetes and kidney disease: a network meta-analysis. Lancet 2015;385:2047–56. 10.1016/S0140-6736(14)62459-4 [DOI] [PubMed] [Google Scholar]
  • 44.Palmer SC, Mavridis D, Nicolucci A, et al. Comparison of clinical outcomes and adverse events associated with glucose-lowering drugs in patients with type 2 diabetes: a meta-analysis. JAMA 2016;316:313–24. 10.1001/jama.2016.9400 [DOI] [PubMed] [Google Scholar]
  • 45.Kreider B, Pepper JV, Gundersen C, et al. Identifying the effects of SNAP (food stamps) on child health outcomes when participation is endogenous and misreported. J Am Stat Assoc 2012;107:958–75. 10.1080/01621459.2012.682828 [DOI] [Google Scholar]
  • 46.Collaborative Health Studies Coordinating Center. MESA overview and protocol. https://www.mesa-nhlbi.org/aboutMESAOverviewProtocol.aspx (accessed 25 Sep 2017).
  • 47.Keku E, Rosamond W, Taylor HA, et al. Cardiovascular disease event classification in the Jackson Heart Study: methods and procedures. Ethn Dis 2005;15:S6-62–70. [PubMed] [Google Scholar]
  • 48.Framingham Heart Study. Phenotypic Data and Table of Exams. https://www.framinghamheartstudy.org/researchers/description-data/ (accessed 25 Sep 2017).
  • 49.University of Alabama at Birmingham Division of Preventive Medicine. CARDIA follow-up contacts and manual of operations. https://www.cardia.dopm.uab.edu/images/more/recent/CARDIA_Follow-up_Contacts_MOO_05-11-2016.pdf (accessed 25 Sep 2017).
  • 50.Collaborative Studies Coordinating Center. Surveillance procedures—coronary heart disease. Updated manual 3. http://www2.cscc.unc.edu/aric/sites/default/files/public/manuals/UpdatesManual3_20151112.pdf (accessed 25 Sep 2017).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary file 1

bmjopen-2017-018374supp001.pdf (1.3MB, pdf)

Reviewer comments
Author's manuscript

Articles from BMJ Open are provided here courtesy of BMJ Publishing Group

RESOURCES