Abstract
Background
Marginal structural models are an important tool for observational studies. These models typically assume that variables are measured without error. We describe a method to account for differential and non-differential measurement error in a marginal structural model.
Methods
We illustrate the method estimating the joint effects of antiretroviral therapy initiation and current smoking on all-cause mortality in a United States cohort of 12,290 patients with HIV followed for up to 5 years between 1998 and 2011. Smoking status was likely measured with error, but a subset of 3686 patients who reported smoking status on separate questionnaires composed an internal validation subgroup. We compared a standard joint marginal structural model fit using inverse probability weights to a model that also accounted for misclassification of smoking status using multiple imputation.
Results
In the standard analysis, current smoking was not associated with increased risk of mortality. After accounting for misclassification, current smoking without therapy was associated with increased mortality [hazard ratio (HR): 1.2 (95% CI: 0.6, 2.3)]. The HR for current smoking and therapy (0.4 (95% CI: 0.2, 0.7)) was similar to the HR for no smoking and therapy (0.4; 95% CI: 0.2, 0.6).
Conclusions
Multiple imputation can be used to account for measurement error in concert with methods for causal inference to strengthen results from observational studies.
Keywords: Bias (Epidemiology), epidemiologic measurements, HIV, acquired immunodeficiency syndrome, Highly Active Antiretroviral Therapy, Confounding Factors (Epidemiology)
Marginal structural models have become widely used tools to estimate the effects of exposures in the presence of time-varying confounding and selection bias1. These models have been essential in HIV, where time-varying confounding by biomarkers limits the use of standard methods when estimating the effects of antiretroviral therapy and other exposures in observational studies2–6. Similarly, joint marginal structural models have been proposed7 and applied8,9 to estimate the joint effects of two or more exposures when one or more exposures are subject to time-varying confounding affected by prior exposure.
With few exceptions10–12, most implementations of both standard and joint marginal structural models assume that exposures are measured without error, but exposure measurement is often imperfect. Here, we estimate the joint effects of therapy initiation and current smoking (a known risk factor for death among patients in treatment for HIV13,14) on all-cause mortality in the Centers for AIDS Research Network of Integrated Clinical Systems (CNICS), a large clinical cohort of patients infected with HIV in the United States. In this cohort, smoking status reported by health care providers in the medical record was prone to measurement error, but information on smoking status was also available from a subgroup of patients who completed a clinical assessment of smoking status using detailed self-report instruments. We estimate the joint effects of therapy initiation and current smoking by combining multiple imputation to account for misclassification of smoking status with inverse probability weighted estimation of a joint marginal structural model. We also illustrate the finite sample properties of the proposed method using a limited Monte Carlo simulation.
METHODS
We are interested in the joint effects of initiation of antiretroviral therapy and current smoking on mortality, as parameterized by a set of three hazard ratios. This set of parameters can be identified using a 2-by-2 factorial randomized trial in which therapy-naïve patients were randomized to 1 of 4 arms: 1) initiate therapy and smoke; 2) initiate therapy and do not smoke; 3) do not initiate therapy and smoke; and 4) do not initiate therapy and do not smoke. The three hazard ratios of interest compare arms 1–3 above with arm 4. Because the prevalence of smoking among patients with HIV is high, many patients recruited for such a trial would likely have smoked prior to study entry, but the trial would estimate the effect of smoking only during the study period. Of course, if trial participants had exposure to smoking prior to randomization, then there would be interest in exploring how smoking history possibly modifies the effect of current smoking.
To consistently estimate these parameters using observational data, we must additionally assume that we can adequately account for confounding of both exposures, including confounding by smoking history at study entry.
Study population
CNICS was developed to support population-based HIV research in the United States15. The CNICS cohort includes over 29,000 HIV-positive adults engaged in clinical care from January 1, 1995 to the present (2014) at 8 Centers for AIDS Research sites (Case Western Reserve University; Fenway Community Health Center of Harvard University; Johns Hopkins University; University of Alabama at Birmingham; University of California, San Diego; University of California, San Francisco; University of North Carolina; and University of Washington). All patients attending two primary HIV medical care visits at one of the eight study sites are eligible for CNICS and followed for clinical events, lab measurements, and medications while they remain in care at study sites. Institutional review boards at each site approved study protocols. Participants provided written informed consent to be included in the CNICS cohort or contributed administrative and/or clinical data with a waiver of written informed consent where approved by local Institutional Review Boards.
CNICS includes 13,241 patients who entered care at a CNICS site prior to initiation of antiretroviral therapy and had a viral load of at least 500 copies/mL at cohort entry between January 1, 1998 and December 31, 2011. We excluded patients who did not have both CD4 cell count and viral load recorded within one year of study entry (n=152), and who were missing information on race (n=438) or transmission risk factor (n=361). The remaining 12,290 patients were included in this analysis.
Therapy initiation and smoking status
Therapy initiation was defined as the date on which patients were first prescribed three or more antiretroviral agents. After therapy initiation, patients were assumed to be on therapy throughout follow-up under an observational analog to the intention-to-treat assumption4. As in prior observational analyses3,16, we estimate the effect of initiating therapy rather than being on therapy.
Patients classified as current smokers at any time during the study period were considered current smokers for the entire study period. Information on smoking status was available for all patients through provider documentation on patient medical records. However, smoking status as documented in the medical records was likely measured with error, and the absence of a positive report was assumed to imply nonsmoking. Moreover the validity of health care provider-reported smoking status was possibly influenced by predictors of mortality.
A subset of 3686 patients from five sites self-reported smoking status on the CNICS clinical assessment using a web-based survey software application implemented during clinical care. These patients composed an internal validation subgroup. Patient-reported current smoking status was considered less error-prone and used as a “gold standard” measure of smoking classification17.
Mortality ascertainment
The outcome of interest was all-cause mortality. Deaths were obtained through clinic sources, death certificates, and the United States vital status databases. Each CNICS site maintains a registry of deaths among patients at that site and regularly queries the Social Security Death Index and/or National Death Index to confirm mortality data.
Statistical methods
The 12,290 patients were followed from study entry until death, loss to follow-up, December 31, 2011, or 5 years after study entry. Patients were considered to be lost to follow-up after 12 consecutive months without a documented clinic visit including a CD4 cell count or viral load measurement.
We aim to estimate the joint effects of current smoking and therapy initiation using the joint marginal structural Cox model
(1) |
where the effects of interest are the hazard ratios exp(β1), exp(β2), and exp(β1 + β2 + β3) for the joint effects of no therapy and smoking, therapy and no smoking, and therapy and smoking on mortality, respectively, relative to the hazard among nonsmoking patients not on therapy. S represents the gold standard (patient-reported) smoking status and A(t) is an indicator of therapy initiation by time t.
Because gold-standard smoking status S is not available for all patients, we must classify patients according to provider-reported smoking status if we do not wish to limit our analysis to the subset of patients who filled out detailed smoking status information on the electronic questionnaires. The standard analysis estimates the parameters β = {β1, β2, β3}of equation 1 as θ = {θ1, θ2, θ3} in equation 2 by fitting the marginal structural Cox model
(2) |
where smoking is measured by provider-reported smoking status (S*) rather than the gold-standard, patient reported smoking status (S). We fit the marginal structural model using inverse-probability weights to create a pseudo-population in which neither exposure is associated with the confounders measured prior to the exposure2,3,18,19. Details on the implementation of the standard marginal structural model shown above are provided in section A of the eAppendix.
Analysis accounting for measurement error
To estimate hazard ratios for the joint effects of current smoking and therapy initiation that were not biased by misclassification of provider-reported smoking status, we used multiple imputation to impute S, the patient-reported measure of current smoking status, prior to fitting the marginal structural model. Multiple imputation is a technique to account for missing data21–24. Viewing misclassification of current smoking status as a missing data problem in which S is missing for patients outside the validation subgroup facilitates the use of missing data methods, like multiple imputation, to account for measurement error25,26.
We used information on the relationship between variables that were measured for all patients and patient-reported current smoking status in the validation subgroup to impute S for all other patients in each of M imputations. Details are provided in the Appendix.
In each of the M imputations, we fit the marginal structural Cox model
(6) |
by maximizing the weighted partial likelihood in each imputation
(7) |
where Yi represents the time to the first of mortality or censoring and δi is an event indicator for patient i. Note that distinct values of the parameters γ are estimated in each imputation m.
Weights were estimated for each month in each imputation as Wm(t) = WSmWA(Sm)(t). Smoking weights were estimated as WSm = f(Sm)/f(Sm|L) and treatment weights were estimated as
(8) |
where ⌊t⌋ is the count of completed months from study entry to time t and L is a vector of time-fixed covariates including age, sex, race, ethnicity, and year, HIV transmission risk factor (indicators of being a man who has sex with men and injection drug use), CD4 cell count, and viral load at study entry.
Assuming that inclusion in the subset of patients with detailed smoking information depends only on the observed covariates, the parameters of interest, exp(β1), exp(β2), and exp(β1 + β2 + β3) can be consistently estimated by exp(γ̄1), exp(γ̄2), and exp(γ̄1 + γ̄2 + γ̄3), respectively, where γ̄p is the average γp across all imputations,
(9) |
A large-sample estimate of the variance for γ̄p is given by
(10) |
where the within imputation variance, , is the robust variance for 3,27. In this example, we set M = 20.
We also graphically present the cumulative incidence of mortality for patients in each of the four groups in the standard analysis and analysis accounting for measurement error19,28. Statistical analyses were performed using SAS version 9.3 (SAS Institute, Inc., Cary, NC), and SAS code is provided in Section B of the eAppendix.
We further explored the finite-sample properties of multiple imputation to account for measurement error in marginal structural models using simulations. The bias, 95% confidence interval coverage, power, and mean squared error were evaluated for the standard marginal structural model and the marginal structural model using multiple imputation to account for measurement error that is non-differential (scenario 1) and differential (scenario 2) with respect to the outcome. Details about the simulations are provided in Section C of the eAppendix.
RESULTS
Table 1 describes the characteristics of the study population at study entry and over 36,418 person-years of follow-up. Of the 12,290 patients included in the study, 82% were male, 39% were black, and 60% were men who have sex with men. At study entry 19% had been diagnosed with AIDS. Provider diagnoses identified 39% of the cohort as current smokers, and 69% of the patients initiated antiretroviral therapy before the end of follow-up. Over the 5 years of follow-up, 935 patients died, and 5137 were lost to follow-up.
Table 1.
Characteristics | All patients at study entry (n=12,290) |
All patients over 36,418 person-years |
Validation subgroup at study entry (n=3686) |
Validation subgroup over 12,157 person-years |
||
---|---|---|---|---|---|---|
n | % | % | n | % | % | |
Male sex | 10034 | 82 | 81 | 3217 | 87 | 87 |
Black race | 4841 | 39 | 39 | 1044 | 28 | 28 |
Hispanic ethnicity | 1342 | 11 | 11 | 533 | 14 | 14 |
Injection drug user | 2295 | 19 | 18 | 538 | 15 | 15 |
MSM | 7340 | 60 | 60 | 2652 | 72 | 71 |
AIDS | 2132 | 19 | 26 | 487 | 13 | 21 |
CD4 cell count | ||||||
<250 | 5131 | 42 | 26 | 1461 | 40 | 23 |
250 – 500 | 4235 | 34 | 40 | 1344 | 36 | 41 |
>500 | 2924 | 24 | 34 | 881 | 24 | 36 |
Suppressed viral load b | 0 | 0 | 52 | 0 | 0 | 40 |
Physician-reported current smoker c | 4823 | 39 | 42 | 1596 | 43 | 45 |
Patient-reported current smoker d | - e | - e | - e | 1687 | 46 | 44 |
On therapy | 0 | 0 | 67 | 0 | 0 | 73 |
No therapy, nonsmoker f | 7467 | 61 | 20 | 1999 | 54 | 13 |
No therapy, smokerf | 4823 | 39 | 14 | 1687 | 46 | 14 |
On therapy, nonsmokerf | 0 | 0 | 39 | 0 | 0 | 42 |
On therapy, smokerf | 0 | 0 | 28 | 0 | 0 | 31 |
CNICS: Centers for AIDS Research Network of Integrated Clinical Systems; MSM: men who have sex with men
<500 copies/mL; only patients with unsuppressed viral load at cohort entry were included in the study
Provider-reported smoking was likely measured with error
Patient-reported smoking was the gold-standard measure of smoking status
Patient-reported smoking status was not available for patients outside the validation subgroup
Smoking status is provider-reported smoking status for all patients and patient-reported smoking status for patients in the validation subgroup
A subset of 3686 patients completed detailed self-reported smoking status as part of the CNICS clinical assessment of patient reported outcomes during the study period and were included in the validation subgroup. Patients in the validation subgroup were similar to all patients at the sites where they were enrolled but more likely to be male and/or men who have sex with men, and less likely to be black, injection drug users, or have AIDS at baseline than patients in the main cohort. In the validation subgroup, 43% of patients had a provider diagnosis as a current smoker, while 46% reported themselves that they were current smokers. Using self-reported current smoking status as the gold standard, the sensitivity of provider-diagnosed smoking was 74% (95% CI: 72%, 76%) and the specificity was 83% (95% CI: 81%, 85%) in the validation subgroup. Of the 3686 patients in the validation subgroup, 52 died and 855 were lost to follow-up over 5 years.
Table 2 presents hazard ratios comparing mortality between exposure groups in the standard analysis and in the analysis accounting for measurement error of smoking status. In the standard joint marginal structural model, the hazard ratio (HR) comparing smoking and not initiating therapy to not smoking and not initiating therapy was a counterintuitive 0.6 (95% CI: 0.6, 0.8). The HR for not smoking and initiating therapy was 0.3 (95% CI: 0.3, 0.4), and the HR for smoking and initiating therapy was 0.3 (95% CI: 0.2, 0.3). These results suggest that smoking (with and without therapy) was protective of 5-year total mortality. Figure 1 presents the risk of mortality over time for each exposure group in the standard analysis.
Table 2.
Model | Standard | Accounting for measurement error |
||||
---|---|---|---|---|---|---|
Mortality | HR | 95% CI | Mortality | HR | 95% CI | |
Crude | ||||||
No therapy, no smoking | 10.5 | 1 | 11.6 | 1 | ||
No therapy, smoking | 10.8 | 0.8 | 0.7, 1.0 | 12.3 | 1.2 | 0.6, 2.5 |
Therapy, no smoking | 13.6 | 0.9 | 0.8, 1.1 | 10.8 | 1.1 | 0.7, 1.5 |
Therapy, smoking | 11.8 | 0.9 | 0.8, 1.2 | 11.8 | 1.2 | 0.6, 2.2 |
Marginal Structural Model | ||||||
No therapy, no smoking | 22.6 | 1 | 22.4 | 1 | ||
No therapy, smoking | 22.1 | 0.6 | 0.6, 0.8 | 22.9 | 1.2 | 0.6, 2.3 |
Therapy, no smoking | 10.0 | 0.3 | 0.3, 0.4 | 9.3 | 0.4 | 0.2, 0.6 |
Therapy, smoking | 9.6 | 0.3 | 0.2, 0.3 | 10.4 | 0.4 | 0.2, 0.7 |
HR: hazard ratio; CI: confidence interval
After accounting for the measurement error in current smoking status, the prevalence of smoking in the entire cohort was 50%. Compared to not smoking and not initiating therapy, the mortality hazard ratio for smoking and not initiating therapy was 1.2 (95% CI: 0.6, 2.3). Compared to not smoking and not initiating therapy, the HR for not smoking and initiating therapy was 0.4 (95% CI: 0.2, 0.6), and the HR for smoking and initiating therapy was similar at 0.4 (95% CI: 0.2, 0.7). Figure 2 presents the risk of mortality over time for each exposure group after accounting for measurement error. The HR for smoking and initiating therapy was similar to the expected HR under multiplicativity of the effects of smoking and therapy (0.4) and farther from the null than the expected HR under additivity (0.6).
Table 3 presents the results from Monte Carlo simulations. In 2000 simulated cohorts each of 2000 patients, the multiple imputation approach produced an HR with little bias and appropriate coverage under conditions mimicking the data used in this analysis. When exposure misclassification was differential with respect to the outcome, the multiple imputation approach that assumed non-differential misclassification yielded results with more bias than the multiple imputation approach accounting for the differential misclassification, but much less bias than the standard marginal structural model.
Table 3.
Method | Sensitivity | Specificity | Average HR | Biasb | SD c (Bias) |
Coverage d | Power e | MSE f |
---|---|---|---|---|---|---|---|---|
Scenario 1 g | ||||||||
Truth | 1 | 1 | 1.73 | |||||
Standard MSM | 0.7 | 0.8 | 1.15 | 41.00 | 4 | 0 | 70 | 1162 |
MSM with MI | 0.7 | 0.8 | 1.73 | 0.09 | 12 | 97 | 99 | 137 |
Scenario 2 h | ||||||||
Truth | 1 | 1 | 1.73 | |||||
Standard MSM | 0.6 (noncases) 0.95 (cases) |
0.7 (noncases) 0.9 (cases) |
1.45 | 17.00 | 5 | 22 | 99 | 314 |
MSM with MI (nondifferential) | 0.6 (noncases) 0.95 (cases) |
0.7 (noncases) 0.9 (cases) |
1.74 | 0.70 | 12 | 96 | 99 | 139 |
MSM with MI (differential) | 0.6 (noncases) 0.95 (cases) |
0.7 (noncases) 0.9 (cases) |
1.73 | 0.36 | 11 | 97 | 99 | 121 |
Se: Sensitivity; Sp: Specificity; MSM: Marginal structural model; MI: Multiple Imputation; CI: Confidence interval
Bias was defined as the average of the difference between the true ln(HR) and the estimated ln(HR) in each simulated scenario times 100
Standard deviation of the bias
95% confidence interval coverage was the proportion of iterations in which the estimated 95% confidence interval contained the true value
Power was the proportion of simulations in which the estimated 95% confidence interval excluded the null value
Mean squared error was the sum of the square of the bias and the square of the standard deviation of the bias
In the first scenario, exposure misclassification was nondifferential with respect to the outcome
In the second scenario, exposure misclassification was differential with respect to the outcome
DISCUSSION
Results from a joint marginal structural model that accounted for misclassification of current smoking status suggested that patients who smoked had slightly higher risk of mortality under no therapy initiation (23% vs. 22%) and under therapy initiation (10% vs. 9%) over 5 years. These findings are consistent with existing literature on risks of smoking among individuals in care for HIV13,14,29,30. In the standard joint marginal structural model, which did not account for measurement error of smoking status, current smoking (counter to expectations) appeared to decrease the risk of mortality.
As in existing studies highlighting the discrepancies between electronic health records and patient report outcomes17, provider classification of current smoking status on electronic health records was a poor proxy for self-reported smoking. After accounting for measurement error of smoking status, the prevalence of current smoking in the CNICS was similar to the prevalence of self-reported smoking reported in other cohorts of patients with HIV13,29,31. Existing studies conducted among therapy initiators report that smoking sometimes is associated with poorer adherence to therapy29,32,33, inferior immunological markers29,34, and increased risk of mortality29, leading investigators to suggest that smoking may make therapy less effective29,34,35. Studies in the era prior to effective therapy found no effect of smoking on mortality36–38. In this analysis, conducted in the current HIV treatment era, we report somewhat increased mortality for smokers under no therapy initiation, which has not been examined in previous studies.
Here, we have described a method to estimate the joint effects of therapy and current smoking from observational data when smoking status may be misclassified. These joint effects can be conceptualized as the effects of joint interventions on therapy and current smoking. Estimating these joint effects in observational data requires accounting for confounding of both therapy initiation and current smoking. Because CNICS is based on data collected at routine clinic visits, we were unable to include information not routinely collected at an HIV patient visit, meaning that we may have residual confounding by unmeasured covariates. In addition, CNICS does not include information for most patients on smoking prior to study entry, which is likely to affect both smoking during the study and mortality. However, in analyses conducted among only patients who completed detailed questionnaires, the effect of smoking on mortality was similar when estimated using weights that included prior smoking (HR: 1.6; 95% CI: 0.8, 3.3) and when estimated using weights that did not include prior smoking (HR: 1.6; 95% CI: 0.9, 2.8), lending support to our results despite possible residual confounding by prior smoking status.
Alternatively, we could consider current smoking as a modifier of the effect of therapy on mortality. Because a modifier is not required to be unconfounded, we could examine modification by current smoking status by fitting the structural model proposed above with only the treatment weights. Numerical results are similar when smoking is considered to be an effect modifier, though we interpret the results as stratum specific effects of therapy, rather than joint effects. The HR for the effect of treatment on mortality was not modified by smoking status; the HR was 0.4 (95% CI: 0.2, 0.6) for nonsmokers and 0.4 (95% CI: 0.2, 0.6) for smokers.
We also assume non-informative censoring. Over the 5-year study period, 5137 of the 12,290 patients in the cohort were lost to follow-up. Note that, when one includes potential confounders in an outcome regression survival model, one simultaneously accounts for confounding and informative censoring based on those variables. Here, because we accounted for confounding using inverse probability weights instead of including confounders in the structural model, we would need inverse probability of censoring weights to produce consistent estimates of the hazard ratios in the presence of covariates that predict both loss to follow-up and death. However, while the proportion of patients lost to follow-up was high, no measured covariates were strongly associated with loss to follow-up. In addition, previous work has shown that loss to follow-up in the CNICS cohort does not predict mortality39, indicating that selection bias resulting from the high proportion of patients lost to follow-up is likely to be modest. As expected, when we applied inverse probability of censoring weights in unpublished analyses, results were unchanged.
We used multiple imputation to account for misclassification of current smoking status using self-reported smoking as the gold standard. However, if patients did not accurately report their smoking status, we could have residual measurement error. Similarly, if patients changed their smoking behaviors over follow-up, we would have misclassified their smoking status for at least some of the study period. Unfortunately, smoking behaviors have changed very slowly among patients with HIV, which is one reason smoking rates among patients with HIV are still over twice as high as among the general population40–44. Finally, we assume all other variables, including date of therapy initiation, are measured without error, though the multiple imputation approach could be extended to account for measurement error of multiple variables.
Multiple imputation is a method to handle missing data that is familiar to epidemiologists and has been previously applied to account for measurement error of exposures25 and outcomes26. For multiple imputation to produce consistent estimates in the presence of exposure measurement error, we must assume transportability of the misclassification probabilities; that is, we assume that inclusion in the validation subgroup is independent of the gold-standard exposure, given the observed exposure and covariates included in the imputation model45. Just as we must assume that a variable is missing at random given the observed data to use multiple imputation for missing data, we must assume that information on the gold standard exposure is missing at random (conditional on observed data) to use multiple imputation to account for measurement error.
The gold standard exposure is guaranteed to be missing at random when the validation subgroup is randomly selected from the study population (or randomly selected within strata of covariates) and participation is perfect. In our example, the validation subgroup was not randomly selected, but gold standard exposure information may still be missing at random if we measured all predictors of being included in the validation subgroup that also affect the misclassification probabilities and included these variables in the imputation model. However, we were not able to include study site as a predictor of patient-reported smoking in the imputation model because the patient-reported outcomes were collected at only 5 of the 8 study sites during the study period. If study site predicted smoking status, we may have residual bias due to measurement error. Finally, the validity of the estimates from the multiple imputation procedure depends not only on including all relevant covariates in the imputation model but also on correctly specifying their functional forms and including all relevant interactions between covariates.
One may wish to estimate the effect of exposure on outcome using patients in the validation subgroup alone. However, if a covariate influences both selection into the validation sample and the outcome, effects estimated from the validation sample alone may not be generalizable to the entire study sample, even if they are internally valid (e.g., not subject to measurement bias or selection bias) for the participants in the validation sample. In this case, multiple imputation can estimate the effect of exposure on outcome that is generalizable to the entire study sample because it allows the investigator to use information from all study participants. On the other hand, if the validation sample is randomly selected from the main study data, we would expect the estimate of effect from the validation sample to be generalizable to the entire study sample. In this case, multiple imputation still offers an advantage over using validation sample alone in terms of precision, though this advantage diminishes as the size of the validation sample relative to the main study increases.
Several alternative methods to account for measurement error are available. Regression calibration has been proposed to account for nondifferential measurement error in marginal structural models11 and can be applied in studies with internal or external validation data, but regression calibration does not appear to work well with differential measurement error46. Differential measurement error is possible if the provider’s classification of smoking status depends on predictors of mortality, a situation that is possible if health care providers are more likely to ask a sick patient about tobacco use, and therefore correctly classify that patient as a smoker or nonsmoker, than a healthy patient. Multiple imputation allows for differential exposure measurement error by including a term for the product between the misclassified exposure and the outcome in the imputation model. Alternatively, Bayesian methods can be used to account for measurement error in studies both with and without internal validation data45,47–49.
We used multiple imputation to account for measurement error in a marginal structural model to estimate the joint effects of current smoking and therapy. These analyses were feasible in a large observational database due to improved measurement of smoking status for a subset of patients leveraging patient reported outcomes collected in addition to information from the electronic health record. Using Monte Carlo simulations, we showed that multiple imputation can be combined with a marginal structural model to produce consistent estimates of the hazard ratio in the presence of confounding and measurement error similar in magnitude to the measurement error of smoking seen in the CNICS cohort.
When estimating causal effects from observational data, with few notable exceptions11,50,51 epidemiologists and biostatisticians have predominantly focused on accounting for bias due to confounding and selection, limiting consideration of measurement error to speculation in the Discussion sections of manuscripts. We have illustrated a technique to account for measurement error quantitatively that can be combined with modern methods to control confounding in observational settings.
Supplementary Material
Acknowledgments
Funding: Supported in part by the National Institutes of Health [NIH R01AI100654, NIH R24AI067039, NIH U01AR057954, and NIH P30AI50410].
Appendix
Imputation details
We used the monotone logistic method for multiple imputation to impute the smoking status S for each patient23. In the validation sample, we fit the logistic regression
where S* is the provider-reported smoking status, δ is an indicator of death during the study period, Y is the time to death or censoring, L is a vector of time-fixed covariates, A is an indicator of initiating therapy during the study period, and H is the time from study entry to the first of therapy initiation, death, or censoring. We included interactions δS* and AS* to allow for differential measurement error of smoking with respect to mortality and therapy, respectively.
Next, for each of M imputations, we draw a set of regression coefficients αm from a multivariate normal distribution with mean vector (α̂0, α̂1, α̂2, α̂3, α̂4, α̂5, α̂6, α̂7, α̂8, α̂9) and covariance matrix Σ̂S*,δ,log(Y),L,A,log (H) estimated from the logistic regression above. In each imputation m, we draw the imputed smoking status Sm for patients not in the validation subgroup from a Bernoulli distribution with probability pm, where and expit(·) = exp(·)/{1 + exp(·)}. For patients in the validation subgroup, Sm = S for all m.
Contributor Information
Jessie K. Edwards, Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC
Stephen R. Cole, Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC
Daniel Westreich, Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC.
Heidi Crane, Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA.
Joseph J. Eron, Department of Medicine, Division of Infectious Diseases, University of North Carolina at Chapel Hill, Chapel Hill, NC
W. Christopher Mathews, School of Medicine, University of California, San Diego, San Diego, CA.
Richard Moore, School of Medicine, Johns Hopkins University, Baltimore, MD.
Stephen L. Boswell, Fenway Health, Boston, MA
Catherine R. Lesko, Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC
Michael J. Mugavero, Department of Medicine, Division of Infectious Diseases, University of Alabama at Birmingham, Birmingham, AB
References
- 1.Suarez D, Borràs R, Basagaña X. Differences between marginal structural models and conventional models in their exposure effect estimates: a systematic review. Epidemiology. 2011;22(4):586–588. doi: 10.1097/EDE.0b013e31821d0507. [DOI] [PubMed] [Google Scholar]
- 2.Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11(5):550–560. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
- 3.Cole SR, Hernán MA, Robins JM, et al. Effect of highly active antiretroviral therapy on time to acquired immunodeficiency syndrome or death using marginal structural models. Am J Epidemiol. 2003;158(7):687–694. doi: 10.1093/aje/kwg206. [DOI] [PubMed] [Google Scholar]
- 4.Cole SR, Hernán MA, Margolick JB, Cohen MH, Robins JM. Marginal structural models for estimating the effect of highly active antiretroviral therapy initiation on CD4 cell count. Am J Epidemiol. 2005;162(5):471–478. doi: 10.1093/aje/kwi216. [DOI] [PubMed] [Google Scholar]
- 5.Cole SR, Hernán Ma, Anastos K, Jamieson BD, Robins JM. Determining the effect of highly active antiretroviral therapy on changes in human immunodeficiency virus type 1 RNA viral load using a marginal structural left-censored mean model. Am J Epidemiol. 2007;166(2):219–227. doi: 10.1093/aje/kwm047. [DOI] [PubMed] [Google Scholar]
- 6.Cain LE, Logan R, Robins JM, et al. When to initiate combined antiretroviral therapy to reduce mortality and AIDS-defining illness in HIV-infected persons in developed countries: an observational study. Ann Intern Med. 2011;154(8):509–515. doi: 10.1059/0003-4819-154-8-201104190-00001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hernan MA, Brumback B, Robins JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. J Am Stat Assoc. 96(454):440–448. [Google Scholar]
- 8.Howe CJ, Cole SR, Mehta SH, Kirk GD. Estimating the effects of multiple time-varying exposures using joint marginal structural models: alcohol consumption, injection drug use, and HIV acquisition. Epidemiology. 2012;23(4):574–582. doi: 10.1097/EDE.0b013e31824d1ccb. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sander PM, Cole SR, Stall RD, et al. Joint effects of alcohol consumption and high-risk sexual behavior on HIV seroconversion among men who have sex with men. AIDS. 2013;27(5):815–823. doi: 10.1097/QAD.0b013e32835cff4b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Moodie EEM, Delaney JaC, Lefebvre G, Platt RW. Missing Confounding Data in Marginal Structural Models: A Comparison of Inverse Probability Weighting and Multiple Imputation. Int J Biostat. 2008;4(1):1–23. doi: 10.2202/1557-4679.1106. [DOI] [PubMed] [Google Scholar]
- 11.Cole SR, Jacobson LP, Tien PC, Kingsley L, Chmiel JS. Using marginal structural measurement-error models to estimate the long-term effect of antiretroviral therapy on incident aids or death. Am J Epidemiol. 2010;171(1):113–122. doi: 10.1093/aje/kwp329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Robins JM. General methodological considerations. J Econom. 2003;112(1):89–106. [Google Scholar]
- 13.Helleberg M, Afzal S, Kronborg G, et al. Mortality attributable to smoking among HIV-1-infected individuals: a nationwide, population-based cohort study. Clin Infect Dis. 2013;56(5):727–734. doi: 10.1093/cid/cis933. [DOI] [PubMed] [Google Scholar]
- 14.Wada N, Jacobson LP, Cohen M, French A, Phair J, Munoz A. Cause-specific mortality among HIV-infected individuals, by CD4+ cell count at HAART initiation, compared with HIV-uninfected individuals. AIDS. 2014;28(2):257–265. doi: 10.1097/QAD.0000000000000078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kitahata MM, Rodriguez B, Haubrich R, et al. Cohort profile: the Centers for AIDS Research Network of Integrated Clinical Systems. Int J Epidemiol. 2008;37(5):948–955. doi: 10.1093/ije/dym231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Sterne J, Hernán MA, Ledergerber B, et al. Long-term effectiveness of potent antiretroviral therapy in preventing AIDS and death: a prospective cohort study. Lancet. 2005;366(9483):378–384. doi: 10.1016/S0140-6736(05)67022-5. [DOI] [PubMed] [Google Scholar]
- 17.Kozak MS, Mugavero MJ, Ye J, et al. Patient reported outcomes in routine care: advancing data capture for HIV cohort research. Clin Infect Dis. 2012;54(1):141–147. doi: 10.1093/cid/cir727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cole SR, Hernán MA. Constructing inverse probability weights for marginal structural models. Am J Epidemiol. 2008;168(6):656–664. doi: 10.1093/aje/kwn164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Westreich D, Cole SR, Tien PC, et al. Time scale and adjusted survival curves for marginal structural cox models. Am J Epidemiol. 2010;171(6):691–700. doi: 10.1093/aje/kwp418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Howe CJ, Cole SR, Westreich DJ, Greenland S, Napravnik S, Eron JJ. Splines for trend analysis and continuous confounder control. Epidemiology. 2011;22(6):874–875. doi: 10.1097/EDE.0b013e31823029dd. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Little RJA, Rubin DB. Statistical Analysis with Missing Data, Second Edition. 2 edition. New York, NY: Wiley-Interscience; 2002. p. 408. [Google Scholar]
- 22.Rubin D, Little R. Statistical Analysis With Missing Data. New York: John Wiley & Sons; 1987. [Google Scholar]
- 23.Rubin DB. Multiple Imputation for Nonresponse in Surveys. New York, NY: Wiley; 1987. p. xxix.p. 287. [Google Scholar]
- 24.Sterne JAC, White IR, Carlin JB, et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338:b2393. doi: 10.1136/bmj.b2393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cole SR, Chu H, Greenland S. Multiple-imputation for measurement-error correction. Int J Epidemiol. 2006;35(4):1074–1081. doi: 10.1093/ije/dyl097. [DOI] [PubMed] [Google Scholar]
- 26.Edwards JK, Cole SR, Troester Ma, Richardson DB. Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data. Am J Epidemiol. 2013;177(9):904–912. doi: 10.1093/aje/kws340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.White H. A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrics. 1980;48:817–838. [Google Scholar]
- 28.Toh S, Hernández-Díaz S, Logan R, Robins JM, Hernán MA. Estimating absolute risks in the presence of nonadherence: an application to a follow-up study with baseline randomization. Epidemiology. 2010;21(4):528–539. doi: 10.1097/EDE.0b013e3181df1b69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Feldman JG, Minkoff H, Schneider MF, et al. Association of cigarette smoking with HIV prognosis among women in the HAART era: a report from the women’s interagency HIV study. Am J Public Health. 2006;96(6):1060–1065. doi: 10.2105/AJPH.2005.062745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Cohen MH, French AL, Benning L, et al. Causes of death among women with human immunodeficiency virus infection in the era of combination antiretroviral therapy. Am J Med. 2002;113(2):91–98. doi: 10.1016/s0002-9343(02)01169-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lifson AR, Lando HA. Smoking and HIV: prevalence, health risks, and cessation strategies. Curr HIV/AIDS Rep. 2012;9(3):223–230. doi: 10.1007/s11904-012-0121-0. [DOI] [PubMed] [Google Scholar]
- 32.Lazo M, Gange SJ, Wilson TE, et al. Patterns and predictors of changes in adherence to highly active antiretroviral therapy: longitudinal study of men and women. Clin Infect Dis. 2007;45(10):1377–1385. doi: 10.1086/522762. [DOI] [PubMed] [Google Scholar]
- 33.Shuter J, Bernstein SL. Cigarette smoking is an independent predictor of nonadherence in HIV-infected individuals receiving highly active antiretroviral therapy. Nicotine Tob Res. 2008;10(4):731–736. doi: 10.1080/14622200801908190. [DOI] [PubMed] [Google Scholar]
- 34.Miguez-Burbano MJ, Burbano X, Ashkin D, et al. Impact of tobacco use on the development of opportunistic respiratory infections in HIV seropositive patients on antiretroviral therapy. Addict Biol. 2003;8(1):39–43. doi: 10.1080/1355621031000069864. [DOI] [PubMed] [Google Scholar]
- 35.Marshall MM, McCormack MC, Kirk GD. Effect of cigarette smoking on HIV acquisition, progression, and mortality. AIDS Educ Prev. 2009;21(3) Suppl:28–39. doi: 10.1521/aeap.2009.21.3_supp.28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Galai N, Park LP, Wesch J, Visscher B, Riddler S, Margolick JB. Effect of smoking on the clinical progression of HIV-1 infection. J Acquir Immune Defic Syndr Hum Retrovirol. 1997;14(5):451–458. doi: 10.1097/00042560-199704150-00009. [DOI] [PubMed] [Google Scholar]
- 37.Burns DN, Hillman D, Neaton JD, et al. Cigarette smoking, bacterial pneumonia, and other clinical outcomes in HIV-1 infection. Terry Beirn Community Programs for Clinical Research on AIDS. J Acquir Immune Defic Syndr Hum Retrovirol. 1996;13(4):374–383. doi: 10.1097/00042560-199612010-00012. [DOI] [PubMed] [Google Scholar]
- 38.Study Group for the MRC Collaborative Study of HIV Infection in Women. Survival and progression of HIV disease in women attending GUM/HIV clinics in Britain and Ireland. Sex Transm Infect. 1999;75(4):247–252. doi: 10.1136/sti.75.4.247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Edwards JK, Cole SR, Westreich D, et al. Loss to Clinic and Five-Year Mortality among HIV-Infected Antiretroviral Therapy Initiators. PLoS One. 2014;9(7):e102305. doi: 10.1371/journal.pone.0102305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lifson AR, Neuhaus J, Arribas JR, van den Berg-Wolf M, Labriola AM, Read TRH. Smoking-related health risks among persons with HIV in the Strategies for Management of Antiretroviral Therapy clinical trial. Am J Public Health. 2010;100(10):1896–1903. doi: 10.2105/AJPH.2009.188664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tesoriero JM, Gieryic SM, Carrascal A, Lavigne HE. Smoking among HIV positive New Yorkers: prevalence, frequency, and opportunities for cessation. AIDS Behav. 2010;14(4):824–835. doi: 10.1007/s10461-008-9449-2. [DOI] [PubMed] [Google Scholar]
- 42.Webb MS, Vanable PA, Carey MP, Blair DC. Cigarette smoking among HIV+ men and women: examining health, substance use, and psychosocial correlates across the smoking spectrum. J Behav Med. 2007;30(5):371–383. doi: 10.1007/s10865-007-9112-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Burkhalter JE, Springer CM, Chhabra R, Ostroff JS, Rapkin BD. Tobacco use and readiness to quit smoking in low-income HIV-infected persons. Nicotine Tob Res. 2005;7(4):511–522. doi: 10.1080/14622200500186064. [DOI] [PubMed] [Google Scholar]
- 44.Moscou-Jackson G, Commodore-Mensah Y, Farley J, DiGiacomo M. Smoking-cessation interventions in people living with HIV infection: a systematic review. J Assoc Nurses AIDS Care. 2014;25(1):32–45. doi: 10.1016/j.jana.2013.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Greenland S. Bayesian perspectives for epidemiologic research: III. Bias analysis via missing-data methods. Int J Epidemiol. 2009;38(6):1662–1673. doi: 10.1093/ije/dyp278. [DOI] [PubMed] [Google Scholar]
- 46.Stürmer T, Thürigen D, Spiegelman D, Blettner M, Brenner H. The performance of methods for correcting measurement error in case-control studies. Epidemiology. 2002;13(5):507–516. doi: 10.1097/00001648-200209000-00005. [DOI] [PubMed] [Google Scholar]
- 47.Greenland S. Relaxation penalties and priors for plausible modeling of nonidentified bias sources. Stat Sci. 2009;24(2):195–210. [Google Scholar]
- 48.MacLehose RF, Olshan AF, Herring AH, Honein Ma, Shaw GM, Romitti Pa. Bayesian methods for correcting misclassification: an example from birth defects epidemiology. Epidemiology. 2009;20(1):27–35. doi: 10.1097/EDE.0b013e31818ab3b0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gustafson P, Greenland S. Curious phenomena in Bayesian adjustment for exposure misclassification. Stat Med. 2006;25(1):87–103. doi: 10.1002/sim.2341. [DOI] [PubMed] [Google Scholar]
- 50.Regier MD, Moodie EEM, Platt RW. The effect of error-in-confounders on the estimation of the causal parameter when using marginal structural models and inverse probability-of-treatment weights: a simulation study. Int J Biostat. 2014:1–15. doi: 10.1515/ijb-2012-0039. [DOI] [PubMed] [Google Scholar]
- 51.Goetghebeur E, Vansteelandt S. Structural mean models for compliance analysis in randomized clinical trials and the impact of errors on measures of exposure. Stat Methods Med Res. 2005;14(4):397–415. doi: 10.1191/0962280205sm407oa. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.