Abstract
The objective of this study was to estimate the cumulative incidences of failure by months 12 (M12) and 24 (M24) for the most prescribed first-line anti-retroviral regimens (ART).
It is retrospective analysis of a prospectively collected database.
All patients who initiated their first ART with the most prescribed regimens between 1st January 2004 and 30th June 2013 in 12 large HIV reference centers in France were included. The outcome was treatment failure—defined by any treatment modification for virological or tolerability reasons—and comparisons between regimens were carried out at M12 and M24. Adjusted and weighted methods via the propensity score (PS) were used to compare the effectiveness of the first antiretroviral regimens. Potential confounders of the treatment-outcome association were used to estimate PS with multinomial logistic regression.
Overall, 3128 and 2690 patients were included in the M12 and M24 analyses, respectively. Patients received 5 different regimens (ABC/3TC with ATV/r or DRV/r, TDF/FTC with ATV/r, DRV/r, or EFV). Failure was reported in 25% and 42% at M12 and M24, respectively. Patients who received TDF/FTC/EFV had a significantly higher proportion of failure at M12 by comparison with TDF/FTC with DRV/r (reference), but not at M24. Patients in the 3 other groups had a trend toward a higher proportion of failure at M12 although not statistically significant. No difference was found at M24.
Using data from a large prospective cohort, we found that boosted atazanavir and darunavir had comparable effectiveness, whatever the associated NRTIs, whereas efavirenz-based regimens were relatively less performing on the short term.
INTRODUCTION
Currently, recommended first-line antiretroviral regimens (ART) include 2 nucleoside analog reverse transcriptase inhibitors (NRTIs) in association with either a non-nucleoside reverse transcriptase inhibitor (NNRTI), a ritonavir boosted protease inhibitor (PI/r), or an integrase strand-transfer inhibitor (INSTI).1 Due to the number of different drugs in each class, multiple potential triple combinations can be prescribed. Randomized controlled trials (RCTs), considered as the methodological gold standard, have been designed to compare virological efficacy of some of the potential triple drug regimens. These studies provide the core knowledge for recommendations and guidelines,1–3 but they have limits. Selection and volunteer biases may compromise the generalization of their results to the general patient population.4,5
Observational cohort studies offer a complementary research design by providing information on comparative effectiveness of different antiretroviral regimens used in clinical practice. These studies allow comparisons between strategies not evaluated by RCTs, and they are representative of treatment strategies used in routine care settings. Reasons for modifying or interrupting ART may be different in this setting by comparison with RCTs, due to a broader use of ART. It is well known that in observational studies, unlike in RCTs, characteristics of patients between the treatment groups are quite different. The potential for confounding by indication may strongly impact the outcome interpretation and reliability of study findings.4–6
Statistical methods have therefore been introduced to provide a causal approach of the analysis of observational data. Most of these methods are based on the propensity score (PS), which is the probability of receiving a treatment given some observed covariates.7 The goal of propensity scores is to balance observed covariates between subjects from the treatment groups in order to mimic what happens in a randomized study.8 In practice propensity scores are unknown and are estimated via regression models. Adjusting and weighting via the PS for estimation of causal treatment effects have been mostly used for comparisons of 2 groups. Recently, however, some studies have used the multiple PS for the comparison of >2 treatment arms.9–11
Using a large prospectively collected observational cohort, we analyzed the effectiveness of the most prescribed first-line regimens. The outcome—treatment failure—was defined as any modification of the regimen because of lack of efficacy or poor tolerability. Crude analyses were compared with analyses adjusted for multiple PS or weighted via marginal structural models and doubly robust estimators.12–14
METHODS
Subjects and Data Collection
The patients were HIV-1-infected adults receiving care in 12 large HIV reference centers from 11 distinct geographical regions in France. These centers maintain prospective cohorts of all HIV-infected patients who received care and provided written consent. The data collection has been approved by the French national commission on informatics and liberty (CNIL). The database collects demographic, clinical, antiretroviral history, viral load, and CD4+ T cell counts data at regular 3 to 6 months intervals during routine clinical assessment.15 For the purpose of this study, we selected all patients who initiated their first ART between 1st January 2004 and 30th June 2013. We then restricted the population to patients receiving a regimen, which has been used by at least 200 patients and which was still recommended as the first regimen in the most recent years (thus regimens including lopinavir were excluded).
Key confounders of the treatment-outcome association included continuous and categorical variables that were assessed at or before the initiation of ART. Continuous variables were age, baseline viral load (log10 copies/ml), baseline CD4+ T cell counts (cells/mm3), and duration of known HIV infection (time in months from HIV diagnosis to ART initiation). Categorical variables were hepatitis B or C co-infection (yes/no), AIDS at ART initiation (yes/no), prior history of depression (yes/no), having a glomerular filtration rate estimated by the abbreviated Modification of Diet and Renal Disease (eGFR) <90 ml/min per 1.73 m2 (yes/no), a combination of sex and route of transmission (men who have sex with men [MSM], other men and women) and year of ART initiation (2004–2008, 2009–2010, or 2011–2013).
The outcome was treatment failure, defined by any modification of the first regimen due to the lack of efficacy or to poor tolerability. This outcome was investigated by month 12 (M12) and month 24 (M24). All other reasons for regimen modification, such as treatment simplification, pregnancy (planned or current), clinical trial participation, or planned antiretroviral interruptions were not considered as treatment failures and were considered as censored. For patients treated with tenofovir/emtricitabine with efavirenz, simplification to the single-pill formulation was not considered as a treatment modification. When the reason for ART modification was not clear, we searched for the last viral load value before modification. If this last viral load was >2.6 log10copies/ml, failure was considered as the reason for the ART modification. Otherwise, the observation was censored.
Two dataset were used for each of the M12 and M24 analyses. Patients who did not experience the outcome and who had a follow-up <12 months were excluded from the M12 analysis. Similarly patients who did not experience the outcome and who had a follow-up shorter than 24 months were excluded from the M24 analysis.
STATISTICAL METHODS
Unadjusted analysis and adjusted/weighted analyses based on the propensity score (PS) were used to compare the effectiveness of the first ART regimens at M12 and M24. Adjusted method used multiple PSs estimated by multivariable logistic regression. Each PS is the conditional probability of assigning a given subject to one of the treatment groups given the observed covariates. Selection for the PS model was an iterative process similar to that suggested previously.10,11 Marginal structural models were also used in a weighting strategy. We used stabilized weights based on the product of Inverse Probability of Treatment Weighted (IPTW) and Inverse Probability of Censoring Weighted (IPCW).14 The double robust (DR) estimator combines 2 approaches to estimate the causal effect of a treatment on an outcome.13 The first is based on an outcome-model and the second is a model for the propensity score. DR produces a consistent estimate of the treatment effect if either of the 2 models has been correctly specified. Generalized doubly robust estimator for multiple treatments has been proposed recently.16 We use the bootstrap method to estimate confidence intervals and P values of doubly robust estimators as recommended.17
Two sensitivity analyses were performed. A first sensitivity analysis excluded censored patients (patients who modified their initial ART regimen due to another reason than lack of efficacy or tolerability). A second analysis included only patients having an estimated PS or stabilized weight between the 1st and 99th percentile. All analyses were done with SAS (version 9.3; SAS Institute Inc, Cary, NC).
RESULTS
On the basis of our inclusion criteria, we selected 3628 patients who initiated ART with 5 different regimens. The 5 regimens were the following: atazanavir /ritonavir (ATV/r) with abacavir/lamivudine (ABC/3TC) (N = 250); ATV/r with tenofovir/emtricitabine (TDF/FTC) (N = 958); TDF/FTC with efavirenz (EFV) (N = 721); darunavir/ritonavir (DRV/r) with ABC/3TC (N = 340); and DRV/r with TDF/FTC (N = 1259). Among the 3628 patients, 500 patients were excluded from the M12 analysis because of a follow-up of <12 months. Thus, the M12 analysis was based on 3128 patients. Similarly, the M24 analysis was based on 2690 patients.
Baseline characteristics of the 3128 patients included in the M12 analysis are summarized in Table 1, showing to what extent the treatment groups initially differed. The use of TDF/FTC with DRV/r was more likely among subjects with high baseline viral load, low baseline CD4+ T cell count, in those with AIDS at ART initiation or among those who had been recently diagnosed. Use of ATV/r was more likely in patients who initiated ART in 2004 to 2008. Similar results were found for the patients included in the M24 analysis (data not shown). The unadjusted P value corresponds to the comparison before correction on multiple PSs. Confounders variables included in the propensity score model were markedly different between the 5 treatment groups with all P values < 0.001. The adjusted P values, corresponding to a balance-check after correction, indicate a good balance between the treatment groups since only the CD4+ T cell count is slightly unbalanced after correction.
TABLE 1.
Treatment Failure at M12
Among the 3128 subjects, 339 (10.8%) patients were censored. Among the 2789 remaining patients, 706 (25.3%) experienced a treatment failure before M12. Fifty six percent of failures occurred during the first 3 months of antiretroviral therapy, and only 53 of all M12 failures (7.5%) were due to the lack of efficacy. Table 2 reports adjusted odds ratios (ORs) based on multivariable logistic regression of treatment failure on potential confounders. Several covariates were associated with increased treatment failure: women versus MSM (OR = 1.46; P < 0.01), having an eGFR <90 ml/min per 1.73 m2 (OR = 1.26; P = 0.05); presence of AIDS at ART initiation (OR = 1.73; P < 0.01); and higher baseline viral load (OR = 1.16 per log10; P < 0.01). Probability of treatment failure was lower in patients treated early in the 2000 s (OR = 0.57 in 2004–2008 vs 2011–2013; P < 0.01; OR = 0.68 in 2009–2010 vs 2011–2013; P < 0.01).
TABLE 2.
Patients who received TDF/FTC with DRV/r had the lowest probability of failure whatever the method used; thus we used them as the reference group. Table 3 shows estimates of the probability of treatment failure in the reference group and risk differences for the other treatment groups. A positive difference in probability indicates a higher estimate of treatment failure by comparison with the reference group. The unadjusted analysis showed no significant difference in proportions of failure between the 4 treatment groups by comparison with TDF/FTC with DRV/r. Among the 2789 uncensored patients with a follow-up >12 months, we estimated a 4.5% to 8.8% higher absolute probability of treatment failure at M12 in the 4 treatment groups by comparison with the TDF/FTC with the DRV/r group. These estimates were higher than the unadjusted associations. In particular, patients receiving TDF/FTC with EFV had a significantly higher probability of failure than patients receiving TDF/FTC with DRV/r. Patients receiving ABC/3TC with ATV/r, TDF/FTC with ATV/r, or ABC/3TC with DRV/r had a trend toward a higher probability of failure with P values ∼ 0.08–0.10.
TABLE 3.
Treatment Failure at M24
Among the 2690 subjects of the M24 analysis, 536 (19.9%) patients were censored. Among the 2154 remaining patients 893 (41.5%) presented with treatment failure before M24. Most of the predictors of M12 treatment failure (Table 2) were also predictive of failure at M24 (Table 4), except for an eGFR <90 ml/min per 1.73 m2. In addition, higher baseline CD4+ T cell count (OR = 1.07 per 100 cells/ml; P = 0.03) was significantly associated with an increased probability of treatment failure at M24.
TABLE 4.
Probabilities of treatment failure varied from 41% to 46% in the TDF/FTC with DRV/r group (Table 5). In the unadjusted analysis, patients receiving an ATV/r-containing regimen had a lower absolute probability of failure compared with the reference group (−8% for ABC/3TC with ATV/r; P = 0.008; −3.8% for TDF/FTC with ATV/r; P = 0.38). The use of appropriate statistical methods showed a quite different picture with a higher probability, although not statistically significant, of failure in patients receiving an ATV/r-containing regimen. Marginal structural models and double robust estimators showed no statistical difference in probability of treatment failure of the 4 treatment groups by comparison with the reference group.
TABLE 5.
All sensitivity analyses described in the method section provided similar results for both M12 and M24 analyses (data not shown).
DISCUSSION
We here provide a comparison of the effectiveness of the 5 most prescribed first-line ART regimens for HIV-infected patients in France between 2004 and 2013. The 2 most frequently used regimens were TDF/FTC with either EFV or DRV/r as recommended in international HIV treatment guidelines.1–3 ABC/3TC with ATV/r was received by >300 patients although it was not a recommended regimen. By July 2013, the number of patients receiving INSTI-based regimens (not recommended as first-line in France before 2014) was not large enough to be included in the present analysis. Crude treatment failure rates at months 12 and 24 were 25 and 42%, respectively. Patients receiving TDF/FTC with DRV/r had the lowest risk of treatment failure at months 12 and 24. After controlling for confounding by propensity score modeling, marginal structural models and double robust we found that, at month 12, patients receiving TDF/FTC with EFV had a significantly higher probability of treatment failure. Patients receiving boosted atazanavir regimens and ABC/3TC with DRV/r had a nonsignificantly higher probability of treatment failure. At month 24, all regimens had a comparable effectiveness.
The observed crude failure rate, ∼25% at 1 year, is in line with the finding that 47% of the patients of our cohort changed their first-line regimen before the end of the first year, whatever the reasons for treatment modification.18 Thus, it is a reasonable estimation of treatment effectiveness. By comparison, the 1 and 2-year cumulative incidences of treatment modification in patients initiating ART in 2002 to 2009 were 25 and 39%, respectively, in a large collaboration of cohort studies.19 As observed in RCTs, treatment modifications were mainly due to adverse events rather than to virological or clinical failures. Fifty percent of failing patients modified their regimen during the first 3 months. Early interruption of ART regimens due to short-term poor tolerability has already been described.19,20
Observational studies provide an insight in how different treatments will be used in “real life” by the patients and their physicians. We studied a large population seeking care in different centers, representative of the patients under care in France during the last 10 years. The large population of our cohort fulfills the necessary condition to allow the selection of regimens for which no data have been produced by RCTs.21 Permanent assessment and control of the quality of the database15 limits the errors that one can encounter with this type of studies. Appropriate analysis of carefully collected prospective observational data can then complement the findings of RCTs22 to avoid that first-line regimens be chosen by the physicians on the basis of self-conviction, indirect comparisons, or other factors.
By choosing a pragmatic definition of treatment failure, we took into account some reasons for failure that may not be accounted for in RCTs. Most trials used virological failure as primary endpoints, but treatment modifications should also be considered as failures. The large number of planned visits in an RCT, especially during the first year, and strict protocol rules are 2 important differences with observational studies. One can suspect that clinicians modified patients’ regimen more easily and early in clinical practice than in an RCT. The surprisingly high probability of failure rate in the most recent calendar years can be explained by the availability of an increasing number of potent drugs leading to treatment modifications for minor toxicities. Comparative effectiveness of initial antiretroviral therapy regimens based on virological failure between RCTs and observational studies have been made and showed a good agreement.23
The methods we used were designed to limit the indication bias, taking into account the major patients characteristics that drive a physician decision while selecting ART. A key feature of the methods used here is the variable selection for propensity score models or for inverse probability weights.24,25 There is trade-off between reducing confounding bias and increasing bias and variance due to a large selection of variables in the propensity model.25 We followed the current recommendations to construct both propensity scores and stabilized weights.10,25 Similar statistical methods have been used to compare the virological efficacy of boosted double versus boosted single protease inhibitor regimens.26
A limitation of our analyses is that some unmeasured confounding may persist, as some socio-economic characteristics were not recorded in the database. We think, however, that major confounders, including baseline viral load and the year of ART initiation, were included in the models used.
In conclusion, using data from a large prospective cohort of patients seeking care in France in the recent years, we found that TDF/FTC with DRV/r had the lowest probability of treatment failure at months 12 and 24. At month 12, patients receiving TDF/FTC with EFV had a significant higher probability of treatment failure whereas no difference was found for the patients receiving boosted atazanavir regimens and ABC/3TC with DRV/r. At month 24, all regimens had a comparable effectiveness. This approach allowed us to address some questions that have not been and will probably not be considered in RCTs, adding important information for physicians and patients that will be making decisions on the choice of the first ART regimen.
ACKNOWLEDGMENTS
All authors contributed significantly to the study. PF and LC designed the study. PF was responsible of the statistical analysis. LC, PP, CA, CK, LC, AC, AC, DR, CC, and FB-S were responsible for the data collection in their centers. PF and LC wrote the first version of the paper, which was amended by all co-authors. All authors approved the final version.
Footnotes
Abbreviations: ABC/3TC = abacavir with lamivudine, ART = antiviral therapy, ATV/r = ritonavir boosted atazanavir, CNIL = French national commission on informatics and liberty, DR = Double Robust estimator, DRV/r = ritonavir boosted darunavir, EFV = efavirenz, eGFR = estimated glomerular filtration rate, HIV = human immunodeficiency virus, INSTI = integrase strand-transfer inhibitor, IPCW = Inverse Probability of Censoring Weighted, IPTW = Inverse Probability of Treatment Weighted, M12 = month 12 after treatment initiation, M24 = month 24 after treatment initiation, MSM = men who have sex with men, NNRTI = non-nucleosidic reverse transcriptase inhibitor, NRTI = nucleosidic reverse transcriptase inhibitor, OR = odd ratio, PS = propensity score, RCT = randomized clinical trial, TDF/FTC = tenofovir with emtricitabine.
Dat’AIDS Study Group: P. Enel, V. Obry-Roguet, O. Faucher, S. Bregigeon, I. Poizot-Martin, (Marseille); B. Marchou, P. Massip, E. Bonnet, M. Obadia, M. Alvarez, L. Porte, L. Cuzin, P. Delobel, M. Chauveau, D. Garipuy, M.Mularczyk, J. Bernard, I. Lepain, M. Marcel, E. Puntis, K. Sauné (Toulouse); P. Pugliese, C. Ceppi, E. Cua, J. Cottalorda, P. Dellamonica, E. Demonchy, B. Dunais, J. Durant, C. Etienne, S. Ferrando, J. G. Fuzibet, R. Garraffo, K. Risso, V. Mondain, A. Naqvi, N. Oran, I. Perbost, S. Pillet, B. Prouvost-Keller, C. Pradier, S. Wehrlen-Pugliese, E. Rosenthal, S. Sausse, P.M. Roger (Nice); C. Allavena, C. Bernaud, E. Billaud, C. Biron, B. Bonnet, S. Bouchez, D. Boutoille, C. Brunet-Cartier, N. Hall, T. Jovelin, P. Morineau, F. Raffi, V. Reliquet, H. Hue, L. Larmet, So. Pineau, V. Ferré, E. André-Garnier, A. Rodallec, F. Vivrel, M. Lefebvre, O. Grossi, C. Biron, P. Point, O. Aubry (Nantes); A. Cheret, P. Choisy (Tourcoing); Y. Yazdanpanah, R. Landman, C. Duvivier, M.A. Valantin, R. Agher, C. Katlama, Lortholary, V. Avettand-Fenoel, C. Rouzioux, P.H. Consigny, G. Cessot, F. Touam, R. Usubillaga, K. Benhadj (Paris); A. Cabié, S. Abel, S. Pierre-François, M. Ouka, J. Martial (Fort de France); D. Rey, E. Ebel, P. Fischer, M. Partisani, C Cheneau, M Priester, ML Batard, C Bernard-Henry, E de Mautort (Strasbourg); C. Chirouze, C. Drobacheff-Thiébaut, J.P. Faller, J.F. Faucher, A. Foltzer, H. Gil, L. Hustache-Mathieu (Besançon), B. Hoen (Pointe à Pitre); C Jacomet, H Laurichesse, O Lesens, M Vidal, N Mrozek, C Aumeran, O Baud, J Beytout, D Coban, S Casanova (Clermont-Ferrand), F. Bani-Sadr, C. Rouger, J.L. Berger, Y. N’Guyen, D. Lambert, I. Kmiec, M. Hentzien (Reims), D. Peyramond, C. Chidiac, F. Ader, F. Biron, A. Boibieux, L. Cotte, T. Ferry, P Miailhes, T. Perpoint, S. Degroodt (Lyon).
The databases are collected via the Nadis® software, Fedialis Medica, France, developed and maintained with financial support from ViiV healthcare. The authors do not have any conflict of interest related to this work.
REFERENCES
- 1.Gunthard HF, Aberg JA, Eron JJ, et al. Antiretroviral treatment of adult HIV infection: 2014 recommendations of the International Antiviral Society—USA Panel. JAMA 2014; 312:410–425. [DOI] [PubMed] [Google Scholar]
- 2.Department of Health and Human Services. Panel on Antiretroviral Guidelines for Adults and Adolescents. Guidelines for the use of antiretroviral agents in HIV-1-infected adults and adolescents. http://aidsinfo.nih.gov/ContentFiles/AdultandAdolescentGL.pdf Accessed January 4, 2015. [Google Scholar]
- 3.European AIDS Clinical Society. Guidelines, version 7.1, November 2014. http://www.eacsociety.org/guidelines/eacs-guidelines/eacs-guidelines.html Accessed January 5, 2015. [Google Scholar]
- 4.D’Agostino RB, Jr, D’Agostino RB., Sr Estimating treatment effects using observational data. JAMA 2007; 297:314–316. [DOI] [PubMed] [Google Scholar]
- 5.Pocock SJ, Elbourne DR. Randomized trials or observational tribulations? N Engl J Med 2000; 342:1907–1909. [DOI] [PubMed] [Google Scholar]
- 6.Wood E, Hogg RS, Heath KV, et al. Provider bias in the selection of non-nucleoside reverse transcriptase inhibitor and protease inhibitor-based highly active antiretroviral therapy and HIV treatment outcomes in observational studies. AIDS 2003; 17:2629–2634. [DOI] [PubMed] [Google Scholar]
- 7.Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika 1983; 70:41–55. [Google Scholar]
- 8.Joffe MM, Rosenbaum PR. Invited commentary: propensity scores. Am J Epidemiol 1999; 150:327–333. [DOI] [PubMed] [Google Scholar]
- 9.Frisco ML, Muller C, Frank K. Parents’ union dissolution and adolescents’ school performance: comparing methodological approaches. J Marriage Fam 2007; 69:721–741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Spreeuwenberg MD, Bartak A, Croon MA, et al. The multiple propensity score as control for bias in the comparison of more than two treatment arms: an introduction from a case study in mental health. Med Care 2010; 48:166–174. [DOI] [PubMed] [Google Scholar]
- 11.Zanutto E, Lu B, Hornik R. Using propensity score subclassification for multiple treatment doses to evaluate a National Antifrug Media campaign. J Educ Behav Stat 2005; 30:59–73. [Google Scholar]
- 12.Hernan MA, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology 2000; 11:561–570. [DOI] [PubMed] [Google Scholar]
- 13.Lunceford JK, Davidian M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med 2004; 23:2937–2960. [DOI] [PubMed] [Google Scholar]
- 14.Robins JM, Hernan MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology 2000; 11:550–560. [DOI] [PubMed] [Google Scholar]
- 15.Pugliese P, Cuzin L, Cabie A, et al. A large French prospective cohort of HIV-infected patients: the Nadis Cohort. HIV Med 2009; 10:504–511. [DOI] [PubMed] [Google Scholar]
- 16.Tu C, Koh WY, Jiao S. Using generalized doubly robust estimator to estimate average treatment effects of multiple treatments inobservational studies. J Stat Comput Simulation 2013; 83:1518–1526. [Google Scholar]
- 17.Funk MJ, Westreich D, Wiesen C, et al. Doubly robust estimation of causal effects. Am J Epidemiol 2011; 173:761–767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Keita M, Perbost I, Pugliese-Wehrlen S, et al. Incidences and risk factors of first-line HAART discontinuation: a limitation to the success of the “seek, test, treat, and retain” strategy? AIDS Care 2014; 26:1058–1069. [DOI] [PubMed] [Google Scholar]
- 19.Abgrall S, Ingle SM, May MT, et al. Durability of first ART regimen and risk factors for modification, interruption or death in HIV-positive patients starting ART in Europe and North America 2002–2009. AIDS 2013; 27:803–813. [DOI] [PubMed] [Google Scholar]
- 20.Willig JH, Abroms S, Westfall AO, et al. Increased regimen durability in the era of once-daily fixed-dose combination antiretroviral therapy. AIDS 2008; 22:1951–1960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Phillips AN, Grabar S, Tassie JM, et al. Use of observational databases to evaluate the effectiveness of antiretroviral therapy for HIV infection: comparison of cohort studies with randomized trials. EuroSIDA, the French Hospital Database on HIV and the Swiss HIV Cohort Study Groups. AIDS 1999; 13:2075–2082. [DOI] [PubMed] [Google Scholar]
- 22.Hlatky MA, Califf RM, Harrell FE, Jr, et al. Comparison of predictions based on observational data with the results of randomized controlled clinical trials of coronary artery bypass surgery. J Am Coll Cardiol 1988; 11:237–245. [DOI] [PubMed] [Google Scholar]
- 23.Mugavero MJ, May M, Ribaudo HJ, et al. Comparative effectiveness of initial antiretroviral therapy regimens: ACTG 5095 and 5142 clinical trials relative to ART-CC cohort study. J Acquir Immune Defic Syndr 2011; 58:253–260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Brookhart MA, Schneeweiss S, Rothman KJ, et al. Variable selection for propensity score models. Am J Epidemiol 2006; 163:1149–1156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cole SR, Hernan MA. Constructing inverse probability weights for marginal structural models. Am J Epidemiol 2008; 168:656–664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Petersen ML, Wang Y, van der Laan MJ, et al. Virologic efficacy of boosted double versus boosted single protease inhibitor therapy. AIDS 2007; 21:1547–1554. [DOI] [PMC free article] [PubMed] [Google Scholar]