ISPOR Health Policy Council proposed Good Research Practices For Comparative Effectiveness Research: Benefit or Harm?

Til Stürmer; Tim Carey; Charles Poole

doi:10.1111/j.1524-4733.2009.00653.x

. Author manuscript; available in PMC: 2014 May 28.

Published in final edited form as: Value Health. 2009 Oct 8;12(8):1042–1043. doi: 10.1111/j.1524-4733.2009.00653.x

ISPOR Health Policy Council proposed Good Research Practices For Comparative Effectiveness Research: Benefit or Harm?

Til Stürmer ¹, Tim Carey ¹, Charles Poole ¹

PMCID: PMC4036452 NIHMSID: NIHMS575377 PMID: 19818061

There are increasing calls for better understanding of “what works” in health care [1]. One of the means for assessing what works is through “comparative effectiveness research” (CER) [2]. Ideally, the needed data would come from randomized controlled trials (RCTs) or from natural experiments. RCTs would need to be large, practical clinical trials that compare interventions head-to-head in real clinical settings [3,4], using novel approaches to assess clinically relevant outcomes.

Non-experimental studies of intended drug effects have been criticized because confounding by indication (selective channeling of patients to treatment modalities based on outcome predictors such as severity of disease) can almost never be ruled out [5,6]. Recent developments in pharmacoepidemiologic methods limit the potential for bias and thus increase the value of non-experimental comparisons of intended drug effects. These methods include instrumental variable methods [7,8], the new user design [9], the use of a comparator drug with a similar indication to that of the index drug [10], propensity scores [11], and simple improvements such as eliminating immortal person-time [12] and reducing selection bias by not censoring follow-up when a person stops taking a drug [13,14]. Much remains to be done, however, including the study of heterogeneity of treatment effects at the intersection between personalized medicine and pharmacoepidemiology. In addition, there remains an unresolved tension between emulating RCTs (increasing internal validity based on increasing restrictions [15]) and enhancing generalizability (external validity).

Comparative effectiveness research (CER) is an interdisciplinary endeavor in which the disciplines are linked by the need for information and the development of methods. The involvement of ISPOR in this enterprise is welcome. Like drugs, the Good Research Practices proposed by the working group of the ISPOR Health Policy Council and published in this issue of the journal [16–18] need to be evaluated by their potential benefits and harms. The potential benefits are obvious. Someone unfamiliar with performing non-experimental comparisons of drugs and their outcomes will find valuable discussion in these documents of issues to be considered. Common to all such documents, however, there is the potential for harm when the recommendations are used as a cookbook without understanding their interplay. References to standard textbooks of pharmacoepidemiology [e.g., 19] could help alleviate this problem.

It is inevitable for any detailed overview to contain questionable or outmoded recommendations. Recommendations of the proposed documents that some experienced pharmacoepidemiologists might find arguable include:

The requirement to report the results from all ex ante analyses. Some such analyses will have been abandoned because the researchers discovered that they are biased.
The assessment of the importance of biases based on how they affect the acceptance or rejection of the null hypothesis. Biases are best measured by their effects on the magnitude, direction and precision of effect-measure estimates.
The recommendation for propensity score models to include variables that are only weakly related to treatment selection (but unrelated to the outcome per the following recommendation). It is unclear why any variable that is unrelated to the outcome should be included in a propensity score [20].

Some core issues in the design and analysis of non-experimental comparisons are not addressed in enough detail. One is the importance of the role of various ‘stakeholders’ in CER. Another is the distinction between confounding (e.g., by indication) and selection bias (due, for instance, to non-adherence, drop-out of “sick stoppers,” etc. [13]). The potential to separate these forms of bias is one of the main advantages of the new user design [9].

Given the expected continuation of the rapid development of pharmacoepidemiologic methods over the past 5 years, the proposed Good Research Practices may become outdated very rapidly [21]. We found no indication of how ISPOR intends to keep these guidelines up to date. In an era of guideline proliferation, one might ask what the proposed ISPOR document will add to the existing ones in this field, especially the Good Pharmacoepidemiologic Practice document published and continuously updated by the International Society for Pharmacoepidemiology (ISPE) [22]. Finally, harmonization of the ISPOR documents with others, including those proposed by ISPE, the US Institute of Medicine (IOM), the US Agency for Healthcare Research and Quality (AHRQ), the UK National Institute for Health and Clinical Excellence (NICE) should be considered. Such harmonization would prevent confusion and nit picking by groups opposed to CER.

In our view, the benefit-to-harm balance of ISPOR’s proposed documents on Good Research Practices favors the benefit side. It will help to spread the news that non-experimental treatment comparisons are possible given careful design, analysis, and interpretation. We congratulate ISPOR for providing guidelines for CER that emphasize the potential benefits without giving CER a black box warning for its potential harms.

References

1.Eden J, et al., editors. Knowing What Works in Health Care: A Roadmap for the Nation. National Academies Press; Washington, DC: 2008. [Google Scholar]
2.Institute of Medicine. Monograph on Comparative Effectiveness Research. 2009. in press. [Google Scholar]
3.Tunis S. Comparative Effectiveness: Basic Terms and Concepts. Center for Medical Technology Policy; San Francisco, CA: 2007. http://www.allhealth.org/briefingmaterials/Tunis4-27-07-699.pdf. [Google Scholar]
4.Kolata Gina. NY Times. Nov 25, 2008. New arena for testing of drugs: real world. [Google Scholar]
5.Miettinen OS. The need for randomization in the study of intended effects. Stat Med. 1983;2:267–271. doi: 10.1002/sim.4780020222. [DOI] [PubMed] [Google Scholar]
6.Strom BL, Miettinen OS, Melmon KL. Post-marketing studies of drug efficacy: how? Am J Med. 1984;77:703–708. doi: 10.1016/0002-9343(84)90369-3. [DOI] [PubMed] [Google Scholar]
7.Greenland S. An introduction to instrumental variables for epidemiologists. Int J Epidemiol. 2000;29:722–729. doi: 10.1093/ije/29.4.722. [DOI] [PubMed] [Google Scholar]
8.Brookhart MA, Wang PS, Solomon DH, Schneeweiss S. Evaluating short-term drug effects using a physician-specific prescribing preference as an instrumental variable. Epidemiology. 2006;17:268–75. doi: 10.1097/01.ede.0000193606.58671.c5. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Ray W. Evaluating medication effects outside of clinical trials: new-user designs. Am J Epidemiol. 2003;158:915–20. doi: 10.1093/aje/kwg231. [DOI] [PubMed] [Google Scholar]
10.Glynn RJ, Schneeweiss S, Stürmer T. Indications for propensity scores and review of their use in pharmacoepidemiology. Basic Clin Pharmacol Toxicol. 2006;98:253–9. doi: 10.1111/j.1742-7843.2006.pto_293.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Stürmer T, Schneeweiss S, Brookhart MA, Rothman KJ, Avorn J, Glynn RJ. Analytic Strategies to adjust confounding using Exposure Propensity Scores and Disease Risk Scores: Nonsteroidal Antiinflammatory Drugs (NSAID) and Short-Term Mortality in the Elderly. Am J Epidemiol. 2005a;161:891–898. doi: 10.1093/aje/kwi106. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Suissa S. Immortal time bias in pharmaco-epidemiology. Am J Epidemiol. 2008;167:492–9. doi: 10.1093/aje/kwm324. [DOI] [PubMed] [Google Scholar]
13.Andersen M, Brookhart MA, Glynn RJ, Støvring H, Stürmer T. Practical issues in measuring cessation and re-initiation of drug use in databases. Pharmacoepidemiol Drug Saf. 2008;17(suppl 1):S27. [Google Scholar]
14.Hernán MA, Alonso A, Logan R, Grodstein F, Michels KB, Willett WC, Manson JE, Robins JM. Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease. Epidemiology. 2008;19:766–79. doi: 10.1097/EDE.0b013e3181875e61. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Schneeweiss S, Patrick A, Stürmer T, Brookhart MA, Avorn J, Maclure M, Rothman KJ, Glynn RJ. Increasing levels of restriction in pharmacoepidemiologic database studies of elderly and comparison with randomized trial results. Med Care. 2007;45:S131–S142. doi: 10.1097/MLR.0b013e318070c08e. Supplement 2: Emerging methods in comparative effectiveness and safety. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Berger ML, Mamdani M, Atkins D, Johnson ML. GOOD RESEARCH PRACTICES FOR COMPARATIVE EFFECTIVENESS RESEARCH: DEFINING, REPORTING AND INTERPRETING NON-RANDOMIZED STUDIES OF TREATMENT EFFECTS USING SECONDARY DATA SOURCES. Report of the ISPOR Retrospective Database Analysis Task Force – Part I. doi: 10.1111/j.1524-4733.2009.00600.x. [DOI] [PubMed] [Google Scholar]
17.Cox E, Martin BC, Van Staa T, Garbe E, Siebert U, Johnson ML. GOOD RESEARCH PRACTICES FOR COMPARATIVE EFFECTIVENESS RESEARCH: APPROACHES TO MITIGATE BIAS AND CONFOUNDING IN THE DESIGN OF NON-RANDOMIZED STUDIES OF TREATMENT EFFECTS USING SECONDARY DATA SOURCES. Report of the ISPOR Retrospective Database Analysis Task Force – Part II. doi: 10.1111/j.1524-4733.2009.00601.x. [DOI] [PubMed] [Google Scholar]
18.Johnson ML, Crown W, Martin BC, Dormuth CR, Siebert U. GOOD RESEARCH PRACTICES FOR COMPARATIVE EFFECTIVENESS RESEARCH: ANALYTIC METHODS TO IMPROVE CAUSAL INFERENCE FROM NON-RANDOMIZED STUDIES OF TREATMENT EFFECTS USING SECONDARY DATA SOURCES. Report of the ISPOR Retrospective Database Analysis Task Force – Part III. doi: 10.1111/j.1524-4733.2009.00602.x. [DOI] [PubMed] [Google Scholar]
19.Strom Brian L., editor. Pharmacoepidemiology. 4. John Wiley & Sons Ltd; Chichester, UK: 2005. [Google Scholar]
20.Brookhart MA, Schneeweiss S, Rothman KJ, Glynn RJ, Avorn J, Stürmer T. Variable selection for propensity score models. Am J Epidemiol. 2006;163:1149–1156. doi: 10.1093/aje/kwj149. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Rothman KJ, Poole C. Some guidelines on guidelines: they should come with expiration dates. Epidemiology. 2007;18:794–6. doi: 10.1097/EDE.0b013e3181571259. [DOI] [PubMed] [Google Scholar]
22.ISPE. Guidelines for good pharmacoepidemiology practices (GPP) Pharmacoepidemiol Drug Saf. 2008;17:200–208. doi: 10.1002/pds.1471. [DOI] [PubMed] [Google Scholar]

[R1] 1.Eden J, et al., editors. Knowing What Works in Health Care: A Roadmap for the Nation. National Academies Press; Washington, DC: 2008. [Google Scholar]

[R2] 2.Institute of Medicine. Monograph on Comparative Effectiveness Research. 2009. in press. [Google Scholar]

[R3] 3.Tunis S. Comparative Effectiveness: Basic Terms and Concepts. Center for Medical Technology Policy; San Francisco, CA: 2007. http://www.allhealth.org/briefingmaterials/Tunis4-27-07-699.pdf. [Google Scholar]

[R4] 4.Kolata Gina. NY Times. Nov 25, 2008. New arena for testing of drugs: real world. [Google Scholar]

[R5] 5.Miettinen OS. The need for randomization in the study of intended effects. Stat Med. 1983;2:267–271. doi: 10.1002/sim.4780020222. [DOI] [PubMed] [Google Scholar]

[R6] 6.Strom BL, Miettinen OS, Melmon KL. Post-marketing studies of drug efficacy: how? Am J Med. 1984;77:703–708. doi: 10.1016/0002-9343(84)90369-3. [DOI] [PubMed] [Google Scholar]

[R7] 7.Greenland S. An introduction to instrumental variables for epidemiologists. Int J Epidemiol. 2000;29:722–729. doi: 10.1093/ije/29.4.722. [DOI] [PubMed] [Google Scholar]

[R8] 8.Brookhart MA, Wang PS, Solomon DH, Schneeweiss S. Evaluating short-term drug effects using a physician-specific prescribing preference as an instrumental variable. Epidemiology. 2006;17:268–75. doi: 10.1097/01.ede.0000193606.58671.c5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Ray W. Evaluating medication effects outside of clinical trials: new-user designs. Am J Epidemiol. 2003;158:915–20. doi: 10.1093/aje/kwg231. [DOI] [PubMed] [Google Scholar]

[R10] 10.Glynn RJ, Schneeweiss S, Stürmer T. Indications for propensity scores and review of their use in pharmacoepidemiology. Basic Clin Pharmacol Toxicol. 2006;98:253–9. doi: 10.1111/j.1742-7843.2006.pto_293.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Stürmer T, Schneeweiss S, Brookhart MA, Rothman KJ, Avorn J, Glynn RJ. Analytic Strategies to adjust confounding using Exposure Propensity Scores and Disease Risk Scores: Nonsteroidal Antiinflammatory Drugs (NSAID) and Short-Term Mortality in the Elderly. Am J Epidemiol. 2005a;161:891–898. doi: 10.1093/aje/kwi106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Suissa S. Immortal time bias in pharmaco-epidemiology. Am J Epidemiol. 2008;167:492–9. doi: 10.1093/aje/kwm324. [DOI] [PubMed] [Google Scholar]

[R13] 13.Andersen M, Brookhart MA, Glynn RJ, Støvring H, Stürmer T. Practical issues in measuring cessation and re-initiation of drug use in databases. Pharmacoepidemiol Drug Saf. 2008;17(suppl 1):S27. [Google Scholar]

[R14] 14.Hernán MA, Alonso A, Logan R, Grodstein F, Michels KB, Willett WC, Manson JE, Robins JM. Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease. Epidemiology. 2008;19:766–79. doi: 10.1097/EDE.0b013e3181875e61. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Schneeweiss S, Patrick A, Stürmer T, Brookhart MA, Avorn J, Maclure M, Rothman KJ, Glynn RJ. Increasing levels of restriction in pharmacoepidemiologic database studies of elderly and comparison with randomized trial results. Med Care. 2007;45:S131–S142. doi: 10.1097/MLR.0b013e318070c08e. Supplement 2: Emerging methods in comparative effectiveness and safety. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Berger ML, Mamdani M, Atkins D, Johnson ML. GOOD RESEARCH PRACTICES FOR COMPARATIVE EFFECTIVENESS RESEARCH: DEFINING, REPORTING AND INTERPRETING NON-RANDOMIZED STUDIES OF TREATMENT EFFECTS USING SECONDARY DATA SOURCES. Report of the ISPOR Retrospective Database Analysis Task Force – Part I. doi: 10.1111/j.1524-4733.2009.00600.x. [DOI] [PubMed] [Google Scholar]

[R17] 17.Cox E, Martin BC, Van Staa T, Garbe E, Siebert U, Johnson ML. GOOD RESEARCH PRACTICES FOR COMPARATIVE EFFECTIVENESS RESEARCH: APPROACHES TO MITIGATE BIAS AND CONFOUNDING IN THE DESIGN OF NON-RANDOMIZED STUDIES OF TREATMENT EFFECTS USING SECONDARY DATA SOURCES. Report of the ISPOR Retrospective Database Analysis Task Force – Part II. doi: 10.1111/j.1524-4733.2009.00601.x. [DOI] [PubMed] [Google Scholar]

[R18] 18.Johnson ML, Crown W, Martin BC, Dormuth CR, Siebert U. GOOD RESEARCH PRACTICES FOR COMPARATIVE EFFECTIVENESS RESEARCH: ANALYTIC METHODS TO IMPROVE CAUSAL INFERENCE FROM NON-RANDOMIZED STUDIES OF TREATMENT EFFECTS USING SECONDARY DATA SOURCES. Report of the ISPOR Retrospective Database Analysis Task Force – Part III. doi: 10.1111/j.1524-4733.2009.00602.x. [DOI] [PubMed] [Google Scholar]

[R19] 19.Strom Brian L., editor. Pharmacoepidemiology. 4. John Wiley & Sons Ltd; Chichester, UK: 2005. [Google Scholar]

[R20] 20.Brookhart MA, Schneeweiss S, Rothman KJ, Glynn RJ, Avorn J, Stürmer T. Variable selection for propensity score models. Am J Epidemiol. 2006;163:1149–1156. doi: 10.1093/aje/kwj149. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Rothman KJ, Poole C. Some guidelines on guidelines: they should come with expiration dates. Epidemiology. 2007;18:794–6. doi: 10.1097/EDE.0b013e3181571259. [DOI] [PubMed] [Google Scholar]

[R22] 22.ISPE. Guidelines for good pharmacoepidemiology practices (GPP) Pharmacoepidemiol Drug Saf. 2008;17:200–208. doi: 10.1002/pds.1471. [DOI] [PubMed] [Google Scholar]

PERMALINK

ISPOR Health Policy Council proposed Good Research Practices For Comparative Effectiveness Research: Benefit or Harm?

Til Stürmer, MD, MPH

Tim Carey, MD, MPH

Charles Poole, MPH, ScD

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

ISPOR Health Policy Council proposed Good Research Practices For Comparative Effectiveness Research: Benefit or Harm?

Til Stürmer, MD, MPH

Tim Carey, MD, MPH

Charles Poole, MPH, ScD

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases