The critical evaluation of scientific publications represents a crucial element of physicians’ continuing education and its importance has been emphasized since the early days of evidence-based medicine (1). To this end, Prof. Maria Blettner and her team have been publishing a series of tutorials on appraisal of specialist literature in Deutsches Ärzteblatt (2– 4). The latest article in the series (Part 17) appears in this issue and deals with randomized controlled trials (RCTs) (5).
Advantages of randomization
RCTs have one decisive advantage over non-randomized studies for the evaluation of therapeutic procedures: In all probability, randomization will result in the different groups of patients in the study being comparable with regard to prognostic factors (6). In other words, the groups could be expected to show the same event rate (number of desired or undesired events per unit of time at risk) over the study period if they received the same treatment.
In all probability, randomization renders the groups comparable not only in respect of known prognostic factors, but also with regard to unknown factors, e.g., genetic factors, that might affect the outcome. This structural equivalence of patient groups gives RCTs a conclusive advantage over non-randomized studies, even if certain types of non-randomized trials approach the ideal of structural equivalence. For this reason, RCTs are viewed as the gold standard for appraisal of the efficacy of interventions.
When patient groups in an RCT with a sufficiently high number of cases are treated differently, any differences in the event rates can be attributed to the various treatments—provided the data are free of major distortion by factors such as selection bias and information bias.
Randomization is also beneficial in terms of statistical analysis. The statistical test of significance is readily interpretable—although it often remains poorly understood and is frequently misused, e.g., to support the researchers’ favored hypothesis (7).
RCTs as drug licensing trials
The goal of an RCT performed as a drug licensing trial (i.e., a superiority study) is to show that a new drug is superior to an existing standard treatment or placebo with regard to an endpoint that has been defined a priori—frequently a “hard” endpoint such as survival. The proof of superiority is consciously and intentionally obtained under conditions that do not necessarily reflect everyday clinical practice. These conditions include:
Inclusion criteria, often resulting in selected, homogeneous patient populations with high compliance
Exclusion criteria, which frequently bar patients with comorbidities and comedications from participation
Quality-assurance measures under study conditions, which often cannot be achieved in clinical routine.
There are cogent reasons for enforcing these conditions in drug licensing trials: It is easier to discern the superiority of a new drug if disturbing factors are reduced to a minimum, i.e., if the investigation is conducted under ideal conditions.
Disadvantages of RCTs
The improved detectability of drug efficacy is attended, however, by disadvantages. From the clinician’s perspective, the drug is tested in selected populations under artificial conditions. Furthermore, RCTs generally have hard primary endpoints such as survival, and endpoints that are clinically relevant and important for the patient, e.g., quality of life or rate of hospitalization, may well not be investigated. For this reason, drug licensing trials frequently answer various questions inadequately or not at all. For example, what is the effect of the drug
in patient populations with characteristics other than those in the drug licensing trial (e.g., patients with more comorbidity, patients with comedications, younger and older patients, or pregnant women)?
in patient populations in which the quality assurance measures practiced in the drug licensing trial (e.g., regular blood testing) are not possible?
on other endpoints such as quality of life or hospitalization rate that were not evaluated in the drug licensing trial?
Drug licensing trials are usually optimized for demonstration of superiority. The minimum numbers of cases required for these trials may be oriented on an effect, minimal from the clinical viewpoint—e.g., 5% lower 2-year mortality with the new treatment than with the standard therapy—that the researchers want to be able to confirm statistically. Thus the increased occurrence of adverse effects, which are often less frequent than the primary endpoints on which the case number calculation of the RCT is based, are demonstrated with insufficient statistical power, unless the treatment effects with regard to adverse effects are much greater than the minimal effect used for calculation of case numbers. Moreover, the relatively late occurrence of adverse effects in the observation period of RCTs may lead to increased rates of adverse effects caused by the drug not being detected in drug licensing trials. Therefore, although RCTs are rightly viewed as the gold standard from the epistemiological perspective, in practice they have limitations regarding the evaluation of unwanted effects.
For this reason Germany’s Medicinal Products Act (Arzneimittelgesetz, AMG) demands that approval of a drug for general use be followed by phase-IV studies or observational studies to answer these open questions.
Critics bemoan the artificial nature of drug licensing trials. These detractors should ask themselves whether it is reasonable to discuss the use in routine clinical practice (effectiveness) of a drug that has not even been shown to be effective in ideal conditions. Drug licensing trials (superiority studies) are intended to prevent the licensing of drugs that are ineffective or merely have the same effect as established treatments (comparison with standard therapy).
Appraise studies critically
Every well-trained physician should have the ability to critically appraise scientific publications on clinical research. The tutorials published in Deutsches Ärzteblatt, with its circulation of 400 000 copies, play an important part in reinforcing the readers’ capacity for critical evaluation of medical scientific literature.
Acknowledgments
Translated from the original German by David Roseveare.
Footnotes
Conflict of interest statement
The author declares that no conflict of interest exists.
Editorial accompanying the article “Randomized Controlled Trials: Part 17 of a Series on Evaluation of Scientific Publications” by Maria Kabisch et al. in this issue of Deutsches Ärzteblatt International
References
- 1.Bennett KJ, Sackett DL, Haynes RB, Neufeld VR, Tugwell P, Roberts R. A controlled trial of teaching critical appraisal of he clinical literature to medical students. JAMA. 1987;257:2451–2454. [PubMed] [Google Scholar]
- 2.du Prel J, Röhrig B, Blettner M. Statistical methods in medical research. Dtsch Arztebl Int. 2009;106(7) doi: 10.3238/arztebl.2009.0184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Röhrig B, du Prel du J, Wachtlin D, Blettner M. Types of study in medical research: part 3 of a series on evaluation of scientific publications. Dtsch Arztebl Int. 2009;106(15):262–268. doi: 10.3238/arztebl.2009.0262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Röhrig B, du Prel J, Wachtlin D, Kwiecien R, Blettner M. Sample size calculation in clinical trials: part 13 of a series on evaluation of scientific publications. Dtsch Arztebl Int. 2010;107(31-32):552–556. doi: 10.3238/arztebl.2010.0552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kabisch M, Ruckes C, Seibert-Grafe M, Blettner M. Randomized controlled trials: part 17 of a series on evaluation of scientific publications. Dtsch Arztebl Int. 2011;108(39):663–668. doi: 10.3238/arztebl.2011.0663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Fletcher RH, Fletcher SW. 2. vollständig überarbeitete Auflage. Bern: Verlag Hans Huber; 2007. Klinische Epidemiologie. [Google Scholar]
- 7.Stang A, Poole C, Kuss O. The ongoing tyranny of statistical significance testing in biomedical research. Eur J Epidemiol. 2010;25:225–230. doi: 10.1007/s10654-010-9440-x. [DOI] [PubMed] [Google Scholar]