Highlights
Assessing the certainty of results is an inevitable part of systematic reviews.
Grading of Recommendations Assessment, Development, and Evaluation is a widely used tool for assessing certainty of evidence.
Using Grading of Recommendations Assessment, Development, and Evaluation for observational studies meta-analyses is accompanied by limitations.
Dear Editor,
It is evident in the literature that aside from reporting the synthesis from the predefined questions, one must be aware of the validity and reliability of their estimates. Reporting the level of evidence will guide policymakers and practitioners to apply evidence in routine practice. In 2004, The Grading of Recommendations Assessment, Development, and Evaluation (GRADE) working group developed an approach for systematic reviews to provide transparent information on the certainty of evidence. The approach was initially designed to address the effectiveness of interventions in systematic reviews. However, it has been widely used by researchers as an evidence-ranking scheme to assess the certainty of the evidence of all types of systematic reviews, including prognostic, diagnostic, and prevalence studies. In our experience, the journal reviewers and editors suggested reporting the certainty of the evidence for prognostic/diagnostic systematic reviews using the GRADE guideline. However, applying the GRADE guideline for the rating of the prognostic/diagnostic systematic reviews have important limitations. Here, we will address some issues regarding using this approach in systematic reviews and meta-analyses of observational studies.
The GRADE approach has four levels of evidence: very low, low, moderate, and high. Eight GRADE criteria must be evaluated to report the certainty of each synthesized outcome. Among them, five decreases the grade of evidence (risk of bias, imprecision, inconsistency, indirectness, and publication bias)1. In the case of methodologically robust observational research, GRADE advises raising the quality of the evidence (large magnitude of effect, dose-response gradient, and all plausible residual confounding).
Most of the published systematic reviews have stated they reported their study in line with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guideline2. According to PRISMA guideline, authors must specify how they assess certainty (or confidence) in the body of evidence for an outcome. The GRADE approach suggests evidence from randomized clinical trials must begin with a high level, while evidence based on observational studies starts with a low level. Therefore, it seems that researchers of meta-analyses of observational studies might not be as eager to utilize this method to assess the certainty of evidence because the quality of a meta-analysis of an observational study will be mostly judged as low or very low. In addition, in the latest revision of GRADE guideline, the risk of bias assessment may cause rating down of evidence three levels3.
For example, O’Keeffe et al.4 published a meta-analysis on the relation of smoking and risk of lung cancer in 2018. In this well-written systematic review, a high risk of bias was observed according to the Newcastle–Ottawa scale, and there was also considerable heterogeneity (89.4–98.8%). According to the GRADE guideline, the certainty of the evidence is rated down five levels: three points due to an extremely serious risk of bias and two points due to very serious heterogeneity. As well, we can rate up the level of evidence two to three levels since there was a large magnitude of effect and potential dose-response gradients (increasing the risk of cancer with increasing the number of cigarettes per day). As a conclusion, the overall certainty of evidence for the relationship between smoking and lung cancer derived from the O’Keeffe and colleagues study is ‘low to very low.’ Therefore, based on the definitions given by the GRADE, this relationship will probably be markedly different from the actual estimated effect. While there is a global consensus on the independent hazardous effect of smoking on the increasing risk of lung cancer.
Although randomized clinical trials provide a higher level of evidence than observational studies to assess the safety and efficacy of intervention, in the assessment of the diagnostic/prognostic value of an indicator, observational studies can provide the optimum evidence available5. Furthermore, based on the Centre for Evidence-Based Medicine (CEBM) recommendation, systematic reviews of observational studies evaluated to address prognosis, diagnosis, and prevalence of a predefined question are among the studies with the highest levels of evidence.
In conclusion, it seems that the prespecified low rating of the quality of evidence derived from observational studies proposed by the GRADE may lead to an excessive decrease in the quality of evidence, and most of the authors of meta-analyses of observational studies tend not to report the certainty of evidence throughout their manuscript. In addition, rating down of certainty three levels for application of new risk of bias tools such as ROBINS-I may not applicable for systematic review of observational studies.
We propose to define a new approach for rating the level of evidence in such a way that the rating for systematic reviews of observational studies is designed based on the CEBM guideline. For example, the CEBM suggests the level of evidence provided by well-designed cohort studies and cross-sectional studies to investigate the prognostic and diagnostic value of an indicator is ‘high.’ Therefore, if a systematic review of well-designed cross-sectional studies is conducted to assess the diagnostic value of an indicator, it is better to start rating its certainty from high-quality evidence. Moreover, ‘extremely serious’ risk of bias due to using the new risk of bias tools should be apply only in systematic reviews of treatment effect.
Ethical approval
Not applicable.
Sources of funding
None.
Author contribution
M.Y. contributed to conception of the manuscript, drafted the manuscript, circulated for review, and revised the final manuscript. A.S. drafted the manuscript, circulated for review, and revised the final manuscript.
Conflict of interest disclosure
The authors declare that they have no financial conflict of interest with regard to the content of this report.
Research registration unique identifying number (UIN)
None.
Guarantor
Mahmoud Yousefifard and Arman Shafiee.
Data statement
None.
Footnotes
Sponsorships or competing interests that may be relevant to content are disclosed at the end of this article.
Published online 16 February 2023
Contributor Information
Mahmoud Yousefifard, Email: yousefifard20@gmail.com.
Arman Shafiee, Email: armanshafieemd@gmail.com.
References
- 1. Guyatt GH, Oxman AD, Kunz R, et al. What is ‘quality of evidence’ and why is it important to clinicians? BMJ 2008;336:995–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Page MJ, McKenzie JE, Bossuyt PM, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021;10:1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Piggott T, Morgan RL, Cuello-Garcia CA, et al. Grading of Recommendations Assessment, Development, and Evaluations (GRADE) notes: extremely serious, GRADE’s terminology for rating down by three levels. J Clin Epidemiol 2020;120:116–20. [DOI] [PubMed] [Google Scholar]
- 4. O’Keeffe LM, Taylor G, Huxley RR, et al. Smoking as a risk factor for lung cancer in women and men: a systematic review and meta-analysis. BMJ Open. 2018;8:e021611. doi: 10.1136/bmj.n71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Howick JH. The Philosophy of Evidence-Based Medicine. John Wiley & Sons; 2011. [Google Scholar]
