Skip to main content

View full-text article in PMC

. 2021 Jan 13;19:6. doi: 10.1186/s12916-020-01873-7

Table 4.

Level of evidence in various studies

Level	I / Convincing	II / Highly suggestive / Probable	III / Suggestive / Possible	IV / Weak / Limited-contrasting
Grosso, 2017 [76]	High: concordance between meta-analyses of RCTs and meta-analyses of observational studies; low: meta-analyses of RCTs with results contrary to those from meta-analyses of observational studies	High: meta-analyses of prospective studies with no heterogeneity, no potential confounding factors identified, and agreement of results over time and among meta-analyses, including studies with different designs; medium: meta-analyses of prospective studies with no heterogeneity and no potential confounding factors identified; low: meta-analyses of prospective and case-control studies with no heterogeneity and no potential confounding factors identified	High: meta-analyses of prospective studies lacking information on heterogeneity and potential confounding factors; medium: meta-analyses of prospective and case-control studies lacking information on heterogeneity and potential confounding factors; low: meta-analyses of case-control studies or meta-analyses of any other study design with significant heterogeneity (I² > 50%) and potential confounding factors	Limited studies included in meta-analyses (n ≤ 3) or evident contrasting results from meta-analyses with the same level of evidence
Veronese, 2018 [77] Veronese, 2019 [83] Li, 2017 [86]	Statistical significance with p < 10^− 6, more than 1000 cases (or > 20,000 participants for continuous outcomes), the largest component study reported statistically significant effect (p < 0.05); 95% PI excluded the null; no large heterogeneity (I² < 50%), no evidence of small-study effects (p > 0.10) and excess significance bias (p > 0.10)	Statistical significance with p < 10^− 6, more than 1000 cases (or > 20,000 participants for continuous outcomes), the largest component study reported statistically significant effect (p < 0.05)	Statistical significance with p < 10^− 3, more than 1000 cases (or > 20,000 participants for continuous outcomes)	The remaining statistically significant associations with p < 0.05.
Dinu, 2018 [81]	Significance threshold reached at p ≤ 0.001 for both random and fixed effects calculation; > 1000 cases (or > 5000 total participants if the metric was continuous); not large heterogeneity between studies (I² < 50%); 95% PI excluding the null value; no evidence of small-study effects (if it could be tested)	Significance threshold reached at p ≤ 0.001 for both random and fixed effects calculation; > 1000 cases (or > 5000 total participants if the metric was continuous); not considerable heterogeneity between studies (I² = 50–75%)	Significance threshold reached at p ≤ 0.001 for random effect calculation; 500–1000 cases (or 2500–5000 total participants if the metric was continuous)	Significance threshold reached at p ≤ 0.05 for random effects calculation
Theodoratou, 2014 [52]	Evidence existed from both observational studies and RCTs, and association/effect was of the same direction, statistically significant at p ≤ 0.001, and free from bias	Evidence existed from both observational studies and RCTs, and association/effect was of the same direction and statistically significant at p ≤ 0.001, but excess significance could not be tested; or evidence existed from RCTs and effect was statistically significant at p ≤ 0.001 and with no contrary results from observational data (that is, systematic reviews, if any exist, are also definitive or suggestive and meta-analyses of observational studies, if any exist, are in the same direction)	Suggestive: Evidence from RCTs with an effect at 0.001 ≤ p ≤ 0.05 and with no contrary results from observational data (same as above); or evidence from meta-analyses of observational studies showing an association at p ≤ 0.001, with no contrary results from randomized data (that is, meta-analysis of RCTs, if present, have effects in the same direction) and, if it could be tested, no evidence of small-study effects (p ≥ 0.10), not very large heterogeneity (I² ≤ 75%), no evidence for excess significance, based on cumulative evidence of more than 500 disease events (or more than 5000 total participants if type of metric was continuous)	[Substantial effect unlikely]: Evidence from observational studies or RCTs enough to conclude that a substantial effect is unlikely based on the magnitude and the significance level
Belbasis, 2016 [78] Belbasis, 2015 [18, 51]	More than 1000 cases, significant summary associations (p < 0.001) per random effects calculations, no evidence of small-study effects, no evidence for excess significance bias, PI not including the null, and not large heterogeneity (I² ≤ 50%)effects and excess significance	[No such category exists in these studies]	Nominally significant summary associations (p < 0.05) per random effects calculations, no evidence of small-study effects, no evidence for excess significance bias, and not large heterogeneity (I² < 50%)	All other risk factors with nominally significant summary associations (p < 0.05);
Bellou, 2017 [49] Bellou, 2016 [50]	The associations that fulfilled all the following criteria: statistical significance according to random effects model at p < 10^− 6; based on more than 1000 cases; without large between-study heterogeneity (I² < 50%); 95% PI excluding the null value; and no evidence of small-study effects and excess significance	Associations with > 1000 cases, p < 10^− 6, and largest study presenting a statistically significant effect (with 95% CI excluding the null value)	The associations supported by > 1000 cases and a significant effect at p < 10^− 3)	All other risk factors with nominally significant summary associations (p < 0.05)
Poole, 2017 [79]	The classification was based on AMSTAR (A Measurement tool to Assess Systematic Reviews), as following: Q1: A-priori design; Q2: Duplicate study selection and data extraction; Q3: Search comprehensiveness; Q4: Inclusion of gray literature; Q5: Included and excluded studies provided; Q6: Characteristics of the included studies provided; Q7: Scientific quality of the primary studies assessed and documented; Q8: Scientific quality of included studies used appropriately in formulating conclusions; Q9: Appropriateness of methods used to combine studies’ findings; Q10: Likelihood of publication bias was assessed; Q11: Conflict of interest-potential sources of support were clearly acknowledged in both the systematic review and the included studies.
McRae, 2017 [85] Galbete, 2018 [84] Posadzki, 2018 [73]	Not available levels of evidence (*)

Abbreviations: RCT randomized controlled trials, PI prediction interval, 95% CI 95% confidence interval

*Non-pragmatic approach was applied in this meta-umbrella review