Abstract
Background:
The limited palliative care evidence base is poorly amenable to existing grading schemes utilized in guidelines. Many recommendations are based on expert consensus or clinical practice standards, which are often considered ‘low-quality’ evidence. Reinforcing provider hesitancy in translating recommendations to practice has implications for patient care.
Aim:
To rationalize the selection of an appropriate grading system for rating evidence to support recommendations made in palliative care clinical practice guidelines.
Design:
Review of the methodology sections of international palliative care guidelines published in English identified five grading systems comparison: Grading of Recommendations, Assessment, Development and Evaluations (GRADE); the Scottish Intercollegiate Guidelines Network (SIGN); Infectious Diseases Society of America-European Society for Medical Oncology (IDSA-ESMO); Confidence in the Evidence from Reviews of Qualitative research (CERQual) and the National Service Framework for Long Term Conditions (NSF-LTC).
Results:
There is heterogeneity among grading systems used in published palliative care or terminal symptom management guidelines. GRADE has been increasingly adopted for its methodological rigour and inter-guideline consistency with other medical associations. CERQual has the potential to support recommendations informed by qualitative evidence, but its role in clinical guidelines is less defined. The IDSA-ESMO system has an intuitive typology with the ability to categorize tiers of lower-quality evidence.
Conclusions:
It is challenging to apply commonly used grading systems to the palliative care evidence base, which often lacks robust randomized controlled trials (RCTs). Adoption of IDSA-ESMO offers a feasible and practical alternative for lower-resourced guideline developers and palliative clinicians without a prerequisite for methodological expertise.
Keywords: Practice guideline, clinical guidelines as topic, evidence-based practice, evidence-based medicine, palliative care, palliative medicine, GRADE approach
Key Statements
What is already known about the topic?
Certain areas within the scope of palliative care lend themselves poorly to randomized controlled trials due to issues related to recruitment, attrition, bias and ethics.
The available evidence informing recommendations on end-of-life issues often ranks low on the evidence hierarchy.
The multiple different grading systems currently in use each have their own advantages and disadvantages, particularly in the context of the available evidence base.
What this paper adds?
The grading system developed by the Infectious Diseases Society of America and adapted by the European Society of Medical Oncology is appropriate for supporting recommendations made within palliative care guidelines because its typology accounts for non-experimental methods common in the field.
We provide a deliberation process for selecting a grading system in palliative care, including strengths and limitations of different systems.
Implications for practice, theory or policy
Guideline development within palliative care would benefit from detailed documentation of the evidence appraisal process for consistency and transparency.
To date, there is no single grading system that effectively integrates critical appraisal of qualitative and quantitative evidence sources alike. Systematically approaching recommendations rooted in psychosocial or spiritual evidence may help enhance clinician confidence in situations where the supporting qualitative data is limited.
Guideline developers should strive for a balance between rigour and accessibility when grading their recommendations.
Introduction
Clinical practice guidelines provide recommendations with the goal of delivering consistent and informed patient care. They typically guide clinical decision-making in specific scenarios based on available evidence that has been systematically appraised 1 and compared to alternative options.1,2 Trustworthy guidelines assure methodological transparency, integrate multi-disciplinary stakeholder input and reconcile conflicting recommendations. 2 The Grading of Recommendations, Assessment, Development and Evaluations (GRADE) approach 3 is widely used to ensure rigour and uniformity as part of an international movement towards enhanced quality standards for guideline developers. 2
GRADE is not optimal for the assessment of the evidence base in palliative care, which is often qualitative in nature or based on expert opinion and practice standards, rather than randomized controlled trials (RCTs) and other ‘high-quality’ experimental methods. While these are not scored favourably by GRADE,3 –5 they are pragmatic in providing clinical judgement and navigation in the absence of other evidence.6,7 We acknowledge the hypothetical risk that such ‘low-quality’ evidence may seem synonymous with interventions that do not add value or are equivocal if not harmful. However, this term more commonly signifies a lack of available research. 7 The ability of guidelines to offer direction in the face of inadequate evidence is cited as a strength rather than a weakness, as it would be exceedingly difficult to make clinical decisions on the basis of high-quality evidence alone. 6 Controlled, experimental trials with high participant numbers may not be feasible given certain patient populations or clinical questions, a situation common in palliative care.
We report here the critical evaluation approach we adopted when addressing the need for a holistic, practical evidence appraisal system for use in updating the Palliative Symptom Management Guidelines published by our group at the BC Centre for Palliative Care (BCCPC). While our guidelines are targeted to clinicians, allied health professionals and caregivers for adults with advanced life-limiting illness, 8 the information contained in this paper is applicable to a broader audience of guideline developers and novices to clinical evidence appraisal alike.
Design
Websites of national and international palliative care organizations were searched for clinical guidelines published in English. The academic databases MEDLINE, Embase and CINAHL were searched for additional published palliative care guidelines. Methodology sections, bibliographies and supplemental appendices of the published guidelines were reviewed to identify critical appraisal processes and further guidelines for review. The search strategy outlined in Supplementary Figure 1 was used to expand the search and identify recent publications. We modified our search criteria to include methodological papers on grading systems. Briefly, the databases listed above were searched from inception to May 2023 using a combination of keywords related to palliative care, evidence-based practice and clinical guidelines. We excluded disease-specific guidelines not aimed at a generalist palliative care audience, development guides and guideline appraisal tools such as AGREE II. Our aim was to source publications on grading system methodologies, critiques and comparisons.
We identified five grading systems in use within North America, Australia and Europe. Four of these were selected based on their role in recent palliative care guidelines. The final system, NSF-LTC, was identified during database search and after exhausting the reference lists of methodological papers. It was selected for its theoretical applicability rather than current use.
Results: Summary of grading systems
Five grading systems were identified for review and comparison: GRADE; IDSA-ESMO; Confidence in the Evidence from Reviews of Qualitative research (CERQual); the Scottish Intercollegiate Guidelines Network (SIGN) and the National Service Framework for Long Term Conditions (NSF-LTC). The grading systems used by the identified organizations, and their geographic regions are detailed in Table 1. These were the only distinct grading systems identified during the search process. Other guidelines either did not explicitly name a grading system or relied on expert consensus. These grading systems were compared based on rigour, practicality, clarity of typology and breadth of use. This criteria addresses the systematization of evidence appraisal, feasibility of implementation and congruency with the current clinical landscape.
Table 1.
Systems for scoring or categorizing (grading) recommendations made in clinical guidelines a .
| Organization | System(s) used | Region |
|---|---|---|
| Scottish Intercollegiate Network | SIGN (former), GRADE (current) | UK |
| Registered Nurses Association of Ontario | GRADE (modified), CERQual | Canada |
| European Society for Clinical Oncology | IDSA-ESMO | Europe |
| Alberta Health Services, Cancer Care Alberta | IDSA-ESMO | Canada |
| BC Centre for Palliative Care | GRADE (modified) | Canada |
| Ontario Palliative Care Network | Expert Consensus | Canada |
| The National Institute for Health and Care Excellence | GRADE | UK |
| National Consensus Project Clinical Practice Guidelines for Quality Palliative Care | GRADE | International |
| National Health and Medical Council of Australia | GRADE | Australia |
| National Comprehensive Cancer Network | Expert Consensus | US |
| European Association for Palliative Care | GRADE, Expert Consensus | Europe |
| American Society of Clinical Oncology | Expert Consensus | US |
As used in relevant palliative or end-of-life care guidelines only.
GRADE
Numerous organizations have adopted GRADE, including Cochrane, 9 the World Health Organization, 10 the Canadian Task Force on Preventative Health Care, 11 the National Institute for Health and Care Excellence (UK), 12 the National Consensus Project for Quality Palliative Care (US) 13 and the Registered Nurses Association of Ontario (Canada). 14
The separation of judgements on the strength of recommendation from the quality of evidence is a defining feature of GRADE.3,4 This allows for nuance when high-quality studies are associated with burdensome therapy or low-quality studies with improved patient satisfaction. 4 The quality of evidence rating depends on the level of certainty (high, moderate, low or very low) in how close the true effect is to the effect estimate (Table 2).4,5,15 To assess the quality of evidence supporting each recommendation, five domains are evaluated: study design, risk of bias, consistency, directness, precision and publication bias. Within guidelines, the gestalt of these studies serves as the basis for supporting practice recommendations, interwoven with clinical context.4,5,15 –17 In systematic reviews, GRADE provides structure for conveying judgements about how closely the collective body of evidence for a specific outcome approximates reality, and how influential future research is likely to be.4,5,9,15 The strength of recommendation is dichotomous (Table 2) and is influenced by trade-offs between desirable and undesirable effects, patient values and preferences, evidence quality and equitable resource allocation.4,18,19
Table 2.
Overview of GRADE Quality and Strength of Recommendation ratings.
| Quality of evidence ratings | High | Future research is unlikely to change the effect estimate; benefits clearly outweigh the risks (or vice versa). |
| Moderate Moderate |
Future research may change the effect estimate; sufficient evidence for decision-making. | |
| Low Low |
Future research is likely to change the effect estimate; some evidence for decision-making. | |
| Very Low Very Low |
The effect estimate is uncertain; insufficient evidence for decision-making. | |
| Strength of recommendation ratings | Strong | Most people in this situation would want the recommendation, and only a minority would not. |
| Weak or Conditional | Most people in this situation would want the suggestion, but many would not. |
RCTs are initially assigned a high level of certainty while non-randomized and observational studies are assigned a low level of certainty due to inherent bias,5,9 which can be systematically evaluated using tools developed by the Cochrane Collaboration such as the Risk Of Bias In Non-randomized Studies – of Interventions (ROBINS-I).9,20 ROBINS-I provides a way of comparing the non-randomized study against a ‘target trial’, which is a hypothetical RCT evaluating the same interventions in the same population, where feasibility and ethics can be ignored.9,20
Observational studies may be upgraded if they have a large (i.e. two-fold) or very large (i.e. five-fold) effect size, a dose-response gradient, or residual confounding that inadvertently increases confidence in the estimated effect, whereby the actual effect size would be even greater had the overlooked confounder(s) been accounted for.4,9,17,21 Critical consideration must be given towards the relative contributions of larger scale, higher quality studies. 4
Clinical applications
GRADE cannot be applied to qualitative studies or expert opinion, precluding its scope of use for much of the palliative care evidence base. Retrospective and case control studies represent most of the body of quantitative evidence on symptom management in end-of-life, which are rated as low-quality evidence under the GRADE system. In addition, much evidence from RCTs would be downgraded based on design limitations, such as low sample sizes,9,22 –24 or indirect evidence such as that extracted from systematic reviews on opioids for adjacent populations, such as patients with chronic or cancer pain.25 –29
CERQual
The GRADE-CERQual approach (CERQual) was developed in collaboration with the GRADE Working Group to support the use of qualitative syntheses in guideline development.30,31 The quality of evidence is appraised in a manner analogous to GRADE. Levels of confidence (high, moderate, low or very low) are assigned across four components (methodological limitations, coherence, adequacy of data and relevance) based on how well the review finding reasonably represents the phenomenon of interest.30 –32 Each review finding starts as ‘high confidence’ and is downgraded by one or more levels based on severity of concerns, for which there are no definitive criteria and much is subject to reviewer discretion.30,33
Clinical applications
Although its application is less widespread in the literature compared to GRADE, CERQual offers a framework for clinical areas previously not amenable to traditional grading schemes by standardizing recommendations based on qualitative findings. 31
For example, the mixed-methods systematic review by Edwards et al. 34 was able to state with a high degree of confidence that ‘programmes and services for people at the end-of-life require a comprehensive team approach incorporating symptom relief, psychological and psycho-social support and spiritual care’. This statement arose from the application of CERQual to qualitative studies on palliative care in people with severe mental illness. 34 Being able to demonstrate thematic and contextual congruity conveys a certain degree of rigour. This adds weight to practice recommendations informed by qualitative evidence.
In a separate systematic review on ethical dilemmas faced by palliative care specialists, Schofield et al. 35 held a high degree of confidence in the evidence that: ‘Participants reported ethical challenges related to the ethical principle of truth-telling. This focuses around whether it was appropriate to discuss both the terminal nature of a diagnosis and/or prognosis to both the patient and to family members’. The authors’ rationale was that individual studies had a high level of coherence with respect to this theme. 35 CERQual provides the opportunity to address these ethical challenges within palliative care from an evidence-driven perspective, not just a theoretical one.
SIGN
SIGN is an expert group that independently releases guidelines with a historic grading system of their own. However, the organization began transitioning to GRADE in 2009 and moved away from their ABCD grading of recommendations in 2013.6,36,37 The original SIGN approach of adding ‘+’ or ‘-’ to their level of evidence scores provided a spectrum for conveying strengths or weaknesses, although a significant limitation was their focus on study type rather than study quality.38,39 The clinical applications of SIGN are limited, considering it has fallen out of use.36,37,40
NSF-LTC
NSF-LTC developed a unique evidence typology to address the limitations of experimental quantitative research methods in long-term care based on the premise that research quality is more important than research design.39,41 The intention was to address the challenges of making recommendations for diseases with complex trajectories,39,41 which can be said about the natural history of many life-limiting illnesses under the purview of palliative care. On this basis, NSF-LTC was included in this discussion despite any evidence of its adoption by a recognized palliative care organization.
According to NSF-LTC, systematic assessment of the evidence involves upholding three principles: value in the opinions of experts, patients and families; belief that qualitative, quantitative and mixed-methods studies have equal validity with no implicit hierarchy; and emphasis on study design, integrity of its conclusions and relevance to the population of interest.41,42
Evidence is divided into two categories: ‘E’ for expert evidence (including patients, caregivers and professionals), and ‘R’ for research-based evidence. Five criteria evaluating quality are scored for each study, and the overall study descriptor combines study design, quality and applicability. Research design is categorized into quantitative (P1), qualitative (P2) and mixed methods approaches (P3). For example, a high quality primary qualitative study that is directly applicable to the population of interest would be classified as ‘P2 High Direct’.41,42 A recommendation using an ABC grading scheme is provided after holistic appraisal of the quality and relevance of the total body of evidence. Applicable to research evidence only, this includes quantitative, qualitative and mixed methods designs. The final assessment would be denoted by a combination of the type of evidence along with a rating of research evidence. For example, Grade A research evidence would be listed as ‘RA’, whereas expert evidence would be communicated as ‘E1’ (user expert, i.e. patient or caregiver) or ‘E2’ (professional expert, i.e. clinician) without a letter grade assigned. 42
Clinical applications
NSF-LTC does not award higher points to studies based on their standing within the evidence hierarchy, which is conducive to the type of research available in palliative care. For example, two North American Delphi studies reached expert consensus and developed priority recommendations in palliative medicine and addictions, such as identifying opioid misuse regardless of prognosis and continuing buprenorphine in comorbid opioid use disorder and cancer pain.43,44 Delphi studies use expert consensus to answer a research question and are often considered the lowest tier of evidence. However, each expert brings a unique combination of clinical expertise and theoretical knowledge to the dynamic. 45 These studies strictly adhered to the Delphi process,43,44 and their data is highly applicable to our population of interest. NSF-LTC would grade this favourably, building clinician confidence in safe opioid prescription among this population.
IDSA-ESMO
ESMO adapted the original scheme published in 2001 by the IDSA, modifying it by subdividing the levels of evidence from three to five.46,47 IDSA-ESMO provides a letter grade for the strength of recommendation (Table 3) and a roman numeral for the quality and type of evidence supporting that recommendation (Table 4).46,47 Certain ESMO guidelines explicitly note when clinical trials would be unethical or unfeasible to highlight recommendations supported by clinical expertise and practice standards, by adding an asterisk (*) to the respective grade of recommendation. 48
Table 3.
IDSA-ESMO strength of recommendations.
| Grade | Description |
|---|---|
| A | Strongly recommended; strong evidence for efficacy with a substantial clinical benefit. |
| B | Generally recommended; strong or moderate evidence for efficacy but with a limited clinical benefit. |
| C | Optional; insufficient evidence for efficacy or benefit does not outweigh risks or disadvantages. |
| D | Generally not recommended; moderate evidence against efficacy or for adverse outcomes. |
| E | Never recommended; strong evidence against efficacy or for adverse outcomes. |
Table 4.
IDSA-ESMO levels of evidence.
| Level | Description of evidence |
|---|---|
| I | Evidence from at least one large RCT of good methodological quality with low potential bias; meta-analyses of well-conducted RCTs without heterogeneity |
| II | Small RCTs; phase II RCTs; large RCTs with suspected bias, or meta-analyses of such trials or those with heterogeneity |
| III | Prospective cohort studies; post-hoc and ad-hoc analyses of RCTs |
| IV | Retrospective cohort studies; case-control studies; instrument validation studies |
| V | Studies without a control group; case reports; expert opinions; review articles or narrative reviews; Delphi studies; cross-sectional studies (interviews, focus groups, surveys) |
In 2008, IDSA made the decision to adopt GRADE with the aim of conforming to their international collaborators to facilitate guideline availability.6,50 However, the IDSA-ESMO system remains in use by ESMO.46,47 While the September 2022 ESMO Standard Operating Procedures for guideline developers suggests that the phrasing of recommendations follow those set out by the GRADE Working Group, they still adhere to the letter and Roman numeral typology for level of evidence and strength of recommendation, respectively. 46 Cancer Care Alberta also relies on the IDSA-ESMO system, with their adaptation outlined in their June 2022 Guideline Methodology Handbook. 49
Clinical applications
Most opioid-specific guidelines for cancer pain management within the 2018 ESMO guidelines ranged from level II-III/grade B-C, although certain recommendations on immediate release opioids and transmucosal formulations for breakthrough cancer pain scored high (I, A). 51 There were several level II/grade A-C ratings for the recommendations on opioids in breathlessness. 48
ESMO’s 2021 guidelines on the care of adults with cancer at end-of-life provide psychosocial and spiritual recommendations ranging from level I-III/grade A-B, partly informed by systematic reviews of cross-sectional and longitudinal studies. 52 The ESMO guideline on refractory symptoms and palliative sedation adds a preamble that all assertions are level V evidence because they are based on case series and expert opinion. 53
Comparison of grading systems
Based on the above review, we critically appraised the identified grading systems by comparing the strengths and limitations of each. This was undertaken in the context of our development and review of interdisciplinary palliative care symptom management guidelines as an organization that does not have ample methodological support or extensive guideline development personnel (see Table 5). A summary of this comparison is shown in Table 6.
Table 5.
Comparison of grading systems.
| System | Key features | Advantages a | Limitations a |
|---|---|---|---|
| GRADE | Judgements on both strength of recommendation (2 levels) and quality of evidence (4 levels). | Comprehensive. Two-judgement scales allows for nuance around aspects such as burden of therapy or types of outcomes. Widespread uptake. Systematically upgrades and downgrades evidence based on aspects of the study design and results. Requires reviewers to explicitly outline factors influencing their rating decisions. |
Cannot be applied to qualitative studies or expert opinion. Retrospective and case-control studies are rated as low-quality evidence. Therefore, much of the evidence in palliative care would be rated as low or very low. Limited opportunity for nuance among lower tiers of evidence. Requires both methodological and content expertise to apply (i.e. resource intensive). |
| CERQual | System for appraising evidence in qualitative syntheses. Assigns 4 levels of confidence across 4 components based on how well the phenomenon of interest is represented. |
Offers a framework for adding rigour to assessment of qualitative evidence. Evidence can be downgraded, allowing for nuance. Developed in collaboration with the GRADE Working Group, which confers legitimacy and standardization. |
There are no definitive criteria for assessing levels of confidence among qualitative studies. Requires strong expertise in qualitative methodology. Downgrading of evidence is subject to reviewer discretion, hypothetical risk of low inter-rater reliability. Applicable to qualitative evidence only. |
| SIGN | 4-level scale, with an added option to further indicate strengths or weaknesses (i.e. 1++ for high quality meta-analyses or systematic reviews, 4 for expert opinion). Grade of recommendation (A–D) is based on the highest scoring level of evidence. |
Typology is simple and easy to interpret. Plus (+) and minus (–) symbols added to numerical scale allow for greater nuance in conveying judgement on strength of the level of evidence. |
Considers study type on the evidence hierarchy rather than study quality. Has fallen out of use, with previous guidelines adopting GRADE in lieu which serves a similar purpose but with greater rigour. |
| NSF-LTC | Follows premise that research quality is more important than research design. Two evidence categories – expert evidence (patients, caregivers, professionals) and research evidence. Research evidence is scored based on study design, quality, and applicability. |
Considers challenges of making recommendations for diseases with complex trajectories. No implicit hierarchy to different study designs. Plausible applicability to bodies of knowledge with a relative dearth of experimental evidence, but robust expert opinion (i.e. as seen in palliative care). |
Does not differentiate between study quality and validity. Assesses quantitative and qualitative studies using the same rubric, suggesting inherent ambiguity in its criteria. Unique grading typology that is not widely adopted within other guidelines. This system was not identified in any of the palliative guidelines. |
| ISDA-ESMO | Judgements on strength of recommendation (5 levels, as a letter of the alphabet) and type of evidence (5 levels, as a roman numeral). | 5 levels for strength of recommendation provides greater distinction around lower quality evidence. Typology is recognizable and intuitive. System is in use within published palliative care guidelines. Can be used for both quantitative and qualitative evidence without needing to resort to two different systems. Less resource-intensive from a methodological perspective compared to GRADE. |
Unable to downgrade rating based on biases or methodological flaws within the studies included. Less methodologically rigorous compared to GRADE. |
In the context of palliative care and resource-limited organizations.
Table 6.
Comparison of grading systems based on rigour, resources, typology, breadth and type of evidence.
| System | Rigour | Resources | Typology | Breadth | Evidence type |
|---|---|---|---|---|---|
| GRADE | +++ | +++ | Clear, intuitive | Widespread | Quantitative |
| CERQual | ++ | ++ | Unique, niche | Limited | Qualitative |
| SIGN | + | + | Clear, intuitive | Limited | Quantitative |
| NSF-LTC | + | + | Unique, niche | Very limited | Both |
| IDSA-ESMO | ++ | + | Clear, intuitive | Moderate | Both |
Discussion
Here we have described our identification and characterization of five grading systems relevant to palliative care clinical practice guidelines. Based on this review, we critically appraised the systems by comparing the strengths and limitations of each in the context of interdisciplinary symptom management guidelines developed and reviewed by an organization with limited resources and timelines. Ultimately, we identified IDSA-ESMO as the most pragmatic system in grading practice recommendations made within palliative care guidelines.
Rationale for selecting IDSA-ESMO
Compared to GRADE, IDSA-ESMO does not have the same need for extensive academic methodological expertise. Delphi studies, surveys, interviews and expert opinion are included within its level of evidence scheme,46,49 aligning with the palliative care context and the types of evidence included in our guidelines. Although IDSA-ESMO does not upgrade or downgrade evidence to the same extent as GRADE,6,7,50,54 its level of evidence table considers methodological quality. 46 Conveniently, it avoids having to rely on two distinct grading schemes. Its strengths of recommendation range from A to E,46,49 providing greater distinction around ‘lower quality’ evidence and allowing for more nuance in considering clinical variables and patient preferences. Its typology is recognizable and intuitive, as desired to ensure accessibility among clinician audiences.
The ESMO guidelines on refractory symptom management and end-of-life care in cancer patients48,51 –53 demonstrate versatility and applicability of their system to various palliative care settings, serving as proof of concept and evidence of congruity. The simple addition of an asterisk (*) indicates recommendations that are justified based on expert consensus or standard practice but are not feasibly supported by clinical trials. 48 This notation is both practically useful and assures transparency behind basing recommendations on non-experimental evidence.
Systems for quantitative evidence: GRADE and SIGN
Strengths of the GRADE methodology include its comprehensiveness, extensive adoption and systematic approach to downgrading or upgrading evidence. It enhances transparency by requiring reviewers to explicitly outline the factors influencing their evidence ratings.3,5,7,9,17 It is held in high regard within academic medicine as evidenced by its adoption across various organizations, with an added advantage of familiarity among inter-disciplinary guideline users.
Despite concerns about deviating from the academic standard, we were hesitant to implement GRADE in the absence of sufficient organizational infrastructure. GRADE is resource-intensive, requiring both methodological and content expertise, training and dedicated staff. There are inherent financial and time considerations in assuring adequate technical understanding of this system.
In addition, GRADE has been criticized for having poor inter-rater agreement and lacking internal consistency, which may be attributed to discrepancies between ratings from clinicians and GRADE methodology experts.3,7 This shortcoming may be compounded by inexperience or unfamiliarity.
The GRADE Working Group discourages modification of its methodology due to inconsistencies with best practice standards. 4 The investment required may outweigh the benefits, as palliative care evidence would likely have universally low levels of certainty or be ungradable. There is a lack of distinction between tiers of ‘lower quality’ evidence in GRADE. Even with tools such as ROBINS-I, most of the evidence in palliative care would be downgraded due to unavoidable bias and small sample sizes. There is a risk of reinforcing provider hesitancy and indecision. Implementing a fulsome GRADE approach may exceed the capacity of smaller organizations.
In contrast to GRADE’s complexity, the historic SIGN methodology was simpler in its reliance on the hierarchy of evidence heuristic. However, its ABCD scheme disadvantageously addressed the quality of evidence rather than the clinical importance of the recommendations.38,39 This system would also have provided us with low quality of evidence scores, in addition to being less methodologically rigorous and largely obsolete.
Systems for qualitative or mixed-methods evidence: CERQual and NSF-LTC
CERQual shows promise in adding academic rigour to qualitative evidence, which is prominent in palliative care research and informs many important non-biomedical areas such as ethics, bereavement and spiritual distress. However, proper implementation of CERQual requires strong expertise in qualitative methodology. Its components are inherently vague, lacking strict criteria for upgrading or downgrading evidence.30,32 We came across select systematic reviews 34 but no palliative care guidelines that seamlessly integrated recommendations using a quantitative framework in conjunction with CERQual. Adding a secondary grading system may convolute our process and impair usability among the intended audiences due to the additional cognitive load of interpreting two different systems.
NSF-LTC was designed for use in long-term neurological conditions, which share parallels with other life-limiting illnesses under the purview of palliative care, such as the need to adapt interventions based on disease progression, non-linear trajectory and unquantifiable experiences of pain or other terminal symptoms. However, a major limitation is its relative obscurity; we did not identify any organizations who have adopted NSF-LTC in the palliative care context. It was included in our comparison for its theoretical utility, although we were deterred by a lack of widespread adoption to date. This system also lacks rigour in its ability to differentiate between study quality and validity. 39 NSF-LTC assesses quantitative and qualitative studies using the same rubric, implying ambiguity in its criteria. One study found that three-quarters of assessors used NSF-LTC incorrectly. 39 Finally, we were concerned that clinicians may find its unfamiliar grading typology difficult to decipher.
Conclusion
Acceptance of GRADE as the ‘gold standard’ in rating the evidence supporting practice recommendations is a compelling reason for large scale and well-resourced organizations to adopt it. However, the expertise and capacity requirements can be out of scope for smaller organizations. The lack of granularity for assessing non-RCT study designs does not lend itself well to the evidence base available in palliative care. By comparison, the implementation of IDSA-ESMO is more feasible for lower-resourced guideline development groups. Based on the grading systems we reviewed, the IDSA-ESMO approach was the most succinct and pragmatic. It has practical strengths despite lacking the methodological rigour of GRADE, and is appropriately suited to the palliative care evidence base.
Supplemental Material
Supplemental material, sj-pptx-1-pmj-10.1177_02692163241286658 for A pragmatic approach to selecting a grading system for clinical practice recommendations in palliative care by Sasha Voznyuk, Rachel Z Carter and Julia Ridley in Palliative Medicine
Footnotes
Authorship: Sasha Voznyuk: Made a substantial contribution to conceptualization; drafted the article; revision and editing; approved the final version submitted for publication.
Rachel Z Carter: Contributed to writing and review of drafts of the article; approved the final version submitted to publication.
Julia Ridley: Made a substantial contribution to conceptualization; critically reviewed drafts of the article; approved the final version submitted for publication.
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Ms Voznyuk was supported through the UBC Faculty of Medicine Summer Student Research Program.
Research ethics and patient consent: Not required.
Data management and sharing: All material for this article is available online.
ORCID iDs: Sasha Voznyuk
https://orcid.org/0009-0006-9650-2065
Rachel Z Carter
https://orcid.org/0000-0001-8609-798X
Supplemental material: Supplemental material for this article is available online.
References
- 1. Guerra-Farfan E, Garcia-Sanchez Y, Jornet-Gibert M, et al. Clinical practice guidelines: the good, the bad, and the ugly. Injury 2023; 54: S26–S29. [DOI] [PubMed] [Google Scholar]
- 2. Institute of Medicine. Clinical practice guidelines we can trust. Washington, DC: The National Academies Press, 2011. [Google Scholar]
- 3. Kavanagh BP. The GRADE system for rating clinical guidelines. PLoS Med 2009; 6: e1000094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Schünemann H, Brożek J, Guyatt G, et al. (eds). GRADE handbook for grading quality of evidence and strength of recommendations, https://gdt.gradepro.org/app/handbook/handbook.html (2013, accessed 12 March 2023).
- 5. Dijkers M. Introducing GRADE: a systematic approach to rating evidence in systematic reviews and to guideline development, https://ktdrr.org/products/update/v1n5/dijkers_grade_ktupdatev1n5.pdf (2013, accessed 17 December 2022).
- 6. Deresinski S. Guiding clinical care through evidence-free zones. Clin Infect Dis 2010; 51: 1157–1159. [DOI] [PubMed] [Google Scholar]
- 7. Khan AR, Khan S, Zimmerman V, et al. Quality and strength of evidence of the Infectious Diseases Society of America clinical practice guidelines. Clin Infect Dis 2010; 51: 1147–1156. [DOI] [PubMed] [Google Scholar]
- 8. BC Centre for Palliative Care. B.C. Inter-professional palliative symptom management guidelines, https://bc-cpc.ca/wp-content/uploads/2018/09/SMGs-interactive-final-Nov-30-compressed.pdf (2017, accessed 15 December 2022).
- 9. Higgins J, Thomas J, Chandler J, et al. (eds). Cochrane handbook for systematic reviews of interventions version 6.3, www.training.cochrane.org/handbook (2022, accessed 12 March 2023).
- 10. World Health Organization. WHO guidelines for the pharmacological and radiotherapeutic management of cancer pain in adults and adolescents, https://www.who.int/publications/i/item/9789241550390 (2019, accessed 14 December 2022). [PubMed]
- 11. Canadian Task Force on Preventative Health Care. Chapter 1: overview of the Canadian Task Force on Preventive Health Care, https://canadiantaskforce.ca/chapter-1-overview-of-the-canadian-task-force-on-preventive-health-care/ (2022, accessed 18 December 2022).
- 12. National Institute for Health and Care Excellence (NICE). Developing NICE guidelines: The manual, www.nice.org.uk/process/pmg20 (2014, accessed 17 December 2022). [PubMed]
- 13. Ahluwalia SC, Chen C, Raaen L, et al. A systematic review in support of the national consensus project clinical practice guidelines for quality palliative care. 4th ed. J Pain Symptom Manage 2018; 56: 831–870. [DOI] [PubMed] [Google Scholar]
- 14. Registered Nurses’ Association of Ontario. A palliative approach to care in the last 12 months of life: Best practice guidelines, https://rnao.ca/bpg/guidelines/palliative-approach-care-last-12-months-life (2020, accessed 17 December 2022).
- 15. Balshem H, Helfand M, Schünemann HJ, et al. GRADE guidelines: 3. Rating the quality of evidence. J Clin Epidemiol 2011; 64: 401–406. [DOI] [PubMed] [Google Scholar]
- 16. Higgins JPT, Altman DG, Gøtzsche PC, et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ 2011; 343: d5928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Siemieniuk R, Guyatt G. What is GRADE? BMJ Best Practice, https://bestpractice.bmj.com/info/toolkit/learn-ebm/what-is-grade/ (accessed 15 December 2022).
- 18. Guyatt GH, Oxman AD, Vist GE, et al. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ 2008; 336: 924–926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Baker A, Young K, Potter J, et al. A review of grading systems for evidence-based guidelines produced by medical specialties. Clin Med 2010; 10: 358–363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Schünemann HJ, Cuello C, Akl EA, et al. GRADE guidelines: 18. How ROBINS-I and other tools to assess risk of bias in nonrandomized studies should be used to rate the certainty of a body of evidence. J Clin Epidemiol 2019; 111: 105–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Guyatt GH, Oxman AD, Sultan S, et al. GRADE guidelines: 9. Rating up the quality of evidence. J Clin Epidemiol 2011; 64: 1311–1316. [DOI] [PubMed] [Google Scholar]
- 22. Johnson MJ, Cockayne S, Currow DC, et al. Oral modified release morphine for breathlessness in chronic heart failure: a randomized placebo-controlled trial. ESC Heart Fail 2019; 6: 1149–1160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Verberkt CA, van Den Beuken-Van Everdingen MHJ, Schols JMGA, et al. Effect of sustained-release morphine for refractory breathlessness in chronic obstructive pulmonary disease on health status: a randomized clinical trial. JAMA Intern Med 2020; 180: 1306–1314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Kawaguchi J, Hamatani Y, Hirayama A, et al. Experience of morphine therapy for refractory dyspnea as palliative care in advanced heart failure patients. J Cardiol 2020; 75: 682–688. [DOI] [PubMed] [Google Scholar]
- 25. Schmidt-Hansen M, Bennett MI, Arnold S, et al. Oxycodone for cancer-related pain. Cochrane Database Syst Rev 2022; 6: CD003870. [DOI] [PubMed] [Google Scholar]
- 26. Li Y, Ma J, Lu G, et al. Hydromorphone for cancer pain. Cochrane Database Syst Rev 2021; 8: CD011108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Schmidt-Hansen M, Bromham N, Taubert M, et al. Buprenorphine for treating cancer pain. Cochrane Database Syst Rev 2015; 2015: CD009596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Wiffen PJ, Wee B, Derry S, et al. Opioids for cancer pain – an overview of Cochrane reviews. Cochrane Database Syst Rev 2017; 7: CD012592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Zecca E, Brunelli C, Bracchi P, et al. Comparison of the tolerability profile of controlled-release oral morphine and oxycodone for cancer pain treatment: an open-label randomized controlled trial. J Pain Symptom Manage 2016; 52: 783-794.e6. [DOI] [PubMed] [Google Scholar]
- 30. Lewin S, Glenton C, Munthe-Kaas H, et al. Using qualitative evidence in decision making for health and social interventions: an approach to assess confidence in findings from qualitative evidence syntheses (GRADE-CERQual). PLoS Med 2015; 12: e1001895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Lewin S, Booth A, Glenton C, et al. Applying GRADE-CERQual to qualitative evidence synthesis findings: introduction to the series. Implement Sci 2018; 13(Suppl 1): 2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Wainwright M. The GRADE-CERQual approach for assessing how much confidence to place in the findings of qualitative evidence syntheses and the iSoQ tool, https://training.cochrane.org/sites/training.cochrane.org/files/public/uploads/GRADE%20CERQual.pdf (2022, accessed 13 March 2023).
- 33. Nichols VP, Toye F, Eldabe S, et al. Experiences of people taking opioid medication for chronic non-malignant pain: a qualitative evidence synthesis using meta-ethnography. BMJ Open 2020; 10: e032988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Edwards D, Anstey S, Coffey M, et al. End of life care for people with severe mental illness: mixed methods systematic review and thematic synthesis (the MENLOC study). Palliat Med 2021; 35: 1747–1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Schofield G, Dittborn M, Huxtable R, et al. Real-world ethics in palliative care: a systematic review of the ethical challenges reported by specialist palliative care practitioners in their clinical practice. Palliat Med 2021; 35: 315–334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Scottish Intercollegiate Guidelines Network. Applying the GRADE methodology to SIGN guidelines: core principles, https://www.sign.ac.uk/media/1835/gradeprincipals.pdf (2010, accessed 13 March 2023).
- 37. Scottish Intercollegiate Guidelines Network. Methodology: new developments, https://www.sign.ac.uk/what-we-do/methodology/new-developments/ (2021, accessed 13 March 2023).
- 38. Cooper NAM, Khan KS, Clark TJ. Evidence quality in clinical guidelines: a comparison of two methods. Acta Obstetric Gynecolog Scand 2015; 94: 1283–1289. [DOI] [PubMed] [Google Scholar]
- 39. Baker A, Potter J, Young K, et al. The applicability of grading systems for guidelines. J Eval Clin Pract 2011; 17: 758–762. [DOI] [PubMed] [Google Scholar]
- 40. Scottish Intercollegiate Guidelines Network. SIGN 50: a guideline developer’s handbook, https://www.sign.ac.uk/media/2038/sign50_2019.pdf (2019, accessed 12 March 2023).
- 41. Turner-Stokes L, Harding R, Sergeant J, et al. Generating the evidence base for the National Service Framework for Long Term Conditions: a new research typology. Clin Med 2006; 6: 91–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. DH Long-term Conditions NSF Team. The National Service Framework for Long-term Conditions, https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/198114/National_Service_Framework_for_Long_Term_Conditions.pdf (2005, accessed 12 March 2023).
- 43. Lau J, Mazzotta P, Whelan C, et al. Opioid safety recommendations in adult palliative medicine: a North American Delphi expert consensus. BMJ Support Palliat Care 2022; 12: 81–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Merlin JS, Khodyakov D, Arnold R, et al. Expert panel consensus on management of advanced cancer-related pain in individuals with opioid use disorder. JAMA Netw Open 2021; 4: e2139968. [DOI] [PubMed] [Google Scholar]
- 45. Niederberger M, Spranger J. Delphi technique in health sciences: a map. Front Public Health 2020; 8: 457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. ESMO Guidelines Committee. ESMO Standard Operating Procedures (SOPs) for Clinical Practice Guidelines (CPGs) and ESMO Magnitude of Clinical Benefit (ESMO-MCBS) and ESMO Scale for Clinical Actionability of molecular Targets (ESCAT) scores, https://www.esmo.org/content/download/77789/1426712/1/ESMO-Clinical-Practice-Guidelines-Standard-Operating-Procedures.pdf (2022, accessed 12 March 2023).
- 47. Dykewicz CA. Summary of the guidelines for preventing opportunistic infections among hematopoietic stem cell transplant recipients. Clin Infect Dis 2001; 33: 139–144. [DOI] [PubMed] [Google Scholar]
- 48. Hui D, Maddocks M, Johnson MJ, et al. Management of breathlessness in patients with cancer: ESMO clinical practice guidelines. ESMO Open 2020; 5: e001038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Cancer Care Alberta. Guideline methodology handbook, https://www.albertahealthservices.ca/assets/info/hp/cancer/if-hp-cancer-guide-utilization-handbook.pdf (2022, accessed 12 March 2023).
- 50. Miles KE, Rodriguez R, Gross AE, et al. Strength of recommendation and quality of evidence for recommendations in current Infectious Diseases Society of America guidelines. Open Forum Infect Dis 2021; 8: ofab033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Fallon M, Giusti R, Aielli F, et al. Management of cancer pain in adult patients: ESMO clinical practice guidelines. Ann Oncol 2018; 29: iv166–iv191. [DOI] [PubMed] [Google Scholar]
- 52. Crawford GB, Dzierżanowski T, Hauser K, et al. Care of the adult cancer patient at the end of life: ESMO clinical practice guidelines. ESMO Open 2021; 6: 100225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Cherny NI. ESMO clinical practice guidelines for the management of refractory symptoms at the end of life and the use of palliative sedation. Ann Oncol 2014; 25: iii143–iii152. [DOI] [PubMed] [Google Scholar]
- 54. Lee DH, Vielemeyer O. Analysis of overall level of evidence behind Infectious Diseases Society of America practice guidelines. Arch Intern Med 2011; 171: 18–22. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplemental material, sj-pptx-1-pmj-10.1177_02692163241286658 for A pragmatic approach to selecting a grading system for clinical practice recommendations in palliative care by Sasha Voznyuk, Rachel Z Carter and Julia Ridley in Palliative Medicine
