Are "Complicated Grief" (CG) criteria superior to "Prolonged Grief Disorder" (PGD) criteria in diagnosing grief disorder? – No.
PGD criteria ought to be the standard for diagnosing disordered grief
In 2009, following the completion of an NIH/NIMH-funded investigation of consensus criteria for disordered grief, we published validated criteria for a new diagnostic entity, prolonged grief disorder (PGD).1 Based on the results of this field trial, the Yale Bereavement Study (YBS), we proposed PGD’s inclusion to DSM-5 and ICD-11. The strength of the evidence in support of PGD from the YBS, supplemented by confirmatory findings from multiple international studies, compelled both the DSM and ICD to take steps to include a new disorder of grief.
The DSM’s and the ICD’s tentative formulations of diagnostic criteria for a disorder of grief agree almost perfectly with PGD criteria.2 They also share PGD’s sound psychometric properties. By contrast, complicated grief (CG) criteria 3 do not agree with DSM and ICD criteria, use abstruse syntax, produce more false positive than true positive diagnoses, and lack predictive validity.2 Evidence supports the validity of PGD,1 and its DSM and ICD derivatives,2 but weighs against the validity of CG.2 PGD meets established criteria for a mental disorder;4 CG does not. Based on existing evidence, PGD criteria ought to be the standard for diagnosing disordered grief.
Methodological misgivings
Presumably, medical journals only publish studies that employ standard, scientifically sound methods. Standard information for evaluating diagnostic criteria includes estimates of diagnostic accuracy (e.g., sensitivity and specificity) and prevalence of disorder. Curiously, this information is absent from the Cozza et al. publication “Performance of DSM-5 Criteria for Persistent Complex Bereavement Disorder…” (PCBD),5 which concludes that CG criteria are superior to PGD and PCBD criteria. In the YBS data,2 CG criteria produce more false positive cases (63%) than true (37%) positive cases of disorder and have an unacceptably high (30%) overall rate of diagnosis. Based on YBS results, we expect that the unreported overall rate of diagnosis of CG and false positive CG test rate in the entire Cozza et al. sample are unacceptably high.
The results that Cozza et al. do report are inherently “spectrum biased.” Cozza et al. discard nearly half (n=797, 46%) of their total sample to focus on the most obvious “cases” (n=260, 15%) and “controls” (n=675, 39%). Spectrum bias 6 in estimates of diagnostic accuracy results from excluding less obvious, borderline cases in favor of extreme, easier to evaluate, cases and controls. Spectrum-biased designs overestimate sensitivity and specificity by omitting diagnostic errors from near-threshold cases. The real test of diagnostic performance is not in identifying extremes, but rather in discerning in-between, more difficult to evaluate, cases. Cozza et al.’s spectrum-biased design does not account for the many false positive tests for CG that would have appeared in the large excluded segment (46%) of their analytic sample. Most likely, and, if so, consistent with findings from the YBS,2 there are more false than true positive tests for CG in Cozza et al.’s full sample.
Prevalence rates and rates of false positives and negatives obtained for the full sample (not a spectrum-biased group of cases and controls) are needed to determine which criteria sets have superior performance. Essential, yet missing, information and use of a spectrum-biased design raise questions concerning the soundness of the science behind the CG proposal.
CG criteria pathologize normal grief
Bereavement is a common, natural life event. It is normal to be upset following the loss of someone loved; heightened vigilance is needed to avoid pathologizing normal reactions. For this reason, diagnostic criteria for grief disorder should prioritize diagnostic specificity (minimizing false positives) over sensitivity (minimizing false negatives). CG criteria, lax both in number and severity of symptoms required for diagnosis, produce many false positives, over-diagnose grief disorder, and pathologize normal grief. Applying the “moderate” symptom severity threshold employed by Cozza et al.5 to CG criteria 3 that require only three symptoms for diagnosis, the positive test rate for CG in the YBS sample is 62%. Thus, CG criteria diagnose most bereaved individuals in a community sample as mentally ill – a result undermining their face validity (e.g., laypersons are likely to consider such criteria suspect).
Straw Men arguments
Suggestions that CG criteria are superior to PGD criteria in their embodiment of clinical wisdom or applicability are straw men arguments. These claims strive to shift attention away from issues of paramount importance to diagnostic assessment, i.e., diagnostic validity and accuracy. Simply put, diagnostic validity must be established before proceeding to clinical applications; it does not make sense to discuss clinical applications of invalid diagnoses.
We do not deny that clinical insight is essential to the formulation of psychiatric diagnoses. PGD, like CG, was informed by the clinical insight of prominent psychiatrists (e.g., Drs. Parkes, Horowitz, Jacobs, Shear, and Reynolds). However, clinical opinion alone is insufficient for the validation of diagnostic criteria. Clinical insight was translated into an NIH/NIMH-funded investigation of consensus criteria for PGD that produced compelling evidence of a new diagnostic entity.1 By contrast, CG criteria were introduced in a review article3 (as opposed to an original research article) without any empirical support, and specifically without evidence of predictive validity.2 Given the ease with which CG criteria can be satisfied, they also lack face validity.
The claim that CG criteria, compared with PGD criteria, identify a greater number of individuals who will benefit from treatment is another straw man argument. The main purpose of diagnostic assessment is to determine whether an individual truly has a disorder. To paraphrase DSM-5, accurate diagnosis is a prerequisite for appropriate treatment. Any argument about the superior clinical applicability of CG criteria diverts attention away from diagnostic accuracy to bereaved individuals’ presumed need for clinical care. As Spitzer wrote: “To confuse making a mental disorder diagnosis with demonstrating treatment need [is]… a serious mistake.” 7
Moreover, there is no evidence that CG criteria accurately identify bereaved individuals in need of, or likely to benefit from, treatment. We found that CG criteria, unlike PGD criteria, were unrelated to risk of future mental disorder, functional impairment, or diminished quality of life.2 Thus, CG criteria do not identify bereaved individuals at risk of enduring dysfunction who might be helped by an intervention. In fact, there is no evidence that CG criteria are better than PGD criteria with respect to any clinical process or outcome. These lines of argument are aimed at drawing attention away from fundamental issues of diagnostic performance (e.g. prevalence, false positive rates, predictive validity) to focus, prematurely, on issues of clinical application.
Evidence suggests abandoning CG and supporting PGD criteria as the diagnostic standard
Data from multiple, independent, community-based datasets tell a consistent, compelling story – one that supports the diagnostic validity and accuracy of PGD criteria. This is not the case for CG criteria. CG criteria lack validity,2 produce more false than true positive tests for disorder2 and, because they are too easily satisfied, pathologize normal grief. PGD criteria reliably and validly identify bereaved individuals genuinely in need of and likely to benefit from the help of a mental health professional. CG criteria are inadequate and counterproductive, and should be withdrawn from serious consideration. It is time for scientists and clinicians to agree that PGD should be adopted as the standard for diagnosing disordered grief.
Acknowledgments
Research grant MH56529 from the National Institute of Mental Health (PI: Prigerson) supported the collection of data for testing consensus criteria for disordered grief. Dr. Prigerson’ s Outstanding Investigator Award from the National Cancer Institute (NCI R35: CA197730) supported her effort in the preparation of this manuscript. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Footnotes
Neither Dr. Prigerson nor Dr. Maciejewski has any conflicts of interest to report. All conflict of interest disclosure information for Drs. Prigerson and Maciejewski is accurate, complete, and up-to-date.
References
- 1.Prigerson HG, Horowitz MJ, Jacobs SC, Parkes CM, Aslan M, Goodkin K, et al. Prolonged grief disorder: Psychometric validation of criteria proposed for DSM-V and ICD-11. PLoS Medicine. 2009;6:e1000121. doi: 10.1371/journal.pmed.1000121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Maciejewski PK, Maercker A, Boelen PA, Prigerson HG. "Prolonged grief disorder" and "persistent complex bereavement disorder", but not "complicated grief", are one and the same diagnostic entity: an analysis of data from the Yale Bereavement Study. World Psychiatry. 2016;15(3):266–75. doi: 10.1002/wps.20348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Shear MK, Simon N, Wall M, Zisook S, Neimeyer R, Duanet N, et al. Complicated grief and related bereavement issues for DSM-5. Depress Anxiety. 2011;28:103–17. doi: 10.1002/da.20780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Feighner JP, Robins E, Guze SB, Woodruff RA, Winokur G, Munoz R. Diagnostic criteria for use in psychiatric research. Arch Gen Psychiatry. 1972;26:57–63. doi: 10.1001/archpsyc.1972.01750190059011. [DOI] [PubMed] [Google Scholar]
- 5.Cozza SJ, Fisher JE, Mauro C, Zhou J, Ortiz CD, Skritskaya N, et al. Performance of DSM-5 Persistent Complex Bereavement Disorder Criteria in a Community Sample of Bereaved Military Family Members. Am J Psychiatry. 2016;173:919–29. doi: 10.1176/appi.ajp.2016.15111442. [DOI] [PubMed] [Google Scholar]
- 6.Ransohoff DF, Feinstein AR. Problems of spectrum and bias in evaluating the efficacy of diagnostic tests. N Engl J Med. 1978;299:926–30. doi: 10.1056/NEJM197810262991705. [DOI] [PubMed] [Google Scholar]
- 7.Spitzer RL. Diagnosis and need for treatment are not the same. Arch Gen Psychiatry. 1998;5:120. doi: 10.1001/archpsyc.55.2.120. [DOI] [PubMed] [Google Scholar]
