Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Jul 27.
Published in final edited form as: Psychooncology. 2022 Dec 15;32(2):203–213. doi: 10.1002/pon.6063

Proxy ratings of psychological well-being in patients with primary brain tumors: A systematic review

Timothy S Sannes 1,2, Miryam Yusufov 1,2, Hermioni L Amonoo 1,2,3, Elizabeth G Broden 1, Darcy E Burgers 1,2, Paul Bain 4, Cristina Pozo-Kaderman 1,2, Damien M Miran 1,2, Timothy S Smith 2,5, Ilana M Braun 1,2, William F Pirl 1,2
PMCID: PMC10373343  NIHMSID: NIHMS1906775  PMID: 36371618

Abstract

Objective:

This systematic review examined the agreement of proxy ratings of depression and anxiety in neuro-oncology patients.

Methods:

Searches were conducted across 4 databases (MEDLINE, Embase, PsycINFO, CINAHL, and Web of Science) to identify studies that compared proxy ratings (non-health care providers) of anxiety and depression in patients with brain cancer. Methodological quality and potential risk of bias were evaluated using the Joanna Briggs Institute Critical Appraisal Checklist.

Results:

Out of the 936 studies that were screened for inclusion, 6 were included for review. The findings were mixed in terms of whether patient and proxy ratings were accurate (e.g., deemed equivalent), with many of the selected studies suggesting moderate level of agreement for several of the selected studies and, when both depression and anxiety were included, depression ratings from proxy raters were more accurate than for anxiety. We identified important limitations across the selected articles, such as low sample size, clarity on defining proxy raters and the different instructions that proxy raters are given when asked to assess patients’ mood symptoms.

Conclusions:

Our findings suggest that proxy ratings of depression and anxiety should be interpreted with caution. While there is some agreement in proxy and patients with brain cancer ratings of depression and anxiety (greater agreement for depression), future work should recruit larger samples, while also remaining mindful of defining proxy raters and the instructions given in collecting these ratings.

Keywords: anxiety, cancer, caregiver, depression, neuro-oncology, oncology, proxy ratings, screening

1 |. INTRODUCTION

Patients with cancer are particularly susceptible to suffering from depression and anxiety.1 Prevalence rates vary widely, between 5% and 50% for depression2 and 19%–42% for anxiety.3,4 This mood disturbance can negatively impact adherence to treatments, patients’ perception of other symptoms, and quality of life. There is also suggestion that depression and anxiety contribute to increased cancer incidence, and both cancer-specific and all-cause mortality,5 While screening for more general stress remains a particular focus in psycho-oncology,6 screening and measuring depression and anxiety, specifically, remains critically important as effective supportive interventions exist to treat these conditions.7 Despite the knowledge of this gap,8 depression and anxiety remain widely undetected in much of oncology practice.9,10

Neuro-oncology presents inherent challenges to assessing depression and anxiety. With expressive communication frequently impacted, neuro-oncology patients may be challenged in completion of questionnaires inquiring about these domains. In addition to the neural location of the tumor, treatment often includes surgical resection, radiation to the brain and potential systemic therapy, all of which can also impact patients’ ability to communicate, as well as their mood and well-being.11,12 These challenges extend to patients enrolled in clinical trials, as patients have more motor dysfunction and worsening clinical status, they are less likely to complete questionnaires and more likely to drop out of studies.13 Despite these challenges, many have advocated for the incorporation of questionnaires into the assessment of clinical endpoints, termed RANO (Response Assessment in Neuro-oncology),14 given the importance to patients’ quality of life and ability to distinguish between comparable treatment regimens. These inherent challenges in depression/anxiety measurement in neuro-oncology have led many investigators to focus on capturing proxy ratings of patients’ quality of life and well-being.

The challenges in assessing another’s well-being are multifold. Measuring other’s perceived ability to function or subjective experience can be challenging, and researchers rarely apply appropriate methodology to instruct proxies in measuring patient behavior15 nor apply appropriate analytic techniques to compare the accuracy of proxy ratings16 leading to disagreement in ratings. When clinical care providers attempt to rate quality of life in their patients, agreement is poor17,18 compared to close relatives or spouses in which the agreement is much higher.19 There is more agreement in concrete domains (such as functioning) than in subjective domains such as mood. For instance, when dementia patients rate their own quality of life, they rate this as higher than both their relatives and staff.20 Parents of children with cancer may underestimate their children’s emotional concerns overall, but overestimate the anxiety they have over the course of treatment.21 Collectively, the challenge of consistent agreement between patients’ self-assessment and proxy’s assessment is referred to as the “interrater gap” and acknowledges that imagining another’s experience should be an intentional shift in perspective.15

Examination of proxy raters’ accuracy suggests some agreement between patients and caregivers’ perception of the patients’ quality of life,22 much less attention has been paid to domains of mood.23 We know very little about assessing depression and anxiety in neuro-oncology patients with proxy raters, yet up to 48% of patients and 40% of caregivers experience significant depression or anxiety following diagnosis of a brain tumor.24 Several demographic and treatment-related factors, such as being female, having a lower WHO tumor grade classification, lower education level, and a history of psychiatric illness are significantly related to greater likelihood of clinically significant anxiety or depression.25 To our knowledge, there has not been a systematic review of the agreement of proxy ratings of depression or anxiety in neuro-oncology, despite their high prevalence rates and inherent challenges in assessing mood in this population.

With this background, the current aims of the study were as follows:

  1. What is the level of agreement between proxy ratings of patients’ level of anxiety and depression and patient reported measures of these domains in primary brain cancer?

  2. If both patient and caregiver proxy measures of anxiety/depression are collected and reported, what is the strength of this relationship and are they deemed equivalent?

2 |. METHODS

We followed the statement of Preferred Reporting Items for Systematic Review and Meta-Analyses outlined by Moher and colleagues.26 The systematic review was registered with PROSPERO registration number CRD42021247201.

2.1 |. Search strategy

We conducted a search of the following electronic bibliographic databases: MEDLINE (Ovid), Embase (Elsevier), Web of Science Core Collection (Clarivate), PsycINFO (EBSCO), and CINAHL (EBSCO). The search strategy was designed to capture publications reporting any outcome using a proxy measure in brain cancer patient-caregiver dyads (Appendix 1). “Proxy ratings” were defined by ratings of patients’ well-being from someone outside of the treating medical team. This included any type of informal/unpaid caregiver, involved in the patients’ care; however, to ensure that comparisons were appropriate across the selected article, we extracted the definition of each manuscript’s proxy rater. We relied on a previously applied search strategy to define primary brain tumors.27 For including proxy ratings, we applied the definition of ratings of patients’ well-being from someone outside of the treating medical team. Controlled vocabulary terms were included in the search when available; no date limit was applied. The search was last run on 28 April 2022. Covidence software28 was used to organize all abstracts.

2.2 |. Inclusion criteria

Studies were included if (1) they recruited individuals 18-years-old or older with primary brain tumors diagnosed at or after 18-year of age; (2) examined psychological factors (depression and anxiety) assessed by a caregiver proxy who was not part of the healthcare team; (3) contained depression and anxiety rated by the patient; (4) were published in the English language. Studies were excluded if (1) published as abstract only; (2) review paper only; (3) lacked relevance (e.g., basic science, animal studies, etc); (4) captured other proxy ratings (e.g., from a clinical professional, for instance); (5) psychological factors not quantitatively measured; (6) No comparison of proxy ratings of patients’ anxiety/depression and patient rating of anxiety/depression. There were no limitations on study setting or country of conduct. After employing this search strategy, however, and identifying only five papers meeting inclusion criteria, we opted to also include any paper captured by our search criteria that compared patient and caregiver ratings of anxiety and depression, even if these constructs were part of a larger battery of measuring quality of life, in line with evidence that anxiety and depression significantly overlap with quality of life.2931 We also opted to include studies that measured proxy ratings of depression or anxiety, not restricting our search to having both. In our screening of studies, it also became apparent that several studies did not describe how measures of anxiety and depression were adapted to obtain proxy ratings or, more specifically, the instructions that proxy raters were given which we felt particularly important given the interrater gap referenced above. Thus, we extracted how each study described their approach to administering their specific measures to proxies, or if these details were not provided. Based on these revised criteria, six studies were included in this systematic review (see Figure 1 for reporting according to PRISMA guidelines).26

FIGURE 1.

FIGURE 1

PRISMA describing screening and selection of studies.

2.3 |. Data extraction

Study authors (Hermioni L. Amonoo, Elizabeth G. Broden, Miryam Yusufov and William F. Pirl) extracted the following from each of the selected articles: lead author, publication year, study design, sample size, cancer characteristics of study population, and measures employed (Table 1). Other study authors (Damien M. Miran and Darcy E. Burgers) also extracted statistical values related to the level of agreement of proxy ratings and how studies defined caregivers/proxy raters, further described below. Study authors (Timothy S. Sannes and Miryam Yusufov) assessed methodological quality and potential risk of bias using the Joanna Briggs Institute Critical Appraisal Checklist for Studies Reporting Prevalence Data32 (Table 2). Questions taken from these criteria32 assessed by our team included, “Was the sample frame appropriate to address the target population?” “Were study participants sampled in an appropriate way?” “Were the study subjects and the setting described in detail?” “Was the data analysis conducted with sufficient coverage of the identified sample?” “Was the condition measured in a standard, reliable way for all participants?” “Was there appropriate statistical analysis?” For the question, “Was the sample size adequate?” we consulted criteria for other dyadic studies.33

TABLE 1.

Extracted papers, associated measures and statistical comparisons

Study authors/year Participant sample (N) Instrument Statistical comparison Numerical level of agreement: Group comparison VS. continuous scale of level of agreement p value
Moreale et al. (2017) Low grade glioma patients who underwent a neurosurgical procedure at least 1 year prior (n = 46) Beck depression inventory scalea
State- trait anxiety inventorya
Paired t-test and chi square tests using the pearson index. Depression:
- Patient: 7.76 versus caregiver: 6.23
p = 0.102
Anxiety:
- State: Patients: 44.26 versus caregiver: 40.28
p = 0.031
- Trait: Patient: 37.95 versus caregiver: 35.17 p = 0.047
Rooney et al. (2013) New glioma diagnosis; receiving active treatment (N = 41) Patient health Questionnaire-9 (PHQ-9; for internal validity)a Patient-proxy agreement = PHQ-9 within two points: PHQ-9 agreement: 36.6% (15/41) of cases 6.80 versus proxy ratings of 8.40 p = 0.016
Structured clinical interview for DSM-IV (SCID; for external validity)b Paired sample wilcoxon rank sum test
Intra-class correlation coefficient ICC: 0.69
Cohen’s kappa (K; proxy reports on questionnaire compared to diagnostic interview for patients’ depression) Proxy K = 0.58
Patient K = 0.47
Milbury et al. (2019) Patients with WHO grade I-IV glioma (n = 20) Center for epidemiological studies-depression (CES-D)a Paired t-tests between patient-proxy ratings CES-D: patient (mean = 16.49) versus caregiver (mean = 16.25) t = 0.13 p = 0.90
MD Anderson symptom inventory-brain tumor module (MDASI-BT): subscalesa Intra-class correlation coefficient MDASI-BT: No difference in overall scores, subscales: ICC 0.51–0.82 P = 0.007–0.0001
Papers that compared agreement of emotional well-being without specific measures of anxiety/depression
Armstrong et al. (2012) Primary brain tumor patients receiving neurocognitive testing as part of routir care (n = 115) MD Anderson symptom inventory-brain tumor (MDASI-BT)a Comparison of SD/CI (group differences with >90%) Distress: 0.92
Sadness: 0.62
p = 0.001
p = 0.032
Mean difference Irritability: 0.26
Mood: 0.47
p = 0.28
p = 0.074
Pearson correlation Distress: 0.09
Sadness: 0.14
Irritability: 0.15
Mood: 0.11
p = 0.32
p = 0.14
p = 0.11
p = 0.25
Brown et al. (2008) Patients with grade 3 astrocytomas in 3 treatment protocols (n = 197) Profiles of mood states short form (POMS-SF)a - Paired signed rank test % w/in 10 pts of one another
POMS: 61%
p = 0.30
Baseline: 118 dyads The symptom distress scale (SDS)a SDS: 62%, p = 0.11
- Spearman’s correlation POMS: 0.65
SDS: 0.66
N/A
- Intraclass correlation (ICC) POMS: 0.61
SDS: 0.64
Jacobs et al. (2014) Grade III or IV malignant glioma (n = 45) FACT-Br subscale: Emotional well-being (EWB)a Intraclass correlation coefficient (ICC) EWB: 0.58 N/A

Note: For comparing the accuracy between two groups, paired samples t-tests or, for data that is not normally distributed, paired signed rank test are common. Across both tests, a significant value is interpreted as the two samples being nonequivalent. For assessing the level of agreement between two measurements of a continuous variable common tests include Pearson correlation and ICCs. Correlation coefficients and ICCs provide a single value for the level of agreement and, if statistically significant, then the two ratings are comparable. The scale for the ICC is the same as for other correlations—0 represents no relationship, while 1 represents a perfect linear relationship. Generally, as outlined by Koo and Li34 an ICC of less than 0.50 is poor, agreement between 0.50 and 0.75 is moderate, 0.75 and 0.90 is good and above 0.90 is considered excellent agreement. Cohen's kappa coefficient (Κ) is used to measure the level of agreement for a categorical outcome and has a similar scale: ≤ 0 as indicating no agreement and 0.01–0.20 as none to slight, 0.21–0.40 as fair, 0.41–0.60 as moderate, 0.61–0.80 as substantial, and 0.81–1.00 as almost perfect agreement.51

a

Same scale administered to both patient and caregiver.

b

Proxies “encouraged to participate [in interview] as they saw fit.”

TABLE 2.

Methodological quality, risk of bias, and quality assessment for the 6 included empiric studies

1. Sample 2. Sampling 3. Sample size 4. Description 5. Data analysis 6. Methods 7. Measures 8. Statistical analysis 9. Response rate
Moreale et al. (2017) + + + + + + + +
Rooney et al. (2013) + + + + + + + +
Milbury et al. (2019) + + + + + + + +
Armstrong et al. (2012) + + + + + + + + +
Brown et al. (2008) + + + + + + + + +
Jacobs et al. (2014) + + + + + + + +

Note: +yes; −no; ?unclear; N/A = not applicable. Contents for this table were guided by the “Critical Appraisal Checklist for Studies Reporting Prevalence Data” from Munn, Z., Moola, S., Lisy, K., Riitano, D., Tufanaru, C. (2015). Methodological guidance for systematic reviews of observational epidemiological studies reporting prevalence and incidence data. International Journal of Evidence Based Healthcare, 13(3)147–153.

2.4 |. Extracted statistics from selected publications

We extracted all reported statistics related to within dyad comparisons; included paired samples t-test, paired signed rank tests, correlation coefficient, intraclass coefficients (ICCs) and Cohen’s Kappa coefficients (K). Specifically, we extracted the numerical level of agreement and associated significance level documented in each selected publication. For assessing the level of agreement between two measurements of a continuous variable (e.g., Pearson correlation and ICCs), the correlation coefficients and ICCs were extracted. In this case, this single correlation coefficient, if statistically significant, suggests that there is a linear relationship between the two measurements (patient vs. caregiver). If the extracted coefficient was not significant, we assigned a label of not related (“NR”). Generally, as outlined by Koo and Li34 an ICC of less than 0.50 is poor, agreement between 0.50 and 0.75 is moderate, 0.75 and 0.90 is good and above 0.90 is considered excellent agreement. Cohen’s kappa coefficient (Κ) is used to measure the level of agreement for a categorical outcome.

3 |. RESULTS

3.1 |. Study characteristics

Electronic database searching yielded 930 unique records and 39 full-text articles were read and examined for inclusion. Six studies met our inclusion criteria (Figure 1). Regarding the search criteria within the six selected studies, only two met the original inclusion criteria, 4 met the adjusted inclusion criteria such that these articles compared broader emotional well-being within a larger battery of quality of life (e.g., MD Anderson Symptom Inventory) and not specifically measures of anxiety and depression. Two35,36 of the six studies were longitudinal, whereas the rest were cross-sectional study designs. Finally, we excluded three papers3739 that were captured within the established search criteria of proxy ratings but only reported on comparisons between patients and caregivers’ ratings of their own symptoms.

3.2 |. Description of study participants

There was a wide range of sample sizes across the selected studies. The sample sizes of these studies ranged from 20 patient-caregiver dyads36 to 118 dyads.35 Across the six selected studies, there was a total of 385 patient-caregiver dyads included.

3.3 |. Clinical characteristics

Characteristics of patients’ diagnoses ranged widely, all reporting on patients with a general diagnosis of glioma and were recruited across a wide range of points along the cancer trajectory (e.g., diagnosis, treatment, post-treatment etc).

3.4 |. Proxy rater definitions

Our search criteria specified that proxy raters would be, “ratings of patients’ well-being from someone outside of the treating medical team;” however, during our search, it became apparent that there was wide heterogeneity by which investigators defined proxy raters. As such, we chose to pull the exact wording each investigator used to define proxy raters for their study (see Table 2). The majority of studies relied on patients to identify the proxy rater who could report their symptoms. Definitions of proxy raters were as simple as cohabitating partner,40 patient-identified caregiver,35 whereas other studies further specified that family caregivers included spouses, family members or adult children36 or could include these roles as well as friends of the patient.41 Other selected studies further defined the roles of family caregivers such as providing care are home,42 or, more specifically, home care for physical, emotional and social needs.43 See Table 3 for exact wording of each study’s definition of a proxy rater.

TABLE 3.

Extracted definition of caregivers and instructions given to proxy raters for the 6 selected studies

Study authors/year N Definition of proxy Instructions for proxy raters
Moreale et al. (2017) 46 provided care at home for physical, emotional and social needs “…collected from the caregivers by asking them to describe their relative’s depression state (e.g. “Do you think that your relative doesn’t feel sad, or feels sad, or feels sad all the time …”), coping strategies (e.g. “Do you think that your relative tries to look at the problem objectively and see all sides?”) and anxiety (e.g. “Do you think that he/she feels nervous and restless?).”
Rooney et al. (2013) 41 Co-habiting partner “…their proxy also completed it ‘‘for the patient’’, based on their observations at home.”
Milbury K. et al. (2019) 20 Family caregiver (eg, spouse, sibling, adult child) “Caregivers completed the MDASI-BT as they perceived patients’ symptom severity and interference.”
Armstrong et al. (2012) 115 Caregiver “who are primarily involved in their care in the home setting (biological, legal, or functional relationship) MD Anderson Symptom Inventory-Brain Tumor (MDASI-BT; completed by caregiver on behalf of patient) Prompt for answering questions adapted from Lobchuk et al.15
Brown et al. (2008) 118 Patient-identified caregiver Unclear whether it was on behalf of the patient
Jacobs et al. (2014) 45 Patient identified nonprofessional caregiver (spouse, family member, or friend) “…caregivers were asked to complete the same questionnaire as they perceived the patient would answer the questions. Caregivers could not fill out the questionnaire for patients.”

3.5 |. Instructions for proxy ratings

There was also a wide range in the reporting of instructions that proxy raters were given in their estimation of their patients’ emotional well-being. One study35 did not specify instructions given to proxy raters, despite the authors forming their study rationale around the importance of proxy raters. Other studies referenced that proxy raters completed measures based on “observation”40 and two studies suggested that proxies were instructed to complete measures based on what they “perceived” the patient was experiencing36,41; however, for one of these studies it was unclear whether similar instructions were provided for the scales extracted in our search methods (e.g., CESD). Only two studies42,43 outlined a detailed script provided to proxy raters. The first asked proxy raters, “Do you think that your relative doesn’t feel sad, or feels sad, or feels sad all the time?” for rating depression and, “Do you think that he/she feels nervous and restless?” for assessment of anxiety.43 The second study42 instructed proxy raters, “In your mind’s eye put yourself in the patient’s shoes. Forget yourself,” borrowed from a palliative care study by Lobchuk.15 See Table 3 for exact wording that participants were provided in providing proxy ratings.

3.6 |. Measure of psychological well-being

Selected studies also used a variety of measures to capture depressive and anxious symptoms from the patient and their respective proxies. Measures included the Beck Depression Inventory and the State-Trait Anxiety Inventory,43 the Personal Health Questionnaire-9 and Structured Clinical Interview for the DSM-IV,40 and the Center for Epidemiological Studies-Depression scale.36 Within the expanded criteria that included studies using quality of life measures, two studies36,37 used the MD Anderson Symptom Inventory-Brain Tumor Module (from which information on the subscales pertaining to emotional well-being were extracted) with another study40 using the Functional Assessment of Cancer Therapy—Brain. Also in the category of selected papers that used measures not specific to depression or anxiety, one study35 used the Profile of Mood States and the Symptom Distress Scale. All selected studies administered the same measures (in full; unique to each study) across patients and proxies, save for one study40 that administered a clinical interview, which was only given to patients but for which proxies were, “encouraged to participate as they saw fit.”1

3.7 |. Quantitative data on level of agreement between patient and proxy ratings

We extracted the numerical level of proxy rater agreement with patient reports across two types of comparisons: group comparisons (t-test, chi-squares etc.) versus comparison of continuous ratings (Pearson correlation coefficient, ICC’s etc). These are described in Table 1. As referenced above in our expansion of search criteria during our search, very few selected papers reported on proxy ratings of both anxiety and depression, but in the one paper that included both,43 ratings of depressive symptoms were statistically equivalent to patients, whereas anxiety ratings were not. Further, within the subscales of a commonly used measure of quality of life (MD Anderson Symptom Inventory-Brain Tumor), distress was one of two subscales that was not equivalent between patients and caregivers.42 While not the focus of the current review, this manuscript did have the interesting finding that when patients’ performance status was considered, caregiver and patient reports were equivalent when performance status was high but, when performance status was low, caregivers reported significantly worse symptoms than patients.42

Perhaps the most comprehensive proxy assessment of depression uncovered in our search administered a self-report survey (PHQ-9) to both patients and proxy raters, in addition to a follow-up diagnostic interview to determine whether patients met criteria for major depressive disorder.40 The authors opted not to compare patients and proxies on their raw scores of the PHQ-9, rather classified patients as meeting the threshold for depression if they scored above the cutoff10 and included endorsement of one of the cardinal symptoms (anhedonia or depression). With these criteria, proxy ratings agreed with patient ratings 36.6% of cases with proxy raters generally reporting more depressive symptoms in patients than patients reported. When compared to the diagnostic interview, proxy ratings were slightly more accurate (Κ= 0.58) than patients’ PHQ-9 reports of their own depressive symptoms (Κ= 0.47). Interestingly, within depressive symptom categories rated by proxies, appetite change, and guilt had the highest ICC’s from proxy ratings (0.69 and 0.66, respectively), whereas poor concentration had the lowest level of agreement (ICC = 0.23). The authors concluded that proxy raters were more accurate than patients’ own report in objective symptoms (sleep change, appetite change, fatigue and psychomotor change) compared to subjective symptoms (low mood, anhedonia, guilt).40 This study did not assess anxiety.

Studies that compared continuous measures of proxy and patient ratings reported moderate levels of agreement between patient and proxy ratings. More specifically, there was moderate agreement across measures of depression,40,43 mood and distress,35 and emotional well-being,41 One of these studies36 did not list specific ICCs associated with each subscale (only a range was provided) within the larger measure of quality of life, thus we were unable to extract which scales corresponded to the range of ICCs reported.

3.8 |. Risk of bias and assessment of quality

Based on established criteria,32 most studies met criteria suggesting low risk of bias and high methodologic quality. For the question, “Was the sample size adequate?” criteria for other dyadic studies,33 suggested that sample sizes under 50 may impact differing levels of agreement and thus only two of the selected studies35,42 met this criteria for adequate sample size. See Table 2.

4 |. DISCUSSION

This study aimed to systematically review studies in neuro-oncology that measured proxy-reports of patients’ well-being; specifically, anxiety and depression. Given that patients with brain cancer often have difficulty with expressive language or other communication challenges, there has been significant attention to whether proxy reports are accurate and reliable; however, very little of this work has focused on proxy ratings of patient mood, such as measures of anxiety or depression. Overall, in our systematic review, few studies targeted this direct question, nor did they consistently report quantitative outcomes across domains of depression and anxiety and, instead, gathered proxy ratings of quality of life more generally. As such, we broadened our search slightly to include studies that compared patient and proxy emotional well-being within a larger measure of quality of life. Even with this broader criterion, comparing quantitative outcomes as we originally intended was limited by the unclear definition of proxy ratings, heterogeneity in measures of depression or anxiety and wide range of statistical test reported. Taken together, the findings were mixed in terms of whether patient and proxy ratings were equivalent, and the level of agreement was also mixed, with the suggestion of a moderate level of agreement for several of the selected studies.35,36,40,41

Overall, the findings from this review suggest that investigators in neuro-oncology should be cautious when considering using proxy raters. While few of the selected studies included measures of both depression and anxiety, in the one study that assessed equivalence, depression was deemed equivalent, whereas anxiety was not.43 One possibility is that symptoms of depression are more observable from an outsider’s perspective than symptoms of anxiety. Upon closer inspection of the selected article by Rooney and colleagues, specific depressive symptoms were compared between proxy raters and patients, demonstrating that objective symptoms (sleep, appetite, fatigue and psychomotor changes) were more accurate than subjective symptoms (mood, anhedonia, guilt).40 These findings are consistent with reviews examining the agreement of proxy ratings in pediatric oncology, in which proxy raters can more accurately rate children’s observable functioning than domains that are nonobservable, like emotional functioning,44 as well as patients outside of oncology with expressive aphasia.45 Given that mood is generally considered less observable than functional domains, this may be one explanation why there was a paucity of studies targeting anxiety and depression in our review.

An additional characteristic of studies that may impact the agreement between patient and proxy ratings, is whether the patient is cognitively impaired. While this was not a focus of the extracted metrics in our review, it was part of the rationale for examining proxy rater agreement in neuro-oncology.11,12 Across the selected studies in our systematic review, the findings were mixed regarding the impact, or measuring, of neurocognitive impairment. While three of the studies in our review36,41,43 did not measure cognitive impairment, one study35 suggested that there was stronger agreement between proxy and patient ratings of mood when the patient was did not meet the study’s criteria for cognitive impairment. This parallels other work which suggests greater agreement in proxy and patient ratings of quality life in the absence of patient cognitive impairment.46 Conversely, two other selected studies in our review did not detect any differences in agreement in symptom ratings when cognitive impairment was identified,39 nor in measures of depression.40 This is consistent with recent work demonstrating similar levels of agreement in patient and proxy measures of quality of life.47 These mixed findings—both in our systematic review and across other comparisons of proxy and patient agreement that consider neurocognitive impairment—highlight the multitude of factors that impact how one can perceive the subjective experience of another individual. Administering a neuropsychological battery (or screener), in addition to employing validated measures of anxiety and depression from patients and their proxies is time consuming and thus not be feasible, yet future work should attempt to account for patients’ cognition given its potential impact on proxy and patient agreement.

One unanticipated result from our review was the challenge of clarifying who the proxy rater was across each study. We specified that proxy raters were not health care professionals, but this broad definition did not account for potential differences across whether proxies were a spouse, a friend, or a parent. Definitions in the selected studies ranged from cohabitating partner,40 patient-identified caregiver,35 spouses, family members or adult children,36 friend41 or someone providing care at home,42 or, caring care for physical, emotional and social needs.43 Of course, these distinct relationships may bias proxy ratings in different ways, particularly for subjective domains. Such heterogeneity in reporting makes studies difficult to reproduce.

An additional finding from our study was the heterogeneity in reporting how instructions were given to proxy raters during each investigation. In general, much of the work on assessing the agreement of proxy ratings comes from quality of life and existing frameworks underscore that the “gap” between patient and proxy reports may be explained, in part, by the perspective from which the proxy was asked to rate the patients’ symptoms. This can include asking proxies to estimate how they think the patient would respond (termed proxy-patient perspective) or ask how they think the patient would respond (termed proxy-proxy perspective),48 Prior work has suggested that providing a prompt for proxy raters to imagine themselves as the patient increases the agreement of proxy ratings.15 This may be particularly salient in emotional domains, like depression and anxiety, which can shift from day to day. Future research may benefit from incorporation of repeated assessments over time, such as ecological momentary assessment in which brief measures are administered repeatedly and which appears feasible in cancer patients.49

4.1 |. Study limitations

There are several limitations from this systematic review worth noting. First, there were very few studies that met the original study criteria and only two additional studies met the expanded criteria that included proxy’s assessment of emotional well-being within a more general measurement of emotional well-being. As noted above, there was significant variability in what quantitative outcomes we were able to extract, further limiting comparisons within our group of selected articles. Thus, our original goal to compare proxy agreement across studies quantitatively may be premature. An additional weakness—also contributing to our inability to make quantitative comparisons among studies - is that we identified additional variability in the definition of a proxy rating in the selected articles. In hindsight, we had only specified that a non-healthcare worker (family, friend etc.) could serve as a proxy rater and this likely was not specific enough of a definition. Finally, there was a wide range of sample sizes in our selected studies (N’s from 20 to 118) and other reviews of proxy ratings of health related quality of life highlight that the agreement between patients and proxies is much lower in sample sizes less than 50.33 This remains a significant limitation of our review, in which the majority of our selected studies (4 out of 6 of the studies) fell below this sample size cutoff.

4.2 |. Clinical implications

The results from this systematic review presented herein have many clinical implications. In a clinical setting, we will often turn to family members for collateral information by proxy reports about how patients are doing, sometimes delving into emotional domains such as anxiety and depression. As we’ve outlined, this is more likely in neuro-oncology in particular, where patients may have unique challenges to completing self-report measures. Our results suggest that while these are not always accurate, there is likely more agreement in domains of depression than anxiety. In our search, it also became apparent that there is a much more robust literature base on proxy measures of quality of life, which is not always as clinically actionable as anxiety or depression. If a clinician plans to use proxy ratings, they also should pay special attention to how they instruct the proxy to rate the patients’ well-being (e.g., one protocol suggests, “…put yourself in the patient’s shoes.” [15]). Further consideration of the relationship of the proxy rater to the patient may also be particularly important, as we found significant variability in how proxies were defined or the level of detail in reporting who the proxy was for each study. For patients, they may easily hand off distress screening or other assessments of emotional domains to their caregivers as proxy raters, but our results are a reminder that the most accurate data is from patients themselves. Future work should investigate how different proxy raters provide accurate or inaccurate assessments of mood, how proxy instructions provided and, eventually, develop and validate measurement tools specific to proxy raters in neuro-oncology.

The systematic review described herein uncovered important details about proxy ratings for depressive and anxious symptoms in neuro-oncology, suggesting that findings are mixed regarding the agreement of proxy ratings. Comparisons between studies proved difficult, given differences in measures used, clarity around instructions given to proxy raters and the statistics reported. Future studies should consult the extant literature to identify clinically relevant measures of anxiety or depression (the GAD-7 or PHQ-9, for example) or, alternatively, choose scales that have been promoted to increase consistency in research studies.50 Accurately identifying how and when proxy ratings are appropriate will greatly contribute to our understanding of the emotional impact of brain cancer.

Supplementary Material

Supplementary Appendix

Footnotes

1

Note asterisks in Table 1 designating that each measure was completed by both patient and their respective proxy.

SUPPORTING INFORMATION

Additional supporting information can be found online in the Supporting Information section at the end of this article.

CONFLICT OF INTEREST

IMB reports participates in research that is in small part funded through a structured research agreement between the Brigham & Women’s Hospital & Cannex Scientific. She also receives an honorarium from Elimu Informatics.

DATA AVAILABILITY STATEMENT

The data extracted during the course of this review is available upon reasonable request.

REFERENCES

  • 1.Mitchell AJ, Chan M, Bhatti H, et al. Prevalence of depression, anxiety, and adjustment disorder in oncological, haematological, and palliative-care settings: a meta-analysis of 94 interview-based studies. Lancet Oncol. 2011;12(2):160–174. 10.1016/s1470-2045(11)70002-x [DOI] [PubMed] [Google Scholar]
  • 2.Caruso R, Nanni M, Riba M, et al. Depressive spectrum disorders in cancer: prevalence, risk factors and screening for depression: a critical review. Acta Oncol 2017;56(2):146–155. 10.1080/0284186x.2016.1266090 [DOI] [PubMed] [Google Scholar]
  • 3.Hashemi S-M, Rafiemanesh H, Aghamohammadi T, et al. Prevalence of anxiety among breast cancer patients: a systematic review and meta-analysis. Breast Cancer. 2020;27(2):1–13. 10.1007/s12282-019-01031-9 [DOI] [PubMed] [Google Scholar]
  • 4.Linden W, Vodermaier A, MacKenzie R, Greig D. Anxiety and depression after cancer diagnosis: prevalence rates by cancer type, gender, and age. J Affect Disord. 2012;141(2-3):343–351. 10.1016/j.jad.2012.03.025 [DOI] [PubMed] [Google Scholar]
  • 5.Wang Y-H, Li J-Q, Shi J-F, et al. Depression and anxiety in relation to cancer incidence and mortality: a systematic review and meta-analysis of cohort studies. Mol Psychiatr. 2020;25(7):1487–1499. 10.1038/s41380-019-0595-x [DOI] [PubMed] [Google Scholar]
  • 6.Ehlers SL, Davis K, Bluethmann SM, et al. Screening for psychosocial distress among patients with cancer: implications for clinical practice, healthcare policy, and dissemination to enhance cancer survivorship. Transl Behav Med. 2019;9(2):282–291. 10.1093/tbm/iby123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Osborn RL, Demoncada AC, Feuerstein M. Psychosocial interventions for depression, anxiety, and quality of life in cancer survivors: meta-analyses. Int J Psychiatry Med. 2006;36(1):13–34. 10.2190/eufn-rv1k-y3tr-fk0l [DOI] [PubMed] [Google Scholar]
  • 8.Sanson-Fisher R, Girgis A, Boyes A, et al. The unmet supportive care needs of patients with cancer. Cancer. 2000;88(1):226–237. [DOI] [PubMed] [Google Scholar]
  • 9.Kissane DW. Unrecognised and untreated depression in cancer care. Lancet Psychiatry. 2014;1(5):320–321. 10.1016/s2215-0366(14)70345-1 [DOI] [PubMed] [Google Scholar]
  • 10.Passik SD, Dugan W, McDonald MV, Rosenfeld B, Theobald DE, Edgerton S. Oncologists’ recognition of depression in their patients with cancer. J Clin Oncol. 1998;16(4):1594–1600. 10.1200/jco.1998.16.4.1594 [DOI] [PubMed] [Google Scholar]
  • 11.Armstrong CL, Goldstein B, Cohen B, Jo M-Y, Tallent EM. Clinical predictors of depression in patients with low-grade brain tumors: consideration of a neurologic versus a psychogenic model. J Clin Psychol Med Settings. 2002;9(2):97–107. 10.1023/a:1014987925718 [DOI] [Google Scholar]
  • 12.Weitzner MA. Psychosocial and neuropsychiatric aspects of patients with primary brain tumors. Cancer Invest. 1999;17(4):285–291. 10.3109/07357909909040599 [DOI] [PubMed] [Google Scholar]
  • 13.Renovanz M, Hechtner M, Kohlmann K, et al. Compliance with patient-reported outcome assessment in glioma patients: predictors for drop out. Neuro-Oncol Pract. 2018;5(2):129–138. 10.1093/nop/npx026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dirven L, Armstrong TS, Blakeley JO, et al. Working plan for the use of patient-reported outcome measures in adults with brain tumours: a Response Assessment in Neuro-Oncology (RANO) initiative. Lancet Oncol. 2018;19(3):e173–e80. 10.1016/s1470-2045(18)30004-4 [DOI] [PubMed] [Google Scholar]
  • 15.Lobchuk MM, McClement SE, Daeninck PJ, Shay C, Elands H. Asking the right question of informal caregivers about patient symptom experiences: multiple proxy perspectives and reducing interrater gap. J Pain Symptom Manag. 2007;33(2):130–145. 10.1016/j.jpainsymman.2006.07.015 [DOI] [PubMed] [Google Scholar]
  • 16.Dickinson HO, Hrisos S, Eccles MP, Francis J, Johnston M. Statistical considerations in a systematic review of proxy measures of clinical behaviour. Implement Sci. 2010;5(1):1–8. 10.1186/1748-5908-5-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Blazeby J, Williams M, Alderson D, Farndon J. Observer variation in assessment of quality of life in patients with oesophageal cancer. Br J Surg. 1995;82(9):1200–1203. 10.1002/bjs.1800820916 [DOI] [PubMed] [Google Scholar]
  • 18.Slevin ML, Plant H, Lynch D, Drinkwater J, Gregory W. Who should measure quality of life, the doctor or the patient? Br J Cancer. 1988;57(1):109–112. 10.1038/bjc.1988.20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sigurdardottir V, Brandberg Y, Sullivan M. Criterion-based validation of the EORTC QLQ-C36 in advanced melanoma: the CIPS questionnaire and proxy raters. Qual Life Res. 1996;5(3):375–386. 10.1007/bf00433922 [DOI] [PubMed] [Google Scholar]
  • 20.Griffiths AW, Smith SJ, Martin A, Meads D, Kelley R, Surr CA. Exploring self-report and proxy-report quality-of-life measures for people living with dementia in care homes. Qual Life Res. 2020;29(2): 463–472. 10.1007/s11136-019-02333-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hermont AP, Scarpelli AC, Paiva SM, Auad SM, Pordeus IA. Anxiety and worry when coping with cancer treatment: agreement between patient and proxy responses. Qual Life Res. 2015;24(6):1389–1396. 10.1007/s11136-014-0869-3 [DOI] [PubMed] [Google Scholar]
  • 22.Sneeuw KC, Aaronson NK, Osoba D, et al. The use of significant others as proxy raters of the quality of life of patients with brain cancer. Med Care. 1997;35(5):490–506. 10.1097/00005650-199705000-00006 [DOI] [PubMed] [Google Scholar]
  • 23.Lv E. Proxy ratings of patient quality of life factors related to patient–proxy agreement. Acta Oncol. 2004;43(3):229–234. 10.1080/02841860410029357 [DOI] [PubMed] [Google Scholar]
  • 24.Ford E, Catt S, Chalmers A, Fallowfield L. Systematic review of supportive care needs in patients with primary malignant brain tumors. Neuro-Oncol. 2012;14(4):392–404. 10.1093/neuonc/nor229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Arnold SD, Forman LM, Brigidi BD, et al. Evaluation and characterization of generalized anxiety and depression in patients with primary brain tumors. Neuro-Oncol. 2008;10(2):171–181. 10.1215/15228517-2007-057 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Moher D, Liberati A, Tetzlaff J, Altman DG, Group* P. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann Intern Med. 2009;151(4):264–269. 10.7326/0003-4819-151-4-200908180-00135 [DOI] [PubMed] [Google Scholar]
  • 27.Shi C, Lamba N, Zheng LJ, et al. Depression and survival of glioma patients: a systematic review and meta-analysis. Clin Neurol Neurosurg. 2018;172:8–19. 10.1016/j.clineuro.2018.06.016 [DOI] [PubMed] [Google Scholar]
  • 28.Covidence Systematic Review Software. (2022). Veritas Health Innovation. http://www.covidence.org [Google Scholar]
  • 29.Mystakidou K, Tsilika E, Parpa E, Katsouda E, Galanos A, Vlahos L. Assessment of anxiety and depression in advanced cancer patients and their relationship with quality of life. Qual Life Res. 2005; 14(8):1825–1833. 10.1007/s11136-005-4324-3 [DOI] [PubMed] [Google Scholar]
  • 30.Smith E, Gomm S, Dickens C. Assessing the independent contribution to quality of life from anxiety and depression in patients with advanced cancer. Palliat Med. 2003;17(6):509–513. 10.1191/0269216303pm781oa [DOI] [PubMed] [Google Scholar]
  • 31.Grotmol KS, Lie HC, Hjermstad MJ, et al. Depression—a major contributor to poor quality of life in patients with advanced cancer. J Pain Symptom Manag. 2017;54(6):889–897. 10.1016/j.jpainsymman.2017.04.010 [DOI] [PubMed] [Google Scholar]
  • 32.Munn Z, Moola S, Lisy K, Riitano D, Tufanaru C. Methodological guidance for systematic reviews of observational epidemiological studies reporting prevalence and cumulative incidence data. Int J Evid Base Healthc. 2015;13(3):147–153. 10.1097/xeb.0000000000000054 [DOI] [PubMed] [Google Scholar]
  • 33.Sneeuw KC, Sprangers MA, Aaronson NK. The role of health care providers and significant others in evaluating the quality of life of patients with chronic disease. J Clin Epidemiol. 2002;55(11): 1130–1143. 10.1016/s0895-4356(02)00479-1 [DOI] [PubMed] [Google Scholar]
  • 34.Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016; 15(2):155–163. 10.1016/j.jcm.2016.02.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Brown PD, Decker PA, Rummans TA, et al. A prospective study of quality of life in adults with newly diagnosed high-grade gliomas: comparison of patient and caregiver ratings of quality of life. Am J Clin Oncol. 2008;31(2):163–168. 10.1097/coc.0b013e318149f1d3 [DOI] [PubMed] [Google Scholar]
  • 36.Milbury K, Li J, Weathers S-P, et al. Pilot randomized, controlled trial of a dyadic yoga program for glioma patients undergoing radiotherapy and their family caregivers. Neuro-Oncol Pract. 2019;6(4): 311–320. 10.1093/nop/npy052 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Petruzzi A, Finocchiaro CY, Lamperti E, Salmaggi A. Living with a brain tumor. Support Care Cancer. 2013;21(4):1105–1111. 10.1007/s00520-012-1632-3 [DOI] [PubMed] [Google Scholar]
  • 38.Janda M, Steginga S, Langbecker D, Dunn J, Walker D, Eakin E. Quality of life among patients with a brain tumor and their carers. J Psychosom Res. 2007;63(6):617–623. 10.1016/j.jpsychores.2007.06.018 [DOI] [PubMed] [Google Scholar]
  • 39.Goebel S, von Harscher M, Mehdorn HM. Comorbid mental disorders and psychosocial distress in patients with brain tumours and their spouses in the early treatment phase. Support Care Cancer. 2011; 19(11):1797–1805. 10.1007/s00520-010-1021-8 [DOI] [PubMed] [Google Scholar]
  • 40.Rooney AG, McNamara S, Mackinnon M, et al. Screening for major depressive disorder in adults with glioma using the PHQ-9: a comparison of patient versus proxy reports. J Neuro-Oncol. 2013;113(1): 49–55. 10.1007/s11060-013-1088-4 [DOI] [PubMed] [Google Scholar]
  • 41.Jacobs DI, Kumthekar P, Stell BV, et al. Concordance of patient and caregiver reports in evaluating quality of life in patients with malignant gliomas and an assessment of caregiver burden. Neuro-Oncol Pract. 2014;1(2):47–54. 10.1093/nop/npu004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Armstrong TS, Wefel JS, Gning I, et al. Congruence of primary brain tumor patient and caregiver symptom report. Cancer. 2012;118(20): 5026–5037. 10.1002/cncr.27483 [DOI] [PubMed] [Google Scholar]
  • 43.Moreale R, Campanella F, Marin F, Skrap M, Palese A. Emotional concerns and coping strategies in low grade glioma patients and reliability of their caregivers in reporting these concerns: findings from a cross-sectional study. Eur J Oncol Nurs. 2017;30:113–119. 10.1016/j.ejon.2017.08.010 [DOI] [PubMed] [Google Scholar]
  • 44.Eiser C, Morse R. Can parents rate their child’s health-related quality of life? Results of a systematic review. Qual Life Res. 2001;10(4):347–357. 10.1023/a:1012253723272 [DOI] [PubMed] [Google Scholar]
  • 45.Cruice M, Worrall L, Hickson L, Murison R. Measuring quality of life: comparing family members’ and friends’ ratings with those of their aphasic partners. Aphasiology. 2005;19(2):111–129. 10.1080/02687030444000651 [DOI] [Google Scholar]
  • 46.Ediebah DE, Reijneveld JC, Taphoorn MJ, et al. Impact of neurocognitive deficits on patient–proxy agreement regarding health-related quality of life in low-grade glioma patients. Qual Life Res. 2017;26(4):869–880. 10.1007/s11136-016-1426-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Caramanna I, Klein M, van den Bent M, et al. Neurocognitive impairment and patient-proxy agreement on health-related quality of life evaluations in recurrent high-grade glioma patients. Qual Life Res Int J Qual Life Aspects Treat Care Rehabilitation. 2022;31(11): 3253–3266. 10.1007/s11136-022-03197-w [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 48.Pickard AS, Knight SJ. Proxy evaluation of health-related quality of life: a conceptual framework for understanding multiple proxy perspectives. Med Care. 2005;43(5):493–499. 10.1097/01.mlr.0000160419.27642.a8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kampshoff CS, de Verdonck-Leeuw IM, van Oijen MG, Sprangers MA, Buffart LM. Ecological momentary assessments among patients with cancer: a scoping review. Eur J Cancer Care. 2019;28(3):e13095. 10.1111/ecc.13095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wagner LI, Schink J, Bass M, et al. Bringing PROMIS to practice: brief and precise symptom screening in ambulatory cancer care. Cancer. 2015;121(6):927–934. 10.1002/cncr.29104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. 10.2307/2529310 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Appendix

Data Availability Statement

The data extracted during the course of this review is available upon reasonable request.

RESOURCES