Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Nov 1.
Published in final edited form as: J Affect Disord. 2021 Jun 25;294:1–9. doi: 10.1016/j.jad.2021.06.049

Self-Report Depression Screening Measures for Older Hispanic/Latin American Adults: A PRISMA Systematic Review

Andrea Ochoa Lopez 1, Michelle N Martinez 1, Joshua Garcia 1, Mark E Kunik 2, Luis D Medina 1,CA
PMCID: PMC8410643  NIHMSID: NIHMS1718663  PMID: 34252863

Abstract

Background:

Assessing depression symptoms in Hispanic/Latin American (H/Ls) older adults, a group at high risk for depression, is nuanced due to the influence of cultural characteristics in symptom expression and manifestation. Little is known about the psychometric properties of available measures when used with this population.

Methods:

We conducted a two-stage systematic review of available depression assessment tools. We first identified self-report measures designed for use with adults. We then identified studies where at least one of such measures was used in older H/Ls that reported psychometric properties for the measure(s) used.

Results:

Only 3 measures were identified for use with older H/Ls: the BDI, GDS, and CES-D. However, few data were found to support the validity of the BDI, and the CES-D was not consistently valid across cultural groups. The GDS was found appropriate, though its performance varied based on race/ethnicity, nationality, and cutoff scores. The CES-D and GDS also demonstrated varying psychometric properties based on study setting (research versus clinical) and target population (inpatient psychiatric patients versus community-dwelling individuals).

Limitations:

The number of articles that met criteria for inclusion in our review was small, and there was variation among samples of the few studies included.

Conclusions:

Currently available self-report depression screening measures have acceptable applicability among older H/Ls, but their utility may vary based on their intended use. Modified cutoff scores may be beneficial in maximizing the utility of these measures when given to diverse older adults.

Keywords: depression, self-report, Hispanic Americans, outcome assessment, aging, psychometrics

Depression Assessment in Older Hispanic/Latin American Adults: A Systematic Review

The number of individuals that identify as members of a culturally and linguistically diverse (CALD) group has increased from 6.9 million in 2006 to 11.1 million in 2016 and is expected to increase notably by 2030. People that identify as Hispanic or Latin American (H/L) represent the most prevalent CALD group across age groups, including older adults. This population comprised 23% of those age 65 and above in 2016, and by 2030, is projected to comprise 28% of this age group (Administration for Community Living, 2019). Given the expected increase in the older adult population, it is crucial to ensure that the resources to promote the health of these individuals are sufficient and appropriate. Physical and mental health screenings are a fundamental aspect of providing adequate health services, including conditions and disorders for which older individuals are at high risk.

Depression, though less prevalent in older adults than in younger populations, has severe consequences on the well-being of the former. Suicide rates are higher for older than younger individuals and commonly associated with depressive symptoms, which are present in about 15% of community-dwelling older adults (Blazer, 2003). Depression is also associated with symptoms of other cognitive, somatic, and affective disorders in this population, including vascular risk factors and dementia (Hickie et al., 2001; Sweitzer et al., 2002). There is also a high risk for depression for H/L people, particularly among bicultural, immigrant, and less-acculturated individuals (Gerst, 2010; González et al., 2001). Screening for depression is an important prevention strategy to reduce the risk of adverse outcomes related to this disorder in older adults, as it may help detect suicide risk and implement effective treatments that reduce suicidal thoughts (Bruce et al., 2004).

Assessing depression symptoms in the context of aging presents a unique set of challenges, which have been scarcely addressed in research and may bias the clinical evaluation of older adults. Noted issues include limited knowledge of symptom pattern, the inadequacy of tools developed for younger cohorts, the misuse of assessment tools, and deficits in additional assessment (Gonçalves et al., 2009). Depression in older adults is often comorbid with other medical conditions in a complex interplay; this may not always be accounted for in assessment due to factors such as clinical judgement or measurement design. In addition, social and cultural paradigms influence the way in which older adults understand and report their own symptoms. Even with appropriate considerations, the assessment measures used to evaluate older adults should be selected carefully. Although measures like the Geriatric Depression Scale (GDS; Yesavage et al., 1982) are available for this purpose, the studies that have validated them have commonly been conducted with majority non-Hispanic White samples, thus limiting the generalizability and applicability of their findings. A recent systematic review by Balsamo et al. (2018) evaluated the psychometric properties of self-report depression measures among older adults, finding the GDS, the Beck Depression Inventory-II (BDI-II; Beck et al., 1996), and the Center for Epidemiological Studies Depression Scale (CES-D; Radloff, 1977) to be favorable for this purpose; however, little to no demographic data are reported, weakening the generalizability of findings to underrepresented racial and ethnic groups (O’Bryant et al., 2004; Medina et al., 2020).

The assessment of depression symptoms can also represent a challenge in the context of diverse populations. Research has shown that depression is commonly under-recognized among H/Ls in various settings, a tendency partly related to cultural differences in the manifestation and reporting of depressive symptoms listed in currently available depression screening tools (Crockett et al., 2005; MacIntosh & Strickland, 2010). Additional influences include health literacy barriers, somatic presentations, and the use of cultural idioms of distress in depression screening (Lewis-Fernandez et al., 2005). Some studies have examined the psychometric properties of depression measures across multiple ethnic groups, with some finding that specific items (like those related to sleep and appetite disturbances) performed poorly in the detection of depression in these samples (Teresi et al., 2008). Despite this knowledge, little has been done to evaluate the utility of available measures to assess depression symptoms in older H/L adults.

Identifying appropriate assessment tools for older H/L adults is a need yet to be met in research and clinical contexts. The measures that have been validated for use with older adults are not necessarily valid for use with H/Ls, and those that have been validated in H/L populations may be influenced by aging factors when used with older adults. There is a need for critical review of the literature to determine which currently available tests are appropriate for measuring symptoms of depression in this population. Self-report screening measures, in particular, are relevant to evaluate as they represent a crucial aspect of depression assessment (Uher et al., 2012). Addressing this gap may allow for the provision of general recommendations and clinical considerations for use of these tools in the assessment of older H/L adults. Similar work by Wu and Skemp Kelley (2007) provided valuable information for the assessment of depressive symptoms in Chinese older adults, describing variability in the psychometric properties and cutoff scores of available instruments and offering strategies for efficient evaluation of that population, including specific recommended measures. However, to our knowledge, no similar study exists in the context of H/L older adult populations.

The Present Study

We sought to systematically review the literature on depression assessment tools in older Hispanics/Latin Americans. Our search was focused on self-report measures given their widespread use and utility in to assess depressive symptoms in clinical settings (Valente & Saunders, 2005). After identifying commonly used depression measures designed for use with general adult populations, we searched for studies that reported data about the psychometric properties (e.g., reliability, validity) of these materials, using samples that included older H/Ls. Thus, our study had three main aims. We sought to (1) identify depression screening tools available in English and/or Spanish that may be considered appropriate for use with Hispanic/Latin American older adults; (2) examine the psychometric properties of depression screening tools available in English and/or Spanish among older Hispanic/Latin American adults; and (3) provide general guidelines for consideration in the assessment of depression in older H/L adults using screening measures.

Method

We conducted a two-stage systematic review of the available literature following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Moher et al., 2009). Stage 1 consisted of a review of three databases focused on behavioral measurement instruments in health and psychosocial science. The purpose of this search was to compile a list of self-report depression measures available for use with adult populations. Stage 2 consisted of an extended review of three additional databases focused on the reporting of findings related to these identified measures in the psychological and health science literature. The purpose of this search was to identify journal articles in which psychometric data for the self-report depression measures found during Stage 1.

Inclusion/Exclusion Criteria

In Stage 1, we included records for measures that (1) were in English or Spanish, (2) were exclusively self-report, (3) were designed for use with adults, and (4) were designed to measure depressive symptoms. Records for tests that measured constructs similar or related to depression, but not depressive symptoms (e.g., negative affect, worry, rumination), were excluded, as were tests that measured any other clinical disorders in addition to depression (i.e., screening questionnaires for identifying a variety of disorders). Measures that were compilations of two or more existent tests were included if all items in one such measure were designed to measure depressive symptoms. There was no limit in publication date.

In Stage 2, we initially screened studies that (1) were in English or Spanish, (2) were conducted in the United States, Spain, or Latin American countries (3) included Hispanics/Latin Americans (H/Ls), (4) sampled older adult participants, (5) included at least one of the depression screening tools identified in Stage 1, and (6) provided data on the psychometric properties of the depression screening tool used. After conducting an initial search and evaluating results based on these criteria, we identified studies that qualified for secondary screening. We screened these articles and applied an additional set of criteria to detect those with samples that were representative of the H/L older adult population; thus, we selected studies in which (1) at least one fifth of participants were H/L and (2) all participants were adults of age 55 or older. The required minimum of H/L participants in studies was determined as a standard of representativeness, with the consideration that H/Ls comprise over 18% of the general population and over 20% of the older adult population in the United States (United States Census, 2016). As in Stage 1, there was no limit in publication date. We excluded unpublished manuscripts (e.g., theses/dissertations), editorial and opinion pieces, systematic reviews or meta-analyses, books, book chapters, and articles from journals that were not peer-reviewed.

Information Sources

We used three academic databases in Stage 1 selected for their focus on behavioral measurement instruments in psychological and health sciences: Health and Psychological Instruments (HAPI), APA PsycTests, and Mental Measurements Yearbook with Tests in Print (MMYTP). We used three additional databases in Stage 2: PsycInfo, PubMed, and MedicLatina. These databases were selected given their offering of scholarly articles from peer-reviewed journals, and the availability of materials covering a wide range of subjects in psychology, including psychometrics.

Search

The process in all three Stage 1 databases included four criteria sets: (1) measure subject (i.e., depression-related keywords), (2) test modality (i.e. “self-report” or “self report”), (3) language (“English” or “Spanish”), and (4) age. We identified appropriate depression-related keywords for each database using their respective lists of index terms. Given our interest in identifying measures designed for use with general adult populations, our age criteria did not specifically request results involving older adults; instead, it discarded all results involving children and/or adolescent samples. The search excluded records containing the terms “children,” “adolescents,” “youth,” “child,” “teenager,” “pediatric,” and “kids.”

The Stage 2 process in all three databases included four criteria sets: (1) specific depression measure identified in Stage 1 (e.g., “Beck Depression Inventory, or BDI”), (2) age, (3) language/ethnicity, and (4) type of data reported. Table 1 displays the terms used for search criteria 2 through 4; these terms were consistently used in combination with each depression measure identified in Stage 2.

Table 1.

Index terms used for Stage 2 Search

Age Language/Ethnicity Required data

older adults or elderly or geriatric or geriatrics or aging or senior or seniors or older people or age 65 or 65+ AND Hispanics or latinos or latinas or latinx or chicano or mexican american or spanish speakers AND psychometrics or validity or reliability or psychometric properties or sensitivity or specificity

Data Collection Process

For each record identified in Stage 1, we documented the database of origin, assessment measure, citation for the measure, language, and reason for discarding if it did not meet inclusion criteria. Records obtained through the Stage 2 database search were similarly recorded to document the assessment measure, title and citation for the article, and a description of the sample used; these records were then reviewed to assess their initial eligibility. Studies that met inclusion criteria at this point were reviewed further to determine their final eligibility. Studies that met inclusion criteria were reviewed for available psychometric data and diagnostic cutoff scores used by authors to diagnose depression with the measures. Records were originally obtained by the lead author (AOL) and then verified in collaboration with the second author (MNM).

Data Items

The following sample characteristics were collected when available from qualifying studies: number of subjects, clinical or nonclinical categorization, source of recruitment, gender proportion, age range, racial/ethnic group distribution, socioeconomic status, and language(s) spoken. We also recorded the following psychometric data when available: reliability and validity statistics, sensitivity and specificity, positive and negative predictive value, and other psychometric qualities (e.g., results of factor structure analyses, measurement bias).

Risk of Bias in Individual Studies/Across Studies

Outcome reporting bias within eligible studies was reported qualitatively. Table 2 displays study characteristics that may implicate risk for bias in eligible studies, as suggested by PRISMA guidelines (Moher et al., 2009). We did not generate mathematical estimates of risk for bias across study as we did not access study data beyond that which was reported in the reviewed manuscripts.

Table 2.

Reported sources of bias within studies

Article Race/ethnicity differences Sex differences Age differences Diagnosis differences Language differences Location differences

Baker & Espino 1997 No
Baker et al., 1993
Baker et al., 1994 No
Carrete et al., 2001 Yes
Espino et al., 1996 No No No
Gatewood-Colwell et al., 1989 No No
Gonzalez et al., 1995 No
Hahn et al., 2011 Yes Yes
MacIntosh & Strickland, 2010 Yes
Mahard, 1988
Robinson et al., 2002 Yes Yes

Note: “Yes” means the study reported significant differences between groups based on a given factor. “No” means the study reported NO significant differences between groups based on a given factor. Empty cells signify a lack of statement in the study about significant differences based on a given factor.

Results

Study Selection

The Stage 1 search yielded a total of 207 results, of which 53 self-report depression measures were identified (Figure 1). This final list of measures (see Table S1, Supplementary Materials) was examined further in Stage 2.

Figure 1.

Figure 1.

PRISMA diagram for Stage 1 Literature Search

The Stage 2 search yielded a total of 383 records, of which 36 articles were eligible for further examination (Figure 2). Only 11 of these records, which collectively employed 3 different measures (Table 3), included H/L older adult samples and were reviewed further for psychometric data.

Figure 2.

Figure 2.

PRISMA diagram for Stage 2 Literature Search

Table 3.

Characteristics of depression screening tools used across included studies

Measure No. of items Score range Type of questions Time to complete Cut-off score Score meaning

BDI 21 0–63 0 to 3 points 5 to 10 minutes 10 0 to 9 = no depression, 10 to 15 = mild depression, 16 to 19 = mild to moderate depression, 20 to 29 = moderate to severe depression, 30 to 63 = severe depression
CES-D 20 0–60 0 to 3 points 10 to 15 minutes 16 Higher scores are indicative of higher depression
GDS-15 15 0–30 Yes/No 5 to 10 minutes 11 0 to 4 = no depressive symptoms, 5–8 = mildly depressed, 12–15 = severely depressed
GDS-30 30 0–30 Yes/No 10 to 15 minutes 11 0 to 9 = no depressive symptoms, 10 to 19 = mildly depressed, 20–30 = severely depressed

Note. BDI = Beck Depression Inventory, original version; CES-D = Center for Epidemiological Studies Depression Scale; GDS-15 = Geriatric Depression Scale, 15-item version; GDS-30 = Geriatric Depression Scale, 30-item version.

Study Characteristics

Table 4 summarizes the final list of studies included, as well as relevant descriptive characteristics and psychometric properties of depression measures.

Table 4.

Psychometric properties of depression screening measures reported in studies representative of Hispanic/Latin American older adults

Journal Article Measure and Setting (Clinical/Research) Cutoff score Psychometrics Descriptive demographics

Baker & Espino, 1997 GDS-15, Clinical setting 0–5 = normal; 6–10 = mildly suggestive of depression; 11–15 = strongly suggestive of depression. Also used a revised cutoff of 4 or above. Cronbach’s alpha: 0.94.
Sensitivity: Using original cut-scores, 39% in patients with MDD and 77% in patients with ODD. Using revised cut-score, 75% for patients with MDD and 85% for patients with ODD.
Specificity: Not determined due to lack of true negatives in sample.
41 Mexican American psychiatric patients age 60–99 with a diagnosis of depressive disorders, 43.9% women.
Baker et al., 1993 GDS-15, Research Setting 6 Sensitivity: 35% for African Americans and 49% for Mexican Americans. 55 African American community residents (mean age 77 years) and 41 Mexican American people (age 65+ years) with a diagnosis of affective illness.
Baker et al., 1994 GDS-15, Clinical setting 6 and 4 Sensitivity: 49% using the original cutoff score of 6 or more. With a revised cutoff of 4, sensitivity improved to 75% for the combined sample of Black and Mexican American participants. 55 Black community-dwelling and 57 Mexican American psychiatric patients. The Mexican American sample, age 62–98, was recruited from the psychiatric unit; hospitalized and depressed, 44% women.
Carrete et al., 2001 GDS-30, face-to-face version (GDS-P) and phone version (GDS-T), Research setting 10 and 11 Cronbach’s alpha: 0.85 for GDS-P and 0.88 for GDS-T.
Sensitivity: 88% for GDS-P and 84% for GDS-T.
Specificity: 82% for GDS-P and 79% for GDS-T.
Random sample of 282 Hispanic elderly individuals, mean age 71.7 years, 56% women, 50% had college or higher education.
Espino et al., 1996 GDS-30, Clinical setting 11 Sensitivity: 0.80
Specificity: 0.61
66 Mexican American participants age 65 or older, 48% women; recruited from private or public community healthcare clinics.
Gatewood-Colwell et al., 1989 BDI, original version, Research setting 16 Cronbach’s alpha: 0.80
Pearson correlation with GDS: 0.79.
51 participants age 60–80 recruited from a community senior service agency. 15 white women, 11 White men, 13 bilingual Mexican American women, & 12 Mexican American men.
Gonzalez et al., 1995 CES-D, Original and Modified versions, Clinical setting 16 Cronbach’s alpha: Original CES-D: 0.90 for the Scaling study, 0.89 for the Replication study, and 0.93 for the Test-Retest study. Modified CES-D: 0.92, 0.91, and 0.92, respectively.
Items that did not meet convergent validity criteria: Item 4, “I felt I was just as good as other people;” correlation of −0.11 with the scale. Item 8, “I felt hopeful about the future;” correlation of 0.35 with the scale.
472 older Hispanic/Latin American adult patients from San Francisco, Los Angeles, San Antonio, Miami, New York City, and Caracas, Venezuela. All recruited from rheumatology clinics. Predominantly women, with a mean 8 years of education.
Hahn et al., 2011 CES-D, Research setting 16 Cronbach’s alpha: 0.73 for baseline data and 0.79 for the second wave of participants. Mexican American, community-dwelling adults age 65 and over, new to caregiving roles, recruited from Texas, New Mexico, Colorado, California, or Arizona. 59.4% female, mean years of education 4.87. Interviewed in English or Spanish.
MacIntosh & Strickland, 2010 CES-D, Research setting 16 Item bias for overall sample was found for one positive affect question in the 10-item subset.

Among the Hispanic sub-population, there was item response bias for the positive affect items across time.
4499 adults age 70–85, 58% Hispanic and 42% non-Hispanic White, 56.7% female
Mahard, 1988 CES-D, Clinical and Research settings 16 Cronbach’s alpha: 0.87 in both patient and nonpatient samples. 60 Puerto Ricans age 55–82 living in New York City in the Summer of 1984. Half diagnosed clinically as depressed and receiving treatment in a community mental health clinic, other half community-dwelling adults. 80% female, predominantly low education (median of 4 years) and low income.
Robinson et al., 2002 CES-D, 20-item and 10-item versions, Clinical setting Original cut points (21 for 20-item and 4 for 10-item) compared to recommended cut points (20 for 20-item and 3 for 10-item). Sensitivity: 73% for 20-item version with original cut point; 81% with recommended cut point. 76% for 10-item version with original cut point; 84% with recommended cut point.
Specificity: 72% for 20-item version with original cut point and 70% with recommended cut point. 70% for 10-item version with original cut point and 64% with recommended cut point.
303 Puerto Rican adults age 55 and over, recruited from primary care clinics in Northeastern U.S. 73% female, urban, low education, low income, interviewed in English or Spanish.
GDS, Clinical Setting Original cut points (9 for 30-item, 6 for 15-item) compared to recommended cut points (13 for 30-item, 7 for 15-item). Sensitivity: 84% for 30-item version with original cut point and 81% with recommended cut point. 76% for 15-item version with original cut point and 73% with recommended cut point.
Specificity: 53% for 30-item version with original cut point and 65% with recommended cut point. 64% for 15-item version with original cut point and 71% with recommended cut point.

Note. BDI = Beck Depression Inventory; CES-D = Center for Epidemiological Studies Depression Scale; GDS = Geriatric Depression Scale.

Robinson et al. (2002) reported psychometric data for both the CES-D and the GDS.

Risk of Bias Within Studies

Risk for bias was identified within studies included in the present review (Table 2). Of eleven studies, only three reported on the presence or absence of significant differences between groups based on race/ethnicity (Espino et al., 1996; Gatewood-Colwell et al., 1989; MacIntosh & Strickland, 2010); such significant differences were absent in two studies (Espino et al., 1996; Gatewood-Colwell et al., 1989) and present in one (MacIntosh & Strickland, 2010). Significant sex differences were reported in two studies (Hahn et al., 2011; Robinson et al., 2002) and not supported in another two studies (Espino et al., 1996; Gatewood-Colwell et al., 1989). Considering that many samples included in the present review were predominantly composed of women, who are at higher risk for depression than men, this source of bias is to be expected. Significant differences between clinical or diagnostic groups were the most commonly mentioned; they were reported absent in two studies and present in three. It is reasonable to assume that the inclusion of individuals with a previous diagnosis of depression and/or other mental disorders in study samples may have influenced their findings. Lastly, one study (Baker et al., 1994) reported a lack of significant differences based on language of test administration, and one (Gonzalez et al., 1995) noted no differences based on location of recruitment and testing.

Results of Individual Studies

The type of psychometric data reported varied across the eleven studies included. Six provided internal consistency as measured by Cronbach’s alpha, six provided sensitivity and/or specificity data, one provided item-level test bias results as measured by differential item analysis (DIF), and one provided convergent validity data as measured by correlation between depression measures. Additionally, one study (Gonzalez et al., 1995) provided information about two items of the original, 20-item CES-D that did not meet convergent validity (see CES-D below).

Beck Depression Inventory (BDI)

Only one study that included psychometric characteristics of the BDI met criteria to be considered representative of older H/Ls. Gatewood-Connell and colleagues (1989) analyzed a sample of 51 older adults, in which about half were older H/L men and women, to evaluate the validity and reliability of the original BDI (Beck et al., 1961). Results indicated a Cronbach’s alpha of 0.80 for this measure, which is appropriate for research settings but lower than the recommended cutoff for clinical purposes (0.90; Nunnally and Bernstein, 1994). The BDI also had a correlation of 0.79 with the Geriatric Depression Scale (GDS), which was included by the authors as evidence for the convergent validity of the BDI. No data were found for the updated (second) version of the BDI – the BDI-II (Beck et al, 1996).

Geriatric Depression Scale (GDS)

The psychometric characteristics of the GDS differed between study samples; notably, some studies identified in this review evaluated the original, 30-item GDS (Yesavage & Brink, 1983) while others used the 15-item GDS (Sheikh & Yesavage, 1986). Among community-dwelling older H/L adults, a Spanish version of the 30-item GDS translated by Carrete and colleagues (2001) yielded internal consistency values ranging from α =0.85 to 0.88; these properties are considered adequate for research purposes, but potentially insufficient in clinical contexts (Nunnally and Bernstein, 1994). The authors also identified sensitivity values ranging from 84% to 88%. Among psychiatric patients in an inpatient clinical facility, this same measure had high internal consistency (α = 0.94); however, its sensitivity was inadequate in such clinical groups when using the original cutoff score of 5; using a revised score of 4, sensitivity improved significantly (from 39% to 75% for patients with major depression disorder and from 77% to 85% for those with other depressive disorders).

Studies that recruited participants without a history of depression seemed to yield more consistent and adequate psychometric values, even if the participants were recruited from health care clinics. Using a cutoff of 11 on the GDS-30, Espino and colleagues (1996) reported sensitivity of 80% and specificity of 61% among Mexican American participants recruited from such clinics. In a sample of over 300 community-dwelling Puerto Rican participants, sensitivity on the GDS-30 ranged from 81% to 84% and specificity from 53% to 65%, depending on the cutoff score used. In addition, in the latter sample, sensitivity and specificity for the GDS-15 ranged from 73% to 76% and 64% to 71%, respectively, depending on the cutoff score used. Two other studies similarly reported that using recommended cutoffs prevented the GDS-15 from demonstrating acceptable sensitivity values and recommended modified cutoff scores. A lower cutoff score of 4 increased sensitivity from 39% to 75% in patients with major depression (Baker & Espino, 1997). Baker and colleagues (1994) reported similar findings using a cutoff score of 4 compared to a sensitivity of 49% when using a cutoff of 6 in a sample of community-dwelling Mexican Americans with a diagnosis of affective illness.

Center for Epidemiological Studies Depression Scale (CES-D)

The CES-D performed appropriately among older, H/L patients recruited from medical clinics, with reported internal consistencies ranging from α=.89 to 0.93. However, Gonzalez and colleagues (1995) identified two items of the original CES-D that did not support convergent validity. In their analyses, item 4 (“I felt I was just as good as other people”) and item 8 (“I felt hopeful about the future”) not only had low or negative correlations (−0.11 and 0.35, respectively) with the scale overall, but removing them improved the measure’s internal consistency alpha. Regarding item 4, the authors suggested that the concept of being “as good as others” may not be a culturally appropriate comparison for Spanish speakers, for whom endorsing this may be equivalent to bragging, while providing a negative response may be a “culturally sanctioned avoidance of self-aggrandizement.” This idea is consistent with the fact that item 4 also correlated similarly with the CES-D and a self-efficacy measure. For item 8, the authors noted that the direct translation of this item to Spanish had a meaning closer to “I felt optimistic about the future,” which participants had difficulty understanding. The authors also reported that these two items also had a higher missing value rate than the other 18 items in the original CES-D evaluation. These findings suggest that older H/L adults may not interact with all questions of the CES-D in the same way as other sociodemographic groups. Alternatively, the vocabulary used in the translation of these items may not be culturally appropriate, as it may be interpreted differently by older H/L adults than was intended in the original measure.

The CES-D showed varying performance among community-dwelling participants from different nationalities. In a study of community-dwelling Mexican Americans, internal consistency ranged from α= 0.73 to 0.79, reflecting internal consistency below recommended cutoffs (Nunnally & Bergstein, 1994). In contrast, two studies where the measure was administered to Puerto Rican older adults yielded internal consistency of α=0.87 as well as a sensitivity of 64% and specificity of 64% using traditional cutoff score of 4; notably, using the authors’ recommended cutoff score of 3 caused an increase in sensitivity to 81% and specificity to 72% (Mahard, 1988; Robinson, 2002). In addition to variation between participants from different nationalities, Hispanic participants overall appeared to express positive affect differently than non-Hispanic Whites based on the results of differential item functioning (DIF) analyses conducted by MacIntosh and Strickland (2010); in other words, item-level bias significantly contributed to differences in total scores depending on ethnoracial group. This difference was greater than the variation expected to occur by chance, and thus, revealed measurement bias in CES-D items for older H/Ls.

Discussion

Summary of Evidence

Although previous research has summarized psychometric characteristics of tools used in late-life depression assessment (Balsamo et al., 2018), a failure of these studies to provide information about the sociodemographic characteristics of the samples included brings into question the generalizability of their findings. Therefore, the goal of this study was to examine available self-report depression screening tools for use with older Hispanics/Latin Americans. We sought to pursue this goal through a systematic review and established three main aims. For our first aim, which involved the identification of potentially useful depression screening tools, we found five measures available in English and Spanish that may be deemed appropriate for use with older adults. Of these, however, our search only yielded studies to support the validity of four such measures for use with older H/Ls, of which two were different versions of the same screening tool (GDS-15 and GDS-30). Notably, despite our broad search, all these are measures created in the United States and validated among Spanish speakers of H/L origin. Our second aim was to examine the psychometric properties of depression screening tools available in English and/or Spanish among older Hispanic/Latin American adults. Overall, we found some information reported on the psychometric characteristics of these measures from studies that included older H/Ls; while these studies are scarce, their samples appear representative and their findings may therefore be generalizable, with considerations described below. However, it is important to delineate the advantages and disadvantages of these measures as inferred from the characteristics of the studies in which their properties have been reported. For this purpose, the performance of these depression measures among different type of samples must be distinguished.

Based on a single study that met criteria for inclusion in this review, the Beck Depression Inventory appeared appropriate for use in research but did not necessarily meet minimal criteria for clinical use. While there were more studies with evidence regarding the Geriatric Depression Scale, these were divided as some reported data on the original 30-item version and others on the 15-item version. Based on internal consistency, the 30-item version was only well-supported among psychiatric patients. The authors of two studies recommended lower cutoff scores for the GDS-15 to improve sensitivity in both clinical and community samples, suggesting that older H/L participants may address the measure differently and therefore demonstrate a different score distribution than the original validation sample. Overall, both versions of the measure performed better among community-residing participants without a history of depression diagnosis in terms of sensitivity and specificity; even then, these values were higher among Puerto Rican than Mexican American participants when using the 30-item GDS, showing a certain variability based on cultural background.

A similar pattern emerged for the CES-D when used in community-dwelling samples as it did for the GDS when used in samples without a history of depression diagnosis; that is, the CES-D showed better internal consistency among Puerto Rican than Mexican participants, though even for the former sample, sensitivity and specificity values were only moderately acceptable and varied notably based on cutoff score. The internal consistency of the CES-D among patients from medical clinics was very good, but the measure had problematic items that threatened the overall validity of the measure when used with older H/Ls. Clinicians evaluating participants from this demographic group in clinical contexts using the CES-D should take the cultural inadequacy of items 4 and 8 into account, whether that implicates reconsidering their relevance in the overall assessment of depression symptoms or using an abbreviated version of the CES-D without these items (notably, this latter option is limited in that the psychometrics of these abbreviated versions in this population may also be unverified).

The small number of studies that met inclusion criteria for this systematic review preclude our ability to make firm recommendations. Therefore, we cautiously provide preliminary recommendations based on the reported results. The data summarized in the present study suggest that the utility of the measures identified may vary depending on their intended purpose. The BDI seems to be an appropriate instrument for the assessment of depression among older H/Ls in research contexts, although this assumption relies on a single data point obtained after a comprehensive systematic review. The best use of the GDS may be a subject of debate; based on internal consistency, it appears appropriate for use with patients who are hospitalized in psychiatric clinics, but various studies have urged to reconsider the use of Cronbach’s alpha values as a single determinant of validity (Dunn et al., 2014; Peters, 2014). When accounting for sensitivity and specificity, the GDS may be better suited for mentally healthy participants; and regardless of the population, professionals evaluating older H/Ls may consider using a different cutoff score to obtain a more desirable balance of sensitivity and specificity, as recommended by Baker and Espino (1997), Baker and colleagues (1994), and Robinson and colleagues (2002). Finally, the CES-D demonstrates weaknesses in regard to its cultural appropriateness, at least among older H/L adults. The acceptability of its psychometric properties is highly variable depending on demographic characteristics, including race/ethnicity and nationality, and it has been found through different methodologies that individuals from diverse backgrounds interact with certain items of this measure inconsistently. These issues may intervene with research findings in studies in which older H/L adults represent a relevant proportion of the sample, and for clinical purposes, providers may do well to select a more robust screening tool when evaluating older H/Ls. If used regardless, clinicians may consider using different cutoffs based on findings by Robinson and colleagues (2002) to maximize the utility of this measure.

Limitations

Although the present study provides a summary of depression measure properties based on a thorough systematic review, there are methodological issues to consider. The number of articles that met criteria for inclusion in our review was small, and there was variation among samples of the few studies included, greatly limiting interpretation and the recommendations that can be made based off these findings. Sample size varied from 44 to over 4000, and there was little homogeneity in terms of gender, education, and nationality of participants. Studies with diverse racial/ethnic samples often reported including specific percentages of “Hispanic” participants without clarifying their country of origin or their level of bilingualism or acculturation, key factors often excluded from psychological assessment literature (O’Bryant et al., 2004; Medina et al., 2020). Overall, as seen in Table 2, there was inconsistency in the reporting of significant differences at the level of relevant sociodemographic characteristics. These limitations increment the risk of bias at the study and outcome level. At the outcome level, it is also worth mentioning that the studies identified do not provide information on the positive or negative predictive values of the depression screening measures we describe, even though these values may be more accurate descriptors of the utility of a measure (Trevethan, 2017).

Conclusions

Overall, the data on the psychometric properties of depression screening measures for use with older H/L adults is variable and reveals a limited awareness among researchers and clinicians of the nuances unique to evaluating depression in this population. Of the three measures considered in this review, the Geriatric Depression Scale shows the most promise as an adequate test for use with diverse older adults, although not without caveats described above. Our findings suggest that using existing depression screening measures when working with these populations involves additional considerations for both clinicians and researchers in order to compensate for the fact that such measures were designed without accounting for cultural differences in the experience and manifestation of depressive symptoms. Future research should place greater emphasis on the development and validation of depression measures created with consideration of the needs of diverse clients and patients, or ideally, designed through an emic approach to aim for items written in culturally relevant terms.

Supplementary Material

1

Highlights.

  • We reviewed depression scales for use with older Hispanic/Latin American adults

  • Only three scales were found to contain psychometric data in this population

  • The validity of these measures may vary based on their intended use

  • Sensitivity of these measures may also be improved with modified cutoff scores

Acknowledgements

The authors of this manuscript have no additional acknowledgements to report.

Conflict of Interest

The authors have no relevant conflicts of interest to disclose. This work was supported by the National Institutes of Health [R24AG065170] and the Alzheimer’s Association [2019-AARGD-642445]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or the Alzheimer’s Association.

Footnotes

Declarations of interest: none

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Addington D, Addington J, Maticka. Tyndale E, Iqbal Z, Birchwood M, Hemsley D, Jackson C, & Morris E, 2004. Calgary Depression Scale for Schizophrenia. British Journal of Clinical Psychology, 43, 97–104. [Google Scholar]
  2. Adler M, Liberg B, Andersson S, Isacsson G, & Hetta J, 2008. Development and validation of the Affective Self Rating Scale for manic, depressive, and mixed affective states. Nordic journal of psychiatry, 62(2), 130–135. 10.1080/08039480801960354 [DOI] [PubMed] [Google Scholar]
  3. Administration for Community Living, 2019. 2018 Profile of Hispanic Americans Age 65 and Over 4. [Google Scholar]
  4. Baker FM, Espino DV, 1997. A Spanish Version of the Geriatric Depression Scale in Mexican-American Elders. International Journal of Geriatric Psychiatry, 12, 21–25. [DOI] [PubMed] [Google Scholar]
  5. Baker FM, Espino DV, Robinson BH, Stewart B, 1994. Assessing Depressive Symptoms in African American and Mexican American Elders. Clinical Gerontologist 14, 15–29. 10.1300/J018v14n01_03 [DOI] [Google Scholar]
  6. Beck AT, Steer RA, & Brown GK, 1996. Manual for the Beck Depression Inventory-II. San Antonio, TX: Psychological Corporation. [Google Scholar]
  7. Blatt SJ, D’. Afflitti JP, Quinlan DM, & Bornstein RF, 2002. Depressive Experiences Questionnaire. Journal of Personality Assessment, 78, 47–68.11936212 [Google Scholar]
  8. Blazer D, 2003. Depression in Late Life: Review and Commentary. Journal of Gerontology: Medical Sciences, 58A(3), 249–265. [DOI] [PubMed] [Google Scholar]
  9. Brink TL, Yesavage JA, Brink TL, Yesavage JA, Lum O, Heersema PH, Adey M, & Rose TL, 1982. Geriatric Depression Scale--Long Form. [Also denoted: Geriatric Depression Scale]. Clinical Gerontologist, 1(1), 37–43. [Google Scholar]
  10. Bruce ML, Ten Have TR, Reynolds CF III, Katz II, Schulberg HC, Mulsant BH, Brown GK, McAvay GJ, Pearson JL, Alexopoulos GS, 2004. Reducing Suicidal Ideation and Depressive Symptoms in Depressed Older Primary Care Patients: A Randomized Controlled Trial. JAMA 291, 1081. 10.1001/jama.291.9.1081 [DOI] [PubMed] [Google Scholar]
  11. Burnam MA, Wells KB, Leake B, Landsverk J, Adams RS, Larson MJ, Corrigan JD, Horgan CM, & Williams TV, 2012. MOS Depression Screener. Journal of Head Trauma Rehabilitation, 27(5), 349–360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Burrows AB, Morris JN, Simon SE, Hirdes JP, Phillips C, Burrows AB, Morris JN, Simon SE, Hirdes JP, & Phillips C, 2000. Minimum Data Set Depression Rating Scale. Age and Ageing, 29, 165–172. [DOI] [PubMed] [Google Scholar]
  13. Carrete P, Augustovski F, Gimpel N, Fernandez S, Paolo R, Schaffer I, Rubinstein F, 2001. Validation of a telephone-administered geriatric depression scale in a Hispanic elderly population. J Gen Intern Med 16, 446–450. 10.1046/j.1525-1497.2001.016007446.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Carroll BJ, Boulos PK, Daiwani MS, Tanabe J, Mikulich-Gilbertson SK, Banich MT, Crowley TJ, & Sakai JT, 2016. Carroll Rating Scale for Depression. PLoS ONE, 11(4), 1–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Crockett LJ, Randall BA, Shen Y-L, Russell ST, Driscoll AK, 2005. Measurement Equivalence of the Center for Epidemiological Studies Depression Scale for Latino and Anglo Adolescents: A National Study. Journal of Consulting and Clinical Psychology 73, 47–58. 10.1037/0022-006X.73.1.47 [DOI] [PubMed] [Google Scholar]
  16. Delibas H, Kirdok A, & Erol A, 2016. Alexithymia and somatisation in patients with remitted major depression and their impact on social functioning. S Afr J Psychiatr, 22(1), 886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Derogatis LR, Lipman RS, Rickels K, Uhlenhuth EH, Covi L, Burns DD, & Spangler DL, 2000. Hopkins Symptom Checklist--Depression Scale. Journal of Consulting and Clinical Psychology, 68, 46–56.10710839 [Google Scholar]
  18. Dunn TJ, Baguley T, Brunsden V, 2014. From alpha to omega: A practical solution to the pervasive problem of internal consistency estimation. Br J Psychol 105, 399–412. 10.1111/bjop.12046 [DOI] [PubMed] [Google Scholar]
  19. Espino DV, Bedolla MA, Perez M, Baker FM, 1996. Validation of the Geriatric Depression Scale in an Elder Mexican American Population: A Pilot Study. Clinical Gerontologist 16, 55–67. 10.1300/J018v16n04_05 [DOI] [Google Scholar]
  20. First MB, Spitzer RL, Gibbon M, Williams JBW, Hodgins DC, & el-Guebaly N, 2004. Structured Clinical Interview for DSM-IV--Patient Edition. Journal of Consulting and Clinical Psychology, 72, 72–80.14756616 [Google Scholar]
  21. Gatewood-Colwell G, Kaczmarek M, Ames MH, 1989. Reliability and Validity of the Beck Depression Inventory for a White and Mexican-American Gerontic Population. Psychol Rep 65, 1163–1166. 10.2466/pr0.1989.65.3f.1163 [DOI] [PubMed] [Google Scholar]
  22. Gerst K, Al-Ghatrif M, Beard HA, Samper-Ternent R, Markides KS, 2010. High depressive symptomatology among older community-dwelling Mexican Americans: The impact of immigration. Aging & Mental Health 14, 347–354. 10.1080/13607860903292578 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gonçalves DC, Albuquerque PB, Byrne GJ, Pachana NA, 2009. Assessment of depression in aging contexts: General considerations when working with older adults. Professional Psychology: Research and Practice 40, 609–616. 10.1037/a0017305 [DOI] [Google Scholar]
  24. Gonzalez HM, Haan MN, Hinton L, 2001. Acculturation and the Prevalence of Depression in Older Mexican Americans: Baseline Results of the Sacramento Area Latino Study on Aging. J Am Geriatr Soc 49, 948–953. 10.1046/j.1532-5415.2001.49186.x [DOI] [PubMed] [Google Scholar]
  25. González VM, Stewart A, Ritter PL, Lorig K, 1995. Translation and validation of arthritis outcome measures into Spanish. Arthritis & Rheumatism 38, 1429–1446. 10.1002/art.1780381010 [DOI] [PubMed] [Google Scholar]
  26. Hahn EA, Kim G, Chiriboga DA, 2011. Acculturation and Depressive Symptoms Among Mexican American Elders New to the Caregiving Role: Results from the Hispanic-EPESE. J Aging Health 23, 417–432. 10.1177/0898264310380454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Haroz EE, Bass J, Lee C, Oo SS, Lin K, Kohrt B, Michalopolous L, Nguyen AJ, & Bolton P, 2017. Development and cross-cultural testing of the International Depression Symptom Scale (IDSS): a measurement instrument designed to represent global presentations of depression. Global mental health (Cambridge, England), 4, e17. 10.1017/gmh.2017.16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hickie I, Scott E, Naismith S, Ward PB, Turner K, Parker G, Mitchell P, Wilhelm K, 2001. Late-onset depression: genetic, vascular and clinical contributions. Psychol. Med 31, 1403–1412. 10.1017/S0033291701004731 [DOI] [PubMed] [Google Scholar]
  29. Jay M, & John OP, 2004. A depressive symptom scale for the California Psychological Inventory: construct validation of the CPI-D. Psychological assessment, 16(3), 299–309. 10.1037/1040-3590.16.3.299 [DOI] [PubMed] [Google Scholar]
  30. Kellner R, & Kellner R, 1987. Symptom Questionnaire. Journal of Clinical Psychiatry, 48, 268–274. [PubMed] [Google Scholar]
  31. Lewis-Fernandez R, Das AK, Alfonso C, Weissman MM, Olfson M, 2005. Depression in US Hispanics: Diagnostic and Management Considerations in Family Practice. The Journal of the American Board of Family Medicine 18, 282–296. 10.3122/jabfm.18.4.282 [DOI] [PubMed] [Google Scholar]
  32. MacIntosh RC, Strickland OJ, 2010. Differential item responses on CES-D inventory: A comparison of elderly Hispanics and non-Hispanic Whites in the United States and item usage by elderly Hispanics across time. Aging & Mental Health 14, 556–564. 10.1080/13607860903421045 [DOI] [PubMed] [Google Scholar]
  33. McLean PD, Hakstian AR, McLean PD, & Hakstian AR, 1990. Brief Screen for Depression. Journal of Consulting and Clinical Psychology, 58(4), 482–488. [DOI] [PubMed] [Google Scholar]
  34. Medina LD, Torres S, Gioia A, Ochoa Lopez A, Wang J, Cirino PT, 2020. Reporting of Demographic Variables in Neuropsychological Research: An Update of O’Bryant et al.’s Trends in the Current Literature. J Int Neuropsychol Soc 1–11. 10.1017/S1355617720001083 [DOI] [PubMed] [Google Scholar]
  35. Michela B, Cataldi F, Carlucci L, Padulo C, Fairfield B, 2018. Assessment of late-life depression via self-report measures: a review. CIA Volume 13, 2021–2044. 10.2147/CIA.S178943 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Mimura C, Murashige M, Oda T, & Watanabe Y, 2011. Development and psychometric evaluation of a Japanese scale to assess depression severity: Himorogi Self-rating Depression Scale. International journal of psychiatry in clinical practice, 15(1), 50–55. 10.3109/13651501.2010.509801 [DOI] [PubMed] [Google Scholar]
  37. Mogge NL, Steinberg JS, Fremouw W, & Messer J, 2008. The Assessment of Depression Inventory (ADI): an appraisal of validity in an outpatient sample. Depression and anxiety, 25(1), 64–68. 10.1002/da.20247 [DOI] [PubMed] [Google Scholar]
  38. Moher D, Liberati A, Tetzlaff J, Altman D, The PRISMA Group, 2009. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Medicine 6, 1–6. [PMC free article] [PubMed] [Google Scholar]
  39. Mahard R, 1988. The CES-D as a Measure of Depressive Mood in the Elderly Puerto Rican Population. Journal of Gerontology, 43(1), P24–P25. 10.1093/geronj/43.1.P24 [DOI] [PubMed] [Google Scholar]
  40. Nunnally JC and Bernstein IH, 1994. The Assessment of Reliability. Psychometric Theory, 3, 248–292. [Google Scholar]
  41. Nyenhuis DL, Luchetta T, Yamamoto C, Terrien A, Bernardin L, Rao SM, Garron DC, Nyenhuis DL, Luchetta T, Yamamoto C, Terrien A, Bernardin L, Rao SM, & Garron DC, 1998. Chicago Multiscale Depression Inventory. Journal of Personality Assessment, 70(2), 386–401. [DOI] [PubMed] [Google Scholar]
  42. Nyenhuis DL, Rao SM, Zajecka JM, Garron DC, Nyenhuis DL, Rao SM, Zajecka JM, Luchetta T, Bernardin L, & Garron DC, 1995. Multiscale Depression Inventory. Journal of the International Neuropsychological Society, 1, 291–296. [DOI] [PubMed] [Google Scholar]
  43. O’Bryant S, O’Jile J, McCaffrey R, 2004. Reporting of Demographic Variables in Neuropsychological Research: Trends in the Current Literature. The Clinical Neuropsychologist 18, 229–233. 10.1080/13854040490501439 [DOI] [PubMed] [Google Scholar]
  44. Parker G, Hilton T, Bains J, Hadzi-Pavlovic D, McHale M, Hendrikz J, Dann F, & Kenardy J, 2008. Depression in the Medically Ill−−10. Psychosomatic Medicine, 70, 869–874. [DOI] [PubMed] [Google Scholar]
  45. Peters G-JY, 2014. The alpha and the omega of scale reliability and validity: Why and how to abandon Cronbach’s alpha and the route towards more comprehensive assessment of scale quality. European Health Psychologist, 16(2),56–69.doi: 10.31234/osf.io/h47fv [DOI] [Google Scholar]
  46. Pilkonis PA, Yu L, Dodds NE, Johnston KL, Maihoefer CC, & Lawrence SM, 2014. Validation of the depression item bank from the Patient-Reported Outcomes Measurement Information System (PROMIS) in a three-month observational study. Journal of psychiatric research, 56, 112–119. 10.1016/j.jpsychires.2014.05.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Post-White J, & Post-White J, 1986. Depression Symptom Survey. Cancer Nursing, 9(1), 15–22. [PubMed] [Google Scholar]
  48. Schweitzer I, Tuckwell V, O’Brien J, Ames D, 2002. Is late onset depression a prodrome to dementia? Int. J. Geriat. Psychiatry 17, 997–1005. 10.1002/gps.525 [DOI] [PubMed] [Google Scholar]
  49. Sheikh JI, & Yesavage JA, 1986. Geriatric Depression Scale (GDS): Recent evidence and development of a shorter version. Clinical Gerontologist, 5, 165–173. [Google Scholar]
  50. Sheikh JI, Yesavage JA, Washington TR, Zimmerman S, Hain DJ, & Carlton-LaNey I, 2018. Geriatric Depression Scale--Short Form. Nephrology Nursing Journal, 45(3), 251–258. [PubMed] [Google Scholar]
  51. Siddiqui S, & Ali Shah SA, 1997. Siddiqui-Shah Depression Scale (SSDS): Development and Validation. Psychology and Developing Societies, 9(2), 245–262. 10.1177/097133369700900205 [DOI] [Google Scholar]
  52. Radloff LS, 1977. The CES-D Scale: A Self-Report Depression Scale for Research in the General Population. Applied Psychological Measurement, 1(3), 385–401. 10.1177/014662167700100306 [DOI] [Google Scholar]
  53. Raskin A, Schulterbrandt J, Reating N, et al. , Sajatovic M, & Ramirez LF (2003). Raskin Depression Rating Scale. [Google Scholar]
  54. Reynolds W, 1994. Self-Report Depression Questionnaire. In Assessment in Rehabilitation and Exceptionality (Vol. 1, Issue 1, pp. 1–20). [Google Scholar]
  55. Reynolds WM, Kobak KA, Murphy DA, Herbeck DM, Marelich WD, & Schuster MA, 2010. Hamilton Depression Inventory. International Journal of Sexual Health, 22, 195–204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Robison J, Gruman C, Gaztambide S, Blank K, 2002. Screening for Depression in Middle-Aged and Older Puerto Rican Primary Care Patients. The Journals of Gerontology Series A: Biological Sciences and Medical Sciences 57, M308–M314. 10.1093/gerona/57.5.M308 [DOI] [PubMed] [Google Scholar]
  57. Rockliff BW, & Radloff LS, 1977. Rockliff Depression Rating Scale. Applied Psychological Measurement, 1, 385–401. [Google Scholar]
  58. Rogers WH, Wilson IB, Bungay KM, Kwon A, Bungay KM, Pei Y, Rogers WH, Wilson IB, Zhou Q, & Adler DA, 2003. Primary Care Screener for Affective Disorders. Medical Care, 41(3), 368–374. [DOI] [PubMed] [Google Scholar]
  59. Romaniuk M, & Khawaja NG, 2013. University student depression inventory (USDI): confirmatory factor analysis and review of psychometric properties. Journal of affective disorders, 150(3), 766–775. 10.1016/j.jad.2013.02.037 [DOI] [PubMed] [Google Scholar]
  60. Rush AJ, Giles DE, Schlesser MA, Fulton CL, Weissenburger J, Burns C, Blumberger DM, Vila-Rodriguez F, Thorpe KE, Feffer K, Noda Y, Giacobbe P, Knyahnytska Y, Kennedy SH, Lam RW, Daskalakis ZJ, & Downar J, 2018. Inventory for Depressive Symptomatology--Self-Report. Lancet, 391, 1683–1692.29726344 [Google Scholar]
  61. Rybarczyk B, Winemiller D, Lazarus L, Haut A, & Hartman C, 1996. Validation of a Depression Screening Measure for Stroke Inpatients. Geriatric Psychiatry, 4(2), 13–139 [DOI] [PubMed] [Google Scholar]
  62. Teresi JA, Ramirez M, Lai J-S, Silver S, 2010. Occurrences and sources of Differential Item Functioning (DIF) in patient-reported outcome measures: Description of DIF methods, and review of measures of depression, quality of life and general health 62. [PMC free article] [PubMed] [Google Scholar]
  63. Trevethan R (2017). Sensitivity, Specificity, and Predictive Values: Foundations, Pliabilities, and Pitfalls in Research and Practice. Front Public Health, 5(307). doi: 10.3389/fpubh.2017.00307 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Tuunainen A, Langer RD, Klauber MR, & Kripke DF, 2001. Short version of the CES-D (Burnam screen) for depression in reference to the structured psychiatric interview. Psychiatry research, 103(2–3), 261–270. 10.1016/s0165-1781(01)00278-5 [DOI] [PubMed] [Google Scholar]
  65. Uher R, Perlis RH, Placentino A, Dernovšek MZ, Henigsberg N, Mors O, Maier W, McGuffin P, Farmer A, 2012. Self-Report and Clinician-Rated Measures of Depression Severity: Can One Replace the Other? Depress Anxiety 29, 1043–1049. 10.1002/da.21993 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Valente S & Saunders J, 2005. Screening for depression and suicide: self-report instruments that work. Journal of Psychological Nursing, 43(11). [DOI] [PubMed] [Google Scholar]
  67. Van der Does W, & Van der Does W, 2002. Leiden Index of Depression Sensitivity. Behaviour Research and Therapy, 40, 105–120. [DOI] [PubMed] [Google Scholar]
  68. Wang RIH, Treul S, Alverno L, Wang RIH, Treul S, & Alverno L, 1975. Brief Self-Assessing Depression Scale. Journal of Clinical Pharmacology, 15, 163–167. [DOI] [PubMed] [Google Scholar]
  69. Wells KB, Burnam MA, Camp P, Wells KB, Burnam A, & Camp P, 1995. Adult Outpatients Questionnaire. Medical Care, 33, 350–364. [DOI] [PubMed] [Google Scholar]
  70. Wu C, Kelley LS, 2007. Choosing an Appropriate Depression Assessment Tool for Chinese Older Adults: A Review of 11 Instruments. JGN 33, 12–22. 10.3928/00989134-20070801-04 [DOI] [PubMed] [Google Scholar]
  71. Yesavage JA, Brink TL, Rose TL, Lum O, Huang V, Adey M, & Leirer VO, 1982. Development and validation of a geriatric depression screening scale: a preliminary report. Journal of psychiatric research, 17(1), 37–49. 10.1016/0022-3956(82)90033-4 [DOI] [PubMed] [Google Scholar]
  72. Zimmerman M, 1997. Self-Report Scale to Diagnose Depression. In Psychopharmacology bulletin / (Vol. 33, pp. 13–16). The Clearinghouse,. [Google Scholar]
  73. Zimmerman M, Chelminski I, McGlinchey JB, & Posternak MA, 2008. A clinically useful depression outcome scale. Comprehensive psychiatry, 49(2), 131–140. 10.1016/j.comppsych.2007.10.006 [DOI] [PubMed] [Google Scholar]
  74. Zimmerman M, Coryell W, Corenthal C, Wilson S, Nolen-Hoeksema S, & Morrow J, 1991. Interview to Diagnose Depression. Journal of Personality and Social Psychology, 61, 115–121. [DOI] [PubMed] [Google Scholar]
  75. Zimmerman M, Sheeran T, Young D, Zimmerman M, Chelminski I, McGlinchey J, & Young D, 2006. Diagnostic Inventory for Depression. Journal of Nervous and Mental Disease, 194(8), 565–569. [DOI] [PubMed] [Google Scholar]
  76. Zemore R, Zemore R, Fischer DG, Garratt LS, & Miller C, 1990. Depression Proneness Rating Scale. Current Psychology, 9, 255–263. [Google Scholar]
  77. Zung WWK, Sharpley CF, Bitsika V, Christie DRH, Bradford R, Steigler A, & Denham JW, 2017. Self-Rating Depression Scale. Journal of Psychosocial Oncology, 35(4), 438–450. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES