Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Jan 8.
Published in final edited form as: J Nerv Ment Dis. 2012 Sep;200(9):10.1097/NMD.0b013e318266bcaa. doi: 10.1097/NMD.0b013e318266bcaa

Rater Evaluations for Psychiatric Instruments and Cultural Differences: The PANSS in China and United States

Neil Krishan Aggarwal a,b,*, Xiang Yang Zhang d,e,*, Elina Stefanovics a,c, Da Chun Chen e, Mei Hong Xiu e, Ke Xu a,b, Robert A Rosenheck a,c
PMCID: PMC3885177  NIHMSID: NIHMS527934  PMID: 22922237

Abstract

This article compares Positive and Negative Syndrome Scale (PANSS) data from Chinese and American inpatients with chronic schizophrenia to show how differences in item ratings may reflect cultural attitudes of raters. The Chinese sample (N=504) came from Beijing Huilongguan Hospital. The American sample came from 268 PANSS assessments of CATIE subjects hospitalized for 15 days or more to optimize equivalence of the samples. Controlling for age and gender, the Chinese sample scored significantly lower for total score by 25% (p<.0001), for the positive sub-scale by 35% (p<.0001), and on the general sub-scale by 32% (p<.0001), but not significantly different on the negative sub-scale score (+0.26%, p=0.76). However, the Chinese sample scored 26% higher on the item on poor rapport (p<.0001), 10.2% higher on passive social withdrawal (p=.003), and most notably 46% higher on the item on lack of judgment and insight (p<.0001). These results remain broadly consistent across gender sub-group analyses. Differences seem to be best explained by both cultural differences in patient clinical presentations as well as varying American and Chinese cultural values affecting rater judgment.

Keywords: schizophrenia, PANSS, cultural psychiatry, ethnopsychology, rating scales

INTRODUCTION

Cross-cultural psychiatric researchers have struggled to investigate mental illness across cultures. Cultures possess distinct norms of normal and abnormal behavior, influencing symptom patterns and raising questions about the homogeneity of illnesses across societies (Devereux, 2000). Psychiatrists have typically assumed that emotions are expressed and can be measured in similar ways across populations, although ethnographers have observed cross-cultural variations in their manifestation, apprehension, and sociolinguistic expression (Abu-Lughod and Lutz, 1990). These considerations have guided research in cultural psychiatry on the universality or specificity of psychopathology, the challenges in providing services in inter-cultural patient-physician interactions, and methods of assessment in the absence of universal pathogenic biomarkers (Kirmayer, 2007).

Instrument translation exemplifies dilemmas in cultural psychiatry. Questionnaires from one setting that do not consider idioms of distress or different symptom manifestations in another setting may neglect cultural influences on the manifestation or assessment of illness (Kleinman, 1988). In cultures without medical vocabularies to express psychological states, researchers must decide between strict lexical and broad conceptual translations (Aggarwal, 2007). In addition, questions from European languages may need considerable rethinking in Asian or African languages with different grammatical structures (Ertan and Eker, 2000). Instruments can only be applied cross culturally with appreciation that illnesses and their assessment may vary across cultures and that even well translated instruments may be applied and interpreted differently.

This article confronts these challenges by analyzing data from the widely used Positive and Negative Syndrome Scale (PANSS) among Chinese and American inpatients with schizophrenia over the past decade. The PANSS includes 30 items that generate a total symptom severity score and three sub-scales of mutually exclusive items measuring positive symptoms (i.e. hallucinations, delusions, hostility), negative syndromes (i.e. blunted affect, social withdrawal or poor rapport) and general symptoms of psychological distress (i.e. anxiety, depression, lack of insight) (Kay et al., 1987), though alternative subscale configurations are possible. Since its publication in 1987, the PANSS has been translated into many languages, including Swedish (von Knorring and Lindström, 1992), Japanese (Igarashi et al., 1998), French (Lançon et al., 1999), and Thai (Nilchaikovit et al., 2000). The Chinese translation of the PANSS appeared in 1991 after a four-center study confirmed its inter-rater reliability, short-term test-retest reliability, and internal consistency with principal components analysis validating the separateness of positive and negative factors (Phillips et al., 1991). The authors attempted to reach the publisher of the PANSS, Multi-Health Systems, for a full list of translations that was not provided at the time of this writing.

Few studies have analyzed cross-cultural PANSS data. International, multicenter clinical medication trials typically randomize or stratify PANSS data by treatment group since their interest is drug efficacy (Beasley et al., 1997; Costa e Silva et al., 2001; Findling et al., 2008; Garcia et al., 2009; Kane et al., 2006; Kasper et al., 2003; Lasser et al., 2004; Lee et al., 2002; Olié et al., 2002; Ryckmans et al., 2009; van Os et al., 2004), with little interest in differences in PANSS responses between countries, ethnicities, or languages. In a previous study, Aggarwal and colleagues (2011) found substantial differences between psychiatric inpatients from four hospitals in Changsha, China and inpatients in the US CATIE schizophrenia trial (Lieberman et al., 2005). Generally, Chinese scores were substantially lower than American scores, but Chinese ratings were especially (>20%) higher on items such as hostility, disturbance of volition, lack of insight and poor impulse control, which the authors attributed to rater differences in the application of interpersonal cultural norms emphasizing harmonious relationships and adherence to authority in China. In a rare exception, Mechri et al. (2009) organized PANSS data according to subscale and total scores for patients with schizophrenia from France and Tunisia. They found that neurological soft signs were associated with the PANSS disorganization sub-scale in the French sample, but with the PANSS negative sub-scale in the Tunisian sample. Most studies using the PANSS focus on treatment comparisons, assuming that cross-cultural impacts are negligible.

This study compares differences among inpatients in the CATIE study and a Beijing sample on total and subscale scores. Individual items are identified that differ by sample in the total PANSS scale and subscales. Assuming that schizophrenia manifests similarly in China and the US and that translation from English to Chinese is reasonably equivalent, variant item ratings countering broad trends may reflect cultural differences in clinical presentations, patient responses, or PANSS application by raters.

METHODS

The Chinese sample came from hospitalized inpatients in Beijing (N=504) and included age, gender, days of hospitalization, and PANSS individual item ratings. The study was conducted at the Beijing Hui Long Guan Hospital and HeBei Province Veteran Psychiatric Hospital. Beijing Hui Long Guan Hospital is one of the largest psychiatric hospitals in China with more than 1300 inpatient beds and is located 30 km from central Beijing. This hospital serves a catchment population of 13 million people. These patients had a duration of illness averaging 24.4 ± 9.6 years with 10.0 ± 9.5 years of hospitalization.

This hospital also has an outpatient setting, treating individuals with slight to moderate severity of psychosis or following-up patients after inpatient stay. Patients with moderate to severe severity of symptoms [generally with Clinical Global Impression (CGI) ≥4], first-episode psychosis, and acutely exacerbations are hospitalized. For first-episode patients, the length of hospitalization is short, generally 2–6 months due to good response to antipsychotic treatment. Commonly used antipsychotics include chlorpromazine, haloperidol or perphenazine among first-generation antipsychotics and risperidone, clozapine, olanzapine or aripiprazole among second generation antipsychotics. The study was performed between December 2006 and May 2007.

For the Chinese samples, all schizophrenic patients were of the chronic subtype, with duration of illness for at least 5 years. The recruitment criteria included: 1) age 16–75 years; 2) Han Chinese ethnicity; 3) confirmed DSM-IV diagnosis of schizophrenia; 4) at least 5 years of illness; 5) stable prescriptions of oral antipsychotic drugs for at least 6 months before entry into the study; 6) written informed consent and able to take part in neuropsychological assessment. Patients with medical or psychiatric co-morbidities and those needing other medications were included.

The US sample was derived from the Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE), a large, NIMH funded antipsychotic drug trial conducted between January 2001 and December 2004 at 57 US sites based on an algorithmically-determined treatment phases. Patients were initially assigned to olanzapine, perphenazine, quetiapine, risperidone, or ziprasidone under double-blind conditions. Since there were few differences in drug outcomes across multiple measures (Rosenheck and Sernyak, 2009), we pooled results from all treatment arms for these analyses .

In both samples, patients with first-episode psychosis were excluded if they did not meet a DSM-IV diagnosis of schizophrenia to avoid enrollment of subjects whose diagnostic categories might change over time. Since both samples comprise treatment efficacy studies, patients with treatment-refractory schizophrenia were excluded for their low rate of response to multiple trials of adequate doses of antipsychotics.

Measures

In Beijing, four psychiatrists, blind to the clinical status and treatment condition, assessed the patient’s psychopathology using the PANSS upon admission. To ensure consistency and reliability of rating across the study, these four psychiatrists who had worked at least 5 years in clinical practice, were trained with the PANSS before the start of the study. After training, a correlation coefficient greater than 0.8 was maintained for inter-rater reliability on the PANSS total score at repeated assessments during the study.

In the CATIE sample, PANSS assessments were completed at 1 month and then every 3 months after randomization up to 18 months. Hospital days were assessed every month. Data presented here include all baseline assessments of patients hospitalized for at least 15 days during the month prior to assessment (N=268) to maximize comparability to the Beijing sample.

Symptom outcomes were assessed with the PANSS, in which higher scores reflect worse symptoms, and a 20% difference in scores was used in this analysis to represent a clinically important difference (Cramer et al., 2001; Leucht et al., 2005). Information from both data sets was also available on age, gender and duration of illness.

Data Analysis

First, t-tests and Pearson chi-square were used to assess the significance of differences between the samples in age and gender. Next, we used generalized linear models to assess the significance of differences between the two samples on PANSS summary scores, the three subscales, and individual items, adjusting for differences in age and gender. We present least square means and compare the percentage difference in scores between the Beijing and CATIE samples. We could not control for duration of hospitalization because the measurements of duration of hospital stay were different in the two studies. Statistical analyses was performed using SAS 9.1 statistical software (SAS institute Inc, Cary, North Carolina, USA)

RESULTS

The sample included 772 observations. The Chinese sample ranged 16–76 years old, averaging 50 years (SD=10.712), compared to the CATIE sample which ranged in age from 18–65 with a mean of 37.8 years old (SD=11.7) (t=14.53 , DF=764, p<.0001). There were 332 (65.9%) males and 172 (34.1%) females in the Chinese sample compared to the 205 (76.5%) males and 63 females (23.5%) in the CATIE sample (chi-square=7.1, DF=1, p=.007).

Controlling for age and gender, scores in the Chinese sample were significantly lower than the CATIE sample for total score by 25% (p<.0001), for the positive sub-scale by 35% (p<.0001), and on the general sub-scale by 32% (p<.0001) [Table 1]. The Chinese sample also scored 24–49% lower than the CATIE sample on all positive items (all p<.0001). Differences on the negative sub-scale, in contrast were not significantly different. Notably, positive symptom items assess hallucinatory behavior, grandiosity, abstract thinking, stereotypy, somatic concerns, anxiety, guilt, tension, mannerisms, depression, and disorientation. One hypothesis is that the Chinese sample exhibited less psychopathology than the CATIE sample. In this manner, the American sample could be conceived as “more ill” clinically than the Chinese sample. Another hypothesis concerns the rater administrations of the PANSS itself. These ratings are assessed through direct verbal response of the patient as judged by trained raters. Cultural differences in stigma could influence whether American were viewed as more likely to endorse these symptoms than were Chinese patients.

TABLE 1.

Clinical assessment characteristics of patients with early episode (2 years or less) schizophrenia at baseline, controlling for gender and age. Shaded rows are significant at p<.05

BEIJING
SAMPLE
CATIE
SAMPLE
T P Percent
N=504 N=268 diff
PANSS scores and items LS
mean
SE LS mean SE
PANSS TOTAL 60.13 0.75 80.00 1.08 −15.08 <.0001 −24.8%
Subscales
  Positive 13.13 0.26 20.23 0.38 −15.15 <.0001 −35.1%
  Negative 20.33 0.34 20.43 0.49 −0.15 0.87 −0.5%
  General 26.67 0.35 39.35 0.50 −20.72 <.0001 −32.2%
INDIVIDUAL ITEMS
  Positive Items
  P1 (Delusions) 2.79 0.08 3.77 0.12 −6.53 <.0001 −26.0%
  P2 (Conceptual Disorganization) 2.46 0.07 3.27 0.10 −6.50 <.0001 −24.8%
  P3 (Hallucinatory Behavior) 1.80 0.07 3.50 0.11 −13.05 <.0001 −48.6%
  P4 (Excitement) 1.18 0.04 2.07 0.06 −12.05 <.0001 −43.0%
  P5 (Grandiosity) 1.52 0.07 2.27 0.10 −6.20 <.0001 −33.0%
  P6 (Suspiciousness/persecution) 2.11 0.07 3.49 0.11 −10.48 <.0001 −39.5%
  P7 (Hostility) 1.27 0.04 1.86 0.06 −8.11 <.0001 −31.7%
  Negative Items
    N1 (Blunted affect) 3.25 0.07 3.06 0.09 1.60 0.11 6.2%
    N2 (Emotional withdrawal) 3.37 0.06 3.09 0.09 2.50 0.01 9.1%
    N3 (Poor rapport) 3.11 0.06 2.47 0.09 5.55 <.0001 25.9%
    N4 (Passive/apathetic social Withdrawal) 3.34 0.07 3.03 0.09 2.67 0.01 10.2%
  N5 (Difficulty in abstract thinking) 3.25 0.07 3.66 0.10 −3.26 0.0012 −11.2%
  N6( Lack of spontaneity & flow of conversation) 2.56 0.07 2.60 0.10 −0.34 0.73 −1.5%
  N7 (Stereotyped thinking) 1.46 0.05 2.51 0.08 −10.82 <.0001 −41.8%
  General Items
   G1 (Somatic concern) 1.23 0.05 2.46 0.07 −14.99 <.0001 −50.0%
   G2 (Anxiety) 1.36 0.04 2.91 0.06 −19.39 <.0001 −53.3%
   G3 (Guilt feelings) 1.08 0.04 2.5 0.06 −18.82 <.0001 −56.8%
   G4 (Tension) 1.06 0.04 2.46 0.05 −19.51 <.0001 −56.9%
   G5 (Mannerisms & posturing) 1.03 0.03 1.79 0.04 −13.81 <.0001 −42.5%
   G6 (Depression) 1.43 0.05 2.72 0.07 −14.16 <.0001 −47.4%
   G7 (Motor retardation) 1.88 0.06 2.15 0.08 −2.63 0.008 −12.6%
   G8 (Uncooperativeness) 1.39 0.05 1.63 0.06 −2.96 0.003 −14.7%
   G9 (Unusual thought content) 3.05 0.08 3.31 0.12 −1.87 0.06 −7.9%
   G10 (Disorientation) 1.51 0.05 1.93 0.07 −4.57 <.0001 −21.8%
   G11 (Poor attention) 1.74 0.05 2.39 0.08 −6.63 <.0001 −27.2%
   G12 (Lack of judgment & insight 4.83 0.06 3.30 0.09 13.44 <.0001 46.4%
   G13 (Disturbance of volition 1.55 0.05 2.38 0.08 −8.40 <.0001 −34.9%
   G14 (Poor impulse control) 1.06 0.04 1.99 0.05 −14.35 <.0001 −46.7%
   G15 (Preoccupation) 1.15 0.04 2.60 0.06 −18.74 <.0001 −55.8%
   G16 (Active social avoidance) 1.24 0.05 2.82 0.07 −18.07 <.0001 −56.0%

However, the Chinese sample scored 26% higher on the item representing poor rapport or lack of interpersonal empathy (p<.0001), and 12% higher on passive social withdrawal (p=.003). On the general subscale, the Chinese sample scored significantly lower by 20–57% on all items except for lack of judgment and insight where they scored 46% higher than CATIE subjects (p<.0001). Poor rapport, passive social withdrawal, and lack of judgment and insight are scored interpersonally through rater interpretations of patient behavior and deserve further cross-cultural scrutiny through qualitative analysis.

Gender sub-analyses across the CATIE and Chinese samples follow similar trends. Chinese men scored significantly lower than CATIE men on the total score by 26.1% (p<.0001), on the positive subscale by 37.5% (p<.001), and on the general subscale by 34.3% (p<.0001) [Table 2]. Chinese men scored 24–54% lower than CATIE men on all positive items (p<.0001). Similarly, Chinese men scored 24.4% higher on the item representing poor rapport (p<.0001) and 10.4% higher on social withdrawal (p=.008). On the general subscale, Chinese men scored significantly lower by 15–61% on all items except for lack of judgment and insight where they scored 42.2% higher than CATIE men (all p<.0001).

TABLE 2.

Comparing PANSS scores of men across the CATIE and Chinese samples Shaded rows are significant at p<.05)

BEIJING USA T P Percent
N=332 N=205 diff
PANSS scores and items LS mean SE LS mean SE
PANSS TOTAL 60.02 0.86 81.22 1.16 −13.92 <.0001 −26.1%
Subscales
  Positive 12.61 0.29 20.19 0.40 −14.41 <.0001 −37.5%
  Negative 21.32 0.39 21.34 0.53 −0.02 0.98 −0.1%
  General 26.09 0.41 39.69 0.55 −18.84 <.0001 −34.3%
INDIVIDUAL ITEMS
  Positive Items
    P1 (Delusions) 2.70 0.10 3.76 0.13 −5.99 <.0001 −28.2%
    P2 (Conceptual Disorganization) 2.54 0.08 3.35 0.11 −5.41 <.0001 −24.2%
    P3 (Hallucinatory Behavior) 1.60 0.08 3.46 0.11 −12.94 <.0001 −53.8%
    P4 (Excitement) 1.09 −0.05 1.98 0.06 −10.91 <.0001 −44.9%
    P5 (Grandiosity) 1.66 0.08 2.37 0.11 −4.74 <.0001 −30.0%
    P6 (Suspiciousness/persecution) 1.84 0.08 3.51 0.10 −11.66 <.0001 −47.6%
    P7 (Hostility) 1.20 0.04 1.75 0.06 −6.92 <.0001 −31.4%
  Negative Items
    N1 (Blunted affect) 3.51 0.08 3.27 0.10 1.77 0.08 7.3%
    N2 (Emotional withdrawal) 3.17 −0.12 2.99 0.21 0.72 0.47 6.0%
    N3 (Poor rapport) 3.21 0.08 2.58 0.10 4.66 <.0001 24.4%
    N4 (Passive/apathetic social withdrawal) 3.58 0.07 3.23 0.09 2.66 0.008 10.8%
    N5 (Difficulty in abstract thinking) 3.28 0.08 3.65 0.11 −2.41 0.01 −10.1%
    N6 (Lack of spontaneity & flow of conversation) 2.72 0.08 2.75 0.11 −0.20 0.83 −1.1%
    N7 (Stereotyped thinking) 1.56 0.06 2.58 0.08 −9.98 <.0001 −39.5%
  General Items
    G1 (Somatic concern) 1.17 0.05 2.53 0.07 −14.06 <.0001 −53.8%
    G2 (Anxiety) 1.25 0.05 2.82 0.07 −17.13 <.0001 −55.7%
    G3 (Guilt feelings) 1.05 0.05 2.36 0.07 −14.4 <.0001 −55.5%
    G4 (Tension) 1.12 0.04 2.45 0.06 −17.04 <.0001 −54.3%
    G5 (Mannerisms & posturing) 1.05 0.04 1.91 0.05 −12.07 <.0001 −45.0%
    G6 (Depression) 1.42 0.06 2.65 0.08 −11.16 <.0001 −46.4%
    G7 (Motor retardation) 1.95 0.08 2.16 0.09 −1.79 0.07 −9.7%
    G8 (Uncooperativeness) 1.33 0.05 1.64 0.07 −3.43 0.0006 −18.9%
    G9 (Unusual thought content) 2.76 0.09 3.25 0.12 −2.97 0.003 −15.1%
    G10 (Disorientation) 1.58 0.07 1.98 0.08 −3.61 0.0003 −20.2%
    G11 (Poor attention) 1.81 0.07 2.46 0.09 −5.48 <.0001 −26.4%
    G12 (Lack of judgment & insight) 4.85 0.08 3.41 0.10 10.64 <.0001 42.2%
    G13 (Disturbance of volition) 1.45 0.06 2.39 0.09 −8.13 <.0001 −39.3%
    G14 (Poor impulse control) 1.04 0.04 2.08 0.06 −12.78 <.0001 −50.0%
    G15 (Preoccupation) 1.01 0.04 2.61 0.06 −20.26 <.0001 −61.3%
    G16 (Active social avoidance) 1.23 0.06 2.91 0.08 −16.12 <.0001 −57.7%

Likewise, Chinese women scored significantly lower than CATIE men on the total score by 22% (p<.0001), on the positive subscale by 29.1% (p<.001), and on the general subscale by 28.5% (p<.0001) [Table 3]. Chinese women scored 20–42% lower than CATIE women on all positive items (p=.02 or less for all scores). Like Chinese men, Chinese women scored 26.5% higher on the item representing poor rapport (p=.0009). However, some gender differences emerge. Compared to women in the CATIE study, Chinese women did not show statistically significant differences on social withdrawal, though they did show a 38.4% decrease in stereotyped thinking (p<.0001). Furthermore, there are the same dramatic differences within the general subscale among Chinese and CATIE women. Like Chinese men, Chinese women continued to score substantially higher on lack of judgment and insight (53.7%, p<.0001), while scores for other items were lower by 26–61% (p=.004 or less for all scores) except for non-significant differences in motor retardation, uncooperativeness, and unusual thought content.

TABLE 3.

Comparing PANSS scores of women across the CATIE and Chinese samples Shaded rows are significant at p<.05

BEIJING USA T P Percent
N=172 N=63 diff
PANSS scores and items LS mean SE LS mean SE
PANSS TOTAL 60.74 1.31 77.87 2.31 −6.27 <.0001 −22.0%
Subscales
  Positive 13.89 0.49 19.58 0.87 −5.54 <.0001 −29.1%
  Negative 19.34 0.62 19.8 1.09 −0.35 0.72 −2.3%
  General 27.51 0.58 38.49 1.02 −9.08 <.0001 −28.5%
INDIVIDUAL ITEMS
  Positive Items
    P1 (Delusions) 2.92 0.15 3.63 0.26 −2.31 0.02 −19.6%
    P2 (Conceptual Disorganization) 2.39 0.12 3.21 0.21 −3.33 0.001 −25.5%
    P3 (Hallucinatory Behavior) 2.07 0.14 3.21 0.24 −3.93 0.0001 −35.5%
    P4 (Excitement) 1.29 0.08 2.21 0.14 −5.64 <.0001 −41.6%
    P5 (Grandiosity) 1.41 0.10 2.13 0.17 −3.34 0.0007 −33.8%
    P6 (Suspiciousness/persecution 2.48 1.14 3.15 25.00 −2.26 0.02 −21.3%
    P7 (Hostility) 1.33 0.08 2.03 0.14 −4.12 <.0001 −34.5%
  Negative Items
    N1 (Blunted affect) 2.98 0.11 2.86 0.20 0.52 0.6 4.2%
    N2 (Emotional withdrawal) 3.17 −0.12 2.99 0.21 0.72 0.47 6.0%
    N3 (Poor rapport) 3.01 0.11 2.38 0.20 2.62 0.009 26.5%
    N4 (Passive/apathetic social withdrawal) 3.08 0.13 2.93 0.22 0.57 0.57 5.1%
    N5 (Difficulty in abstract thinking) 3.23 0.11 3.67 0.20 −1.84 0.06 −12.0%
    N6 (Lack of spontaneity & flow of conversation) 2.39 0.12 2.54 0.21 −0.61 0.54 −5.9%
    N7 (Stereotyped thinking) 1.49 0.09 2.42 0.17 −4.74 <.0001 −38.4%
  General Items
    G1 (Somatic concern) 1.35 0.07 2.20 0.13 −5.48 <.0001 −38.6%
    G2 (Anxiety) 1.46 0.08 3.04 0.14 −9.43 <.0001 −52.0%
    G3 (Guilt feelings) 1.07 0.06 2.73 0.11 −12.15 <.0001 −60.8%
    G4 (Tension) 1.21 0.06 2.5 0.11 −9.90 <.0001 −51.6%
    G5 (Mannerisms & posturing) 1.03 0.04 1.55 0.07 −6.60 <.0001 −33.5%
    G6 (Depression) 1.43 0.08 2.75 0.14 −7.62 <.0001 −48.0%
    G7 (Motor retardation) 1.81 0.09 2.13 0.17 −1.59 0.11 −15.0%
    G8 (Uncooperativeness) 1.46 0.09 1.58 0.15 −0.63 0.52 −7.6%
    G9 (Unusual thought content) 3.41 0.14 3.03 0.25 1.28 0.2 12.5%
    G10 (Disorientation) 1.43 0.08 1.94 0.15 −2.9 0.004 −26.3%
    G11 (Poor attention) 1.65 0.09 2.9 0.15 −4.42 <.0001 −43.1%
    G12 (Lack of judgment & insight) 4.84 0.11 3.15 0.19 7.35 <.0001 53.7%
    G13 (Disturbance of volition) 1.68 0.09 2.29 0.17 −3.08 0.002 −26.6%
    G14 (Poor impulse control) 1.09 0.05 1.89 0.09 −7.43 <.0001 −42.3%
    G15 (Preoccupation) 1.33 0.09 2.51 0.16 −6.36 <.0001 −47.0%
    G16 (Active social avoidance) 1.29 0.08 2.63 0.14 −7.96 <.0001 −51.0%

DISCUSSION

This study used PANSS data collected from people with schizophrenia in a sample of hospitalized patients from Beijing, China and in the inpatient subgroup of the CATIE study, a large clinical trial enrolling Americans with schizophrenia from multiple sites across the country. We assume from the PANSS ratings that the clinical presentation of patients was comparable with generally lower symptom levels for all domains in the Chinese sample, perhaps due to longer hospitalizations. To maximize validity of this assumption, we used multivariate analyses, adjusting for differences in age and gender.

Ratings from the Chinese sample were consistently and statistically significantly lower than in the American sample for total PANSS score, two of the three subscales, and most individual items. We ascribe these findings to general response differences that are inevitable with non-equivalent sampling strategies. If the Chinese were hospitalized longer and patients achieved clinical stability, their lower positive symptoms may represent residual symptoms, contributing to poor insight, social withdrawal, and poor rapport. A study of first-episode schizophrenic illness found that ethnic Han scored lower than Malay subjects across all PANSS subscales (Lim et al., 2011). It has been claimed that the doctor-patient relationship in China inherits Confucian notions of moral self-cultivation and harmonious relationships in which patients are expected to obey their families and physicians (Tsai, 2001). This may account for lower overall PANSS scores as well. Perhaps most obviously, the Chinese sample could have also demonstrated less severe clinical psychopathology than the American sample.

Of special interest are the two items in which the Chinese sample rated substantially (over 20–40%) higher than the US sample at a statistically significant level (poor rapport and lack of judgment and insight), and to a lesser extent the items that were significantly higher although with a smaller magnitude of difference (blunted affect and emotional withdrawal). From a purely clinical standpoint, it is unclear how to interpret these findings given that Chinese subjects were hospitalized longer: improved – not poor – rapport, judgment, and insight could be expected with longer inpatient stays as symptoms improve and patients received psychoeducation, but such differences were not found. We therefore hypothesize that cultural differences may have affected assessments of these items by the Chinese raters, influenced by distinctive interpersonal norms. These items represent behaviors, especially lack of conventional judgment and insight associated with schizophrenia that are perhaps less tolerated in Chinese culture and more highly scored by Chinese raters, and conversely relatively more tolerated and less highly scored by US raters. PANSS data from inpatients in Changsha, China compared to inpatients in the American CATIE sample showed similar trends of higher ratings for rater-based interpretations of patient behavior (e.g. insight, hostility, disturbance of volition and poor impulse control) in contrast to items scores based more exclusively on patient verbal response (Aggarwal et al., 2011), suggesting a potentially generalizable finding. Bioethics in China has focused on the “relational” nature of the Chinese self, defined, shaped, and situated interpersonally where dignity and humaneness unfold through proper interaction with others (Cong, 2004). Chinese raters may have perceived the behavior of study subjects to represent delinquency in their interpersonal responsibilities, resulting in higher scores for these PANSS items.

In contrast, Chinese raters evaluated symptoms of individual psychopathology lower than American raters on items such as hallucinatory behavior, grandiosity, difficulty in abstract thinking, stereotypy, somatic concerns, anxiety, guilt, tension, mannerisms, depression and disorientation. While, as noted above, there may be non-cultural explanations for these differences, American raters may have scored these symptoms higher given their cultural emphasis on individualism and the cultivation of higher faculties such as reason, abstraction, and volition that may be diminished in schizophrenia. Americans may regard hallucinations, false and idiosyncratic beliefs (delusions, grandiosity), neglect of personal hygiene (disorganization), and beliefs that others want to harm them (paranoia) as representing more severe deficiencies (Roe and Davidson, 2001; Sass, 2001). Differences in Chinese and American scores may therefore reflect cultural differences in norms for personal conduct and its abnormality in schizophrenia.

Differences between the Chinese and CATIE samples followed broadly similar trends when stratified by gender. This Chinese sample has been previously reported to include fewer women who have been previously reported to have lower PANSS scores (Zhang et al., 2009) as well as more positive and affective symptoms than men (Tang et al., 2007) in homogeneous ethnic Han samples. Chinese men and women both scored lower than their American counterparts on the total PANSS, positive subscale, and negative subscale scores and higher on the particular items for lack of judgment and insight and poor social rapport. Interestingly, Chinese women scored lower than American women around stereotyped thinking. “Excessive thinking” as a widespread Chinese idiom of distress seems to de-stigmatize paranoid ideation with related disrupted behavior among patients with schizophrenia (Yang et al., 2010). It is unclear whether Chinese patients exhibited less paranoia in this sample or if their symptoms were rated lower. If rated lower for cultural reasons, this idiom of distress appears to be utilized mostly for women, at least for this sample. Chinese women did not show statistically significant differences compared to American women on motor retardation, uncooperativeness, and unusual thought content. This suggests that either the clinical severity between both sets of women was comparable or that the PANSS was administered in the same way across both groups.

Several methodological limitations require comment. First, the Chinese sample comes from one city whereas the American sample collects patients from over 57 sites. Thus, our study represents findings from one Chinese region and may not represent findings throughout all of China, though they broadly conform to findings from the southern city of Changsha (Aggarwal et al., 2011). The training of PANSS raters is assumed to be uniform so findings from a small number of raters in China could reflect local practice. On the contrary, the large number of CATIE raters may have introduced significant discrepancies in the ways that the American sample was generally rated. Therefore, we cannot estimate the importance of this limitation. Second, the PANSS was administered at different times throughout hospitalization. Variant ratings could be attributed to the differential effect of various phases in the course of illness with higher PANSS scores presumably reflecting earlier intervals in the hospitalization before clinical stability. However, this is not likely to have affected the differential pattern of item responses. Our data do not allow for finer analyses to estimate confounding variables such as length of time in hospital, type and dose of any medication(s), and role of psychotherapeutic interventions such as individual, group, or family therapies in changing PANSS score, although these factors are also unlikely to have affected the differential pattern of item responses. Third, hospitalization criteria differ in China and the US, possibly leading to sample differences, although these may well be reflected in the generally lower PANSS scores in the Chinese samples. Fourth, the Chinese sample is comprised of ethnic Han patients whereas the American sample includes Caucasians, African-Americans, Native Americans, Asians, Pacific Islanders, and those with mixed race (Lieberman et al. 2005). Even though we compared the samples by nationality, we may have overlooked interracial variations by analyzing the American sample together.

Nevertheless, we believe that our results signal an innovative strategy toward quantitative-qualitative analysis of the cultural patterning of responses to symptoms measured through psychiatric instruments. Without laboratory or radiological biomarkers, psychiatry remains an interpretation of an interpretation – the first interpretation, a patient’s report of symptoms, and the second interpretation, a clinician’s assessment from a patient’s report of symptoms (Kleinman, 1996). Psychiatric instruments with accurate linguistic translations can standardize the process of the obtaining the first interpretation. Our data suggests that the second interpretation – here, the clinician’s assessment through rater scores – may be taken for granted within instrument translations when it is not at all clear that the process of rating has been calibrated. This preliminary study calls attention to the cultural shaping of rater application of structured instruments in psychiatric assessment and the opportunity to rethink the international standardization of raters, a focus for future research.

CONCLUSIONS

Several important conclusions can be drawn from this study. First, while instruments appear to be standardized for content, there is not yet a process for standardizing raters in cross-cultural contexts. The data in this paper suggest that this would be a vital area for further research. Second, while the rating of symptoms appears to be clinically influenced, there may also be cultural influences based on local interpretations of normal and abnormal behavior especially in the evaluation of judgment and insight.

Acknowledgments

This work was supported, in part, by the Yale-China Program, New Haven, CT, through the Chia Fellowship Program made possible by the Chia Family Foundation, which facilitated this research collaboration. The funding source did not have a role in study design; in the collection, analysis and interpretation of data; in the writing of the report; and in the decision to submit the paper for publication.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

CONFLICTS OF INTEREST

None of the authors have any conflicts to declare.

REFERENCES

  1. Abu-Lughod L, Lutz CA. Introduction: emotion, discourse, and the politics of everyday life. In: Lutz CA, Abu-Lughod L, editors. Language and the politics of emotion. 1st ed. Cambridge: Cambridge University Press; 1990. pp. 1–23. [Google Scholar]
  2. Aggarwal NK. Exploring identity, culture, and suffering with a Kashmiri Sikh refugee. Soc Sci Med. 2007;65:1654–1665. doi: 10.1016/j.socscimed.2007.06.002. [DOI] [PubMed] [Google Scholar]
  3. Aggarwal NK, Tao H, Xu K, Stefanovics E, Zhening L, Rosenheck RA. Comparing the PANSS in Chinese and American inpatients: Cross-cultural psychiatric analyses of instrument translation and implementation. Schizophr Res. 2011;132:146–152. doi: 10.1016/j.schres.2011.08.003. [DOI] [PubMed] [Google Scholar]
  4. Beasley CM, Jr, Hamilton SH, Crawford AM, Dellva MA, Tollefson GD, Tran PV, Blin O, Beuzen JN. Olanzapine versus haloperidol: acute phase results of the international double-blind olanzapine trial. Eur Neuropsychopharmacol. 1997;7:125–137. doi: 10.1016/s0924-977x(96)00392-6. [DOI] [PubMed] [Google Scholar]
  5. Cong Y. Doctor-family-patient relationship: the Chinese paradigm of informed consent. J Med Philos. 2004;29:149–178. doi: 10.1076/jmep.29.2.149.31506. [DOI] [PubMed] [Google Scholar]
  6. Costa e Silva JA, Alvarez N, Mazzotti G, Gattaz WF, Ospina J, Larach V, Starkstein S, Oliva D, Cousins L, Tohen M, Taylor CC, Wang J, Tran PV. Olanzapine as alternative therapy for patients with haloperidol-induced extrapyramidal symptoms: results of a multicenter, collaborative trial in Latin America. J Clin Psychopharmacol. 2001;21:375–381. doi: 10.1097/00004714-200108000-00004. [DOI] [PubMed] [Google Scholar]
  7. Cramer J, Rosenheck R, Xu W, Henderson W, Thomas J, Charney D Department of Veterans Affairs Cooperative Study Group on Clozapine in Refractory Schizophrenia. Detecting improvement in quality of life and symptomatology in schizophrenia. Schizophr Bull. 2001;27:227–234. doi: 10.1093/oxfordjournals.schbul.a006869. [DOI] [PubMed] [Google Scholar]
  8. Devereux G. Normal and abnormal. In: Littlewood R, Dein S, editors. Cultural psychiatry and medical anthropology: an introduction and reader. 1st ed. London: The Athlone Press; 2000. pp. 213–289. [Google Scholar]
  9. Ertan T, Eker E. Reliability, validity, and factor structure of the geriatric depression scale in Turkish elderly: are there different factor structures for different cultures? Int Psychogeriatr. 2000;12:163–172. doi: 10.1017/s1041610200006293. [DOI] [PubMed] [Google Scholar]
  10. Findling RL, Robb A, Nyilas M, Forbes RA, Jin N, Ivanova S, Marcus R, McQuade RD, Iwamoto T, Carson WH. A multiple-center, randomized, double-blind, placebo-controlled study of oral aripiprazole for treatment of adolescents with schizophrenia. Am J Psychiatry. 2008;165:1432–1441. doi: 10.1176/appi.ajp.2008.07061035. [DOI] [PubMed] [Google Scholar]
  11. Garcia E, Robert M, Peris F, Nakamura H, Sato N, Terazawa Y. The efficacy and safety of blonanserin compared with haloperidol in acute-phase schizophrenia: a randomized, double-blind, placebo-controlled, multicentre study. CNS Drugs. 2009;23:615–625. doi: 10.2165/00023210-200923070-00006. [DOI] [PubMed] [Google Scholar]
  12. Igarashi Y, Hayashi N, Yamashina M, Otsuka N, Kuroki N, Anzai N, Kazamatsuri H. Interrater reliability of the Japanese version of the Positive and Negative Syndrome Scale and the appraisal of its training effect. Psychiatry Clin Neurosci. 1998;52:467–470. doi: 10.1046/j.1440-1819.1998.00425.x. [DOI] [PubMed] [Google Scholar]
  13. Kane JM, Khanna S, Rajadhyaksha S, Giller E. Efficacy and tolerability of ziprasidone in patients with treatment-resistant schizophrenia. Int Clin Psychopharmacol. 2006;21:21–28. doi: 10.1097/01.yic.0000182114.65134.81. 2006. [DOI] [PubMed] [Google Scholar]
  14. Kasper S, Lerman MN, McQuade RD, Saha A, Carson WH, Ali M, Archibald D, Ingenito G, Marcus R, Pigott T. Efficacy and safety of aripiprazole vs. haloperidol for long-term maintenance treatment following acute relapse of schizophrenia. Int J Neuropsychopharmacol. 2003;6:325–337. doi: 10.1017/S1461145703003651. [DOI] [PubMed] [Google Scholar]
  15. Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. 1987;13:261–276. doi: 10.1093/schbul/13.2.261. [DOI] [PubMed] [Google Scholar]
  16. Kirmayer LJ. Cultural psychiatry in historical perspective. In: Bhugra D, Bhui K, editors. Textbook of cultural psychiatry. 1st ed. Cambridge: Cambridge University Press; 2007. pp. 3–19. 2007. [Google Scholar]
  17. Kleinman A. Rethinking psychiatry: from cultural category to personal experience. New York: The Free Press; 1988. [Google Scholar]
  18. Kleinman A. How is culture important for DSM-IV? In: Mezzich JE, Kleinman A, Fabrega H Jr, Parron DL, editors. Culture and psychiatric diagnosis: a DSM-IV perspective. (1st ed. Washington, D.C.: American Psychiatric Press; 1996. pp. 15–29. [Google Scholar]
  19. Lançon C, Reine G, Llorca PM, Auquier P. Validity and reliability of the French-language version of the Positive and Negative Syndrome Scale (PANSS) Acta Psychiatr Scand. 1999;100:237–243. doi: 10.1111/j.1600-0447.1999.tb10851.x. [DOI] [PubMed] [Google Scholar]
  20. Lasser RA, Bossie CA, Gharabawi GM, Turner M. Patients with schizophrenia previously stabilized on conventional depot antipsychotics experience significant clinical improvements following treatment with long-acting risperidone. Eur Psychiatry. 2004;19:219–225. doi: 10.1016/j.eurpsy.2003.11.007. [DOI] [PubMed] [Google Scholar]
  21. Lee CT, Conde BJ, Mazlan M, Visanuyothin T, Wang A, Wong MM, Walker DJ, Roychowdhury SM, Wang H, Tran PV. Switching to olanzapine from previous antipsychotics: a regional collaborative multicenter trial assessing 2 switching techniques in Asia Pacific. J Clin Psychiatry. 2002;63:569–576. doi: 10.4088/jcp.v63n0706. [DOI] [PubMed] [Google Scholar]
  22. Leucht S, Kane JM, Kissling W, Hamann J, Etschel E, Engel R. Clinical implications of Brief Psychiatric Rating Scale scores. Br J Psychiatry. 2005;187:366–371. doi: 10.1192/bjp.187.4.366. [DOI] [PubMed] [Google Scholar]
  23. Lieberman JA, Stroup TS, McEvoy JP, Swartz MS, Rosenheck RA, Perkins DO, Keefe RS, Davis SM, Davis CE, Lebowitz BD, Severe J, Hsiao JK Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) Investigators. Effectiveness of antipsychotic drugs in patients with chronic schizophrenia. N Engl J Med. 2005;353:1209–1223. doi: 10.1056/NEJMoa051688. [DOI] [PubMed] [Google Scholar]
  24. Lim CS, Subramaniam M, Poon LY, Chong SA, Verma S. Cross-ethnic differences in severity of symptomatology of individuals with first-episode schizophrenia spectrum disorder. Early Interv Psychiatry. 2011;5:242–248. doi: 10.1111/j.1751-7893.2011.00260.x. [DOI] [PubMed] [Google Scholar]
  25. Mechri A, Bourdel MC, Slama H, Gourion D, Gaha L, Krebs MO. Neurological soft signs in patients with schizophrenia and their unaffected siblings: frequency and correlates in two ethnic and socioeconomic distinct populations. Eur Arch Psychiatry Clin Neurosci. 2009;259:218–226. doi: 10.1007/s00406-008-0859-y. [DOI] [PubMed] [Google Scholar]
  26. Nilchaikovit T, Uneanong S, Kessawai D, Thomyangkoon P. The Thai version of the Positive and Negative Syndrome Scale (PANSS) for schizophrenia: criterion validity and interrater reliability. J Med Assoc Thai. 2000;83:646–651. [PubMed] [Google Scholar]
  27. Olié JP, Spina E, Murray S, Yang R. Ziprasidone and amisulpride effectively treat negative symptoms of schizophrenia: results of a 12-week, double-blind study. Int Clin Psychopharmacol. 2002;21:143–151. doi: 10.1097/01.yic.0000182121.59296.70. [DOI] [PubMed] [Google Scholar]
  28. Phillips MR, Xiong W, Wang RW, Gao YH, Wang XQ, Zhang NP. Reliability and validity of the Chinese versions of the Scales for Assessment of Positive and Negative Symptoms. Acta Psychiatr Scand. 1991;84:364–370. doi: 10.1111/j.1600-0447.1991.tb03161.x. [DOI] [PubMed] [Google Scholar]
  29. Roe D, Ravidson L. Self and narrative in schizophrenia: time to author a new story. Med Humanit. 2001;31:89–94. doi: 10.1136/jmh.2005.000214. [DOI] [PubMed] [Google Scholar]
  30. Rosenheck RA, Sernyak MJ. Developing a policy for second-generation antipsychotic drugs. Health Aff (MIlwood) 2009;28:w782–w793. doi: 10.1377/hlthaff.28.5.w782. [DOI] [PubMed] [Google Scholar]
  31. Ryckmans V, Kahn JP, Modell S, Werner C, McQuade RD, Kerselaers W, Lissens J, Sanchez R. Switching to aripiprazole in outpatients with schizophrenia experiencing insufficient efficacy and/or safety/tolerability issues with risperidone: a randomized, multicentre, open-label study. Pharmacopsychiatry. 2009;42:114–121. doi: 10.1055/s-0028-1112134. [DOI] [PubMed] [Google Scholar]
  32. Sass LA. Self and world in schizophrenia: three classic approaches. Philos Psychiatr Psychol. 2001;8:251–270. [Google Scholar]
  33. Tang YL, Gillespie CF, Epstein MP, Mao PX, Jiang F, Chen Q, Cai ZJ, Mitchell PB. Gender differences in 542 Chinese inpatients with schizophrenia. Schizophr Res. 2007;97:88–96. doi: 10.1016/j.schres.2007.05.025. [DOI] [PubMed] [Google Scholar]
  34. Tsai DF. How should doctors approach patients? A Confucian reflection on personhood. J Med Ethics. 2001;27:44–50. doi: 10.1136/jme.27.1.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Van Os J, Bossie CA, Lasser RA. Improvements in stable patients with psychotic disorders switched from oral conventional antipsychotics therapy to long-acting risperidone. Int Clin Psychopharmacol. 2004;19:229–232. doi: 10.1097/01.yic.0000122861.35081.16. [DOI] [PubMed] [Google Scholar]
  36. von Knorring L, Lindström E. The Swedish version of the Positive and Negative Syndrome Scale (PANSS) for schizophrenia. Construct validity and interrater reliability. Acta Psychiatr Scand. 1992;86:463–468. doi: 10.1111/j.1600-0447.1992.tb03298.x. [DOI] [PubMed] [Google Scholar]
  37. Yang LH, Phillips MR, Lo G, Chou Y, Zhang X, Hopper K. "Excessive thinking" as explanatory model for schizophrenia: impacts on stigma and "moral" status in Mainland China. Schizophr Bull. 2010;36:836–845. doi: 10.1093/schbul/sbn180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Zhang XY, Chen da C, Qi LY, Wang F, Xiu MH, Chen S, Wu GY, Kosten TA, Kosten TR. Gender differences in the prevalence, risk and clinical correlates of tardive dyskinesia in Chinese schizophrenia. Psychopharmacology (Berl) 2009;205:647–654. doi: 10.1007/s00213-009-1590-8. [DOI] [PubMed] [Google Scholar]

RESOURCES