Skip to main content
Springer logoLink to Springer
letter
. 2024 Jun 5;275(5):1531–1532. doi: 10.1007/s00406-024-01834-8

How to assess response

Steffen Zitzmann 1,, Christoph Lindner 2
PMCID: PMC12270965  PMID: 38839710

The term response has been established as a key concept in psychiatry. It is an indispensable part of the jargon and frequently applied in practice and research. By a patient who showed a response, we mean someone whose symptoms improved to an extent that is practically important. Often, it is implied that this improvement has occurred in response to how the patient was treated, such as by taking a drug. In clinical research, however, the term response often refers to the improvement alone without a clear reference to the specific cause for this improvement. The reason is that we can hardly know how this patient would have improved if they had not been treated or had received a placebo.

To assess whether a patient has responded, their percentage symptom reduction from baseline has to be obtained first. This value can be computed as S2/S1-1·100\%, where S1 and S2 stand for the patient's symptoms at baseline and after the treatment, respectively [1]. Then, the patient can be classified into a category of improvement. These categories are based on cutoffs, for example, on steps of 25\% [2]. However, there is an issue with this procedure, which arises because interviews and questionnaires are often used. Such measures are blurred by noise called measurement error [3, 4]. The amount of error is typically expressed by the measure’s reliability, which is defined as the proportion of the variability of the measured score that can be attributed to true differences between patients [5].

To illustrate, consider the Positive and Negative Syndrome Scale (PANSS) [6]—a measure of symptom severity in schizophrenia. Recent research found that the internal consistency of the PANSS (i.e., an estimate of its reliability) ranged up to 0.94 [7]. Before we continue, note that to compute a patient’s percentage PANSS reduction from baseline, their total scores need to be corrected by subtracting 30 points [9, 10]. Now, suppose the patient got a corrected total score of S1=60 points at baseline and S2=40 after treatment. Using the equation for computing percentage symptom reduction from baseline, their symptoms seemingly decreased by 33.3\%. Thus, if measurement error was ignored, the patient would be classified into the 25 to 50\% improvement category, meaning the patient would be described as having improved. However, we may indicate uncertainty due to measurement error by a confidence interval. The interval can be computed as:

S2/S1-1·100\%±1.96·1-ρ·1+S22/S12·σ12/σ22·σ22/S12·100\%

where ρ is the reliability of the PANSS. σ1 and σ2 are the standard deviations at baseline and after treatment [1, 11]. Consider the patient whose symptoms reduced from 60 to 40 points. If, for example, the reliability of ρ=.94 is used in the calculation of the patient’s confidence interval, and the same standard deviation of σ1=σ2=10.93 is assumed at baseline and after treatment, we yield -33.3\%±10.5\%. Thus, we cannot be certain that their actual improvement was not 23\% instead of 33.3\%, for example. More importantly, this interval includes 25\%, indicating that this patient could fall into the 0 to 25\% category (i.e., no practically important improvement). In other words, it cannot be decided whether this patient belongs to the 0 to 25\% or 25 to 50\% category.

It should be noted that the confidence interval involves making assumptions about the measure, particularly regarding quality and distributional aspects. Thus, depending on the context, one may use different values for the reliability and the standard deviations in the calculation. To illustrate how they impact the patient’s confidence interval, see Fig. 1. Panel (a) shows that when the level of reliability ρ is increased, the interval will be narrowed. By contrast, increasing the standard deviation σ1=σ2 will increase the width [Panel (b)]. Finally, given unequal standard deviations, a greater ratio σ1/σ2 will lead to a decrease in width [Panel (c)].

Fig. 1.

Fig. 1

Dependence of the confidence interval on reliability ρ [Panel (a)], standard deviation σ1=σ2 [Panel (b)], and standard deviation ratio σ1/σ2 [Panel (c)]. SD standard deviation

To conclude, the patient whose confidence interval includes a cutoff illustrates the need for a refinement of the assessment of response. Interviews and questionnaires are not error-free measures. As psychiatrists, we should take this message serious and “embrace” the uncertainty. To decide whether a patient has responded, we should not focus solely on their percentage symptom reduction from baseline. In addition, a confidence interval should be placed around this value so that it can be better evaluated into which improvement category the patient falls. If the interval includes a cutoff, a definite categorization cannot be made, and further information has to be gathered.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Declarations

Conflict of interest

The authors declare no conflict of interest.

References

  • 1.Zitzmann S, Lindner C, Leucht C, Leucht S (2023) A potential issue with PANSS responder analysis. Schizophr Res 261:287–290. 10.1016/j.schres.2023.10.009 [DOI] [PubMed] [Google Scholar]
  • 2.Leucht S, Davis JM, Engel RR, Kissling W, Kane JM (2009) Definitions of response and remission in schizophrenia: recommendations for their use and their presentation. Acta Psychiatr Scand 119:7–14. 10.1111/j.1600-0447.2008.01308.x [DOI] [PubMed] [Google Scholar]
  • 3.Novick MR (1966) The axioms and principal results of classical test theory. J Math Psychol 3:1–18. 10.1016/0022-2496(66)90002-2 [Google Scholar]
  • 4.Zitzmann S (2023) A cautionary note regarding multilevel factor score estimates from lavaan. Psych 5:38–49. 10.3390/psych5010004 [Google Scholar]
  • 5.Norman G (2014) When i say … reliability. Med Educ 48:946–947. 10.1111/medu.12511 [DOI] [PubMed] [Google Scholar]
  • 6.Kay SR, Fiszbein A, Opler LA (1987) The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull 13:261–276. 10.1093/schbul/13.2.261 [DOI] [PubMed] [Google Scholar]
  • 7.Lin C-H, Lin H-S, Lin S-C, Kuo C-C, Wang F-C, Huang Y-H (2018) Early improvement in PANSS-30, PANSS-8, and PANSS-6 scores predicts ultimate response and remission during acute treatment of schizophrenia. Acta Psychiatr Scand 137:98–108. 10.1111/acps.12849 [DOI] [PubMed] [Google Scholar]
  • 8.Li L, Ma H, Wang X, Meng E (2021) Validation of Chinese version of positive and negative syndrome scale-6 in clinical setting: a preliminary study. Psychiatry Clin Psychopharmacol 31:386–391. 10.5152/pcp.2021.21060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Leucht S, Kissling W, Davis JM (2010) The PANSS should be rescaled. Schizophr Bull 36:461–462. 10.1093/schbul/sbq016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Obermeier M, Mayr A, Schennach-Wolff R, Seemüller F, Möller H-J, Riedel M (2010) Should the PANSS be rescaled? Schizophr Bull 36:455–460. 10.1093/schbul/sbp124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zitzmann S (2023) Einzelfallbezogene Veränderungsdiagnostik [Diagnostics of individual change]. In: Dohrenbusch R (ed) Psychologische Begutachtung. Springer, Wiesbaden, pp 1–9 [Google Scholar]

Articles from European Archives of Psychiatry and Clinical Neuroscience are provided here courtesy of Springer

RESOURCES