Skip to main content
Schizophrenia Bulletin logoLink to Schizophrenia Bulletin
. 2011 May 16;38(6):1200–1215. doi: 10.1093/schbul/sbr037

Early Recognition and Disease Prediction in the At-Risk Mental States for Psychosis Using Neurocognitive Pattern Classification

Nikolaos Koutsouleris 1,*, Christos Davatzikos 2, Ronald Bottlender 1, Katja Patschurek-Kliche 1, Johanna Scheuerecker 1, Petra Decker 1, Christian Gaser 3, Hans-Jürgen Möller 1, Eva M Meisenzahl 1
PMCID: PMC3494049  PMID: 21576280

Abstract

Background: Neuropsychological deficits predate overt psychosis and overlap with the impairments in the established disease. However, to date, no single neurocognitive measure has shown sufficient power for a prognostic test. Thus, it remains to be determined whether multivariate neurocognitive pattern classification could facilitate the diagnostic identification of different at-risk mental states (ARMS) for psychosis and the individualized prediction of illness transition.

Methods: First, classification of 30 healthy controls (HC) vs 48 ARMS individuals subgrouped into 20 “early,” 28 “late” ARMS subjects was performed based on a comprehensive neuropsychological test battery. Second, disease prediction was evaluated by categorizing the neurocognitive baseline data of those ARMS individuals with transition (n = 15) vs non transition (n = 20) vs HC after 4 years of follow-up. Generalizability of classification was estimated by repeated double cross-validation.

Results: The 3-group cross-validated classification accuracies in the first analysis were 94.2% (HC vs rest), 85.0% (early at-risk subjects vs rest), and, 91.4% (late at-risk subjects vs rest) and 90.8% (HC vs rest), 90.8% (converters vs rest), and 89.0% (nonconverters vs rest) in the second analysis. Patterns distinguishing the early or late ARMS from HC primarily involved the verbal learning/memory domains, while executive functioning and verbal IQ deficits were particularly characteristic of the late ARMS. Disease transition was mainly predicted by executive and verbal learning impairments.

Conclusions: Different ARMS and their clinical outcomes may be reliably identified on an individual basis by evaluating neurocognitive test batteries using multivariate pattern recognition. These patterns may have the potential to substantially improve the early recognition of psychosis.

Keywords: individualized psychosis prediction, multivariate analysis, neurocognitive test battery

Introduction

Neurocognitive deficits have been described as a core feature of schizophrenic psychosis affecting cognitive performance particularly within the domains of processing speed, verbal learning, and executive functioning.1 Some of these deficits seem to emerge early in the course of the disease and may be traced even in different at-risk mental states (ARMS) for the illness. In this context, the state of ultrahigh risk (UHR) for psychosis characterized by attenuated and/or brief limited intermittent psychotic symptoms may involve abnormalities across a range of cognitive abilities including speed of processing, working memory, sustained attention, cognitive flexibility, and verbal memory.27 This cross-domain neurocognitive pattern overlaps with the profound deficits observed in the established disease, albeit not reaching the level of impairment seen in the full clinical picture of the disorder. This latter observation suggests that task performance may either deteriorate within specific cognitive domains during transition to psychosis as shown by cross-sectional and longitudinal findings in different ARMS populations,79 or alternatively may result from a higher conversion rate in the UHR compared with lower risk states. Furthermore, neuropsychological investigations of an early, ie, initial ARMS as defined by subtle cognitive-perceptual “basic” symptoms10,11 found that these individuals already exhibited executive control impairments compared with healthy volunteers.9 Taken together, these data revealed patterns of neurocognitive deficits that predate the onset of the overt disease and hence may substantially improve the difficult clinical detection of initial prodromal stages. This could facilitate an earlier commencement of therapeutic approaches aimed at ameliorating signs and symptoms and even preventing the onset of psychosis.12

This promising perspective has increasingly moved into the research focus because the current risk assessment strategies have to date produced only modest 1-year prediction rates ranging from 9% to 54% by relying solely on operationalized psychopathological criteria.13 In this regard, previous investigations found that reduced baseline performance in verbal IQ, verbal/working memory, and olfactory identification tasks may distinguish subsequent converters from non converters in a clinically defined ARMS.3,14,15 Furthermore, Lencz et al5 and Riecher-Rössler et al16 consistently reported that the prediction of disease transition could be substantially improved to an accuracy of up to 80% by using linear regression models that combined both psychopathological and neurocognitive measures. In contrast to these results, a recent multicenter investigation of the North American Longitudinal Prodromal Study17 found that neuropsychological data did not provide additional predictive power beyond multivariate clinical prediction models.18 Moreover, it is still unclear whether a neurocognition-based prediction of psychosis will generalize to ARMS individuals not used for deriving the respective diagnostic models. This means that diagnostic validity has to be assessed either in independent test populations or estimated using rigorous cross-validation (CV) that separates the training from validation data throughout the model generation process. Thus, it has to be elucidated whether neurocognitive abnormalities in the ARMS reliably enhance the early detection of different prodromal stages and the prediction of subsequent disease manifestation to a level allowing for single-subject inferences.

A valid answer to this question has to address major methodological obstacles. First, the considerable intraindividual variability of single neuropsychological tests may blur diagnostically relevant information. This low signal-to-noise ratio of single test measures could be substantially increased by using multivariate approaches in order to extract high-dimensional discriminative patterns across comprehensive neuropsychological test batteries. However, it will be insufficient to detect discriminative patterns that perfectly describe specific diagnostic boundaries in a given training population. This overfitting problem may be resolved by using a machine-learning framework capable of detecting multivariate patterns of neuropsychological deficits that generalize well to unseen individuals.19 In this context and based on our own experiences with Magnetic Resonance Imaging (MRI)-based multivariate prediction models,2023 we employed the support-vector machine (SVM)24 in conjunction with ensemble learning methods25,26 in order to evaluate whether potentially complex patterns of cognitive ability derived from the combination of several neuropsychological tests may facilitate the individualized recognition of different ARMS and the prediction of frank psychosis.

Methods

Study Participants

Forty-eight individuals in an ARMS for psychosis and 30 healthy controls (HC) matched group-wise for age, gender, and premorbid verbal IQ (tables 3 and 4) were recruited at the Early Detection and Intervention Center for Mental Crises, Department of Psychiatry and Psychotherapy, Ludwig-Maximilian-University, Germany, for neuropsychological testing using operationalized criteria as detailed in9,22,2734 and table 1. These criteria were based on a 2-stage concept of the ARMS, distinguishing between (1) an early ARMS (ARMS-E), mainly defined by the presence of basic symptoms911 and associated with an increased, but not imminent risk of psychosis and (2) a “late” ARMS (ARMS-L), characterized by an ultrahigh risk for psychosis following the Personal Assessment of Crisis Evaluation (PACE) criteria.37,38 Candidate ARMS and HC individuals were carefully screened for the exclusion criteria listed in table 1 by evaluating the personal and familial history using a semistructured clinical interview and the structured clinical interview for Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, (DSM-IV).39 Additionally, ARMS individuals were rated using the Global Assessment of Functioning (GAF) Scale of the DSM-IV, the Positive and Negative Symptom Scale (PANSS),40 and the Montgomery-Åsberg Depression Rating Scale (MADRS).41 It is of note that 80%/0% of the ARMS/HC subjects analyzed in the present study overlapped with the cohort previously used for a MRI-based pattern recognition analysis.22 Moreover, an overlapping population was recently assessed for neuroanatomical correlates of executive dysfunction.42

Table 3.

Analysis of Sociodemographic, Clinical and Global Anatomical Variables

HC ARMS T2 P ARMS-E ARMS-L F2 P ARMS-NT ARMS-T F2 P
Sociodemographic Variables
N 30 48 20 28 20 15
 Age: mean (SD) (years) 26.0 (2.7) 24.7 (5.8) 1.41 .162 25.0 (5.6) 24.5 (5.9) 0.80 .456 25.8 (6.8) 22.8 (3.8) 2.88 .063
 Gender: male/female (%) 18/12 (60/40) 32/16 (66.7/33.3) 0.36 .630 13/7 (65/35) 19/9 (67.9/32.1) 0.40 .834 14/6 (70/30) 11/4 (73.3/26.7) 1.00 .662
 School education: mean (SD) (years) 12.4 (1.2) 12.0 (1.2) 1.72 .090 12.3 (1.0) 11.8 (1.2) 2.55 .085 11.9 (1.2) 11.9 (1.1) 1.69 .192
 Number (%) of first-degree relatives with schizophrenic psychoses 6 (12.5) 4 (20) 2 (7.1) 1.64 .379 4 (20) 1 (7.1) 1.19 .379
 Number (%) of first-degree relatives with affective psychoses 9 (18.8) 3 (15) 6 (22.2) 0.39 .713 4 (20) 4 (28.6) 0.34 .689
HC ARMS T2 P ARMS-E ARMS-L F P ARMS-NT ARMS-T F P
Clinical Variables: mean (SD)
    GAF score 58.6 (10.8) 60.1 (8.4) 57.3 (12.5) 0.76 .451 58.6 (10.9) 59.9 (14.2) −0.24 .814
    PANSS total score 62.6 (18.7) 59.9 (14.5) 64.9 (21.6) −0.81 .422 52.9 (11.8) 65.5 (20.5) −1.96 .062
    PANSS positive score 12.8 (4.5) 10.8 (3.5) 14.5 (4.7) −2.80 .008* 11.4 (4.2) 13.5 (4.2) −1.20 .242
    PANSS negative score 16.5 (7.7) 15.9 (6.7) 17.0 (8.6) −0.42 .676 12.1 (4.9) 19.4 (8.4) −2.78 .011*
    PANSS general score 33.3 (9.5) 33.2 (7.8) 33.4 (11.0) −0.61 .952 29.5 (6.1) 32.6 (11.3) −0.74 .477
    MADRS score 17.5 (9.4) 19.7 (7.8) 15.8 (10.5) 1.18 .247 17.7 (5.4) 11.1 (9.8) 2.02 .057

Note: ARMS, At-Risk Mental State for psychosis; ARMS-E, early ARMS subgroup; ARMS-L, late ARMS subgroup; ARMS-NT, nontransition subgroup; ARMS-T, transition subgroup; GAF, Global Assessment of Functioning; HC, Healthy Control subjects; PANSS, Positive and Negative Symptom Scale; F, main effect's F value; T Student's t-test value; χ2, Pearson χ2 value; MADRS, Montgomery-Åsberg Depression Rating Scale. Schooling years, clinical, and global anatomical variables were assessed using ANCOVA designs, with group entered as main effect and age and gender defined as covariates of no interest. All P values are two-sided and exact in case of nonparametric tests.

*Significant at P<0.05.

Table 4.

Statistical Analysis of Between-Group Differences in the 12 Neurocognitive Test Measures

HC vs ARMS: t-test HC vs ARMS-E vs ARMS-L: ANOVA & Post-Hoc Analyses HC vs ARMS-NT vs ARMS-T: ANOVA & Post-Hoc Analyses
HC ARMS ARMS-E ARMS-L ARMS-NT ARMS-T
Raw Raw Z F P Raw Z Raw Z F P HC vs ARMS-E HC vs ARMS-L ARMS-E vs ARMS-L Raw Z Raw Z F P HC vs ARMS-NT HC vs ARMS-T ARMS-NT vs ARMS-T
MWT-B (IQ) 109.7 (8.3) 106.7 (14.3) −0.2 (1.8) 0.4 .539 107.3 (14.3) −0.2 (1.6) 106.3 (14.6) −0.3 (1.0) 0.2 .819 108.9 (13.8) −0.1 (1.7) 107 (16.8) 0.0 (1.9) 0.0 .977
DST 67.8 (10.5) 60 (12.5) −0.8 (1.3) 7.7 .007 64 (13.8) −0.4 (1.0) 57.2 (10.8) −1.0 (1.1) 6.1 .003* 0.852 0.003 0.133 59.5 (14.3) −0.8 (1.4) 57.7 (8.8) −0.9 (0.8) 4.9 .010
DS 17.4 (3.6) 17 (4.2) −0.1 (1.1) 0.1 .771 17.4 (4.5) 0.0 (2.8) 16.8 (4.1) −0.1 (2.3) 0.2 .845 18.2 (3.7) 0.2 (0.9) 16.8 (4.7) −0.1 (1.2) 0.4 .668
LNS 17.8 (1.3) 16.8 (3.8) −0.7 (2.8) 1.7 .194 17.3 (3.8) −0.4 (1.3) 16.4 (3.8) −0.9 (1.0) 1.2 .303 17.2 (4.9) −0.5 (3.5) 16.9 (2.4) −0.4 (1.7) 0.3 .728
TMT-A 24.8 (5.6) 28.3 (8.4) −0.5 (1.4) 3.0 .089 25.9 (7.5) −0.1 (1.0) 30.1 (8.7) −0.8 (1.5) 3.5 .036 26.1 (8.9) −0.2 (1.3) 29.4 (7.1) −0.5 (1.3) 0.9 .412
TMT-B 48 (13.3) 69.2 (25.3) −1.3 (1.6) 15.3 .000* 63.7 (26.2) −0.9 (1.4) 73.1 (24.5) −1.5 (1.3) 8.8 .000* 0.066 0.000 0.484 68.7 (29.5) −1.3 (1.8) 76.5 (19.6) −1.5 (1.1) 9.0 .000* 0.003 0.002 1.00
TMT-(B−A) 23.2 (12.4) 40.9 (21.7) −1.2 (1.5) 14.0 .000* 37.9 (22.3) −1.0 (1.3) 43 (21.4) −1.3 (1.0) 7.3 .001* 0.042 0.001 1.00 42.6 (23) −1.4 (1.6) 47.2 (19.5) −1.4 (1.3) 9.5 .000* 0.001 0.003 1.00
SOPT 1.5 (1.3) 4.7 (3.3) −2.2 (2.2) 25.6 .000* 5.3 (4.2) −2.6 (1.8) 4.3 (2.5) −1.9 (1.0) 13.8 .000* 0.000 0.001 0.552 5.2 (3.6) −2.4 (2.4) 4.7 (2.5) −2.1 (1.6) 15.4 .000* 0.000 0.010 1.00
RAVLT-IR 64 (5.6) 55.3 (10.8) −1.3 (1.7) 14.8 .000* 57.2 (12.2) −1.0 (0.0) 53.9 (9.8) −1.5 (0.0) 8.1 .001* 0.052 0.001 0.771 55.1 (13.3) −1.4 (2.0) 54.5 (9.4) −1.3 (1.5) 6.8 .002* 0.006 0.019 1.00
RAVLT-DR 14.1 (1.2) 11.8 (3.3) −1.6 (2.5) 11.1 .001* 12.5 (3.1) −1.1 (12) 11.3 (3.4) −1.9 (12) 6.6 .002* 0.191 0.002 0.511 12.2 (3.8) 0.4 (2.6) 11.1 (3.5) 1.5 (2.5) 5.4 .007
RAVLT-Ret 0.3 (0.8) 1.2 (1.7) −0.9 (2.0) 5.4 .023 1.0 (1.3) −0.8 (1.2) 1.3 (1.9) −1.0 (1.2) 2.8 .066 0.6 (1.3) −0.2 (1.6) 1.6 (2.0) −1.3 (2.3) 3.8 .028
VF 35.1 (6.3) 33.3 (10.2) −0.2 (1.7) 0.2 .623 34.1 (11.5) −0.1 (0.0) 32.7 (9.4) −0.2 (0.0) 0.2 .823 32.3 (9.9) −0.4 (1.6) 33.4 (11.5) 0.0 (1.7) 0.7 .516

Note: Neurocognitive test scores were adjusted for the effects of age and gender and standardized according to respective means and SDs of the HC data. For each neurocognitive test variable, statistical comparisons were conducted to evaluate group-level differences between HC vs ARMS (t-test) as well as HC vs ARMS-E vs ARMS-L and HC vs ARMS-NT vs ARMS-T (ANOVA). The Holm-Bonferroni correction was employed to correct the P values for multiple comparisons and significant between-group differences were flagged with an asterisk. In these cases, a Bonferroni post-hoc analysis was carried out to determine the significance of pairwise group differences. Abbreviations of neuropsychological test variables are detailed in table 2.

*Significant at P<0.05.

Table 1.

Inclusion Criteria for the ARMS Subjects/Exclusion Criteria

ARMS-E: ARMS subjects without Attenuated Psychotic Symptoms (APS) and/or Brief Limited Intermittent Psychotic Symptoms (BLIPS) …
    (1) … having one or more of the following basic symptoms appeared first at least 12 mo prior to study inclusion and several times per week during the last 3 mo.
        Thought interferences
        Thought perseveration
        Thought pressure
        Thought blockages
        Disturbances of receptive language, either heard or read
        Decreased ability to discriminate between ideas and perception, fantasy and true memories
        Unstable ideas of reference (subject-centrism)
        Derealization
        Visual perception disturbances
        Acoustic perception disturbances
and/or
 (2) … showing a reduction in the Global Assessment of Functioning Score (DSM-IV) of at least 30 points (within the past year) combined with at least one of the following trait markers:
        First-degree relative with a lifetime-diagnosis of schizophrenia or a schizophrenia spectrum disorder
        Pre- or perinatal complications
ARMS-L: ARMS subjects with or without basic symptoms, with or without global functioning & trait markers …
 (1) … having at least one of the following APS within the last 3 mo, appearing several times per week for a period of at least 1 wk:
        Ideas of reference
        Odd beliefs or magical thinking
        Unusual perceptual experiences
        Odd thinking and speech
        Suspiciousness or paranoid ideation
and/or
 (2) … having at least one of the following BLIPS, defined as the appearance of one of the following psychotic symptoms for less than 1 wk (interval between episodes at least 1 wk), resolving spontaneously:
        Hallucinations
        Delusions
        Formal thought disorder
        Gross disorganized or catatonic behavior
Exclusion criteria
 Disease transition as defined by Yung et al35
    A past or present diagnosis of schizophrenia spectrum and bipolar disorders, as well as delirium, dementia, amnestic, or other cognitive disorders, mental retardation, and psychiatric disorders due to a somatic factor, following the DSM-IV criteria
    Developmental and personality disorders, following the DSM-IV criteria
    Alcohol or drug abuse within 3 mo prior to examination, following the DSM-IV criteria
    A past or present inflammatory, traumatic or epileptic diseases of the central nervous system
    Any previous treatment with antipsychotics prior to neurocognitive assessment
    Healthy controls: positive familial history of schizophrenic or affective psychoses in the first-degree relatives

Note: ARMS, At-Risk Mental State for psychosis; ARMS-E, early ARMS subgroup; DSM-IV, Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition.

Adopted From Häfner et al.36

Included ARMS individuals were seen weekly in the first month, monthly in the first year, quarterly in the second year, and thereafter annually to detect possible transitions to psychosis according to the criteria of Yung et al35: PANSS scores of 4 or above on the hallucination item (P3) or scores of 5 or above on the unusual thought content (G9), suspiciousness (P6), or conceptual disorganization (P2) items. Symptoms had to occur daily and persist for more than 1 week to be deemed a transition to frank psychosis. ARMS individuals were assigned to the transition group (ARMS-T) if they met these criteria once during the follow-up period of 4 years and had a diagnosis of schizophrenia spectrum disorder 1 year after transition following the International Classification of Diseases-10 research criteria. Follow-up information could be obtained from 35 subjects 3.9 (SD: 1.2) years after study inclusion, including 15 converters (ARMS-T: n = 11, schizophrenia; 4, schizoaffective psychosis; average (minimum–maximum) time to transition: 210.7 (40–777) days) and 20 nonconverters (ARMS-NT: n = 14, no diagnosis; 3, major depression; 1, bipolar disorder; 1, adjustment disorder; 1, conversion disorder). The conversion group consisted of 14 ARMS-L and 1 ARMS-E individual (transition rates: 66.7% of followed ARMS-L, 7.1% of followed ARMS-E cases). No ARMS-NT subject met the transition criteria during the study period. None of the ARMS individuals had received antipsychotic medication prior to neuropsychological testing. No follow-up was available in 13 subjects (27.3%), of whom 6 could not be contacted or refused to participate, and 7 had not completed the follow-up interval. We did not find significant sociodemographic or neurocognitive baseline differences between ARMS individuals with and without available clinical follow-up information (see table 2 in online supplementary material). All subjects provided their written informed consent before study inclusion. The study was approved by the Local Research Ethics Committee of the Ludwig-Maximilian-University.

Neurocognitive Testing

A cross-domain neuropsychological test battery comprising 9 standardized tests was administered to all subjects by trained master-level neuropsychologists (K.K., J.S., and P.D.) to assess premorbid verbal IQ, processing speed, working memory, verbal, and visual memory as well as executive functions (table 2). These tests have been previously employed to study neurocognitive alterations in the ARMS for psychosis.9,50 From the acquired neurocognitive data, 12 test variables were computed (table 2) and adjusted for the effects of age and gender using partial correlations. The adjusted scores were z-transformed based on the respective HC data and entered analyses of variance (ANOVAs) that assessed between-group differences in each neurocognitive measure for (1) HC vs ARMS, (2) HC vs ARMS-E vs ARMS-L, and (3) HC vs ARMS-NT vs ARMS-T. Adjustment for multiple comparisons was performed across the 12 neurocognitive measures using Holm's sequential method.50,51 Significance was defined at P < .05, family-wise error corrected. Significant between-group effects were examined for pairwise differences using post-hoc Bonferroni tests.

Table 2.

Neuropsychological Test Battery

Cognitive Domain Variables
Premorbid verbal IQ
 Mehrfach-Wortschatztest B (MWT-B)43 1. Raw score correct
Processing speed
 Trail-Making Test, part A (TMT-A)44 2. Time to completion (s)
 Digit Symbol Test (DST, WAIS-III)45 3. Raw score correct
Working Memory
 Digit Span Test (DS, WAIS-III)45 4. Raw score correct
 Letter Number Span Test (LNS)46 5. Raw score correct
 Subject-Ordered Pointing Task (SOPT)47 6. Error score
Verbal Learning and Memory
 Rey Auditory Verbal Learning Test (RAVLT)48 7. Sum of raw score correct after trials 1–5 (RAVLT-IR)
8. Raw score correct after delayed recall (RAVLT-DR)
9. Retention: difference between raw score correct in trial 5 and delayed recall (RAVLT-Ret)
Executive Functions
 Trail-Making Test, part B (TMT-B)44 10. Time to completion (s)
11. Difference between TMT-B and TMT-A (TMT-[B−A])
 Verbal Fluency (letters) (VF)49 12. Sum of correct responses

Note: Cognitive domains were defined according to Schultze-Lutter et al.50

Additionally, associations between the ARMS individuals’ neurocognitive test measures and their clinical ratings (GAF, MADRS, and PANSS) were explored in a supplementary analysis (see table 1 in online supplementary material). Again, the exploratory P values of this analysis were corrected for multiple comparisons using Holm's sequential method51 (significance level at P < .05).

Neurocognitive Pattern Classification

SVM are multivariate statistical methods that have been successfully employed as biomedical diagnostic tools because of their primary strength, which is providing optimal methods for classifying single individuals, rather than simply describing statistical group differences (see also the methodological comparison of different classification algorithms in table 5 of the online supplementary material). In our case, the 12 different neuropsychological test scores were used by the SVM to determine the best nonlinear classification model that reliably predicted the study participants’ group membership in 3 different analyses: (1) HC vs ARMS, (2) HC vs ARMS-E vs ARMS-L, and (3) HC vs ARMS-NT vs ARMS-T. As customary in predictive analytics, the SVM model is constructed from one set of subjects (the training sample) and applied to a different set of subjects (the test sample), using CV (see Methods in online supplementary material). This process produces an unbiased estimate of the expected diagnostic and prognostic accuracy of the SVM model on new individuals, rather than merely fitting the current patient group. The principles of generating and validating predictive SVM models on separate training and testing samples of a study population have been detailed in our previous work.22

In summary, the neurocognitive data of the training sample were first adjusted for age and gender effects using partial correlations. The residuals were projected to a high-dimensional feature space using the radial basis functions in order to account for nonlinear relations between the neurocognitive data and the group membership of the training subjects. In this feature space, the SVM algorithm found the optimal between-group boundary by maximizing the geometric distance between the most similar subjects of opposite groups (the support vectors [SV]).24,52,53 It has been shown that this “maximum-margin” principle in conjunction with the nonlinear projection generates classification rules that are adaptive to subtle between-group differences and therefore generalize well to new individuals.24

The group membership of unseen test subjects was predicted by first applying the adjustment and nonlinear transformation parameters of the training sample to the test sample. Then, the trained binary SVM models (eg,: HC vs ARMS-E) determined the geometric positions of the test subjects relative to the “learned” decision boundary, resulting in a decision value and a group membership prediction for each test subject. In the 3-group analyses 1 and 2, we used these decision values to construct multigroup classifiers, where the binary SVM model with the maximum decision value decided about the test subject's group membership.

This SVM training and validation scheme was wrapped in a repeated nested CV analysis (see Methods in online supplementary material).23,54 On the outer loop of this analysis, we performed 50 repetitions of the following CV cycle. First, the order of the subjects was permuted within each group and then the entire population was split into ten nonoverlapping samples. Each of these samples was iteratively held back as validation data, while the 9 remaining samples entered the inner CV loop. This outer CV loop produced a robust and unbiased estimate of classification generalizability based on a large number of validation samples that were strictly separated from the entire training process performed on the inner loop. At this inner loop, we used 10-fold CV with 10 repetitions to generate ensembles of SVM models. More specifically, for each validation sample at the outer CV level, 100 different training data partitions were created at the inner CV level. In each of these 100 training partitions, the most discriminative sets of neurocognitive features were determined. Each of these sets was used to train a separate SVM model. Then, each of these models predicted the group membership of the unseen validation subjects on the outer loop. These predictions were averaged across all 100 training partitions to yield an ensemble decision. Finally, for each validation subject, all SVM ensemble decisions were aggregated across those outer training partitions, in which this subject had not been involved in the training process. Majority voting was used to determine the final out-of-training group membership for each validation subject (tables 5 and 6).

Table 5.

Two-group classification performance: The performance of the binary SVM ensemble classifiers (group “+1” vs group “−1”) was evaluated (1) by constructing a binary SVM ensemble from all SVM base learners of a CV1 partition, in which the respective CV2 test subjects had not been included, (2) by computing the average decision value in each of these binary CV1 ensembles in order to determine the group membership (average decision value > 0 or < 0) of the respective CV2 test subjects and (3) through majority voting across those binary CV1 SVM ensembles, in which the CV2 test subjects had not participated in the training process (see also the Methods section for a detailed explanation of the employed ensemble learning framework).

Binary classifiers TP TN FP FN Sensitivity (%) Specificity (%) BAC (%) FPR (%) PPV (%) NPV (%)
Neurocognitive SVM analysis: 30 HC vs 48 ARMS
    HC vs ARMS 24 46 2 6 95.8 80.0 87.9 4.2 92.3 88.5
Neurocognitive SVM analysis: 30 HC vs 20 ARMS-E vs 28 ARMS-L
    HC vs ARMS-E 30 18 2 0 90.0 100 95.0 10.0 93.8 100
    HC vs ARMS-L 26 25 3 4 89.3 86.7 88.0 10.7 89.7 86.2
    ARMS-E vs ARMS-L 10 28 0 10 100 50.0 75.0 0.0 100 73.7
Neurocognitive SVM analysis: 30 HC vs 20 ARMS-NT vs 15 ARMS-NT
    HC vs ARMS-NT 28 16 4 2 80.0 93.3 86.7 20.0 87.5 88.9
    HC vs ARMS-T 30 13 2 0 86.7 100 93.3 13.3 93.8 100
    ARMS-NT vs ARMS-T 15 12 3 5 80.0 75.0 77.5 20.0 83.3 70.6

Note: ARMS, At-Risk Mental State for psychosis; ARMS-E, early ARMS subgroup; ARMS-L, late ARMS subgroup; ARMS-NT, nontransition subgroup; ARMS-T, transition subgroup; HC, Healthy Control subjects, SVM, support-vector machine, CV, cross-validation.

Balanced accuracy (BAC), false positive rate (FPR), positive predictive value (PPV) and negative predictive value (NPV) were calculated from the confusion matrix containing the number of true positives (TP), false negatives (FN), true negatives (TN), and false positives (FP).

Table 6.

Three-Group Classification Performance: The multigroup SVM ensemble classifier was constructed by (1) generating multigroup SVM base learners though pairwise coupling the average binary CV1 ensembles’ decision values on the CV2 test data and (2) aggregating the multigroup base learners across all CV1 partitions, in which the CV2 test data had not been part of the training samples (out-of-training prediction; OOT).

HC vs ARMS-E vs ARMS-L HC vs ARMS-NT vs ARMS-T
Clinical groups SVM predicted classes Clinical groups SVM predicted classes
HC ARMS-E ARMS-L HC ARMS-NT ARMS-T
HC 29 0 1 HC 29 1 0
ARMS-E 2 14 4 ARMS-NT 3 16 1
ARMS-L 2 0 26 ARMS-T 2 1 12
OOT-performance OOT-performance
    TP 29 14 26 TP 29 16 12
    TN 44 58 45 TN 30 43 49
    FP 4 0 5 FP 5 2 1
    FN 1 6 2 FN 1 4 3
    Sensitivity (%) 96.7 70.0 92.6 Sensitivity (%) 96.7 80.0 80.0
    Specificity (%) 91.7 100.0 90.0 Specificity (%) 85.7 95.6 98.0
    Balanced accuracy (%) 94.2 85.0 91.4 Balanced accuracy (%) 90.8 90.8 89.0
    False positive rate (%) 8.3 0.0 10.0 False positive rate (%) 14.3 4.4 2.0
    Positive predicitive value (%) 87.9 100.0 83.9 Positive predicitive value (%) 85.3 88.9 92.3
    Negative predictive value (%) 97.8 90.6 95.7 Negative predictive value (%) 96.8 91.5 94.2
Overall accuracy: 88.5% Overall accuracy: 87.7%

Note: ARMS, At-Risk Mental State for psychosis; ARMS-E, early ARMS subgroup; ARMS-L, late ARMS subgroup; ARMS-NT, nontransition subgroup; ARMS-T, transition subgroup; HC, Healthy Control subjects; TP, true positives; FN, false negatives; TN, true negatives; FP, false positives; SVM, support-vector machine; CV, cross-validation.

The class membership of an outer cross-validation (CV2) test subject was determined (1) by the binary classifier with the maximum decision value among all binary SVMs within the respective multigroup base learner and (2) through majority voting across all multigroup base learners (see also the Methods section for a detailed explanation of the employed ensemble learning framework). The out-of-training classification performance of the multigroup SVM ensemble was then evaluated for one group against all other groups. For example, in the HC vs ARMS-E vs ARMS-L analysis 29 HC subjects of 30 (sensitivity: 96.7%) were correctly assigned to their group, while 44 of 48 (91.7%) ARMS subjects were correctly not labeled as HC, resulting in a balanced accuracy of (96.7% + 91.7%)/2 = 94.2%. The overall accuracy of the multigroup SVM ensemble was the proportion of all unseen CV2 test subjects correctly classified by the SVM classifier ensembles.

This ensemble learning approach has proven to achieve robust classification because it greatly reduces the risk of unfortunate selections of poorly performing single classifiers by averaging the diagnostic decisions of numerous predictive models. Furthermore, this approach improves classification generalizability particularly in small samples because it increases the classifiers’ ability to detect complex decision boundaries by means of training sample variation.25

The performance of these binary and multigroup ensemble classifiers on the unseen validation data was measured in terms of sensitivity, specificity, balanced accuracy (BAC), positive/negative predictive value (PPV/NPV), and false positive rate. The discriminative neurocognitive patterns that formed the basis for the classifiers’ diagnostic decisions were first evaluated by computing each neurocognitive measure's probability of being selected as discriminative feature across each classification analysis (figure 2). Second, the nonlinear discriminative neurocognitive patterns were approximated by computing for each neurocognitive measure, the difference between the respective model's SV.22 Then, the mean and SE of these SV differences was computed across all SVM models to form discriminative neurocognitive profiles (figure 3). Neurocognitive features with zero-crossing SEs were considered unreliable at the 95% confidence level. Third, we quantified the importance of single neurocognitive variables in the ARMS-NT vs ARMS-T analysis by removing one neurocognitive measure at a time and repeating the entire classification experiment (see figure 1 and table 3 in online supplementary material).

Fig. 2.

Fig. 2.

Neurocognitive feature selection probabilities. In each inner cross-validation (CV1) training sample, the set of neurocognitive features used by an support-vector machine (SVM) ensemble to categorize between the study groups was optimized by means of recursive classifier elimination (ensemble thinning, see Methods in online supplementary material). This procedure removed SVM classifiers containing irrelevant/redundant neurocognitive variables from the respective SVM ensembles. Thus, we were able to compute the feature selection probability of each neurocognitive variable as the ratio between the number of SVM models that used the respective variable as discriminative feature and the total number of SVM models (see table 6 in online supplementary material) across all CV1 training data partitions in each of the following classification analyses: A: healthy controls (HC) vs at-risk mental states (ARMS), B: HC vs ARMS-E vs ARMS-L, C: HC vs ARMS-NT vs ARMS-T.

Fig. 3.

Fig. 3.

Discriminative neurocognitive profiles of the three classification experiments. The support-vector machine (SVM) results of our study were obtained based on complex and subtle patterns of neurocognitive between-group differences that are difficult to visualize due to the nonlinearity of the classification method. Therefore, these nonlinear discriminative neurocognitive patterns were approximated (1) by computing the difference vector between the scaled (0,1) and adjusted (age, gender) neurocognitive features of all nearest-neighbor support-vector pairs that constituted the optimal separating decision boundary of a SVM model, trained on a given inner cross-validation (CV1) training sample and (2) by calculating the arithmetic mean and SE of the mean for these difference vectors obtained across all CV1 training samples in each of three classification experiments: A: healthy controls (HC) vs at-risk mental states (ARMS), B: HC vs ARMS-E vs ARMS-L, C: HC vs ARMS-NT vs ARMS-T. Zero-crossings of SEs indicate unreliable between-group differences.

Fig. 1.

Fig. 1.

Out-of-training prediction probabilities in the three classification analyses. A: HC vs at-risk mental states (ARMS) analysis, B: Binary prediction probabilities in the healthy controls (HC) vs ARMS-E vs ARMS-L analysis (left) and HC vs ARMS-NT vs ARMS-T analysis (right), C: Multigroup prediction probabilities in the HC vs ARMS-E vs ARMS-L analysis (left) and HC vs ARMS-NT vs ARMS-T analysis (right).

Results

Age, gender, education, and premorbid verbal IQ (tables 3 and 4) did not differ significantly in ARMS vs HC, HC vs ARMS-E vs ARMS-L, and HC vs ARMS-NT vs ARMS-T. A trend difference was observed in the mean age of the HC, ARMS-E, and ARMS-L groups (table 3). Thus, in order to minimize age- and also possible gender-related effects, we adjusted the data for these covariates in all classification analyses using partial correlations. Furthermore, we studied the impact of using unadjusted data in the ARMS-NT vs ARMS-T analysis (see table 4 in online supplementary material).

The ARMS samples did not differ with respect to the prevalence of schizophrenic or affective psychosis in the first-degree relatives. Global functioning was similarly reduced in the ARMS-E and ARMS-L, ARMS-T and ARMS-NT groups. A significantly higher PANSS positive score was observed in ARMS-L vs ARMS-E individuals. The ARMS-T group was characterized by a significantly higher PANSS negative score, a trend toward a higher PANSS total score and a lower MADRS score compared with ARMS-NT. Correlations between PANSS total and negative scores and rey auditory verbal learning test (RAVLT)-DR and RAVLT-Ret measures were detected in our supplementary analysis using an exploratory threshold of P < .05 (See table 1 in online supplementary material). However, these correlations did not survive correction for multiple comparisons.

Neurocognitive Analysis of HC vs ARMS Individuals

Univariate Results.

After correction for multiple comparisons, cognitive set shifting (trail-making test [TMT]-B, TMT-[B−A]), visual working memory (subject-ordered pointing task [SOPT]), and verbal learning/memory (RAVLT-IR, RAVLT-DR) was significantly impaired in ARMS vs HC subjects (table 4).

SVM Classification Analysis.

The binary HC vs ARMS classification performance in the CV2 test data was high (BAC = 87.9%, sensitivity = 95.8%, specificity = 80%, PPV = 92.3%, and NPV = 88.5%; table 5, figure 1A). Figures 2A and 3A show that the highest feature selection probabilities and reliable group differences were observed for premorbid verbal IQ (Mehrfach-Wortschatztest [MWT]-B), cognitive set shifting (TMT-B, TMT-[B−A]), visual working memory (SOPT), and verbal learning (RAVLT-IR, RAVLT-DR).

Neurocognitive Analysis of HC vs ARMS-E vs ARMS-L Individuals

Univariate Results.

The performance reductions detected in the entire ARMS group vs HC were also found in the HC vs ARMS-E vs ARMS-L analysis, with the digit symbol test (DST) being additionally involved (table 4). The post-hoc analysis revealed that the ARMS-L group showed significant performance deficits across these neurocognitive measures, ranging 1–2 SD below the HC group. Significant performance reductions in ARMS-E vs HC were confined to the TMT-(B−A) and the SOPT. No significant group-level differences were observed in ARMS-E vs ARMS-L.

SVM Classification Analysis.

The 3-group classification accuracy was 88.5% (table 6; figure 1C, left). Only one HC was mislabeled as ARMS-L (HC vs rest: sensitivity = 96.7%, specificity = 91.7%, BAC = 94.2%). Two ARMS-E were mislabeled as HC and 4 as ARMS-L (ARMS-E vs rest: 70.0%, 100%, and 85.0%). Two ARMS-L were misclassified as HC (ARMS-L vs rest: 92.6%, 90%, and 91.4%). Among the binary classifiers, the highest BAC (sensitivity, specificity) of 95% (90% and 100%) was observed in HC vs ARMS-E, followed by 88% (89.3% and 86.7%) in HC vs ARMS-L and 75.0% (100% and 50%) in ARMS-E vs ARMS-L (table 5; figure 1B, left).

The discriminative pattern of HC vs ARMS-E (figure 2B) showed high selection probabilities in the working memory (Letter number span [LNS], SOPT: >90% of all models) and verbal learning/memory domain (RAVLT-IR, RAVLT-DR, RAVLT-Ret: 70–80%). The between-group SV differences of these neurocognitive variables (except for the LNS) were also reliable at the 95% confidence level (figure 3B). In contrast, the HC vs ARMS-L classification particularly involved the trail-making measures, as well as premorbid verbal IQ (>70%) and to a lesser extent (60–70%) verbal learning/memory measures. Except for premorbid verbal IQ, these neurocognitive variables contributed reliably to the discriminative pattern. Finally, the DST, SOPT, RAVLT-DR, and RAVLT-Ret measures were selected by >70% of the SVM models in the ARMS-E vs ARMS-L analysis, with the most reliable contribution observed in RAVLT-Ret.

Neurocognitive Analysis of HC vs ARMS-NT vs ARMS-T Individuals

Univariate Results.

After correcting for multiple comparisons, the significant performance differences found in the HC vs ARMS-NT vs ARMS-T analysis overlapped with the HC vs ARMS comparison, except for the RAVLT-DR (table 4). Compared with HC, both ARMS-NT and ARMS-T individuals had significant performance deficits across the TMT-B, TMT-(B−A), SOPT, and RAVLT-IR but they did not differ from each other in these measures (table 4).

SVM Classification Analysis.

The 3-group classification accuracy was 87.7% (table 6; figure 1C, right). One HC was misclassified as ARMS-NT (HC vs rest: sensitivity = 96.7%, specificity = 85.7%, and BAC = 90.8%). Three ARMS-NT were wrongly assigned to the HC and 1 to the ARMS-T group (ARMS-NT vs rest: 80.0%, 95.6%, and 90.8%). Two ARMS-T were mislabeled as HC and 1 as ARMS-NT (ARMS-T vs rest: 80%, 98%, 89%). The binary HC vs ARMS-T classifier attained the highest performance (BAC, sensitivity, and specificity: 93.3%, 86.7%, and 100%), followed by HC vs ARMS-NT (86.7%, 80.0%, and 93.3%) and ARMS-NT vs ARMS-T (77.5%, 80%, and 75%) (table 5; figure 1B, right).

The discriminative pattern of HC vs ARMS-NT (figure 2C) was similar to the HC vs ARMS-E profile and showed high selection probabilities (>80%) in the working memory and delayed verbal learning domain (LNS, SOPT, RAVLT-DR), which were reliable for the SOPT and RAVLT-DR measures (figure 3C). The HC vs ARMS-T pattern was characterized by frequently selected executive and working memory measures (digit span, TMT-B, TMT-(B−A), verbal fluency: >75% models). Reliable between-group SV differences were found in cognitive flexibility, visual working memory, and verbal learning abilities (TMT-B, TMT-[B−A], SOPT, and RAVLT).

Finally, the discriminative ARMS-NT vs ARMS-T pattern particularly involved verbal and executive functions, as measured by the MWT-B, DST, TMT-B, RAVLT-DR, and RAVLT-Ret (>60% of the classification models included these measures). The cognitive flexibility and delayed verbal learning measures were reliable at the 95% confidence interval. Balanced classification accuracy dropped 2%–7% when one of these neurocognitive variables was removed from the test battery (see figure 1 and table 3 in online supplementary material).

Discussion

To our knowledge, this is the first study to assess the feasibility of an individualized early recognition of different ARMS and a prediction of disease transition by analyzing neurocognitive data using state-of-the-art machine-learning techniques. These data were acquired from a neuroleptic-naïve at-risk population recruited using operationalized high-risk criteria that were previously employed to study neurobiological and neurocognitive correlates of the ARMS for psychosis.9,22,23,2734,55 Furthermore, the transition rate of 42.9% in our follow-up cases indicates that the ARMS individuals’ risk for transition to psychosis lies within the range of previous UHR studies, eg, the initial findings from the PACE clinic in Australia,37,38 the earlyTreatment of Pre-Psychosis clinic in Norway,56 the Swiss FEPSY study,16 or the North American Prodrome Longitudinal Study.17,18,57

We could demonstrate that a neurocognition-based classification system may achieve high diagnostic accuracies in (1) distinguishing unseen ARMS from HC, (2) recognizing their level of vulnerability, as defined by the operationalized ARMS-E and ARMS-L criteria, and (3) predicting their subsequent outcome regarding transition or nontransition to psychosis over 4 years. Furthermore, the repeated nested CV framework provided an unbiased estimate of classification generalizability to new, unseen cases. Hence, the observation of overall high cross-validated classification accuracies in the 3 SVM experiments may support that the underlying neurocognitive patterns indeed represent distinct and homogeneous neurocognitive signatures of the ARMS and the emerging illness. Therefore, these signatures may be interpreted as neurocognitive markers that allow for a reliable early recognition of the ARMS and the prodromal phase of psychosis at the individual level.

Neurocognitive Markers of ARMS-E and ARMS-L

The (visual) working memory and verbal learning domains were particularly involved in the discriminative pattern underlying the high classification accuracy (96%) of ARMS-E vs HC individuals (table 5). This observation partly overlaps with reports of verbal memory alterations in at-risk populations characterized by predictive basic symptoms.7,9,11 However, these investigations did not find deficits in the (visual) working memory domain (SOPT, LNS) to be associated with an “early” ARMS for psychosis. Furthermore, Simon et al55 did not detect any significant neurocognitive differences in at-risk individuals recruited for basic symptoms compared with help-seeking patient controls. These inconsistencies may be attributed to differences in statistical power and employed methodology.58 Nevertheless, our binary classification results suggest that a pattern of altered verbal and mnemonic functions may reliably distinguish at-risk individuals experiencing predictive basic symptoms11,28 from healthy volunteers on a single-subject basis.

Compared with this pattern, the neurocognitive signature used by the HC vs ARMS-L classifier relied strongly on premorbid verbal IQ, processing speed, cognitive set shifting, and to a lesser degree verbal learning abilities. This pattern facilitated a cross-validated BAC of 88% in the classification of HC vs ARMS-L individuals. The pronounced involvement of executive functioning in this discriminative profile agrees with previous neuropsychological studies of UHR subjects.2,8,9,14,59,60 Furthermore, in keeping with our findings, premorbid verbal IQ,5,55 processing speed,9,60 and verbal learning impairments5,9,55 have been shown to characterize the UHR for psychosis, albeit not as a consistent deficiency pattern throughout the literature. Again, this heterogeneity may be due to differences in sample characteristics, employed neuropsychological measures, and statistical strategies across these studies.58 In this regard, it is noteworthy that the neurocognitive profile of the ARMS-L obtained using univariate statistical procedures (table 4) differs from the discriminative patterns detected by our multivariate methodology. This discrepancy reflects the SVM's ability to perform multivariate prediction, meaning that the algorithm analyzes the relationships of different cognitive features with respect to the conjoint discriminative power they provide. Thus, single predictive features may be redundant when combined into a discriminative profile. In contrast, features that appear to be irrelevant may aid in reliably separating the groups within a high-dimensional nonlinear discriminative space.

This phenomenon was particularly observed in the ARMS-E vs ARMS-L analysis, where univariate methods did not reveal any significant differences. However, the binary SVM classifier was able to separate unseen ARMS-E and ARMS-L with a BAC of 75% by relying on a discriminative pattern that involved premorbid verbal IQ, processing speed, visual working memory, and delayed verbal learning. This discriminative pattern is in keeping with existing studies that showed that these neurocognitive domains are more impaired in ARMS-L vs ARMS-E individuals.7,9,55

In summary, our binary and multigroup classification results (table 6) suggest that an early recognition of ARMS individuals may be feasible on a single-subject basis. If replicated in larger, independent populations, these findings could support the diagnostic application of neurocognitive pattern classification in the early stages of psychosis development. These at-risk states are likely to be more amenable to therapeutic intervention than the established disorder, but at the same time, they elude clinical detection due to the absence of frank psychotic symptoms.

In this regard, prospective studies are needed to trace neurocognitive deficits from the initial prodrome to the first-episode of the disease in order to evaluate whether the discriminative profiles of the ARMS-E and ARMS-L evolve on a single disease trajectory, or alternatively, whether they represent 2 distinct vulnerability states of psychosis, as suggested by the divergent transition rates in both subgroups (see methods section).

Neurocognitive Markers of Transition to Psychosis

Our second classification analysis demonstrated that neurocognition-based SVM classifiers may be capable of predicting whether a subclinical ARMS will evolve into frank psychosis at the single-subject level. The neurocognitive patterns extracted by the binary classifiers partly overlapped with the profiles identified in the HC vs ARMS-E vs ARMS-L analysis. More specifically, the classification of converters vs nonconverters achieved a BAC of 77.5% based on a discriminative pattern, which mainly involved premorbid verbal IQ, executive functions, and (verbal) learning abilities. Furthermore, the multigroup classifier (table 6) separated the converters/nonconverters from the other study groups with a BAC of 89%/90.8%, suggesting that larger training samples and additional neurocognitive information provided by the HC group facilitated the modeling of the complex separating boundary between the conversion and nonconversion samples.

Linear classification models combining clinical with neurocognitive data have previously shown to improve disease prediction beyond the levels of purely clinical recognition strategies. In the study of Lencz et al,5 the authors found that regression models using verbal memory performance and positive symptom severity achieved an accuracy of 80% in distinguishing subsequent converters from the nonconverters. Similarly, Riecher-Rössler et al16 found that logistic regression integrating suspiciousness and anhedonia scores with executive functioning correctly predicted transition in 80% of the cases. However, none of these studies estimated the generalizability of prediction using a rigorous CV procedure that separated the training from the validation data at all steps of the model creation process. Moreover, a recent multicenter study,17 which involved a total of 269 clinical high-risk subjects with follow-up information and 193 HC, did not find neurocognitive data to provide predictive power beyond clinical prediction models.18

In the context of these conflicting findings, our results suggest that multivariate machine learning methods may be capable of extracting those discriminative patterns from cross-domain neuropsychological test batteries that indeed facilitate a reliable prediction of psychosis in unseen ARMS individuals. It is of note that our multigroup classification system outperformed (89%) the PACE-based UHR criteria in predicting psychosis as 14 of 21 followed ARMS-L subjects (66.7%), which were recruited using these criteria, subsequently developed the disease. However, we have to point out that our classification results were obtained in a clinically defined high-risk population and hence cannot be generalized to asymptomatic individuals at-risk for psychosis due to eg, genetic reasons. Furthermore, it is unknown how the SVM method would perform in ARMS samples with lower conversion rates, as reported recently.61

In keeping with Lencz et al5 and Riecher-Rössler et al,16 our findings support that neurocognitive pattern recognition may substantially enhance the diagnostic reliability of existing clinical early recognition strategies.18,28,38 Furthermore, the higher PANSS negative scores observed in the conversion vs the nonconversion group (table 3) as well as the correlation between PANSS negative scores and RAVLT-DR (|r|=.56) as well as RAVLT-Ret (|r|=.56) suggests an association between prodromal negative symptoms and cognitive deficits. In this regard, multimodal diagnostic applications combining psychopathological and neurocognitive data may outperform classifiers relying only on neurocognitive information.5,16 However, these multimodal diagnostic systems would essentially depend on skilled clinicians trained to reliably detect subtle prodromal symptoms across individuals and over time, which may confine such systems to highly specialized clinical centers.

Our current and previous results22 suggest that neurocognitive and neuroanatomical pattern classification may achieve equivalent prediction results. This is important because the applicability and availability of MRI-based procedures in clinical real-world scenarios may be limited due to concomitant psychiatric, medical, and economic conditions. Furthermore, the similar classification accuracies obtained by neurocognition- and MRI-based pattern classification may point to a link between structural brain alterations22,23,30,31,36,6264 and neurocognitive abnormalities in the ARMS. To date, these potential associations have been assessed only by 2 studies that found correlations between hippocampal volume and verbal learning deficits34 as well as between prefronto-callosal volumes and executive impairment.42 Therefore, further research is needed to evaluate the performance of multimodal classifiers that integrate both neuroanatomical and neurocognitive data.

Finally, 2 further limitations have to be considered. Although the repeated double CV framework provided a reliable estimate of classification generalizability, we cannot rule out that the classification accuracies observed in our analyses may be due an accidental recruitment of an easy to categorize ARMS population. This possibility could be precluded in future multicenter studies that recruit and follow significantly larger samples of ARMS individuals. Furthermore, the encouraging results of our classification analyses do not imply that similar levels of specificity and sensitivity in predicting psychosis could be achieved in individuals presenting with ARMS-like symptoms in a normal clinical setting. This is because the classifiers were not trained to detect other psychiatric conditions like depression, bipolar disorder, or borderline personality disorder that may present with overlapping clinical and neurocognitive abnormalities. In this regard, the ARMS-NT group's clinical outcome involved affective spectrum diagnoses, such as major depression, bipolar, and adjustment disorder, possibly pointing to a specificity of the neurocognitive patterns in separating schizophrenia spectrum psychosis from affective disorders. This important issue should be further examined in transnosological ARMS cohorts that cover ARMS for different psychiatric disorders.

Funding

Funding was provided by Ludwig-Maximilian-University to Nikolaos Koutsouleris for the recruitment and examination of the ARMS subjects. The funding source had no further role in the study design, the collection, analysis and interpretation of data, the writing of the report and in the decision to submit the article for publication.

Supplementary Material

Supplementary material is available at http://schizophreniabulletin.oxfordjournals.org.

Supplementary Data

Acknowledgments

We would like to thank Dr Reinhold Bader, Linux Cluster Systems for the Munich and Bavarian Universities, for his support in integrating the LIBSVM software into the batch system of the Linux cluster. Conflicts of Interest: None to declare

References

  • 1.Heinrichs RW, Zakzanis KK. Neurocognitive deficit in schizophrenia: a quantitative review of the evidence. Neuropsychology. 1998;12:426–445. doi: 10.1037//0894-4105.12.3.426. [DOI] [PubMed] [Google Scholar]
  • 2.Hawkins KA, McGlashan TH, Quinlan D, et al. Factorial structure of the Scale of Prodromal Symptoms. Schizophr Res. 2004;68:339–347. doi: 10.1016/S0920-9964(03)00053-7. [DOI] [PubMed] [Google Scholar]
  • 3.Brewer WJ, Wood SJ, McGorry PD, et al. Impairment of olfactory identification ability in individuals at ultra-high risk for psychosis who later develop schizophrenia. Am J Psychiatry. 2003;160:1790–1794. doi: 10.1176/appi.ajp.160.10.1790. [DOI] [PubMed] [Google Scholar]
  • 4.Francey SM, Jackson HJ, Phillips LJ, Wood SJ, Yung AR, McGorry PD. Sustained attention in young people at high risk of psychosis does not predict transition to psychosis. Schizophr Res. 2005;79:127–136. doi: 10.1016/j.schres.2005.06.023. [DOI] [PubMed] [Google Scholar]
  • 5.Lencz T, Smith CW, McLaughlin D, et al. Generalized and specific neurocognitive deficits in prodromal schizophrenia. Biol Psychiatry. 2006;59:863–871. doi: 10.1016/j.biopsych.2005.09.005. [DOI] [PubMed] [Google Scholar]
  • 6.Niendam TA, Bearden CE, Johnson JK, et al. Neurocognitive performance and functional disability in the psychosis prodrome. Schizophr Res. 2006;84:100–111. doi: 10.1016/j.schres.2006.02.005. [DOI] [PubMed] [Google Scholar]
  • 7.Pukrop R, Schultze-Lutter F, Ruhrmann S, et al. Neurocognitive functioning in subjects at risk for a first episode of psychosis compared with first- and multiple-episode schizophrenia. J Clin Exp Neuropsychol. 2006;28:1388–1407. doi: 10.1080/13803390500434425. [DOI] [PubMed] [Google Scholar]
  • 8.Wood SJ, Brewer WJ, Koutsouradis P, et al. Cognitive decline following psychosis onset: data from the PACE clinic. Br J Psychiatry Suppl. 2007;51:s52–s57. doi: 10.1192/bjp.191.51.s52. [DOI] [PubMed] [Google Scholar]
  • 9.Frommann I, Pukrop R, Brinkmeyer J, et al. Neuropsychological profiles in different at-risk states of psychosis: executive control impairment in the early—and additional memory dysfunction in the late—prodromal state. Schizophr Bull. doi: 10.1093/schbul/sbp155. January 6, 2010; doi:10.1093/schbul/sbp155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Huber G. Reine Defektsyndrome und Basisstadien endogener Psychosen. Fortschr Neurol Psychiatr. 1966;34:409–426. [Google Scholar]
  • 11.Klosterkötter J, Hellmich M, Steinmeyer EM, Schultze-Lutter F. Diagnosing schizophrenia in the initial prodromal phase. Arch Gen Psychiatry. 2001;58:158–164. doi: 10.1001/archpsyc.58.2.158. [DOI] [PubMed] [Google Scholar]
  • 12.Amminger GP, Schäfer MR, Papageorgiou K, et al. Long-chain omega-3 fatty acids for indicated prevention of psychotic disorders: a randomized, placebo-controlled trial. Arch Gen Psychiatry. 2010;67:146–154. doi: 10.1001/archgenpsychiatry.2009.192. [DOI] [PubMed] [Google Scholar]
  • 13.Haroun N, Dunn L, Haroun A, Cadenhead KS. Risk and protection in prodromal schizophrenia: ethical implications for clinical practice and future research. Schizophr Bull. 2006;32:166–178. doi: 10.1093/schbul/sbj007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Pukrop R, Ruhrmann S, Schultze-Lutter F, Bechdolf A, Brockhaus-Dumke A, Klosterkötter J. Neurocognitive indicators for a conversion to psychosis: comparison of patients in a potentially initial prodromal state who did or did not convert to a psychosis. Schizophr Res. 2007;92:116–125. doi: 10.1016/j.schres.2007.01.020. [DOI] [PubMed] [Google Scholar]
  • 15.Woodberry KA, Seidman LJ, Giuliano AJ, Verdi MB, Cook WL, McFarlane WR. Neuropsychological profiles in individuals at clinical high risk for psychosis: relationship to psychosis and intelligence. Schizophr Res. 2010;123:188–198. doi: 10.1016/j.schres.2010.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Riecher-Rössler A, Pflüger MO, Aston J, et al. Efficacy of using cognitive status in predicting psychosis: a 7-year follow-up. Biol Psychiarty. 2009;66:1023–1030. doi: 10.1016/j.biopsych.2009.07.020. [DOI] [PubMed] [Google Scholar]
  • 17.Seidman LJ, Giuliano AJ, Meyer EC, et al. Neuropsychology of the prodrome to psychosis in the NAPLS consortium: relationship to family history and conversion to psychosis. Arch Gen Psychiatry. 2010;67:578–588. doi: 10.1001/archgenpsychiatry.2010.66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Cannon TD, Cadenhead K, Cornblatt B, et al. Prediction of psychosis in youth at high clinical risk: a multisite longitudinal study in North America. Arch Gen Psychiatry. 2008;65:28–37. doi: 10.1001/archgenpsychiatry.2007.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Noble WS. What is a support vector machine? Nat Biotechnol. 2006;24:1565–1567. doi: 10.1038/nbt1206-1565. [DOI] [PubMed] [Google Scholar]
  • 20.Davatzikos C, Shen D, Gur RC, et al. Whole-brain morphometric study of schizophrenia revealing a spatially complex set of focal abnormalities. Arch Gen Psychiatry. 2005;62:1218–1227. doi: 10.1001/archpsyc.62.11.1218. [DOI] [PubMed] [Google Scholar]
  • 21.Fan Y, Resnick SM, Wu X, Davatzikos C. Structural and functional biomarkers of prodromal Alzheimer's disease: a high-dimensional pattern classification study. Neuroimage. 2008;41:277–285. doi: 10.1016/j.neuroimage.2008.02.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Koutsouleris N, Meisenzahl E, Davatzikos C, et al. Neuroanatomical pattern classification identifies subjects in at-risk mental states of psychosis and predicts disease transition. Arch Gen Psychiatry. 2009;66:700–712. doi: 10.1001/archgenpsychiatry.2009.62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Koutsouleris N, Gaser C, Bottlender R, et al. Use of neuroanatomical pattern regression to predict the structural brain dynamics of vulnerability and transition to psychosis. Schizophr Res. 2010;123:175–187. doi: 10.1016/j.schres.2010.08.032. [DOI] [PubMed] [Google Scholar]
  • 24.Vapnik VN. An overview of statistical learning theory. IEEE Trans Neural Netw. 1999;10:988–999. doi: 10.1109/72.788640. [DOI] [PubMed] [Google Scholar]
  • 25.Polikar R. Ensemble based systems in decision making. IEEE Circuits and Systems. 2006;6:21–45. [Google Scholar]
  • 26.Wang Y, Fan Y, Bhatt P, Davatzikos C. High-dimensional pattern regression using machine learning: from medical images to continuous clinical variables. Neuroimage. 2010;50:1519–1535. doi: 10.1016/j.neuroimage.2009.12.092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ruhrmann S, Schultze-Lutter F, Klosterkötter J. Early detection and intervention in the initial prodromal phase of schizophrenia. Pharmacopsychiatry. 2003;36(suppl 3):S162–S167. doi: 10.1055/s-2003-45125. [DOI] [PubMed] [Google Scholar]
  • 28.Ruhrmann S, Schultze-Lutter F, Salokangas RKR, et al. Prediction of psychosis in adolescents and young adults at high risk: results from the prospective European prediction of psychosis study. Arch Gen Psychiatry. 2010;67:241–251. doi: 10.1001/archgenpsychiatry.2009.206. [DOI] [PubMed] [Google Scholar]
  • 29.Schultze-Lutter F, Ruhrmann S, Picker H, von Reventlow HG, Brockhaus-Dumke A, Klosterkötter J. Basic symptoms in early psychotic and depressive disorders. Br J Psychiatry Suppl. 2007;51:s31–s37. doi: 10.1192/bjp.191.51.s31. [DOI] [PubMed] [Google Scholar]
  • 30.Meisenzahl EM, Koutsouleris N, Gaser C, et al. Structural brain alterations in subjects at high-risk of psychosis: a voxel-based morphometric study. Schizophr Res. 2008;102:150–162. doi: 10.1016/j.schres.2008.02.023. [DOI] [PubMed] [Google Scholar]
  • 31.Koutsouleris N, Schmitt G, Gaser C, et al. Neuroanatomical correlates of different vulnerability states of psychosis in relation to clinical outcome. Br J Psychiatry. 2009;195:218–226. doi: 10.1192/bjp.bp.108.052068. [DOI] [PubMed] [Google Scholar]
  • 32.Frommann I, Brinkmeyer J, Ruhrmann S, et al. Auditory P300 in individuals clinically at risk for psychosis. Int J Psychophysiol. 2008;70:192–205. doi: 10.1016/j.ijpsycho.2008.07.003. [DOI] [PubMed] [Google Scholar]
  • 33.Quednow BB, Frommann I, Berning J, Kühn KU, Maier W, Wagner M. Impaired sensorimotor gating of the acoustic startle response in the prodrome of schizophrenia. Biol Psychiatry. 2008;64:766–773. doi: 10.1016/j.biopsych.2008.04.019. [DOI] [PubMed] [Google Scholar]
  • 34.Hurlemann R, Jessen F, Wagner M, et al. Interrelated neuropsychological and anatomical evidence of hippocampal pathology in the at-risk mental state. Psychol Med. 2008;38:843–851. doi: 10.1017/S0033291708003279. [DOI] [PubMed] [Google Scholar]
  • 35.Yung AR, Phillips LJ, McGorry PD, et al. Prediction of psychosis. A step towards indicated prevention of schizophrenia. Br J Psychiatry Suppl. 1998;172:14–20. [PubMed] [Google Scholar]
  • 36.Sun D, van Erp TGM, Thompson PM, et al. Elucidating a magnetic resonance imaging-based neuroanatomic biomarker for psychosis: classification analysis using probabilistic brain atlas and machine learning algorithms. Biol Psychiatry. 2009;66:1055–1060. doi: 10.1016/j.biopsych.2009.07.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Yung AR, Phillips LJ, Yuen HP, et al. Psychosis prediction: 12-month follow up of a high-risk("prodromal") group. Schizophr Res. 2003;60:21–32. doi: 10.1016/s0920-9964(02)00167-6. [DOI] [PubMed] [Google Scholar]
  • 38.Yung AR, Phillips LJ, Yuen HP, McGorry PD. Risk factors for psychosis in an ultra high-risk group: psychopathology and clinical features. Schizophr Res. 2004;67:131–142. doi: 10.1016/S0920-9964(03)00192-0. [DOI] [PubMed] [Google Scholar]
  • 39.American Psychiatric Association. Diagnostic and Statistical Manual for Mental Disorders. 4th ed. Washington, DC: American Psychiatric Association; 1994. [Google Scholar]
  • 40.Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. 1987;13:261–276. doi: 10.1093/schbul/13.2.261. [DOI] [PubMed] [Google Scholar]
  • 41.Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry. 1979;134:382–389. doi: 10.1192/bjp.134.4.382. [DOI] [PubMed] [Google Scholar]
  • 42.Koutsouleris N, Patschurek-Kliche K, Scheuerecker J, et al. Neuroanatomical correlates of executive dysfunction in the at-risk mental state for psychosis. Schizophr Res. 2010;123:160–174. doi: 10.1016/j.schres.2010.08.026. [DOI] [PubMed] [Google Scholar]
  • 43.Häfner H, Maurer K, Ruhrmann S, et al. Early detection and secondary prevention of psychosis: facts and visions. Eur Arch Psychiatry Clin Neurosci. 2004;254:117–128. doi: 10.1007/s00406-004-0508-z. [DOI] [PubMed] [Google Scholar]
  • 44.Lehrl S. Mehrfachwahl-Wortschatz-Intelligenztest MWT-B. 5th ed. Balingen, Germany: Spitta Verlag; 2005. [Google Scholar]
  • 45.Reitan RM. TMT, Trail Making Test A & B. South Tucson, AR: Reitan Neuropsychology Laboratory; 1992. [Google Scholar]
  • 46.Wechsler D. Wechsler Adult Intelligence Scale. 3rd ed. San Antonio, TX: Psychological Coorperation; 1997. [Google Scholar]
  • 47.Gold JM, Carpenter C, Randolph C, Goldberg TE, Weinberger DR. Auditory working memory and Wisconsin Card Sorting Test performance in schizophrenia. Arch Gen Psychiatry. 1997;54:159–165. doi: 10.1001/archpsyc.1997.01830140071013. [DOI] [PubMed] [Google Scholar]
  • 48.Petrides M. Impairments on nonspatial self-ordered and externally ordered working memory tasks after lesions of the mid-dorsal part of the lateral frontal cortex in the monkey. J Neurosci. 1995;15:359–375. doi: 10.1523/JNEUROSCI.15-01-00359.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lezak MD. Neuropsychological Assessment. 3rd ed. New York, NY: Oxford University Press; 1995. [Google Scholar]
  • 50.Schultze-Lutter F, Ruhrmann S, Picker H, et al. Relationship between subjective and objective cognitive function in the early and late prodrome. Br J Psychiatry Suppl. 2007;51:s43–s51. doi: 10.1192/bjp.191.51.s43. [DOI] [PubMed] [Google Scholar]
  • 51.Holm S. A simple sequentially rejective multiple test procedure. Scand J Stat. 1979;6:65–70. [Google Scholar]
  • 52.Vapnik V. Statistical Learning Theory. New York, NY: Wiley Interscience; 1998. [Google Scholar]
  • 53.Burges CJC. A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov. 1998;2:121–167. [Google Scholar]
  • 54.Filzmoser P, Liebmann B, Varmuza K. Repeated double cross validation. J Chemometrics. 2009;23:160–171. [Google Scholar]
  • 55.Simon AE, Cattapan-Ludewig K, Zmilacher S, et al. Cognitive functioning in the schizophrenia prodrome. Schizophr Bull. 2007;33:761–771. doi: 10.1093/schbul/sbm018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Larsen TK. The transition from premorbid period to psychosis: how can it be described? Acta Psychiatr Scand. 2002;106:10–11. [Google Scholar]
  • 57.Miller TJ, McGlashan TH, Rosen JL, et al. Prospective diagnosis of the initial prodrome for schizophrenia based on the Structured Interview for Prodromal Syndromes: preliminary evidence of interrater reliability and predictive validity. Am J Psychiatry. 2002;159:863–865. doi: 10.1176/appi.ajp.159.5.863. [DOI] [PubMed] [Google Scholar]
  • 58.Pukrop R, Klosterkötter J. Neurocognitive indicators of clinical high-risk states for psychosis: a critical review of the evidence. Neurotox Res. 2010;18:272–286. doi: 10.1007/s12640-010-9191-1. [DOI] [PubMed] [Google Scholar]
  • 59.Fis NP, Cetin FC, Erturk M, Erdogan E, Dedeoglu C, Yazgan Y. Executive dysfunction in Turkish children at high risk for schizophrenia. Eur Child Adolesc Psychiatry. 2008;17:424–431. doi: 10.1007/s00787-008-0684-x. [DOI] [PubMed] [Google Scholar]
  • 60.Blanchard MM, Jacobson S, Clarke MC, et al. Language, motor and speed of processing deficits in adolescents with subclinical psychotic symptoms. Schizophr Res. 2010;123:71–76. doi: 10.1016/j.schres.2010.05.028. [DOI] [PubMed] [Google Scholar]
  • 61.Yung AR, Yuen HP, Berger G, et al. Declining transition rate in ultra high risk (prodromal) services: dilution or reduction of risk? Schizophr Bull. 2007;33:673–681. doi: 10.1093/schbul/sbm015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Pantelis C, Velakoulis D, McGorry PD, et al. Neuroanatomical abnormalities before and after onset of psychosis: a cross-sectional and longitudinal MRI comparison. Lancet. 2003;361:281–288. doi: 10.1016/S0140-6736(03)12323-9. [DOI] [PubMed] [Google Scholar]
  • 63.Borgwardt SJ, Riecher-Rössler A, Dazzan P, et al. Regional gray matter volume abnormalities in the at risk mental state. Biol Psychiatry. 2007;61:1148–1156. doi: 10.1016/j.biopsych.2006.08.009. [DOI] [PubMed] [Google Scholar]
  • 64.Borgwardt SJ, McGuire PK, Aston J, et al. Reductions in frontal, temporal and parietal volume associated with the onset of psychosis. Schizophr Res. 2008;106:108–114. doi: 10.1016/j.schres.2008.08.007. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Schizophrenia Bulletin are provided here courtesy of Oxford University Press

RESOURCES