Skip to main content
NeuroImage : Clinical logoLink to NeuroImage : Clinical
. 2020 Feb 17;26:102213. doi: 10.1016/j.nicl.2020.102213

Predicting individual clinical trajectories of depression with generative embedding

Stefan Frässle a,, Andre F Marquand b,c, Lianne Schmaal d,e, Richard Dinga f, Dick J Veltman f, Nic JA van der Wee g, Marie-José van Tol h, Dario Schöbi a, Brenda WJH Penninx f,i, Klaas E Stephan a,j,k
PMCID: PMC7082217  PMID: 32197140

Highlights

  • Patients with major depressive disorder (MDD) show variable clinical trajectories.

  • Generative embedding (GE) is used to predict clinical trajectories in MDD patients.

  • GE classifies patients with chronic depression vs. fast remission with 79% accuracy.

  • GE provides mechanistic interpretability and outperforms conventional measures.

  • Proof-of-concept that illustrates the potential of GE for clinical prediction.

Abstract

Patients with major depressive disorder (MDD) show heterogeneous treatment response and highly variable clinical trajectories: while some patients experience swift recovery, others show relapsing-remitting or chronic courses. Predicting individual clinical trajectories at an early stage is a key challenge for psychiatry and might facilitate individually tailored interventions. So far, however, reliable predictors at the single-patient level are absent. Here, we evaluated the utility of a machine learning strategy – generative embedding (GE) – which combines interpretable generative models with discriminative classifiers. Specifically, we used functional magnetic resonance imaging (fMRI) data of emotional face perception in 85 MDD patients from the NEtherlands Study of Depression and Anxiety (NESDA) who had been followed up over two years and classified into three subgroups with distinct clinical trajectories. Combining a generative model of effective (directed) connectivity with support vector machines (SVMs), we could predict whether a given patient would experience chronic depression vs. fast remission with a balanced accuracy of 79%. Gradual improvement vs. fast remission could still be predicted above-chance, but less convincingly, with a balanced accuracy of 61%. Generative embedding outperformed classification based on conventional (descriptive) features, such as functional connectivity or local activation estimates, which were obtained from the same data and did not allow for above-chance classification accuracy. Furthermore, predictive performance of GE could be assigned to a specific network property: the trial-by-trial modulation of connections by emotional content. Given the limited sample size of our study, the present results are preliminary but may serve as proof-of-concept, illustrating the potential of GE for obtaining clinical predictions that are interpretable in terms of network mechanisms. Our findings suggest that abnormal dynamic changes of connections involved in emotional face processing might be associated with higher risk of developing a less favorable clinical course.

1. Introduction

Major depressive disorder (MDD) is one of the most burdening mental disorders with a lifetime prevalence of 10–30% (Andrade et al., 2003; de Graaf et al., 2012). Up to a fourth of MDD patients are at risk of developing a chronic disease (Penninx et al., 2011), characterized by severe negative impact on quality of life and high rates of psychiatric comorbidities (Kohler et al., 2019). The diagnostic criteria of MDD in ICD and DSM-5 (American Psychiatric Association, 2013) are not grounded in pathophysiology, but refer to symptoms and signs (e.g., depressed mood, anhedonia, fatigue) that could have various causes. The diagnostic label MDD likely subsumes patients with different disease mechanisms and has limited predictive validity: MDD patients show highly variable clinical trajectories over time (Gueorguieva et al., 2011; Musliner et al., 2016; Muthen et al., 2011), and the absence of mechanistically interpretable predictors turns therapy into a trial-and-error procedure (Cuthbert and Insel, 2013; Kapur et al., 2012; Rush et al., 2006). This is not only costly and frustrating for patients, but also bears the risk of long-term adverse events (McMahon and Insel, 2012) and reduced treatment adherence (Velligan et al., 2010).

This emphasizes the need for novel prognostic approaches to depression that furnish predictors for clinical trajectories and treatment outcomes. Predicting symptom trajectories in MDD at an early stage is of high clinical relevance because identifying patients at risk of chronic disease might guide the deployment of intensified early interventions (MacQueen, 2009). To achieve this, successful tools may benefit from being grounded in biology to enable a mechanistically relevant stratification of the heterogeneous MDD spectrum (Stephan et al., 2017). Using neuroimaging, some studies demonstrated that disease onset and short-term treatment response prediction may be possible (Dunlop et al., 2017; Mayberg et al., 1997; Pan et al., 2017; Phillips et al., 2015). By contrast, it has proven more challenging to predict long-term clinical outcome, such as symptom trajectories over several years.

Schmaal et al. (2015) assessed the prognostic value of structural and functional magnetic resonance imaging (fMRI) to classify disease trajectories in MDD patients from the NEtherlands Study of Depression and Anxiety (NESDA; Penninx et al., 2008), a multi-site longitudinal study in a large naturalistic cohort. The authors demonstrated that fMRI data from an emotional face perception paradigm allowed discriminating patients who, over the course of two years, showed a chronic disease trajectory from patients showing rapid remission of depressive symptoms, with up to 73% accuracy. This result was obtained by applying a supervised machine learning (ML) method, Gaussian Process Classifiers (GPCs; Rasmussen and Williams, 2005), to contrast images.

While an encouraging initial result, developing this approach further with conventional ML techniques and towards clinically required levels of accuracy faces several challenges (Brodersen et al., 2011). First, achieving high classification accuracy robustly from whole-brain fMRI data can be difficult, given the high dimensionality of the data relative to the small sample sizes. Second, the results from “black-box” ML operating on descriptive features (e.g., contrast images) do not easily allow for mechanistic interpretations. The latter, however, is increasingly recognized as critical for clinical applications of ML (Itani et al., 2019; Woo et al., 2017), both to derive novel treatment ideas from successful predictions but also to detect cases when ML goes awry, e.g., predictions that derive from artefacts in the data.

Generative embedding (GE) represents a potentially attractive alternative to “classical” ML (Shawe-Taylor and Cristianini, 2004). The idea is simple but powerful: instead of selecting features from the original data, one applies a generative model to the data and uses the ensuing model parameter estimates as features. Generative models describe how observed data may have been “generated” from latent (hidden) system states and thus often embody some degree of mechanistic interpretability. For example, in neuroimaging, GE uses model-based estimates of physiological or cognitive parameters, such as connection strengths (Brodersen et al., 2014, 2011), ion channel conductances (Symmonds et al., 2018), prediction errors (Paulus, 2015), or response inhibition (Wiecki et al., 2016). More technically, GE views a generative model as a theory-driven dimensionality reduction device that projects high-dimensional data onto neurobiologically meaningful parameters that define a low-dimensional and interpretable space for classification. Provided a plausible model exists, GE frequently yields more accurate results than conventional ML (Brodersen et al., 2014, 2011), likely because the generative model separates signal (reflecting the process of interest) from (measurement) noise.

Model-based estimates of brain connectivity might be particularly informative for predicting clinical trajectories in MDD, given that dysconnectivity has been postulated as a hallmark of depression (Greicius et al., 2007; Mayberg, 1997; Wang et al., 2012). Here, we used a generative model of fMRI data, dynamic causal modeling (DCM; Friston et al., 2003), to infer effective (directed) connectivity and test the utility of GE for predicting individual clinical trajectories in MDD patients from the NESDA study. For this purpose, we combined DCMs of the emotional face perception network with linear support vector machines (SVMs). We then compared the cross-validated predictive accuracy of GE with more conventional approaches, such as classification based on functional connectivity and local BOLD activity, testing whether a biologically plausible generative model would be superior for predicting naturalistic disease courses from fMRI data.

We emphasize that the present work is not meant to provide an ultimate prognostic tool for outcomes in MDD since this may require a substantially larger dataset than currently available. Instead, the present study provides proof-of-concept that illustrates the potential benefits of GE relative to conventional approaches in neuroimaging studies of MDD, with regard to predictive power and interpretability.

2. Materials and methods

2.1. Participants

The data used here were acquired in the NESDA study (Penninx et al., 2008), a multi-site longitudinal study on the long-term course of depression and anxiety disorders in a large naturalistic cohort. In total, 2981 participants (18–65 years) were recruited from community, primary care and specialized mental health organizations. From this cohort, 301 participants (156 with MDD diagnosis) were included in the MRI experiment. For detailed descriptions of the full sample, see van Tol et al. (2010). For the current study, only those participants were included that had: (i) a DSM-IV diagnosis of MDD, as established using the structured Composite International Diagnostic Interview (CIDI; Robins et al., 1988) in the 6 months prior to baseline, (ii) reported symptoms in the month before baseline as confirmed by either the CIDI or the Life Chart Interview (LCI; Lyketsos et al., 1994), (iii) availability of 2-year follow-up of depressive symptoms from the LCI, and (iv) no other exclusion criteria related to, e.g., poor data quality, non-compliance with task instructions, or deficient performance. For details, see Schmaal et al. (2015). This yielded a final sample of 85 participants (for an overview of the demographic and clinical characteristics, see Table 1).

Table 1.

Demographic and clinical characteristics of participants included in the generative embedding analyses.

Characteristic REM (n = 39) IMP (n = 31) CHR (n = 15) Statistic p-value
Age, Years 35.90 (11.50) 35.03 (10.00) 44.00 (10.01) F = 3.92 0.02
Gender, n (%)
 Female 28 (72) 20 (65) 9 (60) χ2=0.83 0.66
 Male 22 (28) 11 (35) 6 (40)
Education, Years 11.74 (3.30) 12.26 (3.00) 12.00 (2.36) F = 0.25 0.78
Scan Location, n (%)
 AMC Amsterdam 5 (13) 5 (16) 4 (27) χ2=3.81 0.43
 LUMC Leiden 13 (33) 15 (48) 5 (33)
 UMCG Groningen 21 (54) 11 (35) 6 (40)
IDS Total T1 31.44 (10.76) 32.77 (8.35) 33.72 (7.91) F = 0.37 0.69
IDS Total T2 14.72 (9.06) 22.29 (9.93) 29.60 (7.17) F = 15.87 < 0.001
IDS Change (T2-T1) −16.72 (11.33) −10.48 (10.56) −4.13 (8.10) F = 8.35 < 0.001
BAI Total T1 14.90 (9.25) 15.77 (9.42) 14.27 (6.93) F = 0.16 0.85
Antidepressant Use T1, n (%)
 No 27 (69) 22 (71) 8 (53) χ2=1.58 0.45
 Yes 12 (31) 9 (29) 7 (47)
Antidepressant Use T2, n (%)
 No 25 (64) 22 (71) 9 (60) χ2=0.64 0.73
 Yes 14 (36) 9 (29) 6 (40)

Based on the two-year follow-up clinical trajectories derived from CIDI and LCI information, MDD patients were divided into different categories with distinct courses of symptom severity. This division was informed by a latent class growth analysis (Rhebergen et al., 2012), as reported by Schmaal et al. (2015). The three classes were: (i) MDD-remitted, showing a rapid remission of symptoms (REM: n = 39), (ii) MDD-improved, showing a slow but gradual improvement of symptoms from baseline to follow-up (IMP: n = 31), and (iii) MDD-chronic, showing no improvement of symptoms from baseline to follow-up (CHR: n = 15).

2.2. Experimental procedure

For fMRI, an event-related emotional face perception paradigm was used. Participants viewed color images of angry, fearful, sad, happy, and neutral facial expressions, as well as scrambled faces. Stimuli were shown for 2.5 s, with an inter-stimulus interval varying between 0.5–1.5 s. Participants were instructed to indicate the gender of the presented face via button press. For scrambled images, participants had to press buttons in accordance with an arrow pointing to the left or right. Stimuli were presented using E-prime (Psychological Software Tools, Pittsburgh, PA; https://pstnet.com/products/e-prime/). For details, see Demenescu et al. (2011).

2.3. Functional magnetic resonance imaging

2.3.1. Image acquisition

For NESDA, structural and functional MRI data were acquired at the University Medical Center Groningen (UMCG), Amsterdam Medical Center (AMC), and Leiden University Medical Center (LUMC). Participants were scanned on 3-Tesla MR scanners (Philips Healthcare, Best, The Netherlands) with SENSE 8-channel (LUMC, UMCG) or 6-channel (AMC) receiver head coils. For details, see Supplementary Material S1.

2.3.2. Image data processing

Some of the functional images were affected by a “column” or “pencil beam” artifact caused by imperfect fat suppression pulses. The artifact was most apparent in temporal signal-to-noise ratio (tSNR) maps and manifested as vertical stripes, primarily in frontal gyrus and anterior temporal lobe (see https://github.com/dinga92/stripe_cleaning_scripts). This was corrected by regressing out artifact-related independent components prior to the routine preprocessing steps. Artifact correction was done within FMRIB's Software Library (FSL; http://www.fmrib.ox.ac.uk/fsl) as follows: First, the Multivariate Exploratory Linear Optimized Decomposition into Independent Components (melodic) algorithm was used to identify independent components associated with the artifact, and second, regfilt was used to regress out the artifact-related components.

After artifact correction, functional images were analyzed using SPM12 (version R7487, Wellcome center for Human Neuroimaging, London, UK, http://www.fil.ion.ucl.ac.uk) and Matlab (Mathworks, Natick, MA, USA). Individual images were realigned to the mean image, coregistered with the high-resolution anatomical image, and normalized to the Montreal Neurological Institute (MNI) standard space using the unified segmentation-normalization approach. During spatial normalization, functional images were resampled to a voxel size of 2 × 2 × 2 mm3. Finally, normalized functional images were spatially smoothed using an 8 mm FWHM Gaussian kernel.

Preprocessed and artifact-corrected functional images from each participant entered first-level General Linear Model (GLM) analyses. Each condition (i.e., angry, fearful, happy, sad, neutral, and scrambled faces) was modeled as an individual regressor, consisting of a train of stimulus onsets convolved with a canonical hemodynamic response function (HRF). Additionally, temporal and dispersion derivatives of the canonical HRF were included to account for variability in shape and timing of hemodynamic responses (Friston et al., 1998). Realignment parameters were included as nuisance regressors to control for movement-related artifacts. Importantly, we assessed whether groups differed in the amount of head movements as this could potentially confound subsequent classification analyses. However, using a one-way between-subject analysis of variance (ANOVA) with the factor group, we found that the MDD groups did not significantly differ in their mean framewise displacement (FD; Power et al., 2012) computed from the six realignment parameters (F(2,82) = 0.68, p = 0.51). Additionally, low-frequency fluctuations in the data were removed using a high-pass filter (cut-off 1/128 Hz).

2.3.3. Time series extraction

We selected six regions of interest (ROIs) that represent key components of the extended face perception network (Haxby et al., 2000), bilateral occipital face area (OFA; Puce et al., 1996), fusiform face area (FFA; Kanwisher et al., 1997), and amygdala (Breiter et al., 1996). To account for inter-subject variability in their exact location, center coordinates were defined for each participant individually: First, we identified the most likely MNI coordinates of these regions from a meta-analysis of 720 studies using Neurosynth (Yarkoni et al., 2011) with the search criterion “face”. Relying on this external information from Neurosynth helped ensure independence of feature selection (i.e., definition of ROI coordinates) and subsequent prediction. Generally, we prevented any cross-talk between training and test samples which might otherwise positively bias classification accuracy (see Supplementary Material S2). Second, individual peak activation coordinates were defined as the subject-specific local maximum closest to the Neurosynth coordinates within a 12 mm sphere for the linear contrast comparing faces (regardless of emotional valence) against scrambled images. Individual coordinates are illustrated in Supplementary Figure S1. While no group information was used to define ROI coordinates, in principle, some bias could still exist if the identification of individual peak activation coordinates had been influenced by systematic group differences. We examined this potential issue for the present analysis, using a multivariate analysis of variance (MANOVA), but did not find any significant group differences in the subject-specific peak activation coordinates for any of the regions (all p > 0.05). Third, BOLD signal time series were extracted from the subject-specific ROIs as the first eigenvariate of all voxels within an 8 mm sphere centered around the individual coordinates. Time series were mean-centered and movement-related variance was removed (by regression using the realignment parameters).

2.4. Dynamic causal modeling

Dynamic causal modeling (DCM; Friston et al., 2003) is a generative model that enables inference on hidden (latent) neuronal states from measured neuroimaging data. For fMRI, dynamics of neuronal activity are described as a function of the effective (directed) connectivity among neuronal populations:

dxdt=(A+jB(j)uj)x+Cu (1)

where x represents neuronal states, A encodes endogenous connectivity among brain regions, B(j) represents the modulatory influence that input uj exerts on endogenous connections, and C quantifies the strength of experimentally controlled inputs (perturbations) on brain regions. Integrating Eq. (1) yields a predicted neuronal time course which is then passed through a nonlinear hemodynamic model that translates neuronal signal into predicted BOLD signal (Buxton et al., 1998; Friston et al., 2000; Stephan et al., 2007). This yields a complete forward mapping from hidden neuronal states to observable fMRI data and, under Gaussian assumptions about the measurement noise, specifies the likelihood function. By specifying prior distributions over model parameters (neuronal, hemodynamic) and hyperparameters (measurement noise), DCM becomes a fully generative model. Model inversion then proceeds with approximate Bayesian schemes, most commonly variational Bayes under the Laplace approximation (VBL; Friston et al., 2007).

2.4.1. Definition of model space

Inference on effective connectivity is conditional on the underlying model (e.g., assumptions about the network architecture). However, there typically exist several a priori hypotheses about the likely network structure. This model uncertainty leads to defining a model space, a set of alternative plausible candidate models. Here, a total of seven models were constructed, representing different possible connectivity structures in the above-mentioned emotional face perception network. For all models, endogenous connectivity and driving inputs were identical. Driving inputs were set to elicit face-sensitive activation (i.e., containing all face stimuli regardless of whether an emotional or neutral face was presented, but excluding scrambled faces) in left and right OFA, consistent with their proposed role as the first stage in the face perception network (Haxby et al., 2000; Pitcher et al., 2011). The stimulus-evoked activity then propagated through the network via intra- and interhemispheric connections. We assumed forward and backward intrahemispheric connections between OFA and FFA, and between FFA and amygdala, but not between OFA and amygdala – consistent with the notion of a hierarchy in the face perception network (Fairhall and Ishai, 2007; Haxby et al., 2000). Additionally, reciprocal interhemispheric connections were set between homotopic regions (Catani and Thiebaut de Schotten, 2008; Clarke and Miklossy, 1990; Van Essen et al., 1982; Zeki, 1970; Zilles and Clarke, 1997), but omitted between heterotopic regions (Catani and Thiebaut de Schotten, 2008; Hofer and Frahm, 2006).

For this basic structure, seven different modulatory input patterns were defined (Fig. 1), representing distinct hypotheses of how emotion processing could modulate intra- and interhemispheric connections in the extended face perception network (Fairhall and Ishai, 2007; Frässle et al., 2016). Emotion processing could modulate either (i) forward, (ii) backward, or (iii) forward and backward intrahemispheric connections. Similarly, emotion processing could modulate interhemispheric connections or not. This yielded six different models, representing all possible combinations of the above effects. Furthermore, we included a “null” model (model 7) where none of the connections were modulated.

Fig. 1.

Fig. 1

Different plausible hypotheses of the effective connectivity pattern in the network mediating emotional face perception. Forward and backward intrahemispheric endogenous connections were set between OFA and FFA, and between FFA and amygdala (Amy). Additional, reciprocal interhemispheric connections were set between bilateral OFA, bilateral FFA and bilateral amygdala. Driving inputs comprised all faces, regardless of the emotional valence, and were allowed to drive neuronal activity in the left and right OFA. While endogenous connectivity and driving inputs were identical for all models, they differed in the assumed modulatory influences of emotion processing. Emotion processing could either modulate (i) forward (models 1&4), (ii) backward (models 2&5), or (iii) forward and backward intrahemispheric connections (models 3&6). Additionally, emotion processing (i) modulated (models 1–3) or (ii) did not modulate interhemispheric connections among homotopic brain regions (models 4–6). Systematically varying all combinations resulted in six distinct models. Finally, we also included a “null” model (i.e., model 7, not shown) where none of the intra- and interhemispheric connections was modulate by emotion processing.

Driving and modulatory inputs were not mean-centered. Model inversion was performed using DCM12 (SPM12, version R7487). For details, see Supplementary Material S3.

2.4.2. Bayesian model averaging

We computed individual parameter estimates by means of Bayesian model averaging (BMA; Penny et al., 2010) across all models in our model space within a pre-specified Occam's window (πocc=0.05). For details, see Supplementary Material S4. BMA parameter estimates represent a weighted average across the models considered, where each model contributes according to its posterior model probability. In order to prevent any cross-talk between training and test sample, BMA parameters were computed for each participant individually.

2.5. Generative embedding

The posterior means of BMA parameter estimates (78 in total) from each participant were used to create a generative score space for a discriminative classification method. Within this space, a linear kernel representing the inner product k(xi,xj)=<xi,xj> was used to compare two instances (participants). A support vector machine (SVM) was applied for binary classification of pairwise combinations of the three MDD groups (i.e., REM, IMP, and CHR). Specifically, we used the fitcsvm routine in Matlab. Estimates of classification performance were obtained by leave-one-out cross-validation. Here, in each fold, the classifier is trained on n1 participants (the training set) and tested on the left-out participant. Using the training set only, the hyperparameters of the SVM (box constraint and kernel scale; see Supplementary Material S5) were optimized using in-built routines of fitcsvm. This computes Bayes-optimal hyperparameters using the expected improvement acquisition function (Frazier, 2018) based on (inner) five-fold cross validation. This approach is known as nested cross-validation (Cawley and Talbot, 2010; Stone, 1974). By default, fitcsvm solves SVMs using the Sequential Minimal Optimization algorithm (SMO; Fan et al., 2005). Significance of the classification result was assessed using permutation tests. Here, an empirical null distribution of the balanced accuracy is computed by randomly permuting the participant labels and re-fitting the entire classification model (i.e., training and testing) based on these new labels (Good, 2000; Ojala and Garriga, 2010). For each permutation, the balanced accuracy is re-evaluated. Here, we used 1000 permutations. The p-value is then computed as the rank of the original balanced accuracy in the distribution of permutation-based balanced accuracies, divided by the total number of permutations.

3. Results

3.1. Group differences in effective connectivity

Effective connectivity among the regions of the extended face perception network (i.e., OFA, FFA, and amygdala, each in both hemispheres) was assessed using DCM for fMRI (Friston et al., 2003). First, we provide a brief summary of the group differences in DCM parameter estimates as assessed using classical statistics (a comprehensive description of the classical analyses is provided in Supplementary Material S7).

Random-effects Bayesian model selection (Rigoux et al., 2014; Stephan et al., 2007) suggested model 3 to be the winning model at the group level with an expected posterior probability of 0.49 and a protected exceedance probability close to 1 (Supplementary Figure S4B). Nevertheless, at the single-subject level, other models received non-negligible posterior probabilities as well. To account for this variability, individual connectivity parameters were estimated using BMA (Penny et al., 2010) over all seven models in the model space within the default Occam's window (πocc=0.05).

When testing for group differences in the BMA parameter estimates (two-sample t-tests), no significant differences were found when correcting for multiple comparisons based on the false discovery rate (Benjamini and Yekutieli, 2001). At a more liberal threshold (p < 0.05, uncorrected), some differences between the patient subgroups were observed (see Supplementary Figure S6). In brief, effective connectivity patterns differed mainly in how emotions modulated functional integration among the face-processing (i.e., bilateral OFA and FFA) and emotion-sensitive regions (i.e., bilateral amygdala). Overall, emotions tended to exert stronger modulatory influences in patients showing fast remission as compared to patients with a chronic disease trajectory (Supplementary Figure S6, left) and patients with gradual improvement of symptoms (Supplementary Figure S6, right). For the comparison between CHR and IMP patients, the pattern was more ambiguous (Supplementary Figure S6, middle).

While – as highlighted above – none of the group differences survived multiple comparisons correction, these findings suggest that small alterations of different effective connectivity strengths exist between CHR, IMP and REM patients, which enable classification when jointly considered as features.

3.2. Classification of clinical trajectories

3.2.1. Predictive accuracy of effective connectivity parameters

Our results suggest that DCM parameter estimates discriminated patients with chronic disease trajectory from patients showing fast remission, with a balanced accuracy of 79% (p < 0.001; Fig. 2, blue). We then evaluated the underlying receiver-operating characteristic (ROC) and precision-recall (PR) curves (Fig. 3A+B). From the ROC curve, the area under the curve (AUC) for discriminating CHR from REM patients evaluated to 0.87 (blue curve). Notably, high recall (sensitivity) might come at the expense of low positive predictive value (PPV; also known as “precision”) – particularly, in the presence of class imbalances. This is problematic for clinical applications where one strives to maximize sensitivity while at the same time keeping PPV high. Hence, we also inspected the PR curves and found that our classifier achieved 97% sensitivity at a PPV of 86% when discriminating CHR from REM patients.

Fig. 2.

Fig. 2

Balanced accuracy for the binary classifiers as assessed using leave-one-out cross validation for the three different subgroup comparisons – that is, CHR vs. REM (left), CHR vs. IMP (middle), and IMP vs. REM (right). Balanced accuracies are shown for the different features – namely, effective connectivity parameters (DCM; blue), functional connectivity (FC; light grey), and local BOLD activity (CI, dark grey). Asterisks above the bars indicate significant classification performance as assessed by means of permutation tests where an empirical null distribution of the balanced accuracy is computed by randomly permuting the participant labels and re-evaluating the classifier based on these new labels. Additionally, asterisks above the lines connecting two bars indicate significant differences in classification performance between different data features as assessed using the asymptotic McNemar test.

Fig. 3.

Fig. 3

Performance curves for the two binary classifiers that achieved above-chance balanced accuracies – that is, CHR vs. REM (blue curve) and IMP vs. REM (red curve). (A) receiver-operating characteristic (ROC) curves, illustrating the trade-off between the true positive rate (sensitivity) and the false positive rate (1-specificity) across the entire range of detection thresholds, (B) precision-recall (PR) curves, illustrating the trade-off between the precision (positive predictive value) and recall (true positive rate) for different thresholds, and (C) accuracy-reject curves, representing the accuracy of a classifier as a function of the rejection rate (Nadeem et al., 2010). For a comprehensive summary of all classification results, see Table 2. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Effective connectivity parameter estimates also discriminated between IMP and REM patients, with a balanced accuracy of 61% (p = 0.03; Fig. 2), although this did not reach significance when correcting for multiple comparisons (αBonf = 0.0056). The AUC was 0.63 (Fig. 3A+B; red curve). In contrast, CHR patients could not be differentiated from IMP patients above-chance level (balanced accuracy: 47%, p = 0.92; Fig. 2), corresponding to an AUC of 0.35. Table 2 provides a comprehensive summary of all classification results.

Table 2.

Classification results for the generative embedding procedure. Shown are key performance measures of the classification algorithm, including: balanced accuracy, area under the curve, sensitivity (recall), specificity, positive predictive value (precision), and negative predictive value. Performance measures are shown for the three different binary classifications (i.e., CHR vs. REM, CHR vs. IMP, and IMP vs. REM).

Classification CHR (n = 15)
vs. REM (n = 39)
CHR (n = 15)
vs. IMP (n = 31)
IMP (n = 31) vs.
REM (n = 39)
Accuracy 0.87 0.63 0.63
Balanced accuracy 0.79 0.47 0.61
Area under the curve (AUC) 0.87 0.35 0.63
Sensitivity (recall) 0.97 0.94 0.77
Specificity 0.60 0 0.45
Positive predictive value (Precision) 0.86 0.66 0.64
Negative predictive value 0.90 0 0.61

Since groups differed significantly in age (but no other variable; Table 1), we repeated the analysis after regressing out age as a confound from the DCM parameter estimates. We found results to be highly consistent (although with slightly decreased accuracies), suggesting that our results are not confounded by age (Supplementary Material S8).

3.2.2. Comparison to functional connectivity and fMRI activity

We compared the GE results to a conventional approach in which the classifier operates on estimates of functional connectivity (FC). Following standard practice, FC was computed in terms of Pearson's correlations among the same BOLD signal time series that had previously been used for the DCMs. However, FC measures did not discriminate between the different clinical trajectories above chance, with balanced accuracies of 50% (p = 0.77) for CHR vs. REM patients, 42% (p = 0.996) for CHR vs. IMP patients, and 50% (p = 0.37) for IMP vs. REM patients (Fig. 2, light grey). Importantly, for discriminating CHR from REM patients, GE significantly outperformed FC estimates (p = 0.01; asymptotic McNemar test1 as implemented in MATLAB's testcholdout function).

In addition, we tested whether the different clinical trajectories could be distinguished based on measures of BOLD activity (CI) from the same ROIs as utilized for the connectivity-based analyses. Local BOLD activity was quantified in terms of the mean and standard deviation of the contrast estimates within the 8 mm spheres for all face-related contrasts (i.e., contrasts representing the individual regressors of angry, fearful, happy, sad, and neutral faces; see Methods). Under this approach, the different clinical trajectories were indistinguishable, with balanced accuracies of 50% (p = 0.72) for CHR vs. REM patients, 48% (p = 0.81) for CHR vs. IMP patients, and 48% (p = 0.51) for IMP vs. REM patients (Fig. 2, dark grey). As for FC, GE significantly outperformed local BOLD activity for distinguishing CHR from REM patients (p = 0.01).

Notably, these analyses are not meant to represent an optimal prediction approach based on FC or CI measures. Higher accuracies for classification based on FC/CI might be achieved by taking into account the whole-brain information (e.g., Crowther et al., 2015; Fu et al., 2008; Schmaal et al., 2015). Furthermore, it is worth pointing out that the number of features that enter classification are different for GE, FC and CI. However, the purpose of the above analysis was to compare predictions based on different fMRI-based features derived from the exact same data (i.e., the BOLD activity from the ROIs of the emotional face processing network).

3.3. Assessment of predictive confidence

Next, to assess the predictive confidence of our GE approach, we computed accuracy-reject curves for the two binary classifiers that achieved above-chance balanced accuracies (i.e., CHR vs. REM, IMP vs. REM). Accuracy-reject curves illustrate a classifier's accuracy when only predictions greater than a certain (relative) confidence threshold are considered (Nadeem et al., 2010). Hence, this resembles classification with a reject option (Bishop, 2006), where cases that do not meet a certain confidence criterion can be deferred to a clinician. We found that for distinguishing CHR from REM patients, the classifier yielded perfect classification accuracy at a rejection threshold of 60% of participants (Fig. 3C; blue curve). Furthermore, the accuracy-reject curve overall increased as function of rejection rate, suggesting that participants further away from the decision hyperplane were more likely to be assigned correctly to their respective class. In contrast, for distinguishing IMP from REM patients no such cut-off could be identified, and the curve did not reveal a steady increase as a function of rejection rate (Fig. 3C; red curve).

3.4. Inspection of the generative score space

One benefit of GE is that features represent model parameter estimates, which, depending on the model, may be neurobiologically interpretable. Hence, in a next step, we interrogated our generative score space to illustrate which features contributed most to the classification performance.

3.4.1. Inspection of individual predictive features

In a first step, we aimed to pinpoint the individual contribution of each feature (i.e., DCM parameter estimate) separately for the two significant classifiers (i.e., CHR vs. REM, IMP vs. REM). Importantly, individual feature weights of linear classifiers are not directly interpretable because high magnitudes of feature weights might either indicate an association with the label or a “suppressor” variable that cancels out noise or mismatch in other colinear variables (Haufe et al., 2014; Naselaris et al., 2011). Therefore, we followed the procedure described by Haufe et al. (2014) and first transformed all feature weights into patterns based on a corresponding forward mapping.

For both classifiers, the features with the highest average (across cross-validation folds) scores were situated along the dimension of modulatory (emotional) influences (Fig. 4, top; for an alternative visualization, see Supplementary Figure S8), whereas the endogenous connectivity and driving input parameters did not distinguish strongly between the groups. Importantly, since averaging over cross-validation folds might artificially smooth the weights due to correlations among folds, we inspected the variability of the observed results across the individual cross-validation folds. This suggested that the observed pattern was highly consistent for both classifiers (Fig. 4, bottom).

Fig. 4.

Fig. 4

Illustration of the relevance of individual features. First, feature weights were transformed into feature patterns to allow for interpretability (Haufe et al., 2014). The respective score of each individual feature (DCM parameter) is then shown as a polar plot for the classifier distinguishing (A) patients with a chronic disease trajectory from patients that showed fast remission (CHR vs. REM), and (B) patients with gradual improvement of symptom severity from patients that showed fast remission (IMP vs. REM). (Top) Magnitude of scores computed as the average across all cross-validation folds, (bottom) magnitude of scores for each cross-validation fold individually, normalized to the maximum score within each fold for displaying purposes. The grey area represents endogenous connectivity and driving input parameters, showing less pronounced scores as the modulatory parameters. Endogenous connectivity is colored in blue, modulatory influences of happy faces in red, modulatory influences of angry faces in yellow, modulatory influences of fearful faces in violet, modulatory influences of sad faces in green, and driving inputs (related to all faces regardless of the emotional valence) in cyan. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

In brief, for distinguishing CHR from REM patients, the modulatory influence of happy faces on the connection from right amygdala to right FFA received the highest score (Fig. 4A and Supplementary Figure S8A). Furthermore, scores were high for modulatory influences of negative emotions (i.e., fear, anger, and sadness) on connections among face-processing and emotion-sensitive regions. For instance, modulatory influences by angry faces on the connection from right amygdala to right FFA and left amygdala, and on the connection from left OFA to left FFA showed high loads. Similarly, the modulation of connections from right OFA and left FFA to right FFA, as well as the connection from right FFA to right amygdala by fearful faces received high scores.

For distinguishing IMP from REM patients, the modulatory influence of happy faces on the connection from right FFA to right OFA received the highest score (Fig. 4B and Supplementary Figure S8B). Modulatory influences by angry and sad faces on this connection also showed high loads. Similarly, modulation of connections among FFA and amygdala in both hemispheres by angry faces scored highly, as well as modulation of various endogenous connections (e.g., left amygdala to left FFA, left to right OFA, left FFA to left OFA) by happy faces.

Overall, the most consistently important connections – that is, the connections for which modulations received high scores for all emotional valences and both classifiers – are the connections from left to right OFA, right FFA to right OFA, right amygdala to right FFA, as well as the reciprocal interhemispheric connections between bilateral FFA.

In summary, the model-based distinction between CHR/IMP patients from REM patients relied on the expression of trial-by-trial modulation of connections (by emotional contents) within the face perception network. Put simply, our results suggest that abnormal dynamic changes of connections involved in processing emotional faces are associated with higher risk of developing a less favorable clinical course (see also Supplementary Figure S6).

4. Discussion

This paper examines the utility of GE for predicting individual clinical trajectories of MDD patients over a two-year period. Using fMRI data from the NESDA study and DCM to infer effective connectivity within the emotional face perception network, model parameter estimates served as features for supervised learning. This GE approach enabled the prediction of whether a given patient would show a chronic disease course or fast remission, with a balanced accuracy of 79%. Additionally, patients with gradual improvement in symptom severity could be distinguished from those who remitted quickly with a balanced accuracy of 61%. GE outperformed SVM-based classification based on more conventional (descriptive) features, i.e., FC or local activation estimates derived from the same data within the network of interest. Similar to previous studies (Brodersen et al., 2014, 2011), these findings demonstrate that using a plausible generative model as the basis for classification can enhance classification accuracy significantly.

Apart from the superior classification accuracy, another advantage of GE is that results can be interpreted in terms of the mechanisms represented by the underlying generative model. To this end, one can interrogate the generative score space to identify the features that are most discriminative between the different classes (Brodersen et al., 2011). Here, we addressed this by first transforming the feature weights of the linear SVM into patterns, following previous recommendations (Haufe et al., 2014). Inspecting these scores then allowed to pinpoint those features contributing most to the classification between the different naturalistic courses (Fig. 4 and Supplementary Figure S8). This analysis suggested that groups differed primarily along the dimension encoded by the modulatory parameters, which represent trial-by-trial changes in endogenous connections by emotional valence of faces. Put differently, it is the dynamic modulation of connections by emotional contents of faces that allows for predicting the clinical trajectory of an individual patient – not the average connectivity across all trials. In conclusion, our analysis implies that it is the reactivity of the face processing network to emotional stimuli – in terms of reconfiguring its connection strengths trial-by-trial – which enables predicting future clinical trajectories of individual patients.

These results are consistent with conventional group comparisons of the connectivity patterns (see Results and Supplementary Material S7), which, however, do not allow for single-subject predictions. Our results are also consistent with previous work suggesting aberrant processing and regulation of emotions as a key pathomechanism in MDD (Harmer et al., 2009; Rive et al., 2013). For instance, an fMRI meta-analysis demonstrated valence-dependent effects of emotional stimuli on amygdala and fusiform gyrus in depression, with hyperactivation for negative and hypoactivation for positive stimuli (Groenewold et al., 2013; but see Muller et al., 2017). Reduced amygdala activity to positive emotional stimuli has also been associated with anhedonia (Stuhrmann et al., 2013). Similarly, functional integration of the emotion processing network is altered in MDD (Almeida et al., 2009; Greicius et al., 2007; Mayberg, 1997). Alterations in emotion processing have also been suggested to have some clinical utility. For instance, implicit processing of affective facial expressions related to a diagnosis of MDD (Fu et al., 2007), and longitudinal neuroimaging studies reported normalization of activations by emotion processing under pharmacotherapy (Ai et al., 2019; Anand et al., 2007; Fu et al., 2004; Godlewska et al., 2012; Murphy et al., 2009; Robertson et al., 2007; Sheline et al., 2001).

Our classification accuracies are comparable to the results obtained by Schmaal et al. (2015). For CHR vs. REM patients, GE yielded a higher predictive accuracy than the best result reported by Schmaal and colleagues (balanced accuracy: 73%) when not accounting for age. When age was regressed out as a confound, the predictive accuracy of GE (balanced accuracy: 74%) was on par with the result reported in Schmaal et al. (2015). Otherwise, our procedure yielded somewhat complementary results: while Schmaal and colleagues could distinguish CHR from IMP patients but not IMP from REM patients, the opposite held for GE. This may be due to differences in classification procedure and features: Schmaal et al. used GPCs on whole-brain contrast images, whereas we applied linear SVMs to DCM parameter estimates from a small (six-region) network. Furthermore, their analysis used the artifact-confounded MR data (see Methods); hence, it remains to be tested whether classification accuracy would change when the artifact-corrected images are used.

Previous attempts to obtain single-patient predictions in MDD have almost exclusively concerned short-term treatment responses to specific interventions. For instance, seminal PET work demonstrated that cingulate metabolism differentiated distinct treatment responses (Mayberg et al., 1997). For fMRI, brain activity during the processing of sad faces allowed predicting treatment outcome to antidepressant medication in individual (Fu et al., 2008). Graph-theoretical measures based on FC in the default mode network at baseline was associated with changes in symptom severity after two weeks of medication (Shen et al., 2015). Furthermore, activation (Siegle et al., 2012) and FC (Crowther et al., 2015; Walsh et al., 2017) were predictive of psychotherapy outcome. Similarly, FC of subcallosal cingulate cortex with insula, dorsal midbrain and ventromedial prefrontal cortex was differentially associated with remission and treatment failure to cognitive-behavioral therapy and antidepressant medication (Dunlop et al., 2017). Finally, clinical responses to transcranial direct current stimulation of left prefrontal cortex could be predicted in unmedicated MDD patients (Nord et al., 2019).

Arguably, the attempt to predict outcome after two years in a naturalistic setting, as in NESDA, represents a greater challenge than predicting short-term response to a particular treatment. NESDA (1) recruited patients from a wide spectrum, including community, primary care and specialized mental health organizations, (2) encompassed a wide range of depressive phenotypes from very mild to severe, and (3) did not standardize treatments or occurrence of life events over the 2-year follow-up period (Penninx et al., 2008). This represents a strength of the NESDA dataset since it allows testing the course of MDD in a realistic setting which reflects the clinical heterogeneity that physicians face on a daily basis.

Existing attempts to predict MDD trajectories have focused on clinical or cognitive features (Gueorguieva et al., 2017; Kessler et al., 2016; Vogelzangs et al., 2014; Vreeburg et al., 2013). Recently, Dinga et al. (2018) systematically assessed the predictive value of non-imaging data, using clinical, psychological and biological measures from the NESDA study. They found that clinical measures performed best with balanced accuracies around 66%, while endocrine and immunological measures (e.g., cortisol, inflammatory markers, metabolic syndrome markers) did not distinguish between clinical trajectories. Interestingly, consistent with our GE results, they could primarily discriminate REM patients from the other two groups.

Our study is subject to several limitations. First, while our sample size (n = 85) does not fare badly compared to previous imaging-based prediction studies on MDD, the sample size is too modest for establishing predictions that can be expected to generalize robustly; particularly, the chronic group is small, comprising only 15 patients. Unfortunately, we do not have access to a separate validation set at the present time. Hence, our results should be understood as a proof-of-concept regarding the potential benefits of GE for clinical predictions, not as providing a mature prognostic tool for MDD. Second, the NESDA sample utilized in the present work was acquired at multiple sites. While the proportion of patients in the three clinical trajectory groups did not significantly differ across sites (see Table 1) – rendering any potential bias on the reported classification results unlikely – prediction might still benefit from a more thorough data harmonization (Fortin et al., 2017; Yu et al., 2018). Third, we here adopted the classical view on hierarchical processing in the face perception network (Fairhall and Ishai, 2007; Haxby et al., 2000). This view could be extended, given that recent work suggested a direct subcortical pathway, from the superior colliculus to the amygdala via the pulvinar, for rapid threat detection during emotional face perception (McFadyen et al., 2019). This could be accounted for by expanding the present model space and allowing for additional driving inputs into the amygdala. Fourth, the classical DCM approach employed here is restricted to small networks to keep model inversion computationally feasible (Daunizeau et al., 2011; Frässle et al., 2018b). Consequently, we focused on a six-region network comprising only core regions of the face perception network. However, depression is characterized by more widely distributed network organization (Greicius et al., 2007; Mayberg, 1997; Wang et al., 2012) and, hence, inferring whole-brain effective connectivity represents a promising next step. This could be achieved by exploiting recent advances in generative models that are computationally highly efficient (Frässle et al., 2018a, 2017).

Furthermore, in line with Schmaal et al. (2015), we used binary classifiers which can only distinguish between two disease trajectories. However, this approach does not allow for single-class predictions, which rests on multi-class classification (Bishop, 2006). Extending our classification scheme beyond binary classification is likely to be of clinical relevance, as multi-class prediction more faithfully resembles the decision process that physicians routinely engage in. In addition, an attractive alternative for future analyses is to predict continuous measures, such as time-to-recovery, rather than the discrete classes defined by the latent class growth analysis (Rhebergen et al., 2012). On a similar note, we anticipate that predicting the entire disease trajectories rather than discrete classes or continuous outcome measures will constitute an important future test of the predictive utility of effective connectivity patterns in MDD.

Finally, on a more general note, any biomarker in psychiatry will always yield imperfect predictions. This is because the course of psychiatric disorders is affected by a plethora of environmental factors which cannot be foreseen from physiological data, including the occurrence of stressful life events like loss, bereavement, or trauma (Horesh et al., 2008). Such external perturbations likely upper-bound the predictive accuracy of any biomarker, whether derived from neuroimaging or genetics.

Despite these limitations, the present study demonstrates the potential of GE for predicting clinical outcomes of MDD in a way that combines enhanced accuracy with biological interpretability of predictions. More generally, as illustrated by recent successful clinical applications (Symmonds et al., 2018), generative models offer an attractive strategy for establishing computational assays that could inform clinical decision-making in psychiatry (Frässle et al., 2018b). A critical condition for the future success (or failure) of this strategy will be the availability of large prospective patient datasets that, like NESDA, offer clinically relevant outcome data and allow for testing the generalizability and robustness of model-based clinical predictions in real-world settings.

Data availability

The data used in this study can be obtained via the standard NESDA data access procedure (see https://www.nesda.nl/nesda-english/). This requires researchers to file a data request (analysis plan) which has to be approved by the NESDA consortium in order to be granted access to the data. NESDA fully adheres to the FAIR (Findable, Accessible, Interoperable, and Re-usable) data principles. Furthermore, we will make all analysis code publicly available on the GIT repository of ETH Zurich (https://gitlab.ethz.ch/tnu/code/generative-embedding-nesda).

CRediT authorship contribution statement

Stefan Frässle: Conceptualization, Formal analysis, Funding acquisition, Methodology, Software, Visualization, Writing - original draft, Writing - review & editing. Andre F. Marquand: Conceptualization, Funding acquisition, Data curation, Resources, Writing - review & editing. Lianne Schmaal: Conceptualization, Funding acquisition, Data curation, Resources, Writing - review & editing. Richard Dinga: Data curation, Writing - review & editing. Dick J. Veltman: Funding acquisition, Resources, Writing - review & editing. Nic J.A. van der Wee: Funding acquisition, Resources, Writing - review & editing. Marie-José van Tol: Funding acquisition, Resources, Writing - review & editing. Dario Schöbi: Formal analysis, Validation, Writing - review & editing. Brenda W.J.H. Penninx: Conceptualization, Funding acquisition, Project administration, Resources, Writing - review & editing. Klaas E. Stephan: Conceptualization, Funding acquisition, Resources, Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors report no conflict of interest.

Acknowledgements

The authors would like to thank all NESDA participants for taking part in the study. Furthermore, we would like to thank Jakob Heinzle for valuable advice. This work was supported by the ETH Zurich Postdoctoral Fellowship Program (SF), the Marie Curie Actions for People COFUND Program (SF), and the University of Zurich Forschungskredit Postdoc (SF), by a NHMRC Career Development Fellowship (1140764; LS), as well as by the René and Susanne Braginsky Foundation (KES) and the University of Zurich (KES). Furthermore, the infrastructure for the NESDA study (www.nesda.nl) is funded through the Geestkracht program of the Netherlands organization for Health Research and Development (ZonMw, grant number 10-000-1002) and financial contributions by participating universities and mental health care organizations (VU University Medical Center, GGZ inGeest, Leiden University Medical Center, Leiden University, GGZ Rivierduinen, University Medical Center Groningen, University of Groningen, Lentis, GGZ Friesland, GGZ Drenthe, Rob Giel Onderzoekscentrum).

Footnotes

1

The McNemar test is, strictly speaking, only valid when applied to a completely independent test set. Hence, comparing cross-validated classifiers might yield somewhat optimistic results. Having said this, for scenaria like ours, it is unclear what a statistically rigorous way would be to compare classifiers. For a discussion on this issue, see Dietterich (1998)

Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.nicl.2020.102213.

Appendix. Supplementary materials

mmc1.docx (65.1MB, docx)

References

  1. Ai H., Opmeer E.M., Marsman J.C., Veltman D.J., van der Wee N.J.A., Aleman A., van Tol M.J. Longitudinal brain changes in MDD during emotional encoding: effects of presence and persistence of symptomatology. Psychol. Med. 2019:1–11. doi: 10.1017/S0033291719001259. [DOI] [PubMed] [Google Scholar]
  2. Almeida J.R., Versace A., Mechelli A., Hassel S., Quevedo K., Kupfer D.J., Phillips M.L. Abnormal amygdala-prefrontal effective connectivity to happy faces differentiates bipolar from major depression. Biol. Psychiatry. 2009;66:451–459. doi: 10.1016/j.biopsych.2009.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Anand A., Li Y., Wang Y., Gardner K., Lowe M.J. Reciprocal effects of antidepressant treatment on activity and connectivity of the mood regulating circuit: an FMRI study. J. Neuropsychiatry Clin. Neurosci. 2007;19:274–282. doi: 10.1176/appi.neuropsych.19.3.274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Andrade L., Caraveo-Anduaga J., Berglund P., Bijl R., De Graaf R., Vollebergh W., Dragomirecka E., Kohn R., Keller M., Kessler R., Kawakami N., Kilic C., Offord D., Ustun T., Wittchen H. The epidemiology of major depressive episodes: results from the international consortium of psychiatric epidemiology (ICPE) surveys. Int. J. Methods Psychiatr. Res. 2003;12:3–21. doi: 10.1002/mpr.138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. American Psychiatric Association, 2013. Diagnostic and statistical manual of mental disorders (DSM-5 R). Am. Psychiatric Publ.
  6. Benjamini Y., Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 2001;29:1165–1188. [Google Scholar]
  7. Bishop C.M. Vol. 12. Springer; New York: 2006. p. 105. (Pattern Recognition and Machine Learning). 47. [Google Scholar]
  8. Breiter H.C., Etcoff N.L., Whalen P.J., Kennedy W.A., Rauch S.L., Buckner R.L., Strauss M.M., Hyman S.E., Rosen B.R. Response and habituation of the human amygdala during visual processing of facial expression. Neuron. 1996;17:875–887. doi: 10.1016/s0896-6273(00)80219-6. [DOI] [PubMed] [Google Scholar]
  9. Brodersen K.H., Deserno L., Schlagenhauf F., Lin Z., Penny W.D., Buhmann J.M., Stephan K.E. Dissecting psychiatric spectrum disorders by generative embedding. Neuroimage Clin. 2014;4:98–111. doi: 10.1016/j.nicl.2013.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Brodersen K.H., Schofield T.M., Leff A.P., Ong C.S., Lomakina E.I., Buhmann J.M., Stephan K.E. Generative embedding for model-based classification of fMRI data. PLoS Comput. Biol. 2011;7 doi: 10.1371/journal.pcbi.1002079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Buxton R., Wong E., Frank L. Dynamics of blood flow and oxygenation changes during brain activation: the balloon model. Magn. Reson. Med. 1998;39:855–864. doi: 10.1002/mrm.1910390602. [DOI] [PubMed] [Google Scholar]
  12. Catani M., Thiebaut de Schotten M. A diffusion tensor imaging tractography atlas for virtual in vivo dissections. Cortex. 2008;44:1105–1132. doi: 10.1016/j.cortex.2008.05.004. [DOI] [PubMed] [Google Scholar]
  13. Cawley G.C., Talbot N.L.C. On over-fitting in model selection and subsequent selection bias in performance evaluation. J. Mach. Learn. Res. 2010;11:2079–2107. [Google Scholar]
  14. Clarke S., Miklossy J. Occipital cortex in man - Organization of callosal connections, related myeloarchitecture and cytoarchitecture, and putative boundaries of functional visual areas. J. Comp. Neurol. 1990;298:188–214. doi: 10.1002/cne.902980205. [DOI] [PubMed] [Google Scholar]
  15. Crowther A., Smoski M.J., Minkel J., Moore T., Gibbs D., Petty C., Bizzell J., Schiller C.E., Sideris J., Carl H., Dichter G.S. Resting-State connectivity predictors of response to psychotherapy in major depressive disorder. Neuropsychopharmacology. 2015;40:1659–1673. doi: 10.1038/npp.2015.12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cuthbert B.N., Insel T.R. Toward the future of psychiatric diagnosis: the seven pillars of RDOC. BMC Med. 2013;11:126. doi: 10.1186/1741-7015-11-126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Daunizeau J., David O., Stephan K. Dynamic causal modelling: a critical review of the biophysical and statistical foundations. Neuroimage. 2011;58:312–322. doi: 10.1016/j.neuroimage.2009.11.062. [DOI] [PubMed] [Google Scholar]
  18. de Graaf R., ten Have M., van Gool C., van Dorsselaer S. Prevalence of mental disorders and trends from 1996 to 2009. results from the Netherlands mental health survey and incidence study-2. Soc. Psychiatry Psychiatr. Epidemiol. 2012;47:203–213. doi: 10.1007/s00127-010-0334-8. [DOI] [PubMed] [Google Scholar]
  19. Demenescu L.R., Renken R., Kortekaas R., van Tol M.J., Marsman J.B., van Buchem M.A., van der Wee N.J., Veltman D.J., den Boer J.A., Aleman A. Neural correlates of perception of emotional facial expressions in out-patients with mild-to-moderate depression and anxiety. A multicenter fMRI study. Psychol. Med. 2011;41:2253–2264. doi: 10.1017/S0033291711000596. [DOI] [PubMed] [Google Scholar]
  20. Dietterich T.G. Approximate statistical tests for comparing supervised classification learning algorithms. Neural. Comput. 1998;10:1895–1923. doi: 10.1162/089976698300017197. [DOI] [PubMed] [Google Scholar]
  21. Dinga R., Marquand A.F., Veltman D.J., Beekman A.T.F., Schoevers R.A., van Hemert A.M., Penninx B., Schmaal L. Predicting the naturalistic course of depression from a wide range of clinical, psychological, and biological data: a machine learning approach. Transl. Psychiatry. 2018;8:241. doi: 10.1038/s41398-018-0289-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dunlop B.W., Rajendra J.K., Craighead W.E., Kelley M.E., McGrath C.L., Choi K.S., Kinkead B., Nemeroff C.B., Mayberg H.S. Functional connectivity of the subcallosal cingulate cortex and differential outcomes to treatment with cognitive-behavioral therapy or antidepressant medication for major depressive disorder. Am. J. Psychiatry. 2017;174:533–545. doi: 10.1176/appi.ajp.2016.16050518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fairhall S.L., Ishai A. Effective connectivity within the distributed cortical network for face perception. Cereb. Cortex. 2007;17:2400–2406. doi: 10.1093/cercor/bhl148. [DOI] [PubMed] [Google Scholar]
  24. Fan R.E., Chen P.H., Lin C.J. Working set selection using second order information for training support vector machines. J. Mach. Learning Res. 2005;6:1889–1918. [Google Scholar]
  25. Fortin J.P., Parker D., Tunc B., Watanabe T., Elliott M.A., Ruparel K., Roalf D.R., Satterthwaite T.D., Gur R.C., Gur R.E., Schultz R.T., Verma R., Shinohara R.T. Harmonization of multi-site diffusion tensor imaging data. Neuroimage. 2017;161:149–170. doi: 10.1016/j.neuroimage.2017.08.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Frässle S., Lomakina E.I., Kasper L., Manjaly Z.M., Leff A., Pruessmann K.P., Buhmann J.M., Stephan K.E. A generative model of whole-brain effective connectivity. Neuroimage. 2018;179:505–529. doi: 10.1016/j.neuroimage.2018.05.058. [DOI] [PubMed] [Google Scholar]
  27. Frässle S., Lomakina E.I., Razi A., Friston K.J., Buhmann J.M., Stephan K.E. Regression DCM for fMRI. Neuroimage. 2017;155:406–421. doi: 10.1016/j.neuroimage.2017.02.090. [DOI] [PubMed] [Google Scholar]
  28. Frässle S., Paulus F.M., Krach S., Schweinberger S.R., Stephan K.E., Jansen A. Mechanisms of hemispheric lateralization: asymmetric interhemispheric recruitment in the face perception network. Neuroimage. 2016;124:977–988. doi: 10.1016/j.neuroimage.2015.09.055. [DOI] [PubMed] [Google Scholar]
  29. Frässle S., Yao Y., Schobi D., Aponte E.A., Heinzle J., Stephan K.E. Generative models for clinical applications in computational psychiatry. Wiley Interdiscip. Rev. Cogn. Sci. 2018;9:e1460. doi: 10.1002/wcs.1460. [DOI] [PubMed] [Google Scholar]
  30. Frazier, P.I., 2018. A tutorial on bayesian optimization. arXiv e-prints.
  31. Friston K., Harrison L., Penny W. Dynamic causal modelling. Neuroimage. 2003;19:1273–1302. doi: 10.1016/s1053-8119(03)00202-7. [DOI] [PubMed] [Google Scholar]
  32. Friston K., Mattout J., Trujillo-Barreto N., Ashburner J., Penny W. Variational free energy and the Laplace approximation. Neuroimage. 2007;34:220–234. doi: 10.1016/j.neuroimage.2006.08.035. [DOI] [PubMed] [Google Scholar]
  33. Friston K.J., Fletcher P., Josephs O., Holmes A., Rugg M.D., Turner R. Event-related fMRI: characterizing differential responses. Neuroimage. 1998;7:30–40. doi: 10.1006/nimg.1997.0306. [DOI] [PubMed] [Google Scholar]
  34. Friston K.J., Mechelli A., Turner R., Price C.J. Nonlinear responses in fMRI: the balloon model, volterra kernels, and other hemodynamics. Neuroimage. 2000;12:466–477. doi: 10.1006/nimg.2000.0630. [DOI] [PubMed] [Google Scholar]
  35. Fu C.H., Williams S.C., Brammer M.J., Suckling J., Kim J., Cleare A.J., Walsh N.D., Mitterschiffthaler M.T., Andrew C.M., Pich E.M., Bullmore E.T. Neural responses to happy facial expressions in major depression following antidepressant treatment. Am J Psychiatry. 2007;164:599–607. doi: 10.1176/ajp.2007.164.4.599. [DOI] [PubMed] [Google Scholar]
  36. Fu C.H., Williams S.C., Cleare A.J., Brammer M.J., Walsh N.D., Kim J., Andrew C.M., Pich E.M., Williams P.M., Reed L.J., Mitterschiffthaler M.T., Suckling J., Bullmore E.T. Attenuation of the neural response to sad faces in major depression by antidepressant treatment: a prospective, event-related functional magnetic resonance imaging study. Arch. Gen. Psychiatry. 2004;61:877–889. doi: 10.1001/archpsyc.61.9.877. [DOI] [PubMed] [Google Scholar]
  37. Fu C.H.Y., Mourao-Miranda J., Costafrecla S.G., Khanna A., Marquand A.F., Williams S.C.R., Brammer M.J. Pattern classification of sad facial processing: toward the development of neurobiological markers in depression. Biol. Psychiatry. 2008;63:656–662. doi: 10.1016/j.biopsych.2007.08.020. [DOI] [PubMed] [Google Scholar]
  38. Godlewska B.R., Norbury R., Selvaraj S., Cowen P.J., Harmer C.J. Short-term SSRI treatment normalises amygdala hyperactivity in depressed patients. Psychol. Med. 2012;42:2609–2617. doi: 10.1017/S0033291712000591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Good P.I. Springer; New York: 2000. Permutation tests: a Practical Guide to Resampling Methods For Testing Hypotheses, 2nd ed. [Google Scholar]
  40. Greicius M.D., Flores B.H., Menon V., Glover G.H., Solvason H.B., Kenna H., Reiss A.L., Schatzberg A.F. Resting-state functional connectivity in major depression: abnormally increased contributions from subgenual cingulate cortex and thalamus. Biol. Psychiatry. 2007;62:429–437. doi: 10.1016/j.biopsych.2006.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Groenewold N.A., Opmeer E.M., de Jonge P., Aleman A., Costafreda S.G. Emotional valence modulates brain functional abnormalities in depression: evidence from a meta-analysis of fMRI studies. Neurosci. Biobehav. Rev. 2013;37:152–163. doi: 10.1016/j.neubiorev.2012.11.015. [DOI] [PubMed] [Google Scholar]
  42. Gueorguieva R., Chekroud A.M., Krystal J.H. Trajectories of relapse in randomised, placebo-controlled trials of treatment discontinuation in major depressive disorder: an individual patient-level data meta-analysis. Lancet Psychiatry. 2017;4:230–237. doi: 10.1016/S2215-0366(17)30038-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Gueorguieva R., Mallinckrodt C., Krystal J.H. Trajectories of depression severity in clinical trials of duloxetine: insights into antidepressant and placebo responses. Arch. Gen. Psychiatry. 2011;68:1227–1237. doi: 10.1001/archgenpsychiatry.2011.132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Harmer C.J., Goodwin G.M., Cowen P.J. Why do antidepressants take so long to work? A cognitive neuropsychological model of antidepressant drug action. Br. J. Psychiatry. 2009;195:102–108. doi: 10.1192/bjp.bp.108.051193. [DOI] [PubMed] [Google Scholar]
  45. Haufe S., Meinecke F., Gorgen K., Dahne S., Haynes J.D., Blankertz B., Biessmann F. On the interpretation of weight vectors of linear models in multivariate neuroimaging. Neuroimage. 2014;87:96–110. doi: 10.1016/j.neuroimage.2013.10.067. [DOI] [PubMed] [Google Scholar]
  46. Haxby J., Hoffman E., Gobbini M. The distributed human neural system for face perception. Trends Cogn. Sci. 2000;4:223–233. doi: 10.1016/s1364-6613(00)01482-0. [DOI] [PubMed] [Google Scholar]
  47. Hofer S., Frahm J. Topography of the human corpus callosum revisited - Comprehensive fiber tractography using diffusion tensor magnetic resonance imaging. Neuroimage. 2006;32:989–994. doi: 10.1016/j.neuroimage.2006.05.044. [DOI] [PubMed] [Google Scholar]
  48. Horesh N., Klomek A.B., Apter A. Stressful life events and major depressive disorders. Psychiatry Res. 2008;160:192–199. doi: 10.1016/j.psychres.2007.06.008. [DOI] [PubMed] [Google Scholar]
  49. Itani S., Rossignol M., Lecron F., Fortemps P. Towards interpretable machine learning models for diagnosis aid: a case study on attention deficit/hyperactivity disorder. PLoS ONE. 2019;14 doi: 10.1371/journal.pone.0215720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kanwisher N., McDermott J., Chun M. The fusiform face area: a module in human extrastriate cortex specialized for face perception. J. Neurosci. 1997;17:4302–4311. doi: 10.1523/JNEUROSCI.17-11-04302.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Kapur S., Phillips A.G., Insel T.R. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? Mol. Psychiatry. 2012;17:1174–1179. doi: 10.1038/mp.2012.105. [DOI] [PubMed] [Google Scholar]
  52. Kessler R.C., van Loo H.M., Wardenaar K.J., Bossarte R.M., Brenner L.A., Cai T., Ebert D.D., Hwang I., Li J., de Jonge P., Nierenberg A.A., Petukhova M.V., Rosellini A.J., Sampson N.A., Schoevers R.A., Wilcox M.A., Zaslavsky A.M. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Mol. Psychiatry. 2016;21:1366–1371. doi: 10.1038/mp.2015.198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kohler S., Chrysanthou S., Guhn A., Sterzer P. Differences between chronic and nonchronic depression: systematic review and implications for treatment. Depress.Anxiety. 2019;36:18–30. doi: 10.1002/da.22835. [DOI] [PubMed] [Google Scholar]
  54. Lyketsos C., Nestadt G., Cwi J., Heithoff K., Eaton W. The life chart interview - A Standardized method to describe the course of psychopathology. Int. J. Methods Psychiatr. Res. 1994;4:143–155. [Google Scholar]
  55. MacQueen G.M. Magnetic resonance imaging and prediction of outcome in patients with major depressive disorder. J. Psychiatry Neurosci. 2009;34:343–349. [PMC free article] [PubMed] [Google Scholar]
  56. Mayberg H.S. Limbic-cortical dysregulation: a proposed model of depression. J. Neuropsychiatry Clin. Neurosci. 1997;9:471–481. doi: 10.1176/jnp.9.3.471. [DOI] [PubMed] [Google Scholar]
  57. Mayberg H.S., Brannan S.K., Mahurin R.K., Jerabek P.A., Brickman J.S., Tekell J.L., Silva J.A., McGinnis S., Glass T.G., Martin C.C., Fox P.T. Cingulate function in depression: a potential predictor of treatment response. Neuroreport. 1997;8:1057–1061. doi: 10.1097/00001756-199703030-00048. [DOI] [PubMed] [Google Scholar]
  58. McFadyen J., Mattingley J.B., Garrido M.I. An afferent white matter pathway from the pulvinar to the amygdala facilitates fear recognition. Elife. 2019;8 doi: 10.7554/eLife.40766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. McMahon F.J., Insel T.R. Pharmacogenomics and personalized medicine in neuropsychiatry. Neuron. 2012;74:773–776. doi: 10.1016/j.neuron.2012.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Muller V.I., Cieslik E.C., Serbanescu I., Laird A.R., Fox P.T., Eickhoff S.B. Altered brain activity in unipolar depression revisited: meta-analyses of neuroimaging studies. JAMA Psychiatry. 2017;74:47–55. doi: 10.1001/jamapsychiatry.2016.2783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Murphy S.E., Norbury R., O'Sullivan U., Cowen P.J., Harmer C.J. Effect of a single dose of citalopram on amygdala response to emotional faces. Br. J. Psychiatry. 2009;194:535–540. doi: 10.1192/bjp.bp.108.056093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Musliner K.L., Munk-Olsen T., Laursen T.M., Eaton W.W., Zandi P.P., Mortensen P.B. Heterogeneity in 10-Year course trajectories of moderate to severe major depressive disorder: a danish national register-based study. JAMA Psychiatry. 2016;73:346–353. doi: 10.1001/jamapsychiatry.2015.3365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Muthen B., Asparouhov T., Hunter A.M., Leuchter A.F. Growth modeling with nonignorable dropout: alternative analyses of the star*d antidepressant trial. Psychol. Methods. 2011;16:17–33. doi: 10.1037/a0022634. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Nadeem M.S.A., Zucker J.D., Hanczar B. Accuracy-Rejection curves (ARCs) for comparing classification methods with a reject option. Proc. Third Int. Workshop Mach Learn Syst. Biol. 2010;8:65–81. [Google Scholar]
  65. Naselaris T., Kay K.N., Nishimoto S., Gallant J.L. Encoding and decoding in fMRI. Neuroimage. 2011;56:400–410. doi: 10.1016/j.neuroimage.2010.07.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Nord C.L., Halahakoon D.C., Limbachya T., Charpentier C., Lally N., Walsh V., Leibowitz J., Pilling S., Roiser J.P. Neural predictors of treatment response to brain stimulation and psychological therapy in depression: a double-blind randomized controlled trial. Neuropsychopharmacology. 2019;44:1613–1622. doi: 10.1038/s41386-019-0401-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Ojala M., Garriga G.C. Permutation tests for studying classifier performance. J. Mach. Learn. Res. 2010;11:1833–1863. [Google Scholar]
  68. Pan P.M., Sato J.R., Salum G.A., Rohde L.A., Gadelha A., Zugman A., Mari J., Jackowski A., Picon F., Miguel E.C., Pine D.S., Leibenluft E., Bressan R.A., Stringaris A. Ventral striatum functional connectivity as a predictor of adolescent depressive disorder in a longitudinal community-based sample. Am. J. Psychiatry. 2017;174:1112–1119. doi: 10.1176/appi.ajp.2017.17040430. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Paulus M. Bayesian neural adjustment of inhibitory control predicts emergence of problem stimulant use. Neuropsychopharmacology. 2015;40:S32. doi: 10.1093/brain/awv246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Penninx B., Nolen W., Lamers F., Zitman F., Smit J., Spinhoven P., Cuijpers P., de Jong P., van Marwijk H., van der Meer K., Verhaak P., Laurant M., de Graaf R., Hoogendijk W., van der Wee N., Ormel J., van Dyck R., Beekman A. Two-year course of depressive and anxiety disorders: results from the Netherlands study of depression and anxiety (NESDA) J. Affect. Disord. 2011;133:76–85. doi: 10.1016/j.jad.2011.03.027. [DOI] [PubMed] [Google Scholar]
  71. Penninx B.W., Beekman A.T., Smit J.H., Zitman F.G., Nolen W.A., Spinhoven P., Cuijpers P., De Jong P.J., Van Marwijk H.W., Assendelft W.J., Van Der Meer K., Verhaak P., Wensing M., De Graaf R., Hoogendijk W.J., Ormel J., Van Dyck R., Consortium N.R. The Netherlands study of depression and anxiety (NESDA): rationale, objectives and methods. Int. J. Methods Psychiatr. Res. 2008;17:121–140. doi: 10.1002/mpr.256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Penny W., Stephan K., Daunizeau J., Rosa M., Friston K., Schofield T., Leff A. Comparing families of dynamic causal models. PLoS Comput. Biol. 2010;6 doi: 10.1371/journal.pcbi.1000709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Phillips M.L., Chase H.W., Sheline Y.I., Etkin A., Almeida J.R., Deckersbach T., Trivedi M.H. Identifying predictors, moderators, and mediators of antidepressant response in major depressive disorder: neuroimaging approaches. Am. J. Psychiatry. 2015;172:124–138. doi: 10.1176/appi.ajp.2014.14010076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Pitcher D., Walsh V., Duchaine B. The role of the occipital face area in the cortical face perception network. Exp. Brain Res. 2011;209:481–493. doi: 10.1007/s00221-011-2579-1. [DOI] [PubMed] [Google Scholar]
  75. Power J.D., Barnes K.A., Snyder A.Z., Schlaggar B.L., Petersen S.E. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion. Neuroimage. 2012;59:2142–2154. doi: 10.1016/j.neuroimage.2011.10.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Puce A., Allison T., Asgari M., Gore J., McCarthy G. Differential sensitivity of human visual cortex to faces, letterstrings, and textures: a functional magnetic resonance imaging study. J. Neurosci. 1996;16:5205–5215. doi: 10.1523/JNEUROSCI.16-16-05205.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Rasmussen, C.E., Williams, C.K.I., 2005. Gaussian processes for machine learning. Gaussian Processes for Machine Learning, 1–247.
  78. Rhebergen D., Lamers F., Spijker J., de Graaf R., Beekman A.T., Penninx B.W. Course trajectories of unipolar depressive disorders identified by latent class growth analysis. Psychol. Med. 2012;42:1383–1396. doi: 10.1017/S0033291711002509. [DOI] [PubMed] [Google Scholar]
  79. Rigoux L., Stephan K.E., Friston K.J., Daunizeau J. Bayesian model selection for group studies - revisited. Neuroimage. 2014;84:971–985. doi: 10.1016/j.neuroimage.2013.08.065. [DOI] [PubMed] [Google Scholar]
  80. Rive M.M., van Rooijen G., Veltman D.J., Phillips M.L., Schene A.H., Ruhe H.G. Neural correlates of dysfunctional emotion regulation in major depressive disorder. A systematic review of neuroimaging studies. Neurosci. Biobehav. Rev. 2013;37:2529–2553. doi: 10.1016/j.neubiorev.2013.07.018. [DOI] [PubMed] [Google Scholar]
  81. Robertson B., Wang L.H., Diaz M.T., Aiello M., Gersing K., Beyer J., Mukundan S., McCarthy G., Doraiswamy P.M. Effect of bupropion extended release on negative emotion processing in major depressive disorder: a pilot functional magnetic resonance imaging study. J. Clin. Psychiatry. 2007;68:261–267. doi: 10.4088/jcp.v68n0212. [DOI] [PubMed] [Google Scholar]
  82. Robins L., Wing J., Wittchen H., Helzer J., Babor T., Burke J., Farmer A., Jablenski A., Pickens R., Regier D., Sartotius N., Towle L. The composite international diagnostic interview - An epidemiological instrument suitable for use in conjunction with different diagnostic systems and in different cultures. Arch. Gen. Psychiatry. 1988;45:1069–1077. doi: 10.1001/archpsyc.1988.01800360017003. [DOI] [PubMed] [Google Scholar]
  83. Rush A., Trivedi M., Wisniewski S., Nierenberg A., Stewart J., Warden D., Niederehe G., Thase M., Lavori P., Lebowitz B., McGrath P., Rosenbaum J., Sackeim H., Kupfer D., Luther J., Fava M. Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a star*d report. Am. J. Psychiatry. 2006;163:1905–1917. doi: 10.1176/ajp.2006.163.11.1905. [DOI] [PubMed] [Google Scholar]
  84. Schmaal L., Marquand A.F., Rhebergen D., van Tol M.J., Ruhé H.G., van der Wee N.J., Veltman D.J., Penninx B.W. Predicting the naturalistic course of major depressive disorder using clinical and multimodal neuroimaging information: a multivariate pattern recognition study. Biol. Psychiatry. 2015;78:278–286. doi: 10.1016/j.biopsych.2014.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Shawe-Taylor, J., Cristianini, N., 2004. Kernel methods for pattern analysis. cambridge University Press.
  86. Sheline Y.I., Barch D.M., Donnelly J.M., Ollinger J.M., Snyder A.Z., Mintun M.A. Increased amygdala response to masked emotional faces in depressed subjects resolves with antidepressant treatment: an fMRI study. Biol. Psychiatry. 2001;50:651–658. doi: 10.1016/s0006-3223(01)01263-x. [DOI] [PubMed] [Google Scholar]
  87. Shen Y.D., Yao J.S., Jiang X.Y., Zhang L., Xu L.Y., Feng R., Cai L.Q., Liu J., Wang J.H., Chen W. Sub-hubs of baseline functional brain networks are related to early improvement following two-week pharmacological therapy for major depressive disorder. Hum. Brain Mapp. 2015;36:2915–2927. doi: 10.1002/hbm.22817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Siegle G.J., Thompson W.K., Collier A., Berman S.R., Feldmiller J., Thase M.E., Friedman E.S. Toward clinically useful neuroimaging in depression treatment: prognostic utility of subgenual cingulate activity for determining depression outcome in cognitive therapy across studies, scanners, and patient characteristics. Arch. Gen. Psychiatry. 2012;69:913–924. doi: 10.1001/archgenpsychiatry.2012.65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Stephan K.E., Schlagenhauf F., Huys Q.J., Raman S., Aponte E.A., Brodersen K.H., Rigoux L., Moran R.J., Daunizeau J., Dolan R.J., Friston K.J., Heinz A. Computational neuroimaging strategies for single patient predictions. Neuroimage. 2017;145:180–199. doi: 10.1016/j.neuroimage.2016.06.038. [DOI] [PubMed] [Google Scholar]
  90. Stephan K.E., Weiskopf N., Drysdale P.M., Robinson P.A., Friston K.J. Comparing hemodynamic models with DCM. Neuroimage. 2007;38:387–401. doi: 10.1016/j.neuroimage.2007.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Stone M. Cross-Validatory choice and assessment of statistical predictions. J. R. Stat. Soc. Ser. B-Stat. Methodology. 1974;36:111–147. [Google Scholar]
  92. Stuhrmann A., Dohm K., Kugel H., Zwanzger P., Redlich R., Grotegerd D., Rauch A.V., Arolt V., Heindel W., Suslow T., Zwitserlood P., Dannlowski U. Mood-congruent amygdala responses to subliminally presented facial expressions in major depression: associations with anhedonia. J. Psychiatry Neurosci. 2013;38:249–258. doi: 10.1503/jpn.120060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Symmonds M., Moran C.H., Leite M.I., Buckley C., Irani S.R., Stephan K.E., Friston K.J., Moran R.J. Ion channels in EEG: isolating channel dysfunction in NMDA receptor antibody encephalitis. Brain. 2018;141:1691–1702. doi: 10.1093/brain/awy107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Van Essen D.C., Newsome W.T., Bixby J.L. The pattern of interhemispheric connections and its relationship to extrastriate visual areas in the macaque monkey. J. Neurosci. 1982;2:265–283. doi: 10.1523/JNEUROSCI.02-03-00265.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. van Tol M.J., van der Wee N.J., van den Heuvel O.A., Nielen M.M., Demenescu L.R., Aleman A., Renken R., van Buchem M.A., Zitman F.G., Veltman D.J. Regional brain volume in depression and anxiety disorders. Arch. Gen. Psychiatry. 2010;67:1002–1011. doi: 10.1001/archgenpsychiatry.2010.121. [DOI] [PubMed] [Google Scholar]
  96. Velligan D.I., Weiden P.J., Sajatovic M., Scott J., Carpenter D., Ross R., Docherty J.P. Strategies for addressing adherence problems in patients with serious and persistent mental illness: recommendations from the expert consensus guidelines. J. Psychiatr. Pract. 2010;16:306–324. doi: 10.1097/01.pra.0000388626.98662.a0. [DOI] [PubMed] [Google Scholar]
  97. Vogelzangs N., Beekman A.T., van Reedt Dortland A.K., Schoevers R.A., Giltay E.J., de Jonge P., Penninx B.W. Inflammatory and metabolic dysregulation and the 2-year course of depressive disorders in antidepressant users. Neuropsychopharmacology. 2014;39:1624–1634. doi: 10.1038/npp.2014.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Vreeburg S.A., Hoogendijk W.J., DeRijk R.H., van Dyck R., Smit J.H., Zitman F.G., Penninx B.W. Salivary cortisol levels and the 2-year course of depressive and anxiety disorders. Psychoneuroendocrinology. 2013;38:1494–1502. doi: 10.1016/j.psyneuen.2012.12.017. [DOI] [PubMed] [Google Scholar]
  99. Walsh E., Carl H., Eisenlohr-Moul T., Minkel J., Crowther A., Moore T., Gibbs D., Petty C., Bizzel J., Smoski M.J., Dichter G.S. Attenuation of frontostriatal connectivity during reward processing predicts response to psychotherapy in major depressive disorder. Neuropsychopharmacology. 2017;42:831–843. doi: 10.1038/npp.2016.179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Wang L., Hermens D.F., Hickie I.B., Lagopoulos J. A systematic review of resting-state functional-MRI studies in major depression. J. Affect. Disord. 2012;142:6–12. doi: 10.1016/j.jad.2012.04.013. [DOI] [PubMed] [Google Scholar]
  101. Wiecki T.V., Antoniades C.A., Stevenson A., Kennard C., Borowsky B., Owen G., Leavitt B., Roos R., Durr A., Tabrizi S.J., Frank M.J. A computational cognitive biomarker for early-stage huntington's disease. PLoS ONE. 2016;11 doi: 10.1371/journal.pone.0148409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Woo C.W., Chang L.J., Lindquist M.A., Wager T.D. Building better biomarkers: brain models in translational neuroimaging. Nat. Neurosci. 2017;20:365–377. doi: 10.1038/nn.4478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Yarkoni T., Poldrack R.A., Nichols T.E., Van Essen D.C., Wager T.D. Large-scale automated synthesis of human functional neuroimaging data. Nat. Methods. 2011;8:665–670. doi: 10.1038/nmeth.1635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Yu M., Linn K.A., Cook P.A., Phillips M.L., McInnis M., Fava M., Trivedi M.H., Weissman M.M., Shinohara R.T., Sheline Y.I. Statistical harmonization corrects site effects in functional connectivity measurements from multi-site fMRI data. Hum. Brain Mapp. 2018;39:4213–4227. doi: 10.1002/hbm.24241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Zeki S. Interhemispheric connections of prestriate cortex in monkey. Brain Res. 1970;19 doi: 10.1016/0006-8993(70)90237-4. 63–&. [DOI] [PubMed] [Google Scholar]
  106. Zilles K., Clarke S. Comparison With Non-Human primates. Cerebral Cortex: Extrastriate Cortex in Primates. Plenum Press; 1997. Architecture, connectivity and transmitter receptors of human extrastriate cortex; pp. 673–742. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx (65.1MB, docx)

Data Availability Statement

The data used in this study can be obtained via the standard NESDA data access procedure (see https://www.nesda.nl/nesda-english/). This requires researchers to file a data request (analysis plan) which has to be approved by the NESDA consortium in order to be granted access to the data. NESDA fully adheres to the FAIR (Findable, Accessible, Interoperable, and Re-usable) data principles. Furthermore, we will make all analysis code publicly available on the GIT repository of ETH Zurich (https://gitlab.ethz.ch/tnu/code/generative-embedding-nesda).


Articles from NeuroImage : Clinical are provided here courtesy of Elsevier

RESOURCES