The network approach to psychopathology has recently generated enthusiasm in the research community. This is likely due in large part to network methods being promoted with the promise of improving clinical prevention and intervention strategies by explicating the dynamic causal architecture of mental illness1. As a result, studies using network methods have proliferated with the aim of understanding causal interactions between psychiatric symptoms through empirical data.
As one example, there has been a substantial number of studies on the network structure of post‐traumatic stress disorder (PTSD) wherein each network typically includes estimation of centrality indices for 16‐20 symptoms, as well as the presence and weight of 120‐190 edges. Few guidelines inform how to parse the multitudes of exploratory results in each symptom network. Confirmation bias is consequently hard to avoid, and the validity of a network is easily rationalized by the identification of intuitive findings2. By contrast, a variety of post‐hoc explanations are available to dismiss unintuitive findings.
Estimated edges may represent a direct association between two symptoms (e.g., A→B or A←B), a reciprocal effect (A←→B), the common effect of an unmodelled variable (A←X→B), shared item content or method variance, or simply error (noise) in the data. Absent edges may represent conditional independence of two symptoms, or be the result of the specificity in the regularization method used. Central symptoms may cause other symptoms in the network and represent important targets for clinical intervention, or may be the consequence of those other symptoms and thus not useful targets for clinical intervention. Alternatively, as for estimated edges, high symptom centrality may summarize reciprocal relationships among symptoms, relationships with unmodelled variables, shared item content, method variance, or error. There are no methods for disentangling these different explanations of the focal parameters in cross‐sectional symptom networks, which severely limits their utility. In other words, the results are equivocal.
The fundamental reason for this undermining ambiguity is that, with few exceptions, the data used to investigate the network approach to psychopathology are ill‐suited to do so. Network theory is alluring because it describes dynamic causal processes that play out within individuals. However, no statistical procedure can extract this information from the type of cross‐sectional between‐subject data that dominate the literature3. Indeed, networks estimated on these data are not expected to accurately reflect individuals’ experiences or underlying causal processes, by network theorists' own arguments4. As such, the current state‐of‐the‐art networks lack the capacity to provide the very insights they have been promoted to offer.
The unreliability of edges further complicates the interpretation of symptom networks, which change based on the specific set of symptoms in the network, the measures used to assess the symptoms, the use of a clinical or community sample, the sample size, and the type of network analysis adopted5. Remarkably, even when these characteristics are all held equal, key details of the model often do not replicate within or between samples5, 6. This unreliability is predictable, given the intercorrelated nature of psychopathology symptoms, the limited reliability of single self‐ or clinician‐report items, and a focus on the fully partialled relationship between each pair of symptoms (e.g., edge A–B represents what symptoms A and B share with each other, but not with any other symptoms in the network). Together, these common features of current network methods result in edges that are prone to substantial measurement error, leading to spurious associations and high sensitivity to minor variations in study methods and samples. It is therefore difficult to identify generalizable insights in the symptom network literature that advance our understanding of psychopathology.
In contrast, proponents of the network approach recently stated7 that network structures replicate and generalize well, citing examples including “nearly identical” major depression and generalized anxiety disorder symptom networks, and a comparison of four PTSD networks. A closer look at these examples reveals, however, that almost a quarter (23%) of the total estimated edges were unreplicated between the two depression and anxiety networks5, and that well over half (64%) of the edges were inconsistently estimated – as present or absent, or positive or negative – among the four PTSD networks6.
The broader PTSD symptom network literature enables comparisons between additional studies that further highlight substantial inconsistencies8. Among eight of the studies in this literature that have used “state‐of‐the‐art” network methods9 in samples of people who have experienced trauma, the large majority (88%) of symptoms have been reported to have particularly high centrality – many in only a single paper, and none in a majority of the papers. Further, among these studies, all but three (98%) of the 120 possible edges among the PTSD symptoms in DSM‐IV and DSM‐5 have been estimated, and vary between studies in their presence, strength, sign, and hypothesized importance in the network.
There are not yet any methods that can indicate a priori whether or not a specific edge is likely to replicate. While it may be naive to expect exact replication, observed levels of inconsistency between networks seem particularly problematic in the context of a theory that emphasizes interpretation of the presence, absence, strength and sign of each individual edge and the corresponding centrality of individual symptoms. Importantly, optimistic perspectives on the reliability and replicability of symptom networks are often based on methods (e.g., bootnet, the omnibus NetworkComparisonTest, and correlations between lists of edges) that shift the focus away from these detailed features, and towards global network patterns that do not correspond with the basis of network theory or the insights that symptom networks have been promoted to provide6. The result is that these popular methods create an impression of reliability and replicability that fails to translate to the level at which networks are interpreted.
Our concerns surrounding the equivocal, stationary and ungeneralizable nature of current symptom network results contrast with the rhetoric in much of the network literature promising meaningful clinical insights from these methods. Alternative modeling methods and research designs – for example, collecting experimental data with reliable measurement of symptoms over time – are needed to make causal inferences about relationships between symptoms, and thus to achieve the aims of the network approach to psychopathology. Ultimately, it remains unclear what can be meaningfully concluded from the extant network literature with respect to the onset, maintenance or treatment of psychopathology.
This work was supported in part by the US National Institutes of Health (grants L30 MH101760 R01AG053217, AA025689 and U19AG051426), the Templeton Foundation and a Macquarie University Research Fellowship.
References
- 1. Borsboom D. World Psychiatry 2017;16:5‐13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Diaconis P. In: Hoaglin D, Mosteller F, Tukey J. (eds). Exploring data tables, trends, and shapes. New York: Wiley, 1985. [Google Scholar]
- 3. Molenaar PCM. Measurement 2004;2:201‐8. [Google Scholar]
- 4. Borsboom D, Cramer AO. Annu Rev Clin Psychol 2013;9:91‐121. [DOI] [PubMed] [Google Scholar]
- 5. Forbes MK, Wright AGC, Markon KE et al. J Abnorm Psychol 2017;126:969‐8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Forbes MK, Wright AGC, Markon KE et al. Multivariate Behav Res (in press). [Google Scholar]
- 7. Borsboom D, Robinaugh DJ, The Psychosystems Group et al. World Psychiatry 2018;17:143‐4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Forbes MK, Wright AGC, Markon KE et al. J Abnorm Psychol 2017;126:1011‐6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Borsboom D, Fried EI, Epskamp S et al. J Abnorm Psychol 2017;126:989‐99. [DOI] [PubMed] [Google Scholar]
