Skip to main content
Human Brain Mapping logoLink to Human Brain Mapping
. 2025 Jul 19;46(11):e70281. doi: 10.1002/hbm.70281

Using Extreme Value Statistics to Reconceptualize Psychopathology as Extreme Deviations From a Normative Reference Model

Charlotte Fraza 1,2, Mariam Zabihi 1,2,3, Christian F Beckmann 1,2,4, Andre F Marquand 1,2,5,
PMCID: PMC12275014  PMID: 40682494

ABSTRACT

For many problems in neuroimaging, the most informative features occur in the tail of the distribution. For example, when considering psychiatric disorders as deviations from a ‘norm’, the tails of the distribution are considerably more informative than the bulk of the distribution for understanding risk, stratifying and predicting such disorders, and for anomaly detection. Yet, most statistical methods used in neuroimaging focus on modeling the bulk and fail to adequately capture extreme values occurring in the tails. To address this, we propose a framework that combines normative models with multivariate extreme value statistics to accurately model extreme deviations of a reference cohort for individual participants. Normative models are now widely used in clinical neuroscience and similar to the employment of normative growth charts in pediatric medicine to track a child's weight in relation to their age; normative models can be used with neuroimaging measurements to quantify individual neurophenotypic deviations from a reference cohort. However, formal statistical treatment of how to model the extreme deviations from these models has been lacking until now. In this article, we provide such an approach inspired by applications of extreme value statistics in meteorology. Since the presentation of extreme value statistics is quite technical, we begin with a non‐technical introduction to the fundamental principles of extreme value statistics to accurately map the tails of the normative distribution for biological markers, including mapping multivariate tail dependence across multiple markers. Next, we give a demonstration of this approach to the UK Biobank dataset and demonstrate how extreme values can be used to accurately estimate risk and detect atypicality. This framework provides a valuable tool for the statistical modeling of extreme deviations in neurobiological data, which could provide us with more accurate and effective diagnostic tools for neurological and psychiatric disorders.


Overview of our approach for extreme value statistics. First, we fit a normative model to imaging phenotypes, before employing a ‘peaks‐over‐threshold’ approach widely used in meteorology and finance. This allows us to model tail dependencies and provides a multivariate measure of extremely dependence through a tail pairwise dependency matrix (TPDM).

graphic file with name HBM-46-e70281-g006.jpg

1. Introduction

In many domains, understanding the extreme values of a given variable is of great importance since they have the most significant impact on outcomes of interest (Zhao et al. 2010; Poon et al. 2004; Marsh and Dale 2002). In neuroscience, we are often similarly interested in the extreme values of a distribution. There are several reasons for this. First, many neurological and psychiatric disorders are characterized by patterns of deviations in specific regions or circuits of the brain (Zabihi et al. 2019; Wolfers et al. 2018; Rutherford, Fraza, et al. 2022; Bu et al. 2024; Loreto et al. 2024). These deviations may be difficult to detect using traditional statistical methods that focus on the mean or median values of the data. However, they may be more readily apparent in the extreme values of the data, which reflect significant deviations from the norm. For example, we know that atypical brain development underlies many brain disorders (Insel 2014; Marquand et al. 2019). If we understand early deviations in brain development better, we could provide risk factors for developing several mental disorders. For example, early extreme deviations in frontal cortex development from the norm could be a risk factor for attention deficit disorder (Shaw et al. 2007). Alternatively, an extreme sudden decrease of dopamine cells in the Substantia nigra could be an early biomarker for Parkinson's disease (Bloem et al. 2021; Berg et al. 2021). Small variations in brain development are generally not cause for concern, as they are within the normal range of individual differences. However, extreme deviations from normal brain development in multiple brain regions can be indicative of underlying neurological or psychiatric conditions that can affect a person's ability to function in their daily life. By accurately mapping these extreme deviations, researchers can gain a better understanding of the brain's structure and function and identify potential risk factors for psychiatric disorders such as depression, schizophrenia, and attention deficit hyperactivity disorder (ADHD). Second, many neurological and psychiatric disorders are complex and heterogeneous, with multiple subtypes and comorbidities (Allsopp et al. 2019; Feczko and Fair 2020; Hannon et al. 2022). By focusing on the tails of the distribution across multiple variables, researchers can identify patterns of tail dependence that might inform about distinct subtypes based on brain activity or structure. Finally, the tails of the distribution might provide particularly useful information for gauging prognosis and treatment response. For instance, people with more extreme values in particular brain regions may respond better to certain types of treatment or may have worse outcomes overall.

Reliable estimation of the tails of a distribution poses multiple challenges. Firstly, data in the tails of a distribution are, by definition, sparse, which makes it difficult to acquire a sufficient number of observations to draw meaningful conclusions (Goix et al. 2017). Secondly, classical approaches in statistics are designed to focus on the central part (i.e., bulk) of the distribution rather than the tails. This makes it hard to capture the subtle nuances of the tail behavior, leading to inaccurate estimation of tail probabilities (Davison and Huser 2015). This is crucial because, for many applications, underestimating or overestimating the likelihood of extreme events can lead to large errors in risk assessment and treatment decisions. Accurate estimation of tail probabilities is, therefore, critical for obtaining reliable estimates of risk and optimal clinical decision‐making. An additional problem is that extreme values usually do not occur in isolation; therefore, modeling dependencies in the tails of multiple variables is also important for accurate inference (Goix et al. 2017). In this respect, it is crucial to recognize that tail dependence does not in general correspond to correlation in the traditional sense. For instance, two variables may be highly correlated, yet show low tail dependence and vice versa. We illustrate this problem didactically using a watershed model inspired by hydrology in the next section.

In this paper, we propose a neuroimaging framework for understanding and exploring patterns in extreme brain deviations for the individual based on extreme value statistics and normative modeling. Normative modeling provides statistical estimates of the distribution of a given brain measure across the population and can be used to estimate individual risk or deviation scores (Marquand et al. 2019; Marquand, Rezek, et al. 2016) and has become a well‐established tool in clinical neuroscience. Multiple large‐scale normative models have been brought online (Rutherford, Fraza, et al. 2022; Bethlehem et al. 2022; Rutherford, Barkema, et al. 2023). It has also been applied to study the individual differences in multiple psychiatric and neurological disorders, finding deviation scores related to schizophrenia (Wolfers et al. 2018; Lv et al. 2021) ADHD (Bu et al. 2024; Kia et al. 2020; Wolfers et al. 2020; Tung et al. 2021; Ebadi et al. 2024), Autism Spectrum Disorder (ASD) (Tung et al. 2021; Bethlehem et al. 2020; Nunes et al. 2020; Zabihi et al. 2020; Looden et al. 2022; Meijer et al. 2023; Floris et al. 2024), and Alzheimer's disease (Loreto et al. 2024; Kumar et al. 2023; Verdi et al. 2023). See (Marquand et al. 2019) for a review. Combining the strength of normative modeling with extreme value statistics will allow us to accurately estimate extreme deviation risk scores for the individual. The combination of these methods enables us to establish brain‐behavior mappings that are more sensitive to the multivariate structure underlying the extreme deviations from an expected pattern (e.g., normative brain development). We predict that the accurate understanding of small probabilities of extreme events is crucial for understanding several types of behavior and psychiatric disorders.

The main goals of this paper are: first, to provide a didactic introduction to extreme value statistics along with guidelines for implementing them in practice, which is important because extreme value statistics is quite a technical topic. Second, we provide a proof of principle to demonstrate the use of multivariate extreme value statistics for neuroimaging data. Whilst we anticipate that extreme value statistics is widely useful in neuroimaging, it is particularly naturally suited to normative modeling, because normative models provide a flexible method to map multiple variables that might have different non‐Gaussian distributions (e.g., brain volume and cortical thickness) to z statistics, which have the same asymptotically Gaussian distribution. This is particularly important for calibrating and fitting these models.

In summary, our main contributions are: (i) Developing a comprehensive analytical framework for multivariate extreme value statistics for neuroimaging data; (ii) applying and demonstrating it on the UK Biobank dataset; (iii) establishing clear underlying tail pairwise dependence between several brain phenotypes; (iv) summarizing the tail pairwise dependence using principal components based on the extreme values, henceforth called extreme principal components. Finally, (v) we relate these extreme principal components to several behavioral phenotypes.

2. Theoretical Background

2.1. Normative Models

Normative modeling is a method for understanding and parsing the heterogeneity in the population. Similar to pediatric growth charts, which are used to model the distribution of height or weight of children according to their age and sex, normative models can be used to model neuroimaging data according to a set of clinical or demographic covariates (e.g., age, sex, and site) (Marquand, Rezek, et al. 2016; Marquand, Wolfers, et al. 2016). The outputs of this analysis are the median and centiles of variation of the data as a function of the covariates. Using normative models, we shift away from the case–control setting and are able to make inferences of deviation from a reference cohort for each individual. Normative models have been used extensively to create large reference cohorts, establish individual deviation scores from these reference cohorts, and link the deviation scores to diverse psychiatric problems (Zabihi et al. 2019; Wolfers et al. 2018; Rutherford, Fraza, et al. 2022; Lv et al. 2021; Wolfers et al. 2020). We typically aim to use the deviation scores from various structural and functional brain measures as features that are then associated with clinical or behavioral scores. Further mathematical details about the normative modeling approach employed here are provided in the supplement.

2.2. Conceptualizing Multivariate Deviations Using a Watershed Model

Most models of mental disorders conceptualize them as the emerging end product of aggregated causes, from disruptions in genetic pathways and brain networks to the environment (Cannon and Keller 2006). For example, multiple neural system dysfunctions have been recorded that can lead to disruptions in attention, memory, and emotion regulation (Cannon et al. 2001). Small individual disturbances to certain nodes in the brain network can cause combined effects that surpass a liability threshold for a specific or multitude of mental disorders. A complex psychiatric state such as schizophrenia, major depression, or anxiety disorder is then thought to be caused by combined heterogeneous deviations from typical network functioning or by one local disruption in the network. To illustrate this point, we give an analogy inspired by hydrology, where extreme value statistics are commonly used; see Figure 1. Under this analogy, in order to understand the effects of a flooding event (e.g., at the river delta), predicting the frequency and intensity of extreme values at different nodes on the river becomes much more important than predicting the change in the mean, because the combined extreme events are what cause the most damage (Asadi et al. 2015). We illustrate this in Figure 1, where we give a visual representation of the different types of extreme data that can exist at individual gauging stations that together cause a flood. It could be that one river in the network has local water elevation due to, for example, snow melting (Figure 1A). This local water elevation then causes flooding downstream at the river basin. Or it could be that the water elevation is raised at multiple locations due to a more generalized effect for example, a storm across the land (Figure 1B,C). Under this watershed model, it becomes clear that it is important that we model the extremal dependence structure between the different gauging stations and water flow correctly, as it is not only the local extreme events but also a co‐occurrence of multivariate extremes that determines how severe the flooding will be (Chiapino and Sabourin 2017; Engelke and Ivanovs 2020).

FIGURE 1.

FIGURE 1

Visual example of different types of extreme river elevations that can cause local flooding at location 1. (A) Local elevation along one water bank gives rise to the flooding of the river basin at location 1. (B,C) The multivariate extreme elevation of the rivers in combination that gives rise to the flooding at location 1. Likewise, extremes in behavior can be understood as being the end product of several large deviations in certain brain regions or networks.

This analogy can be transferred to psychopathology, as proposed in Cannon's watershed model (Cannon and Keller 2006). Under this model, pathophysiological brain states can manifest either due to local brain disruptions in certain brain regions or nodes in the network or via a more widespread disruption in connected brain systems. The widespread disruption can cause psychopathology by a ‘downstream’ aggregation of different effects. Thus, there are many ways that small causal effects (the tributaries) can be aggregated to yield the complete expression of a disorder (the river delta). The key points are that: (i) neural correlates of psychopathology can be understood as manifesting at the extreme end of continuous axes of variation across the population, (ii) these extreme deviations may be spread across multiple brain systems and may not necessarily overlap across individuals and (iii) these deviations may not be reflected in only one data modality. We now show how this model can be operationalized within the context of multivariate extreme value statistics.

2.3. Introduction to Extreme Value Statistics

We will rely on extreme value statistics to accurately model probabilities of the tails of the distribution (Coles et al. 2001), that is, extreme events. We start with exploring a univariate extreme value model fitting method, which serves as a building block for the multivariate extreme value statistics approach we introduce later. There are two main motivations for this. First, depending on the question of the user, the univariate approach could be sufficient. For example, if one suspects that the extremes in one brain region are most predictive of behavior, such as the amygdala in generalized anxiety disorder (Martin et al. 2009). Second, threshold selection for multivariate extreme value statistics is most easily done by looking at individual model fit in univariate space, as we will discuss below.

For the multivariate extreme values, this work employs a framework that is specifically created for exploring extreme tail dependence when the dimensions become large, such as when we look at multiple brain regions or voxels simultaneously. We employ the tail pairwise dependence matrix (TPDM) (Cooley and Thibaud 2018) to understand the dependence structure in the extremes, which is similar to the correlation or covariance variance matrix, but specified for the tails. We then compute an eigendecomposition of this matrix, similar to what is done in principal components analysis (PCA). This is motivated by the fact that, whilst standard covariance PCA estimates variation about the mean, it is not well‐suited to modeling multivariate tail dependency for the extremes, because dependence in the bulk of a multivariate distribution may not be equivalent to dependence in the tails (Cooley and Thibaud 2018). The TPDM is similar to a covariance matrix, but instead of describing the dependence structure around the mean of the distribution, the TPDM captures the dependence structure in the joint tails. An eigendecomposition gives us an ordered basis of eigenvectors based on the extremes, values that capture, in order, the most important vectors for explaining the extreme dependence structure (Cooley and Thibaud 2018; Jiang et al. 2020). We call the method ‘TPDM PCA’ or ‘extreme PCA’ and the associated principal components (PCs) ‘TPDM PCs’ or ‘extreme PCs’. Similar to standard PCs, the TPDM PCs can be interpreted by looking at the individual contributions of each variable to the TPDM PCs, for example, certain regions‐of‐interest (ROIs) in neuroimaging data. This gives an indication of which variables are most important in understanding the extreme deviations.

3. Materials and Methods

3.1. Data Sample

The data used in the paper came from the UK Biobank imaging data set (Miller et al. 2016; Sudlow et al. 2015). We used the image‐derived phenotypes (IDPs), which are pre‐processed imaging data, generated by the image‐processing pipeline developed and run on behalf of the UK Biobank (Alfaro‐Almagro et al. 2018). These are summary measures of several imaging modalities, including structural, functional, and diffusion modalities, making them ideally suited for evaluating new methods and theories. For the full details on the pre‐processing steps and the design of the study, the following document can be read ‘UK Biobank Brain Imaging Documentation’ (Smith et al. 2020). We used data from 43,893 participants between 40 and 80 years old, with 53% females. For the exact distribution of the age range and the number of participants currently in our dataset, see the supplementary material. For this study, we pre‐processed the IDPs using the FUNPACK software (McCarthy 2020). We used 2006 IDPs in total, including structural, functional, and diffusion MRI IDPs processed by the UKB consortium. We split the available IDP data into a training set and a test set with a 10–90 split, giving us 4405 participants for training and 39,488 participants for testing. This was a deliberate choice as we aimed to retain a large number of testing subjects to evaluate the new multivariate extreme value framework. For the UK Biobank, the dataset is very large. Thus, the number of training subjects is still acceptable; however, with a smaller dataset, we would recommend a more conservative split (e.g., split‐half). Certain IDPs were not fitted optimally and were excluded from further analysis; see Figure S8 for an example.

For the behavioral analysis, the UK Biobank dataset provided an extensive number of behavioral, clinical, lifestyle, and cognitive variables (Miller et al. 2016). These non‐imaging derived phenotypes (nIDPs) are categorized within FUNPACK into seven different categories: cognitive phenotypes (199 variables), lifestyle environmental alcohol (9 variables), lifestyle environment tobacco (4 variables), lifestyle environment general (44 variables), lifestyle environment exercise work (37 variables), mental health (29 variables), neurodegenerative disorders (6 variables). For simplicity, we follow this categorization here.

3.2. Overview of Analytic Workflow

In Figure 2, we present a high‐level overview of all the steps in the multivariate extreme value framework we propose. In brief, we first estimated a normative model for all the IDPs using age, sex, and site as our covariates. We then applied a warped Bayesian Linear Regression (wBLR) model, which helped us to predict the neuroimaging data from the covariates whilst modeling non‐Gaussianity and providing us with posterior predictive intervals for each prediction. This essentially follows an approach we have validated previously (Rutherford, Fraza, et al. 2022; Fraza et al. 2021). We then evaluated the performance of the wBLR model and the likelihood warping by examining multiple error metrics, namely explained variance, mean standardized log loss (MSLL), kurtosis, and skewness (see the supplement for details). Then, we collected the deviation or z‐scores for each participant and all the IDPs into one z‐scores matrix, including all the IDPs we modeled. Next, we transformed the marginals and converted them to polar coordinates, which are standard operations in extreme value statistics and will be explained in more detail below. Then, we thresholded the data at the 95‐quantile in polar coordinates. Finally, the TPDM was calculated, based on the extreme quantiles of the data. On this matrix, we performed an eigenvalue decomposition, which gave us an eigenbasis that explains most of the extremal structure in the first few eigenvectors. Finally, we correlated the first few extreme PCs back to behavior to establish a brain‐behavior relationship. These steps are explained in more detail in the next sections.

FIGURE 2.

FIGURE 2

Overview of the proposed extreme normative modeling approach showing all the steps in the pipeline. (1) Estimate a normative model with likelihood‐warped Bayesian Linear regression for each (neuro) image‐derived phenotype (IDP). This allows us to create deviation or z scores from the mean trend for each participant. (2) Combine the z scores of all the participants and each IDP into one matrix. (3) Transform the marginals of the data such that the marginal distribution is varying with tail index = 2 and threshold the data in polar coordinates at the 95‐quantile. (4) Calculate the tail pairwise dependence matrix (TPDM) based on the outer 95‐quantile of the z scores. This matrix captures the extreme structure in the outer tails. (5) Perform an eigenvalue decomposition on the TPDM, creating an independent basis on which the first few eigenvectors contain most of the information of the extremal structure. (6) Correlate the first few extreme principal components to behavior to indicate how multivariate extreme values influence behavior. See the text for full details.

3.3. Extreme Value Statistics

If we consider the entire distribution of deviation scores, we can approximate it as being composed of two components: the bulk of the distribution and the tails. Usually, we fit a parametric model to the bulk of the distribution, such as a Gaussian distribution. However, if we are interested in the far upper and lower tails or the extreme quantiles of a distribution, we have to apply extreme value statistics, which allows us to model the tail probabilities accurately (Coles et al. 2001; De Haan et al. 2006). The extreme events in the tails have a small probability of occurring but usually exercise a large influence on relevant outcomes. However, the small occurrence or sparseness of measurements in the tails makes it difficult to model accurately. In the univariate case, two methods can be employed to address this that is, the ‘block maxima’ or the ‘exceedance’ (a.k.a. ‘peaks over threshold’) approach. We focus on the ‘peaks over threshold’ approach in this paper, which is summarized in Figure 3.

FIGURE 3.

FIGURE 3

Showing the different types of extreme value statistics that can be used with normative modeling. (A) Normative model created for one IDP (27012) modeled as a function of age using a Bayesian Linear Regression (BLR) model. (1) The normative model fit with the centiles of variation. (2) The histogram of the deviation or z scores. (3) The quantile‐quantile (QQ) plot of the z scores. (B) Extreme value statistics using the peaks over threshold method on the z scores. In general, the bulk of the distribution is modeled by a normal distribution and the tails converge to a Generalized Pareto Distribution (GDP). (C) The extreme value bivariate distribution for two dimensions. (D) Modeling multivariate extremes for more than two variables. Here, we summarize the tail dependence with a so‐called tail pairwise dependence matrix (TPDM), similar to a PCA for the bulk of the distribution.

However, for didactic purposes, we start by describing the block maxima approach. This involves dividing the data into blocks of a certain size and then finding the maximum value in each block. For instance, in hydrology, blocks are usually taken to be years, and the annual maximum is measured across each block. The goal is then to fit a statistical model to estimate the probability of these maxima.

More formally, we consider the deviation scores derived from the normative model. These are denoted z, which are realizations of the random variable Z. We use uppercase letters for random quantities and lowercase letters for actual measured values to differentiate between the two. We then consider a set of n independent variables Zn, having a common distribution function F and aim to model the statistical behavior of the maxima:

Mn=maxZ1Zn (1)

Usually, under the block‐maxima approach Zn are grouped according to a logical block unit or time frame, such as a time‐block in fMRI, one full scanning session for MRI, or another repeated time measure in hours, months, and or years. Alternatively, one can also define a block in the spatial domain such as ROI in the brain (or in our case one IDP). We then aim to determine an appropriate probability distribution function F and fit this to observed data. It has been shown (Fisher and Tippett 1928) that the probability distribution of maxima Mn converges to a generalized extreme value distribution (GEVD), which rescaled with parameters bn and an is defined by:

PMnbnanzGz,n (2)

For constants an>0,bnϵR and z the upper end of the tails, we have

Gz=exp1+ξzμσ1ξ (3)

This probability distribution Gz is defined by the shape ξ, location μ and scale σ parameters. The GEVD can be further separated into the Gumbel, Fréchet, and Weibull families, depending on the shape of the data‐generating distribution. Further details are provided in the supplementary material and in (Coles et al. 2001). What is remarkable about this result is that the GEVD is the only possible asymptotic limit for the distributions of Mn, regardless of the form of the common data‐generating distribution F. Therefore, the theorem serves as an extreme value version of the central limit theorem (Coles et al. 2001).

The block maxima approach, although easy to employ, is not very efficient because it ignores large parts of the data that could be relevant (e.g., the second and third‐highest values in each block, which are potentially also informative about extreme behavior). Furthermore, this approach is mainly specified for data that have spatial or temporal block structure. Therefore, in this work, we employ the peaks over threshold approach. Instead of looking at the maximum value in each block, this method focuses on modeling values that exceed a certain threshold. A statistical model is then fit to these “peak” or extreme values. Mathematically, the peaks over threshold approach can be stated as:

Mn=kthlargest ofZ1Zn (4)

With k chosen to be an appropriately large threshold. In a similar manner to the convergence of the GEVD, the probability distribution of this model converges to a generalized Pareto distribution (GPD). This is derived by building on the results above that is by assuming that PMnzGz. Then, by specifying that for a large enough threshold k, the distribution function of Zk, conditional on Z>k, is approximately:

Hy=11+ξyσ~1ξ (5)

which is defined by the shape ξ and scale σ~=σ+ξkμ parameters. For more details see the supplement and (Coles et al. 2001).

3.4. Thresholding Procedures

Prior to computing the TPDM to capture the extremal dependence in the deviations from the normative model (i.e., the z scores), it is necessary to transform the data and determine a suitable threshold, k, above which a sample is considered extreme. The threshold should not be too stringent, as then there will be too few values to model the tails of the distribution accurately, but also not too low, as then the shape and scale parameters will be incorrect. One way to approach this problem is by using graphical tools with several thresholds and deciding on the best threshold based on their outcomes. This is the approach pursued here. In more detail, we follow the method proposed by (Cooley and Thibaud 2018) and transform the marginal distribution for each IDP, so that it fits the framework that requires z‐scores to be regularly varying. Regular variation is a theoretical requirement underlying the statistics of extremes. In essence, if a random vector is multivariate regularly varying, it means that its joint tail decays according to a power function. This requirement is better satisfied after transforming the data. To achieve this, we transformed the marginal distribution of each variable using a rank transform. The rank order transformation smoothens effects of non‐normality, outliers, unequal variances, and so forth, but other transformations can be considered, see (Resnick 2007). Then, we applied the nonparametric marginal transformation proposed by (Cooley and Thibaud 2018) Gz=logF^z, with F^ the original marginal distribution of the z scores. This essentially ensures the marginal distributions of each variable have a distribution that is close to Fréchet. Next, to be able to threshold in multiple dimensions, we transformed the variables into polar coordinates rω and selected the threshold using visual procedures outlined in the supplement.

One graphical tool that can be employed for the selection of an accurate threshold is the mean residual life plot (MRL‐plot) (Davison and Huser 2015), which involves plotting the value of a threshold, k, against the “mean excess” that is the exceedances at k, minus k. Another graphical tool that can be employed is the parameter or threshold stability plot (Scarrott and MacDonald 2012); (Fraza et al. 2021), which involves plotting model parameters as a function of the different thresholds. We look at different estimates of the parameters for the GDP and decide on an optimal threshold based on the lowest value where the plots are approximately stable. Examples of these plots are shown in Section 3.2 below.

Choosing the optimal (multivariate) threshold for the TPDM estimation includes all the same difficulties as selecting a threshold for univariate extreme value statistics. Commonly, it is selected with a visual inspection of all the graphs presented in the univariate result section. However, when the dimensions become large, this becomes infeasible. Therefore, we follow the suggestions given in (Scarrott and MacDonald 2012) we advocate selecting several variables manually. This is the approach we follow in this manuscript, and several examples are shown in the results and supplement.

3.5. Framework for the Tail Pairwise Dependence Structure

With univariate extreme value statistics, we can model the tails in one variable. However, in many circumstances, we are interested in multiple variables simultaneously, which calls for a multivariate approach. For example, when there is a strong tail dependence between certain locations in the brain or between different imaging phenotypes derived from the data. This can be done using multivariate extremes, where our objective is to characterize and understand the dependence structure in the far upper joint tails, see Figure 3C,D. There are several methods to model the multivariate joint tails; for a full overview see (Engelke and Ivanovs 2020). In the bivariate case, we can fit a bivariate extreme value distribution, which tries to estimate the probability of the variable being large while the other is also large, see Figure 3C; however, this approach is very difficult to scale up to higher dimensions. In our case, the chosen multivariate extreme value method is based on PCA (Cooley and Thibaud 2018), called the tail pairwise dependence matrix (TPDM), see Figure 3D.

As for the univariate method, the method used here for understanding the tail pairwise dependence structure is based on the framework of regular variation. We therefore assume Z is a regularly varying random vector. In general, for multivariate regular variation, it is convenient to consider the variables in polar coordinates:

ZZZZ=rω (6)

With ωϵN a location on the unit sphere S and r is the length of the vector (Resnick 2007). Then for a given A, a set of large values away from the origin, we can calculate the probability of Z as:

PZA~rωϵAαrα1drdHω (7)

With ~ signifying “approximately proportional to”, α>0, r the magnitude or radial component of the location, ω the location on the unit sphere S, and H the angular measure on the unit sphere S, see Figure 4. In the integrand, r follows a power‐law distribution according to α the reciprocal of the shape parameter of the GPD (Cooley and Thibaud 2018; Jiang et al. 2020).

FIGURE 4.

FIGURE 4

Illustration of the applied polar transformation of the data. the value indicated by the radial component r becomes independent from the angular component ω. Adapted from (Jiang et al. 2020).

3.6. Tail Pairwise Dependence Matrix

The angular measure H in lower dimensions can be modeled parametrically. However, as the dimension size increases, there is insufficient information in the extremes to fully capture H. Instead of fully modelling the dependence structure contained in H an approximation can be made with the tail pairwise dependency matrix (TPDM). If we assume Z to be a regularly varying random vector with a common tail index α and angular measure H. Then each element in the TPDM, z, is specified according to:

σi,j=SωiωjdHω,i,j=1,,d (8)

The TPDM has similar interpretation properties as a standard covariance matrix only specified for the extremes (Cooley and Thibaud 2018). Thus, if two variables are independent in the outer tails σi,j=0 and the diagonal element σi,i=b2, with b the scale. The TPDM is positive definite and symmetric and therefore the eigenvalues are positive and real (Cooley and Thibaud 2018; Jiang et al. 2020).

3.7. Estimation of the TPDM

We estimated the dependence in the tails via the TPDM for z=(znd)Rnd, which is a matrix containing all data for D IDPs and N subjects. For this framework, the variables should be heavy‐tailed with a common tail index α=2. The framework for regular variation outlined earlier necessitates that all variables exhibit heavy‐tailed behavior, with a shared tail index denoted by α. To ensure this assumption holds, we first transformed the marginals, which is a common procedure in extreme value statistics (Resnick 2007). We first used a rank order transformation to obtain F^, the estimated marginal cumulative distribution function (CDF) from the data. The rank order transformation smoothens effects of non‐normality, outliers, unequal variances and so forth, but other transformations can be considered, see (Resnick 2007). Then we transformed our F^ with Gz=logF^z the CDF of a Fréchet random variable (Cooley and Thibaud 2018). Afterward, a polar coordinate transformation was applied and a threshold of the 95th quantile was chosen for the kth largest radius, see Figure 4. Finally, we estimated the TPDM according to:

σ^ij=2nexc1n=1Nωniωnjlrn>r0 (9)

Where r0 the threshold for the radial component and nexc the number of exceedances (i.e., observations that pass the threshold).

3.8. Extreme Principal Components Analysis

Next, we perform an eigendecomposition (Cooley and Thibaud 2018) of the TPDM or ^z to obtain the eigenvectors (i.e., extreme PCs). This yields an orthonormal basis U^D^U^T=^Z, with D the diagonal matrix of eigenvalues λ1λl and U=u1ul the matrix of eigenvectors.

Two of the goals of standard covariance PCA are to extract the most important information and to analyze the structure of this information in terms of the original variables (Abdi and Williams 2010). To achieve the first goal, we select the top eigenvectors by ordering the eigenvalues that explain the largest ‘scale’, which is the extremal equivalent of variance in conventional (covariance) PCA. This is therefore similar to ordering standard eigenvalues which explain the most variance for covariance PCA. To achieve the second goal that is to understand the contribution of each variable to the extreme PCs, we compute loadings to understand the basis we derived in terms of its original variables by estimating the individual contributions. We define the loadings for each component l:fl=ul×λl, with f the loading, u the eigenvector and λ the eigenvalue. The contribution of each IDP d to the component l is then calculated by: cd,l=fd,l2λl, with λl the eigenvalue of the lth component (Abdi and Williams 2010).

3.9. Evaluation

Finally, we correlate the extreme PCs with all the behavioral phenotypes according to the FUNPACK categorization described in Section 2.1 (McCarthy 2020). We achieve this using pointwise biserial correlation or Spearman correlation depending on whether the phenotype is continuous or categorical for the leading principal components, followed by Bonferroni correction for multiple comparisons across nIDPs and PCs. We also perform a standard correlation PCA approach as a comparison. However, we emphasize that these two approaches are complementary as one is specified for the bulk of the distribution (PCA) and the other is specified for the tails of the distribution (extreme PCA). Therefore, they yield different information, as we will show in the results.

4. Results

4.1. Univariate Extreme Value Results

In this section, we demonstrate the peak‐over‐threshold method for one IDP and show how an optimal threshold can be decided and the tails accurately modeled. In Figure 5, we show the different possible thresholds for one IDP, volume of white matter hypointensities (WMH; UKB field ID 26528). This is a suitable choice because it is the most highly non‐Gaussian phenotype we model and is therefore the most challenging variable for our pipeline (however, please note that after fitting the normative model, the z‐scores should be closer to Gaussian). This IDP is shown in Figure 5.

FIGURE 5.

FIGURE 5

The univariate peak over threshold method for one IDP. (A) Warped BLR normative model result for the white matter hyperintensity IDP, demonstrating the mean and centiles of variation. (B) Histogram of the z scores showing different thresholds that could be employed in the peaks over threshold method (90th, 95th, and 99th percentiles). (C) Scatterplot for the peaks over threshold method demonstrating three possible thresholds of the 90th, 95th, and 99th percentiles. We see that the sparsity of the points increases with higher threshold values.

To decide between the different thresholds, we used graphical tools such as the mean residual life plot and shape and scale plots. In Figure 6, we present the mean residual life plot for the WMH phenotype. To help interpret this plot, it is assumed that the threshold should be placed where the plot starts diverging from the linear trend. In this case, we can observe that the plot is relatively linear until the 0.95 mark.

FIGURE 6.

FIGURE 6

The mean residual life plot of the WMH IDP data. We can observe that the plot is relatively linear up to the 0.95 threshold, indicating that this could be a good threshold.

In Figure 7, we show the parameter threshold stability plot of the modified scale and shape parameters. The dots demonstrate the scale and shape parameters for the GPD at different thresholds. The lines show the 95% confidence interval. Again, this makes it clear that the threshold should be around the 0.9–0.95 mark.

FIGURE 7.

FIGURE 7

The parameter threshold stability plot of the modified scale and shape parameters. The dots demonstrate the scale and shape parameters for the generalized Pareto distribution (GPD) at different thresholds. The lines show the 95% confidence interval. An optimal threshold is chosen at the point where the parameters are both relatively stable in—this case, around the 95th percentile.

On the basis of these techniques, we set the threshold as the 95th quantile that is, r0=247. This yields nexc=1960 values exceeding this threshold, which is low enough to give us sufficient data and high enough to estimate the GPD parameters accurately. Once the threshold is decided, we can estimate the parameters using maximum likelihood optimization. We then optimized the profile log‐likelihood to decide the scale and shape parameters for the generalized Pareto distribution. The more peaked the log‐likelihood is the higher our confidence in our estimates. In the supplement, we show several model diagnostic plots, such as the QQ‐plot and return level plot, yielding straight plots. The straight line indicates that the fitted function is appropriate for the data (Resnick 2007). The final scale and shape parameter estimates are: threshold: 1.9613, exceedances: 397, scale estimate 0.425 ± 0.03 (standardized error (SE)), shape estimate −0.065 ± 0.05 (SE).

4.2. Multivariate Extreme Value Results

4.2.1. Thresholding

As discussed in the methods, to be able to threshold in multiple dimensions, we transformed the variables into polar coordinates rω, then use multiple diagnostic plots to select the threshold, r0. Several examples are shown in Figures S3–S5. On the basis of these plots, we decided the 0.95 quantile was a good common threshold above which the examined parameters seemed relatively constant.

4.2.2. Extreme Principal Components Analysis

After computing the TPDM and its eigendecomposition, we show the scree plot of the scale in Figure 8 below. Briefly, the first extreme PC explains 17.64% of the scale, and the second extreme PC explains 3.71% of the scale. We, therefore, selected the first two extreme PCs as our informative basis.

FIGURE 8.

FIGURE 8

The scree plot and table of scale values of the eigenvalue decomposition for the TPDM matrix. We can see that a large percentage of the scale is explained by the first extreme component.

In Figure 9, we plot the highest contributions (i.e., loadings) of the IDPs to the first two extreme PCs. A line is added to denote the average contribution (1/l). The first extreme PC is dominated by regional and tissue volume IDPs and cortical area IDPs and the second extreme PC is dominated by white matter tract diffusivity IDPs, showing that in the extreme tails these variables vary the most together and explain the largest amount of scale for all the IDP data. This shows that many phenotypes have very strong tail dependence, which underscores the need for a multivariate approach.

FIGURE 9.

FIGURE 9

List of the top 20 contributions of the different IDPs to the first and second extreme PCs. We can see that the first extreme PC is largely dominated by IDPs relating to whole brain tissue volume and area. The second extreme PC is largely dominated by IDPs relating to white matter tract diffusivity.

For comparison, we show the comparable scree plots and loadings for standard PCA in Figures S9 and S10. While it is difficult to compare these approaches since they measure different things, we select the top two components in each case for subsequent analysis, which explain approximately 20% of scale or variance, respectively, for extreme or standard PCA. Furthermore, we also show the results in the supplement of a standard count score analysis of deviation scores above a threshold (|Z| > 1.96), which is comparable to the extreme deviation analysis performed in previous works (Marquand et al. 2019; Wolfers et al. 2020). Whilst many previously identified patterns are still observed with this simpler analysis, we consider this to be suboptimal for three reasons. First, this largely results in a smaller effect size (see Figures S14–S16). Second, it does not provide a statistical model to quantify the magnitude of the deviation and therefore cannot be used for statistical anomaly detection or statistical risk quantification. Third, it is more challenging to determine which specific IDP primarily drives the multivariate effect because this method relies on a basic count score.

4.2.3. Extreme PCs Correlation With Behavior

We next aim to correlate the extreme PCs to behavioral variables to ensure that the model can be used for behavioral predictions in clinical samples. In Figure 10, we present Manhattan plots of the p values for the univariate correlation between the nIDPs and the first two extreme PCs. Across both PCs we detected 137 significant Bonferroni corrected associations. For standard PCA, we detected 158 Bonferroni corrected associations across the first two components, see Figure S11. However, we emphasize that the number of detected associations is not the most important point of comparison, particularly recalling that extreme PCA is estimated on the basis of 20 times fewer data points (i.e., only samples exceeding the 95% threshold). Rather, we wish to emphasize that the associations detected are different between extreme and standard PCA, indicating that they capture different information. This can be seen by comparing Table 1 (extreme PCA) with supplementary Table 1 (standard PCA), both of which show the aggregate effects across the first two components. More specifically, extreme PCA detects associations with phenotypes that could arguably be expected to have a more extreme effect on brain structure or function. For example, associations are detected between extreme PCs and multiple sclerosis and alcohol use, but not with standard PCs. In contrast, standard PCA detects stronger associations with general environmental factors. Note also that this is not just a thresholding effect and the pattern of effects remains even at a more lenient threshold (not shown).

FIGURE 10.

FIGURE 10

Two Manhattan plots of the log p values for the Spearman correlation between the nIDPs and the first two extreme PCs. The black line demonstrates the Bonferroni‐corrected p value threshold. Thus, values passing this line indicate a significant PC‐nIDP correlation. We can see that mostly cognitive measures and general lifestyle nIDPs correlate with the first two extreme PCs.

TABLE 1.

Top 10 associations between non‐imaging derived phenotypes (nIDPs) and extreme principal components, grouped per category.

UKB nIDP ID −log10(p) Description
Neurodegenerative disorders
20.002‐0.1261 112.2 Multiple sclerosis
Cognition
20,016‐2.0 68.8 Fluid intelligence score
20,128‐2.0 41.7 Number of fluid intelligence questions attempted within time limit
6373‐2.0 33.5 Number of puzzles correctly solved
6350‐2.0 32.5 Duration to complete alphanumeric path (trail #2)
630‐2.0 31.6 Touchscreen duration
4282‐2.0 26.2 Maximum digits remembered correctly
23,323‐2.0 25.7 Number of symbol digit matches attempted
23,324‐2.0 24.0 Number of symbol digit matches made correctly
6348‐2.0 23.3 Duration to complete numeric path (trail #1)
20,023‐2.0 20.5 Mean time to correctly identify matches
Lifestyle/environment alcohol
1588‐2.0 4.5 Average weekly beer plus cider intake
Lifestyle/environment tobacco
20,116‐2.0 6.2 Smoking status
Lifestyle/environment general
3‐2.0 28.9 Verbal interview duration
738‐2.0 26.2 Average total household income before tax
6138‐2.0 25.7 Qualifications
2139‐2.0 24.8 Age first had sexual intercourse
3526‐2.0 13.4 Mother's age at death
1835‐2.0 10.1 Mother still alive
680‐2.0 9.8 Own or rent accommodation lived in
728‐2.0 8.3 Number of vehicles in household
1807‐2.0 8.2 Father's age at death
6142‐2.0 7.6 Current employment status
Lifestyle/environment exercise/work
1070‐2.0 18.2 Time spent watching television (TV)
924‐2.0 16.3 Usual walking pace
1190‐2.0 12.0 Nap during day
1100‐2.0 9.7 Drive faster than motorway speed limit
1090‐2.0 7.5 Time spent driving
1120‐2.0 5.7 Weekly usage of mobile phone in last 3 months
1170‐2.0 5.2 Getting up in morning
1130‐2.0 4.2 Hands‐free device/speakerphone use with mobile phone in last 3 month
904‐2.0 4.2 Number of days/week of vigorous physical activity 10+ minutes
Mental health
4570‐2.0 11.3 Friendships satisfaction
4653‐2.0 10.9 Ever highly irritable/argumentative for 2 days
4548‐2.0 9.3 Health satisfaction
4581‐2.0 7.3 Financial situation satisfaction
1970‐2.0 5.6 Nervous feelings
1950‐2.0 5.3 Sensitivity/hurt feelings
4537‐2.0 4.8 Work/job satisfaction
2080‐2.0 4.7 Frequency of tiredness/lethargy in last 2 weeks
2010‐2.0 4.4 Suffer from ‘nerves’
4631‐2.0 4.2 Ever unenthusiastic/disinterested for a whole week

Note: The p values reported in the table are the maximum p value across the first two components. Only associations surviving Bonferroni correction across nIDPs and components are reported.

For interested readers, we have included in the supplement a comparison of effect sizes with the standard method of counting deviation scores above a specific threshold (|Z| > 1.96) and their correlation with the nIDPs. The results of this analysis are, in this specific case, largely comparable to the standard PCA approach. However, we argue that both the standard PCA and extreme PCA methods better capture distributed multivariate signals.

5. Discussion

In this paper, we present a framework to characterize multivariate extreme tail behavior of brain imaging data. This is predicated on re‐conceptualizing psychopathology as deviations from a normative pattern of functioning, where such deviations may be spread across multiple measures and have a complex correlation structure, although we consider that this framework will also be useful for many other neuroimaging applications where understanding tail behavior is of interest. For example, our approach is naturally suited to anomaly detection applications where quantifying the statistical ‘surprise’ of a potential outlier is important, and indeed, our approach is useful in any application where it is important to accurately quantify risk in the tails of a given distribution. We introduced the key notions underlying extreme value statistics, and outlined approaches suitable for fitting these models to neuroimaging data, including heuristics for univariate and multivariate threshold selection. In order to accommodate tail dependency across multiple variables, we propose to apply an eigendecomposition to the TPDM, yielding an extremal variant of PCA that can provide a good basis to further understand multivariate tail dependence. We show that this approach is useful to identify participants deviating from the normative model in multivariate space and that this meaningfully relates to clinical and demographic variables. Finally, we show that this extreme PCA approach detects fundamentally different (and complementary) information to the standard covariance PCA approach used in the field.

We applied this approach to normative models, which are now widely used to capture and understand the heterogeneity in the population by modeling individual deviations from a reference cohort. Although many applications have considered extreme deviations from a normative model as their primary endpoints (Zabihi et al. 2019; Wolfers et al. 2018; Rutherford, Fraza, et al. 2022; Lv et al. 2021; Wolfers et al. 2020), a principled treatment of how to study extreme deviations from normative models has been lacking, which we now provide. In this study, we show that fitting an extreme value distribution to these deviations allows us to statistically quantify the magnitude of extremes (e.g., as quantiles of an assumed extreme value distribution) and explore extreme tail behavior across multiple variables by summarizing extremal dependence structure in a TPDM. Previous research in other domains (e.g., meteorology, hydrology, climate, and finance) has attempted to measure tail dependence between variables, as seen in (Engelke and Ivanovs 2020). Our results show that the TPDM method is an effective tool for this purpose in neuroimaging data. The TPDM is similar to a covariance matrix (Cooley and Thibaud 2018), a well‐established tool in neuroimaging research for understanding the relationships between different brain areas and modalities. However, the covariance matrix only captures variance around the mean, which does not fully correspond to tail dependence.

We demonstrated how to implement multivariate extreme value analysis on a large dataset from the UK Biobank, using IDPs. In this application, we showed that the scale of the TPDM can be summarized by a low number of extreme principal components and that these can be meaningfully related to behavioral phenotypes, providing complementary information to a standard PCA. Our analysis showed that the extreme showed larger effect sizes for nIDPs that may a priori be expected to have a more severe impact on neurobiology, most notably neurodegenerative disorders, and weekly consumption of alcoholic beverages consumed on a weekly basis. These associations were not detected using the standard PCA approach. In contrast, the standard PCA method exhibited a larger effect size for factors related to ‘lifestyle and environment’, which (arguably) : may be expected to have more mild effects on the neurobiology. The multivariate extreme method presented in this paper could therefore offer an alternative to methods mostly focusing on mean deviation scores. Although we focus on normative models here, the extreme value statistics approach is widely useful in neuroimaging, although we point out that fitting imaging data to normative models first is a natural preliminary step and provides important benefits. For example, it is easier to find a common threshold for multivariate analysis if data are first transformed via normative models to a common reference scale (i.e., Z scores). In contrast, it may be more difficult to do this if the raw data values are used, which may have different units and scaling (e.g., combining cortical thickness estimates with subcortical volumes).

As with any statistical method, for extreme value statistics, one major consideration is that users should carefully consider the questions they want the model to answer. The extreme value statistics approach assumes a priori that there is more information of interest in the tails than in the bulk of the distribution. If users have a question pertaining to the general population and ‘healthy’ (or ‘average’) functioning of the brain, an extreme value approach would probably not be the most optimal. For these questions, classical approaches that focus on central tendency could be more appropriate. Therefore, we recommend users carefully consider the nature of their question and assess if it is focused on the mean or the tails of the distribution. The extreme value method is designed to give a more sensitive analysis of the multivariate tails of the distribution and is especially beneficial in applications where accurate quantification of risk is essential. This is, to a certain degree, shown in the analysis we conducted within UK Biobank, where for example more severe clinical phenotypes (such as multiple sclerosis) were more strongly associated with the extreme PCs than standard PCs.

In this paper, we have considered the extreme value framework from the standpoint of considering mental disorders as falling at the extremum of the normal multivariate range, which we consider to be both natural and principled. This offers a powerful tool for uncovering new insights into the underlying relationships between different neuroimaging modalities and for discovering new biological markers of disease. We have also aimed to provide guidance specifically oriented toward neuroimaging for the many considerations that are important in implementing extreme value statistics in practice; for example, the choice of approach (block maximum, peaks over threshold), the choice of threshold parameter, how to transform variables to meet the assumptions underlying extreme value theory, and how to model dependence between variables. However, there are many avenues for future work: first, it would be interesting to model tail dependence in a more structured way, for example using graphical models (Engelke and Ivanovs 2020) or using stronger constraints such as modeling statistical independence in tail behavior, as is done classically using independent components analysis. Another future avenue could be to design appropriate mixture models to combine a classical model to fit for the bulk of the distribution and an extreme value fit to the tails. This would allow accurate modeling of both the bulk and tail. However, in this case, giving adequate weight to the extremes and the bulk of the distribution is important. Finally, other applications could consider assessing extremal dependence in the spatial domain rather than across subjects (e.g., comparing extreme deviations of different clusters within brain images) or combining extremes across both space and subjects.

In conclusion, we presented a comprehensive framework for understanding and visualizing the dependency structure in the tails of different neuroimaging variable distributions. The method presented here is general and can be applied to several types of neuroimaging data, including combining across modalities. Therefore, we expect that this method would be an excellent choice for many applications where extreme brain deviations and their underlying dependence could have a significant effect on extreme behavior, ultimately taking us further in the quest for individualized and precision medicine.

Author Contributions

C.F.: Conceptualization, Formal analysis, Methodology, Writing – Original Draft, Writing – Review and Editing. M.Z.: Methodology, Writing – Review and Editing. C.F.B.: Funding Acquisition, Supervision, Writing – Review and Editing. A.F.M.: Conceptualization, Funding Acquisition, Supervision, Writing – Original Draft, Writing – Review and Editing. All authors: drafting and revising the manuscript.

Ethics Statement

Ethical approval for the acquisition of the UK Biobank data was provided by the UK Biobank consortium, and the data were accessed under application number 23668 (Marquand).

Supporting information

Appendix S1. Supplementary Information.

HBM-46-e70281-s001.docx (8.6MB, docx)

Fraza, C. , Zabihi M., Beckmann C. F., and Marquand A. F.. 2025. “Using Extreme Value Statistics to Reconceptualize Psychopathology as Extreme Deviations From a Normative Reference Model.” Human Brain Mapping 46, no. 11: e70281. 10.1002/hbm.70281.

Funding: This work was supported by Nederlandse Organisatie voor Wetenschappelijk Onderzoek (016.156.415) and European Research Council (10100118). This research has been conducted using the UK Biobank resource under application number (23668).

Data Availability Statement

Data are available under request from UK Biobank. Analysis code for the present manuscript is available at https://github.com/CharFraza/extremes_normative_modelling and the PCNtoolkit software toolbox used for normative modelling is available at https://github.com/amarquand/PCNtoolkit.

References

  1. Abdi, H. , and Williams L. J.. 2010. “Principal Component Analysis: Principal Component Analysis.” Wiley Interdisciplinary Reviews: Computational Statistics 2, no. 4: 433–459. [Google Scholar]
  2. Alfaro‐Almagro, F. , Jenkinson M., Bangerter N. K., et al. 2018. “Image Processing and Quality Control for the First 10,000 Brain Imaging Datasets From UK Biobank.” NeuroImage 166: 400–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Allsopp, K. , Read J., Corcoran R., and Kinderman P.. 2019. “Heterogeneity in Psychiatric Diagnostic Classification.” Psychiatry Research 279: 15–22. [DOI] [PubMed] [Google Scholar]
  4. Asadi, P. , Davison A. C., and Engelke S.. 2015. “Extremes on River Networks.” Annals of Applied Statistics 9, no. 4: 2023–2050. [Google Scholar]
  5. Berg, D. , Borghammer P., Fereshtehnejad S. M., et al. 2021. “Prodromal Parkinson Disease Subtypes—Key to Understanding Heterogeneity.” Nature Reviews. Neurology 17, no. 6: 349–361. [DOI] [PubMed] [Google Scholar]
  6. Bethlehem, R. A. I. , Seidlitz J., Romero‐Garcia R., Trakoshis S., Dumas G., and Lombardo M. V.. 2020. “A Normative Modelling Approach Reveals Age‐Atypical Cortical Thickness in a Subgroup of Males With Autism Spectrum Disorder.” Communications Biology 3, no. 1: 486. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bethlehem, R. a. I. , Seidlitz J., White S. R., et al. 2022. “Brain Charts for the Human Lifespan.” Nature 604, no. 7906: 525–533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bloem, B. R. , Okun M. S., and Klein C.. 2021. “Parkinson's Disease.” Lancet 397, no. 10291: 2284–2303. [DOI] [PubMed] [Google Scholar]
  9. Bu, X. , Zhao Y., Zheng X., et al. 2024. Normative Growth Modeling of Brain Morphology Reveals Neuroanatomical Heterogeneity and Biological Subtypes in Children With ADHD [Internet]. bioRxiv. Accessed March 21, 2024. 10.1101/2024.03.16.582202v1. [DOI]
  10. Cannon, T. D. , Gasperoni T. L., van Erp T. G., and Rosso I. M.. 2001. “Quantitative Neural Indicators of Liability to Schizophrenia: Implications for Molecular Genetic Studies.” American Journal of Medical Genetics 105, no. 1: 16–19. [PubMed] [Google Scholar]
  11. Cannon, T. D. , and Keller M. C.. 2006. “Endophenotypes in the Genetic Analyses of Mental Disorders.” Annual Review of Clinical Psychology 2: 267–290. [DOI] [PubMed] [Google Scholar]
  12. Chiapino, M. , and Sabourin A.. 2017. “Feature Clustering for Extreme Events Analysis, With Application to Extreme Stream‐Flow Data.” In New Frontiers in Mining Complex Patterns, edited by Appice A., Ceci M., Loglisci C., Masciari E., and Raś Z. W., 132–147. Springer International Publishing; (Lecture Notes in Computer Science). [Google Scholar]
  13. Coles, S. , Bawa J., Trenner L., and Dorazio P.. 2001. An Introduction to Statistical Modeling of Extreme Values. Springer. [Google Scholar]
  14. Cooley, D. , and Thibaud E.. 2018. Decompositions of Dependence for High‐Dimensional Extremes. arXiv:161207190. Accessed March 1, 2022. http://arxiv.org/abs/1612.07190.
  15. Davison, A. C. , and Huser R.. 2015. Statistics of Extremes. https://papers.ssrn.com/abstract=2598025.
  16. De Haan, L. , Ferreira A., and Ferreira A.. 2006. Extreme Value Theory: An Introduction. Springer. [Google Scholar]
  17. Ebadi, A. , Allouch S., Mheich A., et al. 2024. Beyond Homogeneity: Charting the Landscape of Heterogeneity in Psychiatric Electroencephalography. bioRxiv. 10.1101/2024.03.04.583393v1. [DOI] [PMC free article] [PubMed]
  18. Engelke, S. , and Ivanovs J.. 2020. Sparse Structures for Multivariate Extremes. arXiv:200412182. Accessed February 15, 2022. http://arxiv.org/abs/2004.12182.
  19. Feczko, E. , and Fair D. A.. 2020. “Methods and Challenges for Assessing Heterogeneity.” Biological Psychiatry 88, no. 1: 9–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fisher, R. A. , and Tippett L. H. C.. 1928. “Limiting Forms of the Frequency Distribution of the Largest or Smallest Member of a Sample.” Mathematical Proceedings of the Cambridge Philosophical Society 24, no. 2: 180–190. [Google Scholar]
  21. Floris, D. L. , Llera A., Zabihi M., et al. 2024. A Multimodal Neural Signature of Face Processing in Autism Within the Fusiform Gyrus [Internet]. medRxiv. Accessed March 15, 2024. 10.1101/2024.01.04.23300134v2. [DOI] [PMC free article] [PubMed]
  22. Fraza, C. J. , Dinga R., Beckmann C. F., and Marquand A. F.. 2021. “Warped Bayesian Linear Regression for Normative Modelling of Big Data.” NeuroImage 245: 118715. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Goix, N. , Sabourin A., and Clémençon S.. 2017. “Sparse Representation of Multivariate Extremes With Applications to Anomaly Detection.” Journal of Multivariate Analysis 161: 12–31. [Google Scholar]
  24. Hannon, K. , Easley T., Zhang W., et al. 2022. Heterogeneity in Depression: Evidence for Distinct Clinical and Neurobiological Profiles. medRxiv. Accessed March 21, 2023. 10.1101/2022.12.07.22283225v1. [DOI]
  25. Insel, T. R. 2014. “Mental Disorders in Childhood: Shifting the Focus From Behavioral Symptoms to Neurodevelopmental Trajectories.” Journal of the American Medical Association 311, no. 17: 1727–1728. [DOI] [PubMed] [Google Scholar]
  26. Jiang, Y. , Cooley D., and Wehner M. F.. 2020. “Principal Component Analysis for Extremes and Application to U.S. Precipitation.” Journal of Climate 33, no. 15: 6441–6451. [Google Scholar]
  27. Kia, S. M. , Huijsdens H., Dinga R., Wolfers T., Mennes M., and Andreassen O. A.. 2020. “Hierarchical Bayesian Regression for Multi‐Site Normative Modeling of Neuroimaging Data.” In Medical Image Computing and Computer Assisted Intervention–MICCAI 2020, edited by Martel A. L., Abolmaesumi P., Stoyanov D., et al., 699–709. Springer International Publishing; (Lecture Notes in Computer Science). [Google Scholar]
  28. Kumar, S. , Earnest T., Payne P. R. O., and Sotiras A.. 2023. Analyse Patient‐Level Heterogeneity in Alzheimer's Disease Using Multimodal Normative Modelling. bioRxiv.
  29. Looden, T. , Floris D. L., Llera A., et al. 2022. “Patterns of Connectome Variability in Autism Across Five Functional Activation Tasks: Findings From the LEAP Project.” Molecular Autism 13, no. 1: 53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Loreto, F. , Verdi S., Kia S. M., et al. 2024. “Alzheimer's Disease Heterogeneity Revealed by Neuroanatomical Normative Modeling.” Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring 16, no. 1: e12559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lv, J. , Di Biase M., Cash R. F. H., et al. 2021. “Individual Deviations From Normative Models of Brain Structure in a Large Cross‐Sectional Schizophrenia Cohort.” Molecular Psychiatry 26, no. 7: 3512–3523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Marquand, A. F. , Kia S. M., Zabihi M., Wolfers T., Buitelaar J. K., and Beckmann C. F.. 2019. “Conceptualizing Mental Disorders as Deviations From Normative Functioning.” Molecular Psychiatry 24, no. 10: 1415–1424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Marquand, A. F. , Rezek I., Buitelaar J., and Beckmann C. F.. 2016. “Understanding Heterogeneity in Clinical Cohorts Using Normative Models: Beyond Case‐Control Studies.” Biological Psychiatry 80, no. 7: 552–561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Marquand, A. F. , Wolfers T., Mennes M., Buitelaar J., and Beckmann C. F.. 2016. “Beyond Lumping and Splitting: A Review of Computational Approaches for Stratifying Psychiatric Disorders.” Biological Psychiatry: Cognitive Neuroscience and Neuroimaging 1, no. 5: 433–447. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Marsh, T. J. , and Dale M.. 2002. “The UK Floods of 2000–2001: A Hydrometeorological Appraisal.” Water Environment Journal 16, no. 3: 180–188. [Google Scholar]
  36. Martin, E. I. , Ressler K. J., Binder E., and Nemeroff C. B.. 2009. “The Neurobiology of Anxiety Disorders: Brain Imaging, Genetics, and Psychoneuroendocrinology.” Psychiatric Clinics of North America 32, no. 3: 549–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. McCarthy, P. 2020. Funpack.
  38. Meijer, J. , Vieira B. H., Elleaume C., Baranczuk‐Turska Z., Langer N., and Floris D. L.. 2023. Towards Understanding Autism Heterogeneity: Identifying Clinical Subgroups and Neuroanatomical Deviations [Internet]. medRxiv. Accessed March 15, 2024. 10.1101/2023.10.21.23297299v1. [DOI] [PubMed]
  39. Miller, K. L. , Alfaro‐Almagro F., Bangerter N. K., et al. 2016. “Multimodal Population Brain Imaging in the UK Biobank Prospective Epidemiological Study.” Nature Neuroscience 19, no. 11: 1523–1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Nunes, A. , Trappenberg T., and Alda M.. 2020. “Measuring Heterogeneity in Normative Models as the Effective Number of Deviation Patterns.” PLoS One 15, no. 11: e0242320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Poon, S. H. , Rockinger M., and Tawn J.. 2004. “Extreme Value Dependence in Financial Markets: Diagnostics, Models, and Financial Implications.” Review of Financial Studies 17, no. 2: 581–610. [Google Scholar]
  42. Resnick, S. I. 2007. Heavy‐Tail Phenomena: Probabilistic and Statistical Modeling. Springer Science & Business Media. [Google Scholar]
  43. Rutherford, S. , Barkema P., Tso I. F., et al. 2023. “Evidence for Embracing Normative Modeling.” elife 12: e85082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Rutherford, S. , Fraza C., Dinga R., et al. 2022. “Charting Brain Growth and Aging at High Spatial Precision. Baker CI, Taschler B, Esteban O, Constable T, Editors.” eLife 11: e72904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Scarrott, C. , and MacDonald A.. 2012. “A Review of Extreme Value Threshold Estimation and Uncertainty Quantification.” Revstat Statistical Journal 10, no. 1: 33–60. [Google Scholar]
  46. Shaw, P. , Eckstrand K., Sharp W., et al. 2007. “Attention‐Deficit/Hyperactivity Disorder Is Characterized by a Delay in Cortical Maturation.” Proceedings of the National Academy of Sciences of the United States of America 104, no. 49: 19649–19654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Smith, S. , Alfro A F., and Miller K.. 2020. UK Biobank Brain Imaging Documentation. https://biobank.ctsu.ox.ac.uk/crystal/docs/brain_mri.pdf.
  48. Sudlow, C. , Gallacher J., Allen N., et al. 2015. “UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age.” PLoS Medicine 12, no. 3: e1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Tung, Y. H. , Lin H. Y., Chen C. L., et al. 2021. “Whole Brain White Matter Tract Deviation and Idiosyncrasy From Normative Development in Autism and ADHD and Unaffected Siblings Link With Dimensions of Psychopathology and Cognition.” American Journal of Psychiatry 178, no. 8: 730–743. [DOI] [PubMed] [Google Scholar]
  50. Verdi, S. , Rutherford S., Fraza C., et al. 2023. Personalising Alzheimer's Disease Progression Using Brain Atrophy Markers. medRxiv. Accessed June 28, 2023. 10.1101/2023.06.15.23291418v1. [DOI]
  51. Wolfers, T. , Beckmann C. F., Hoogman M., Buitelaar J. K., Franke B., and Marquand A. F.. 2020. “Individual Differences v. the Average Patient: Mapping the Heterogeneity in ADHD Using Normative Models.” Psychological Medicine 50, no. 2: 314–323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Wolfers, T. , Doan N. T., Kaufmann T., et al. 2018. “Mapping the Heterogeneous Phenotype of Schizophrenia and Bipolar Disorder Using Normative Models.” JAMA Psychiatry 75, no. 11: 1146–1155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zabihi, M. , Floris D. L., Kia S. M., et al. 2020. “Fractionating Autism Based on Neuroanatomical Normative Modeling.” Translational Psychiatry 10, no. 1: 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Zabihi, M. , Oldehinkel M., Wolfers T., et al. 2019. “Dissecting the Heterogeneous Cortical Anatomy of Autism Spectrum Disorder Using Normative Models.” Biological Psychiatry: Cognitive Neuroscience and Neuroimaging 4, no. 6: 567–578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Zhao, X. , Scarrott C., Oxley L., and Reale M.. 2010. “Extreme Value Modelling for Forecasting Market Crisis Impacts.” Applied Financial Economics 20: 63–72. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix S1. Supplementary Information.

HBM-46-e70281-s001.docx (8.6MB, docx)

Data Availability Statement

Data are available under request from UK Biobank. Analysis code for the present manuscript is available at https://github.com/CharFraza/extremes_normative_modelling and the PCNtoolkit software toolbox used for normative modelling is available at https://github.com/amarquand/PCNtoolkit.


Articles from Human Brain Mapping are provided here courtesy of Wiley

RESOURCES