ABSTRACT
Common measures of brain functional connectivity (FC) including covariance and correlation matrices are symmetry‐positive definite (SPD) matrices residing on a cone‐shaped Riemannian manifold. Despite its remarkable success for Euclidean‐valued data generation, the use of standard generative adversarial networks (GANs) to generate manifold‐valued FC data neglects its inherent SPD structure and hence the inter‐relatedness of edges in real FC. We propose a novel graph‐regularized manifold‐aware conditional Wasserstein GAN (GR‐SPD‐GAN) for FC data generation on the SPD manifold that can preserve the global FC structure. Specifically, we optimize a generalized Wasserstein distance between the real and generated SPD data under adversarial training, conditioned on the class labels. The resulting generator can synthesize new SPD‐valued FC matrices associated with different classes of brain networks, for example, brain disorder or healthy control. Furthermore, we introduce additional population graph‐based regularization terms on both the SPD manifold and its tangent space to encourage the generator to respect the inter‐subject similarity of FC patterns in the real data. This also helps in avoiding mode collapse and produces more stable GAN training. Evaluated on resting‐state functional magnetic resonance imaging (fMRI) data of major depressive disorder (MDD), qualitative and quantitative results show that the proposed GR‐SPD‐GAN clearly outperforms several state‐of‐the‐art GANs in generating more realistic fMRI‐based FC samples. When applied to FC data augmentation for MDD identification, classification models trained on augmented data generated by our approach achieved the largest margin of improvement in classification accuracy among the competing GANs over baselines without data augmentation.
Keywords: brain disorder, classification, data augmentation, fMRI, functional connectivity, generative adversarial network, Riemannian geometry
A manifold‐award generative adversarial network with specialized architecture and population‐graph regularization for brain functional connectivity (FC) generation. The proposed network can generate high quality synthetic FC data in close resemblance to the true data distribution. Incorporating the synthetic FC can greatly improve the downstream classification performance for brain disorders diagnosis.

1. Introduction
Functional connectivity (FC) networks composed of interactions among spatially distinct brain regions are often characterized by cross‐correlations of blood‐oxygen dependent level signals between different brain regions, measured using functional magnetic resonance imaging (fMRI) (Cassidy et al. 2014; Ting et al. 2018). Correlation‐based analysis of FC has formed a basis for diverse areas of neuroimaging research, including investigation of neurodevelopmental, psychiatric, and neurological diseases (Greicius 2008; Zhang et al. 2021), dynamic connectivity analysis (Hutchison et al. 2013), identification of individuals (Finn et al. 2015), machine learning‐based prediction of behavior (Tian and Zalesky 2021) and brain disorders (Du et al. 2018). Deep learning (DL) methods have also been introduced recently for identifying brain disorders using fMRI FC patterns, and showed promising improvements over traditional classifiers. These include classification of autism spectrum disorder using autoencoders (Heinsfeld et al. 2018), mild cognitive impairment using convolutional neural networks (CNNs) (Kam et al. 2019) and deep Boltzmann machine (Suk et al. 2014) to name a few. While most of these works focused on functional connectomic classification, the problem of FC data generation has been less investigated in the state of the art.
FC generation by synthesizing realistic functional connectome profiles associated with healthy and disease groups can have important applications, including data augmentation to improve the performance of FC classifiers. In particular, DL models require a large amount of training data to achieve reasonable performance for brain FC classification. This may not be feasible for small‐sample settings of fMRI in clinical practice, where classification inevitably encounters problems of overfitting and difficulty generalizing to unseen samples. Data augmentation has become a powerful technique for classification tasks with limited data by synthesizing realistic fake data to increase training sample sizes. Generative adversarial networks (GANs) (Goodfellow et al. 2014), a deep generative model for synthetic data generation, offer a novel method for data augmentation in both natural and medical image classifications (Antoniou et al. 2017; Frid‐Adar et al. 2018). GANs have been recently used for neuroimaging data augmentation mainly on raw image synthesis, such as raw MRI (Nguyen et al. 2020; Meyer et al. 2021) and 3D fMRI brain images (Zhuang et al. 2019). While a few recent works applied GANs to brain structural connectivity augmentation for brain disease classification (Barile et al. 2021; Li, Wei, et al. 2021), synthetic FC augmentation is largely unexplored.
Diffusion probabilistic models (Ho et al. 2020) have emerged as a powerful deep generative model, showing remarkable performance for natural and medical imaging. While diffusion models have been used for generating raw functional data such as EEG (Zhou et al. 2023) and fMRI (Yuan and Qiao 2024), their applications to brain connectivity generation are scarce. A graph‐based diffusion was proposed for structural network synthesis (Zong et al. 2024). A diffusion transformer was recently applied for fMRI‐based FC generation and augmentation for autism spectrum disorder diagnosis (Zhao et al. 2025), which, however, has not taken into account the SPD geometry of FC matrices in the generation process. Moreover, diffusion models are more computationally expensive in both model training and sampling compared to GANs.
Synthesizing FC data is a more challenging problem, as common measures of FC networks including covariance, cross‐correlation, and precision matrices are symmetric positive definite (SPD) matrices forming a special geometric structure of a cone‐shaped Riemannian manifold (You and Park 2021; Minh and Murino 2017). Since FC matrices lie on a SPD manifold, their elements which represent connecting edges are inherently inter‐related. Operations of FC matrices as manifold‐valued objects can be better performed based on their corresponding geometric structure of SPD manifold rather than the Euclidean geometry.
Despite its remarkable success for Euclidean data generation (arrays and grid matrices, e.g., natural images), GANs have been rarely applied for generating manifold‐valued data. Distance measures in existing GANs are inappropriate to approximate distance between the true and generated data distributions on manifolds. Direct application of conventional GANs for Euclidean data to generate manifold‐valued FC data such as in (Yao and Lu 2019) disregards the inherent SPD geometry, and thus fails to preserve the global network structure in the generated FC matrices.
In this paper, we target the problem of generating FC network data on the Riemannian manifold of the SPD space that can take into account all pairwise FC edges as a whole instead of treating edges independently from each other as in the conventional approaches. Relatively few but an increasing number of studies have conducted FC analysis using correlation matrices on the SPD manifold, including computing group‐level average and variability (Varoquaux et al. 2010), regression analysis (Deligianni et al. 2011), dynamic FC (Dai et al. 2019), and dimension reduction for machine learning (Qiu et al. 2015; Dan et al. 2022). For a review of SPD‐based FC analyses see (You and Park 2021). However, no prior work has studied brain FC data generation in the SPD space.
We propose a novel variant of GAN called SPD‐GAN for SPD manifold‐valued data generation to augment fMRI FC data for brain disorder classification. Our method is inspired by manifold‐aware Wasserstein GANs (WGAN) recently introduced in (Huang et al. 2019) for manifold‐valued image generation. Specifically, it generalizes the Wasserstein distance between the true and generated data distributions of existing WGAN to the Riemannian manifold of SPD matrices and incorporates logarithm and exponential maps in the adversarial loss for data mapping between the manifold and its tangent space. Reconstruction losses are also added to promote similarity between the true and generated data in both the manifold and tangent space. We utilize the affine‐invariant Riemannian metric (AIRM), a popular metric for SPD space, to define the distance between two SPD objects. Drawing ideas from MotionGAN (Otberdout et al. 2020) for motion generation on a hypersphere manifold, we further develop a conditional version of the SPD‐GAN that uses class labels to guide the generation of SPD‐valued data.
Furthermore, we build a graph to encode the inter‐dependency structure in the SPD data based on geodesic distance on the manifold. Additional graph regularization terms on the SPD manifold and tangent space are then incorporated into the conditional SPD‐GAN objective function to enforce the generator to generate SDP data that respects the unique dependency structure of the real data. The main contributions of this work are as follows:
To our best knowledge, this work is the first to explore a manifold‐aware GAN with specialized architecture and new cost functions that operate on SPD manifolds for synthesizing realistic brain FC matrices that can preserve the global dependency structure.
We propose a graph‐regularized conditional SPD‐GAN (GR‐SPD‐GAN), a novel extension of SPD‐valued WGAN by incorporating conditioned generation and graph regularization. (i) The conditional model enables class‐supervised generation of SPD data, which facilitates the generation of correlation‐based FC matrices according to different experimental groups (diseased and control). (ii) The graph regularization terms can regularize the SPD‐GAN generator to avoid mode collapse and produce more stable GAN training. We show empirically that the GR‐SPD‐GAN can generate better quality synthetic FC data in close resemblance to the true data distribution, and with a significant gain in terms of geometry scores compared to its unregularized counterparts. When applied to a population graph where each node represents a subject with an associated SPD‐valued FC matrix, the proposed graph‐regularization approach allows the generation of FC matrices that can reflect inter‐subject relationships, depending on the similarity and dissimilarity of connectivity structure between subjects.
We demonstrate the usefulness of the generated FC data by using them for data augmentation to enhance brain connectome‐based classification. Experimental results on a large resting‐state fMRI dataset of major depressive disorder (MDD) show that data augmentation using our GR‐SPD‐GAN leads to substantial improvement in MDD identification tasks with different downstream FC classifiers; it significantly outperforms several other state‐of‐the‐art GAN‐based generators that neglect the SPD geometry in FC data.
2. Background
2.1. Generative Adversarial Networks
The family of GAN techniques (Goodfellow et al. 2014) has been very successful for synthesizing natural images. The original GAN training is formulated as a min‐max game between two competing networks: the generator and the discriminator . The maps the random noise into synthetic data approximating the real data, while the learns to discriminate between the real and the generated data. Theoretically, this framework minimizes the Jensen–Shannon divergence between the distributions of true data and generated samples. The state‐of‐the‐art GANs like Wasserstein GAN (WGAN) (Arjovsky et al. 2017) minimizes the Wasserstein‐1 distance between the real and synthetic data distributions. An improved WGAN with gradient penalty (WGAN‐GP) (Gulrajani et al. 2017) was proposed to produce more stable GAN training by penalizing the norm of the discriminator with respect to its input. Consider a dataset of examples sampled from a real data distribution . In WGAN‐GP, and are trained by solving the following minimax problem
where is random noise, is the distribution of generated samples, and is random sample following distribution that is sampled along straight lines between pairs of points from and , is the gradient with respect to , and is the weighted coefficient of the gradient penalty.
2.2. The Geometry of SPD Manifolds
Let be an real SPD matrix which satisfies the property that for all non‐zero . The space of SPD matrices, denoted by is a Riemannian manifold . A Riemannian manifold is a smooth manifold equipped with a Riemannian metric for defining distances on the manifold. Geometrically, a tangent vector is a vector that is tangent to the manifold at a given point . Let denote the tangent space of at , that is, the set of all tangent vectors at which constitutes an Euclidean space. A Riemannian metric on is a family of inner product defined on each that varies smoothly with the base point . SPD matrices are most widely studied when endowed with the affine‐invariant Riemannian metric (AIRM) defined as
![]() |
(1) |
where are two tangent vectors at point . For SPD manifold , where is the vector space of symmetric matrices. The AIRM has several useful properties such as invariance to affine transformation and matrix inversion.
There are two main operations that connect the manifold and the tangent plane : (1) Riemannian exponential map at point , , which projects a tangent vector to a point in
| (2) |
where denotes the matrix exponential. (2) Riemannian logarithmic map at , , which maps any SPD matrix to its tangent space
| (3) |
where denotes the principal matrix logarithm. The geometrical interpretation of the exponential and logarithm maps on is shown in Figure 1. Under AIRM, the geodesic distance between two SPD matrices follows
| (4) |
where denotes the Frobenius norm.
FIGURE 1.

Exponential and logarithm maps on a Riemannian manifold (Minh and Murino 2017).
3. Methods
Figure 2 shows an overview of the proposed graph‐regularized manifold‐aware conditional Wasserstein GAN (GR‐SPD‐GAN) for SPD‐matrix‐valued data generation with an application to synthetic FC data augmentation for brain disorder classification. The proposed framework consists of three stages. (a) Model Training: FC matrices estimated from fMRI are represented as compact points on the SPD manifold. The GR‐SPD‐GAN is trained to learn the distribution of the SPD‐valued FC data associated with each class (brain disorder or healthy control group). Riemannian logarithm and exponential maps are exploited to minimize the Wasserstein distance between the distributions of real and generated data under adversarial training with a generator and a discriminator. A population graph that encodes inter‐subject similarity in FC structure is constructed based on geodesic distance on the SPD space. It is then used as a regularizer at the generator to force the generated FC data to respect the inter‐subject dependency in the real data. (b) Data augmentation: After training, we use the resulting generator to generate synthetic FC data as new points on the SPD manifold conditioned on the experimental class. The generative performance is measured by the geometric score that compares the geometrical properties of the underlying data manifold and the generated one. (c) FC‐based classification: The generated FC data is used to augment training data for subsequent brain disorder classification.
FIGURE 2.

Overview of the proposed framework: (A) GR‐SPD‐GAN is trained on a Riemannian manifold using and operations. Population graph regularizer is added to encourage inter‐subject similarities during the training. (B) Trained generator of GR‐SPD‐GAN is used to generate synthetic data. Geometry score is used to evaluate the synthetic dataset. (C) Synthetic data is used in data augmentation for FC classification.
3.1. SPD‐GAN Network
Consider a set of training samples , where represents the FC matrix of subject , and indicates the corresponding class label of the experimental group (i.e., : healthy control; : a certain brain disorder). The FC matrix can be estimated by cross‐correlations between fMRI time series extracted over voxels and brain regions of interest (ROIs). Our aim is to design a mapping function that converts random noise to synthetic FCs given the class . FC matrices (e.g., covariance and correlation matrices) are SPD data residing on an SPD manifold . Inspired by the manifold‐aware GANs (Huang et al. 2019; Otberdout et al. 2020) for image and motion generation, we propose a SPD manifold‐aware WGAN (SPD‐GAN) to exploit the geometry of the SPD manifold to learn the generative distribution of FCs associated with each experimental group. Analogouus to MotionGAN (Otberdout et al. 2020) for motion generation on a hypersphere manifold, our SPD‐GAN is a conditional version of manifold‐aware WGAN by (Huang et al. 2019), but generates new points in the SDP manifold.
The proposed SDP‐GAN architecture is shown in Figure 2A. It consists of two adversarial networks: (1) A generative model that maps an n‐dimensional noise vector sampled from a prior distribution to a synthetic SDP matrix from a manifold‐valued generating distribution . (2) A discriminative model that estimates the probability of a given input being sampled from the real data distribution (from the training data) rather than the generating distribution (from the ). Both and are conditioned on the class label . This conditioning is performed by incorporating as the input to the and . To generate valid SPD data from the generative network , we employ the Riemannian exponential map (2) at a reference point to transfer the output of , which is a symmetric matrix , to the SPD manifold. The logarithm map (3) is used to project the SPD matrices (the real data and the generated data ) to the tangent space at , before being presented as inputs to the discriminator network . Since the tangent space is a vector space, any regular neural networks designed for Euclidean data can be used as .
3.2. Loss Function
To train the and of the SPD‐GAN, we propose the following objective function, which is a weighted sum of the adversarial loss , the reconstruction loss in the SPD manifold , and the reconstruction loss in the tangent space
| (5) |
The adversarial loss generalizes the objective function of WGAN‐GP (Gulrajani et al. 2017) to the SPD‐manifold valued data, and is a conditional version of the objective function in (Huang et al. 2019).
| (6) |
where and are the Riemannian logarithm and exponential maps defined for the SPD manifold at a particular point , in (2) and (3), respectively. The first two terms denote the estimate of the Wasserstein distance between the real and generated SPD data that the generator learns to minimize. The last term represents the gradient penalty as in (Gulrajani et al. 2017) defined for the manifold‐valued data, where is sampled uniformly along straight lines between pairs of points sampled from the real data distribution and the generating distribution
| (7) |
with , and is the gradient with respect to .
We incorporate two reconstruction losses and to further encourage the generator to synthesize data approximating the real one on both the SPD manifold and its tangent space, respectively. measures the distances between the generated samples and their corresponding ground‐truth real data on the manifold
| (8) |
where is the squared geodesic distance under AIRM defined in (4). quantifies the similarities between the generated tangent vectors and their associated ground‐truth vectors projected on
| (9) |
where denotes the . The generator is trained to minimize the two reconstruction losses (8) and (9). To define the tangent space used in the training of our SPD‐GAN, we exploit the Fréchet mean (Fiori 2009; Karcher 1977) of the training data to define the reference point as
| (10) |
which is the minimizer of average squared geodesic distances over the training FC data.
3.3. Graph‐Regularized SPD‐GAN
Brain FC matrices tend to exhibit similar or highly‐correlated patterns between subjects sharing similar demographics, phenotypes or clinical conditions. To incorporate the relationships between subjects, we introduce a novel graph‐based regularizer in SPD manifold at the generator of the SPD‐GAN to enforce the generated data to respect the inter‐subject FC similarity in the real data .
3.4. Graph Construction
Graphs provide a natural way of representing a population of subjects and their relationships. Let be an undirected weighted population graph, where is the node set (each node represents a subject), is the edge set specified by , representing connections between subjects, and is the weighted adjacency matrix. The weight of an edge provides a measure of similarity between subjects and . The set of SPD‐matrix valued FC data can be considered as signals defined on the graph .
The graph‐based inter‐subject relationships can be computed using the Euclidean distance between vectorized FC matrices of pairs of subjects, given by
| (11) |
where the Gaussian heat kernel (Li, Kailkhura, et al. 2021) is adopted, and is the kernel scalar parameter. However, (11) neglects the geometric properties of SPD manifold. We exploit the geodesic distance between FCs to define the similarity between subjects
| (12) |
Note that the weights decay with distance. This can capture both the similarity and variations between subjects. Specifically, assumes a high value if and are similar, and a small value if they are different.
3.5. Graph‐Based Regularization in SPD Manifold
To encourage the SPD‐GAN generator to generate FC matrices that respect the inter‐subject similarity in the real data, we further introduce two additional graph regularization terms, on both the SPD manifold and its tangent space at , defined respectively by
| (13) |
| (14) |
where and are the generated SPD data and associated tangent vectors, respectively. Note that in , geodesic distance is used to define the pairwise differences in the generated SPD matrices and over connected subjects and . By minimizing the objective functions (13) and (14), it enforces the generator to synthesize similar (or smooth) SPD matrices for subjects with high similarity in brain FC structure, as defined over the population graph of the real data.
Incorporating the graph regularizers (13) and (14) into the objective function (5) of SPD‐GAN gives a new objective of the graph‐regularized SPD‐GAN (GR‐SPD‐GAN) as follows
| (15) |
During the training, the generator generates a matrix , where and operations are applied to enforce the generated output to be projected on the desired manifold and tangent space based on . The discriminator takes the tangent vector , and generated tangent vector as input; to score the probability of a data belonging to the true data distribution . Population graph is applied to impose regularization on generator to generate close FCs resemblance on both manifold and tangent space . As such, the generator learns the real data distribution and synthesizes similar data. The proposed graph regularization of the generator can improve the GAN training with more stable training. This can result in significantly better quality of the generated SPD data compared to the unregularized SDP‐GAN, as measured in terms of geometric score. Algorithm 1 shows the training steps of GR‐SPD‐GAN. Both the generator and discriminator are implemented via neural networks, parametrized with network parameters and , respectively. The details of the network architecture for and are described in Section 4.2. During the training process, the network parameters of and are updated alternately by fixing one of them.
ALGORITHM 1. Training of the proposed GR‐SPD‐GAN.
Require: , training data with their corresponding labels ; , initial generator parameters; , initial discriminator parameters; , batch size; , learning rate, , weights of constructed population‐graph; , the discriminator iterations per generation iteration; , generator update iterations.
1: for n = 0, …, do
2: for t = 0, …, do
3: Sample a batch of examples from real data distribution
4: Sample a batch of noise samples from noise prior
5: Compute stochastic gradient of Equation (15) with respect to
6: Update ()
7: end for
8: Sample a batch of examples from real data distribution
9: Sample a batch of noise samples from noise prior
10: Compute stochastic gradient of Equation (15) with respect to
11: Update ()
12: end for
3.6. Data Augmentation for FC Classification
After training, we apply the learned generator of GR‐SPD‐GAN to generate synthetic FC matrices for data augmentation in brain disorder classification. The generation steps are outlined in Algorithm 2. The generator receives input from the noise prior to generate a point on the tangent space conditioned on each class of the brain networks, which is then transformed by using the exponential map to its corresponding point on the SPD manifold, which represents a FC matrix of a subject for a particular class. The generated FC data is combined with the original training set, and the augmented data is used to train a downstream FC‐based classifier to discriminate between brain disorder and healthy control groups. We consider three types of widely‐used FC classifiers: support vector machine (SVM), convolutional neural network (CNN) and BrainNetCNN (Kawahara et al. 2017).
ALGORITHM 2. SPD data generation by GR‐SPD‐GAN.
Require: , Generator trained with Algorithm 1; , training data; , FC class condition.
1: Sample random noise samples from noise prior
2: Generate , points on the tangent space of the manifold
3: Generate , by mapping the generated points to the manifold at reference point using Equation (2)
4: Concatenate and as augmented training dataset
4. An Application to fMRI FC Data Augmentation for Major Depression Identification
In this section, we present an experimental evaluation of the proposed GR‐SPD‐GAN model for generating realistic FC data and its usefulness for data augmentation for MDD classification on a large MDD fMRI dataset.
4.1. Data Acquisition and Pre‐Processing
We used a resting‐state fMRI dataset of 684 healthy controls (HC) and 747 Major Depressive Disorder (MDD) patients from the open‐access REST‐meta‐MDD Consortium database (Yan et al. 2019). The MDD patients had a Hamilton Depression Rating Scale (HAMD) scores of . The data were acquired using a Siemens (Tim Trio 3T) scanner (TR/TE = 2000/30 ms, 3 mm slice thickness). We also employed the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset for an ablation study. The dataset included 38 normal controls (CN), 34 individuals with mild cognitive impairment (MCI), and 35 Alzheimer's disease (AD) patients. For both datasets, the data were pre‐processed using Data Processing Assistant for Resting‐State fMRI (DPARSF) (Yan and Zang 2010). The first 10 time points were discarded. Slice timing correction and head motion correction were then applied. The data were normalized to MNI space using unified segmentation of the T1 image and resampled to 3‐mm isotropic voxels. A 4 mm FWHM Gaussian kernel was applied for smoothing (except for ReHo). The linear trend of the time courses was removed, followed by temporal band‐pass filtering (0.01–0.08 Hz), with the exception of fALFF. We used the Automated Anatomical Labeling (AAL) atlas to obtain brain parcellation into 116 region‐of‐interests (ROIs), including cortical and subcortical areas, and extracted mean time series of time points for each ROI. We finally estimated a FC matrix for each subject, based on Pearson's correlations between ROIs. We computed the Fréchet mean of the brain FCs of each class (HC and MDD) by (10). Additionally, we examine the model performance on the REST‐meta‐MDD Consortium database using the CC200 atlas (Craddock et al. 2012) to investigate our models applicability to different atlases.
4.2. Model Architecture and Training
The and of the proposed GR‐SPD‐GAN are convolutional neural networks with multiple upsampling and downsampling blocks. The generator takes the concatenation of a 100‐dimensional random vector sampled from a standard Gaussian distribution, and a 2‐dimensional one‐hot vector that encodes the class label of the brain networks (MDD or HC) as inputs. This vector is mapped to one fully‐connected (dense) layer of 61,952 outputs, and four convolutional upsampling blocks with 256, 128, 64, and 1 output channels. Each upsampling block consists of the nearest‐neighbor upsampling followed by a stride convolution (Conv) with LeakyReLU activation function. We have a final dense layer with hyperbolic tangent activation to produce a vector, which is then reshaped to a symmetric matrix as the final output of the GR‐SPD‐GAN generator. The discriminator takes a FC matrix as input. It consists of three downsampling blocks with 256, 128, 64 output channels, where each block is a stride Conv layer followed by batch normalization and LeakyReLU activation. The last dense layer takes both the outputs of the downsampling layers and the class label, and uses the sigmoid activation function to output the probability of a FC sample being sampled from the true data distribution. We trained the GR‐SPD‐GAN with learning rate of 0.0001 using the Adam optimizer (Kingma and Ba 2014), 100 training epochs, and a mini‐batch size of 32. The hyper‐parameters in (15) are empirically determined as from a range of parameters, which gave the optimal performance.
4.3. Methods for Comparison
We benchmark the performance of our proposed method with several standard and state‐of‐the‐art GAN‐based generative models, including the 1‐dimensional deep convolutional GAN (1D‐DCGAN), 2‐dimensional DCGAN (2D‐DCGAN) (Radford et al. 2015), WGAN (Arjovsky et al. 2017), and WGAN‐GP (Gulrajani et al. 2017). The generators of all competing models are composed of three stacked upsampling blocks, including convolutional transpose and batch normalization layers. All of the GANs take brain FC matrices as inputs except the Vanilla‐GAN (Goodfellow et al. 2014) and 1D‐DCGAN, which take the vectorized FC for training. The discriminator/critic in the GANs is composed of two stacked downsampling convolutional blocks where the activation function of the last fully connected layer is defined as sigmoid/linear, and batch normalization layers are excluded in WGAN‐GP. The discriminator of Vanilla‐GAN uses fully connected layers instead of convolutional blocks. Table 1 shows the trainable parameters for the generator and discriminator/critic for each GAN.
TABLE 1.
Trainable model parameters of different GANs.
| GAN | Generator | Discriminator/Ctiric |
|---|---|---|
| Vanilla‐GAN | 176,495,118 | 121,856,001 |
| 1D‐DCGAN | 240,795 | 279,515 |
| 2D‐DCGAN | 2,447,553 | 742,369 |
| WGAN | 2,447,553 | 742,369 |
| WGAN‐GP | 2,447,553 | 742,369 |
| GR‐SPD‐GAN | 2,447,553 | 742,369 |
4.4. Performance Measures
We use the recently proposed geometry score (GS) (Khrulkov and Oseledets 2018) which provides both the qualitative and quantitative means to assess the quality of generated samples by GANs. GS compares the topological properties of the underlying real data manifold and the generated one, with a lower value indicating a better match. It is agnostic to the type of data and is shown to be more expressive in capturing various failure modes of GANs compared to the conventional inception score and Fréchet inception distance. GS is computed based on mean relative living times (MRLT)
can be interpreted as a probability distribution (over non‐negative integers ) that defines the certainty about the correct number of k‐dimensional holes (or connected components) in the underlying manifold of a dataset on average. As in (Khrulkov and Oseledets 2018), we used to study the first homology of datasets.
Additionally, for each generated FC matrix, we computed the highest Pearson correlation with all training FCs. This process is repeated for 100 generated samples, and we report the mean and variance of the resulting maximum correlations. The same procedure was applied to the graph‐based measures. Additionally, we evaluate the synthesized FC by comparing its graph‐theoretical properties to the ground truth, using Pearson correlation for the network, node strength, and edge betweenness centrality. Specifically, node strength refers to the sum of the weights of the links connected to a node, while edge betweenness centrality represents the proportion of all shortest paths in the network that pass through a given edge.
4.5. FC Augmentation and Classification
We assess the usefulness of the generated FC data from GR‐SPD‐GAN and the baseline generative methods via data augmentation for classifying MDD/healthy control. We trained the downstream FC classifiers on the augmented training data with different amounts of generated data, that is, , , and multiple of the size of the original training set. To measure classification performance, we used accuracy, recall, precision, F1‐score, and receiver operating characteristic (ROC). We applied the nested‐stratified 5‐fold cross‐validation (CV) (Pereira et al. 2009) data partitioning scheme to evaluate the FC classification performance. Outer‐folds consist of training and testing sets, where the training set was further split into 5‐inner‐folds training and validation sets. For each outer‐fold, a GAN model was trained using the real training set to generate a fold‐dependent synthetic dataset. Synthetic data were then combined with the real training data of inner‐folds for the classifier training and hyper‐parameters tuning. We used the Optuna library (Akiba et al. 2019) to determine the best hyper‐parameters of each classifier. The test set performance is evaluated based on the selected best models using accuracy. We conduct a paired t‐test to statistically evaluate the classification improvement achieved with augmented dataset over the baseline methods. Specifically, we recorded the classification performance metrics for each fold. The t‐test was then conducted by comparing the fold‐wise performance metric of the GANs against those of the baseline across the five folds. Corrections for multiple testings across performance metrics were performed using false discovery rate (FDR).
The hyper‐parameters search range, model configuration, and the training of the three FC classifiers considered are:
1. SVM: SVM has been used to classify brain FC with reasonably good accuracy. We trained SVM on flattened (vectorized) brain FCs. The ranges considered in the hyper‐parameter tuning: kernel function in [“rbf”, “linear”], regularization parameter from to , and gamma value from to .
2. CNN: We adopt CNN with stacked convolutional blocks, each block consists of Conv2D, BatchNorm, and max‐pooling (MaxPool) layers. Here, the correlation‐based FC matrices were used directly as inputs to the CNN. The convolutional blocks learn high‐level spatial FC which are then fed into the fully‐connected and Softmax layers for classification. Hyper‐parameter search space: batch size from to , number of convolutional layers from to , kernel size in , number of fully‐connected layers from to .
3. BrainNetCNN: BrainNetCNN is a specially‐designed CNN for brain connectivity with special convolutional kernels to preserve the brain network structure. It consists of three types of layers: edge‐to‐edge, edge‐to‐node and node‐to‐graph convolutional layers to capture topological relationships between brain network edges. Hyper‐parameter search space: L2 regularization weight decay from to , scheduler learning rate reduce factor from to , batch size from to .
We implemented all DL models for FC data generation and classification based on the Tensorflow (Abadi et al. 2016) framework on a PC with a single GPU‐NVIDIA Quadro P5000 16 GB.
5. Results
5.1. Quality of Synthetic FC Data
Table 2 shows the geometry scores of the FC samples generated by different GANs. The proposed SPD‐GANs obtained significantly better performance than classical GANs, with GR‐SPD‐GAN achieving the lowest value of geometry score for HC, MDD, and all classes, suggesting the highest similarity between the true and synthetic data distributions and its ability to avoid mode collapse. This is further confirmed by the MRLTs in Figure 3 that provides a visual quality of generated results. The synthetic data distributions by GR‐SPD‐GAN have the closest resemblance to the real data distributions for HC, MDD, and all classes, in contrast to the obvious deviations from the ground‐truth for the 1D‐DCGAN and WGAN. Table 3 shows similarity in FC edges and network measures between the generated and real functional networks quantified in terms of Person correlation. The FC generated using GR‐SPD‐GAN has shown higher Pearson correlation with the real data than other GANs, indicating more realistic synthesis of FC networks in both connectivity edges and network topology. These results imply that generation of synthetic FC is better performed on SPD manifold. The effectiveness of GR‐SPD‐GAN in synthesizing more realistic FC is owing to its generation mechanism that leverages on the right underlying geometry (the Riemannian manifold) of the correlation‐based FC matrices as SPD objects, and thus preserving the interrelatedness of edges in the generated FC matrices. On the other hand, GANs traditionally designed for Euclidean‐valued data fail to capture the correlated nature of edges in real FC matrices. Among the SPD‐GANs, inclusion of the reconstruction losses and graph regularizers in GR‐SPD‐GAN produce a better match to the real data distributions. In Figure 4, we compare qualitatively synthetic FC matrices generated by different GANs. Results shown are group averages for the MDD and HC groups. Again, we observe that our model generated more realistic samples that can preserve the overall connectivity patterns and fine details of edges in the real FC networks. We further compared the distributions of the highest correlations of the generated FC samples with that of real test‐set samples against the real training FCs (see Section A in Supporting Information). In Figure S1, we can see that distribution for the synthetic FC of GR‐SPD‐GAN more closely resembles that for the test‐set compared to other GANs, suggesting better performance in generating variations of samples beyond the training set.
TABLE 2.
Synthetic FC data evaluation with geometry score , where a lower score indicates better performance.
| Methods | HC | MDD | All | SPD | |
|---|---|---|---|---|---|
| Competing | Vanilla‐GAN | 729.8 | 647.2 | 843.8 | X |
| 1D‐DCGAN | 38.1 | 59.2 | 46.9 | X | |
| 2D‐DCGAN | 601.1 | 70.7 | 54.0 | X | |
| WGAN | 59.8 | 65.0 | 23.1 | X | |
| WGAN‐GP | 447.0 | 535.5 | 912.8 | X | |
| Ours | SPD‐GAN (w/o ) | 160.5 | 93.3 | 423.3 | ✓ |
| SPD‐GAN (w/ ) | 35.3 | 44.0 | 20.7 | ✓ | |
| GR‐SPD‐GAN | 29.3 | 52.9 | 5.4 | ✓ |
Note: Bold values indicate the best performance for each metric.
FIGURE 3.

Comparison of MRLT on the real and synthetic FC data distributions generated by 1D‐DCGAN, WGAN, and SPD‐GAN variants.
TABLE 3.
Similarity in FC edges and complex network measures between the synthetic and real FC as measured by Person's correlation coefficients. Higher values indicate closer resemblance of synthetic FCs to real ones.
| Methods | FC edges | Node strength | Edge betweeness centrality | ||
|---|---|---|---|---|---|
| Competing | Vanilla‐GAN | 0.53 0.06 | 0.67 0.08 | 0.11 0.02 | |
| 1D‐DCGAN | 0.10 0.11 | 0.30 0.16 | 0.06 0.02 | ||
| 2D‐DCGAN | 0.54 0.49 | 0.53 0.08 | 0.14 0.02 | ||
| WGAN | 0.51 0.47 | 0.64 0.09 | 0.14 0.02 | ||
| WGAN‐GP | 0.52 0.17 | 0.62 0.03 | 0.15 0.02 | ||
| Ours | SPD‐GAN (w/o ) |
0.75
|
0.84 0.02 | 0.19 0.03 | |
| SPD‐GAN (w/ ) | 0.68 0.35 | 0.63 0.07 |
0.29
|
||
| GR‐SPD‐GAN | 0.66 0.26 |
0.84
|
0.18 0.03 |
Note: Bold values indicate the best performance for each metric.
FIGURE 4.

Ground‐truth fMRI‐derived FC matrices for MDD and HC (left). Generated samples by 1D‐DCGAN, WGAN, and our GR‐SPD‐GAN.
5.2. Results for FC Classification
Table 4 shows the FC classification results for data augmentation using different GANs. Data augmentation using GANs is generally beneficial, giving significant gains ( FDR‐corrected) in classification performance over the baseline models without data augmentation. Among the GAN generators, using generated data from our GR‐SPD‐GAN is clearly the most conducive to improving the classification performance across all classifiers, followed by WGAN and DCGANs. This is evident from the substantially larger margin of increase in all performance measures compared to the other GANs. The better classification performance with data augmentation using GR‐SPD‐GAN is due to the high quality of the synthetic FC data with SPD structure intact as shown in results in Figures 3 and 4; Table 2.
TABLE 4.
MDD classification performance of different FC classifiers trained on the original training set (real) and augmented datasets with an increasing amount of synthesized FC data using the proposed GR‐SPD‐GAN and other GAN‐based generative models.
| Classifier | GAN type | Train set | Accuracy | Recall | Precision | F1‐score | ROC |
|---|---|---|---|---|---|---|---|
| SVM | — | Real | 90.97 ± 1.65 | 91.11 ± 2.45 | 91.59 ± 1.02 | 91.33 ± 1.60 | 90.98 ± 1.62 |
| Vanilla‐GAN | Real + Synth. 1× | 91.14 ± 1.86 | 91.28 ± 2.61 | 91.76 ± 1.29 | 91.50 ± 1.78 | 91.14 ± 1.84 | |
| Real + Synth. 2× | 91.49 ± 1.61 | 91.49 ± 1.61 | 91.52 ± 1.61 | 91.49 ± 1.61 | 91.46 ± 1.61 | ||
| Real + Synth. 3× | 91.50 ± 1.81 | 92.17 ± 2.02 | 91.64 ± 1.68 | 91.89 ± 1.74 | 91.47 ± 1.81 | ||
| 1D‐DCGAN | Real + Synth. 1× | 91.00 ± 1.57 | 90.89 ± 2.14 | 91.82 ± 1.31 | 91.34 ± 1.51 | 91.01 ± 1.55 | |
| Real + Synth. 2× | 91.03 ± 1.53 | 90.94 ± 2.10 | 91.83 ± 1.30 | 91.36 ± 1.47 | 91.03 ± 1.51 | ||
| Real + Synth. 3× | 91.07 ± 1.70 | 91.38 ± 2.39 | 91.53 ± 1.31 | 91.43 ± 1.65 | 91.07 ± 1.68 | ||
| 2D‐DCGAN | Real + Synth. 1× | 90.79 ± 1.58 | 90.64 ± 2.38 | 91.66 ± 1.36 | 91.12 ± 1.55 | 90.80 ± 1.56 | |
| Real + Synth. 2× | 91.17 ± 1.63 | 91.42 ± 2.14 | 91.68 ± 1.56 | 91.53 ± 1.59 | 91.15 ± 1.62 | ||
| Real + Synth. 3× | 91.60 ± 1.57* | 92.13 ± 1.69 | 91.84 ± 1.68 | 91.97 ± 1.50* | 91.57 ± 1.57 | ||
| WGAN | Real + Synth. 1× | 91.52 ± 1.56* | 91.82 ± 2.20 | 91.94 ± 1.20 | 91.87 ± 1.53* | 91.51 ± 1.55 | |
| Real + Synth. 2× | 91.60 ± 1.60* | 92.07 ± 2.13 | 91.90 ± 1.27 | 91.96 ± 1.53* | 91.58 ± 1.59 | ||
| Real + Synth. 3× | 91.54 ± 1.55* | 91.98 ± 2.11 | 91.86 ± 1.20 | 91.91 ± 1.50 | 91.53 ± 1.54 | ||
| WGAN‐GP | Real + Synth. 1× | 91.38 ± 1.74 | 91.90 ± 2.01 | 91.64 ± 1.65 | 91.75 ± 1.69 | 91.35 ± 1.73 | |
| Real + Synth. 2× | 91.42 ± 1.74 | 92.08 ± 1.95 | 91.56 ± 1.68 | 91.81 ± 1.68 | 91.39 ± 1.73 | ||
| Real + Synth. 3× | 91.40 ± 1.74 | 91.89 ± 1.99 | 91.69 ± 1.62 | 91.78 ± 1.69 | 91.38 ± 1.73 | ||
| GR‐SPD‐GAN | Real + Synth. 1× | 91.04 ± 0.71 | 90.65 ± 1.39 | 92.10 ± 0.94 | 91.32 ± 0.74 | 91.04 ± 0.74 | |
| Real + Synth. 2× | 91.15 ± 2.06 | 91.34 ± 2.37 | 91.73 ± 2.11 | 91.52 ± 1.98 | 91.14 ± 2.05 | ||
| Real + Synth. 3× | 92.03 ± 2.36* | 93.05 ± 2.46* | 91.81 ± 2.30 | 92.42 ± 2.28* | 91.99 ± 2.36 | ||
| CNN | — | Real | 87.42 ± 2.41 | 88.90 ± 4.45 | 87.37 ± 2.54 | 87.86 ± 2.04 | 87.33 ± 2.42 |
| Vanilla‐GAN | Real + Synth. 1× | 88.75 ± 2.30 | 92.11 ± 2.39 | 87.51 ± 4.87* | 89.60 ± 1.62 | 88.59 ± 2.44 | |
| Real + Synth. 2× | 88.68 ± 1.38 | 87.83 ± 0.93 | 90.26 ± 2.03 | 89.02 ± 1.25 | 88.73 ± 1.39 | ||
| Real + Synth. 3× | 88.61 ± 2.04 | 91.08 ± 3.13 | 87.77 ± 3.14 | 89.32 ± 1.81 | 88.53 ± 2.08 | ||
| 1D‐DCGAN | Real + Synth. 1× | 89.03 ± 0.66 | 90.34 ± 2.71 | 88.89 ± 1.44 | 89.56 ± 0.81 | 88.94 ± 0.68 | |
| Real + Synth. 2× | 89.59 ± 2.28* | 90.16 ± 2.65 | 90.05 ± 2.85 | 90.07 ± 2.08* | 89.60 ± 2.28* | ||
| Real + Synth. 3× | 88.47 ± 1.34 | 87.97 ± 2.90 | 89.83 ± 1.80 | 88.84 ± 1.31 | 88.49 ± 1.31 | ||
| 2D‐DCGAN | Real + Synth. 1× | 88.33 ± 1.03 | 88.62 ± 1.78 | 89.01 ± 1.67 | 88.79 ± 1.02 | 88.31 ± 1.02 | |
| Real + Synth. 2× | 88.40 ± 1.70 | 90.50 ± 1.97 | 87.80 ± 2.25 | 89.10 ± 1.26 | 88.26 ± 1.86 | ||
| Real + Synth. 3× | 89.10 ± 1.43 | 90.39 ± 3.29 | 88.95 ± 1.46 | 89.62 ± 1.51* | 89.06 ± 1.38 | ||
| WGAN | Real + Synth. 1× | 89.31 ± 1.80 | 90.23 ± 1.71 | 89.42 ± 2.38 | 89.81 ± 1.69*† | 89.28 ± 1.84 | |
| Real + Synth. 2× | 89.45 ± 1.51 | 88.99 ± 3.40 | 90.66 ± 1.26* † | 89.77 ± 1.67 | 89.42 ± 1.46 | ||
| Real + Synth. 1× | 88.05 ± 1.50 | 88.16 ± 6.70 | 89.74 ± 6.23 | 88.49 ± 1.39 | 88.00 ± 1.70 | ||
| WGAN‐GP | Real + Synth. 1× | 89.24 ± 1.59 | 90.92 ± 1.22 | 88.85 ± 2.84* | 89.84 ± 1.32* | 89.18 ± 1.64 | |
| Real + Synth. 2× | 88.47 ± 1.15 | 87.25 ± 2.34 | 90.35 ± 1.81 | 88.74 ± 1.31 | 88.50 ± 1.15 | ||
| Real + Synth. 3× | 89.03 ± 2.05 | 88.46 ± 2.91 | 90.31 ± 1.86 | 89.36 ± 2.09 | 89.03 ± 1.98 | ||
| GR‐SPD‐GAN | Real + Synth. 1× | 89.66 ± 1.57 | 91.17 ± 2.25 | 89.36 ± 2.42 | 90.22 ± 1.35 | 89.57 ± 1.63 | |
| Real + Synth. 2× | 88.82 ± 1.81 | 87.78 ± 2.04 | 90.58 ± 2.92 | 89.12 ± 1.78 | 88.84 ± 1.82 | ||
| Real + Synth. 3× | 89.73 ± 1.14* † | 89.95 ± 1.41 | 90.40 ± 2.51 | 90.14 ± 1.06* † | 89.72 ± 1.19* † | ||
| BrainNetCNN | — | Real | 88.19 ± 1.45 | 88.73 ± 3.08 | 88.74 ± 2.14 | 88.68 ± 1.48 | 88.13 ± 1.46 |
| Vanilla‐GAN | Real + Synth. 1× | 88.40 ± 2.83 | 89.12 ± 4.20 | 88.91 ± 3.04 | 88.94 ± 2.58 | 88.42 ± 2.82 | |
| Real + Synth. 2× | 88.96 ± 1.82 | 87.35 ± 5.41 | 91.32 ± 1.93 | 89.15 ± 2.00 | 89.08 ± 1.66 | ||
| Real + Synth. 3× | 88.89 ± 1.50 | 89.46 ± 2.30 | 89.36 ± 1.72 | 89.38 ± 1.26 | 88.87 ± 1.53 | ||
| 1D‐DCGAN | Real + Synth. 1× | 88.89 ± 2.12 | 89.05 ± 2.24 | 89.74 ± 2.97 | 89.35 ± 1.83 | 88.89 ± 2.19 | |
| Real + Synth. 2× | 89.59 ± 1.95 | 90.79 ± 3.36 | 89.44 ± 0.66 | 90.09 ± 1.89 | 89.53 ± 1.95 | ||
| Real + Synth. 3× | 89.73 ± 1.04 | 89.93 ± 2.66 | 90.42 ± 1.76 | 90.13 ± 1.05 | 89.68 ± 1.06 | ||
| 2D‐DCGAN | Real + Synth. 1× | 88.33 ± 1.59 | 90.13 ± 1.83 | 87.89 ± 1.27 | 88.99 ± 1.31 | 88.25 ± 1.64 | |
| Real + Synth. 2× | 89.59 ± 1.44*† | 90.13 ± 3.09 | 90.08 ± 2.49 | 90.03 ± 1.34*† | 89.58 ± 1.46*† | ||
| Real + Synth. 3× | 90.29 ± 2.04 | 90.16 ± 3.95 | 91.21 ± 0.64 | 90.64 ± 1.95 | 90.32 ± 1.99 | ||
| WGAN | Real + Synth. 1× | 89.80 ± 2.17* | 91.21 ± 3.55 | 89.55 ± 2.12 | 90.32 ± 2.05* | 89.74 ± 2.18* | |
| Real + Synth. 2× | 89.38 ± 1.89 | 90.32 ± 3.59 | 89.46 ± 1.63 | 89.85 ± 1.93* | 89.28 ± 1.84 | ||
| Real + Synth. 3× | 89.24 ± 2.09 | 88.62 ± 2.06 | 90.62 ± 2.30 | 89.60 ± 1.97 | 89.24 ± 2.09 | ||
| WGAN‐GP | Real + Synth. 1× | 89.59 ± 2.45 | 90.41 ± 3.40 | 89.84 ± 2.58 | 90.08 ± 2.28 | 89.56 ± 2.46 | |
| Real + Synth. 2× | 89.24 ± 2.52 | 89.60 ± 2.96 | 89.84 ± 2.07 | 89.70 ± 2.29 | 89.21 ± 2.54 | ||
| Real + Synth. 3× | 89.31 ± 2.53 | 87.76 ± 4.41 | 91.48 ± 1.50* | 89.53 ± 2.52 | 89.43 ± 2.45 | ||
| GR‐SPD‐GAN | Real + Synth. 1× | 89.66 ± 1.57*† | 91.17 ± 2.25* | 89.36 ± 2.42 | 90.22 ± 1.35*† | 89.57 ± 1.63*† | |
| Real + Synth. 2× | 90.36 ± 1.55* † | 93.06 ± 2.03 | 89.04 ± 2.06 | 90.98 ± 1.36* † | 90.24 ± 1.59* † | ||
| Real + Synth. 3× | 90.01 ± 1.52* | 88.91 ± 3.20 | 91.75 ± 1.27* † | 90.27 ± 1.59 | 90.06 ± 1.47* |
Note: Results are averages (standard deviations) of performance measures over 5‐fold cross‐validation. Higher scores indicate better performance. Statistical significance over baseline without augmentation is indicated by *p < 0.05, uncorrected and † p < 0.05, FDR‐corrected across performance metrics. Bold values indicate the best performance for each metric.
Additionally, experiments conducted on the atlas CC‐200 also demonstrated improved performance as shown in Table 5, further affirming the applicability of GR‐SPD‐GAN to other atlases. Additional evaluation based on site‐specific FC data augmentation and classification shows a larger percentage of performance improvement compared to models trained on combined data from all different acquisition sites of the REST‐meta MDD database (see Section B in Supporting Information). In contrast, the standard GAN architectures that neglect the SPD geometry are limited by the problem of mode collapse, producing only a small variety of FC samples with many duplicates (modes). When used for data augmentation, this will give little improvements or even degrade the classification performance due to overfitting of classification models on duplicated samples. We also note that results saturate when augmenting with synthetic data by more than three times the original training set. Moreover, the variances of classification performance measures obtained by data augmentation using GR‐SPD‐GAN are substantially lower in all classifiers compared to other GANs. This indicates that FC classifiers trained on data generated by GR‐SPD‐GAN achieved consistently good performance across different experimental folds, again implying better generalizability of the classifiers due to augmentation with better quality and higher diversity of the synthesized data (Table 7). We tested the performance of the FC classifier trained using only the synthesized FC data from GANs. Results are shown in Table 6. As expected, the classification performance of models trained on synthetic data alone is lower than that trained with real data. However, our GR‐SPD‐GAN is able to achieve reasonably good performance (e.g., accuracy of 62.06F1‐score of 66.03) and outperformed other GANs. Additional evaluation on the ADNI dataset also shows improved FC classification with GR‐SPD‐GAN data augmentation (see Section C in Suporting Information).
TABLE 5.
MDD classification performance of different FC classifiers trained on CC200 atlas.
| Classifier | GAN type | Train set | Accuracy | Recall | Precision | F1‐score | ROC |
|---|---|---|---|---|---|---|---|
| BrainNetCNN | — | Real | 40.59 ± 1.51 | 61.16 ± 35.61 | 53.09 ± 3.31 | 50.13 ± 17.74 | 50.20 ± 0.78 |
| 2D‐DCGAN | Synth. 1× | 53.22 ± 2.34 | 85.02 ± 24.40 | 55.580 ± 7.05 | 63.86 ± 7.87 | 52.32 ± 3.41 | |
| WGAN | Synth. 1× | 55.68 ± 2.11 | 61.89 ± 19.76 | 58.12 ± 4.29 | 57.93 ± 7.59 | 54.61 ± 3.19 | |
| WGAN‐GP | Synth. 1× | 52.67 ± 1.29 | 80.84 ± 22.71 | 53.83 ± 2.96 | 62.75 ± 7.12 | 51.61 ± 1.60 | |
| GR‐SPD‐GAN | Synth. 1× | 60.27 ± 5.48 | 53.79 ± 22.04 | 66.91 ± 9.02 | 56.77 ± 9.87 | 60.88 ± 5.29 |
Note: Results are averages (standard deviations) of performance measures over 5‐fold cross‐validation. Higher scores indicate better performance.
TABLE 7.
Effect of incorporating different component losses in the training of GR‐SPD‐GAN to augment data for FC classification.
| Ours |
|
|
|
Classifiers | Accuracy | Recall | Precision | F1‐score | ROC | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SPD‐GAN (w/o ) | ✓ | X | X | SVM | 89.95 ± 1.62 | 89.95 ± 1.62 | 90.08 ± 1.56 | 89.95 ± 1.62 | 90.01 ± 1.60 | |||
| CNN | 89.73 ± 1.78 | 91.09 ± 2.71 | 89.61 ± 3.01 | 90.28 ± 1.52 | 89.70 ± 1.77 | |||||||
| BrainNetCNN | 89.80 ± 1.89 | 91.18 ± 2.17 | 89.62 ± 2.75 | 90.35 ± 1.61 | 89.71 ± 1.97 | |||||||
| SPD‐GAN (w/ ) | ✓ | ✓ | X | SVM | 90.89 ± 1.56 | 90.89 ± 1.56 | 90.94 ± 1.56 | 90.88 ± 1.56* | 90.85 ± 1.57 | |||
| CNN | 89.17 ± 1.64 | 89.29 ± 1.88 | 90.05 ± 3.21 | 89.61 ± 1.40 | 89.15 ± 1.67 | |||||||
| BrainNetCNN | 89.45 ± 1.76 | 89.59 ± 2.1 | 90.18 ± 1.32 | 89.87 ± 1.59 | 89.44 ± 1.75 | |||||||
| GR‐SPD‐GAN | ✓ | ✓ | ✓ | SVM | 92.03 ± 2.36 | 93.05 ± 2.46 | 91.81 ± 2.30 | 92.42 ± 2.28 | 91.99 ± 2.36 | |||
| CNN | 89.73 ± 1.14 | 89.95 ± 1.41 | 90.40 ± 2.51 | 90.14 ± 1.06 | 89.72 ± 1.19 | |||||||
| BrainNetCNN | 90.36 ± 1.55 | 93.06 ± 2.03 | 89.04 ± 2.06 | 90.98 ± 1.36 | 90.24 ± 1.59 |
Note: Higher scores indicate better performance. Bold values indicate the best performance for each metric.
TABLE 6.
MDD classification performance of different FC classifiers trained on augmented datasets only.
| Classifier | GAN type | Train set | Accuracy | Recall | Precision | F1‐score | ROC |
|---|---|---|---|---|---|---|---|
| BrainNetCNN | — | Real | 88.19 ± 1.45 | 88.73 ± 3.08 | 88.74 ± 2.14 | 88.68 ± 1.48 | 88.13 ± 1.46 |
| 1D‐DCGAN | Synth. 1× | 59.39 ± 3.71 | 70.95 ± 24.19 | 62.08 ± 7.38 | 62.97 ± 7.73 | 58.63 ± 5.29 | |
| 2D‐DCGAN | Synth. 1× | 58.70 ± 3.01 | 77.45 ± 4.12 | 57.80 ± 2.99 | 66.15 ± 2.95 | 57.84 ± 2.59 | |
| WGAN | Synth. 1× | 58.07 ± 1.84 | 60.92 ± 5.78 | 59.46 ± 1.89 | 60.12 ± 3.75 | 57.83 ± 1.61 | |
| WGAN‐GP | Synth. 1× | 60.73 ± 7.00 | 66.96 ± 10.48 | 61.67 ± 6.40 | 63.83 ± 7.16 | 60.49 ± 7.04 | |
| GR‐SPD‐GAN | Synth. 1× | 62.06 ± 6.83 | 70.50 ± 7.93 | 62.80 ± 7.56 | 66.03 ± 5.34 | 61.49 ± 7.08 |
Note: Results are averages (standard deviations) of performance measures over 5‐fold cross‐validation. Higher scores indicate better performance. Bold values indicate the best performance for each metric.
5.3. Ablation Study
We conducted an ablation study to evaluate the effect of each loss component used in training of our GR‐SPD‐GAN on FC classification performance. We consider three variants of GR‐SPD‐GAN to augment FC data for MDD classification. Each model was trained by discarding one loss component of the full GR‐SPD‐GAN model: (i) SPD‐GAN (w/o ) with only the adversarial loss ; (ii) SPD‐GAN (w/ ) with added reconstruction losses on manifold and tangent space, (iii) GR‐SPD‐GAN with , as well as the population graph regularizers on manifold and tangent space. Table 5 shows the classification results of using augmented FC data generated from these SPD‐GANs. The SPD‐GAN models trained with and without the reconstruction losses yield comparable results. However, adding the population graph regularizers to guide the GR‐SPD‐GAN training leads to noticeable improvement in classification performance. This suggests the usefulness of this loss incorporating the inter‐subject relationships to generate more realistic FC data, that can further enhance FC classification when used for data augmentation.
6. Conclusion
We introduced a new manifold‐aware deep generative model, GR‐SPD‐GAN, for SPD‐matrix valued data generation by exploiting the unique manifold geometry of SPD matrices in the adversarial training. It uses a generalization of distribution distance in Wasserstein GAN for manifold‐valued data. We demonstrated its usefulness for synthesizing correlation matrices of neuroimaging data, a common representation of functional brain networks that can preserve its SPD structure, and thus takes into account the inter‐related nature of all FC edges as a whole. By devising a conditional mechanism in this GAN that uses class labels to supervise generation, it allows us to generate FC samples residing on the SPD manifold and according to different classes of brain networks (with brain disorder or healthy control). We incorporated additional population graph‐based regularization terms that can improve the GAN training by forcing the generator to respect the inter‐subject similarity of the FC structure in the real data in order to generate high quality data. Qualitative and quantitative results on a MDD fMRI dataset show the superiority of our method in generating more realistic FC samples, significantly outperforming state‐of‐the‐art GAN‐based generators in terms of geometric score. When applied to augment FC data for connectome‐based MDD identification, our method also provided the largest improvements in classification performance. To conclude, the proposed GR‐SPD‐GAN approach is the first to show the advantages of FC data generation on the SPD space rather than Euclidean geometry, which can find a wide range of other applications in FC analyses besides data augmentation for brain disorder classification.
In addition to FC synthesis, our method can be extended to generate diffusion tensors in diffusion tensor imaging, which captures the directionality and strength of water molecule diffusion in the brain and enables the reconstruction of white matter tracts for structural connectivity. Diffusion tensors are three‐by‐three symmetric positive definite matrices that lie on the SPD manifold. By applying spectral decomposition and adapting the logarithm and exponential maps used for two‐dimensional SPD matrices, our model can handle tensor‐valued data. Our method is also applicable to generating synthetic streamlines that approximate white matter tracts. Streamlines can be modeled as three‐dimensional curves mapped to points on the hypersphere manifold. Since the hypersphere is a Riemannian manifold, the geometry of its curved space is described by a Riemannian metric. Nevertheless, our current framework focuses on generating static brain networks. Future work could explore extensions for generating dynamic FC (Hutchison et al. 2013; Ting et al. 2021, 2022), potentially by mapping the dynamic FC sequences as trajectories on the space of SPD matrices, as in Dai et al. (2020).
Supporting information
Data S1: Supporting information.
Acknowledgments
This work was supported by the Monash University Malaysia and the Ministry of Higher Education, Malaysia under Fundamental Research Grant Scheme FRGS/1/2023/ICT02/MUSM/02/1, King Abdullah University of Science and Technology under Grant CRG11‐5025.3. Open access publishing facilitated by Monash University, as part of the Wiley ‐ Monash University agreement via the Council of Australian University Librarians.
Tan, Y.‐F. , Noman F., Phan R. C.‐W., Ombao H., and Ting C.‐M.. 2025. “Graph‐Regularized Manifold‐Aware Conditional Wasserstein GAN for Brain Functional Connectivity Generation.” Human Brain Mapping 46, no. 12: e70322. 10.1002/hbm.70322.
Funding: This work was supported by Ministry of Higher Education, Malaysia, FRGS/1/2023/ICT02/MUSM/02/1; Monash University Malaysia. King Abdullah University of Science and Technology under Grant CRG11‐5025.3.
Data Availability Statement
The data that support the findings of this study are openly available in REST‐meta‐MDD at https://rfmri.org/REST‐meta‐MDD.
References
- Abadi, M. , Barham P., Chen J., et al. 2016. “TensorFlow: A System for Large‐Scale Machine Learning.” USENIX Symposium on Operating Systems Design and Implementation. 265–283.
- Akiba, T. , Sano S., Yanase T., Ohta T., and Koyama M.. 2019. “Optuna: A Next‐Generation Hyperparameter Optimization Framework.” Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). 2623–2631.
- Antoniou, A. , Storkey A., and Edwards H.. 2017. “Data Augmentation Generative Adversarial Networks.” arXiv preprint arXiv:1711.04340.
- Arjovsky, M. , Chintala S., and Bottou L.. 2017. “Wasserstein Generative Adversarial Networks.” International Conference on Machine Learning. 214–223.
- Barile, B. , Marzullo A., Stamile C., Durand‐Dubief F., and Sappey‐Marinier D.. 2021. “Data Augmentation Using Generative Adversarial Neural Networks on Brain Structural Connectivity in Multiple Sclerosis.” Computer Methods and Programs in Biomedicine 206: 106113. [DOI] [PubMed] [Google Scholar]
- Cassidy, B. , Rae C., and Solo V.. 2014. “Brain Activity: Connectivity, Sparsity, and Mutual Information.” IEEE Transactions on Medical Imaging 34: 846–860. [DOI] [PubMed] [Google Scholar]
- Craddock, R. C. , James G. A., Holtzheimer P. E. III, Hu X. P., and Mayberg H. S.. 2012. “A Whole Brain Fmri Atlas Generated via Spatially Constrained Spectral Clustering.” Human Brain Mapping 33: 1914–1928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai, M. , Zhang Z., and Srivastava A.. 2019. “Analyzing Dynamical Brain Functional Connectivity as Trajectories on Space of Covariance Matrices.” IEEE Transactions on Medical Imaging 39: 611–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai, M. , Zhang Z., and Srivastava A.. 2020. “Analyzing Dynamical Brain Functional Connectivity as Trajectories on Space of Covariance Matrices.” IEEE Transactions on Medical Imaging 39: 611–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dan, T. , Huang Z., Cai H., Laurienti P. J., and Wu G.. 2022. “Learning Brain Dynamics of Evolving Manifold Functional MRI Data Using Geometric‐Attention Neural Network.” IEEE Transactions on Medical Imaging 41: 2752–2763. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deligianni, F. , Varoquaux G., Thirion B., et al. 2011. “A Probabilistic Framework to Infer Brain Functional Connectivity From Anatomical Connections.” Biennial International conference on Information Processing in Medical Imaging. 296–307. [DOI] [PubMed]
- Du, Y. , Fu Z., and Calhoun V. D.. 2018. “Classification and Prediction of Brain Disorders Using Functional Connectivity: Promising but Challenging.” Frontiers in Neuroscience 12: 525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finn, E. S. , Shen X., Scheinost D., et al. 2015. “Functional Connectome Fingerprinting: Identifying Individuals Using Patterns of Brain Connectivity.” Nature Neuroscience 18: 1664–1671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiori, S. 2009. “Learning the Fr'echet Mean Over the Manifold of Symmetric Positive‐Definite Matrices.” Cognitive Computation 1: 279–291. [Google Scholar]
- Frid‐Adar, M. , Diamant I., Klang E., Amitai M., Goldberger J., and Greenspan H.. 2018. “GAN‐Based Synthetic Medical Image Augmentation for Increased CNN Performance in Liver Lesion Classification.” Neurocomputing 321: 321–331. [Google Scholar]
- Goodfellow, I. , Pouget‐Abadie J., Mirza M., et al. 2014. “Generative Adversarial Nets.” Advances in Neural Information Processing Systems 27: 2672–2680. [Google Scholar]
- Greicius, M. 2008. “Resting‐State Functional Connectivity in Neuropsychiatric Disorders.” Current Opinion in Neurology 21: 424–430. [DOI] [PubMed] [Google Scholar]
- Gulrajani, I. , Ahmed F., Arjovsky M., Dumoulin V., and Courville A. C.. 2017. “Improved Training of Wasserstein Gans.” Advances in Neural Information Processing Systems 30: 5769–5779. [Google Scholar]
- Heinsfeld, A. S. , Franco A. R., Craddock R. C., Buchweitz A., and Meneguzzi F.. 2018. “Identification of Autism Spectrum Disorder Using Deep Learning and the ABIDE Dataset.” NeuroImage: Clinical 17: 16–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ho, J. , Jain A., and Abbeel P.. 2020. “Denoising Diffusion Probabilistic Models.” Advances in Neural Information Processing Systems 33: 6840–6851. [Google Scholar]
- Huang, Z. , Wu J., and Van Gool L.. 2019. “Manifold‐Valued Image Generation With Wasserstein Generative Adversarial Nets.” Proceedings of the AAAI Conference on Artificial Intelligence 33: 3886–3893. [Google Scholar]
- Hutchison, R. M. , Womelsdorf T., Allen E. A., et al. 2013. “Dynamic Functional Connectivity: Promise, Issues, and Interpretations.” NeuroImage 80: 360–378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kam, T.‐E. , Zhang H., Jiao Z., and Shen D.. 2019. “Deep Learning of Static and Dynamic Brain Functional Networks for Early MCI Detection.” IEEE Transactions on Medical Imaging 39: 478–487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karcher, H. 1977. “Riemannian Center of Mass and Mollifier Smoothing.” Communications on Pure and Applied Mathematics 30: 509–541. [Google Scholar]
- Kawahara, J. , Brown C. J., Miller S. P., et al. 2017. “BrainNetCNN: Convolutional Neural Networks for Brain Networks; Towards Predicting Neurodevelopment.” NeuroImage 146: 1038–1049. [DOI] [PubMed] [Google Scholar]
- Khrulkov, V. , and Oseledets I.. 2018. “Geometry Score: A Method for Comparing Generative Adversarial Networks.” International Conference on Machine Learning. 2621–2629.
- Kingma, D. P. , and Ba J.. 2014. “Adam: A Method for Stochastic Optimization.” arXiv preprint arXiv:1412.6980.
- Li, C. , Wei Y., Chen X., and Sch¨onlieb C.‐B.. 2021. “BrainNetGAN: Data Augmentation of Brain Connectivity Using Generative Adversarial Network for Dementia Classification.” In Deep Generative Models, Data Augmentation, Labelling, Imperfections, 103–111. Springer International Publishing. [Google Scholar]
- Li, Q. , Kailkhura B., Anirudh R., et al. 2021. “Mr‐Gan: Manifold Regularized Generative Adversarial Networks for Scientific Data.” SIAM Journal on Mathematics of Data Science 3: 1197–1222. [Google Scholar]
- Meyer, M. I. , de la Rosa E., de Barros N. P., et al. 2021. “A Contrast Augmentation Approach to Improve Multi‐Scanner Generalization in MRI.” Frontiers in Neuroscience 1048: 708196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Minh, H. Q. , and Murino V.. 2017. “Covariances in Computer Vision and Machine Learning.” In Synthesis Lectures on Computer Vision, vol. 7, 1–170. Morgan & Claypool Publishers. [Google Scholar]
- Nguyen, K. P. , Fatt C. C., Treacher A., et al. 2020. “Anatomically Informed Data Augmentation for Functional MRI With Applications to Deep Learning.” Proceedings of SPIE—The International Society for Optical Engineering 11313: 172–177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Otberdout, N. , Daoudi M., Kacem A., Ballihi L., and Berretti S.. 2020. “Dynamic Facial Expression Generation on Hilbert Hypersphere With Conditional Wasserstein Generative Adversarial Nets.” IEEE Transactions on Pattern Analysis and Machine Intelligence 44, no. 2: 848–863. [DOI] [PubMed] [Google Scholar]
- Pereira, F. , Mitchell T., and Botvinick M.. 2009. “Machine Learning Classifiers and fMRI: A Tutorial Overview.” NeuroImage 45: S199–S209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qiu, A. , Lee A., Tan M., and Chung M. K.. 2015. “Manifold Learning on Brain Functional Networks in Aging.” Medical Image Analysis 20: 52–60. [DOI] [PubMed] [Google Scholar]
- Radford, A. , Metz L., and Chintala S.. 2015. “Unsupervised Representation Learning With Deep Convolutional Generative Adversarial Networks.” arXiv preprint arXiv:1511.06434.
- Suk, H.‐I. , Lee S.‐W., Shen D., et al. 2014. “Hierarchical Feature Representation and Multimodal Fusion With Deep Learning for AD/MCI Diagnosis.” NeuroImage 101: 569–582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tian, Y. , and Zalesky A.. 2021. “Machine Learning Prediction of Cognition From Functional Connectivity: Are Feature Weights Reliable?” NeuroImage 245: 118648. [DOI] [PubMed] [Google Scholar]
- Ting, C.‐M. , Ombao H., Salleh S.‐H., and Abd Latif A. Z.. 2018. “Multi‐Scale Factor Analysis of High‐Dimensional Functional Connectivity in Brain Networks.” IEEE Transactions on Network Science and Engineering 7: 449–465. [Google Scholar]
- Ting, C.‐M. , Samdin S. B., Tang M., and Ombao H.. 2021. “Detecting Dynamic Community Structure in Functional Brain Networks Across Individuals: A Multilayer Approach.” IEEE Transactions on Medical Imaging 40: 468–480. [DOI] [PubMed] [Google Scholar]
- Ting, C.‐M. , Skipper J. I., Noman F., Small S. L., and Ombao H.. 2022. “Separating Stimulus‐Induced and Background Components of Dynamic Functional Connectivity in Naturalistic fMRI.” IEEE Transactions on Medical Imaging 41: 1431–1442. [DOI] [PubMed] [Google Scholar]
- Varoquaux, G. , Baronnet F., Kleinschmidt A., Fillard P., and Thirion B.. 2010. “Detection of Brain Functional‐Connectivity Difference in Post‐Stroke Patients Using Group‐Level Covariance Modeling.” The International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI). 200–208. [DOI] [PubMed]
- Yan, C. , and Zang Y.. 2010. “DPARSF: A Matlab Toolbox for Pipeline Data Analysis of Resting‐State fMRI.” Frontiers in Systems Neuroscience 4: 1377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yan, C.‐G. , Chen X., Li L., et al. 2019. “Reduced Default Mode Network Functional Connectivity in Patients With Recurrent Major Depressive Disorder.” Proceedings of the National Academy of Sciences of The United States of America 116: 9078–9083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yao, Q. , and Lu H.. 2019. “Brain Functional Connectivity Augmentation Method for Mental Disease Classification With Generative Adversarial Network.” In Chinese Conference on Pattern Recognition and Computer Vision, 444–455. Springer International Publishing. [Google Scholar]
- You, K. , and Park H.‐J.. 2021. “Re‐Visiting Riemannian Geometry of Symmetric Positive Definite Matrices for the Analysis of Functional Connectivity.” NeuroImage 225: 117464. [DOI] [PubMed] [Google Scholar]
- Yuan, X. , and Qiao Y.. 2024. “Diffusion‐TS: Interpretable Diffusion for General Time Series Generation.” The Twelfth International Conference on Learning Representations. https://openreview.net/forum?id=4h1apFjO99.
- Zhang, J. , Kucyi A., Raya J., et al. 2021. “What Have We Really Learned From Functional Connectivity in Clinical Populations?” NeuroImage 242: 118466. [DOI] [PubMed] [Google Scholar]
- Zhao, H. , Lou H., Yao L., and Zhang Y.. 2025. “Diffusion Transformer‐Augmented Fmri Functional Connectivity for Enhanced Autism Spectrum Disorder Diagnosis.” Journal of Neural Engineering 22: 016044. [DOI] [PubMed] [Google Scholar]
- Zhou, T. , Chen X., Shen Y., et al. 2023. “Generative Ai Enables Eeg Data Augmentation for Alzheimer's Disease Detection via Diffusion Model.” In 2023 IEEE International Symposium on Product Compliance Engineering‐Asia (ISPCE‐ASIA), 1–6. IEEE. [Google Scholar]
- Zhuang, P. , Schwing A. G., and Koyejo O.. 2019. “FMRI Data Augmentation via Synthesis.” International Symposium on Biomedical Imaging: 1783–1787. [Google Scholar]
- Zong, Y. , Zuo Q., Ng M. K.‐P., Lei B., and Wang S.. 2024. “A New Brain Network Construction Paradigm for Brain Disorder via Diffusion‐Based Graph Contrastive Learning.” IEEE Transactions on Pattern Analysis and Machine Intelligence 46: 10389–10403. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data S1: Supporting information.
Data Availability Statement
The data that support the findings of this study are openly available in REST‐meta‐MDD at https://rfmri.org/REST‐meta‐MDD.

