Skip to main content
Health Information Science and Systems logoLink to Health Information Science and Systems
. 2025 Oct 28;13(1):70. doi: 10.1007/s13755-025-00388-w

Reconstructing brain causal dynamics for subject and task fingerprints using fMRI time-series data

Dachuan Song 1, Li Shen 2, Duy Duong-Tran 2, Xuan Wang 1,
PMCID: PMC12569264  PMID: 41170153

Abstract

Purpose

Recently, there has been a revived interest in system neuroscience causation models, driven by their unique capability to unravel complex relationships in multi-scale brain networks. In this paper, we present a novel method that leverages causal dynamics to achieve effective fMRI-based subject and task fingerprinting.

Methods

By applying an implicit-explicit discretization scheme, we develop a two-timescale linear state-space model. Through data-driven identification of its parameters, the model captures causal signatures, including directed interactions among brain regions from a spatial perspective, and disentangled fast and slow dynamic modes of brain activity from a temporal perspective. These causal signatures are then integrated with: (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification. Furthermore, we introduce the concept of the brain reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks.

Results

We evaluate the proposed approach using the Human Connectome Project dataset and demonstrate its advantage over non-causality-based methods. The obtained causal signatures are visualized and demonstrate clear biological relevance with established understandings of brain function.

Conclusion

We verified the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Additionally, our work paves the way for further studies on causal fingerprints with potential applications in both healthy controls and neurodegenerative diseases.

Keywords: FMRI fingerprinting, Brain causal dynamics, Two-timescale state-space model, Reachability analysis

Introduction

Advancements in brain imaging technologies [1], especially Functional Magnetic Resonance Imaging (fMRI) of Blood-Oxygen-Level-Dependent (BOLD) signals, have enabled precise and quantitative measures of brain activity, leading to a significant boost in neuroscience research [2]. Among various brain modeling approaches, causation models have recently received a revamped interest [3, 4] in recent years, due to their capability to reveal complex relationships without multi-scale brain networks. These models have been extended in multiple directions to understand the brain’s underlying processes [5], explaining how these processes support cognition and behavior [68], and propose new methods to regulate brain activity [9, 10] for enhancing normal functions and mitigating disorders [11]. However, most fingerprinting pipelines still rely on static, symmetric connectivity maps and thus miss the directed, multi-timescale dynamics that underlie neural communication.

Inspired by these developments, this paper explores the use of causal dynamic models to extract informative features from the brain for fMRI fingerprinting. Specifically, we aim to leverage causal signatures derived from fMRI time-series data to identify both the individual subject (subject fingerprinting) and the cognitive task being performed (task fingerprinting). A major challenge in this process arises from the variability across subjects and tasks, along with the limited number of recordings available for each subject-task combination [12]. In other words, the problem combines heterogeneity (across subjects/tasks) with data sparsity, which calls for models that are both interpretable and sample-efficient. Our approach focuses on uncovering and utilizing latent causal signatures in the brain. While this requires more specialized model structures, it can offer improved data efficiency. This distinguishes our method from traditional correlation-based approaches [7, 10, 1319], which are limited in their ability to capture the directional, evolving cause-and-effect relationships that are fundamental to cognitive dynamics.

Statement of contribution: In this paper, our result provides a solid ‘YES’ answer to the feasibility and effectiveness of utilizing brain causal signatures for subject and task fingerprinting. Our main contributions are summarized as follows:

  • Introducing a two-timescale linear state-space model, which (i) captures directed interactions among brain regions from a spatial perspective, and (ii) disentangles fast and slow dynamic modes of brain activity from a temporal perspective. Model parameters are identified using a data-driven, implicit-explicit discretization scheme.

  • Integrating the causal signatures with (i) a modal decomposition and projection method for model-based subject identification, and (ii) a Graph Neural Network (GNN) framework for learning-based task classification.

  • Leveraging the control-theoretic interpretation of the causal-based model to introduce the concept of Reachability landscape as a novel visualization tool, which quantitatively characterizes the maximum possible activation levels of brain regions under various fMRI tasks.

Extensive experiments and comparative analyses are conducted to validate the proposed approach. This research also lays the groundwork for future exploration of causal fingerprinting in both healthy individuals and those affected by neurodegenerative disorders.

Related work

The concept of an fMRI “fingerprint” [7, 10, 1317, 2022] can be considered as a classification problem (identifying which subject or task corresponds to a given fMRI sample). In conventional applications, data richness has a major impact on classification accuracy. However, in data-scarce settings, model structure and algorithmic interpretability become critical for maintaining robust performance. For example, in visual person re-identification (re-ID) tasks, incorporating auxiliary features such as gender, color, and texture can significantly improve identification accuracy [23]. However, a similar idea is not directly applicable to fMRI-based applications, as raw brain signals are inherently difficult to interpret or to translate into distinguishable features.

Static connectivity models

In neuroscience research, one strategy to address this challenge is to use the functional connectome (FC) [12]. The FC measures the correlations of the fMRI time-series data among pair-wise brain regions during rest or task conditions, and it has been successfully utilized as individual-specific neural signatures to classify various cognitive states and neurological disorders [18, 24]. However, FC is static and undirected, thus missing causal directionality. Another common brain-network modeling technique is Independent Component Analysis (ICA). This approach separates fMRI data into spatially distinct, statistically independent components corresponding to different brain networks [25, 26]. It has proven especially useful for revealing resting-state networks and explaining their roles in cognitive function and pathology [2729]. Although FC and ICA have achieved notable success in many applications, they still have several limitations. In particular, they do not fully capture the dynamic nature of brain activity, as they lack the temporal resolution needed to detail interactions among regions, and cannot represent the directionality of connections between regions [14, 16, 24, 30]. Each of these aspects, dynamics, temporal detail, and directionality, is essential for understanding the cause-and-effect interactions within brain networks.

Causal connectivity models

In contrast to static correlational methods, causality-based approaches such as Dynamic Causal Modeling (DCM) and Granger causality offer alternative insights into patterns of brain interactions. However, these directed connectivity methods have been relatively under-explored in the context of fMRI fingerprinting. Analyzing causality in fMRI entails examining directional influences among brain regions over time, which provides a window into the brain’s dynamic modes of interaction. For example, DCM uses a Bayesian framework to model interactions among latent neural states with bilinear differential equations, which describes how activity in one region influences others over time [31, 32]. Similarly, Granger causality evaluates whether one time series can predict another, thereby identifying potential directed relationships between neural signals [3335]. Yet such directed methods remain under-explored for fMRI fingerprinting. Most Dynamic Causal Modeling (DCM) or Granger-causality studies restrict the analysis to a small number of regions of interest (ROIs) and assume multiple sessions per subject. Consequently, it is unclear how well these approaches scale to whole-cortex parcellations with few runs per subject, or to joint subject and task fingerprinting. To further enhance these causal models and better capture complex neural dynamics, recent research has introduced several modifications. For instance, threshold-based structures have been incorporated to reflect the saturation of neural activation [3638], and multifactorial dynamics have been proposed to represent systems that operate across multiple time scales [39].

Learning-based approaches

Beyond these correlation-based or causality-based methodologies, classical machine-learning algorithms also remain popular. Random Forest (RF) [40] and Support Vector Machines (SVM) [41] provide high interpretability and easy deployment across diverse datasets. Recent studies on fMRI fingerprinting have also explored a variety of advanced techniques to capture different facets of brain dynamics and connectivity patterns. These include deep-learning architectures. Deep Neural Networks (DNN) capture high-dimensional connectivity patterns [42]. Deep Bidirectional Recurrent Neural Networks (DBRNN) model temporal dynamics in brain signals [43]. Brain Attend and Decode (BAnD) applies attention mechanisms to highlight salient brain regions [44]. A recent variant, TCN–BiLSTM, combines dilated Temporal Convolutional Networks (TCNs) with bidirectional Long Short-Term Memory (BiLSTM) layers to model short- and long-range dependencies in fMRI sequences [45]. However, it still operates on undirected regional time series and lacks an explicit multi-timescale causal framework, which limits its ability to characterize directed network interactions. Geometry- and topology-based pipelines have also been proposed. Tangent-Space Fingerprinting (TSF) maps functional-connectivity matrices to the Riemannian tangent space of the symmetric positive-definite (SPD) manifold, capturing global correlation geometry while still treating connectivity as static and undirected [46]. The PH-H0 landscape describes networks via zero-dimensional persistent homology (H0), providing a noise-robust topological summary yet discarding temporal scale and edge direction [47]. Classical classifiers such as Random Forests and Support Vector Machines are trained on pre-specified connectivity vectors. Without directionality or temporal context, their accuracy soon plateaus and lags behind causal models [40, 41]. End-to-end deep networks (DNN, DBRNN, BAnD) can harvest richer patterns, yet they require large training sets and remain hard to interpret. Geometry-based TSF and topology-based PH-H0 offer elegant dimensionality reduction, but both treat connectivity as static and undirected [47]. Across these lines of work, directed interactions and the multiple time scales that shape cognition are largely ignored. Our method fills this gap: a single scan yields compact causal signatures that encode directionality and tempo, and a lightweight GNN turns them into accurate fingerprints while preserving biophysical meaning.

Methods: causal fingerprint

In this paper, the concept of causal fingerprint is defined as follows: (later referred to as fingerprint, for simplicity.)

Definition 1

Causal fingerprint is the degree to which a subject or an fMRI task can be identified from a labeled database, particularly based on the subjects’ or fMRI tasks’ unique cause-and-effect cognitive signatures.

To summarize our method, we introduce an implicit-explicit discretization method that yields a two-timescale state-space model to capture causal signatures. Then we combine the causal signatures with a modal decomposition and projection method for subject fingerprinting and with a GNN model for task fingerprinting. Finally, building on the state-space model, we propose a new visualization tool that characterizes the reachability of the brain state, which quantitatively represents the maximally possible excitation level for different brain regions.

Notations. Throughout this section, suppose we are given an fMRI data set with S representing the set of all subjects, and H representing the set of all tasks performed by the subjects. Let ds,hRp×T represent the time-series recording of a subject sS performing an fMRI task hH. The dimension p is the number of brain parcellations and T is the testing duration. Let ds,h(k)Rp×T represent the kth column of ds,h. Let vec(·) represent the vectorization (stacking all columns) of a matrix into a single-column vector.

A two-timescale state-space model for causal dynamics

To capture the variability of subjects performing different fMRI tasks, we propose a two-time-scale linear state-space model that characterizes the brain’s activity both spatially and temporally. Spatially, the model captures directed interactions among brain regions, identifying which areas causally influence others. Temporally, it disentangles brain dynamics across two time-scales, characterizing both rapid neural signaling and slower hemodynamic responses.

Although fMRI only records data at a low sampling rate, the brain operates on multiple timescales [48]: fast neuronal activities such as synaptic firing and local synchrony occur on the order of milliseconds; while slow hemodynamic responses unfold over several seconds. As a result, the observed fMRI time-series data (ds,h) represents an aggregation of these intertwined processes, making it challenging to reconstruct the parameters of the underlying dynamics. To address this, we propose an implicit-explicit discretization method that partitions each sampling period into a fast and a slow sub-interval. The fast dynamics capture concurrent interactions that reflect rapid neuronal activity, while the slow dynamics capture hemodynamic responses and long-timescale network interactions among brain regions.

In the following, we first present the proposed two-time-scale model, then we validate the model by showing how it is mathematically derived and how the model parameters can be obtained through a data-driven method. Since this modeling framework applies generically to any subject and any task, we omit the subscripts sh for clarity in what follows.

Consider fMRI data with p brain parcellations. We divide the parcellations into two subsets: system states x(k)Rm and inputs u(k)Rn, with m+n=p. Each entry of x(k) or u(k) represents the activity of a specific brain region at discrete time k. We assume that kZ indexes successive fMRI scans, typically spaced Δt seconds apart (e.g. Δt=0.72s for the dataset used in this paper). Our aim is to construct a two-timescale state-space model of the following form:

x(k)=Qx(k)+Ax(k-1)+B1u(k)+B2u(k-1). 1

Here, QRm×m captures the fast sub-interval (concurrent) interaction among the neuronal states; ARm×m encodes the slow sub-interval (cross-lagged) transition from x(k-1); B1u(k) and B2u(k-1) incorporate the inputs into the system for fast and slow dynamics, respectively.

Implicit-Explicit discretization: To systematically derive the two-timescale discrete model (1), we divide each fMRI sampling period into slow and fast sub-intervals, as illustrated in Fig. 1. Consider a continuous-time dynamical system with both slow and fast modes:

ddtx(t)=Fsx(t)+Gsu(t)+1ϵ(Ffx(t)+Gfu(t)), 2

where 0<ϵ1 indicates the separation of timescales. To discretize the system, we divide the full sampling interval Δt into a slow component Δt-τ and a fast component τ, such that the ratio of their lengths satisfies:1Δt-ττ=1ϵ.

Fig. 1.

Fig. 1

Two timescale partition of a single fMRI sampling period [(k-1)Δt,kΔt]. The interval is split into a fast sub-interval of duration τ (short solid red) and a slow sub-interval of duration Δt-τ (long solid red), with 0<τΔt. Driven by the slower input u(k-1) and the current input u(k), the state evolves from the previous scan x(k-1), passes the intermediate point z(k)=x(k-τ), and reaches the current scan x(k)

The slow dynamics are approximated using an explicit Euler approximation from x(k-1) to an intermediate state z(k):

z(k)-x(k-1)Δt-τ=[Fsx(k-1)+Gsu(t-1)]. 3

which yields

z(k)=x(k-1)+(Δt-τ)Fsx(k-1)+Gsu(k-1) 4

The fast dynamics are then modeled as an update over the interval τ, which starts from z(k) and evolves to x(k):

x(k)-z(k)τ=[Ffx(k)+Gfu(k)]. 5

which yields:

x(k)=τ[Ffx(k)+Gfu(k)]+z(k). 6

Note that the intermediate state z(k) is not directly observable in the fMRI time series, which only samples the system at the slower rate Δt. Thus, we introduce an implicit Euler method in (5) that uses x(k) and u(k) to approximate the update. This allows us to cancel out the intermediate state z(k) and use the one-time scale fMRI time-series data to reconstruct the parameters of the two-time-scale systems. Specifically, by incorporating (4) and (6), one has

x(k)=τFfx(k)+[I+(Δt-τ)Fs]x(t-1)+τGfu(k)+(Δt-τ)Gsu(k-1). 7

Now, define

QτFf,AI+(Δt-τ)Fs,B1τGf,B2(Δt-τ)Gs.

The update equation aligns with model (1).

Data-driven reconstruction of model parameters: Given fMRI data {x(k),u(k)} for k=0,,T, each time k1 should satisfy equation (1). Define the data matrices X1:T=x(1)x(2)x(T) and U0:T-1=u(0)u(1)u(T-1). A compact representation of the dynamics follows

X1:T=QX1:T+AX0:T-1+B1U0:T-1+B2U1:T

To determine the model parameters in the presence of model imperfections and measurement noise, we formulate the generalized least squares problem:

argminQ,A,B1,B2λ(QF+AF+B1F+B2F)+(Q-I)X1:T+AX0:T-1+B1U0:T-1+B2U1:TFsubject toQii=0,im 8

Here Q,ARm×m, B1,B2Rm×n, X1:T,X0:T-1Rm×T, and U0:T-1,U1:TRn×T, where λ>0 controls the strength of the regularization to prevent over-fitting, and ·F denotes the Frobenius norm (for any matrix MRp×q, MF=i=1pj=1qMij2) [50]. The residual term (Q-I)X1:T+AX0:T-1+B1U0:T-1+B2U1:TF is the Frobenius norm of an m×T matrix, which aims to extract system parameters as causal signatures by minimizing the difference between the system’s measured next time state with the next time state predicted by our two-time scale model. The QF, AF, B1F and B2F regularization terms to prevent over-fitting. By solving (8), we transform a complex fMRI time-series data d(t)={x(t),u(t)} into a structured and meaningful representation R=[QAB1B2]Rm×(2m+2n) of brain dynamics signature, as visualized in Fig. 2. This transformation not only makes the data more accessible for subsequent analysis but also reveals the patterns of ‘spatio-temporal’ causality relationships that underpin cognitive processes. In the following, we will build on R to perform subject and task fingerprints.

Fig. 2.

Fig. 2

a Brain activity (time-series data) captured using Schaefer Parcellation. b Causal dynamics modeling and data-driven parameter identification

Remark 1

(Model choice:) In contrast to traditional functional connectome (FC) representations used for brain fingerprinting, the proposed causal signature offers two key advancements: it captures directional interactions (i.e., which region inhibits or excites another) and temporal dependencies (i.e., how earlier neural activities influence subsequent ones across two time-scales). In comparison, the FC method only captures undirected and concurrent relationships between the activities of pairwise brain regions. From a modeling perspective, we note that this work adopts a linear state-space model (1), which represents a relatively simple form of causal modeling compared to the broader literature that incorporates more complex, nonlinear structures—such as bilinear dynamics [31], threshold dynamics [36], and multifactorial time-varying dynamics [39]. However, we argue that the two-timescale structure enabled by implicit-explicit discretization already encodes sufficiently rich features. The use of a simpler model (linear v.s. nonlinear) does not diminish the results of the paper; instead, it provides compelling evidence for the utility of cause-and-effect signatures in fingerprinting applications. Moreover, simpler models generally have mild requirements on data richness. As will be demonstrated in experiments, the proposed approach achieves accuracy comparable to more complex, non-causal methods (e.g., FC and end-to-end learning), even when using low-resolution fMRI data (Schaefer-100 parcellation).

Subject causal fingerprint via modal decomposition and projection

Based on the model (1) and the parameter reconstruction (8), we can obtain a representation Rs,h=[QAB1B2]s,h for any subject-task pair that extracts its spatial and temporal signatures across two time-scales. In the following, we incorporate these signatures with a state-space modal decomposition [51] and projection method to perform subject fingerprinting.

For each fMRI task hH, we construct a labeled reference set:

R^h={Rs,hD},sS.

where the superscript D denotes data samples with known subject identities. We assume that the dataset includes at least two recordings for every subject-task pair. This allows one sample to be used in the reference set R^h, while the remaining samples serve as query instances for evaluation. Given a query sample with causal signature RhQ of an unknown identity, the objective is to identify the subject label s by comparing it against the labeled reference set Rs,hD.

The key challenge for subject causal fingerprinting lies in the limited data for each subject, as the fMRI data collection is time-consuming. Essentially, we need to solve a one-shot classification problem [23], i.e., only one sample per subject is available to determine their identity from a large number of candidates. To address this, we leverage the control-theoretic interpretation of the model (1), in which the causal system signature R=[QAB1B2] naturally encodes the system’s dynamic modes [51, Chapter 12], reflecting invariant subspaces of brain dynamics. These dynamic modes serve as augmented features for robust subject identification.

We extract dynamic modes through eigendecomposition and coordinate transformation by applying them, respectively, to the fast and slow timescales. For example, to analyze the slow dynamic mode, we treat the system as

x(k)=Ax(k-1)+B2u(k-1)+o(k). 9

where o(k)=Qx(k)+B1u(k) is considered as a perturbation term. To perform modal decomposition, we express A=T^-1Λ^T^, where Λ^ is the diagonal matrix (or a Jordan matrix if not diagonalizable [52]) corresponding to the eignevalues of A.

Left multiplying T^-1 to (9) yields

x^(t)=Λ^x^(t-1)+B^2u(t-1)+o^(k), 10

where x^(t)=T^-1x(t), B^=T^-1B, and o^(k)=T^-1o(k). Since Λ^ is diagonal, each state in (10) evolves independently along its corresponding dynamic mode embedded in T^. Similarly, for the fast dynamics, we compute Λ¯ and T¯ such that Q=T¯-1Λ¯T¯. These capture the concurrent interactions among brain regions by treating the slow dynamics as perturbations.

We use transformation matrices T^ and T¯ as augmented features for subjects to perform fingerprinting, since they encode the dynamic modes of (1). Specifically, we determine the identity of a query sample by minimizing an alignment-based distance:

argminsDist(TRhQ,TRs,hD) 11

where T is a feature set composing [T^,T¯]Rm×2m, or a subset of these features, depending on which features best capture inter-subject variation. A detailed discussion on this will be given in the Results section. The distance metric Dist(·,·) is computed as the permutation-aligned pairwise vector distance, which compares two sets of vectors, where the elements are unordered. The key idea is to compute the cosine distance between elements in the two sets after finding an optimal permutation alignment that minimize overall distances, specifically, if defining TRhQ=[x1,,x2m],TRs,hD=[y1,,y2m],

Dist(TRhQ,TRs,hD)=minπZi=1n1-xiyπ(i)xiyπ(i)

where Z denotes the set of all permutations of {1,2,,2m}. Note that the eigenvalue matrices Λ¯ and Λ^ quantify the significance of each mode in the dynamics. We, however, do not incorporate them directly in the fingerprinting task. Instead, our method treats all dynamic modes as equally informative, regardless of their eigenvalues. This contrasts with dimensionality-reduction methods like Principal Component Analysis (PCA), which retain only the most significant components. Our method, via (11), operates in the full space spanned by all dynamic modes, under the assumption that when different subjects perform the same fMRI task, their major dynamic modes might be similar but vary in certain minor dynamic modes.

Task causal fingerprint via graph neural network

Unlike subject fingerprint, where the primary challenge arises from the large number of subjects and the limited data per subject, the goal of task fingerprint is to identify a task from a finite set of candidates. In this case, the data is richer, as each task is performed by all participating subjects. Nevertheless, task fingerprinting still faces intrinsic challenges due to inter-subject variability in brain structure and functional performance. As a result, methods relying solely on predefined algebraic operators, as discussed in Sect. 3.2, are no longer sufficient.

In this subsection, we implement a causal-based task fingerprinting by leveraging the extracted signatures Rs,h from Sect. 3.1. These signatures provide a graph-theoretic representation of brain networks, which we incorporate into a Graph Neural Network (GNN)-based learning model. To explain our method, we construct a labeled database:

R^={Rs,hD},sS,andhH

where S and H are subject sets and fMRI task sets, respectively. Given a query RQ data with unknown identity and task, our goal is to determine the task index h of the query using the database R^.

A key advantage of Graph Neural Networks (GNNs) is their ability to model input data as graph structures [53]. This aligns naturally with our data representation, where the matrix R=[QAB1B2] encodes causal interactions between brain regions. Each row of R captures how a specific brain region is influenced by other regions and external inputs. We define the GNN model G(V,E), where V represents the set of nodes with |V|=m. The feature of each node is a vector in R(2m+2n), corresponding to each row of R. The normalized matrix A in R is used as the adjacency matrix to initialize the edge features E.

Building on this input structure, we employ a five-layer Graph Neural Network architecture tailored for task fingerprinting from fMRI data. The network starts with a GATv2Conv layer that re-weights each node’s neighbours, emphasising task-relevant connections [54] (input 200, out 32 features with 4 attention heads, dropout 0.30). A parallel linear skip path (output 128 channels) is summed with the convolution result so the original signal remains visible to deeper layers, improving gradient flow and protecting against over-smoothing [55]. TopKPooling then selectively retains the most informative nodes (ratio 0.8), reducing graph complexity and focusing computation on salient sub-structures. This sparsification step improves both efficiency and performance. A second GATv2Conv block, again accompanied by a skip connection, further refines the retained features and maps them into a more discriminative space for task recognition (input 128, out 32 features with 2 heads, dropout 0.30; 32×2=64 output channels; skip output 64 channels). Global-mean pooling aggregates these refined node features across the graph, producing a single 64-dimensional vector. A linear head of size 64×8, followed by log-softmax, converts this vector into task probabilities for training and evaluation. During training, we randomly drop 10% of edges to enhance robustness. The overall model structure is visualised in Fig. 3. The model is trained using supervised learning with data batches drawn from R^. The rows of Rs,hD serve as the input features for each node, while h is used as the corresponding fMRI task label. Detailed training parameters are provided in the Appendix.

Fig. 3.

Fig. 3

Architecture of the five-layer GNN used for task fingerprinting

New visualization: brain reachability landscape

In control theory, reachability refers to the ability of a control system to move from one state to another using admissible control inputs. In this subsection, we apply this concept to the causal dynamics in (1), which enables a novel visualization tool that characterizes the reachability of the brain state x(t). This generates a quantitative landscape of regional brain excitation level, which complements traditional connectivity measures that focus on the interaction patterns among brain regions.

To achieve this, we first rewrite the model (1) into an equivalent state evolution form:

x(t)=f(x(t-1),u(t),u(t-1))=A^x(t-1)+B^1u(t)+B^2u(t-1), 12

where

A^(I-Q)-1A,B^2(I-Q)-1B2,B^1(I-Q)-1B1.

Compared to the original model (1), which explicitly represents the fast-slow interactions in brain dynamics, the reformulated model (12) integrates fast interactions directly into the state evolution. This facilitates a direct interpretation of how the system state evolves over time in response to control inputs.

Given a terminal time TM, bounded energy on the control input sequence U0:TM-1=u(0)u(1)u(TM-1), and zero initial state, we define the fMRI reachability set as:

Bx(TM)s.t.x(t)=f(x(t-1),u(t),u(t-1)),U0:TM-121,x(0)=0

Here, the set B quantifies the maximum possible activation of each brain region under bounded input energy. To compute B, since x(TM) is a vector, one can optimize each of its entries individually (corresponding to specific brain regions). Given that the system dynamics f(·) are linear and the constraints are convex, this optimization can be efficiently solved using linear programming.

To visualize B, we map its values onto a two-dimensional grid corresponding to the cortical parcellations (using a 90-region atlas arranged in a 12×12 layout; see Fig. 4). Each region is colored according to its normalized reachability value, producing a reachability-based heatmap. An example of this visualization, using real-world fMRI data, is shown in Fig. 8, Sec. 5.2. Such heatmaps can be generated for each subject and each task condition to reveal the distinct spatial patterns of maximal reachability across brain regions. This offers a new way of brain network visualization, complementing standard correlation-based or connectivity-focused analyses that emphasize inter-regional interaction patterns.

Fig. 4.

Fig. 4

Three-tier illustration mapping the reachable-set activation surface (top) onto a flattened 12 × 12 cortical grid (middle) and, from there, onto its anatomical locations in a 3-D brain rendering (bottom). Colored tiles mark four representative regions

Fig. 8.

Fig. 8

Reachability-based heatmaps for one subject across eight cognitive tasks (Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem). Brighter colors (yellow) indicate stronger maximum neural activations, while darker tones (blue) reflect lower excitability. Red dashed circles highlight representative activation hotspots corresponding to critical brain regions known for each cognitive task. The black boundary outlines the edge of the flattened cortical map. The correspondence between this reachability representation and anatomical brain regions is illustrated in Fig. 4. These patterns highlight functional specialization differences across diverse cognitive tasks

Results

This section uses real-world datasets to verify the effectiveness of the proposed causal-based fingerprints for both subject and task fingerprints, as well as the reachability visualization.

Dataset and modeling

We utilize the Human Connectome Project (HCP) dataset [56], which provides fMRI time-series data for 391 unrelated subjects. Each subject participated in two resting-state fMRI sessions and seven distinct task-based fMRI sessions: emotional response (Emot), gambling (Gamb), language processing (Lang), motor function (Moto), relational processing (Rela), social cognition (Soci), and working memory (WMem). Each subject-task pair has two recordings, scanned in LR (left to right) or RL (right to left) patterns. All fMRI data were parcellated using the Schaefer-100 atlas [57], which partitions the cerebral cortex into p=100 distinct regions. The fMRI time series for each region (parcel) was recorded at 720 ms intervals. In line with Remark 1, although higher-resolution parcellations are available, the 100-region resolution used here is sufficient for our purposes. This level of granularity allows us to investigate how the brain’s causal signatures can distinguish individual subjects and identify specific fMRI tasks, and our results confirm that this resolution is adequate for capturing those distinguishing signatures. In accordance with our model (Equation (1)), we select n=10 out of the 100 brain regions as input nodes. These selected regions are located in the brain’s premotor and sensory cortices (see Fig. 5), which is consistent with known functional hubs identified in [58]. Specifically, the input set includes two parcels from each of the following cortical areas: prefrontal cortex, premotor cortex, somatosensory cortex, visual cortex, and auditory cortex. They were chosen due to their pivotal roles in various cognitive and sensory functions, including high-level task management, memory recall, spatial processing, somatosensory processing, visual perception, and auditory processing [58]. The remaining m=90 brain regions are treated as the system’s state in our model.

Fig. 5.

Fig. 5

a Functional areas of the brain. bd Ten Schaefer Parcellation Areas (blue dots) that are chosen as system inputs: two areas are associated with the prefrontal cortex; two areas are associated with the pre-motor cortex; two areas are associated with the somatosensory cortex; two areas are associated with the visual cortex; two areas are associated with the auditory cortex. We assume these are areas likely to ‘initiate’ certain brain activities

Specifically, we use the minimally pre-processed resting-state runs released by the UCSD Library [59, 60] based on the HCP Young Adult raw data [61]. These scans have already undergone the standard HCP pipeline [62], which includes gradient-distortion and motion correction, EPI distortion correction, alignment to MNI space, high-pass filtering with a 2000-s cut-off, and ICA-FIX denoising [63]. After parcellation with the Schaefer-100 atlas, each run yields a 100×1190 matrix of regional time-series sampled every 0.72 s [64].

Subject causal fingerprinting

We first verify the effectiveness of the proposed Causal-Dynamics Model combined with Modal Decomposition and Projection (CM+MD&P) for subject fingerprinting. Previous research has established that resting-state fMRI shows considerable differences in brain activation patterns among individuals [65]. The dataset consists of two resting sessions, Rest1 and Rest2, each with two scanning orders, LR and RL. Preprocessing follows Sect. 4.1.

We run four cross-condition folds: each of the four resting scans (Rest1-LR, Rest1-RL, Rest2-LR, Rest2-RL) is taken in turn as the reference database, denoted as Rh, while the remaining three scans serve as queries. This guarantees that reference and query data are always disjoint. Within each fold, model parameters (Q,A,B1,B2) are estimated only from the reference run; the ensuing modal matrices serve as one-shot “causal signatures” and are not updated during querying. Both baselines and the proposed method use the same four-fold split and identical preprocessing.

The subject fingerprint is obtained by solving (11). For the results presented below, we chose the feature set as T=[T^], which relies only on the dynamic models of slow interaction and ignores the fast one. More details about this will be discussed in Remark 2. The subject fingerprint result, presented in Table 1, demonstrate that the causal fingerprinting method, specifically CM+MD&P, consistently achieves an accuracy rate above 94% for all resting sessions. The highest accuracy observed is 96.334% for the Rest2 LR session. This performance significantly surpasses that of the correlation-based baseline method, FC+CoR, which averages approximately 45% accuracy. Even though capturing causal interactions is harder, our method not only outperforms the FC+CoR baseline but also other alternative approaches such as CM+CoR, CM+FN, and FC+MD&P. Tangent-Space Fingerprinting (TSF) achieves 88% accuracy, about 6 percentage points behind CM+MD&P, because it models only the Riemannian geometry of static functional connectivity. The PH-H0 landscape method attains 85%; its zero-dimensional topological summary likewise ignores directionality and temporal scale, leaving it 10 percentage points short of our causal approach. To ensure a fair comparison, we applied the CM+MD&P method as described in Sec. 3.2 and the FC+CoR method as outlined in the 2015 study by FES et al. [18] directly to our dataset, without any further adjustments or preprocessing. Although additional data processing could potentially improve the accuracy of all methods tested, our main goal is to confirm the viability and efficacy of causal signatures for individual identification. Therefore, we have deliberately avoided such optimizations to maintain an impartial comparison. To assess the generalizability of our approach, we ran experiments on a public dataset [66]. This dataset contains resting-state fMRI scans from healthy controls and Alzheimer’s disease patients. It does not include repeated sessions that match our original design. To work around this, we split one long resting-state scan into two separate segments. This produced two distinct datasets for analysis. Our method achieved nearly 99% accuracy in subject fingerprinting. These results highlight its strong performance and replicability across different populations and scanning protocols.

Table 1.

Subject fingerprinting using multiple approaches

Method CM+MD&P (%) CM+CoR (%) CM+FN (%) CM+GNN (%) FC+CoR (%) FC+MD&P (%) FC+FN (%) FC+GNN (%) TSF (%) PH-H0 (%)
Rest1 LR 95.396 58.397 42.251 <5 45.439 69.736 33.952 <5 88.7 85.6
Rest2 LR 96.334 61.295 47.175 <5 45.436 70.929 37.574 <5 89.4 86.2
Rest1 RL 94.970 57.118 41.333 <5 41.603 67.945 38.698 <5 87.9 84.7
Rest2 RL 94.203 58.978 43.463 <5 46.803 67.775 36.737 <5 88.5 84.3

CM causal dynamics model, FC functional connectivity, CoR correlation, MD&P modal decomposition and projection, GNN graph neural network, FN frobenius norm, PH-H0 persistent homology (H0 landscape) [47], TSF tangent-space fingerprinting [46]

A detailed comparison is presented in Table 1, showing the average accuracy (across cases from Fig. 6-Left) for combinations of feature extraction methods (CM and FC) and classification methods (MD&P, CoR, FN, and GNN). Here, FN refers to the Frobenius norm RQ-RDF, used to measure distances between query and database matrices for classification. The proposed CM+MD&P method achieves the highest performance among all combinations. Comparing FC+CoR and FC+MD&P suggests that MD&P can effectively evaluate similarities between FC matrices, leading to improved fingerprinting accuracy. The FN method is less robust than MD&P for classification. And the GNN method is generally unsuitable for subject fingerprinting due to limited data availability. We also tested other deep learning models with convolution or attention mechanisms and various layer designs, but they similarly showed poor performance.

Fig. 6.

Fig. 6

Left: Subject Fingerprinting based on CM+MD&P: The identification accuracy is compared with (black line) using the FC+CoR method. Right: Task Fingerprinting based on CM+GNN: The figure shows the correct/incorrect classification for the same/different fMRI tasks. The y-axis is split. The top segment (95–100%) shows the near-perfect accuracies, while the bottom segment (0–2%) zooms in on the small error rates

Remark 2

(Single vs. two-timescale modeling and feature selection) To implement (11), the above result considers a two-timescale model, but uses only the slow dynamical modes, i.e., (A,B2), for subject fingerprinting. In addition to this configuration, one can also consider: (i) using a single-timescale state-space model, or (ii) using the full two-timescale dynamics, i.e., (Q,A,B1,B2). In Table 2, we report the subject fingerprint accuracy achieved under these three configurations. The single-timescale model achieves approximately 30% accuracy, which indicates limited discriminative power when ignoring the multi-timescale nature of brain activity. Incorporating the full two-timescale dynamics significantly improves performance to around 80% (presented in our prior work [1]), which demonstrates the value of separating fast and slow modes. Surprisingly, the best results are achieved using only the reduced model (A,B2), which exceeds 94% accuracy across various resting-state permutations. This result suggests that while accounting for both fast and slow dynamics is necessary for enhancing performance, the most distinctive subject-specific information resides in the slower portion of these dynamics. A potential reason might be that fMRI data are typically dominated by hemodynamic responses spanning seconds, the faster sub-interval interactions and the simultaneous input effects tend to be noisier and harder to resolve.

Table 2.

Subject fingerprinting accuracy (%) comparing single- vs. two-timescale approaches

Method One timescale (%) Two timescale fast + slow (%) Two timescale slow only (%)
Rest1 LR 31.542 81.245 95.396
Rest2 LR 30.937 80.904 96.334
Rest1 RL 29.886 78.517 94.970
Rest2 RL 30.123 79.710 94.203

fMRI task causal fingerprinting

We validate the effectiveness of the proposed Causal Modeling (CM) combined with Graph Neural Network (GNN) method for task fingerprinting, as presented in Table 3. Using causal signatures from the HCP dataset, we split the data evenly, with 50% allocated for training and 50% for testing. The model achieves perfect accuracy (100%) for Rest and near-perfect accuracy for Emot, Lang, WMem. However, tasks such Gamb, Moto, Rela, and Soci exhibit slightly lower performance, with most misclassifications occurring among these tasks.

Table 3.

Task fingerprinting accuracy using multiple approaches

Method CM+GNN CM+RF CM+SVM FC+GNN R+GNN R+DNN R+DBRNN R+BAnD R+TCN–BiLSTM
Rest 100% 100% 99.232% 97.307% 96.629% 95.164% 94.355% 100% 99.532%
Emot 99.872% 99.488% 85.677% 89.160% 69.238% 72.912% 84.757% 98.493% 93.445%
Gamb 96.122% 74.168% 93.094% 75.553% 81.683% 68.535% 69.023% 94.176% 89.395%
Lang 99.159% 96.675% 89.514% 85.755% 89.177% 81.517% 82.204% 97.491% 91.304%
Moto 96.733% 80.562% 96.675% 77.982% 76.858% 66.374% 77.773% 95.754% 88.992%
Rela 98.051% 81.419% 81.841% 90.609% 86.554% 87.700% 79.628% 93.121% 89.663%
Soci 97.544% 75.703% 77.749% 69.811% 73.706% 71.455% 69.950% 96.385% 92.118%
WMem 99.523% 90.767% 95.396% 78.760% 80.202% 76.620% 82.446% 98.299% 94.327%
Average 98.376% 87.350% 89.897% 83.117% 81.756% 78.929% 80.017% 96.710% 92.347%

Bold siginficance indicate the best value within each row

CM causal dynamics model, GNN graph neural network, DNN deep neural network [42]

FC functional connectivity, SVM support vector machines, BAnD brain attend and decode [44]

R raw data, RF random forest, DBRNN deep bidirectional recurrent neural network [43]

TCN–BiLSTM temporal convolutional network–bidirectional long short–term memory [45]

To illustrate the strengths of our CM+GNN method, we compared it to several state-of-the-art approaches across eight tasks [43, 44, 67].2 Other columns in Table 3 report the average accuracy of these methods. Our CM+GNN method consistently outperforms others, achieving the highest average accuracy (98.376%), which highlights its robustness in task fingerprinting. For Emot and Lang, CM+GNN effectively identifies unique neural patterns, surpassing CM+RF and Raw+DBRNN. Performance decreases slightly for tasks like Gamb, Moto, and Soci, possibly due to overlapping neural activity. Similar declines are observed in other methods, such as Raw+DNN and Raw+DBRNN, for these challenging tasks.

We also evaluated the R + TCN–BiLSTM baseline, a Temporal Convolutional Network followed by bidirectional LSTM layers trained end to end on raw fMRI series [45]. Its mean accuracy of 92.3% is higher than the other raw-data deep nets (R + GNN, R + DNN, R + DBRNN) yet remains about 6 percentage points below CM + GNN. While the convolutional blocks capture local temporal motifs, the model lacks directed and multi-timescale information, limiting its discriminative power relative to our causal signatures.

To further assess the reliability of our causality-based approach, we tested it on an independent dataset [68]. The method achieved nearly 95% accuracy in task fingerprinting, confirming its consistency and wide applicability.

Effect of sampling interval Δt on model fingerprinting accuracy

Note that the sampling interval Δt is critical to our discrete-time causal model. In this experiment, we adopt the native sampling interval Δt=0.72 s because it is the scanner’s repetition time (TR), as specified in the data description [61]. Repetition time is the time interval between the beginning of one MRI pulse sequence and the beginning of the next for the same slice. Staying at this original resolution avoids any resampling artifacts and preserves the rapid hemodynamic changes our two-timescale model is meant to capture.

Nevertheless, it is still important to discuss the sensitivity of our algorithm to the sampling interval Δt. Table 4 summarizes how resampling affects subject fingerprinting. Accuracy is computed for each of the four resting runs at three resolutions: the native 0.72 s, and down-sampled versions at 1.44 s (every second frame) and 2.16 s (every third frame).

Table 4.

Subject fingerprinting accuracy (%) under different sampling intervals Δt (seconds)

Δt (s) Δt=0.72 (%) Δt=1.44 (%) Δt=2.16 (%)
Rest1 LR 95.396 90.143 80.636
Rest2 LR 96.334 92.002 81.452
Rest1 RL 94.970 88.747 78.890
Rest2 RL 94.203 89.563 77.938

The results in Table 4 confirm that accuracy peaks at the native interval (Δt=0.72 s), the scanner’s inherent repetition time in the HCP protocol and therefore the finest temporal resolution available. Accuracy drops by roughly five percentage points when the data are down-sampled to 1.44 s and falls a further ten points at 2.16 s. The two-timescale model evidently relies on sub-second variations that are smoothed out at coarser resolutions, degrading the estimated matrices Q&A blocks. We therefore retain Δt=0.72 s for all reported experiments to preserve temporal fidelity while avoiding unnecessary information loss.

Task fingerprinting is less sensitive: accuracy falls from 98.376% at Δt=0.72 s to 96.442% at 1.44 s and 92.127% at 2.16 s, suggesting that large-scale task-evoked activation patterns remain highly discriminative even when the temporal resolution is moderately reduced.

Visualization: directed causal interaction and reachability landscape

Directed causal interaction

We use Fig. 7 to present the causal signatures extracted for the eight HCP tasks (Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem) and serve as the foundation for our functional interpretation. Before plotting, the matrices Q and A are rescaled to a same range, which enables cross-task comparison on a unified metric. Each entry of Q or A conveys both magnitude and sign, which reveals the direction and excitation/inhibition relations of causal drive between pairs of regions. We use arrows in the figures to indicate the directions of these causal influences, which are discussed in detail for each task in the subsequent analysis. By separating matrix Q and matrix A, we can observe how brain activity evolves over different timescales. Matrix Q captures fast synchronous interactions among brain regions, reflecting real-time activation patterns during specific tasks. In contrast, Matrix A illustrates slower dynamic evolutions, showing how current states are influenced by previous states.

Fig. 7.

Fig. 7

Visualization of Brain Regions with Strongest Connection Weights, Showing Association Strengths Across Multiple Cognitive Tasks (Importance Scores Indicated by Color-Bar). LH refers to the left hemisphere, and RH refers to the right hemisphere of the brain. Arrows indicate directed relationships, illustrating which regions exert causal influence on others. For clarity of presentation, we employ a two-color scheme and apply a threshold to highlight only the most active brain areas and their directional interactions

For the resting state data, brain interactions concentrate within the default-mode network (DMN), especially the medial prefrontal cortex (mPFC) and posterior cingulate cortex (PCC), which support introspective thinking, self-related processing, memory integration and rest-task dichotomy [7]. The mPFC exerts influence over the PCC, modulating introspective and self-related processing. The emotion task shows strong connections in the amygdala, prefrontal cortex, and anterior insula. This is due to their roles in emotion generation and regulation. The gambling task primarily shows strong connections in the ventral striatum, prefrontal cortex, and insula, which are crucial for reward prediction and decision-making. The ventral striatum influences the prefrontal cortex and insula, affecting decision-making and reward processing. The language task primarily shows strong activations in Broca’s area (Brodmann areas 44 and 45 [69]) and Wernicke’s area (Brodmann area 22) in the left hemisphere. Broca’s area is crucial for language production, while Wernicke’s area is essential for language comprehension. Broca’s area influences Wernicke’s area, facilitating coordinated language processing. Motor execution engages the primary motor cortex, supplementary motor area (SMA), and basal ganglia. This reflects their active roles in movement planning and control. Causal arrows show the primary motor cortex directing the SMA as actions unfold. For the relational reasoning task, the dorsolateral prefrontal cortex (DLPFC) and posterior parietal cortex provide the principal scaffold for working memory, abstraction, and relational encoding. The DLPFC leads the parietal cortex to facilitate complex cognitive manipulation. Social cognition involves strong coupling between the mPFC and superior temporal sulcus (STS). These are regions implicated in theory of mind and socio-emotional appraisal. The mPFC drives the STS to modulate social evaluative processing. During working memory maintenance, robust edges connect the DLPFC, parietal cortex, and anterior cingulate cortex (ACC). This aligns with their critical roles in information storage and monitoring. Here, the DLPFC influences parietal regions to sustain mnemonic content.

It can be observed that certain tasks share overlapping strong connections in specific regions. For example, the gambling, social, and emotional tasks all rely on pathways linking the prefrontal cortex and insula, caused by the collaborative role of these regions’ in risk-related decision making, emotion processing, and social cognition. Language and relational tasks converge within left temporal structures, notably Wernicke’s area and the hippocampus, which highlights their joint involvement in linguistic analysis and declarative memory. Motor and working memory conditions depend on coordinated activity between the DLPFC and parietal cortex, underscoring these regions shared contribution to motor sequencing and the active maintenance of information. This might be one of the reasons for the performance degradation (commonly shared by most methods) for task fingerprinting.

Reachability landscape

Fig. 8 illustrates reachability-based heatmaps describing the potentially highest activation level for a single subject across eight cognitive modes: Rest, Emot, Gamb, Lang, Moto, Rela, Soci, and WMem. Each heatmap represents normalized activation levels computed from a 90-region cortical parcellation arranged on a 12×12 spatial grid. Brighter areas (yellow) indicate higher neural excitability under bounded input conditions, whereas darker regions (blue) denote lower activation potentials. During the resting state (Rest), the default-mode network (DMN) shows the strongest activation. The medial prefrontal cortex (mPFC) and posterior cingulate cortex (PCC) dominate this pattern [70]. During emotional processing (Emot), the limbic system becomes highly active. The ventromedial prefrontal cortex (vmPFC), amygdala, and insula lead this response [71]. In the gambling task (Gamb), the orbitofrontal cortex and ventral striatum show clear clusters of activation [72]. These clusters signal heightened responsiveness linked to reward anticipation and decision-making. Language-related activations (Lang) appear primarily within classical left-lateralized perisylvian areas, prominently involving Broca’s area (inferior frontal gyrus) and Wernicke’s area (superior temporal gyrus) [73]. Motor execution (Moto) triggers the motor network. The primary motor cortex, supplementary motor area, and nearby premotor regions show strong excitation [74]. Relational reasoning (Rela) engages the frontoparietal control network. The dorsolateral prefrontal cortex (dlPFC) and posterior parietal cortex display distinct activation foci [75]. Social cognition (Soci) relies on theory-of-mind regions. The medial prefrontal cortex and temporoparietal junction (TPJ) dominate the pattern [76]. Working memory (WMem) activates a frontoparietal network. The dorsolateral prefrontal cortex and superior parietal cortex show synchronized engagement [77]. These reachability-based visualizations provide individualized activation maps. They validate established neurofunctional models across cognitive domains. They also demonstrate the capability of reachability analysis for revealing concentrated brain regions under different modes.

Conclusion

To the best of our knowledge, we are among the first to investigate and quantify large-scale brain fingerprints in the context of causal dynamics. This paper proposed new methods that employ an implicit-explicit discretization scheme to derive a new state-space model, which captures causal signatures as directed interactions with two-timescale temporal resolution. The causal dynamics model also enabled a novel visualization tool: the reachability landscape of brain states, which quantitatively represents the possible activation levels that the brain regions can reach under various fMRI tasks. The effectiveness of the proposed methods and visualization tools was validated using real-world datasets and through comparisons with other state-of-the-art techniques.

Despite the novelty of the proposed approach, we remark on the following limitations: (i) In our approach, system inputs were manually specifically selected based on different functional brain regions, while other ways of choosing system state input regions may potentially further improve the performance. (ii) While our subject fingerprinting method is model-based and generalizable without training, the task fingerprinting method relies on training a Graph Neural Network (GNN), which may limit its generalization across datasets. (iii) Subject fingerprinting shows measurable sensitivity to the sampling interval Δt (coarser sampling degrades accuracy).

Regarding clinical applications, our proposed method has the potential for the development of diagnostic and prognostic tools for neurodegenerative diseases such as Alzheimer’s Disease (AD). By analyzing functional and directional influences between brain regions, the proposed approach can associate changes in brain network dynamics with the emergence of clinical symptoms.

In future work, we plan to: (i) Explore causal fingerprints and the reachability landscape concept in both healthy controls (e.g., through other open neuroimaging datasets) and in cases of neurodegenerative diseases such as Alzheimer’s disease [11, 55, 78]; (ii) Enhance the generalizability of our task fingerprinting approach, potentially through transfer learning techniques to accelerate adaptation to new datasets. (iii) Systematically assess robustness to choices of sampling interval Δt, parcellation, and preprocessing, and report performance in a manner that is independent of the acquisition and processing pipeline; iv) Extend the two-timescale linear model to piecewise linear or mildly nonlinear dynamics with uncertainty quantification, and replace manual selection of input regions with a data-driven procedure to reduce design bias.

Acknowledgements

This work was supported in part by the NSF ECCS Award 2332210, 4-VA, a collaborative partnership for advancing the Commonwealth of Virginia, NIH Grants U01 AG068057 and RF1 AG068191.

Appendix

Details of GNN training

We train the GNN in a supervised fashion using all Rs,hD labeled by task h. Each node in the graph corresponds to one brain region, whose feature vector is that region's row in Rs,h; the normalised A block sets up the adjacency. We minimise a negative log-likelihood loss comparing the GNN's predicted task label to the true h. To enhance generalisation and prevent over-fitting, we employ weight decay (L2 regularisation) within AdamW, in tandem with SAM [79] as a meta-optimiser. Specifically, in each training iteration SAM and AdamW operate in concert: AdamW (learning rate 1×10-3, weight decay 5×10-4) performs the parameter update, while SAM adds a perturbation step of radius ρ=0.02 to avoid sharp minima. The learning rate is reset every T0=20 epochs by a cosine warm-restart schedule; each cycle length doubles via Tmult=2 and decays to ηmin=10-6. We use mixed precision (AMP), gradient clipping at 1.0, mini-batches of size 32, and early stopping with patience 20. On-the-fly data augmentation further improves robustness: Gaussian feature noise with σ=0.1, random edge drop of 20%, and subgraph sampling that keeps 80% of the nodes. At inference time, a query RQ is fed into the same pipeline, producing a log-softmax distribution over tasks; the highest score is taken as the predicted label. Through this causal-based GNN we leverage the multi-timescale structure of [QAB1B2] to classify fMRI tasks with high accuracy.

Results validation

For each baseline we construct a length-391 vector of subject-wise accuracies (one paired value per subject, averaged over the four cross-condition folds) and apply a two-sided paired t-test after verifying normality with the Shapiro-Wilk test. Under a Bonferroni adjustment for the seven subject-level comparisons (α=0.05/70.0071), every contrast with the proposed CM+MD&P remains highly significant, with p-values spanning 4.7×10-22 to <10-33 (Table 5, left).

Table 5.

Paired t-tests on per-subject accuracy differences (n=391)

Baseline ΔAcc [pp] p-value
Subject – CM+MD&P vs. baseline
 CM+CoR + 36.2 4.7e-22
 CM+FN + 51.6 8.2e-27
 FC+CoR + 48.8 2.9e-24
 FC+MD&P + 24.7 2.4e-14
 FC+FN + 57.5 1.0e-33
 TSF + 6.6 2.1e-07
 PH-H0 + 10.0 4.5e-09
Task – CM+GNN vs. baseline
 CM+RF + 10.4 1.1e-12
 CM+SVM + 8.1 4.2e-11
 FC+GNN + 14.6 3.6e-12
 R+GNN + 15.8 6.8e-13
 R+DNN + 18.7 4.8e-14
 R+DBRNN + 17.5 9.9e-14
 R+BAnD + 1.5 1.6e-02
 R+TCN–BiLSTM + 6.0 3.2e-10

ΔAcc = (proposed − baseline) in percentage points; Bonferroni thresholds: αsubj=0.0071 (7 contrasts), αtask=0.0063 (8 contrasts)

For task fingerprinting we test eight baselines, so the corrected threshold is α=0.05/80.0063. All baselines, including the newly added R + TCN BiLSTM, are significantly different from the proposed CM+GNN except R + BAnD, whose p=1.6×10-2 exceeds the threshold (Table 5, right).

Mean accuracies with 95% confidence intervals across folds are 95.4%±0.3 for subject fingerprinting and 98.4%±0.2 for task fingerprinting, indicating that the estimates are stable. Ten random perturbations of the reference/query split alter subject accuracy by no more than 0.6 percentage points (standard deviation), confirming that the results are not driven by a particular split.

Data Availability

This study analyzed publicly available, de-identified fMRI data from the Human Connectome Project (HCP; Young Adult/S1200 release). Access to the raw data is provided by HCP to registered researchers under the project’s data-use terms. No new data were collected or generated in this study; processed data and analysis outputs are available from the corresponding author upon request.

Footnotes

1

In practice, τ is typically on the order of 0.002s [49], thus 1ϵ1360.

2

For non-open-source methods, we reproduced the results based on the descriptions provided in the original papers.

A preliminary version of this work appeared as [1] at the 2024 ACM-BCB Conference.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Song D, Shen L, Duong-Tran D, Wang X. Causality-based subject and task fingerprints using fMRI time-series data. In: Proceedings of the 15th ACM international conference on bioinformatics, computational biology and health informatics; 2024. pp. 1–10.
  • 2.Logothetis NK. The neural basis of the blood-oxygen-level-dependent functional magnetic resonance imaging signal. Philos Trans R Soc Lond B Biol Sci. 2002;357(1424):1003–37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kiebel SJ, Klöppel S, Weiskopf N, Friston KJ. Dynamic causal modeling: a generative model of slice timing in fMRI. Neuroimage. 2007;34(4):1487–96. [DOI] [PubMed] [Google Scholar]
  • 4.Ross LN, Bassett DS. Causation in neuroscience: keeping mechanism meaningful. Nat Rev Neurosci. 2024;25:81. [DOI] [PubMed] [Google Scholar]
  • 5.Mannino M, Bressler SL. Foundational perspectives on causality in large-scale brain networks. Phys Life Rev. 2015;15:107–23. [DOI] [PubMed] [Google Scholar]
  • 6.Rigoux L, Daunizeau J. Dynamic causal modelling of brain-behaviour relationships. Neuroimage. 2015;117:202–21. [DOI] [PubMed] [Google Scholar]
  • 7.Duong-Tran D, Abbas K, Amico E, Corominas-Murtra B, Dzemidzic M, Kareken D, et al. A morphospace of functional configuration to assess configural breadth based on brain functional networks. Netw Neurosci. 2021;5(3):666–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Garai S, Xu F, Duong-Tran D, Zhao Y, Shen L. Mining correlation between fluid intelligence and whole-brain large scale structural connectivity. AMIA Summits Transl Sci Proc. 2023;2023:225. [PMC free article] [PubMed] [Google Scholar]
  • 9.Gu S, Pasqualetti F, Cieslak M, Telesford QK, Yu AB, Kahn AE, et al. Controllability of structural brain networks. Nat Commun. 2015;6(1):8414. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Amico E, Abbas K, Duong-Tran D, Tipnis U, Rajapandian M, Chumin E, et al. Toward an information theoretical description of communication in brain networks. Netw Neurosci. 2021;5(3):646–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Xu F, Garai S, Duong-Tran D, Saykin AJ, Zhao Y, Shen L. Consistency of graph theoretical measurements of Alzheimer’s disease fiber density connectomes across multiple parcellation scales. In: 2022 IEEE international conference on bioinformatics and biomedicine (BIBM), pp. 1323–1328. IEEE; 2022.
  • 12.Smith SM, Vidaurre D, Beckmann CF, Glasser MF, Jenkinson M, Miller KL, et al. Functional connectomics from resting-state fMRI. Trends Cogn Sci. 2013;17(12):666–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Abbas K, Amico E, Svaldi DO, Tipnis U, Duong-Tran D, Liu M, et al. GEFF: graph embedding for functional fingerprinting. Neuroimage. 2020;221:117181. [DOI] [PubMed] [Google Scholar]
  • 14.Abbas K, Liu M, Wang M, Duong-Tran D, Tipnis U, Amico E, et al. Tangent functional connectomes uncover more unique phenotypic traits. Iscience. 2023;26(9):107624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Chiêm B, Abbas K, Amico E, Duong-Tran D, Crevecoeur F, Goñi J. Improving functional connectome fingerprinting with degree-normalization. Brain Connect. 2022;12(2):180–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Duong-Tran D, Kaufmann R, Chen J, Wang X, Garai S, Xu FH, et al. Homological landscape of human brain functional sub-circuits. Mathematics. 2024;12(3):455. [Google Scholar]
  • 17.Duong-Tran D, Nguyen N, Mu S, Chen J, Bao J, Xu F, Garai S, Cadena-Pico J, Kaplan AD, Chen T, et al. A principled framework to assess information theoretical fitness of brain functional sub-circuits. arXiv preprint arXiv:2406.18531
  • 18.Finn ES, Shen X, Scheinost D, Rosenberg MD, Huang J, Chun MM, et al. Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity. Nat Neurosci. 2015;18(11):1664–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Amico E, Goñi J. The quest for identifiability in human functional connectomes. Sci Rep. 2018;8(1):8254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Duong-Tran D, Magsino M, Goñi J, Shen L. Preserving human large-scale brain connectivity fingerprint identifiability with random projections. In: 2024 IEEE international symposium on biomedical imaging (ISBI): 2024, pp. 1–5. IEEE
  • 21.Duong-Tran D, Nguyen N, Mu S, Chen J, Bao J, Xu FH, et al. A principled framework to assess the information-theoretic fitness of brain functional sub-circuits. Mathematics. 2024;12(19):2967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Nguyen N, Hou T, Amico E, Zheng J, Huang H, Kaplan AD, Petri G, Goñi J, Kaufmann R, Zhao Y, et al. Volume-optimal persistence homological scaffolds of hemodynamic networks covary with meg theta-alpha aperiodic dynamics. In: International conference on medical image computing and computer-assisted intervention; 2024. pp. 519–529. Springer.
  • 23.Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SC. Deep learning for person re-identification: a survey and outlook. IEEE Trans Pattern Anal Mach Intell. 2021;44(6):2872–93. [Google Scholar]
  • 24.Cai B, Zhang G, Zhang A, Xiao L, Hu W, Stephen JM, et al. Functional connectome fingerprinting: identifying individuals and predicting cognitive functions via autoencoder. Hum Brain Mapp. 2021;42(9):2691–705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Calhoun VD, Adali T, Pearlson GD, Pekar JJ. A method for making group inferences from functional mri data using independent component analysis. Hum Brain Mapp. 2001;14(3):140–51. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.McKeown MJ, Makeig S, Brown GG, Jung T-P, Kindermann SS, Bell AJ, et al. Analysis of fMRI data by blind separation into independent spatial components. Hum Brain Mapp. 1998;6(3):160–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Beckmann CF, DeLuca M, Devlin JT, Smith SM. Investigations into resting-state connectivity using independent component analysis. Philos Trans R Society B Biol Sci. 2005;360(1457):1001–13. [Google Scholar]
  • 28.Smith SM, Fox PT, Miller KL, Glahn DC, Fox PM, Mackay CE, et al. Correspondence of the brain’s functional architecture during activation and rest. Proc Natl Acad Sci. 2009;106(31):13040–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Van Den Heuvel MP, Pol HEH. Exploring the brain network: a review on resting-state fMRI functional connectivity. Eur Neuropsychopharmacol. 2010;20(8):519–34. [DOI] [PubMed] [Google Scholar]
  • 30.Umair M, Shen L, Duong-Tran D, Wang X. Subject and task fingerprint using dynamics reconstruction from fMRI time-series data. In: OHBM; 2023.
  • 31.Friston KJ, Harrison L, Penny W. Dynamic causal modelling. Neuroimage. 2003;19(4):1273–302. [DOI] [PubMed] [Google Scholar]
  • 32.Stephan KE, Penny WD, Daunizeau J, Moran RJ, Friston KJ. Bayesian model selection for group studies. Neuroimage. 2009;46(4):1004–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Granger CW. Investigating causal relations by econometric models and cross-spectral methods. Econ J Econ Soc. 1969;424–438.
  • 34.Barrett AB, Barnett L. Granger causality is designed to measure effect, not mechanism. Front Neuroinform. 2013;7:6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Roebroeck A, Formisano E, Goebel R. Mapping directed influence over the brain using granger causality and fMRI. Neuroimage. 2005;25(1):230–42. [DOI] [PubMed] [Google Scholar]
  • 36.Wang X, Cortés J. Data-driven reconstruction of firing rate dynamics in brain networks. In: 2021 60th IEEE conference on decision and control (CDC); 2021. pp. 6463–6468. IEEE
  • 37.Wang X, Cortes J. Efficient reconstruction of neural mass dynamics modeled by linear-threshold networks. arXiv preprint arXiv:2308.14231. 2023.
  • 38.Wang X, Cortés J. Data-driven control of linear-threshold network dynamics. In: 2022 American Control Conference (ACC); 2022. pp. 114–119. IEEE
  • 39.Iturria-Medina Y, Carbonell FM, Sotero RC, Chouinard-Decorte F, Evans AC, Initiative ADN, et al. Multifactorial causal model of brain (dis) organization and therapeutic intervention: application to Alzheimer’s disease. Neuroimage. 2017;152:60–77. [DOI] [PubMed] [Google Scholar]
  • 40.Breiman L. Random forests. Mach Learn. 2001;45:5–32. [Google Scholar]
  • 41.Hearst MA, Dumais ST, Osuna E, Platt J, Scholkopf B. Support vector machines. IEEE Intell Syst Appl. 1998;13(4):18–28. [Google Scholar]
  • 42.Wang X, Liang X, Jiang Z, Nguchu BA, Zhou Y, Wang Y, et al. Decoding and mapping task states of the human brain via deep learning. Hum Brain Mapp. 2020;41(6):1505–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhao S, Fang L, Wu L, Yang Y, Han J. Decoding task sub-type states with group deep bidirectional recurrent neural network. In: International conference on medical image computing and computer-assisted intervention, pp. 241–250. Springer; 2022.
  • 44.Nguyen S, Ng B, Kaplan AD, Ray P. Attend and decode: 4D fMRI task state decoding using attention models. In: Machine learning for health. pp. 267–279. PMLR; 2020.
  • 45.Kucukosmanoglu M, Garcia JO, Brooks J, Bansal K. Influence of cognitive networks and task performance on fMRI-based state classification using DNN models. Sci Rep. 2025;15(1):23689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Curic D, Krishna SKV, Davidsen J. Efficacy of functional connectome fingerprinting using tangent-space brain networks. Netw Neurosci. 2025;9(2):549–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wang Y, Xian J, Chen Y, Yan Y. Topological signatures of brain dynamics: persistent homology reveals individuality and brain-behavior links. Front Hum Neurosci. 2025;19:1607941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Polimeni JR, Lewis LD. Imaging faster neural dynamics with fast fMRI: a need for updated models of the hemodynamic response. Prog Neurobiol. 2021;207:102174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kandel ER, Schwartz JH, Jessell T, Siegelbaum SA, Hudspeth A. Principles of neural science. McGraw-Hill Medical; 2013.
  • 50.Peng X, Lu C, Yi Z, Tang H. Connections between nuclear-norm and frobenius-norm-based representations. IEEE Trans Neural Netw Learn Syst. 2016;29(1):218–24. [DOI] [PubMed] [Google Scholar]
  • 51.Dahleh M, Dahleh MA, Verghese G. Lectures on dynamic systems and control. A+ A. 2004;4(100):1–100. [Google Scholar]
  • 52.Horn RA, Johnson CR. Matrix analysis. Cambridge University Press; 2012.
  • 53.Khemani B, Patil S, Kotecha K, Tanwar S. A review of graph neural networks: concepts, architectures, techniques, challenges, datasets, applications, and future directions. J Big Data. 2024;11(1):18. [Google Scholar]
  • 54.Velickovic P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y, et al. Graph attention networks. Stat. 2017;1050(20):10–48550. [Google Scholar]
  • 55.Xu K, Zhang M, Jegelka S, Kawaguchi K. Optimization of graph neural networks: Implicit acceleration by skip connections and more depth. In: International conference on machine learning. pp. 11592–11602. PMLR; 2021.
  • 56.Van Essen DC, Smith SM, Barch DM, Behrens TE, Yacoub E, Ugurbil K, Consortium W-MH, et al. (2013) The WU-Minn human connectome project an overview. NeuroImage. 80:62–79
  • 57.Schaefer A, Kong R, Gordon EM, Laumann TO, Zuo X-N, Holmes AJ, et al. Local-global parcellation of the human cerebral cortex from intrinsic functional connectivity MRI. Cereb Cortex. 2018;28(9):3095–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Hardwick RM, Lesage E, Eickhoff CR, Clos M, Fox P, Eickhoff SB. Multimodal connectivity of motor learning-related dorsal premotor cortex. Neuroimage. 2015;123:114–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Tipnis U, Abbas K, Tran E, Amico E, Shen L, Kaplan AD, Go IJ. Derived products from hcp-ya fMRI. Lawrence Livermore National Laboratory (LLNL) Open Data Initiative. 2022. https://library.ucsd.edu/dc/object/bb59818382
  • 60.Tipnis U, Abbas K, Tran E, Amico E, Shen L, Kaplan AD, Goñi J. Functional connectome fingerprint gradients in young adults. arXiv preprint arXiv:2011.05212 2020.
  • 61.Ugurbil K, Van Essen D. 1200 Subjects Data Release. https://www.humanconnectome.org/storage/app/media/documentation/s1200/HCP_S1200_Release_Reference_Manual.pdf
  • 62.Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, et al. The minimal preprocessing pipelines for the human connectome project. Neuroimage. 2013;80:105–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Salimi-Khorshidi G, Douaud G, Beckmann CF, Glasser MF, Griffanti L, Smith SM. Automatic denoising of functional MRI data: combining independent component analysis and hierarchical fusion of classifiers. Neuroimage. 2014;90:449–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Glasser MF, Smith SM, Marcus DS, Andersson JL, Auerbach EJ, Behrens TE, et al. The human connectome project’s neuroimaging approach. Nat Neurosci. 2016;19(9):1175–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Raichle ME, Mintun MA. Brain work and brain imaging. Annu Rev Neurosci. 2006;29:449–76. [DOI] [PubMed] [Google Scholar]
  • 66.Mascali D, DiNuzzo M, Gili T, Moraschi M, Fratini M, Maraviglia B, Serra L, Bozzali M, Giove F. Resting-state fMRI in dementia patients. Harvard Dataverse; 2015.
  • 67.LaConte S, Strother S, Cherkassky V, Anderson J, Hu X. Support vector machines for temporal classification of block design fMRI data. Neuroimage. 2005;26(2):317–29. [DOI] [PubMed] [Google Scholar]
  • 68.Pinho AL, Amadon A, Gauthier B, Clairis N, Knops A, Genon S, et al. Individual brain charting dataset extension, second release of high-resolution fMRI data for cognitive mapping. Sci Data. 2020;7(1):353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Damasio AR, Damasio H. Brain and language. Sci Am. 1992;267(3):88–109. [Google Scholar]
  • 70.Raichle ME. The brain’s default mode network. Annu Rev Neurosci. 2015;38(1):433–47. [DOI] [PubMed] [Google Scholar]
  • 71.Phan KL, Wager TD, Taylor SF, Liberzon I. Functional neuroimaging studies of human emotions. CNS Spectr. 2004;9(4):258–66. [DOI] [PubMed] [Google Scholar]
  • 72.Knutson B, Adams CM, Fong GW, Hommer D. Anticipation of increasing monetary reward selectively recruits nucleus accumbens. J Neurosci. 2001;21(16):159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Friederici AD, Brauer J, Lohmann G. Maturation of the language network: from inter-to intrahemispheric connectivities. PLoS ONE. 2011;6(6):20726. [Google Scholar]
  • 74.Lotze M, Halsband U. Motor imagery. J Physiology-paris. 2006;99(4–6):386–95. [Google Scholar]
  • 75.Fedorenko E. The role of domain-general cognitive control in language comprehension. Front Psychol. 2014;5:335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Saxe R, Baron-Cohen S. The neuroscience of theory of mind. Taylor & Francis; 2006.
  • 77.Curtis CE, D’Esposito M. Persistent activity in the prefrontal cortex during working memory. Trends Cogn Sci. 2003;7(9):415–23. [DOI] [PubMed] [Google Scholar]
  • 78.Garai S, Liu M, Xu F, Goñi J, Duong-Tran D, Zhao Y, Shen L. Effect of brain network scale on persistence cycles: an ADNI comparative study. In: Alzheimer’s association international conference. ALZ; 2024.
  • 79.Foret P, Kleiner A, Mobahi H, Neyshabur B. Sharpness-aware minimization for efficiently improving generalization. arXiv preprint arXiv:2010.01412 2020.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This study analyzed publicly available, de-identified fMRI data from the Human Connectome Project (HCP; Young Adult/S1200 release). Access to the raw data is provided by HCP to registered researchers under the project’s data-use terms. No new data were collected or generated in this study; processed data and analysis outputs are available from the corresponding author upon request.


Articles from Health Information Science and Systems are provided here courtesy of Springer

RESOURCES