Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Sep 30.
Published in final edited form as: Med Image Comput Comput Assist Interv. 2018 Sep 13;11072:145–153. doi: 10.1007/978-3-030-00931-1_17

A Riemannian Framework for Longitudinal Analysis of Resting-State Functional Connectivity

Qingyu Zhao 1, Dongjin Kwon 1,2, Kilian M Pohl 2
PMCID: PMC7526985  NIHMSID: NIHMS1630604  PMID: 33005907

Abstract

Even though the number of longitudinal resting-state-fMRI studies is increasing, accurately characterizing the changes in functional connectivity across visits is a largely unexplored topic. To improve characterization, we design a Riemannian framework that represents the functional connectivity pattern of a subject at a visit as a point on a Riemannian manifold. Geodesic regression across the ‘sample’ points of a subject on that manifold then defines the longitudinal trajectory of their connectivity pattern. To identify group differences specific to regions of interest (ROI), we map the resulting trajectories of all subjects to a common tangent space via the Lie group action. We account for the uncertainty in choosing the common tangent space by proposing a test procedure based on the theory of latent p-values. Unlike existing methods, our proposed approach identifies sex differences across 246 subjects, each of them being characterized by three rs-fMRI scans.

1. Introduction

Longitudinal resting-state(rs)-fMRI studies, in which participants are scanned at multiple visits, have been increasingly used for investigating functional connectivity changes and development in human brains [1,2]. However, current methods for rs-fMRI group analysis are mostly designed for cross-sectional studies [3,4]. In this paper, we propose a framework for performing group analysis on longitudinal rs-fMRI data.

Cross-sectional studies often encode the functional connectivity of a subject as an n × n covariance matrix C of BOLD (blood-oxygen-level dependent) time courses associated with n ROIs. To identify differences in functional connectivity between two groups, a univariate group test is typically applied to each connection (each element in the upper triangle of C). One problem in this practice is that the univariate tests neglect the strong statistical dependence among matrix elements: a covariance matrix C is confined by the positive-definite constraint C = CT, xCxT > 0 for all non-zero xn. One way to alleviate this problem is to leverage the fact that covariance matrices form a Riemannian manifold [5]. Previous rs-fMRI connectivity studies have reported improved detection sensitivity by performing group analysis directly on that manifold [4]. Based on this observation, we design a Riemannian framework to analyze the change in connectivity patterns captured by longitudinal studies.

In a longitudinal study, each subject is characterized by a series of covariance matrices representing connectivity patterns at multiple visits. Motivated by previous works [6], our longitudinal framework is composed of two parts: (a) fitting a longitudinal trajectory on the covariance matrices of each subject via geodesic regression on the manifold [7] (Sect. 2); and (b) comparing subject-specific trajectories across groups (Sect. 3). The challenge in (b) is that trajectories of different subjects are not directly comparable as they are essentially tangent vectors defined in different tangent spaces. Several methods have been explored to handle this problem, either by directly performing group analysis on the ‘tangent bundle space’ [6] or by designing mixed-effect models for manifolds [8]. These methods, however, consider the object of interest (in our case a covariance matrix) as a single manifold-valued variable, such that group difference can only be identified with respect to the entire matrix (the whole brain connectivity) instead of each matrix element (connectivity between two ROIs). In order to enable ROI-specific analysis, we map all tangent vectors to a common tangent space of a template point on the manifold. This enables univariate testing to each matrix element across all mapped tangent vectors.

In addition, we define the mapping function based on the Lie group action and briefly discuss its favorable properties to the popular parallel transport mechanism [9,10]. We define the common tangent space via the identity matrix [9] and the Fréchet mean [4], and we argue that the latter is preferred in the context of connectivity analysis. Finally, our group analysis accounts for the uncertainty in estimating the template via a robust test procedure based on the theory of latent p-values. We finally validate our proposed Riemannian framework using both synthetic and real rs-fMRI datasets.

2. Computing Subject-Specific Trajectory

Recall that the space of n × n covariance matrices forms a Riemannian manifold M [5]. Let A be a point on M, TAM the tangent space at A, and XTAM a tangent vector. There is a unique geodesic curve γ (a locally length-minimizing curve on the manifold) with γ(0) = A and γ(0)′ = X. The analytical equation of a geodesic is defined by the exponential map,

γ(t)=ExpA(tX):=A12expmA12(tX)A12A12, (1)

where expm is the matrix exponential operator. In other words, the exponential map ExpA(X) at an initial point A projects a tangent vector X to a point on the manifold at γ(1) along the geodesic γ defined by (A, X). The inverse mapping of ExpA(X) is called the log map. It projects a point BM back to a tangent vector at A via LogA(B):=A12logmA12BA12A12

Now let us consider M covariance matrices of a subject {C1, …, CM} measured at M visits. Let {t1, …, tM} be the time associated with those visits. Without loss of generality, we translate {ti} such that t1 = 0. We then use geodesic regression [7] to characterize the change of {Ci} over time (the relationship between ti and Ci). Specifically, to find a geodesic curve (A^, X^) that optimally fits the data (Fig. 1), we minimize the following objective function:

(A^,X^)=argminA,Xi=1MdExpAtiX,Ci2, (2)

where d(A, B) measures the geodesic distance between A, BM via the Riemannian metric d(A,B):=LogA(B)A:=trlogmA12BA12.

Fig. 1.

Fig. 1.

(a) A Riemannian framework for longitudinal connectivity analysis: the jth subject’s trajectory (red dashed curve) is fitted via geodesic regression on subject-specific covariance matrices (triangles on the left). All resulting subject-specific tangent vectors are mapped via function ϕ to the tangent space of C¯ to perform group analysis. (b) Top: 3 covariance matrices {C1, C2, C3} of a subject with t = 0, 1, 2. Bottom: The optimal geodesic (A^, X^) derived from geodesic regression.

In particular, the initial point A^ characterizes the connectivity pattern at baseline. The tangent vector X^ characterizes the longitudinal trajectory of that subject’s connectivity. Note that the tangent space TA^M is the vector space of n×n symmetric matrices. Therefore, unlike A^, the matrix elements in the upper triangle of X^ are mutually independent for univariate tests.

3. Group Analysis for Trajectories

Now let {(Aj, Xj)|j = 1, …, N} represent the geodesics of N different subjects. While cross-sectional analysis is interested in analyzing baseline matrices {Aj} [4], here we aim to identify group differences in {Xj}, i.e., the difference across subjects in their longitudinal changes. As mentioned, these tangent vectors are defined in different tangent spaces (e.g. the red and green vector in Fig. 1a), therefore not directly comparable. To handle this problem, we map all tangent vectors {Xj|j = 1, …, N} to a common tangent space TC¯M. This step essentially requires the definition of (a) a mapping function ϕAjC¯Xj=X¯j and (b) a template point C¯. Finally, we estimate the p-values of univariate testing for all n(n + 1)/2 matrix elements across X¯jj=1,,N.

Choosing the Mapping Function ϕ.

A popular choice of ϕ is the parallel transport [9,10], which transports tangent vectors on a manifold such that they stay parallel with respect to the affine connection. Despite its appealing geometrical meaning, the major drawback of parallel transport is its path-dependency: transporting a tangent vector along two different curves with the same start and end point generally results in two different ‘copies’ of the vector. This phenomenon can lead to ambiguity in choosing the template for group analysis. To show this effect, let ϕABp(X) denote the parallel transport of XTAM along the geodesic from A to B. Let X1TA1M and X2TA2M be two subject-specific tangent vectors. We further assume that they are equivalent when transported to C¯, i.e., X¯1=ϕA1C¯pX1=ϕA2C¯pX2=X¯2. Now if we perturb C¯ by ε via C¯*=ExpC¯(ε) and transport X1, X2 to C¯*, in general,

X¯1*=ϕA1C¯*pϕC¯A1pX¯1ϕA2C¯*pϕC¯A2pX¯2=X¯2* (3)

as ϕp is path-dependent. Contradictory to our previous assumption that the two subject-specific tangents are equivalent, Eq. 3 reveals that they are different at C¯*. In other words, (in)equality relationships among {Xj} are variant to the template selection, which can lead to serious ambiguity in the comparison of subject-specific trajectories.

To resolve this problem, we exploit the fact that M is equipped with an affine-invariant Riemannian metric [5]. To be specific, let GLn denote the Lie group of all n × n invertible matrices. This group acts on M via a smooth mapping function ψ:GLn×MM,ψG(A):=GAGT=B, where GGLn and A, BM. This group action can be naturally extended to tangent vectors via its derivative map G(X) := GXGT = Y, where XTAM and YTBM. In other words, G achieves the mapping of tangent vectors across different tangent spaces based on the aforementioned smooth group action. With this construction, we propose the following mapping function

ϕABg(X)=dψG(X),G=B12A12. (4)

Since the group action ψ is transitive [5], so is ϕg, i.e., ϕBCgϕABg(X)=ϕACg(X). This property avoids the path-dependent assumption as required in parallel transport, so that (in)equality relationships among X¯j are invariant to the choice of the template.

Choosing the Template C¯.

In the context of connectivity analysis, we argue that the Fréchet mean [4] is a more appropriate template compared to the identity matrix I [9]. Recall that X(u, v), the (u, v)th element in a subject-specific tangent vector X, encodes the longitudinal information about connectivity between the uth and vth ROI. Since ϕAC¯gor ϕAC¯p is a general linear transformation, X¯(u,v) is a linear combination of X(u, v) and other matrix elements. Consequently, X¯(u,v) no longer precisely relates the two ROIs. This reveals one critical trade-off: only X models true subject-specific ROI information, whereas only X¯ is geometrically comparable with other tangent vectors. To alleviate this issue, we realize that when C¯ is close to A, C¯12A12 is close to an identity transformation (X¯(u,v)X(u,v)). This motivates us to choose C¯ as the Fréchet mean, because it is the ‘closest’ matrix to all {Aj} so that true ROI information can be optimally preserved for all subjects.

3.

A Robust Test Procedure.

In the end, a univariate test is applied to each connection (upper triangular matrix element) across X¯j. However, with different choices of C¯, both the null distribution and observed values of X¯j(u,v) vary accordingly to the associated ϕAC¯g. Therefore, the uncertainty in estimating the Fréchet mean can lead to unstable p-values. To solve this problem, we resort to the theory of latent p-values [11]. Recall that the p-value is defined as p=prX¯0>X¯H0, i.e., the probability of obtaining a result larger (right-tailed) than the observed X¯ under the null hypothesis H0. Recently, statisticians also interpret p-values as random variables [11]. Specifically, the latent p-value considers both the strength of the evidence against the null hypothesis, and the uncertainty in the evidence and null distribution. Here we regard both X¯0 and X¯ as unobservable latent variables. As described in [11], the latent p-value can be approximated by generating Monte-Carlo realizations of X¯0. We then let X¯0 and X¯ be dependent on a latent template C¯, and we marginalize C¯ by p=C¯ prX¯0(C¯)>X¯(C¯)C¯,H0pr(C¯). Finally, we can approximate this integration by sampling C¯ via bootstrapping. Intuitively, instead of regarding X¯ and X¯0 as deterministic variables, we perform multiple test procedures based on templates sampled from pr(C¯) (Algorithm 1).

4. Experiments

Synthetic Data.

In this experiment, we simulated covariance matrices for 2 groups of subjects. Each group has N = 20 subjects, and each subject has M = 3 covariance matrices. We added longitudinal changes to the 3 matrices of each subject in Group B. We validated our framework based on the accuracy in identifying those effects between the two groups.

We started from simulating BOLD time series using SimTB [12]. We randomized simulation parameters for each subject and derived covariance matrices from simulated BOLD signals. To be specific, we defined n = 10 ROIs and randomly grouped them into 3 independent networks (Fig. 1b). Only ROIs of the same network activated simultaneously, thus having non-zero covariances. The activation amplitude of each ROI was sampled from the standard Gaussian N(0,1). The unique activation (functional noise) probability of each ROI was u = 0.35. We added Riccan noise (imaging noise) with a Contrast-to-Noise Ratio (CNR) of 1.5. The above simulation parameters were then shared across subjects. To synthesize subject variability, we perturbed the activation amplitudes of each subject with N(0,0.05). Then, for each subject in Group A, BOLD time courses at 3 time points (t = 0, 1, 2) were generated using those simulation parameters. The only difference in simulating Group B subjects was that the activation amplitudes of the 3rd network successively changed at {t1, t2, t3}, with a changing rate sampled from N(0,0.25). Consequently, only the covariances within that network could change longitudinally. Figure 1b shows one example, where the 9 elements within the red square (the 3rd network) are true positives to be identified by group analysis. Unlike the simulation in [4], our strategy relies only on the basic fMRI signal generation mechanism, so that the simulated matrices do not bias any particular covariance modeling technique.

Figure 2a, b, c show the identified group differences (yellow) derived from three longitudinal analysis methods: (a) our proposed Riemannian framework with the mapping function ϕg (Eq. 4); (b) Riemannian framework with parallel transport ϕp (defined in [13]); and (c) linear regression on each element across subject-specific covariance matrices {Ci} (Euclidean). The final univariate test was the two-sample t-test. The latent p-value map (Sect. 3) was used in Method a and b, whereas Method c only required a deterministic p-value map as it directly compared the trajectories (linear slopes) of different subjects. Significant matrix elements were identified at p ≤ 0.05 after correcting for multiple comparison via the Bonferroni procedure. Due to the influence of ROI unique activation and the Riccan noise, none of the three methods identified all true positives. However, only Method a yielded no false positive. To show the non-deterministic nature of the univariate tests in Method a and b, we identified group differences using each of the 500 p-value maps and counted the frequency of each element being significant. The spread of the two resulting histograms (Fig. 2d, e) indicates the latent p-value map is preferred over any particular deterministic p-value map. Moreover, our proposed mapping function ϕg (Fig. 2d) achieved more consistent group test results compared to the parallel transport (Fig. 2e). Next, to show the importance of using the Fréchet mean as the template, we sampled 1000 C¯ via Principal Geodesic Analysis, and generated a deterministic p-value map for each sampled C¯. We sorted and binned the 1000 test results according to the distance from the sampled C¯ to the Fréchet mean. Figure 3a indicates when C¯ was near the Fréchet mean, the test accuracy significantly increased. Finally, we tested the three methods’ robustness against noise (u and CNR). Under each noise level, we repeated the experiments (including re-grouping ROIs and re-randomizing activation amplitudes) 10 times. Figure 3b shows that the Riemannian framework always outperformed the traditional Euclidean method, and our proposed mapping function ϕg always achieved the best result.

Fig. 2.

Fig. 2.

(a)(b)(c) Group differences (yellow) identified by the three methods. (d)(e) Histogram of identified elements counted using the 500 p-value maps.

Fig. 3.

Fig. 3.

(a) Number of true (false) positives averaged over templates sampled within a certain distance range. (b) ROC of the three methods under three noise levels. (c) The application of our proposed framework to the NCANDA dataset revealed sex effects in two connections. The p-values were corrected for multiple comparison.

The NCANDA Dataset.

We applied our framework to rs-fMRI data of 246 normal adolescents (age 12–21; 117 boys and 129 girls) from the National Consortium on Alcohol and Neurodevelopment in Adolescence (NCANDA) [14]. Each participant in this dataset was scanned three times (baseline, 1-year follow-up and 2-year follow-up). We adopted the same preprocessing procedure as described in [14]. The brain was parcelated into 100 ROIs using [15]. The final univariate test was a general linear model (GLM) accounting for sex, age, site and race. Using Method a, we identified sex effects in two connections (Fig. 3c). Specifically, the connectivity between Frontal_Suf_L and Vermis_1 increases faster in girls. The connectivity between Frontal_Inf_Oper_R and Cuneus_L increases faster in boys. Method b and c did not yield any significant finding.

5. Conclusion

Based on the Riemannian modeling of covariance matrices, we introduced a framework for performing group analysis on longitudinal rs-fMRI. Importantly, our proposed tangent vector mapping function and latent p-value strategy aim to best compromise between the geometry of data and clinical interpretation. Both synthetic and real-data experiments indicated the potential of our proposed framework in longitudinal connectivity analysis. Nevertheless, a theoretical combination of Riemannian geometry and consistent statistics still remains an open topic. In addition, we need to further explore the challenging issue of characterizing multi-level longitudinality, i.e., dynamics in the time courses of each single visit vs. development across visits.

Acknowledgement.

This research was supported in part by NIH grants U24AA021697-06, AA005965, AA013521, AA017168.

References

  • 1.Odish O, et al. : Longitudinal resting state fMRI analysis in healthy controls and premanifest Huntington’s disease gene carriers: a three-year follow-up study. Hum. Brain Mapp 36(1), 110–119 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.van der Horn H, et al. : The default mode network as a biomarker of persistent complaints after mild traumatic brain injury: a longitudinal functional magnetic resonance imaging study. J. Neurotrauma 34(23), 3262–3269 (2017) [DOI] [PubMed] [Google Scholar]
  • 3.Beckmann C, Mackay C, Filippini N, Smith S: Group comparison of resting-state FMRI data using multi-subject ICA and dual regression. OHBM (2009) [Google Scholar]
  • 4.Varoquaux G, Baronnet F, Kleinschmidt A, Fillard P, Thirion B: Detection of brain functional-connectivity difference in post-stroke patients using group-level covariance modeling In: Jiang T, Navab N, Pluim JPW, Viergever MA (eds.) MICCAI 2010. LNCS, vol. 6361, pp. 200–208. Springer, Heidelberg: (2010). 10.1007/978-3-642-15705-925 [DOI] [PubMed] [Google Scholar]
  • 5.Pennec X, Fillard P, Ayache N: A Riemannian framework for tensor computing. IJCV 66(1), 41–66 (2006) [Google Scholar]
  • 6.Hong Y, Singh N, Kwitt R, Niethammer M: Group testing for longitudinal data In: Ourselin S, Alexander DC, Westin C-F, Cardoso MJ (eds.) IPMI 2015. LNCS, vol. 9123, pp. 139–151. Springer, Cham: (2015). 10.1007/978-3-319-19992-411 [DOI] [PubMed] [Google Scholar]
  • 7.Fletcher PT: Geodesic regression and the theory of least squares on Riemannian manifolds. IJCV 105(2), 171–185 (2013) [Google Scholar]
  • 8.Kim HJ, Adluru N, Suri H, Vemuri BC, Johnson SC, Singh V: Riemannian nonlinear mixed effects models: analyzing longitudinal deformations in neuroimaging. In: CVPR, pp. 172–181 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ng B, Dressler M, Varoquaux G, Poline JB, Greicius M, Thirion B: Transport on Riemannian manifold for functional connectivity-based classification In: Golland P, Hata N, Barillot C, Hornegger J, Howe R (eds.) MICCAI 2014. LNCS, vol. 8674, pp. 405–412. Springer, Cham: (2014). 10.1007/978-3-319-10470-651 [DOI] [PubMed] [Google Scholar]
  • 10.Campbell KM, Fletcher PT: Efficient parallel transport in the group of diffeomorphisms via reduction to the lie algebra In: Cardoso MJ (ed.) GRAIL/MFCA/MICGen −2017. LNCS, vol. 10551, pp. 186–198. Springer, Cham: (2017). 10.1007/978-3-319-67675-317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Thompson EA, Geyer CJ: Fuzzy p-values in latent variable problems. Biometrika 94(1), 49–60 (2007) [Google Scholar]
  • 12.Erhardt E, Allen E, Wei Y, Eichele T, Calhoun V: SimTB, a simulation toolbox for fMRI data under a model of spatiotemporal separability. NeuroImage 59(4), 4160–4167 (2012) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ferreira R, Xavier J, Costeira JP, Barroso V: Newton method for Riemannian centroid computation in naturally reductive homogeneous spaces. In: ICASSP (2006) [Google Scholar]
  • 14.Müller-Oehring E, et al. : Influences of age, sex, and moderate alcohol drinking on the intrinsic functional architecture of adolescent brains. Cereb. Cortex 28(3), 1049–1063 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Craddock R, James G, Holtzheimer P, Hu X, Mayberg H: A whole brain fMRI atlas generated via spatially constrained spectral clustering. Hum. Brain Mapp 33(8), 1914–1928 (2012) [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES