Abstract
We present a continuous model for structural brain connectivity based on the Poisson point process. The model treats each streamline curve in a tractography as an observed event in connectome space, here the product space of the gray matter/white matter interfaces. We approximate the model parameter via kernel density estimation. To deal with the heavy computational burden, we develop a fast parameter estimation method by pre-computing associated Legendre products of the data, leveraging properties of the spherical heat kernel. We show how our approach can be used to assess the quality of cortical parcellations with respect to connectivity. We further present empirical results that suggest that “discrete” connectomes derived from our model have substantially higher test-retest reliability compared to standard methods. In this, the expanded form of this paper for journal publication, we also explore parcellation free analysis techniques that avoid the use of explicit partitions of the cortical surface altogether. We provide an analysis of sex effects on our proposed continuous representation, demonstrating the utility of this approach.
Keywords: Diffusion MRI, Connectivity Analysis, Point Process, Non-Parametric Estimation
Graphical Abstract
1. Introduction
In recent years the study of structural and functional brain connectivity has expanded rapidly. Following the rise of diffusion and functional MRI, connectomics has unlocked a wealth of knowledge to be explored. Almost synonymous with the connectome is the network-theory based representation of the brain [1]. In much of the recent literature the quantitative analysis of connectomes has focused on region-to-region connectivity. This paradigm equates physical brain regions with nodes in a graph, and uses observed structural measurements or functional correlations as a proxy for edge strengths between nodes.
Critical to this representation of connectivity is the delineation of brain regions, the parcellation. Multiple studies have shown that the choice of parcellation influences the graph statistics of both structural and functional networks [2, 3, 4, 5]. It remains an open question which of the proposed parcellations is the optimal representation, or even if such a parcellation exists [6].
It is thus useful to construct a more general framework for cortical connectivity, one in which any particular parcellation of the cortex may be expressed and its connectivity matrix derived, and one in which the variability of connectivity measures can be modeled and assessed statistically. It is also important that this framework allow comparisons between parcellations, and representations in this framework must be both analytically and computationally tractable. Since several brain parcellations at the macroscopic scale are plausible, a representation of connectivity that is independent of parcellation is particularly appealing.
In this paper, we develop such a general framework for a parcellation independent connectivity representation, building on the work of [7]. We describe a continuous point process model for the generation of observed tract1 (streamline) intersections with the cortical surface, from which we may recover a distribution of edge strengths for any pair of cortical regions, as measured by the inter-region tract count. Our model is an intensity function over the product space of the cortical surface with itself, assigning to every pair of points on the surface a connectivity density, opposed to the usual connectivity mass assigned in discrete models. We describe an efficient method to estimate the parameter of the model, as well as a method to recover the region-to-region edge strength. We then demonstrate the estimation of the model on a test-retest dataset. We provide reproducibility estimates for our method and the standard direct count method [8] for comparison. We also compare the representational power of common cortical parcellations with respect to a variety of measures.
In this, the extended journal form of the conference publication [9], we explore possible methods for direct analysis of the continuous connective object. We present an exemplar analysis of group differences in continuous summary measures (regressing a connectivity measure on sex, with age and ICV covariates), showing significant differences in regions also identified using parcellation-based representations. We use this analysis to demonstrate a practical analysis of the proposed model.
2. Continuous Connectivity Model
The key theoretical component of our work is the use of point process theory to describe estimated cortical tract projections. A point process is a random process where any realization consists of a collection of discrete points on a measurable space. The most basic of these processes is the Poisson process, in which events occur independently at a specific asymptotic intensity (rate) λ over the chosen domain [10]. λ completely characterizes each particular process, and is often defined as a non-negative function λ : Domain → ℝ+, which allows the process to vary in intensity by location. This is functionally similar to a probability density, except that realizations of the Poisson process can consist of zero, one, or many points, the points are independent by assumption, and λ need not integrate to one.
The expected count of any sub-region (subset) of the domain is its total intensity, the integral of λ over the sub-region. In this paper, our domain is the connectivity space of the cortex, the set of all pairs of points on the surface, and the events are estimated tract intersections with the cortical surface.
2.1. Model Definition and Properties
Let Ω be union of two disjoint subspaces each diffeomorphic to the 2-sphere representing the white matter boundaries in each hemisphere. Further consider the space Ω × Ω, which here represents all possible endpoint pairs for tracts that reach the white matter boundary. We denote the set of observed tract endpoint pairs as D. We treat the observation of such tracts as events generated by an inhomogeneous (symmetric) Poisson process on Ω × Ω; in our case, for every event (x, y) we have a symmetric event (y, x).
Assuming that each event is independent of all other events except for its symmetric event (i.e., each tract in D is recovered independently), we model connectivity as a intensity function λ : Ω × Ω → ℝ+, such that for any regions E1,E2 ⊂ Ω, the number of events is Poisson distributed with parameter
(1) |
Due to properties of the Poisson distribution, the expected number of tracts is exactly 𝒞(E1,E2). For any collection of regions , we can compute a weighted graph 𝒢(P, λ) by computing each 𝒞(Ei,Ej) for pairs (Ei,Ej) ∈ P × P. Each node in this graph is an element of P (a subset of Ω, a region of the cortical surface), and the edges between them are the rate at which we observe streamlines between the regions.
We call P a parcellation of Ω if ∪i Ei = Ω and ∩i Ei has measure zero ({Ei} is almost disjoint). If P is a parcellation, then 𝒢(P, λ) has Poisson rate parameters as edges. For any realization of endpoints, the count matrices that form traditional connectomes are independent draws from Poisson distributions with elements of 𝒢(P, λ) as parameters. The independence of the observations is conditional on λ and the fact that P is a parcellation, and does not imply an independence of the rates of the different regions—in other words, the observed counts are independent given the parameters, but this model does not speak to the generation of the parameters themselves.
It is immediately clear that λ is one such parcellation independent representation of connectivity that we desired in Section 1. λ is defined without reference to any particular parcellation; moreover, for any choice of parcellation P or even more general sets of subsets of Ω (e.g. overlapping sets) we can recover the parameters of a random network 𝒢(P, λ). While λ is a representation of cortical connectivity, we posit that λ itself is not a weighted graph as it no longer has a countable set of nodes. However, it does retain several graph-like constructions, namely a function analogous to weighted-degree (“strength”).
Define the marginal connectivity over a region E ⊂ Ω as M(·;E) : Ω → ℝ+ as:
(2) |
This is the aggregate connectivity to any point in region Ei from any point x–the pointwise intensity of observing a tract incident on x for which the other endpoint is contained in Ei. Further define
(3) |
This is the direct analogue of the sum of the edge weights for a given node x, i.e. the weighted degree. It is equal to the pointwise rate at which tracts are incident on x, connecting to any other point. If λ is continuous, then it can be shown that M(x) is also continuous.
2.2. Selection of a Parcellation
𝒢(P, λ) is a summary statistic for the intensity function λ, in that it summarizes information about the rate of tract observation into a finite set of scalars. It is clearly dependent on the parcellation P. Thus, given λ and two or more parcellations P1, P2, . . . , we would like to know which parcellation and associated summary statistic (graph) 𝒢(P, λ) best represents the underlying connectivity function. This requires a definition of the goodness of a representation; in practical terms, this means we need to choose a loss function in order to quantify how well 𝒢(P, λ) represents λ. There are at least two perspectives to consider, one in which 𝒢(P, λ) is viewed as an approximation to the function λ, and another in which 𝒢(P, λ) is viewed as an approximation to the parameter of the point process model.
L2 Approximation Error
Because each Pi covers Ω (and Pi×Pi = Ω×Ω), each 𝒢(P1, λ) can be viewed as a piece-wise function g : Ω × Ω → ℝ+, where such that x ∈ Ei and y ∈ Ej. In other words, g is the constant approximation to λ over every pair of regions. A natural measure of error is another form of Integrated Squared Error:
(4) |
This is analogous to squared loss (ℓ2-loss).
Likelihood
An alternative viewpoint leverages the point process model to measure the likelihood of the observed endpoint count in each region:
(5) |
This uses the Poisson assumption on the tract endpoints, that the number of endpoints in any region Ei ×Ej is Poisson distributed with rate parameter equal to the integral of the intensity function over the region 𝒞(Ei,Ej). Here, the independence assumption plays a critical role, allowing pairs of regions to be evaluated separately. Unfortunately this is biased toward parcellations with more, smaller regions, as the Poisson distribution has tied variance and mean in one parameter. A popular likelihood-based option that somewhat counterbalances this is Akaike’s Information Criterion (AIC) [11],
(6) |
AIC balances accuracy with parsimony, penalizing overly parameterized models— in our case, parcellations with too many regions.
2.3. Recovery of the Intensity Function
A sufficient statistic for Poisson process models is the intensity function λ(x, y). Estimates of this function, denoted λ̂, represent a non-trivial learning task, and have been the subject of much study in the spatial statistics community [12]. We choose to use non-parametric Kernel Density Estimation (KDE), and we present an efficient method for tuning the bandwidth parameter (up to a choice of desiderata). We first inflate each surface to a sphere and register them using a spherical registration (See Section 3); recovery of λ can be undertaken without group registration, but for later analysis such a registration is useful. We treat each hemisphere as disjoint from the other, allowing us to treat Ω×Ω as the product of spheres (S1 ∪ S2) × (S1 ∪ S2).
The unit normalized spherical heat kernel is a natural choice of kernel for 𝕊2. We use its truncated spherical harmonic representation [13], which is defined as the following for any two unit vectors p and q on the 2-sphere:
(7) |
Here, is the hth degree associated Legendre polynomial of order 0. Note that the non-zero order polynomials have coefficient zero due to the radial symmetry of the spherical heat kernel [13]. We extend this kernel to our context trivially: since Ω is actually the union of two spheres, S1 ∪ S2 (each the inflation of the anatomic hemispheres), we define Kσ(p, q) to be zero if (p, q) are not on both on S1 or both on S2. Since we are estimating a function on Ω × Ω, we use the product of two heat kernels as our KDE kernel κ. For any test point (p, q), the kernel value associated to an endpoint pair (xi, yi) is κσ((p, q)|(xi, yi)) = Kσ(xi, p)Kσ(yi, q). It is easy to show that ∫Ω×Ω Kσ(xi, p)Kσ(yi, q)dpdq = 1.
As is standard KDE practice, we evaluate
(8) |
on a discrete grid of Ω × Ω. The surface mesh itself provides a convenient choice for such a grid, though various remeshing schema can also be used. We rewrite the estimator into a more computationally efficient form for estimating σ in the next section.
2.3.1. Bandwidth Selection
The spherical heat kernel has a single shape parameter σ which corresponds to its bandwidth. The practitioner may either set this parameter manually or select a tuning criterion and then optimize the parameter for the given criterion. We provide an efficient method for the latter case; a discussion on the advantages and disadvantages of the former is provided in Section 5.2.
In general, automated tuning of kernel hyper-parameters requires the re-estimation of the density estimate λ̂ at every iteration; most procedures would, at each step, propose a parameter value, measure some criterion and/or its derivative with respect to the parameter value, and then propose a next parameter value. Assuming the chosen criterion requires numerical integration, this usually has of order O(NMS) operations, where N is the number of observations (tracts), M is the number of mesh points, and S is the number of tuning steps/sigma grid points. For our particular context we are able to reduce this to O(NM +MS) operations. This is achieved by rewriting our kernel in the following form:
(9) |
The right hand side is clearly separable into two functions, one of which is independent of the bandwidth and the other of which is independent of the data. This allows the memoization of part of the computation so that we only need to store the sum of the outer products of the harmonics. Evaluations of the kernel can then be done quickly computed for sequences of values of σ.
We then are left with the choice of loss function. Denoting the true intensity function λ, the estimated intensity λ̂, and the leave-one-out estimate λ̂i (leaving out observation i), Integrated Squared Error (ISE) is defined:
(10) |
(11) |
Hall and Marron [14] suggest tuning bandwidth parameters using estimated ISE. In practice, we find that replacing each leave-one-out estimate with its logarithm log λ̂i(xi, yi|σ) yields more consistent and stable results.
3. Preprocessing and Procedure
We demonstrate the use of our framework in two separate analyses. The first is a test-retest reliability analysis using a subset of the Consortium for Reliability and Reproducibility (CoRR) dataset [15]; we compare the reliability of our method versus the standard counting method. We also compare three parcellations using the criteria defined in Equations 4, 5, and 6. Our second analysis is a demonstration of groupwise tests of parcellation free measures. This was performed on data from the Human Connectome Project, and tests sex differences in marginal connectivity (Eq. 3), a derived statistic of the proposed representation. In the first analysis we use the self-tuning bandwidth parameter, while in the second we use a pre-specified bandwidth. This was done to avoid conditioning results on the bandwidth parameter itself (avoiding the possibility of a group-wise effect on bandwidth selection).
3.1. IPCAS (Test-Retest Reliability)
Our first dataset is comprised of 29 subjects from the Institute of Psychology, Chinese Academy of Sciences (IPCAS) sub-dataset of the larger Consortium for Reliability and Reproducibility (CoRR) dataset [15]. T1-weighted (T1w) and diffusion weighted (DWI) images were obtained on 3T Siemens TrioTim using an 8-channel head coil and 60 directions. Each subject was scanned twice, roughly two weeks apart. T1w acquisition parameters were as follows: flip angle: 7 degrees; TI: 1100 ms; TE: 2.51 ms; TR: 2530 ms; voxel: 1×1×1.3 mm3. DWI acquisition parameters were as follows: flip angle: 90 degrees; TE: 30; TR: 200; 60 directions; voxel: 1.8×1.8×2.5 mm3.
T1w images were processed with FreeSurfer’s [16] recon-all pipeline to obtain a triangle mesh of the gray-white matter boundary registered to a shared spherical space [17], as well as corresponding vertex labels per subject for three atlas-based cortical parcellations, the Destrieux atlas [18], the Desikan-Killiany (DK) atlas [19], and the Desikan-Killiany-Tourville (DKT31) atlas [20]. Probabilistic streamline tractography was conducted using the DWI in 2 mm isotropic MNI 152 space, using Dipy’s LocalTracking module [21], as well as its implementation of constrained spherical deconvolution (CSD) [22] with a harmonic order of 6. Tractography streamlines were seeded at 2 random locations in each voxel labeled as likely white matter via the segmentation maps generated by FMRIB’s Automated Segmentation Tool [23], also known as FSL FAST. Streamline tracking followed directions randomly in proportion to the orientation function at each sample point at 0.5 mm steps, starting bidirectionaly from each seed point. As per Dipy’s Anatomically Constrained Tractography (ACT) criteria [24], we retained only tracts longer than 5mm with endpoints in likely gray matter.
We estimate λ̂ (the continuous connectivity representation) on each scan using the self-tuning kernel given in Section 2, and the vertices of a equi-areal triangular mesh (geodesic grid) as our sample points. This, along with the convention of unit surface area instead of unit radius (thereby eliminating an unnecessary division), simplifies computation. We threshold each of the sample points for λ̂ at 10−5, which is approximately one half of one unit tract density. We then numerically integrate λ̂ to compute regional connectivity graphs as in Eq. 1, and compare these to traditional count-based connectivity graphs. As a measure of reliability we provide the mean Intraclass Correlation (ICC) score [25] computed both with and without entries that are zero for all subjects, for both count and integrated-intensity connectivity matrices, for each element (each edge of the connectome). We also compute three measures of parcellation representation accuracy, namely ISE, Negative Log Likelihood, and AIC scores.
3.2. HCP (Demonstration Analysis of Marginal Connectivity)
Our second dataset is comprised of 731 subjects from the Human Connectome Project2 S900 release [26]. We used the minimally preprocessed T1-weighted (T1w) and diffusion weighted (DWI) images rigidly aligned to MNI 152 space. T1w acquisition parameters were as follows: flip angle: 8 degrees; TI: 1000 ms; TE: 2.14 ms; TR: 2400 ms; voxel: 0.7×0.7×0.7 mm3. DWI acquisition parameters were as follows: flip angle: 78 degrees; TE: 89.5 ms; TR: 5520 ms; 90 directions at each b-value; b-values: 1000, 2000, and 3000 s/mm2; voxel: 1.25×1.25×1.25 mm3.
The preprocessing of these images included gradient nonlinearity correction (T1w, DWI), motion correction (DWI), eddy current correction (DWI), and linear alignment (T1w, DWI). We use the HCP Pipeline (version 3.13.1) FreeSurfer [17] protocol to run an optimized version of the recon-all pipeline in order to extract registered surfaces and region labels. We remesh subjects in the FreeSurfer shared spherical space in order to construct registered meshes with a dense correspondence between the vertices [27]. Tractography was conducted using the DWI in 1.25mm isotropic MNI 152 space. Probabilistic streamline tractography was again performed using Dipy’s implementation of constrained spherical deconvolution (CSD) [22], here using a harmonic order of 8. The same seeding and tracking procedures were used as the above IPCAS dataset processing, as well as pruning short tracts, or those that did not end in likely gray matter (again as specified by Dipy’s ACT).
We again estimate λ̂ (the continuous connectivity representation) on each scan. In the HCP data, we use a fixed kernel bandwidth of σ = 0.005. While using a self-tuning kernel on each subject does not invalidate the proceeding analysis, it may be advantageous to instead fix a kernel bandwidth across the group (See Section 5). We then numerically integrate each λ̂ function to form the marginal connectivity defined in Eq. 3, sampled at every mesh vertex. We then conduct a linear regression on the marginal connectivity values at each vertex, using HCP subject data for Sex, with covariates of Age (in years), and intercranial volume. We exclude any vertex identified as part of the medial wall in more than 95% of the subjects (~ 700 subjects), since labels are not fixed in the registered space. We correct the parametric p-values for each regression coefficient using the the Benjamini–Hochberg [28] False Discovery Rate correction for multiple comparisons.
4. Results
4.1. IPCAS Test–Retest
Table 1 and Figure 1 show that networks derived from the continuous connectivity estimates are preferable to regular count networks with respect to reliability as measured by ICC in the IPCAS dataset. Table 1 shows surprisingly low mean ICC scores for regular count matrices. This may be because ICC normalizes each measure by its s2 statistic, meaning that entries in the adjacency matrices that should be zero but that are subject to a small amount of noise—a few erroneous tracts—have very low ICC. Our method in effect smooths tract endpoints into a density; end points near the region boundaries are in effect shared with the adjacent regions. Thus, even without thresholding we dampen noise effects as measured by ICC. With thresholding, our method’s performance is further improved, handily beating the counting method with respect to ICC score. It is important to note that for many graph statistics, changing graph topology can greatly affect the measured value [3]. While it is important to have consistent non-zero measurements, the difference between zero and small but non-zero edge values in the graph context is also non-trivial [29]. The consistency of zero-valued measurements is thus very important in connectomics.
Table 1.
Atlas | DK | Destrieux | DKT31 |
---|---|---|---|
Number of Regions | 68 | 148 | 62 |
Count ICC | 0.2093 | 0.1722 | 0.2266 |
Intensity ICC (Full) | 0.4868 | 0.4535 | 0.4388 |
Intensity ICC (w/Threshold) | 0.5613 | 0.6481 | 0.4645 |
Table 2 suggests that all three measures, while clearly different, are consistent in their parcellation selection at least with respect to these three parcellations. It is somewhat surprising that the Destrieux atlas has quite low likelihood criteria, but this may be due to the (quadratically) larger number of region pairs. It should be noted that these results must be conditioned on the use of a probabilistic CSD tractography model. Different models may lead to different intensity functions and resulting matrices. The biases and merits the different models and methods (e.g. gray matter dilation for fiber counting vs streamline projection) remain important open questions.
Table 2.
Type | DK | Destrieux | DKT31 |
---|---|---|---|
ISE | 1.85 × 10−5 | 2.10 × 10−5 | 2.13 × 10−5 |
Negative LogLik | 85000 | 355000 | 88000 |
AIC Score | 174000 | 733000 | 185000 |
4.2. HCP Sex Differences
Figure 3 shows a map of the areas in which significant differences of marginal connectivity were discovered between males and females in the HCP dataset, given the pointwise correspondence of the surfaces and under the fixed σ parameter. The marginal connectivity M(x) defined in Eq. 3 is the rate at which we observe tracts which have one endpoint at x as estimated by λ̂. Thus, estimated differences in marginal connectivity have a simple interpretation: in regions of significant difference, observed tracts are more likely to be incident on those regions in one group over the other. That is, the relative tract density per unit area is higher.
There is a clear region of difference in the left pre-frontal cortex. We also identify another significant region as the left anterior cingulate. Outside of these regions, there are areas suggestive of significant changes, but no large contiguous groups of significant vertices. Even though the FDR correction is conservative in its adjustment of positively correlated p-values, we encourage researchers to take caution in the more scattered significant regions.
5. Discussion
In this paper we have proposed a continuous representation for cortical connectivity. While we believe our particular formulation to be novel, graph-like objects defined over continuums are by no means new. In particular, the continuum limits of graphs [30], also known as graphons, appear similar to the proposed framework, and have been used in the analysis of large random networks. Point process forms of graphons also exist [31], though in both these and the more general context the analysis is usually conducted with the assumption of exchangeability (i.e. the random function is invariant under measure preserving transformations of the domain). Our proposed structure is precisely the opposite, imposing the surface metric (or its proxy, a metric on S2); it is unreasonable to assume that phenomena of neurological interest on the cortical surface are exchangeable. Further, due to the surface metric the proposed object is not the graph limit of a parcellation-based graph.
Several dense connectome representations have been proposed [32, 26]. These representations use parcellations on the voxel or mesh-face level. Sampled continuous connectivity on mesh vertices and dense connectomes [26] have, at surface level, the same representation. They are both large arrays of connective values, with each element representing the connectivity to a very small physical feature; in the representation proposed here, however, each of these scalar values is a point density of an underlying function defined over the continuum of the cortical surface, with an imposed smoothness constraint. The former requires integration over an area to have matching units to traditional connectomes (tracts), while the latter is exactly that count, not a count per unit area. The former is equipped with a surface topology that is separate from the network topology; the latter abstracts away this surface. In the dense connectome case a joint analysis could surely be undertaken with the mesh vertices; indeed, we believe our proposed framework could be construed as one such analysis.
Another alternative to atlas-based parcellations are stochastic parcellations [33, 34], such as those generated by Poisson disk-sampling. While the resulting representations may have a large number regions approaching that of the dense connectomes, the regions are usually sampled at the time of processing, and may not have correspondence between studies. The normalization of network measures across these random network configurations is still possible [35] using a resampling scheme. This still ignores the underlying surface, and does not provide insight into the nature of parcellation differences, but appears empirically useful.
The proposed framework can also be put in the context of tractographic probability spaces. Ours is a reduction of a tract to its endpoints since traditional connectivity analyses are mostly concerted with the gray matter intersections, but more general spaces have been proposed. In particular O’Donnell and Westin [36] and the later Wassermann et al. [37] both propose implicit spaces via an inner product definition (a kernel trick). More recently, Parisot et al. [38] and Gallardo et al. [39] both propose implicit spaces based on similarity measures (scalar functions on Ω × Ω), both for the purpose of connectivity based parcellation. In their contexts the connective profiles (or transformations thereof) generate the similarity measures, whereas we seek to define connectivity as a (non-negative) scalar function.
5.1. HCP Analysis
The objective of our analysis of the HCP data is the demonstration of a simple analysis based on our proposed parcellation free model. The use of mass univariate regression with FDR correction is a compromise; in the best possible case we would prefer a spatial regression. However, accounting for spatial covariance is a non-trivial task (especially considering the manifold under consideration is not truly S2), and so we leave this challenge for later work. Though such a method is desirable, the results recovered from the naïve method remain promising.
The identification of changes in the left pre-frontal cortex and anterior cingulate is encouraging for simple analyses such as these. The left pre-frontal cortex was identified in Ingalhalikar et al. [40] as well as Duarte-Carvajalino et al. [41] as having significant connectivity differences between the sexes. The anterior cingulate was also identified in Duarte-Carvajalino et al. as a region of significant change. Both of these studies used other, more complex measures of network changes, and of course used parcellation based network representations. The qualitative result remains similar, that females have “higher connectivity” in left pre-frontal cortex, and in the second case, the anterior cingulate. It should be noted, however, that these regions are also quite large, and the effects documented need not be co-localized.
While in network analysis the degree statistic taken locally is somewhat heavy-handed, here its simplicity makes it easily interpretable. The lack of pre-defined parcels also removes the need for areal regularization. In a traditional connectome, it might be advisable to regularize by region size in order to make the units into tracts per unit area. This is implicit under the proposed representation.
We fail to identify other large contiguous regions outside of the left prefrontal cortex; this may, of course, be related to the underlying signal, but there are several confounding factors that should be considered. The identified small areas of significance may be artifacts of pre-processing or tractography biases. In particular, it may be the case that one sex or another is more easily registered (or that a correlated trait, such as head-size, adds or detracts from such a registration). A similar case may be made for tractography. It may also be the case that the use of a stationary kernel (i.e. a kernel that does not change by general cortical region) biases our analyses towards regions with matching scale as our chosen bandwidth.
Sex differences in the brain have been studied in a number of other papers, especially with respect to connectivity. Alongside Ingalhalikar et al. [40] and Duarte-Carvajalino et al. [41], Rymen et al. [42] also measure sex differences in structural connectomes, using a proxy measure of creativity to explore phenotypic correlations with estimated connectivity. Of the three, only Rymen et al. directly use the degree statistic (in their paper referred to as Connectivity, or Sweighted), while both of the other studies use more complex topological measures.
As with almost all group difference statistical analyses, we do not make the assertion that all males or females exhibit the characteristics we found to be significantly different between the groups. These differences are found on average, are necessarily a generalization and not the rule, and then only given our model assumptions and procedural choices. Some of these differences are probably due to latent factors such as the aforementioned registration bias.
5.2. Kernel Bandwidth Selection
ISE is a global measure, and bandwidth parameters chosen using this criterion will thus be set globally. Other forms of bandwidth estimation are actively being researched in the literature [43], including local methods which allow σ to vary as function of the location. It is not clear as of yet which method or loss function serves best the overall desiderata of brain connectivity analysis.
An alternative to the self-tuned kernel is one with specified bandwidth, as demonstrated in the previous analysis. While clearly any subsequent results must be conditioned on this choice, in our opinion this option simplifies interpretation across multiple subjects, and opens up a wealth of theoretical results. Furthermore, it has been suggested, albeit on Euclidean spaces, that the estimation of a smoothed form of the underlying density still retains its qualitative value.
A general kernel estimate of some general density p with the form
can be shown to be asymptotically consistent, unbiased, and has known variance in the limit as h → 0 (see [44]). Here Z(h) chosen so that λ̂ integrates to 1. More recent work suggests the use of fixed h may also be of use [45], i.e. eschewing the asymptotic result in favor of estimating a mollified (smoothed or blurred) form of λ. Fixing h, the estimate can be rewritten as the convolution of the empirical data function (a sum of delta functions) with the kernel Kh. This work was performed on ℝn for a density P estimated with a slightly different kernel than the one given above, but we may transport the general concept by using spherical convolution [46]. In more general settings outside of density estimation, the value of mollified signals has long been accepted [47].
6. Conclusion
We have presented a general framework for structural brain connectivity. This framework provides a representation for cortical connectivity that is independent of the choice of regions, and thus may be used to compare the accuracy of a given set of regions’ connectivity matrix. We provide one possible estimation method for this representation, leveraging spherical harmonics for fast parameter estimation. We have demonstrated this framework’s viability, as well as provided a preliminary comparison of regions using several measures of accuracy. We further have provided an example analysis of the continuous connective object.
The results presented here lead us to conjecture that our connectome estimates are more reliable compared to standard fiber counting, though we stress that a much larger study is required for strong conclusions to be made. However, we believe that it is important to explore connectomics beyond the confines of the graph-theoretic abstraction. This particular instantiation of continuous connectivity is fairly general, and could be made more nuanced with a variety of existing statistical technologies, including regressions accounting for spatial auto-correlations, marked processes, and inter-subject hierarchical models of tract generation.
Highlights.
Generalizes traditional connectome count matrices to spatial process of tracts
Provides fast estimator, with efficient hyper parameter tuning
Provides results showing improved reliability (as measured by ICC score)
Includes demonstration analysis using analogous “degree” function
Significant differences in example analysis between sexes
Acknowledgments
This work was supported by NIH Grant U54 EB020403, as well as the NSF Graduate Research Fellowships Program and the Rose Hills Fellowship at the University of Southern California. The authors would like to thank the original MICCAI reviewers, the MICCAI community at-large, as well as Greg Ver Steeg for multiple helpful comments.
Appendix: Implementation Notes
We optimize σ using grid search using a large linear grid. Using Equation 9 makes the marginal time cost of additional grid points low, so we can densely sample any reasonable interval quickly (σ is one dimensional). The boundary and spacing of the interval can be determined heuristically, and overcautious estimates are, again, relatively cheap.
Both the leave-one-out estimation and the quadrature can be done in parallel. In practice we find that caching calls to exp and the Legendre polynomials (i.e. computing a look-up table beforehand, or constructing one at run-time), reduces the computational costs of bandwidth estimation. The truncation error introduced by using a lookup table is minimal with a sufficiently dense table; since this is a one-time cost and can be completely computed in parallel, we choose a table optimal for our physical memory constraints. Note that we will only be computing exp and the Legendre polynomials for either fixed numbers known a priori, fixed grids, or (in the latter case) arguments in on the interval [−1, 1]. After σ has been computed or specified by the user, the estimation of λ̂ is relatively fast, as it is a sub-problem of the bandwidth estimation.
Because these operations are taking place on a mesh and significant mesh pre-processing steps are undertaken (e.g. re-meshing), we implemented our method in C++ using a half-edge structure. We take full advantage of compile time optimization. For repeated loops over constant sets (e.g. the spherical harmonic coefficients), loop unrolling and the appropriate use of const declared variables in tandem with standard optimizations speed up processing by an order of magnitude. (This is usually at the cost of code size, which is, practically speaking, almost free.)
Footnotes
It is critical to distinguish between white matter fibers (fascicles) and observed “tracts.” Here, “tracts” denotes the 3d-curves recovered from Diffusion Weighted Imaging via tractography algorithms.
Data were provided by the Human Connectome Project, WU-Minn Consortium (PIs: David Van Essen & Kamil Ugurbil) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Sporns O, Tononi G, Edelman GM. Theoretical neuroanatomy: relating anatomical and functional connectivity in graphs and cortical connection matrices. Cerebral Cortex. 2000;10(2):127–141. doi: 10.1093/cercor/10.2.127. [DOI] [PubMed] [Google Scholar]
- 2.Van Wijk BC, Stam CJ, Daffertshofer A. Comparing brain networks of different size and connectivity density using graph theory. PloS One. 2010;5(10):e13701. doi: 10.1371/journal.pone.0013701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zalesky A, et al. Whole-brain anatomical networks: does the choice of nodes matter? NeuroImage. 2010;50(3):970–983. doi: 10.1016/j.neuroimage.2009.12.027. [DOI] [PubMed] [Google Scholar]
- 4.Satterthwaite TD, Davatzikos C. Towards an individualized delineation of functional neuroanatomy. Neuron. 2015;87(3):471–473. doi: 10.1016/j.neuron.2015.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wang J, et al. Parcellation-dependent small-world brain functional networks: A resting-state fMRI study. Human Brain Mapping. 2009;30(5):1511–1523. doi: 10.1002/hbm.20623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.de Reus MA, Van den Heuvel MP. The parcellation-based connectome: limitations and extensions. NeuroImage. 2013;80:397–404. doi: 10.1016/j.neuroimage.2013.03.053. [DOI] [PubMed] [Google Scholar]
- 7.Gutman B, Leonardo C, Jahanshad N, Hibar D, Eschenburg K, Nir T, Villalon J, Thompson P. Registering cortical surfaces based on wholebrain structural connectivity and continuous connectivity analysis. International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer; 2014. pp. 161–168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Jahanshad N, Rajagopalan P, Hua X, Hibar DP, Nir TM, Toga AW, Jack CR, Saykin AJ, Green RC, Weiner MW, et al. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity. Proceedings of the National Academy of Sciences. 2013;110(12):4768–4773. doi: 10.1073/pnas.1216206110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Moyer D, Gutman BA, Faskowitz J, Jahanshad N, Thompson PM. A continuous model of cortical connectivity. International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer; 2016. pp. 157–165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Moller J, Waagepetersen RP. Statistical inference and simulation for spatial point processes. CRC Press; 2003. [Google Scholar]
- 11.Akaike H. International Encyclopedia of Statistical Science. Springer; 2011. Akaike’s information criterion; pp. 25–25. [Google Scholar]
- 12.Diggle P. A kernel method for smoothing point process data. Applied Statistics. 1985:138–147. [Google Scholar]
- 13.Chung MK. Heat kernel smoothing on unit sphere. Biomedical Imaging: Nano to Macro, 2006. 3rd IEEE International Symposium on; IEEE; 2006. pp. 992–995. [Google Scholar]
- 14.Hall P, Marron JS. Extent to which least-squares cross-validation minimises integrated square error in nonparametric density estimation. Probability Theory and Related Fields. 1987;74(4):567–581. [Google Scholar]
- 15.Zuo X-N, et al. An open science resource for establishing reliability and reproducibility in functional connectomics. Scientific Data. :1. doi: 10.1038/sdata.2014.49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fischl B. Freesurfer. NeuroImage. 2012;2(62):774–781. doi: 10.1016/j.neuroimage.2012.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Fischl B, et al. High-resolution intersubject averaging and a coordinate system for the cortical surface. Human Brain Mapping. 1999;8(4):272–284. doi: 10.1002/(SICI)1097-0193(1999)8:4<272::AID-HBM10>3.0.CO;2-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Fischl B, et al. Automatically parcellating the human cerebral cortex. Cerebral Cortex. 2004;14(1):11–22. doi: 10.1093/cercor/bhg087. [DOI] [PubMed] [Google Scholar]
- 19.Desikan RS, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage. 2006;31(3):968–980. doi: 10.1016/j.neuroimage.2006.01.021. [DOI] [PubMed] [Google Scholar]
- 20.Klein A, Tourville J, et al. 101 labeled brain images and a consistent human cortical labeling protocol. Front Neurosci. 2012;6(171):10–3389. doi: 10.3389/fnins.2012.00171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Garyfallidis E, et al. Dipy, a library for the analysis of diffusion MRI data. Front Neuroinform. 8(8) [Google Scholar]
- 22.Tournier JD, Yeh CH, Calamante F, Cho KH, Connelly A, Lin CP. Resolving crossing fibres using constrained spherical deconvolution: validation using diffusion-weighted imaging phantom data. NeuroImage. 2008;42(2):617–625. doi: 10.1016/j.neuroimage.2008.05.002. [DOI] [PubMed] [Google Scholar]
- 23.Zhang Y, Brady M, Smith S. Segmentation of brain MR images through a hidden markov random field model and the expectation-maximization algorithm, Medical Imaging. IEEE Transactions on. 2001;20(1):45–57. doi: 10.1109/42.906424. [DOI] [PubMed] [Google Scholar]
- 24.Smith RE, et al. Anatomically-constrained tractography: improved diffusion MRI streamlines tractography through effective use of anatomical information. NeuroImage. 2012;62(3):1924–1938. doi: 10.1016/j.neuroimage.2012.06.005. [DOI] [PubMed] [Google Scholar]
- 25.Portney LG, Watkins MP. Statistical measures of reliability. Foundations of Clinical Research: Applications to Practice. 2000;2:557–586. [Google Scholar]
- 26.Van Essen DC, Smith SM, Barch DM, Behrens TE, Yacoub E, Ugurbil K, et al. Consortium WH. The WU-Minn Human Connectome Project: an overview. NeuroImage. 2013;80:62–79. doi: 10.1016/j.neuroimage.2013.05.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, Xu J, Jbabdi S, Webster M, Polimeni JR, et al. The minimal preprocessing pipelines for the Human Connectome Project. NeuroImage. 2013;80:105–124. doi: 10.1016/j.neuroimage.2013.04.127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological) 1995:289–300. [Google Scholar]
- 29.Zalesky A, Fornito A, Cocchi L, Gollo LL, van den Heuvel MP, Breakspear M. Connectome sensitivity or specificity: which is more important? NeuroImage. 2016;142:407–420. doi: 10.1016/j.neuroimage.2016.06.035. [DOI] [PubMed] [Google Scholar]
- 30.Lovász L. Large networks and graph limits. 60 [Google Scholar]
- 31.Caron F, Fox EB. Sparse graphs using exchangeable random measures. doi: 10.1111/rssb.12233. arXiv preprint arXiv:1401.1137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Baldassano C, Iordan MC, Beck DM, Fei-Fei L. Discovering voxel-level functional connectivity between cortical regions. 2nd NIPS Workshop on Machine Learning and Interpretation in Neuroimaging; 2012. [Google Scholar]
- 33.Hagmann P, Kurant M, Gigandet X, Thiran P, Wedeen VJ, Meuli R, Thiran JP. Mapping human whole-brain structural networks with diffusion MRI. PloS One. 2007;2(7):e597. doi: 10.1371/journal.pone.0000597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ball G, Boardman JP, Aljabar P, Pandit A, Arichi T, Merchant N, Rueckert D, Edwards AD, Counsell SJ. The influence of preterm birth on the developing thalamocortical connectome. Cortex. 2013;49(6):1711–1721. doi: 10.1016/j.cortex.2012.07.006. [DOI] [PubMed] [Google Scholar]
- 35.Schirmer M, Ball G, Counsell SJ, Edwards AD, Rueckert D, Hajnal JV, Aljabar P. Normalisation of neonatal brain network measures using stochastic approaches. International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer; 2013. pp. 574–581. [DOI] [PubMed] [Google Scholar]
- 36.O’Donnell LJ, Westin CF. Automatic tractography segmentation using a high-dimensional white matter atlas. IEEE transactions on medical imaging. 2007;26(11):1562–1575. doi: 10.1109/TMI.2007.906785. [DOI] [PubMed] [Google Scholar]
- 37.Wassermann D, Bloy L, Kanterakis E, Verma R, Deriche R. Unsupervised white matter fiber clustering and tract probability map generation: Applications of a gaussian process framework for white matter fibers. NeuroImage. 2010;51(1):228–241. doi: 10.1016/j.neuroimage.2010.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Parisot S, Arslan S, Passerat-Palmbach J, Wells WM, Rueckert D. Group-wise parcellation of the cortex through multi-scale spectral clustering. NeuroImage. 2016;136:68–83. doi: 10.1016/j.neuroimage.2016.05.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gallardo G, Wells W, Deriche R, Wassermann D. Groupwise structural parcellation of the whole cortex: A logistic random effects model based approach. NeuroImage. doi: 10.1016/j.neuroimage.2017.01.070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ingalhalikar M, Smith A, Parker D, Satterthwaite TD, Elliott MA, Ruparel K, Hakonarson H, Gur RE, Gur RC, Verma R. Sex differences in the structural connectome of the human brain. Proceedings of the National Academy of Sciences. 2014;111(2):823–828. doi: 10.1073/pnas.1316909110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Duarte-Carvajalino JM, Jahanshad N, Lenglet C, McMahon KL, de Zubicaray GI, Martin NG, Wright MJ, Thompson PM, Sapiro G. Hierarchical topological network analysis of anatomical human brain connectivity and differences related to sex and kinship. NeuroImage. 2012;59(4):3784–3804. doi: 10.1016/j.neuroimage.2011.10.096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ryman SG, van den Heuvel MP, Yeo RA, Caprihan A, Carrasco J, Vakhtin AA, Flores RA, Wertz C, Jung RE. Sex differences in the relationship between white matter connectivity and creativity. NeuroImage. 2014;101:380–389. doi: 10.1016/j.neuroimage.2014.07.027. [DOI] [PubMed] [Google Scholar]
- 43.Heidenreich NB, Schindler A, Sperlich S. Bandwidth selection for kernel density estimation: a review of fully automatic selectors. AStA Advances in Statistical Analysis. 2013;97(4):403–433. [Google Scholar]
- 44.Hall P, Watson G, Cabrera J. Kernel density estimation with spherical data. Biometrika. 1987;74(4):751–762. [Google Scholar]
- 45.Rinaldo A, Wasserman L. Generalized density clustering. The Annals of Statistics. 2010:2678–2722. [Google Scholar]
- 46.Driscoll JR, Healy DM. Computing Fourier transforms and convolutions on the 2-sphere. Advances in Applied Mathematics. 1994;15(2):202–250. [Google Scholar]
- 47.Lindeberg T. Scale-space theory in computer vision. Vol. 256. Springer Science & Business Media; 2013. [Google Scholar]