Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Sep 4.
Published in final edited form as: Med Image Comput Comput Assist Interv. 2013;16(0 2):534–541. doi: 10.1007/978-3-642-40763-5_66

Studying Cerebral Vasculature Using Structure Proximity and Graph Kernels

Roland Kwitt 1, Danielle Pace 1, Marc Niethammer 2, Stephen Aylward 1
PMCID: PMC4154378  NIHMSID: NIHMS623542  PMID: 24579182

Abstract

An approach to study population differences in cerebral vasculature is proposed. This is done by 1) extending the concept of encoding cerebral blood vessel networks as spatial graphs and 2) quantifying graph similarity in a kernel-based discriminant classifier setup. We argue that augmenting graph vertices with information about their proximity to selected brain structures adds discriminative information and consequently leads to a more expressive encoding. Using graph-kernels then allows us to quantify graph similarity in a principled way. To demonstrate our approach, we assess the hypothesis that gender differences manifest as variations in the architecture of cerebral blood vessels, an observation that previously had only been tested and confirmed for the Circle of Willis. Our results strongly support this hypothesis, i.e, we can demonstrate non-trivial, statistically significant deviations from random gender classification in a cross-validation setup on 40 healthy patients.

1 Motivation

The human circulatory system, with blood vessels transporting nutrients and waste from one location to another, appears to lend itself naturally to a graph-based representation. A question that immediately arises is whether we can leverage that graph structure to study population differences that manifest as topological changes. In this work, we focus on the cerebrovascular network. Several studies indicate an association between selected mental diseases and irregular cerebral blood vessel topology. Preliminary to our study of such associations, herein we assess the hypothesis that gender-associated differences exist in the architecture of cerebral vessel networks. A recent study [2] demonstrates gender-related differences in local geometry, e.g., radius; however, evidence in support of general architectural differences across genders has only been reported for the Circle of Willis [7]. Beyond serving as a preliminary study, identifying gender-related differences could also help to explain why certain vascular pathologies, such as aneurysms, have a higher incidence rate in women than in men [7].

Although it is straight-forward to compare individuals or populations in terms of local vessel network properties, e.g., tortuosity measures or radius estimates [2], a general framework for comparing and characterizing vascular topology at a global scale does not exist. Furthermore, vascular graph comparisons are typically performed using summary statistics in order to circumvent computational complexity issues inherent to many graph similarity metrics. Additionally, while progress has been made in representing cerebral vasculature as graphs [4] and even as graphs that span intracranial space [1], no prior work, to our knowledge, has augmented vascular graph encodings with brain structure information, e.g., hippocampus, thalamic nuclei and so forth.

Our contribution is two-fold: First, we augment the spatial graph representation of [1] with brain structure information at each vertex to enhance expressiveness. Second, we draw upon recent advances in machine learning with graph-structured data [8] to quantify gender-related differences in cerebrovascular architecture using a graph-kernel based discriminant classifier.

2 Functional Graphs

In this section, we introduce the concept of functional graphs, an extension of spatial graphs [1] with vertex labels that encode information about proximal brain structures. Our approach has the following requirements: For a population of N individuals, we require MRA and T1-weighted MRI images, as well as extracted cerebral vasculature in a centerline + radius representation. Fig. 1 shows some example images from the dataset we use throughout this work (see Sect. 4 for details). To study architectural differences of blood vessel networks, effects arising from translation, rotation, etc. need to be removed. This is typically done by aligning all images with respect to a common brain atlas and applying the corresponding transform(s) on the blood vessels to map them into the atlas space. We further require that the atlas has an associated voxel-wise labeling of brain structures (e.g., see [6]). For clarity of presentation, we briefly recapitulate spatial graph construction [1] and then discuss our contribution.

Fig. 1.

Fig. 1

Input data: example (mid-transversal) slices of skull-stripped T1-weighted MRI and MRA as well as extracted vasculature from different individuals. The right-hand side shows the atlas of brain structures [6] that we use as a reference space.

From Centerlines to Spatial Graphs

The representation of blood vessels as centerlines is difficult to use directly for comparing individuals, primarily due to extensive, naturally occurring spatial and topological variations among individuals. To study the global blood vessel network, we favor a more compact and less noisy representation.

To compute a spatial graph, the brain is parcellated into regions that are equally likely to contain a vessel; the vessel paths across those regions thereby form a compact graph representation of the individual, and those graphs can be summarized and compared within and across individuals and populations.

Regarding parcellation, first, an atlas of vessel density is constructed for a population of individuals. The atlas is established by computing an Euclidean distance map for each individual's vascular network3. Averaging the (inverted) distance maps {Di}i=1N, followed by a normalization of distances above the q-th percentile (e.g., q = 80) to [0, 1] (denoted by ϕq in Fig. 2), allows the resulting image D¯ to be interpreted as an atlas of spatial blood vessel density.

Fig. 2.

Fig. 2

Construction of the vascular density atlas, i.e., the spatial density image D¯ of blood-vessels, from N (inverted) distance maps D1,,DN . The centroidal Voronoi tessellation is then driven by the density information of the [0, 1]-normalized image D¯.

Second, [1] proposes a centroidal Voronoi tessellation (CVT) of the density atlas to identify regions of approximately equal probability. Specifically, CVT is applied to D¯ to generate C cells C1,,CC. The cells are computed by running Lloyd's algorithm on a subset of all voxels, selected with probability proportional to the vessel density in D¯. The algorithm is initialized by C randomly chosen cell centers which are then iteratively refined. This computation scheme results in a higher number of cells in regions of high spatial density and essentially approximates an equiprobable (in terms of expected probability of observing a blood vessel in a cell) tessellation of D¯, illustrated in Fig. 2.

The graph representation of an individual's blood vessel network is then encoded in an adjacency matrix A = [aij]C×C, where aij = 1 signifies that a blood vessel traverses from cell Ci to Cj cell undirected. This encoding leads to an (unlabeled) graph G=(V,E) with vertex set V={v1,,vC} and edge set E={(vi,vj):aij=1}. While G=(V,E) captures blood vessel topology, no information about brain anatomy is included. We hypothesize that adding information regarding adjacent brain structures will produce graphs with more discriminative power.

Graph Augmentation by Vertex Labels

A label function γ:VE0{l1,,lL} assigns one of L discrete labels from an alphabet E0 to a vertex vV. Several variations of E0 and γ are possible. In fact, the identifier iN of a CVT cell Ci is already a valid label, since it uniquely identifies the spatial position of the graph vertices in atlas space. Another strategy is to use the vertex degree, i.e., the number of incident edges, as a vertex label. We refer to those two strategies as the default labeling strategies (a) and (b) with E0a{1,,C}, E0b{di:dideg(vi)} and corresponding label functions γa(vi) = i, b(vi) = deg(vi) = di.

To augment the graph with more informative labels, we introduce the concept of label transfer. Given a set of S segmented brain structures S0,,SS1, represented by a voxel-wise label map, we propose to transfer those labels to the graph vertices in a way that abstractly encodes the function of a vessel in the brain, i.e., the structures it may influence. We refer to this extension of spatial graphs, i.e., augmentation with labels encoding information about proximal brain structures, as functional graphs.

Formally, structure Su={i:ξ(xi)=su} is given as the index set of voxels xi with segmentation label su from the brain atlas, where ξ denotes the mapping from voxels to segmentation labels. One possible strategy for label transfer, i.e., the assignment of a discrete label li ∈ {s0, . . . , sS−1}to vertex vi, is to compute the intersection of Ci with each segmentation region Sj and tally the occurrences of the corresponding segmentation labels. Given that hinCiSn denotes the cardinality of the index set for the intersection of Ci with Sn, we can define a label histogram for vertex vi as hi := [hi0, . . . , hiS−1]T. This label histogram can then be used extract one discrete label γ(vi) = su, e.g., by means of max-pooling u = arg maxs his. The strategy has one drawback, though. Since the label map is sparse in the sense that voxels which are not part of segmented structures are labeled as background s0, max-pooling can lead to a significant number of s0-labeled vertices and, in further consequence, to undesirable ambiguities. For instance, edges in two graphs with start/end vertex labeled as s0 are topologically equivalent, presumably leading to a loss in expressiveness. To avoid this effect, we propose to track vertices labeled as s0 and assign a new, unique label biB{bj}j=1B, B{s0,,sS1}= each time. We refer to label transfer with / without unique background labels as labeling strategies (c) and (d), resp., with E0c{s0,,sS1}, E0dE0c\{s0}B and label functions γ(vi) = s (u defined as above), and γd(vi) = {su, if u ≠ 0, bj, j → j + 1, otherwise}.

3 Measuring Graph Similarity via Graph Kernels

Quantifying similarity between graphs is an inherently difficult problem due to the fact that no polynomial-time algorithm is known to compare graphs in a strict graph-isomorphism sense. Other strategies, such as comparing graphs by means of topological descriptors, however, are often too restrictive or ignore important aspects. A promising alternative is to rely on graph-kernels, a concept that has recently gained considerable interest in the machine learning community, not least because kernels are the key ingredient of many learning algorithms.

The Weisfeiler-Lehman (WL) Subtree Kernel

In this work, we exploit the Weisfeiler-Lehman subtree kernel (WLS) [8], a recently introduced graph-kernel with an e cient computation scheme that relies on comparing subtree patterns. A subtree is a subgraph of a graph with a designated root and no cycles. The maximum distance between the root and any other vertex in the subtree is referred to as the subtree height h. A subtree pattern extends the notion of a subtree by allowing equal vertices. The key concept of the WLS kernel is to count all pairs of matching subtree patterns, rooted at the vertices of both graphs. For graphs with V=n, this can be done in O(hn).

Assuming graphs with discrete vertex labels, the algorithm proceeds as follows: For each vertex v in both graphs, a signature is created by 1) collecting the vertex labels {γ(u)uN(v)} of all neighbors of v, 2) sorting the resulting multiset and 3) adding the sorted multiset to the existing label (e.g., as a string). A new, compressed label is then assigned to all vertices, ensuring that vertices with the same signature get the same compressed label. This procedure is iterated until we reach the desired subtree height h. After h iterations, we tally the common multisets in both graphs. Fig. 4 illustrates this concept on the toy example of the two vessels in Fig. 3. By letting Ei denote the set of all vertex labels in iteration i with EiEi+1= and defining ci(G,lij),lijEi to be a function that counts the number of occurrences of the j-th vertex2label lij in graph G, we can define the WLS kernel between Gu and Gv as:

k(Gu,Gv)=[c0(Gu,l01),,ch(Gu,lhEh)],[c0(Gv,l01),,ch(Gv,lhEh)]. (1)

The inner product in Eq. (1) is taken over vectors that contain the concatenated counts of the original and compressed labels after h iterations. In our toy example of Fig. 4 for h = 1, k(G1,G2)=11 in case of γa and k(G1,G2)=19 in case of γd (see suppl. material). This demonstrates the benefit of label transfer (with unique background labels), since similarity increases due to the fact that both blood vessels traverse the same cells and feed into the same structure.

Fig. 4.

Fig. 4

Graph structure for both blood vessels of Fig. 3, with edge colors corresponding to blood vessel colors, i.e. G1, G2. The table illustrates multiset construction and label compression (numbers in parentheses) for h = 1 and γa, γd (i.e., sequential labels and label transfer with unique background labels).

Fig. 3.

Fig. 3

Exemplary CVT tessellation with two overlayed blood vessels. Without label transfer (γa), vertices are labeled sequentially. With label transfer using non-unique background labels (γc), vertices v5,6,7 are labeled as s5 := 5, vertices v1,2,3,4,8 as background s0 := 0; unique background labels (γd) allow to distinguish vertices v1,2,3,4,8.

4 Experiments

The goal of our experimental study is to assess, using graph-kernel based classification, whether there is evidence supporting the hypothesis of gender-related differences in cerebrovascular architecture. This is done by studying classifier predictions in terms of non-trivial deviations from random choice. Along those lines, we show that functional graphs demonstrate preferred behavior compared to spatial graphs with default vertex labelings.

Dataset

We use a dataset of 40 healthy patients , 18 male and 22 female. For all individuals, MRA and T1-weighted MRI scans were acquired by a Siemens Allegra head-only 3T MR system. Voxel spacing is 0.5 × 0.8 × 0.8mm for the MRA (448 × 448 × 128 voxel), and 1mm isotropic for the T1 images (176 × 256 × 176 voxel). Cerebral blood vessels were extracted by an expert (cf. [2]). All blood vessels are transformed into the space of the SPL 2012 brain atlas [6] (1mm isotropic, 256 × 256 × 256 voxel), using a combination of two transforms, t1 t0, where t0 denotes a rigid transform to align the MRA and T1 images and t1 denotes an affine transform to align the (skull-stripped) T1 image to the (skull-stripped) brain atlas4.

Evaluation Setup

In all experiments, we use a C-SVM classifier [3] with the cost parameter optimized by cross validation (CV) on the training data. All reported results are averaged over five CV runs. As a common measure of problem difficulty, we also report the average fraction of selected support vectors (listed next to the classification accuracies in parentheses). The ratio of training to testing samples is set to 0.7/0.3, resulting in 12 test examples in each CV fold. To avoid biasing the results, we run the full atlas formation pipeline + CVT in each CV run. Consequently, the graph representations change slightly due to differences in vessel density atlas formation and partitioning. We additionally perform a set of permutation tests (cf. [5]) to assess whether a classification result could have been obtained by chance. In other words, the classifier could have picked up spurious patterns in the data that correlate with the labels. Our null-hypothesis, H0, is that the data is independent from its labels. The setup is as follows: for each CV split, we choose P = 103 random permutations of the training labels, train the classifier and compute an estimate of the classification error using the testing portion of the data. This gives an empirical estimate ei of the error under H0 and allows the computation of a p-value estimate as = (#{ei < e*} + 1)/(P + 1), where e* is the prediction error without label permutation. We cannot reject H0 if > α, indicated by a next to the classification result (for α = 0.05). To study the impact of the vascular atlas tessellation granularity as well as the WLS kernel height h, we vary the number of CVT cells from 256 to 2048 and h from h = 2 to h = 6. Increasing h affects the kernel in that longer vessel paths are taken into account.

Evaluation of Default Labeling Strategies

To establish a baseline, Tables 1a and 1b list the classification results for spatial-graphs with nodes labeled using γa and γb. In case of γa, the label alphabet E0a grows linearly with the number of CVT cells, whereas in case of γb, E0b is limited by the maximum vertex degree. The results of Table 1a indicate that sequential vertex labeling leads to accuracies often close to random choice. Essentially, all samples are chosen as support vectors, indicating a lack of structure in the data, most likely due to uninformative subtree patterns. Labeling by vertex degree, cf. Table 1b, produces slightly better result in general, yet the limited variety of the label alphabet also seems to restrict discriminative power.

Table 1.

Gender classification accuracies (in %) without label transfer for our two default vertex labeling strategies: (a) sequential labels and (b) vertex degrees.

#Cells h = 2 h = 4 h = 6 #Cells h = 2 h = 4 h = 6
256E0a=256 53 (1.00) 55 (1.00) 53 (1.00) 256E0b=12 57 (0.65) 67 (0.79) 67 (0.80)
512E0a=512 63 (1.00) 58 (1.00) 58 (1.00) 512E0b=11 70 (0.56) 70 (0.69) 70 (0.73)
1024E0a=1024 58 (1.00) 57 (1.00) 57 (1.00) 1024E0b=11 62 (0.51) 67 (0.65) 70 (0.69)
2048E0a=2048 63 (1.00) 67 (1.00) 65 (1.00) 2048E0b=10 62 (0.50) 68 (0.59) 70 (0.63)

(a) γa:VE0a

(b) γb:VE0b

Evaluation of Label Transfer

Tables 2a and 2b list the classification accuracies for functional graphs, with and without unique background labels. Overall, we notice an increase in classification performance, with top rates reaching up to 77% and 72%, respectively. Table 2b indicates that, without unique background labels, increasing the number of CVT cells does not necessarily improve performance. In fact, we notice a slight overall decrease in accuracy as we move from 256 to 1024 cells, presumably due to the increase in non-informative subtree patterns caused by an increasing number of non-unique background labels. A comparison to Table 2a supports this conclusion, since background labels are unique and increasing the number of cells from 256 to 1024 leads to consistent improvements. As we further increase the number of cells from 1024 to 2048, the high number of unique, but nonetheless arbitrary labels (665 on avg., cf. Table 2a) seems to confound the positive effect of label transfer.

Table 2.

Gender classification accuracies (in %) using label transfer (a) with and (b) without unique background labels. Numbers in red show the cardinality of the set of added background labels, i.e., B and S = 155 (#segmented structures, see [6]).

#Cells h = 2 h = 4 h = 6 #Cells h = 2 h = 4 h = 6
256E0a=S+132 60 (1.00) 62 (1.00) 62 (1.00) 256E0c=S 68 (0.85) 72 (0.87) 72 (0.87)
512E0d=S+226 70 (0.93) 70 (0.94) 70 (0.93) 512E0c=S 72 (0.74) 70 (0.79) 72 (0.79)
1024E0d=S+386 75 (0.91) 77 (0.92) 77 (0.92) 1024E0c=S 63 (0.79) 68 (0.81) 70 (0.84)
2048E0d=S+665 73 (0.87) 73 (0.88) 73 (0.89) 2048E0c=S 68 (0.71) 68 (0.74) 72 (0.74)

(a) γd:VE0d

(b) γc:VE0c

Discussion

While our results indicate gender-related architectural differences in cerebral vasculature, consistent with [7], several points are worth noting. First, the resolution of MRA is limited to capture blood vessels of size (in diameter) equal to the voxel dimension. This essentially prevents us from quantifying capillaries and determining exact vessel function. Second, our approach is designed to quantify similarity of blood vessel graphs at a global level, to determine “if ” there are differences. Answering the question of “where” differences occur requires a different approach and is left for future research. Third, while the classification accuracies are lower than originally reported in [1], we remark that our results are obtained via cross-validation. In fact, some splits do lead to perfect classification. Finally, most permutation test outcomes actually support the rejection of H0 at the 5% significance level. At 1%, the overall picture remains the same, however, some results of the default labeling strategies (e.g., in Table 1a) are not statistically significant. In summary, the presented approach facilitates the study of vascular differences among populations of healthy individuals, and may facilitate the study of cerebrovascular differences between individuals and populations involving selected mental disorders.

Acknowledgments

This work was funded in part by the following grants: NIH/NCI 1R01CA138419, 1R43CA165621; NIH/NIMH 2P41EB002025-26A1, 1R01MH091645-01A1. The software is available as part of TubeTK (http://www.tubetk.org); the full dataset is available from http://midas3.kitware.com/midas/folder/8051.

Footnotes

3

i.e., an approximate voxel-wise distance to the closest blood vessel, e.g., computed using the N-d extension of Daniellson's algorithm, cf. [4]

4

Skull-stripping is done using bet2 (FMRIB); Registration is done using 3D Slicer.

References

  • 1.Aylward S, Jomier J, Vivert C, LeDigarcher V, Bullit E. Spatial graphs for intra-cranial vascular network characterization. In: Duncan J, Gerig G, editors. MICCAI 2005. Vol. 3749. Springer; Heidelberg: 2005. pp. 59–66. [DOI] [PubMed] [Google Scholar]
  • 2.Bullit E, Zeng D, Mortamet B, Gosh A, Aylward S, Lin W, Marks B, Smith K. The e ects of healthy aging on intracerebral blood vessels visualized by magnetic resonance angiography. Neurobiol Aging. 2010;31(2):290–300. doi: 10.1016/j.neurobiolaging.2008.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Chang CC, Lin CJ. LIBSVM: A library for support vector machines. ACM TIST. 2011;2(3):1–27. [Google Scholar]
  • 4.Gerig G, Koller T, Szekely G, Brechbühler C, Käbler O. Symbolic description of 3D structures applied to cerebral vessel tree obtained from MR angiography volume data. IPMI. 1993 [Google Scholar]
  • 5.Golland P, Fischl B. Permutation tests for classification: Towards statistical significance in image-based studies. IPMI. 2003 doi: 10.1007/978-3-540-45087-0_28. [DOI] [PubMed] [Google Scholar]
  • 6.Halle M, Talos IF, Jakab M, Makris N, Meier D, Wald L, Fischl B, Kikinis R. Multi-modality MRI-based atlas of the brain. 2012 [Google Scholar]
  • 7.Horikoshi T, Akiyama I, Yamagata Z, Sugita M, Nukuki H. Magnetic resonsance angiographic evidence of sex-linked variations in the Circle of Willis and the occurrence of cerebral aneurysms. J Neurosurg. 2002;96:697–703. doi: 10.3171/jns.2002.96.4.0697. [DOI] [PubMed] [Google Scholar]
  • 8.Shervashidze N, Schweitzer P, van Leeuwen E, Mehlhorn K, Borgwardt K. Weisfeiler-Lehman graph kernels. JMLR. 2011;12:2539–2561. [Google Scholar]

RESOURCES