Abstract
The volume of stroke lesion is the gold standard for predicting the clinical outcome of stroke patients. However, the presence of stroke lesion may cause neural disruptions to other brain regions, and these potentially damaged regions may affect the clinical outcome of stroke patients. In this paper, we introduce the tractographic feature to capture these potentially damaged regions and predict the modified Rankin Scale (mRS), which is a widely used outcome measure in stroke clinical trials. The tractographic feature is built from the stroke lesion and average connectome information from a group of normal subjects. The tractographic feature takes into account different functional regions that may be affected by the stroke, thus complementing the commonly used stroke volume features. The proposed tractographic feature is tested on a public stroke benchmark Ischemic Stroke Lesion Segmentation 2017 and achieves higher accuracy than the stroke volume and the state-of-the-art feature on predicting the mRS grades of stroke patients. Also, the tractographic feature yields a lower average absolute error than the commonly used stroke volume feature.
Keywords: modified Rankin Scale (mRS), Stroke, Clinical outcome prediction, Tractographic feature, Machine learning
1. Introduction
According to the World Health Organization, 15 million people suffer strokes each year, the second leading cause of death (5.8 million) and the third leading cause of disability worldwide [8,13]. Around 87% of strokes are ischemic strokes, which result from an obstruction within a blood vessel in the brain [18]. The corresponding lack of oxygen results in different degrees of disability of people, and the modified Rankin Scale (mRS) is commonly used to measure the degree of disability or dependence in the daily activities of stroke patients [2,5,22].
Several studies [2,12,14,17,21,23] demonstrate significant correlations between stroke volume and mRS grades, with larger lesions predicting more severe disability. However, only a few studies [3,4,15] extracted different features, including first-order features and deep features, other than the volume from stroke lesion to predict the mRS grades of stroke patients. The study of Maier and Handels [15] is most relevant to our work. They extracted 1650 image features and 12 shape characteristics from the stroke volume, the volume surrounding the stroke and the remaining brain volume, and they applied a random forest regressor with 200 trees on these 1662 features to predict the mRS grades of stroke patients. However, the presence of stroke lesion may disrupt other brain regions that may affect the clinical outcome of stroke patients.
The main contribution of this paper is the introduction of a new second-order feature, the tractographic feature, that couples the stroke lesion of a patient with the average connectome information from a group of normal subjects. The tractographic feature describes the potentially damaged brain regions due to the neural disruptions of the stroke lesion. Ideally one would like to use the diffusion images from the stroke patient, but this is not a realistic scenario. For instance, the patient with mental in their body is unsafe for getting an MRI scan. Instead, we use the “normal” subject data from the HCP project with the assumption that the parcellations and the associated tracts computed from that data are a reasonable approximation to extract the connectivity features. These tractographic features coupled with the stroke lesion information are used to predict the mRS grades of stroke patients. The concept of the tractographic feature was first proposed by Kao et al. [9] who used these to predict the overall survival of brain tumor patients. We modify their method to adapt to the size of the lesions and propose a new weighted vector of the tractographic feature. Our experimental results demonstrate that the proposed approach improves upon the state-of-the-art method and the gold standard in predicting the clinical outcome of stroke patients.
2. Materials and Methods
2.1. Dataset
Ischemic Stroke Lesion Segmentation (ISLES) 2017 [10,16] provides 43 subjects in the training dataset. Each subject has two diffusion maps (DWI, ADC), five perfusion maps (CBV, CBF, MTT, TTP, Tmax), one ground-truth lesion mask and clinical parameters. The ground-truth lesion mask is built in the follow-up anatomical sequence (T2w or FLAIR) and the corresponding mRS grade was given on the same day. The clinical parameters include mRS grade ranging from 0 to 4, time-to-mRS (88 to 389 days), TICI scale grade from 0 to 3, time-since-stroke (in minutes), and time-to-treatment (in minutes). Since TICI scale grade, time-since-stroke, and time-to-treatment were missing for some subjects, these three clinical parameters are not used in this work. The dimension and voxel spacing of MR images are different between each subject, but they are the same within each subject. We only focus on the subjects who obtain an mRS grade at 3 months (90 days) following hospital discharge since ascertainment of disability at 3-month post-stroke is an essential component of outcome assessment in stroke patients [5], and the tractographic data may change at a different time. Therefore, only 37 subjects are considered in this paper.
2.2. Tractographic Feature
The tractographic feature describes the potentially damaged region impacted by the presence of the stroke lesion through the average connectome information from 1021 Human Connectome Project (HCP) subjects [20]. For each HCP subject, q-space diffeomorphic reconstruction [24] is used to compute the diffusion orientation distribution function. Figure 1 shows the workflow of building a tractographic feature for a stroke patient.
Fig. 1.

The workflow for constructing a tractographic feature from a stroke region.
Given the stroke lesion in the subject space, we first map the stroke lesion to the Montreal Neurological Institute (MNI) space [6]. Second, we place one seed within each voxel of the brain region, and a deterministic diffusion fiber tracking method [25] is used to find all possible tracts passing through the stroke volume inside the brain from the average diffusion orientation distribution function of 1021 HCP subjects. Topology-informed pruning [26] is used to remove false-positive tracts. Third, an existing brain parcellation atlas is used to create a disruption matrix D, which describes the degree of disruption between different brain parcellation regions due to the presence of the stroke lesion.
| (1) |
dij notes the number of tracts starting from a region i and ending in a region j, and N is the total number of brain parcellation regions in the existing atlas. Then, this disruption matrix is normalized by its maximum value, i.e., where is the normalized disruption matrix, and dm is the maximum element of the disruption D. Afterward, we sum up each column in this normalized disruption matrix to form a row vector . From the stroke lesion, we build a weight vector γ = [s1, s2, …, sN], which is the distribution of the stroke volume in the different brain parcellation regions. si is the volume of the stroke lesion in the i-th brain parcellation region. In the end, the row vector L is multiplied by this weight vector γ element-wisely to form the tractographic feature T.
| (2) |
○ is the Hadamard-product. This vector T is the proposed tractographic feature extracted from stroke lesion without any diffusion information of a patient. In this paper, the Automated Anatomical Labeling (AAL) [19] template is used to define 116 brain regions so the dimension of the tractographic feature is 116. The reasons for choosing AAL rather than other existing atlases are (i) this atlas contains an optimal number of brain regions that could make each region large enough to compensate possible stroke-induced lesion effect or distortion, and (ii) this atlas contains cortical, subcortical and cerebellar regions, which could be equally important for mRS prediction. The source code is available on GitHub1.
Parameters of Fiber Tracking
DSI Studio2 is used to build the fiber tracts for each subject. Table 1 shows the tracking parameters3 we used in this paper. The type of stroke lesion is set to ROI (−roi=stroke_lesion) that found all possible tracts passing through the stroke lesion.
Table 1.
Tracking parameters of building the fiber tracts for stroke patients in this paper. More details of parameters can be found at http://dsi-studio.labsolver.org/Manual/Fiber-Tracking.
| Parameter | Value |
|---|---|
| Termination index | qa |
| Threshold | 0.15958 |
| Angular threshold | 90 |
| Step size (mm) | 0.50 |
| Smoothing | 0.50 |
| Min length (mm) | 3.0 |
| Max length (mm) | 500.0 |
| Topology-informed pruning (iteration) | 1 |
| Seed orientation | All orientations |
| Seed position | Voxel |
| Randomize seeding | Off |
| Check ending | Off |
| Direction interpolation | Tri-linear |
| Tracking algorithm | Streamline (Euler) |
| Terminate if | 2,235,858 Tracts |
| Default otus | 0.60 |
Parameters of Connectivity Matrix
DSI studio is used to create the connectivity matrix4 followed by fiber tracking. Automated Anatomical Labeling is chosen to form a 116 × 116 connectivity matrix. The type of the connectivity matrix is set to end, the value of each element in the connectivity matrix is the count of fiber tracts, and the threshold to remove the noise in the connectivity matrix is set to 0.
2.3. Evaluation Metrics
The employed evaluation metrics are (i) the accuracy, which is the percentage of the predicted mRS scores matching the corresponding ground-truth mRS scores, and (ii) the average absolute error between the predicted mRS scores and the corresponding ground-truth mRS scores.
3. Experimental Results
First Experiment:
In this experiment, we compare the mRS prediction performance of the tractographic feature with other first-order features extracted from the lesion mask. These first-order features include the volumetric feature, spatial feature, morphological feature and volumetric-spatial feature depicted in Table 2. The framework of the first experiment is shown in Fig. 2.
Table 2.
First-order features extracted from the stroke lesion.
| Type of feature | Descriptions |
|---|---|
| Volumetric feature | Volumetric feature is the volume of the lesion in the MNI space, and it only has one dimension |
| Spatial feature | Spatial feature describes the location of the lesion in the brain. The centroid of the lesion is extracted as the spatial feature for each subject, and the spatial feature has three dimensions |
| Morphological feature | Morphological feature describes shape information of the lesion. The length of the major axis and minor axis of the lesion, the ratio of the length of the major axis and minor axis of the lesion, the solidity and roundness of the lesion, and the surface of the lesion are extracted as the morphological feature. The morphological feature has six dimensions for each subject |
| Volumetric-spatial feature | Volumetric-spatial feature describes the distribution of the stroke lesion in different brain parcellation regions from an existing structural atlas. Automated Anatomical Labeling (AAL) [19] is used to build the volumetric-spatial feature so the dimension of the volumetric-spatial feature is 116 |
Fig. 2.

The framework of the first experiment. In the end, the predicted mRS grade is rounded to an integer.
We first register the stroke lesions from subject space to the MNI space to overcome the differences of the voxel spacing and image dimension between different subjects. The tractographic feature and other first-order features are extracted from these normalized stroke lesions. After feature extraction, we apply a standard feature normalization on the extracted features to ensure that each dimension of the features has the same scale. Then, we remove the dimensions of the features with zero variance between subjects and apply a recursive feature elimination with leave-one-out cross-validation to find the best subset of the feature that yields the lowest average mean absolute error. In the training phase, we train one random forest regressor for each type of feature, i.e., five random forest regressors are trained. Each random forest regressor has 300 trees of which maximum depth is 3. In the testing phase, we use different types of features with the corresponding trained random forest regressors to predict the mRS grades of stroke patients, and the predicted mRS grade is rounded to an integer. We evaluate the mRS prediction performance of different types of features with leave-one-out cross-validation on ISLES 2017 training dataset. The quantitative results are reported in Table 3. From Table 3, the tractographic feature has the highest accuracy and lowest average absolute error on predicting the mRS grades of stroke patients compared to other first-order features.
Table 3.
The mRS prediction performance of different types of features on ISLES 2017 training dataset with leave-one-out cross-validation. The bold numbers show the best performance. (The average absolute error is reported as mean ± std.)
| Type of feature | Accuracy | Average absolute error |
|---|---|---|
| Tractographic feature | 0.622 | 0.487 ± 0.683 |
| Volumetric feature | 0.514 | 0.595 ± 0.715 |
| Volumetric-spatial feature | 0.568 | 0.621 ± 0.817 |
| Morphological feature | 0.378 | 0.703 ± 0.609 |
| Spatial feature | 0.351 | 0.919 ± 0.882 |
| Maier and Handels [15] | 0.595 | 0.460 ± 0.597 |
Second Experiment:
We compare the mRS prediction performance of the tractographic feature with the state-of-the-art feature proposed by Maier and Handels [15]. We implement their feature extraction method on ISLES 2017 dataset. First, 1650 image features and 12 shape features are extracted from the lesion volume and the apparent diffusion coefficients (ADC) maps in the subject space. Thereafter, these two types of features are concatenated to build a 1662-dimension feature. Then, we apply the same feature normalization, feature selection, cross-validation, and random forest regressor as the first experiment to predict the mRS of stroke patients. The quantitative results of the state-of-the-art feature are also shown in Table 3. From Table 3, the tractographic feature also achieves higher accuracy and similar average absolute error (p = 0.81) compared to the state-of-the-art feature.
4. Discussion and Conclusion
From the first experiment, the tractographic feature has the best mRS prediction accuracy and the lowest average absolute error compared to other first-order features. The main reason is that the tractographic feature integrates volumetric-spatial information of the stroke lesion and the average diffusion information from a group of normal subjects that describes the potentially damaged regions impacted by the stroke lesion. These potentially damaged regions are formatted in the disruption matrix D from Eq. (1), and the weight vector γ from Eq. (2) carries spatial and volumetric information of the stroke lesion to the tractographic feature T. Also, it is worth noting that the volumetric-spatial feature is the same as the weight vector γ of the tractographic feature, and the mRS prediction performance of volumetric-spatial feature is improved by considering the average connectome information from a group of normal subjects.
The second experiment demonstrates that the tractographic feature also has better mRS prediction accuracy than the state-of-the-art feature [15]. It should be noted that their approach requires ADC maps that are not necessarily always available, and using only the lesion shape information degrades the overall performance significantly in their approach. We also note that the tractographic feature is of much lower dimensions (116) compared to the state-of-the-art feature (1662).
In both experiments, we apply the recursive feature selection with cross-validation on different types of features, and this procedure reduces one dimension of feature recursively until finding the best subset of the feature with the lowest mean absolute error. For the tractographic feature, this reduces the dimensionality from 116 to 8. This selected tractographic feature comes from eight AAL regions shown in Fig. 3 (left and right inferior temporal gyrus, right Rolandic operculum, left middle frontal gyrus, orbital part and triangular part of right inferior frontal gyrus, left angular gyrus and left putamen).
Fig. 3.

Selected tractographic feature from eight AAL regions including left (in red) and right (in pink) inferior temporal gyrus red, right Rolandic operculum (in orange), left middle frontal gyrus (in yellow), orbital part (in green) and triangular part (in blue) of right inferior frontal gyrus, left angular gyrus (in purple) and left putamen (in grey) after applying the recursive feature selection with cross-validation on the original tractographic features. These tractographic features are extracted from 37 ISLES 2017 training subjects.
After feature selection, we use a random forest regressor to predict the mRS grades of stroke patients. The random forest regressor gives the importance to each dimension within a given type of feature shown in Fig. 4.
Fig. 4.

Region importance of eight selected AAL brain parcellation regions given by a random forest regressor with 300 trees whose maximum depth is 3. The average values are marked in the green triangles. Left inferior temporal gyrus (LITG) yields a higher mean importance (0.26) than right Rolandic operculum (RRO, 0.14), left middle frontal gyrus (LMFG, 0.13), orbital part (ORIFG, 0.11) and triangular part (TRIFG, 0.10) of right inferior frontal gyrus, left angular gyrus (LAG, 0.09), left putamen (LP, 0.09) and right inferior temporal gyrus (RITG, 0.08) within 37 ISLES 2017 training subjects on the task of predicting the mRS grades of stroke patients.
For the selected tractographic feature, left inferior temporal gyrus yields the highest average importance compared to the other seven regions within 37 ISLES 2017 training subjects on the task of predicting the mRS grades. The reasons left inferior temporal gyrus has the greatest effect on the mRS of stroke patients are (i) this region is important for language processing and speech production [1], and (ii) a large number of fiber tracts, passing through this region, goes across the splenium of the corpus callosum which connects the visual, parietal and auditory cortices [7,11] (See Fig. 5).
Fig. 5.

The fiber tracts passing through the left inferior temporal gyrus from the average connectome information of 1024 HCP subjects. We place a seed in each voxel inside the whole brain to find all possible tracts passing through the left inferior temporal gyrus.
In conclusion, the paper presents for the first time the use of tractographic features for predicting the clinical outcome of stroke patients. The tractographic feature leads to promising mRS prediction results on ISLES 2017 dataset but needs to be further validated using a larger and representative independent dataset to rule out a potential methodical bias and over-fitting effects. The proposed tractographic feature has the potential to be improved if we build a disruption matrix from each HCP subject given the stroke lesion in MNI space and construct the average disruption matrix from these individual disruption matrices.
Limitation.
The proposed tractographic feature cannot be generated if the stroke lesion is not located in the brain parcellation regions.
Acknowledgement.
This research was partially supported by a National Institutes of Health (NIH) award # 5R01NS103774-03. We thank Oytun Ulutan for technical support and Dr. Robby Nadler for writing assistance and language editing.
Footnotes
parameter_id=F168233E9A99193F32318D24ba3Fba3Fb404b0FA43D21D22cb01ba02a01d.
References
- 1.Antonucci SM, et al. : Lexical retrieval and semantic knowledge in patients with left inferior temporal lobe lesions. Aphasiology 22(3), 281–304 (2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Banks JL, Marotta CA: Outcomes validity and reliability of the modified Rankin scale: implications for stroke clinical trials: a literature review and synthesis. Stroke 38(3), 1091–1096 (2007) [DOI] [PubMed] [Google Scholar]
- 3.Choi Y, Kwon Y, Lee H, Kim BJ, Paik MC, Won JH: Ensemble of deep convolutional neural networks for prognosis of ischemic stroke. In: Crimi A, Menze B, Maier O, Reyes M, Winzeck S, Handels H (eds.) BrainLes 2016. LNCS, vol. 10154, pp. 231–243. Springer, Cham: (2016). 10.1007/978-3-319-55524-922 [DOI] [Google Scholar]
- 4.Forkert ND, et al. : Multiclass support vector machine-based lesion mapping predicts functional outcome in ischemic stroke patients. PLoS One 10(6), e0129569 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Goyal M, et al. : Endovascular thrombectomy after large-vessel ischaemic stroke: a meta-analysis of individual patient data from five randomised trials. Lancet 387(10029), 1723–1731 (2016) [DOI] [PubMed] [Google Scholar]
- 6.Grabner G, Janke AL, Budge MM, Smith D, Pruessner J, Collins DL: Symmetric atlasing and model based segmentation: an application to the hippocampus in older adults. In: Larsen R, Nielsen M, Sporring J (eds.) MICCAI 2006. LNCS, vol. 4191, pp. 58–66. Springer, Heidelberg: (2006). 10.1007/118667638 [DOI] [PubMed] [Google Scholar]
- 7.Hofer S, Frahm J: Topography of the human corpus callosum revisited—comprehensive fiber tractography using diffusion tensor magnetic resonance imaging. Neuroimage 32(3), 989–994 (2006) [DOI] [PubMed] [Google Scholar]
- 8.Johnson W, et al. : Stroke: a global response is needed. Bull. World Health Organ. 94(9), 634 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kao P-Y, Ngo T, Zhang A, Chen JW, Manjunath BS: Brain tumor segmentation and tractographic feature extraction from structural MR images for overall survival prediction. In: Crimi A, Bakas S, Kuijf H, Keyvan F, Reyes M, van Walsum T (eds.) BrainLes 2018. LNCS, vol. 11384, pp. 128–141. Springer, Cham: (2019). 10.1007/978-3-030-11726-912 [DOI] [Google Scholar]
- 10.Kistler M, et al. : The virtual skeleton database: an open access repository for biomedical research and collaboration. J. Med. Internet Res 15(11), e245 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Knyazeva MG: Splenium of corpus callosum: patterns of interhemispheric interaction in children and adults. Neural Plast. 2013, 12 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lev MH, et al. : Utility of perfusion-weighted CT imaging in acute middle cerebral artery stroke treated with intra-arterial thrombolysis: prediction of final infarct volume and clinical outcome. Stroke 32(9), 2021–2028 (2001) [DOI] [PubMed] [Google Scholar]
- 13.Lindley RI: Stroke, 2nd edn. Oxford University Press, Oxford: (2017) [Google Scholar]
- 14.Löuvbld KO, et al. : Ischemic lesion volumes in acute stroke by diffusion-weighted magnetic resonance imaging correlate with clinical outcome. Ann. Neurol.: Off. J. Am. Neurol. Assoc. Child Neurol. Soc 42(2), 164–170 (1997) [DOI] [PubMed] [Google Scholar]
- 15.Maier O, Handels H: Predicting stroke lesion and clinical outcome with random forests. In: Crimi A, Menze B, Maier O, Reyes M, Winzeck S, Handels H (eds.) BrainLes 2016. LNCS, vol. 10154, pp. 219–230. Springer, Cham: (2016). 10.1007/978-3-319-55524-921 [DOI] [Google Scholar]
- 16.Maier O, et al. : ISLES 2015 - A public evaluation benchmark for ischemic stroke lesion segmentation from multispectral MRI. Med. Image Anal 35, 250–269 (2017) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Parsons M, et al. : Combined 1H MR spectroscopy and diffusion-weighted MRI improves the prediction of stroke outcome. Neurology 55(4), 498–506 (2000) [DOI] [PubMed] [Google Scholar]
- 18.Summers D, et al. : Comprehensive overview of nursing and interdisciplinary care of the acute ischemic stroke patient: a scientific statement from the American Heart Association. Stroke 40(8), 2911–2944 (2009) [DOI] [PubMed] [Google Scholar]
- 19.Tzourio-Mazoyer N, et al. : Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage 15(1), 273–289 (2002) [DOI] [PubMed] [Google Scholar]
- 20.Van Essen DC, et al. : The WU-Minn human connectome project: an overview. Neuroimage 80, 62–79 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Van Everdingen K, et al. : Diffusion-weighted magnetic resonance imaging in acute stroke. Stroke 29(9), 1783–1790 (1998) [DOI] [PubMed] [Google Scholar]
- 22.Van Swieten J, et al. : Interobserver agreement for the assessment of handicap in stroke patients. Stroke 19(5), 604–607 (1988) [DOI] [PubMed] [Google Scholar]
- 23.Vogt G, et al. : Initial lesion volume is an independent predictor of clinical stroke outcome at day 90: an analysis of the virtual international stroke trials archive (VISTA) database. Stroke 43(5), 1266–1272 (2012) [DOI] [PubMed] [Google Scholar]
- 24.Yeh FC, Tseng WYI: NTU-90: a high angular resolution brain atlas constructed by q-space diffeomorphic reconstruction. Neuroimage 58(1), 91–99 (2011) [DOI] [PubMed] [Google Scholar]
- 25.Yeh FC, et al. : Deterministic diffusion fiber tracking improved by quantitative anisotropy. PLoS One 8(11), e80713 (2013) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Yeh F-C, et al. : Automatic removal of false connections in diffusion MRI tractography using topology-informed pruning (TIP). Neurotherapeutics 16(1), 52–58 (2018). 10.1007/s13311-018-0663-y [DOI] [PMC free article] [PubMed] [Google Scholar]
