Skip to main content
Lippincott Open Access logoLink to Lippincott Open Access
. 2023 Sep 25;54(11):2895–2905. doi: 10.1161/STROKEAHA.123.043897

Prediction of Stroke Outcome in Mice Based on Noninvasive MRI and Behavioral Testing

Felix Knab 1,2, Stefan Paul Koch 1,2,3, Sebastian Major 1,2, Tracy D Farr 1,3,4, Susanne Mueller 1,2,3, Philipp Euskirchen 1, Moritz Eggers 1,2, Melanie TC Kuffner 1,2, Josefine Walter 1,2,5, Daniel Berchtold 1,2, Samuel Knauss 1,6, Jens P Dreier 1,2,7,8, Andreas Meisel 1,2, Matthias Endres 1,2,7,9,10,11, Ulrich Dirnagl 1,2,7,9,10,11, Nikolaus Wenger 1,2,6,7, Christian J Hoffmann 1,2,6, Philipp Boehm-Sturm 1,2,3, Christoph Harms 1,2,6,7,9,
PMCID: PMC10589430  PMID: 37746704

Abstract

BACKGROUND:

Prediction of poststroke outcome using the degree of subacute deficit or magnetic resonance imaging is well studied in humans. While mice are the most commonly used animals in preclinical stroke research, systematic analysis of outcome predictors is lacking.

METHODS:

We intended to incorporate heterogeneity into our retrospective study to broaden the applicability of our findings and prediction tools. We therefore analyzed the effect of 30, 45, and 60 minutes of arterial occlusion on the variance of stroke volumes. Next, we built a heterogeneous cohort of 215 mice using data from 15 studies that included 45 minutes of middle cerebral artery occlusion and various genotypes. Motor function was measured using a modified protocol for the staircase test of skilled reaching. Phases of subacute and residual deficit were defined. Magnetic resonance images of stroke lesions were coregistered on the Allen Mouse Brain Atlas to characterize stroke topology. Different random forest prediction models that either used motor-functional deficit or imaging parameters were generated for the subacute and residual deficits.

RESULTS:

Variance of stroke volumes was increased by 45 minutes of arterial occlusion compared with 60 minutes. The inclusion of various genotypes enhanced heterogeneity further. We detected both a subacute and residual motor-functional deficit after stroke in mice and different recovery trajectories could be observed. In mice with small cortical lesions, lesion volume was the best predictor of the subacute deficit. The residual deficit could be predicted most accurately by the degree of the subacute deficit. When using imaging parameters for the prediction of the residual deficit, including information about the lesion topology increased prediction accuracy. A subset of anatomic regions within the ischemic lesion had particular impact on the prediction of long-term outcomes. Prediction accuracy depended on the degree of functional impairment.

CONCLUSIONS:

For the first time, we developed and validated a robust tool for the prediction of functional outcomes after experimental stroke in mice using a large and genetically heterogeneous cohort. These results are discussed in light of study design and imaging limitations. In the future, using outcome prediction can improve the design of preclinical studies and guide intervention decisions.

Keywords: forecasting; infarction; middle cerebral artery; mice, Inbred C57BL; precision medicine; random forest; sensory motor performance; topographic brain mapping


Summary: Systematic analysis of stroke outcome predictors in mice is lacking. We developed a tool to predict late pellet reaching performance based on early stroke localization in a diverse cohort of 215 mice.


Stroke remains a major public health challenge, as its occurrence rises in aging societies.1 With decreasing mortality rates due to progress in acute stroke therapy, the need to develop treatments that improve functional outcomes has become more pressing.2 Understanding predictors of functional outcome after stroke both in humans and rodents is an essential step towards defining adequate treatment decisions and improving the design of preclinical studies.3 Prediction of expected motor-functional outcomes in mice could potentially guide researchers in their treatment decisions in preclinical stroke intervention studies and support outcome-dependent stratifications. In humans, magnetic resonance imaging (MRI) is used to assess stroke volume and topology.4 Lesion volume is most commonly used to analyze stroke severity and predict functional outcome, but the prognostic value has been shown to be better when including lesion topology.511 This was also shown in an exploratory preclinical study using a porcine stroke model.12 Additionally, the degree of the subacute deficit has been shown to be a reliable predictor of long-term outcomes, especially for mild functional deficits.13,14 In contrast to humans, little is known about predictors of functional outcomes after stroke in mice, although they are commonly used in preclinical research. Detecting motor-functional deficits in rodents following experimental stroke is difficult as they show impressive recovery and have subtle, if any, long-term motor deficits in common behavioral tests.15 The stroke preclinical assessment network recently completed the largest preclinical stroke study on mice and provided examples of how the middle cerebral artery occlusion (MCAO) model can be used to conduct studies that are similar to those performed in humans.16 Various efforts to standardize preclinical stroke models have been made, including standardization of operation procedures and monitoring of cerebral blood flow.17,18 Today, heterogeneity of the MCAO model has been reduced but still largely depends on factors such as the experience of the surgeon or filament choice.19 Additionally, other factors such as the natural microbiome, comorbidities, age, or sex are likely contributing to the observable heterogeneity in preclinical stroke studies. In their most recent publication, the stroke preclinical assessment network consortium has suggested to embrace and systematically integrate this heterogeneity of outcomes into preclinical research, to more accurately simulate the reality of clinical studies.19

In a previous study, we have investigated the accuracy of early perfusion/diffusion MRI for final infarct prediction.20 The main goal of the present retrospective study was to identify and compare different predictors of functional outcome following brain ischemia in mice and to provide researchers with a tool, which can help to cope with the heterogeneity of stroke volumes. Our work opens new avenues toward a better understanding of poststroke recovery and impairment in mice and paves the way to using the prediction of functional outcomes for the optimization of preclinical stroke studies. The provided prediction tool gives laboratories, in particular, those that are just beginning to gradually increase standardization of the MCAO model in an iterative process, the option to cope with heterogeneity of stroke volumes from various sources.

METHODS

Data are reported according to the ARRIVE guidelines (Animal Research: Reporting of In Vivo Experiments). Data are openly accessible on zenodo (https://doi.org/10.5281/zenodo.6534690). The repository contains all raw MRI T2w images, lesion masks, and the registered atlases, the input for machine learning algorithms (lesion volume, segmented MRI, behavioral scores), the trained classifiers, and their output (predicted behavioral scores).

Heterogeneity of Stroke Volumes

We compared the degree of stroke volume heterogeneity caused by different arterial occlusion times using imaging data from a total of 291 C57Bl/6 mice undergoing either 30 (n=114), 45 (n=56), or 60 (n=121) minutes of MCAO. Mean lesion volumes and deviation from group mean in percent were assessed (Figure 1). We additionally performed a covariate analysis using a multiple linear regression model assessing the effect of surgeon and occlusion time on stroke lesion volume. Subsequently, imaging and behavioral data from 15 studies performed between 2015 and 2019 applying an occlusion time of 45 minutes and 15 genotypes were collected (Figure S1). For this cohort, which was later divided into a prediction cohort and a replication cohort, we analyzed the effect of the covariates genotype and surgeon (n=2) on the heterogeneity of stroke lesion volumes.

Figure 1.

Figure 1.

Heterogeneity of stroke volumes by occlusion time. A, Lesion incidence maps of mice with 30, B, 45, or C, 60 min of middle cerebral artery occlusion (MCAO). Notably, an increasing percentage of the mice show affection of cortical regions and, therefore, of the entire MCA territory. D, Stroke volumes resulting from the 3 different arterial occlusion times. Lesion volumes became significantly larger with longer occlusion times. Error bars indicate 95% CI; black line indicates mean. E, Individual deviation from group mean was expressed in percent. Interestingly, the 60 min of MCAO provided significantly smaller deviations, indicating more homogeneity of stroke volumes at longer occlusion times.

Experimental Design and Selection Process

The prediction tools were developed using data from the prediction cohort and were tested again in the replication cohort. Skilled forepaw reaching was tested in the staircase test before and after MCAO surgery, followed by MRI after 24 hours (Figure 2A, details in Supplemental Material and Methods). Initially, 450 mice were considered for inclusion and 215 mice were included in this retrospective study (consort diagram, Figure S1B; Table S1). Basic characteristics of all animals are provided in Table S2 and Figure S2. A detailed report on the selection criteria and mortality can be found in the Supplemental Material and Methods.

Figure 2.

Figure 2.

Experimental design and assessment of functional subgroups. A, Mice were trained in pellet reaching for 21 d. Average percentage of retrieved pellets during the last week before middle cerebral artery occlusion (MCAO) was used to normalize performance (baseline). Mice received either 45 min of MCAO or sham surgery. Magnetic resonance imaging was performed on day 1 and mice were then tested for another 21 d. B, Chord diagram displaying the different trajectories of recovery. Groups on top display mice stratified by degree of subacute deficit, while groups on bottom display mice stratified by residual deficit. Numbers indicate group sizes. Interconnections visualize how many mice with a given subacute deficit recovered to have a given residual deficit, that is, roughly a third of the mice with a severe subacute deficit displayed no recovery and showed a severe residual deficit. C, Performance of the paretic right paw over time for each severity grade. Error bars indicate 95% CI. Severity grades here were based on the degree of the residual deficit. D, Overview over sizes of functional subgroups. Groups are either based on the degree of subacute or residual deficit. While the majority of mice displayed a severe subacute deficit, due to their impressive capability for spontaneous recovery, the majority of mice displayed only mild residual deficits. Created with BioRender.com.

Staircase Test of Skilled Reaching

Behavioral testers were blinded to the intervention and MRI results. Mice were housed, food-restricted, and familiarized with the staircase test (details in Supplemental Material and Methods).21 The percentage of collected pellets per side was chosen as the experimental variable. Performance was expressed as a percentage of average prestroke performance (baseline). Based on the mean performance of the entire cohort after the stroke, 2 phases were defined: a subacute deficit phase (days, 2–6), during which performance appeared dynamic and increasing, and a residual deficit phase (days, 12–21), throughout which performance appeared stable. Average performance was calculated for each mouse for the respective period. A gap of 5 days was chosen between the subacute and residual deficit to ensure the dynamic phase had ended.

Classification of Severity Grades for Subgroup Analysis

Alluding to the modified Rankin Scale, a score regularly used in patients with stroke, 5 different functional subgroups were defined based on the severity of deficit (Figure 2B through 2D; details in Supplemental Material and Methods).22

Selection of Predictors

Two factors were analyzed for their predictive value for the degree of the subacute deficit after MCAO: (1) stroke lesion volume measured within the coordinates of the Allen Mouse Brain Atlas, which makes it identical to the volume of remaining brain tissue23 and (2) MRI lesion percentages measured in the 736 Allen Mouse Brain Atlas regions. Throughout the article, this predictor is referred to as segmented MRI. The dimensionality of the segmented MRI (736 initial regions) was reduced by discarding all regions that did not show a lesion in at least 1 animal. This left the segmented MRI with 536 regions. In addition to those 2 imaging-based parameters, for the prediction of the residual deficit, the degree of subacute deficit was included as a third factor.

Development of Prediction Models With Machine Learning

For the development of machine learning–based prediction models, we divided the prediction cohort into training (two-thirds of the animals) and testing (one-third of the animals). For each predictor, we calculated the median absolute error and reported the average as prediction error (PE). Only PEs from the test cohort were used for accuracy analysis. PE measures the difference between predicted and actual performance. Small errors indicate high accuracy. To determine which anatomic regions affect MRI-based residual deficit prediction accuracy, the out-of-bag predictor importance for each anatomic region in the segmented MRI was calculated (details in Supplemental Material and Methods).24 The prediction models were uploaded and are available at github.com/major-s/mouse-mcao-outcome-predictor.

Statistics

Methods for statistics are described in the Supplemental Material and Methods.

Ethics Approval

All animal procedures were approved by and performed in concordance with local authorities (Landesamt für Gesundheit und Soziales) under license numbers G0197/12, G005/16, G0057/16, G0119/16, GG254/16, G0157/17, and G0343/17.

RESULTS

Occlusion Time, Surgeon, and Genotype Influence Heterogeneity of Stroke Volumes

Figure 1A through 1C show stroke incidence maps and edema-corrected lesion volumes for the 3 occlusion times. After 30 minutes, the mean lesion volume was 14.6 mm3 (95% CI, 11.6–17.6), 31.9 mm3 (26.0–37.9) after 45 minutes, and 50.2 mm3 (45.2–55.2) after 60 minutes (Figure 1D). Volumes differed across occlusion times (χ2=115.0; P<0.0001). The 30-minute MCAO group had a 72.7% (57.4–88.0) deviation from the group mean, the 45-minute group 58.1% (48.3–68.0), and the 60-minute group 44.7% (38.9–50.6; Figure 1E). Deviation from group mean differed significantly among the 3 groups (χ2=14.7; P=0.0006). The difference in deviation from group mean between the 30- and 60-minute MCAO groups (Z=3.5; P=0.002) and the 45- and 60-minute groups (Z=2.85; P=0.013) was statistically significant. Linear regression model demonstrated that the occlusion time (F=143.7; P<0.0001) but not the surgeon (F=0.8; P=0.5) affected stroke lesion volumes in 291 C57Bl/6 mice (Figure S3). We further examined the effects of genotype and surgeon on stroke lesion volumes in the prediction cohort. Linear regression model indicated that genotype (F=2.2; P=0.008) and surgeon (F=9.0; P=0.003) affected stroke volume.

MRI Characterization and Lesion Topology in Prediction and Replication Cohort

T2-weighted MR images were coregistered on the Allen Mouse Brain Atlas to evaluate lesion topology (segmented MRI) and incidence maps were generated (Figure S4A and S4B). Quantitative evaluation of the hyperintense lesion was performed on 536 regions and lesion volume in each region was expressed in percent of the total region (Table S3). Lesion volumes and topological characteristics varied slightly between the prediction and replication cohort (Figure S4C and S4D). Further, incidence maps and lesion volumes were calculated for the functional subgroups (Figure S5) and more details can be found in the Supplemental Material and Methods.

A Modified Staircase Test Can Assess Early and Long-Term Functional Outcome in Mice After Stroke

Performance in the prediction cohort peaked after ≈7 days (Figure 3A and 3B). Baseline performance of the prediction cohort is reported in the Supplemental Material and Methods. In the phase of the subacute deficit, the left (nonparetic) side showed 61.6% (56.1–67.1) performance, while the right (paretic) side showed 37.6% (31.9–43.3; Figure 3C). In sham animals, performance on nonparetic side was 78.4% (63.5–93.3) and 72.2 (57.3–87.2) on the paretic side. Mixed-effects analysis exhibited a side-group interaction (F[1164]=4.2; P=0.0423). MCAO mice performed significantly worse than sham on the paretic paw (least square mean difference [MD], 34.6pp; t=4.1; df=328; P=0.0001). Nonparetic sides did not differ between groups (least square MD, 16.8 pp; t=2.0; df=328; P=0.0937). To demonstrate side-specificity of the motor-functional deficit in MCAO mice, paired t tests were performed on paretic and nonparetic paws. The paretic paw performed significantly worse (MD, 24.0%; 18.3–29.7; t=8.4; df=147; P<0.0001; Figure S7A). In the period of residual deficit, the performance of nonparetic paw in MCAO animals was 95.4% (89.5–101.2) and 60.9% (54.9–66.8; Figure 3D) on the paretic side. Sham animals showed a performance of 92.4% (81.8–103.0) on the nonparetic side and 91.6% (78.5–104.7) on the paretic side. Mixed-effects analysis demonstrated side-group interaction (F[1164]=9.4; P=0.0026). MCAO animals showed worse paretic paw performance compared with sham (least square MD, 30.7 pp; t=3.5; df=328; P=0.0011). Nonparetic sides did not differ significantly (least square MD, 3.0 pp; t=0.4; df=328; P=0.9302). Finally, we analyzed the effects of the general well-being (neuroscore, weight loss on day 2) of the animals on functional outcome and found poor correlation to the degree of either subacute or residual deficit and worse prediction accuracies compared with imaging- or performance-based prediction (Figures S8 through S11).25

Figure 3.

Figure 3.

Assessing the sensorimotor deficit after stroke. A, Nonparetic left and B, paretic right paw performance over time, bars depict 95% CI. Performance on days −7 to −1 were averaged and taken as a baseline (first black rectangle). Data from days 2 to 6 were summarized and referred to as subacute deficit (second rectangle) while data from days 12 to 21 are referred to as residual deficit (third rectangle). C, Subacute and D, residual deficit of the nonparetic and paretic paw in sham (brown plot) and middle cerebral artery occlusion (MCAO; blue plot) animals. Dashed lines indicate medians, dotted line indicate quartiles. Mixed-effects model followed by Šidák post hoc tests for multiple comparison were used to test effect of group and side.

Prediction of Subacute Deficit From Acute MR Imaging

Using random forest and early noninvasive MRI data, subacute deficit prediction models were created. PE was 13.3 pp (13.0–13.7; Q1, 6.0; Q3, 31.5; interquartile range [IQR], 25.5) for the lesion volume and 16.2 pp (16.0–16.4; Q1, 8.0; Q3, 29.2; IQR, 21.1) for the segmented MRI (Figure 4A and 4B). Unpaired, two-tailed t test revealed that PE was significantly smaller when using the lesion volume as a predictor compared with the segmented MRI (MD, 2.9 pp [2.6–3.3]; t=16.6; df=98; P<0.0001).

Figure 4.

Figure 4.

Comparison of predictors of the subacute and residual deficit. A and B, Comparison of the prediction accuracy of the 2 predictors of the subacute deficit. Averaged Q1, median, Q3, and interquartile range (IQR) were calculated using the 50 independent models for each predictor. Dashed lines indicate median, dotted lines indicate quartiles. Unpaired t test was used to compare prediction error (PEs). Small PE indicates small deviation between predicted and achieved performance. C and D, Comparison of prediction accuracy of the 2 imaging-based parameters and the subacute deficit for the prediction of the residual deficit. Again, 1-way ANOVA followed by Tukey test for multiple comparison was used. MRI indicates magnetic resonance imaging.

Prediction of Residual Deficit From Subacute Deficit and MRI

PE was 18.90 pp (18.70–19.09; Q1, 8.04; Q3, 29.09; IQR, 21.05) for the subacute deficit, 22.68 pp (22.46–22.90; Q1, 14.07; Q3, 41.84; IQR, 27.77) for the segmented MRI and 25.95 pp (25.71–26.29; Q1, 12.21; Q3, 41.25; IQR, 29.04) for the lesion volume (Figure 4C and 4D). One-way ANOVA showed predictors differed significantly (F[2147]=1032; P<0.0001). After multiple comparisons were corrected using Tukey test, the PE of the subacute deficit was significantly smaller than the PE of the segmented MRI (MD, 3.79 pp [4.15–3.42]; P<0.0001) and the lesion volume (MD, 7.06 pp [7.43–6.69]; P<0.0001). Segmented MRI outperformed the prediction accuracy of the lesion volume (MD, 3.28 pp [3.65–2.91]; P<0.0001).

Identification of Anatomic Regions Associated With Long-Term Outcome

Using out-of-bags, each MRI region’s contribution to long-term outcomes was quantified. 81 locations had above-zero out-of-bag predictor importance, contributing more than random data (Figure 5A and 5B; Table S4). Among those were the caudoputamen, primary somatosensory area, external capsule, and corticospinal tract. Next, random forest models were trained using increasing numbers of MRI regions, starting with the most important. The models’ PE gradually reduced with more MRI regions included (Figure 5C). However, PE rapidly plateaued, and adding more regions only minimally increased accuracy. After the inclusion of the first 14 anatomic regions, PE displayed a relevant local minimum (22.9 pp).

Figure 5.

Figure 5.

Identification of anatomic regions that drive imaging-based prediction of functional outcome. A, Coronal slices depicting the importance of the 81 regions calculated from our random forest models (RF). Only regions with an importance level >0 are shown B, Visualization of the importance level of the 14 anatomic regions that were included in the prediction model that resulted in a local minimum of prediction error (PE). C, Prediction of residual deficit was performed using an increasing number of regions in the segmented magnetic resonance imaging. PE as well as maximum and minimum PE are depicted. Dashed lines at x=14 and y=21.5 denote the first local minimum of PE when including only 14 regions.

Validation in an Independent Cohort

Finally, we tested our prediction models on the independent replication cohort. Imaging and behavioral analyses are included in Supplemental Material and Methods. For prediction of subacute deficit using lesion volume, mean PE was 23.2 pp (22.9–23.5) and 20.0 pp (19.7–20.3) using segmented MRI (Figure 6A and 6B). In contrast to results in the prediction cohort, the segmented MRI showed significantly smaller PEs compared with lesion volume (MD, 3.2 pp [3.6–2.8]; t=14.9; df=98; P<0.0001). The PE for the residual deficit using the subacute deficit was 10.8 pp (10.3–11.2), 23.0 pp (22.2–23.8) for the segmented MRI, and 23.9 pp (23.4–24.5) for the lesion volume (Figure 6C and 6D). One-way ANOVA showed predictors differed significantly (F[2147]=19.0; P<0.0001). PE of the subacute deficit was significantly smaller than the PE of the segmented MRI (MD, 12.2 pp [13.2–11.2]; P<0.0001) and the lesion volume (MD, 13.2 pp [14.2–12.2]; P<0.0001). Segmented MRI provided slightly smaller PE than lesion volume (MD, 1.0 pp [2.0–0.1]; P=0.0668). Finally, we tested the prediction accuracy when including an increasing number of anatomic regions. Only regions we had identified to be relevant for the prediction of the residual deficit in our prediction cohort were included. Here, lesion percentage in the caudoputamen alone yielded lower PEs than the entire segmented MRI or lesion volume. When adding regions, PEs spiked and then slowly decreased. After incorporating 14 regions, we again found a local minimum of PE (Figure 6E), replicating our finding from the prediction cohort.

Figure 6.

Figure 6.

Validation of prediction models in an independent cohort. A and B, Comparison of the prediction accuracy of the 2 predictors of the subacute deficit. Dashed lines indicate median, and dotted lines indicate quartiles. Unpaired t test was used to compare prediction errors (PEs). While PEs were comparable to the ones seen in our prediction cohort, subacute deficit was best predicted when using the segmented magnetic resonance imaging (MRI). C and D, Comparison of prediction accuracy of the 2 imaging-based parameters and the subacute deficit for the prediction of the residual deficit. Again, the subacute deficit was the best predictor of the residual deficit, followed by the segmented MRI and the lesion volume. E, Analogous to the prediction cohort, prediction of residual deficit was performed using an increasing number of regions in the segmented MRI. Dashed lines at x=14 and y=21.5 denote the local minimum of PE that had previously been observed in the prediction cohort.

DISCUSSION

In this retrospective study, we systematically compared different predictors of subacute and long-term functional outcomes after stroke in a large cohort of mice. We worked with a cohort that underwent 45 minutes of MCAO and included different genotypes and 2 different surgeons, thereby embracing the resulting heterogeneity of stroke volumes and functional outcomes. This step widens the dynamic range of our prediction tool and integrates translationally relevant levels of variability into our experimental design. Although these levels are not inherent to the MCAO model if performed in young adult, male C57Bl/6 wildtype mice, the selected approach expands the applicability in studies with heterogeneity of various sources. We used MRI-derived parameters such as the lesion volume and lesion topology, as well as early functional testing to predict poststroke performance in a pellet reaching task. Studies with large sample sizes have transformed preclinical MRI research and corresponding open data repositories have proven extremely valuable, for example, in the context of robust functional networks measured by resting-state fMRI.26 Predictors of poststroke outcomes, such as imaging-derived parameters or functional data, have previously been extensively studied in humans.26,8 While 1 preclinical study on predictors of stroke outcome in a porcine MCAO model exists, evidence on prediction of stroke outcome in mice is scarce, despite rodents being used most commonly as model animals in stroke studies.12,27

A side-specific disability of the forepaw was detected over 3 weeks post-surgery. This indicates that performance was governed by the stroke and not by general sickness behavior. We observed distinct trajectories of functional recovery. The differentiation between recoverees and nonrecoverees is comparable to what can be observed in humans.14 The resemblance to the human recovery process supports the predictive value of stroke modeling in mice and the case for implementing sophisticated behavioral tests that involve a longer phase of training before stroke for assessing long-term impairments.28 One should consider that food restriction during staircase training and testing may affect mortality and lesion volume.29

The lesion volume was the best predictor for the subacute deficit after stroke in mice as measured by PE in our prediction cohort. In the replication cohort, it was outperformed by the segmented MRI, which included information about the lesion topology. While PEs remained low, this is clearly a difference between the 2 cohorts. However, we also observed differences in stroke size and topology between the 2 cohorts. In our replication cohort lesion volumes were bigger and cortical regions accounted for higher proportions of the lesion volume compared with the prediction cohort. While it has previously been stated that lesion volume is a good predictor for acute stroke outcome, our data suggest that in mice with greater cortical lesions taking into account topological information provides a more accurate prediction of early functional outcome.6 For the long-term outcome, in the prediction cohort, the segmented MRI provided significantly better prediction accuracies compared with the lesion volume. This indicates that for functional recovery, lesion topology is more relevant than lesion volume, which again is in line with clinical data.11,12 It should be noted that in the 45-minute MCAO model, lesion volume and lesion topology are not fully independent predictors. Larger ischemic lesions typically include cortical regions while smaller lesions are restricted to subcortical structures. This phenomenon can be observed in the results of our replication cohort, where the difference between the segmented MRI and the lesion volume was only marginal but trending toward significance.

Following up on the influence of lesion topology, we identified anatomic regions that drive imaging-based prediction of the residual deficit. We found that considering only a subset of anatomic regions can explain functional long-term outcomes. Prediction accuracy displayed a local maximum after 14 regions had been added to the prediction model. Among those regions were both, cortical and subcortical regions typically affected by a MCAO stroke, such as the caudoputamen and the primary somatosensory area. Evidently, the caudoputamen is part of the lesion core in the transient occlusion model but also serves as the structure that best predicts the performance. Further, white matter regions such as the corpus callosum and the fiber tracts were shown to be of particular importance for the prediction of long-term outcomes. This is in line with previous studies in humans suggesting that lesions of the corticospinal tract seem to be more important to both early and long-term functional outcome than lesion volume.30,31 These findings could be replicated in our independent replication cohort, where a local minimum of PE could be observed after including the same 14 regions. Again, lesions in the caudoputamen had a major impact on prediction accuracies, outperforming both cortical gray and white matter structures.

Imaging-based predictors offered good estimates of poststroke long-term outcome, but the best predictor of the residual deficit was the degree of the subacute deficit, again confirming studies on predictors of poststroke outcome in humans.32 Last, we found that independent of the chosen model, the prediction accuracy for both the subacute and residual deficits depended on the degree of the deficit. In line with clinical data, we found that severe residual deficits are more difficult to predict than moderate-to-mild deficits.13,14

Study Limitations

We have restricted our prediction tools to 45 minutes of MCAO. Because mice from each study were only operated on by 1 surgeon, the fact that the surgeon was identified as an additional covariate must be considered with caution, as we cannot rule out experiment-specific factors like genetic drifts, time of year, or stress levels. We did not replicate our results using different behavioral readouts. MR imaging data acquired at a later time point might have improved prediction accuracy. However, since mice have a small window of opportunity for interventions, late MR imaging cannot be utilized to guide treatment decisions.33 Manually delineating lesion masks is a further limitation. Although manual lesion segmentation is still the gold standard, well-documented, and publically accessible stroke T1-weighted MRI datasets enable completely automated algorithms for MRI stroke lesion segmentation in humans.3436 T2-weighted MRI in rodent stroke models shows good agreement between manual segmentation by multiple raters and fully automated techniques.37 These investigations advanced lesion segmentation automation.37,38 FLAIR (fluid attenuated inversion recovery) MRI fluid signal nulling might have enhanced CSF-ischemic lesion discrimination (cerebrospinal fluid). However, 24 hours after MCAO, CSF has a stronger RARE (rapid acquisition with relaxation enhancement) signal than the lesion and the lateral ventricles can be accurately identified.23 During the subacute deficit phase, animals’ unspecific sickness behavior may have affected performance. However, our behavioral paradigm revealed stroke-specific motor-functional impairments since we observed side-specific deficits between sham and MCAO, as well as paretic and nonparetic paw. There were no video recordings of the testing sessions to account for changes in kinematics, but the whole behavioral data set has been posted to the repository connected to this study. Finally, while larger data sets usually perform better when training machine learning models, the random forest method works well with sample sizes comparable to our data set, as shown when the method was first introduced and in later comparative studies.24,39

Conclusions

We found and described mouse poststroke outcome predictors. The concordance with clinical data strengthens the usability of mice as model animals in preclinical studies. Prediction of outcomes in intervention studies using noninvasive MRI measurements or motor-functional data should be utilized to compare expected and achieved motor-function outcomes, select mice with particularly poor predicted outcomes, and guide treatment or intervention options. This will assist in bridging the translational gap and increase preclinical rodent stroke research reproducibility.

ARTICLE INFORMATION

Acknowledgments

Janet Lips coordinated experiments and contributed with magnetic resonance imaging (MRI), handling, and behavioral evaluation; Larissa Mosch (behavior, histology), Monika Dopatka (surgeries), and Marco Foddis (surgeries, behavioral assessment, and MRI) all provided excellent technical assistance. Figures were created with biorender.com.

Sources of Funding

Funding was provided by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) to Drs Hoffmann and Harms (project number, 417284923), to Dr Boehm-Sturm (project number, 428869206), to Drs Wenger, Endres, and Harms (project number 424778381–TRR 295) and NeuroCure (EXC−2049 to —390688087) and the German Federal Ministry of Education and Research (BMBF, Center for Stroke Research Berlin 01EO1301) to Drs Dirnagl, Boehm-Sturm, and Harms, and to Dr Boehm-Sturm by the BMBF under the ERA-NET NEURON scheme (01EW1811). Dr Kuffner received funding from the DFG Graduate School 203. This work was supported by Charité3R| Replace–Reduce–Refine and partly by the Fondation Leducq to Drs Endres and Harms. Drs Knab and Eggers received a scholarship from the Berlin Institute of Health, Berlin. Dr Euskirchen, Wenger, and Hoffmann are participants in the Charité Clinical Scientist Program funded by the Charité – Universitätsmedizin Berlin and the Berlin Institute of Health and Dr Wenger is a Freigeist Fellow with support from the Volkswagen Foundation. In addition, this work was supported by DFG (Project number 73500270 and 413848220) and ERA-NET NEURON EBio2, with funds from BMBF 01EW2004 to Dr Dreier. Dr Endres received funding from DFG under Germany´s Excellence Strategy—EXC-2049—390688087, Collaborative Research Center ReTune TRR 295- 424778381, BMBF, DZNE, DZHK, EU, Corona Foundation, and Fondation Leducq. am received funding from DFG and Fondation Leducq.

Disclosures

Dr Endres reports grants from Bayer and fees paid to the Charité from Abbot, Amgen, AstraZeneca, Bayer Healthcare, Boehringer Ingelheim, BMS, Daiichi Sankyo, Sanofi, Novartis, Pfizer, all outside the submitted work. Dr Wenger holds patents on neuromodulation technologies that are unrelated to this work. All remaining authors certify that they have no affiliations with or involvement in any organization or entity with any financial interest or nonfinancial interest in the subject matter or materials discussed in this article. No funding was received to assist with the preparation of this article.

Supplemental Material

ARRIVE Checklist

Expanded Materials and Methods

Supplemental Data

Figures S1–S15

Tables S1–S9

Reference 40

Supplementary Material

str-54-2895-s001.pdf (2.4MB, pdf)
str-54-2895-s002.pdf (113.5KB, pdf)

Nonstandard Abbreviations and Acronyms

IQR
interquartile range
MCAO
middle cerebral artery occlusion
MD
mean difference
MRI
magnetic resonance imaging
PE
prediction error
*

F. Knab and S.P. Koch contributed equally.

P. Boehm-Sturm and C. Harms contributed equally.

Preprint posted on bioRxiv May 15, 2022. doi: https://doi.org/10.1101/2022.05.13.491869.

For Sources of Funding and Disclosures, see page 2903 & 2904.

Contributor Information

Felix Knab, Email: felix.knab@uni-tuebingen.de.

Stefan Paul Koch, Email: stefan.koch@charite.de.

Sebastian Major, Email: sebastian.major@charite.de.

Tracy D. Farr, Email: tracy.farr@nottingham.ac.uk.

Susanne Mueller, Email: susanne.mueller1@charite.de.

Philipp Euskirchen, Email: philipp.boehm-sturm@charite.de.

Moritz Eggers, Email: moritz.eggers@charite.de.

Melanie T.C. Kuffner, Email: melanie.kuffner@charite.de.

Daniel Berchtold, Email: daniel.berchtold@charite.de.

Samuel Knauss, Email: Samuel.Knauss@charite.de.

Jens P. Dreier, Email: jens.dreier@charite.de.

Andreas Meisel, Email: andreas.meisel@charite.de.

Matthias Endres, Email: matthias.endres@charite.de.

Ulrich Dirnagl, Email: ulrich.dirnagl@charite.de.

Nikolaus Wenger, Email: nikolaus.wenger@charite.de.

Christian J. Hoffmann, Email: christian.hoffmann@charite.de.

Philipp Boehm-Sturm, Email: philipp.boehm-sturm@charite.de.

REFERENCES

  • 1.Béjot Y, Bailly H, Durier J, Giroud M. Epidemiology of stroke in Europe and trends for the 21st century. Press Medicale. 2016;45:e391–e398. doi: 10.1016/j.lpm.2016.10.003 [DOI] [PubMed] [Google Scholar]
  • 2.Eriksson M, Norrving B, Terént A, Stegmayr B. Functional outcome 3 months after stroke predicts long-term survival. Cerebrovasc Dis. 2008;25:423–429. doi: 10.1159/000121343 [DOI] [PubMed] [Google Scholar]
  • 3.Farr TD, Wegener S. Use of magnetic resonance imaging to predict outcome after stroke: a review of experimental and clinical evidence. J Cereb Blood Flow Metab. 2010;30:703–717. doi: 10.1038/jcbfm.2010.5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Albers GW, Thijs VN, Wechsler L, Kemp S, Schlaug G, Skalabrin E, Bammer R, Kakuda W, Lansberg MG, Shuaib A, et al. Magnetic resonance imaging profiles predict clinical response to early reperfusion: the Diffusion and Perfusion Imaging Evaluation for Understanding Stroke Evolution (DEFUSE) Study. Ann Neurol. 2006;60:508–517. doi: 10.1002/ana.20976 [DOI] [PubMed] [Google Scholar]
  • 5.Schiemanck SK, Post MWM, Kwakkel G, Witkamp TD, Kappelle LJ, Prevo AJH. Ischemic lesion volume correlates with long-term functional outcome and quality of life of middle cerebral artery stroke survivors. Restor Neurol Neurosci. 2005;23:257–263. [PubMed] [Google Scholar]
  • 6.Lövblad KO, Baird AE, Schlaug G, Benfield A, Siewert B, Voetsch B, Connor A, Burzynski C, Edelman RR, Warach S. Ischemic lesion volumes in acute stroke by diffusion-weighted magnetic resonance imaging correlate with clinical outcome. Ann Neurol. 1997;42:164–170. doi: 10.1002/ana.410420206 [DOI] [PubMed] [Google Scholar]
  • 7.Pinto A, McKinley R, Alves V, Wiest R, Silva CA, Reyes M. Stroke lesion outcome prediction based on MRI imaging combined with clinical information. Front Neurol. 2018;9:1–10. doi: 10.3389/fneur.2018.01060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cheng B, Forkert ND, Zavaglia M, Hilgetag CC, Golsari A, Siemonsen S, Fiehler J, Pedraza S, Puig J, Cho TH, et al. Influence of stroke infarct location on functional outcome measured by the modified rankin scale. Stroke. 2014;45:1695–1702. doi: 10.1161/STROKEAHA.114.005152 [DOI] [PubMed] [Google Scholar]
  • 9.Borich MR, Brodie SM, Gray WA, Ionta S, Boyd LA. Understanding the role of the primary somatosensory cortex: opportunities for rehabilitation. Neuropsychologia. 2015;79:246–255. doi: 10.1016/j.neuropsychologia.2015.07.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Munsch F, Sagnier S, Asselineau J, Bigourdan A, Guttmann CR, Debruxelles S, Poli M, Renou P, Perez P, Dousset V, et al. Stroke location is an independent predictor of cognitive outcome. Stroke. 2016;47:66–73. doi: 10.1161/STROKEAHA.115.011242 [DOI] [PubMed] [Google Scholar]
  • 11.Menezes NM, Ay H, Zhu MW, Lopez CJ, Singhal AB, Karonen JO, Aronen HJ, Liu Y, Nuutinen J, Koroshetz WJ, et al. The real estate factor: quantifying the impact of infarct location on stroke severity. Stroke. 2007;38:194–197. doi: 10.1161/01.STR.0000251792.76080.45 [DOI] [PubMed] [Google Scholar]
  • 12.Scheulin KM, Jurgielewicz BJ, Spellicy SE, Waters ES, Baker EW, Kinder HA, Simchick GA, Sneed SE, Grimes JA, Zhao Q, et al. Exploring the predictive value of lesion topology on motor function outcomes in a porcine ischemic stroke model. Sci Rep. 2021;11:1–15. doi: 10.1038/s41598-021-83432-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zarahn E, Alon L, Ryan SL, Lazar RM, Vry MS, Weiller C, Marshall RS, Krakauer JW. Prediction of motor recovery using initial impairment and fMRI 48 h poststroke. Cereb Cortex. 2011;21:2712–2721. doi: 10.1093/cercor/bhr047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Vliet R, Selles RW, Andrinopoulou E, Nijland R, Ribbers GM, Frens MA, Meskers C, Kwakkel G. Predicting upper limb motor impairment recovery after stroke: a mixture model. Ann Neurol. 2020;87:383–393. doi: 10.1002/ana.25679 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Weber RZ, Mulders G, Kaiser J, Tackenberg C, Rust R. Deep learning-based behavioral profiling of rodent stroke recovery. BMC Biol. 2022;20:232. doi: 10.1186/s12915-022-01434-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lyden PD, Bosetti F, Diniz MA, Rogatko A, Koenig JI, Lamb J, Nagarkatti KA, Cabeen RP, Hess DC, Kamat PK, et al. ; SPAN Investigators. The stroke preclinical assessment network: rationale, design, feasibility, and stage 1 results. Stroke. 2022;53:1802–1812. doi: 10.1161/STROKEAHA.121.038047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dirnagl U, Group Members of the MCAO-SOP. Standard operating procedures (SOP) in experimental stroke research: SOP for middle cerebral artery occlusion in the mouse. Nat Preced. 2010;202213:1–14. doi: 10.1038/npre.2010.3492.2 [Google Scholar]
  • 18.Taninishi H, Jung JY, Izutsu M, Wang Z, Sheng H, Warner DS. A blinded randomized assessment of laser Doppler flowmetry efficacy in standardizing outcome from intraluminal filament MCAO in the rat. J Neurosci Methods. 2015;241:111–120. doi: 10.1016/j.jneumeth.2014.12.006 [DOI] [PubMed] [Google Scholar]
  • 19.Morais A, Locascio JJ, Sansing LH, Lamb J, Nagarkatti K, Imai T, Van Leyen K, Aronowski J, Koenig JI, Bosetti F, et al. ; SPAN Investigators. Embracing heterogeneity in the multicenter Stroke Preclinical Assessment Network (SPAN) trial. Stroke. 2023;54:620–631. doi: 10.1161/STROKEAHA.122.040638 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Leithner C, Füchtemeier M, Jorks D, Mueller S, Dirnagl U, Royl G. Infarct volume prediction by early magnetic resonance imaging in a murine stroke model depends on ischemia duration and time of imaging. Stroke. 2015;46:3249–3259. doi: 10.1161/STROKEAHA.114.007832 [DOI] [PubMed] [Google Scholar]
  • 21.Baird AL, Meldrum A, Dunnett SB. The staircase test of skilled reaching in mice. Brain Res Bull. 2001;54:243–250. doi: 10.1016/s0361-9230(00)00457-3 [DOI] [PubMed] [Google Scholar]
  • 22.Bamford JM, Sandercock PAG, Wariow CP, Slattery J. Interobserver agreement for the assessment of handicap in stroke patients: to the editor. Stroke. 1989;20:828. doi: 10.1161/01.STR.20.6.828 [DOI] [PubMed] [Google Scholar]
  • 23.Koch S, Mueller S, Foddis M, Bienert T, von Elverfeldt D, Knab F, Farr TD, Bernard R, Dopatka M, Rex A, et al. Atlas registration for edema-corrected MRI lesion volume in mouse stroke models. J Cereb Blood Flow Metab. 2017;39:313–323. doi: 10.1177/0271678X17726635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Breiman L. Random forests. Mach Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324 [Google Scholar]
  • 25.Donath S, An J, Lee SLL, Gertz K, Datwyler AL, Harms U, Müller S, Farr TD, Füchtemeier M, Lättig-Tünnemann G, et al. Interaction of ARC and DAXX: a novel endogenous target to preserve motor function and cell loss after focal brain ischemia in mice. J Neurosci. 2016;36:8132–8148. doi: 10.1523/JNEUROSCI.4428-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Grandjean J, Canella C, Anckaerts C, Ayranci G, Bougacha S, Bienert T, Buehlmann D, Coletta L, Gallino D, Gass N, et al. Common functional networks in the mouse brain revealed by multi-centre resting-state fMRI analysis. Neuroimage. 2020;205:116278. doi: 10.1016/j.neuroimage.2019.116278 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Casals JB, Pieri NCG, Feitosa MLT, Ercolin ACM, Roballo KCS, Barreto RSN, Bressan FF, Martins DS, Miglino MA, Ambrósio CE. The use of animal models for stroke research: a review. Comp Med. 2011;61:305–313. doi: 10.1109/TAES.1968.5408692 [PMC free article] [PubMed] [Google Scholar]
  • 28.Corbett D, Carmichael ST, Murphy TH, Jones TA, Schwab ME, Jolkkonen J, Clarkson AN, Dancause N, Wieloch T, Johansen-Berg H, et al. Enhancing the alignment of the preclinical and clinical stroke recovery research pipeline: consensus-based core recommendations from the stroke recovery and rehabilitation roundtable translational working group *. Neurorehabil Neural Repair. 2017;31:699–707. doi: 10.1177/1545968317724285 [DOI] [PubMed] [Google Scholar]
  • 29.Manzanero S, Gelderblom M, Magnus T, Arumugam TV. Calorie restriction and stroke. Exp Transl Stroke Med. 2011;3:1–13. doi: 10.1186/2040-7378-3-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lindenberg R, Renga V, Zhu LL, Betzler F, Alsop D, Schlaug G. Structural integrity of corticospinal motor fibers predicts motor impairment in chronic stroke. Neurology. 2010;74:280–287. doi: 10.1212/WNL.0b013e3181ccc6d9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Feng W, Wang J, Chhatbar PY, Doughty C, Landsittel D, Lioutas V, Kautz SA, Schlaug G. Corticospinal tract lesion load: an imaging biomarker for stroke motor outcomes. Ann Neurol. 2015;78:860–870. doi: 10.1002/ana.24510 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Frankel MR, Morgenstern LB, Kwiatkowski T, Lu M, Tilley BC, Broderick JP, Libman R, Levine SR, Brott T. Predicting prognosis after stroke: a placebo group analysis from the National Institute of Neurological Disorders and Stroke rt-PA stroke trial. Neurology. 2000;55:952–959. doi: 10.1212/wnl.55.7.952 [DOI] [PubMed] [Google Scholar]
  • 33.Zeynalov E, Jones SM, Elliott JP. Therapeutic time window for conivaptan treatment against stroke-evoked brain edema and blood-brain barrier disruption in mice. PLoS One. 2017;12:e0183985–e0183988. doi: 10.1371/journal.pone.0183985 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Liew SL, Lo BP, Donnelly MR, Zavaliangos-Petropulu A, Jeong JN, Barisano G, Hutton A, Simon JP, Juliano JM, Suri A, et al. A large, curated, open-source stroke neuroimaging dataset to improve lesion segmentation algorithms. Sci Data. 2022;9:1–12. doi: 10.1038/s41597-022-01401-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Ito KL, Kim H, Liew SL. A comparison of automated lesion segmentation approaches for chronic stroke T1-weighted MRI data. Hum Brain Mapp. 2019;40:4669–4685. doi: 10.1002/hbm.24729 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Verma K, Kumar S, Paydarfar D. Automatic segmentation and quantitative assessment of stroke lesions on MR images. Diagnostics. 2022;12:2055. doi: 10.3390/diagnostics12092055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mulder IA, Khmelinskii A, Dzyubachyk O, de Jong S, Rieff N, Wermer MJH, Hoehn M, Lelieveldt BPF, van den Maagdenberg AMJM. Automated ischemic lesion segmentation in MRI mouse brain data after transient middle cerebral artery occlusion. Front Neuroinform. 2017;11:1–15. doi: 10.3389/fninf.2017.00003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.An J, Wendt L, Wiese G, Herold T, Rzepka N, Mueller S, Koch SP, Hoffmann CJ, Harms C, Boehm-Sturm P. Deep learning-based automated lesion segmentation on mouse stroke magnetic resonance images. Sci Rep. 2023;13:13341. doi: 10.1038/s41598-023-39826-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Fernández-Delgado M, Cernadas E, Barro S, Amorim D. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res. 2014;15:3133–3181. https://jmlr.csail.mit.edu/papers/volume15/delgado14a/delgado14a.pdf [Google Scholar]
  • 40.Shvedova M, Islam MR, Armoundas AA, Anfinogenova ND, Wrann CD, Atochin DN. Modified middle cerebral artery occlusion model provides detailed intraoperative cerebral blood flow registration and improves neurobehavioral evaluation. J Neurosci Methods. 2021;358:109179. doi: 10.1016/j.jneumeth.2021.109179 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

str-54-2895-s001.pdf (2.4MB, pdf)
str-54-2895-s002.pdf (113.5KB, pdf)

Articles from Stroke are provided here courtesy of Wolters Kluwer Health

RESOURCES