Skip to main content
JAMA Network logoLink to JAMA Network
. 2023 Mar 1;6(3):e231070. doi: 10.1001/jamanetworkopen.2023.1070

Development and Validation of a Scoring System for Hemorrhage Risk in Brain Arteriovenous Malformations

Yu Chen 1,2, Heze Han 1,2, Xiangyu Meng 3, Hengwei Jin 4, Dezhi Gao 5, Li Ma 1,2, Ruinan Li 1,2, Zhipeng Li 1,2, Debin Yan 6, Haibin Zhang 1,2, Kexin Yuan 1,2, Ke Wang 1,2, Yukun Zhang 7, Yang Zhao 7, Weitao Jin 7, Runting Li 1,2, Fa Lin 1,2, Xiaofeng Chao 8, Zhengfeng Lin 9, Qiang Hao 1,2, Hao Wang 1,2, Xun Ye 1,2, Shuai Kang 1,2, Youxiang Li 4, Shibin Sun 5, Ali Liu 5, Shuo Wang 1,2, Yuanli Zhao 1,2, Xiaolin Chen 1,2,
PMCID: PMC9978947  PMID: 36857052

Key Points

Question

Can imaging data be used to derive a scoring system to estimate long-term risk of hemorrhage of unruptured brain arteriovenous malformations (AVMs)?

Findings

In this prognostic study of 3962 patients with AVMs, 4 factors (ventricular system involvement, venous aneurysm, deep location, and exclusively deep drainage) were identified to build a scoring system to predict the rupture risk of AVMs. The scoring system had excellent performance in both the derivation and multicenter external validation cohorts and high discrimination in risk stratification of patients in the prospective follow-up cohort receiving conservative treatment management.

Meaning

These findings suggest that the scoring system is a reliable tool that can be used to reduce unnecessary interventions or unexpected AVM ruptures.

Abstract

Importance

The dilemma between natural rupture risk and adverse outcomes of intervention is of major concern for patients with unruptured arteriovenous malformations (AVMs). The existing risk score for AVM rupture includes factors that are controversial and lacks prospective validation.

Objective

To develop and robustly validate a reliable scoring system to predict the rupture risk of AVMs.

Design, Setting, and Participants

This prognostic study developed a prediction model derived from a single-center cohort (derivation cohort) and validated in a multicenter external cohort (multicenter external validation cohort) and a cohort of patients receiving conservative treatment management (conservative treatment validation cohort). Patients were recruited from a nationwide multicenter prospective collaboration registry in China. A total of 4135 patients were enrolled in the registry between August 1, 2011, and September 1, 2021. Of those, 3962 patients were included in the study (3585 in the derivation cohort and 377 in the multicenter external validation cohort); 1028 patients from the derivation cohort who had time-to-event data and prerupture imaging results were included in the conservative treatment validation cohort. Data were analyzed from March 10 to June 21, 2022.

Main Outcomes and Measures

A scoring system was developed based on risk factors identified from a literature review and a robust selection process. Patients were stratified into different risk groups based on scores to calculate hemorrhage-free probability in future years, and Kaplan-Meier curves were plotted to visualize risk stratification. Receiver operating characteristic curves were used to assess the discrimination of models. Univariable analyses (logistic regression analysis for descriptive data and Cox regression analysis for survival data) were used to compare baseline information and assess bias.

Results

Among 3962 patients (2311 men [58.3%]; median [IQR] age, 26.1 [14.6-35.5] years), 3585 patients (2100 men [58.6%]; median [IQR] age, 25.9 [14.6-35.0] years) were included in the derivation cohort, and 377 patients (211 men [56.0%]; median [IQR] age, 26.4 [14.5-39.2] years) were included in the multicenter external validation cohort. Thirty-six hemorrhages occurred over a median (IQR) follow-up of 4.2 (0.3-6.0) years among 1028 patients in the conservative treatment validation cohort. Four risk factors were used to develop the scoring system: ventricular system involvement, venous aneurysm, deep location, and exclusively deep drainage (VALE). The VALE scoring system performed well in all 3 cohorts, with areas under the receiver operating characteristic curve of 0.77 (95% CI, 0.75-0.78) in the derivation cohort, 0.85 (95% CI, 0.81-0.89) in the multicenter external validation cohort, and 0.73 (95% CI, 0.65-0.81) in the conservative treatment validation cohort. The 10-year hemorrhage-free rate was 95.5% (95% CI, 87.1%-100%) in the low-risk group, 92.8% (95% CI, 88.8%-97.0%) in the moderate-risk group, and 75.8% (95% CI, 65.1%-88.3%) in the high-risk group; the model discrimination was significant when comparing these rates between the high-risk group and the low- and moderate-risk groups (P < .001 for both comparisons).

Conclusions and Relevance

In this prognostic study, the VALE scoring system was developed to distinguish rupture risk among patients with AVMs. The stratification of unruptured AVMs may enable patients with low risk of rupture to avoid unnecessary interventions. These findings suggest that the scoring system is a reliable and applicable tool that can be used to facilitate patient and physician decision-making and reduce unnecessary interventions or unexpected AVM ruptures.


This prognostic study uses data from a nationwide multicenter prospective registry in China to develop a scoring system that predicts the risk of hemorrhage among patients with brain arteriovenous malformations and validates the score in external multicenter and clinical practice cohorts.

Introduction

Brain arteriovenous malformations (AVMs) are tangles of abnormally dilated vessels without intervening capillaries, which represent high-flow and low-resistance hemodynamic features due to direct arteriovenous shunting.1,2 Brain AVMs can lead to intracranial hemorrhage, with an annual rupture risk of 1% to 3% per year if left untreated.3,4 Although several grading scales have been proposed and are widely used to evaluate operability or treatment outcomes,5,6,7,8 it remains debatable whether interventional therapy can benefit patients with AVMs.9,10,11,12 Therefore, these options should be weighed against the patient’s natural hemorrhagic risks, which may vary widely across AVMs with different features.13

The R2eD score developed by Feghali et al14 was the first scale to estimate the rupture risk of AVMs and was externally validated by Bird et al.15 This scoring system identified 5 risk factors from statistical models (race [non-White], exclusive deep location, small AVM size, exclusive deep drainage, and monoarterial feeding). However, this scale should be applied and interpreted with caution due to its cross-sectional data–based selection of risk factors and its lack of prospective validation. Thus, clinicians and patients need a more reliable prediction and risk stratification tool for AVM rupture to inform expectations of hemorrhage-free outcomes and avoid unnecessary interventional treatment. Because the necessary imaging data were easily accessible for diagnosing AVMs, this study aimed to develop and validate a new scoring system combining evidence-based and statistically significant imaging features for the prediction of long-term hemorrhagic risk in patients with unruptured AVMs.

Methods

Study Design and Patients

In this prognostic study, we developed a prediction model for AVM rupture risk using prospectively collected data from a single-center cohort (derivation cohort) between August 1, 2011, and September 1, 2021. Data were analyzed from March 10 to June 21, 2022. Patients in this study were recruited from the MATCH (Multimodality Treatment for Brain Arteriovenous Malformation in Mainland China)16 nationwide multicenter prospective collaboration registry. The institutional review board of Beijing Tiantan Hospital approved this study, and patients who participated in the MATCH registry provided written informed consent at hospital admission. This study followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guideline for prognostic studies.17

Validation of this model consisted of 2 steps. A multicenter cohort involving patients from 8 provinces in mainland China was used to validate the model externally (multicenter external validation cohort). To further validate the generalizability of the model in a clinical practice scenario, we used data from patients with unruptured AVMs who were receiving conservative treatment management for more than 1 month (conservative treatment validation cohort). Only those with prerupture or pretreatment imaging data were eligible for inclusion in the study. The follow-up duration was censored at the date of hemorrhage or first treatment. The diagnosis of AVM was confirmed with digital subtraction angiography and/or magnetic resonance imaging (MRI). Those without essential imaging data for feature extraction and those diagnosed with dural AVMs, extracranial malformations, cavernous malformations, or hereditary hemorrhagic telangiectasia were excluded. The patient inclusion flowchart is shown in Figure 1.

Figure 1. Flowchart of Patient Selection for Scoring System Development and Validation.

Figure 1.

MATCH indicates Multimodality Treatment for Brain Arteriovenous Malformation in Mainland China.

aPatients were excluded if arteriovenous malformations (AVMs) were ruptured at admission unless they had received conservative treatment management more than 1 month after diagnosis.

bAmong patients receiving conservative treatment management, 793 were retrospectively chosen for inclusion (ie, patients were diagnosed elsewhere and received interventional treatment in our institution more than 1 month after diagnosis), and 235 were prospectively followed up (ie, patients chose conservative treatment management after they were admitted to our institution).

Data Collection and Variables of Interest

The comprehensive protocol for data quality management is available in eMethods 1 in Supplement 1. Demographic information, including sex, age, and modified Rankin Scale Score (which measures degree of disability or dependence after a stroke) at hospital admission, was recorded. Radiographic variables describing AVM morphological characteristics, including nidus location, size, diffuseness, venous drainage (drainage patterns, stenosis, and venous aneurysms), feeding arteries (number, dilation, multiple sources, and perforating arteries), associated aneurysm, and hemorrhagic presentation, were collected. Radiological information was determined via digital subtraction angiography and MRI.

The nidus location was regarded as deep if the lesion exclusively involved the brain stem, cerebellum, basal ganglia, thalamus, corpus callosum, or insular lobe. The definition of eloquent regions (ie, sensory, motor, language, or visual cortex; hypothalamus or thalamus; internal capsule; brain stem; cerebellar peduncles [superior, middle, or inferior]; and deep cerebellar nuclei) was based on the Spetzler-Martin Grading Scale.5 The size of AVMs was dichotomized into small and large based on whether the maximum nidal diameter was less than 3 cm or 3 cm or greater. Ventricular system involvement was determined via MRI based on whether the nidal border was adjacent to the cerebral ventricular system. Feeding arteries were considered dilated when their diameter was at least twice that of the same blood vessel segments. Venous aneurysm was defined as the focal aneurysmal dilation of the proximal drainage vein.18 Hemorrhagic presentation was defined as hemorrhage that could be ascribed to AVM rupture before or at admission.

Model Performance and Validation

We used receiver operating characteristic curves (ROCs) to assess the discrimination of models in the derivation cohort and the 2 validation cohorts. An area under the receiver operating characteristic curve (AUROC) higher than 0.70 indicated a good prediction of rupture. The new model derived from the derivation cohort was evaluated by performing ROC analyses separately in these 3 cohorts. A calibration curve was plotted to investigate the calibration ability of the new model. A perfect prediction was defined as a line with a slope of 1 and an intercept of 0 in the calibration plot. Kaplan-Meier curves were plotted for the conservative treatment validation cohort to visualize risk stratification according to score. Survival analyses were conducted using the rms, version 6.3.0,19 and survival, version 3.4.0,20 packages for R software (R Foundation for Statistical Computing). Parameters in the R2eD score (except for race and ethnicity because the study population included only Chinese patients) were used to fit the model in our training set (data from the derivation cohort), and the DeLong test14 was conducted to compare the ROCs between the R2eD-based model and our model. Data from the 3 cohorts were also used for external validation of the R2eD score to investigate its generalizability beyond its majority White population.

Statistical Analysis

We used R software, version 4.0.3 (R Foundation for Statistical Computing), to perform statistical data analyses. Statistical significance was set at 2-tailed P < .05. All data were divided into 2 groups according to rupture presentation. For continuous values, means with SDs or medians with IQRs were reported for normal and nonnormal distribution data. The proportion of each categorical variable was also recorded. We compared the baseline information of the derivation cohort between groups using univariable analysis, with a t test or Mann-Whitney U test used for continuous data and a χ2 test used for categorical data, as appropriate. Missing data were assumed to be missing at random. Missing values were imputed using the missForest package for R software, version 1.5.21 All available data from the derivation database were used to meet the 20 events per variable rule to maximize the power of our results.

We identified potential predictors based on background knowledge and findings from systematically reviewed literature that investigated these factors using a prospective follow-up design. Odds ratios (ORs) and hazard ratios (HRs) and 95% CIs for these variables were calculated. The most frequently mentioned variables in the published studies were considered central predictors and were fixed in models during the variable selection. We first fitted a global model with all variables, then used backward stepwise regression with the Akaike information criterion as the stopping criterion. We repeated the process in 1000 bootstrap resamples to investigate the stability of selected variables.22 The number of predictors was limited to 4 to develop a scoring scale that could be easily applied in clinical practice. The predictors in the final model were the combination of 2 clinically acknowledged factors and 2 statistically significant and stable factors. Multicollinearity was evaluated using the variance inflation factor (with >3 indicating high collinearity). We also used β coefficients to assign points for these predictors to establish a scoring system that could be conveniently used in clinical practice. We plotted the predicted probabilities of rupture risk using the new model (y-axis) and the total scores from the new grading system (x-axis). A description of the probability calculation is provided in eMethods 2 in Supplement 1.

Results

Patients and AVM Characteristics

A total of 4135 patients with AVMs were enrolled in the MATCH multicenter prospective registry. After careful screening, the final analysis included 3962 patients (2311 men [58.3%] and 1651 women [41.7%]; median [IQR] age, 26.1 [14.6-35.5] years); of those, 3585 patients (2100 men [58.6%] and 1485 women [41.4%]; median [IQR] age, 25.9 [14.6-35.0] years) from Beijing Tiantan Hospital (the MATCH registry sponsor) were in the derivation cohort, and 377 patients (211 men [56.0%] and 166 women [44.0%]; median [IQR] age, 26.4 [14.5-39.2] years) were in the multicenter external validation cohort. Hemorrhage occurred in 2189 patients (61.1%) in the derivation cohort and 225 patients (59.7%) in the multicenter external validation cohort. Missing values in the derivation cohort were less than 3% (eTable 1 in Supplement 1). Other baseline characteristics are shown in Table 1. The univariable analysis of the derivation cohort revealed that most of the investigated angioarchitectural features significantly differed between patients with AVM rupture (n = 2189) vs those without AVM rupture (n = 1396) (eTable 2 in Supplement 1). For example, AVMs with exclusively deep location and ventricular system involvement were associated with more hemorrhagic presentations (exclusively deep location: 693 patients [31.7%] with rupture vs 200 patients [14.3%] without rupture; P < .001; ventricular system involvement: 1477 patients [67.5%] with rupture vs 468 patients [33.5%] without rupture; P < .001). Feeding and draining patterns were also associated with AVM rupture (eg, single feeding artery: 889 patients [40.6%] patients with rupture vs 225 patients [16.1%] without rupture; P < .001; exclusively deep drainage: 721 patients [32.9%] with rupture vs 137 patients [9.8%] without rupture; P < .001).

Table 1. Cohort Characteristics.

Characteristic Patients, No./total No. (%)
Derivation cohort (n = 3585)a Validation cohorts
Multicenter external (n = 377) Conservative treatment management (n = 1028)
Sex
Female 1485/3585 (41.4) 166/377 (44.0) 401/1028 (39.0)
Male 2100/3585 (58.6) 211/377 (56.0) 627/1028 (61.0)
Age at diagnosis, median (IQR), y 25.9 (14.6-35.0) 26.4 (14.5-39.2) 27.2 (17.1-37.5)
Hemorrhagic presentation 2189/3585 (61.1) 225/377 (59.7) 36/1028 (3.5)
Seizure 853/3585 (23.8) 81/377 (21.5) 415/1028 (40.4)
AVM features
Size, cm
<3 1778/3585 (49.6) 201/377 (53.3) 343/1028 (33.4)
3-6 1456/3585 (40.6) 157/377 (41.6) 517/1028 (50.3)
>6 351/3585 (9.8) 19/377 (5.0) 168/1028 (16.3)
Nidus location
Frontal lobe 912/3585 (25.4) 55/377 (14.6) 341/1028 (33.2)
Temporal lobe 981/3585 (27.4) 85/377 (22.5) 286/1028 (27.8)
Parietal lobe 944/3585 (26.3) 84/377 (22.3) 306/1028 (29.8)
Occipital lobe 733/3585 (20.4) 57/377 (15.1) 226/1028 (22.0)
Cerebellum 339/3585 (9.5) 58/377 (15.4) 61/1028 (5.9)
Brain stem 117/3585 (3.3) 17/377 (4.5) 24/1028 (2.3)
Basal ganglia 390/3585 (10.9) 69/377 (18.3) 55/1028 (5.4)
Thalamus 208/3585 (5.8) 34/377 (9.0) 35/1028 (3.4)
Intraventricular area 154/3585 (4.3) 27/377 (7.2) 29/1028 (2.8)
Insula 72/3585 (2.0) 26/377 (6.9) 18/1028 (1.8)
Exclusively deep location 893/3585 (24.9) 131/377 (34.7) 143/1028 (13.9)
Ventricular system involvement 1945/3585 (54.3) 187/377 (49.6) 330/1028 (32.1)
Eloquent region 2000/3585 (55.8) 252/377 (66.8) 535/1028 (52.0)
Feeding artery
Single feeder 1081/3491 (31.0) 120/377 (31.8) 179/1028 (17.4)
Dilation 1641/3518 (46.6) 224/377 (59.4) 685/1028 (66.6)
Multiple sources 976/3525 (27.7) 98/377 (26.0) 375/1028 (36.5)
Perforating artery 1333/3519 (37.9) 157/377 (41.6) 308/1028 (30.0)
Diffuse nidus 1261/3508 (35.9) 124/377 (32.9) 222/1028 (21.6)
Venous draining
Stenosis 551/3527 (15.6) 89/377 (23.6) 108/1028 (10.5)
Any deep drainage 1415/3528 (40.1) 170/377 (45.1) 305/1028 (29.7)
Exclusively deep drainage 850/3528 (24.1) 107/377 (28.4) 92/1028 (8.9)
Venous aneurysm 565/3519 (16.1) 97/377 (25.7) 320/1028 (31.1)
Aneurysm 592/3510 (16.9) 80/377 (21.2) 157/1028 (15.3)
Spetzler-Martin gradeb
I 562/3528 (15.9) 56/377 (14.9) 157/1028 (15.3)
II 1180/3528 (33.4) 114/377 (30.2) 312/1028 (30.4)
III 1162/3528 (32.9) 126/377 (33.4) 352/1028 (34.2)
IV 494/3528 (14.0) 73/377 (19.4) 151/1028 (14.7)
V 130/3528 (3.7) 8/377 (2.1) 56/1028 (5.4)
Spetzler-Martin combined gradec
II 81/3495 (2.3) 5/377 (1.3) 1/1028 (0.1)
III 306/3495 (8.8) 30/377 (8.0) 37/1028 (3.6)
IV 705/3495 (20.2) 66/377 (17.5) 119/1028 (11.6)
V 946/3495 (27.1) 106/377 (28.1) 250/1028 (24.3)
VI 807/3495 (23.1) 98/377 (26.0) 297/1028 (28.9)
VII 462/3495 (13.2) 55/377 (14.6) 207/1028 (20.1)
VIII 147/3495 (4.2) 14/377 (3.7) 93/1028 (9.0)
IX 37/3495 (1.1) 2/377 (0.5) 24/1028 (2.3)
X 4/3495 (0.1) 1/377 (0.3) 0

Abbreviation: AVM, arteriovenous malformation.

a

The breakdown of missing values per parameter in the derivation cohort is provided in eTable 1 in Supplement 1.

b

Range, I-V, with higher grades indicating greater risk of surgical morbidity and death.

c

Spetzler-Martin grade combined with supplemental Spetzler-Martin grade (which includes only statistically significant variables not already included in the original Spetzler-Martin Grading Scale); range, II-X, with higher grades indicating greater risk of surgical morbidity and death.

A total of 1028 patients from the derivation cohort who had time-to-event data and prerupture imaging results were included in the conservative treatment validation cohort. Among those, 793 patients receiving conservative treatment management (77.1%) were retrospectively chosen for inclusion (retrospective group, comprising patients who were diagnosed elsewhere and received interventional treatment in our institution more than 1 month after diagnosis), and 235 patients were prospectively followed up (prospective follow-up group, comprising patients who chose to receive conservative treatment management after admission to our institution). Thirty-six hemorrhages occurred in the conservative treatment validation cohort (22.9%), with a median (IQR) follow-up duration of 4.2 (0.3-6.0) years. The annual hemorrhage rate for the total cohort was 0.84% (0.81% for the retrospective group and 0.87% for the prospective follow-up group). The overall and subgroup Kaplan-Meier curves are shown in eFigure 1 in Supplement 1. No significant differences were observed.

Development of the New Scoring System

Based on the results of the literature review, we summarized factors associated with AVM rupture that were reported in studies with prospective designs13,23,24,25,26,27,28,29,30,31,32,33,34,35 (eTable 3 in Supplement 1). In addition to previous hemorrhagic history, deep location and deep drainage were the most frequently mentioned factors. These 2 variables were therefore deemed central to our score. Backward stepwise regression analysis was used to select 10 of 12 variables with corresponding coefficients, and a bootstrapping process was used to investigate the stability of these factors. A summary of the findings from this analysis is provided in eTable 4 in Supplement 1. Because we considered 4 variables to be the most appropriate number to include in a practical prediction model, only the top 2 factors (ventricular system involvement and venous aneurysm) were added to the final model along with deep location and exclusively deep drainage. The univariable analyses (logistic regression analysis for descriptive data and Cox regression analysis for survival data) in the 3 study cohorts revealed that the effect sizes of the factors could have heterogeneity in different analyses. Univariable analyses (logistic regression analysis for descriptive data and Cox regression analysis for survival data) revealed that ventricular system involvement (OR, 4.11 [95% CI, 3.57-4.74] for the derivation cohort; OR, 8.82 [95% CI, 5.43-14.34] for the multicenter external validation cohort; and HR, 4.04 [95% CI, 2.02-8.08] for the conservative treatment validation cohort), venous aneurysm (OR, 0.15 [95% CI, 0.13-0.19] for the derivation cohort; OR, 0.09 [95% CI, 0.05-0.15] for the multicenter external validation cohort; and HR, 0.42 [95% CI, 0.17-1.00] for the conservative treatment validation cohort), deep location (OR, 2.77 [95% CI, 2.33-3.30] for the derivation cohort; OR, 2.63 [95% CI, 1.65-4.18] for the multicenter external validation cohort; and HR, 4.39 [95% CI, 2.22-8.70] for the conservative treatment validation cohort), and exclusively deep drainage (OR, 4.51 [95% CI, 3.70-5.50] for the derivation cohort; OR, 6.32 [95% CI, 3.48-11.46] for the multicenter external validation cohort; and HR, 3.60 [95% CI, 1.63-7.96] for the conservative treatment validation cohort) were significant in all 3 cohorts (eTable 6 in Supplement 1).

Information on the newly developed scoring system (VALE, an acronym based on the 4 included risk factors: ventricular system involvement, venous aneurysm, deep location, and exclusively deep drainage), with corresponding estimated β coefficients and SEs from the final regression model, is shown in Table 2. One point was assigned to AVMs with deep location (OR, 1.45; 95% CI, 1.17-1.78), and 2 points each were assigned to AVMs with ventricular system involvement (OR, 3.27; 95% CI, 2.79-3.83) and exclusively deep drainage (OR, 2.30; 95% CI, 1.83-2.90). Venous aneurysm was a protective factor for AVM rupture in the study cohort (OR, 0.16; 95% CI, 0.13-0.20); therefore, −4 points were assigned if venous aneurysm presented. An analysis of variance inflation factors revealed no significant collinearity between these variables. The total VALE score ranged from −4 to 5 points. As shown in eFigure 2 in Supplement 1, the predicted probability of rupture increased as the VALE score increased. According to the hemorrhagic probability, we further categorized the VALE score into 3 groups for clinical practicality: low risk (score of less than −2), moderate risk (score of −2 to 1), and high risk (score of greater than 1).

Table 2. VALE Score and Corresponding Estimated Coefficients and SEs in the Final Model.

Acronym Predictor β Coefficient (SE) Points assigneda
V Ventricular system involvement 1.185 (0.081) 2
A Venous aneurysm −1.854 (0.112) −4
L Deep location 0.371 (0.107) 1
E Exclusively deep drainage 0.833 (0.117) 2

Abbreviation: VALE, ventricular system involvement, venous aneurysm, deep location, and exclusively deep drainage.

a

The points were originally determined by rounding the logistic regression model coefficients (1 for V, −2 for A, 0.5 for L, and 1 for E). To develop a scoring system with integral values, these points were doubled to form the score on a scale of 10.

Model Performance and Validation

The performance of the VALE score for predicting AVM rupture in the 3 data sets is shown in Figure 2A. The AUROCs were 0.77 (95% CI, 0.75-0.78) in the derivation cohort, 0.85 (95% CI, 0.81-0.89) in the multicenter external validation cohort, and 0.73 (95% CI, 0.65-0.81) in the conservative treatment validation cohort. The calibration curve (Figure 2B) suggested good agreement between observation and prediction. We also performed a subgroup analysis of the conservative treatment validation cohort based on whether patients were in the prospective follow-up group or the retrospective group. Similar high discrimination was observed for both the prospective follow-up group (AUROC, 0.75; specificity, 0.74; sensitivity, 0.61) and the retrospective group (AUROC, 0.75; specificity, 0.77; sensitivity, 0.63) (eTable 5 in Supplement 1). A risk stratification analysis was also conducted for the conservative treatment validation data set, and the Kaplan-Meier curves are shown in Figure 2D. The groups with low and moderate risk shared similar rupture risk in the long-term follow-up (eg, 10-year hemorrhage-free probability: 95.5% [95% CI, 87.1%-100%] for low-risk group vs 92.8% [95% CI, 88.8%-97.0%] for moderate-risk group; P = .12), while the differences between these 2 groups and the high-risk group were significant (eg, 10-year hemorrhage-free probability for high-risk group: 75.8% [95% CI, 65.1%-88.3%]; P < .001 for comparisons with both low- and moderate-risk groups). Hemorrhage-free probability among patients in the conservative treatment validation cohort at 3 years, 5 years, and 10 years is shown in Table 3.

Figure 2. Performance of the VALE Scoring System and Comparison With the R2eD Scoring System.

Figure 2.

In panel C, the area under the receiver operating characteristic curve was 0.579. In panel D, P values were .12 for low risk vs moderate risk, <.001 for low risk vs high risk, and <.001 for moderate risk vs high risk. No hemorrhagic events occurred after 20 years of follow-up; therefore, the survival plot was cut off at 20 years for better data presentation. R2eD indicates race (non-White), exclusive deep location, small arteriovenous malformation, exclusive deep drainage, and monoarterial feeding, and monoarterial feeding (lesion with a single feeding artery); and VALE, ventricular system involvement, venous aneurysm, deep location, and exclusively deep drainage.

Table 3. Hemorrhage-Free Probability Stratified by Risk Group in the Conservative Treatment Validation Cohort.

Risk groupa Hemorrhage-free probability by years of follow-up, % (95% CI)
3 5 10
Low 100 (ND) 100 (ND) 95.5 (87.1-100)
Moderate 99.6 (98.8-100) 96.7 (94.3-99.1) 92.8 (88.8-97.0)
High 95.2 (91.5-99.0) 92.6 (87.6-97.8) 75.8 (65.1-88.3)

Abbreviation: ND, not defined.

a

Low risk was defined as a VALE score of less than −2, moderate risk as a VALE score of −2 to 1, and high risk as a VALE score of greater than 1.

We also investigated the performance of a previously published risk score for AVM rupture (the R2eD score) as the largest external validation set for our VALE score.14 The R2eD-based model fitted to the derivation cohort of the present study had good discrimination (AUROC, 0.73; 95% CI, 0.71-0.74), but the VALE model performed better (AUROC, 0.77; 95% CI, 0.76-0.79; DeLong test P < .001). The ROC of the R2eD score fitted to the 3 study cohorts, plotted separately, is shown in Figure 2C. The R2eD score performed well in the derivation and multicenter external validation cohorts, with AUROCs of 0.72 (95% CI, 0.70-0.74) and 0.66 (95% CI, 0.61-0.71), respectively. However, in the conservative treatment validation cohort with survival data, the discrimination of the R2eD score was worse than that of the R2eD score in the other 2 cohorts (AUROC, 0.58; 95% CI, 0.49-0.67). The VALE score better predicted AVM rupture in all cohorts, especially in the cohort simulating a clinical practice scenario (ie, the conservative treatment validation cohort). A description of how to use the VALE score in clinical practice, with detailed examples, is available in eFigure 3 in Supplement 1.

Discussion

This prognostic study identified 4 risk factors associated with AVM rupture (ventricular system involvement, venous aneurysm, deep location, and exclusively deep drainage) and established the first prospectively validated scoring system (the VALE score). The VALE score had good performance in the derivation cohort and was well validated in the multicenter external cohort and the conservative treatment cohort, with AUROCs of 0.85 and 0.73, respectively. Long-term follow-up of the conservative treatment validation cohort enabled the VALE scoring system to predict survival over different periods across risk groups, facilitating decision-making for both clinicians and patients.

The R2eD scoring system includes 5 risk factors identified from statistical models (non-White race, exclusive deep location, small AVM sizes, exclusive deep drainage, and monoarterial feeding).14 The VALE score shares similar purposes and 2 variables with the R2eD score: deep location and exclusively deep drainage. In our model, we selected these 2 acknowledged risk factors based on literature review,13,23,24,25,26,27,28,29,30,31,32,33,34,35 with the aim of avoiding results that may have been misleading because they were statistically significant but lacked clinical relevance. This issue is a limitation of other variables included in the R2eD score; for example, the risk associated with race may instead (or also) be associated with socioeconomic status,14 and the inclusion of small AVM size in risk scores has long been debated, with the prevailing view in opposition to its inclusion.23 The discrepancies caused by selection bias are inevitable when using cross-sectional data. We conducted univariable analyses (logistic regression analysis for descriptive data and Cox regression analysis for survival data) in the 3 study cohorts to further investigate biases. Deep location and exclusively deep drainage remained significant in all 3 cohorts, which was consistent with the findings of previous studies.13,24,25,26,27,28,36 Among other predictors included in the VALE score, ventricular system involvement also had high predictive power in all groups, and venous aneurysm was nonsignificant in the prospective follow-up group within the conservative treatment validation cohort. The heterogeneity of the effect size suggested that selecting variables simply by using descriptive data analysis was unstable and unreliable. A study assessing the performance of AVM rupture prediction models in terms of methods used37 also found unstable results in models derived from data sets with different sample sizes and sampling times.

A better variable selection process can be derived by using prospective cohorts with time-to-event data. However, an international multicenter prospective study,29 which was limited by small sample size and low incidence of rupture, did not identify any angioarchitectural features. Therefore, the present study conducted a standardized variable selection procedure based on bootstrap resampling,22 combining evidence-based risk factors with statistically significant predictors to develop an easily applicable score that was then validated in a conservative treatment cohort with survival data. The good performance in the validation cohort enhanced the scoring system’s applicability in a clinical practice scenario.

When considering potential mechanisms of risk for the angioarchitectural features included in the VALE score, ventricular system involvement may lead to abnormal transmural pressure that makes it difficult to maintain the structural stability of the vessel wall due to lack of support from brain tissue. Venous aneurysms have been found to have a highly protective benefit for rupture despite the inconsistency of findings across studies.26,38 One possible reason may be that the dilated cavity could serve as a buffer zone to reduce impedance of the outflow.

Results from the Randomized Trial of Unruptured Brain Arteriovenous Malformations9 suggested that conservative treatment management was superior to intervention for the prevention of stroke. This conclusion has been controversial because of study design aspects.12,39 Treatment of AVMs is complicated, and individual differences in clinical decision-making should be considered. Many scoring scales have been developed in the AVM field, and they should be applied to different elements of the treatment process. In our study, the VALE score was developed for risk stratification of patients with different features of unruptured AVMs. Patients in different rupture risk groups are typically recommended to receive individualized therapeutic strategies based on assessment by different scales: the Spetzler-Martin and supplementary Spetzler-Martin grades for surgical operability,5,6 the Buffalo and Arteriovenous Malformation Embocure scores for endovascular treatment,8,40 and the Virginia Radiosurgery AVM Scale for radiosurgical procedures.7 For patients in different risk groups, the natural history of AVM rupture and the benefit of different treatment strategies must be weighed separately. Multicenter prospective validation with long-term follow-up using these scoring scales comprehensively is also required to better guide clinical practice.

Limitations

This study has several limitations. First, to serve as an applicable tool in clinical practice, our model only included 4 variables derived from imaging data despite the identification of other significant factors in the selection process and literature review.29,41,42 This restriction to 4 risk factors may, to some extent, reduce the accuracy of the score while increasing its convenience. With deeper investigation of genetic information and radiomic data, a larger number of variables associated with AVM rupture could be uncovered, at which point artificial intelligence could be used to develop a more comprehensive and precise model. Second, the relatively low number of hemorrhagic events in the conservative treatment validation cohort produced a lower annual rupture rate than that of other studies.23,28,29 This difference can partly be ascribed to the exclusion of patients with ruptured AVMs but without prerupture imaging data in the retrospective group within the conservative treatment validation cohort. Selection bias can result in variations in incidence rates but does not change the association between these factors and rupture risk. That said, the hemorrhage-free survival probability derived from this cohort should be interpreted with caution because it might be overestimated. Third, development and validation of the VALE scoring system were performed in a predominantly Chinese population, so its generalizability to other populations remains to be assessed by other international collaborations, such as the Multicenter AVM Research Study.29

Conclusions

In this prognostic study, the VALE scoring system, consisting of 4 central imaging features, was developed to stratify the risk of rupture among patients with AVM and provide estimated hemorrhage-free probability for future years. The stratification of unruptured AVMs may enable patients with low risk of rupture to avoid unnecessary interventions. These findings suggest the VALE scoring system is a reliable and applicable tool that may be used to help identify those who can benefit from early intervention, reduce unnecessary interventions or unexpected AVM ruptures, facilitate decision-making for both clinicians and patients, and guide future clinical research toward more personalized design.

Supplement 1.

eMethods 1. Protocol for Data Quality Management

eMethods 2. Calculation of Predicted Hemorrhage Probability According to VALE Score

eTable 1. Breakdown of Missing Data in the Derivation Cohort

eTable 2. Univariable Analysis of the Derivation Cohort

eFigure 1. Kaplan-Meier Curves of Overall and Subgroup Hemorrhage-Free Probability in the Conservative Treatment Validation Cohort

eTable 3. Literature Review of Factors Associated With Arteriovenous Malformation Rupture

eTable 4. Variable Selection and Bootstrap-Derived Quantities Useful for Assessing Stability

eFigure 2. Hemorrhage Probability in the Derivation Cohort

eTable 5. Subgroup Analysis of VALE Score in the Conservative Treatment Validation Cohort According to the Retrospective and Prospective Nature of the Survival Data

eFigure 3. Illustration Cases for the VALE Score

eTable 6. Univariable Analysis of Angiographic Features in Different Cohorts

eReferences

Supplement 2.

Data Sharing Statement

References

  • 1.Solomon RA, Connolly ES Jr. Arteriovenous malformations of the brain. N Engl J Med. 2017;377(5):498. [DOI] [PubMed] [Google Scholar]
  • 2.Lawton MT, Rutledge WC, Kim H, et al. Brain arteriovenous malformations. Nat Rev Dis Primers. 2015;1:15008. doi: 10.1038/nrdp.2015.8 [DOI] [PubMed] [Google Scholar]
  • 3.Derdeyn CP, Zipfel GJ, Albuquerque FC, et al. ; American Heart Association Stroke Council . Management of brain arteriovenous malformations: a scientific statement for healthcare professionals from the American Heart Association/American Stroke Association. Stroke. 2017;48(8):e200-e224. doi: 10.1161/STR.0000000000000134 [DOI] [PubMed] [Google Scholar]
  • 4.Chen CJ, Ding D, Derdeyn CP, et al. Brain arteriovenous malformations: a review of natural history, pathobiology, and interventions. Neurology. 2020;95(20):917-927. doi: 10.1212/WNL.0000000000010968 [DOI] [PubMed] [Google Scholar]
  • 5.Spetzler RF, Martin NA. A proposed grading system for arteriovenous malformations. J Neurosurg. 1986;65(4):476-483. doi: 10.3171/jns.1986.65.4.0476 [DOI] [PubMed] [Google Scholar]
  • 6.Lawton MT, Kim H, McCulloch CE, Mikhak B, Young WL. A supplementary grading scale for selecting patients with brain arteriovenous malformations for surgery. Neurosurgery. 2010;66(4):702-713. doi: 10.1227/01.NEU.0000367555.16733.E1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Starke RM, Yen CP, Ding D, Sheehan JP. A practical grading scale for predicting outcome after radiosurgery for arteriovenous malformations: analysis of 1012 treated patients. J Neurosurg. 2013;119(4):981-987. doi: 10.3171/2013.5.JNS1311 [DOI] [PubMed] [Google Scholar]
  • 8.Lopes DK, Moftakhar R, Straus D, Munich SA, Chaus F, Kaszuba MC. Arteriovenous malformation embocure score: AVMES. J Neurointerv Surg. 2016;8(7):685-691. doi: 10.1136/neurintsurg-2015-011779 [DOI] [PubMed] [Google Scholar]
  • 9.Mohr JP, Parides MK, Stapf C, et al. ; International ARUBA Investigators . Medical management with or without interventional therapy for unruptured brain arteriovenous malformations (ARUBA): a multicentre, non-blinded, randomised trial. Lancet. 2014;383(9917):614-621. doi: 10.1016/S0140-6736(13)62302-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Al-Shahi Salman R, White PM, Counsell CE, et al. ; Scottish Audit of Intracranial Vascular Malformations Collaborators . Outcome after conservative management or intervention for unruptured brain arteriovenous malformations. JAMA. 2014;311(16):1661-1669. doi: 10.1001/jama.2014.3200 [DOI] [PubMed] [Google Scholar]
  • 11.Mohr JP, Overbey JR, von Kummer R, et al. ; International ARUBA Investigators . Functional impairments for outcomes in a randomized trial of unruptured brain AVMs. Neurology. 2017;89(14):1499-1506. doi: 10.1212/WNL.0000000000004532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Volovici V, Schouten JW, Vajkoczy P, Dammers R, Meling TR. Unruptured arteriovenous malformations: do we have an answer after the final follow-up of ARUBA? a bayesian viewpoint. Stroke. 2021;52(3):1143-1146. doi: 10.1161/STROKEAHA.120.032429 [DOI] [PubMed] [Google Scholar]
  • 13.Stapf C, Mast H, Sciacca RR, et al. Predictors of hemorrhage in patients with untreated brain arteriovenous malformation. Neurology. 2006;66(9):1350-1355. doi: 10.1212/01.wnl.0000210524.68507.87 [DOI] [PubMed] [Google Scholar]
  • 14.Feghali J, Yang W, Xu R, et al. R2eD AVM score. Stroke. 2019;50(7):1703-1710. doi: 10.1161/STROKEAHA.119.025054 [DOI] [PubMed] [Google Scholar]
  • 15.Bird WA, Hendrix P, Bohan C, Goren O, Schirmer CM, Griessenauer CJ. External validation of the R2eD AVM score to predict the likelihood of rupture presentation of brain arteriovenous malformations. Neurosurgery. 2021;89(2):220-226. doi: 10.1093/neuros/nyab108 [DOI] [PubMed] [Google Scholar]
  • 16.Registry of Multimodality Treatment for Brain Arteriorvenous Malformation in Mainland China (MATCH). ClinicalTrials.gov identifier: NCT04572568. Updated May 23, 2022. Accessed January 23, 2023. https://clinicaltrials.gov/ct2/show/NCT04572568
  • 17.Moons KGM, Altman DG, Reitsma JB, et al. Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1-W73. doi: 10.7326/M14-0698 [DOI] [PubMed] [Google Scholar]
  • 18.Chen X, Cooke DL, Saloner D, et al. Higher flow is present in unruptured arteriovenous malformations with silent intralesional microhemorrhages. Stroke. 2017;48(10):2881-2884. doi: 10.1161/STROKEAHA.117.017785 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Harrell FE Jr. rms: regression modeling strategies. Version 6.4-1. R Project. April 22, 2022. Updated January 23, 2023. Accessed November 11, 2022. https://cran.r-project.org/web/packages/rms/index.html
  • 20.Therneau TM, Lumley T, Atkinson E, Crowson C. survival: Survival analysis. Version 3.5-0. R Project. August 9, 2022. Updated January 9, 2023. Accessed November 11, 2022. https://cran.r-project.org/web/packages/survival/index.html
  • 21.Stekhoven DJ, Bühlmann P. MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112-118. doi: 10.1093/bioinformatics/btr597 [DOI] [PubMed] [Google Scholar]
  • 22.Heinze G, Wallisch C, Dunkler D. Variable selection—a review and recommendations for the practicing statistician. Biom J. 2018;60(3):431-449. doi: 10.1002/bimj.201700067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hernesniemi JA, Dashti R, Juvela S, Väärt K, Niemelä M, Laakso A. Natural history of brain arteriovenous malformations: a long-term follow-up study of risk of hemorrhage in 238 patients. Neurosurgery. 2008;63(5):823-829. doi: 10.1227/01.NEU.0000330401.82582.5E [DOI] [PubMed] [Google Scholar]
  • 24.Itoyama Y, Uemura S, Ushio Y, et al. Natural course of unoperated intracranial arteriovenous malformations: study of 50 cases. J Neurosurg. 1989;71(6):805-809. doi: 10.3171/jns.1989.71.6.0805 [DOI] [PubMed] [Google Scholar]
  • 25.Mine S, Hirai S, Ono J, Yamaura A. Risk factors for poor outcome of untreated arteriovenous malformation. J Clin Neurosci. 2000;7(6):503-506. doi: 10.1054/jocn.2000.0743 [DOI] [PubMed] [Google Scholar]
  • 26.Yamada S, Takagi Y, Nozaki K, Kikuta K, Hashimoto N. Risk factors for subsequent hemorrhage in patients with cerebral arteriovenous malformations. J Neurosurg. 2007;107(5):965-972. doi: 10.3171/JNS-07/11/0965 [DOI] [PubMed] [Google Scholar]
  • 27.Mast H, Young WL, Koennecke HC, et al. Risk of spontaneous haemorrhage after diagnosis of cerebral arteriovenous malformation. Lancet. 1997;350(9084):1065-1068. doi: 10.1016/S0140-6736(97)05390-7 [DOI] [PubMed] [Google Scholar]
  • 28.da Costa L, Wallace MC, Ter Brugge KG, O’Kelly C, Willinsky RA, Tymianski M. The natural history and predictive features of hemorrhage from brain arteriovenous malformations. Stroke. 2009;40(1):100-105. doi: 10.1161/STROKEAHA.108.524678 [DOI] [PubMed] [Google Scholar]
  • 29.Kim H, Al-Shahi Salman R, McCulloch CE, Stapf C, Young WL; MARS Coinvestigators . Untreated brain arteriovenous malformation: patient-level meta-analysis of hemorrhage predictors. Neurology. 2014;83(7):590-597. doi: 10.1212/WNL.0000000000000688 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Crawford PM, West CR, Chadwick DW, Shaw MD. Arteriovenous malformations of the brain: natural history in unoperated patients. J Neurol Neurosurg Psychiatry. 1986;49(1):1-10. doi: 10.1136/jnnp.49.1.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Kim H, Sidney S, McCulloch CE, et al; UCSF BAVM Study Project. Racial/ethnic differences in longitudinal risk of intracranial hemorrhage in brain arteriovenous malformation patients. Stroke. 2007;38(9):2430-2437. doi: 10.1161/STROKEAHA.107.485573 [DOI] [PubMed] [Google Scholar]
  • 32.Stefani MA, Porter PJ, Ter Brugge KG, Montanera W, Willinsky RA, Wallace MC. Large and deep brain arteriovenous malformations are associated with risk of future hemorrhage. Stroke. 2002;33(5):1220-1224. doi: 10.1161/01.STR.0000013738.53113.33 [DOI] [PubMed] [Google Scholar]
  • 33.Graf CJ, Perret GE, Torner JC. Bleeding from cerebral arteriovenous malformations as part of their natural history. J Neurosurg. 1983;58(3):331-337. doi: 10.3171/jns.1983.58.3.0331 [DOI] [PubMed] [Google Scholar]
  • 34.Spetzler RF, Hargraves RW, McCormick PW, Zabramski JM, Flom RA, Zimmerman RS. Relationship of perfusion pressure and size to risk of hemorrhage from arteriovenous malformations. J Neurosurg. 1992;76(6):918-923. doi: 10.3171/jns.1992.76.6.0918 [DOI] [PubMed] [Google Scholar]
  • 35.Fults D, Kelly DL Jr. Natural history of arteriovenous malformations of the brain: a clinical study. Neurosurgery. 1984;15(5):658-662. doi: 10.1227/00006123-198411000-00003 [DOI] [PubMed] [Google Scholar]
  • 36.Stefani MA, Porter PJ, Ter Brugge KG, Montanera W, Willinsky RA, Wallace MC. Angioarchitectural factors present in brain arteriovenous malformations associated with hemorrhagic presentation. Stroke. 2002;33(4):920-924. doi: 10.1161/01.STR.0000014582.03429.F7 [DOI] [PubMed] [Google Scholar]
  • 37.Tao W, Yan L, Zeng M, Chen F. Factors affecting the performance of brain arteriovenous malformation rupture prediction models. BMC Med Inform Decis Mak. 2021;21(1):142. doi: 10.1186/s12911-021-01511-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Shakur SF, Liesse K, Amin-Hanjani S, et al. Relationship of cerebral arteriovenous malformation hemodynamics to clinical presentation, angioarchitectural features, and hemorrhage. Neurosurgery. 2016;63(suppl 1):136-140. doi: 10.1227/NEU.0000000000001285 [DOI] [PubMed] [Google Scholar]
  • 39.Amin-Hanjani S. ARUBA results are not applicable to all patients with arteriovenous malformation. Stroke. 2014;45(5):1539-1540. doi: 10.1161/STROKEAHA.113.002696 [DOI] [PubMed] [Google Scholar]
  • 40.Dumont TM, Kan P, Snyder KV, Hopkins LN, Siddiqui AH, Levy EI. A proposed grading system for endovascular treatment of cerebral arteriovenous malformations: Buffalo score. Surg Neurol Int. 2015;6:3. doi: 10.4103/2152-7806.148847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Achrol AS, Pawlikowska L, McCulloch CE, et al. ; UCSF BAVM Study Project . Tumor necrosis factor–alpha-238G>A promoter polymorphism is associated with increased risk of new hemorrhage in the natural course of patients with brain arteriovenous malformations. Stroke. 2006;37(1):231-234. doi: 10.1161/01.STR.0000195133.98378.4b [DOI] [PubMed] [Google Scholar]
  • 42.Weinsheimer S, Kim H, Pawlikowska L, et al. EPHB4 gene polymorphisms and risk of intracranial hemorrhage in patients with brain arteriovenous malformations. Circ Cardiovasc Genet. 2009;2(5):476-482. doi: 10.1161/CIRCGENETICS.109.883595 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1.

eMethods 1. Protocol for Data Quality Management

eMethods 2. Calculation of Predicted Hemorrhage Probability According to VALE Score

eTable 1. Breakdown of Missing Data in the Derivation Cohort

eTable 2. Univariable Analysis of the Derivation Cohort

eFigure 1. Kaplan-Meier Curves of Overall and Subgroup Hemorrhage-Free Probability in the Conservative Treatment Validation Cohort

eTable 3. Literature Review of Factors Associated With Arteriovenous Malformation Rupture

eTable 4. Variable Selection and Bootstrap-Derived Quantities Useful for Assessing Stability

eFigure 2. Hemorrhage Probability in the Derivation Cohort

eTable 5. Subgroup Analysis of VALE Score in the Conservative Treatment Validation Cohort According to the Retrospective and Prospective Nature of the Survival Data

eFigure 3. Illustration Cases for the VALE Score

eTable 6. Univariable Analysis of Angiographic Features in Different Cohorts

eReferences

Supplement 2.

Data Sharing Statement


Articles from JAMA Network Open are provided here courtesy of American Medical Association

RESOURCES