Skip to main content
Springer logoLink to Springer
. 2025 Jan 24;67(3):563–573. doi: 10.1007/s00234-025-03551-y

Detecting B-cell lymphoma-6 overexpression status in primary central nervous system lymphoma using multiparametric MRI-based machine learning

Mingxiao Wang 1,2,#, Guoli Liu 2,#, Nan Zhang 3, Yanhua Li 2,4, Shuo Sun 1,2, Yahong Tan 1,2, Lin Ma 1,2,
PMCID: PMC12003451  PMID: 39853344

Abstract

Purpose

In primary central nervous system lymphoma (PCNSL), B-cell lymphoma-6 (BCL-6) is an unfavorable prognostic biomarker. We aim to non-invasively detect BCL-6 overexpression in PCNSL patients using multiparametric MRI and machine learning techniques.

Methods

65 patients (101 lesions) with primary central nervous system lymphoma (PCNSL) diagnosed from January 2013 to July 2023, and all patients were randomly divided into a training set and a validation set according to a ratio of 8 to 2. ADC map derived from DWI (b = 0/1000 s/mm2), fast spin echo T2WI, T2FLAIR, were collected at 3.0 T. A total of 2234 radiomics features from the tumor segmentation area were extracted and LASSO were used to select features. Logistic regression (LR), Naive bayes (NB), Support vector machine (SVM), K-nearest Neighbor, (KNN) and Multilayer Perceptron (MLP), were used for machine learning, and sensitivity, specificity, accuracy F1-score, and area under the curve (AUC) was used to evaluate the detection performance of five classifiers, 6 groups with combinations of different sequences were fitted to 5 classifiers, and optimal classifier was obtained.

Results

BCL-6 status could be identified to varying degrees with 30 models based on radiomics, and model performance could be improved by combining different sequences and classifiers. Support vector machine (SVM) combined with three sequence group had the largest AUC (0.95) in training set and satisfactory AUC (0.87) in validation set.

Conclusion

Multiparametric MRI based machine learning is promising in detecting BCL-6 overexpression.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00234-025-03551-y.

Keywords: PCNSL, BCL-6, Overexpression, MRI, Machine learning

Introuduction

Primary central nervous system lymphoma (PCNSL) is a rare and aggressive form of extra-nodal non-Hodgkin lymphoma confined to the central nervous system (CNS), including the brain, spine, meninges, and eyes, with more than 90% of cases histologically classified as diffuse large B-cell lymphoma (DLBCL) [12]. In contrast to other high-grade non-Hodgkin lymphoma outside the CNS, PCNSL is particularly aggressive and the overall survival time of untreated PCNSL patients is limited to 1.5 months [3, 4]. The observed variations in clinical characteristics and treatment outcomes in DLBCL patients can be attributed to the significant genetic and molecular diversity that drives the disease aggressiveness and tumor advancement [5].

The major molecular markers of PCNSL, including B-cell lymphoma-2 (BCL-2), MYC, B-cell lymphoma-6 (BCL-6), Ki-67, etc., are related to the prognosis. Overexpression of both BCL-2 and MYC proteins, known as double-expression lymphomas (DEL), has been well investigated. Previous studies have identified that BCL-6 expression is also an useful biomarker for predicting poor prognosis and reduced survival rates [58]. Therefore, the prognostic value of BCL-6 may aid in determining the relevant prognosis of PCNSL and guide the therapeutic decisions. In PCNSL, pathologic diagnosis is primarily made based on fine-needle aspiration, but biopsies are invasive and have the risk of undersampling [9]. Patients with PCNSL typically undergo brain MRI examinations prior to treatment, making MRI a possible non-invasive method for detecting the BCL-6 status [10].

The potentials of radiomics analysis in predicting molecular characteristics in brain tumors have been reported [1112]. Liu et al. has demonstrated that DEL and non-DEL status can be effectively differentiated using multiparametric MRI based machine learning radiomics in immunocompetent PCNSL [13]. Another study showed a significant correlation between whole tumor ADC histogram profiling and Ki-67 expression, a marker of active proliferation in PCNSL [3]. Bathla et al. reported that radiomics combined with machine learning model can distinguish PCNSL and glioblastoma excellently [14]. Additionally, previous studies have demonstrated the utility of radiomics in distinguishing among PCNSL, glioma, and metastasis [1518]. In this study, our purpose was to determine whether multiparametric MRI radiomics features could help detect BCL-6 expression status in immunocompetent PCNSL.

Materials and methods

Patient cohort

This retrospective study was approved by the hospital’s ethics committee (No. S2022-671-01), and the requirement for written informed consent was waived. Between January 2013 and July 2023, 69 pathologically proved PCNSL patients were retrieved from the PACS of our hospital. Four of them were excluded due to the usage of steroid treatment before MR scan or blurred MR images. Finally, 65 patients (101 lesions) were included in this study, among whom 40 were BCL-6 overexpression (23 males and 17 females, mean age, 58.11 ± 13.65 years) and 25 were BCL-6 underexpression (16 males and 9 females, mean age, 56.59 ± 12.40 years). Patients were included according to the following criteria: (1) pathologically confirmed DLBCL; (2) immunohistochemical staining results of BCL-6 (> 30% was defined as BCL-6 overexpression [BCL-6 (+)], ≤ 30% was defined as BCL-6 underexpression [BCL-6 (-)]); (3) patients were immunocompetent (determined by past history and laboratory tests including HIV, lymphocyte ratio, IgG, etc.); and (4) over 18 years of age. The exclusion criteria included: (1) incomplete MRI protocol or obvious artifacts in MR images; (2) MR scans after chemotherapy, radiotherapy, or steroid usage; (3) MR scans after tumor resection; and (4) incomplete histological results. All the clinical data, MRI protocol, and MR image artifacts were collected and assessed by A (6-year work experience), B (6-year work experience), and C (3-year work experience). The flowchart of patient inclusion was shown in Fig. 1. Risk score was assessed using three categories (low, intermediate, and high) based on the International Extranodal Lymphoma Study Group (IELSG) [19] and the Memorial Sloan-Kettering Cancer Center prognostic model (MSKCC) [20], respectively.

Fig. 1.

Fig. 1

Flowchart of the study cohort. PCNSL = primary centeral nervous system lymphoma

MRI protocol

MR images were acquired on a 3T MRI system (Discovery 750, GE Healthcare, Milwaukee, WI) with a 32-channel head coil. MR imaging protocol included axial fast spin echo T2 weighted imaging (T2WI, repetition time [TR]/echo time [TE] = 5642/93 msec, field of view [FOV] = 24 × 24 cm, matrix = 512 × 512, number of excitation [NEX] = 1.50), coronal T2 fluid-attenuated inversion recovery (T2FLAIR) (TR/TE/inversion time [TI] = 8527/162/2100 msec, FOV = 24 × 24 cm, matrix = 288 × 224, NEX = 1.00), axial diffusion-weighted imaging (DWI, TR/TE = 3000/65.5 msec, FOV = 24 × 24 cm, b = 0/1000 sec/mm2, matrix = 160 × 160, and NEX = 2.00), with a slice thickness of 5.0 mm and gap of 1.5 mm for all the axial and coronal images.

Tumor segmentation

DICOM images were converted to the NIfTI format, and the image structure was segmented using ITK-SNAP [21]. The areas of interest were comprised of tumor solid components and necrosis or cystic changes, except the hemorrhage. Meanwhile, lesions with a minimum diameter of more than 5 mm were included so as to ensure a reliable measurement. Three dimension Segmentations were delineated on ADC, T2WI, and T2FLAIR, respectively, and intergroup consistency by intraclass correlation coefficient (ICC) was conducted. Regions of interest were drawn by A (6-year work experience) and B (6-year work experience). Tumor segmentation on different MRI sequences was demonstrated in Fig. 2.

Fig. 2.

Fig. 2

An example of tumor segmentation. The red label represented the solid part of the tumor

Feature extraction

Radiomics features were extracted by a customized version of Pyradiomics from Python (Version 3.8.8). A total of 2234 radiomics characteristics were extracted from each tumor segment in each set of images used in the machine learning models. The interpretation and calculation formulas, as well as all radiomics features, were presented on Pyradiomics website (https://pyradiomics.readthedocs.io/en/latest/features.html). The extracted features contained 14 shape features, 36 first-order histogram features, and 75 s-order features (also called texture features).

Filters such as the original image, 8 wavelet transforms, 5 log-sigma transforms, square, square root, logarithmic, exponential, gradient, and LBP-2D transform images were used to select the features.

Feature selection

Feature reduction was performed through the following steps.1) The extracted features with ICC ≥ 0.80 were selected. 2) Levene assay was used to analyze the normal distribution of test features. The correlation between features and BCL-6 expression was then assessed by t-test, and features with P < 0.05 were considered as potential predictors. 3) Least absolute shrinkage and selection operator (LASSO) regression was used to select features. 4) The final subset of features was filtered based on this number and the ranking of the feature weights.

Classifier modeling

Based on the 3 MRI sequences, 6 sequence groups (ADC, T2WI, T2FLAIR, ADC + T2WI, ADC + T2FLAIR, and ADC + T2WI + T2FLAIR) were selected. One sequence group (T2 + T2FLAIR) was deleted since no features were presented. Subsequently, five possible classifiers built on the machine learning tool kit of scikit-learn (https://scikit-learn.org) were used to adapt to these combinations. The following classifiers were used: logistic regression (LR), Naive Bayes (NB), support vector machine (SVM), K-nearest neighbor (KNN), and Multilayer Perception (MLP). The best features were combined with five different machine learning algorithms to filter the optimal model. Five-fold cross-validation was performed in the training set. Finally, the validity of 30 different models were evaluated. Default parameters were used for all machine learning models in this study.

Tumor Morphologic Assessment

Based on previous studies, [2223] tumor morphology was selected and evaluated: (1) Tumor location was determined by the lesion center and divided into two groups: supratentorial or infratentorial, peripheral or median parenchyma (basal ganglia, thalamus, brainstem, and cerebellum). (2) The maximum and minimum diameters of the maximum layer of tumor were measured on T2WI. (3) ADC values, including ADCmean, ADCmax, ADCmin, ADC percentile, and ADC uniformity were calculated (4) Necrosis or cystic changes were defined as areas showing hyperintensity on T2WI.5) Edema (Table S1 in the Supplemental Material) and mass effect levels (Table S2 in the Supplemental Material). These tumor morphologic features were assessed by A. (6-year work experience), B (6-year work experience), and C (3-year work experience). Representative case images in BCL-6(+) and BCL-6(-) were shown in Fig. 3.

Fig. 3.

Fig. 3

MRI in a 47-year-old male PCNSL patient with BCL-6(-). The tumor shows isointensity on axial T2WI (a), restriction on axial ADC map (b), and slight hyperintensity on coronal T2FLAIR (c). MRI in a 76-year-old female PCNSL patient with BCL-6(+). The tumor shows slight hyperintensity on axial T2WI (d), restriction on axial ADC map (e), and isointensity on coronal T2FLAIR (f)

Statistical analysis

The statistical product service solutions (version 26.0) were utilized to analyze statistical differences in clinical data and MR imaging features of lesion between BCL-6 (+) and BCL-6 (-) groups. Continuous variables were assessed by the t-test, while categorical variables were assessed by the nonparametric test. P value < 0.05 were considered statistically significant. The inter group consistency was tested by ICC, and ICC ≥ 0.80 was considered to be a strong correlation. Sensitivity, specificity, accuracy, F1-score, and area under curve (AUC) for each machine learning model were calculated.

Results

Clinical and lesion characteristics

In clinical characteristics, no significant difference was found in age (P = 0.36), gender (P = 0.60), MSKCC (P = 0.42), number of lesions (P = 1.00), or pathologic measure (P = 0.52) between BCL-6 (+) and BCL-6 (-) (Table 1). IELSG (P = 0.02) shows significant difference between BCL-6 (+) and BCL-6 (-). As for MR imaging features, location (P > 0.05), maximum diameter (P = 0.15), minimum diameter (P = 0.35), ADCmax (P = 0.76), ADCmin (P = 0.10), ADC 5 percentile (P = 0.18), ADC uniformity (P = 0.26), necrosis or cystic change (P = 0.13), edema (P = 0.56), or mass effect (P = 0.81) did not show significant difference between BCL-6 (+) and BCL-6 (-). ADCmean (P = 0.00) showed significant difference between BCL-6 (+) and BCL-6 (-). (Table 2).

Table 1.

Clinical data between BCL-6 (+) and BCL6 (-)

ALL BCL-6 (+) BCL-6 (-) P value
Number of patients 65 40 25
Age (mean ± SD, year) 57.52 ± 13.14 58.11 ± 13.65 56.59 ± 12.40 0.36
Gender (%) 0.60
Male 39 (60.00) 23 (57.50) 16 (64.00)
Female 26 (40.00) 17 (42.50) 9 (36.00)
IELSG (%) 0.02
Low 17 (26.15) 8 (20.00) 9 (36.00)
Intermediate 45 (69.23) 32 (80.00) 13 (52.00)
High 3 (4.62) 0 (0) 3 (12.00)
MSKCC (%) 0.42
Low 16 (24.62) 11 (27.50) 5 (20.00)
Intermediate 36 (55.38) 23 (57.50) 13 (52.00)
High 13 (20.00) 6 (15.00) 7 (28.00)
Number of lesions(%) 1.00
Single 39 (60.00) 24 (60.00) 15 (60.00)
Mutiple 26 (40.00) 16 (40.00) 10 (40.00)
Pathologic measure(%) 0.52
Stereotactic biopsy 38 (58.46) 23 (57.50) 15 (60.00)
Partial excision 9 (13.85) 7 (17.50) 2 (8.00)
Whole excision 18 (27.69) 10 (25.00) 8 (32.00)

Distribution of BCL-6 expression status was reported as absolute counts (%). Age was reported as mean ± SD. BCL-6 (+), BCL-6 overexpression; BCL-6 (-), BCL-6 underexpression; IELSG, International Extranodal Lymphoma Study Group; MSKCC, Memorial Sloan-Kettering Cancer Center score for PCNSL outcome; SD, standard deviation

Table 2.

MR Imaging features of the lesions

ALL BCL-6 (+) BCL-6 (-) P value
Number of lesions 101 62 39
Location (%)
Supratentorial 86 (85.15) 56 (90.32) 30 (76.90) 0.07
Infratentorial 15 (14.85) 6 (9.68) 9 (23.10)
Midline 56 (55.45) 38 (61.29) 18 (46.15) 0.14
Peripheral 45 (44.55) 24 (38.71) 21 (53.85)
Maximum diameter (mean ± SD, mm) 29.71 ± 13.92 28.82 ± 14.93 31.18 ± 12.18 0.15
Minimum diameter (mean ± SD, mm) 16.71 ± 9.72 15.69 ± 8.75 18.33 ± 11.01 0.35
ADC
ADCmean 0.91 ± 0.31 0.83 ± 0.38 0.95 ± 0.26 0.00
ADCmax 2.47 ± 0.99 2.47 ± 1.02 2.48 ± 0.94 0.76
ADCmin 0.14 ± 0.41 0.19 ± 0.40 0.06 ± 0.43 0.10
ADC 5 percentile 0.56 ± 0.28 0.59 ± 0.25 0.51 ± 0.32 0.18
ADCuniformity 0.27 ± 0.10 0.26 ± 0.10 0.30 ± 10 0.26
Necrosis or cystic change (%) 0.13
Yes 35 (34.65) 25 (40.32) 10 (25.64)
No 66 (65.35) 37 (59.68) 29 (74.36)
Edema (%) 0.56
1 53 (52.48) 35 (56.45) 18 (46.15)
2 33 (32.67) 18 (29.03) 15 (38.46)
3 15 (14.85) 9 (14.52) 6 (15.38)
Mass effect (%) 0.81
0 18 (17.82) 12 (19.35) 6 (15.38)
1 40 (39.60) 24 (38.71) 16 (41.03)
2 11 (10.89) 8 (12.90) 3 (7.69)
3 23 (22.77) 12 (19.35) 11 (28.21)
4 9 (8.91) 6 (9.68) 3 (7.69)

Distribution of BCL-6 expression status was reported as absolute counts(%). Maximum diameter and minimum diameter were reported as mean ± SD. BCL-6 (+), BCL-6 overexpression; BCL-6 (-), BCL-6 underexpression; SD: standard deviation

Performance of the classification models

After feature selection, the remaining features (sequence, feature filter, feature type, and feature name) for each sequence group were as follows: ADC (number of features [nfeature] = 10), T2WI (nfeature = 4), T2FLAIR (nfeature = 5), ADC + T2WI (nfeature = 15), T2WI + T2FLAIR (nfeature = 0), ADC + T2FLAIR (nfeature = 11), and ADC + T2WI + T2FLAIR (nfeature = 17) (Table S3 in the Supplemental Material). The pipeline for BCL-6 expression detection was depicted in Fig. 4.

Fig. 4.

Fig. 4

The pipeline for BCL-6 overexpression detection

Radiomics model construction

Of the 101 lesions, 81 lesions, including 49 BCL-6 (+) and 32 BCL-6 (-), were in the training set, and 20 lesions, including 13 BCL-6 (+) and 7 BCL-6 (-), were in the validation set.

In the training set, the diagnostic performance of five machine learning with single and three sequences was shown in Table 3. Among the five machine learning methods, SVM with three sequence group achieved better classification performance (AUC = 0.95) than the others, with a sensitivity of 92.0%, specificity of 88.0%, accuracy of 93.0%.

Table 3.

Model performance of single sequence groups and three sequence group in training set

Sequence Model Sensitivity Specificity Accuracy F1 score AUC
ADC LR 0.85 0.75 0.76 0.82 0.78
NB 0.92 0.50 0.78 0.83 0.75
SVM 0.92 0.75 0.88 0.90 0.85
KNN 0.77 0.88 0.83 0.87 0.86
MLP 0.77 0.88 0.81 0.86 0.82
T2WI LR 0.73 0.86 0.63 0.65 0.70
NB 0.64 1.00 0.72 0.77 0.73
SVM 0.91 0.86 0.72 0.72 0.80
KNN 0.46 1.00 0.78 0.81 0.84
MLP 0.82 0.86 0.74 0.80 0.74
T2FLAIR LR 0.67 1.00 0.69 0.75 0.71
NB 0.83 0.86 0.68 0.75 0.64
SVM 0.67 0.88 0.79 0.84 0.80
KNN 0.83 0.38 0.74 0.79 0.76
MLP 0.75 1.00 0.68 0.69 0.76
Three sequence LR 0.75 0.75 0.90 0.89 0.86
NB 0.83 0.63 0.78 0.82 0.82
SVM 0.92 0.88 0.93 0.94 0.95
KNN 0.83 1.00 0.80 0.82 0.85
MLP 0.75 0.88 0.86 0.89 0.93

AUC, area under curve; LR, logistic regression; NB, Naive Bayes; SVM, support vector machine; KNN, K-nearest neighbor; MLP, Multilayer Perception

In the validation set, the diagnostic performance of five machine learning with single and three sequences was shown in Table 4. SVM with three sequence group also achieved satisfactory classification performance (AUC = 0.87) with a sensitivity of 98.0%, specificity of 84.0%, accuracy of 90.0%. The diagnostic performance of five machine learning with two sequence groups could be seen in Table S4 and S5 in the Supplemental Material.

Table 4.

Model performance of single sequence groups and three sequence group in validation set

Sequence Model Sensitivity Specificity Accuracy F1 score AUC
ADC LR 0.88 0.58 0.81 0.85 0.84
NB 0.88 0.61 0.76 0.83 0.73
SVM 0.92 0.83 0.86 0.89 0.89
KNN 0.94 0.65 0.81 0.83 0.87
MLP 0.92 0.65 0.81 0.83 0.86
T2WI LR 0.51 0.79 0.78 0.63 0.81
NB 0.77 0.66 0.78 0.78 0.86
SVM 0.61 0.90 0.89 0.91 0.83
KNN 0.79 0.76 0.67 0.63 0.76
MLP 0.88 0.52 0.83 0.86 0.84
T2FLAIR LR 0.74 0.61 0.80 0.80 0.85
NB 0.76 0.55 0.80 0.83 0.72
SVM 0.90 0.61 0.75 0.76 0.72
KNN 0.80 0.65 0.65 0.74 0.61
MLP 0.58 0.84 0.85 0.86 0.87
Three sequence LR 0.92 0.77 0.75 0.78 0.77
NB 0.84 0.68 0.75 0.82 0.69
SVM 0.98 0.84 0.90 0.91 0.87
KNN 0.72 0.94 0.90 0.91 0.91
MLP 0.86 0.87 0.80 0.82 0.80

AUC, area under curve; LR, logistic regression; NB, Naive Bayes; SVM, support vector machine; KNN, K-nearest neighbor; MLP, Multilayer Perception

Dicussion

In the present study, the BCL-6 status was determined using multiparametric MRI based machine learning, which involved the selection of an optimal model from a total of 30 models comprising 6 distinct sequence groups and 5 classifiers. This approach offers a promising and non-invasive means for identifying the molecular subtypes, specifically the expression of BCL-6, in PCNSL. The multiparametric MRI radiomics model exhibited a high level of diagnostic accuracy in identifying BCL-6 status, as evidenced by an AUC of 0.95 (95% CI 0.883–1.000), in the training set. Furthermore, the model demonstrated consistent performance in the validation set, achieving an AUC of 0.87 (95% CI: 0.703–1.000), indicating its reliability.

DLBCL mainly presents in two distinct molecular subtypes, with gene expression reflecting varying stages of B-cell differentiation. One subtype exhibited gene expression patterns typical of germinal center B cells, referred to as “germinal center B-like DLBCL”, while the other subtype displayed gene expression signatures consistent with in vitro activation referred to as “activated B-like DLBCL” [24]. Notably, the germinal center B-like DLBCL signature prominently features the BCL-6 gene, a widely recognized germinal center marker and the most commonly translocated gene in DLBCL [25]. The specific expression of BCL-6 protein in normal germinal center B lymphocytes, along with its essential role in germinal center development, suggests a complex interplay between the functions of BCL-6 in both normal and malignant B cells. The BCL-6 protein is a transcriptional repressor with the potential to regulate lymphocyte differentiation and promote lymphomas through the modulation of key downstream target genes [25]. As such, the impact of BCL-6 on the prognosis of patients with PCNSL remains a subject of inquiry. A prospective multicenter trial (CALGB 50202) conducted in 2013 sought to address this question, and ultimately demonstrating a significant association between BCL-6 expression and decreased survival rates[6]. Other studies also indicated that BCL-6 assume clinical relevance as an unfavorable prognostic biomarker in PCNSL [7, 26].

IELSG scoring system is frequently utilized to stratify the risk levels of patients with PCNSL. IELSG added age, performance status, lactate dehydrogenase, cerebrospinal fluid protein, and deep brain lesions involvement as risk factors for PCNSL. In this study, IELSG score showed marginally significant difference between BCL-6 (+) and BCL-6 (-) (P = 0.02). Further research with larger sample size is needed to investigate the correlation between the IELSG scoring system and BCL-6 expression status.

In immunocompetent patients, PCNSL typically shows hypo- or isointense on T1-weighted imaging (T1WI) and iso- to slight hyperintense on T2WI. Most lesions show evident homogeneous enhancement on TICE, and slight low to iso- perfusion on perfusion-weighted imaging (PWI). In clinical practice, diffuse gliomas, including glioblastoma, are more commonly encountered. IDH-wildtype glioblastomas usually demonstrate hypointense on T1WI and heterogeneous hyperintense on T2WI. Hemorrhage and necrosis are more commonly revealed, and heterogeneous enhancement and hyperperfusion are seen in glioblastomas. Therefore, the differentiation between PCNSL and glioblastoma is relatively easy to implement.

In our study, conventional MR imaging findings did not demonstrate marked differences between BCL-6(+) and BCL-6(-) in PCNSL. MR imaging findings, such as location, necrosis or cystic change, maximum diameter, minimum diameter, edema, or mass effect, could not be used to detect BCL-6 overexpression. Besides, relevant quantitative ADC parameters were calculated, and only ADCmean values showed significant difference between BCL-6 (+) and BCL-6 (-). ADCmean value in BCL-6(+) group was lower than that in BCL-6(-) group. ADC values, a quantitative index of diffusion characteristics, could provide information about cellular density and the tortuosity of the extracellular space, and tend to decrease in areas with restricted diffusion [27]. Low ADC values were associated with poorer clinical outcomes in various cancers [2829], and our findings were consistent with the result that low ADC value was regarded as an independent unfavorable prognostic factor in PCNSL patients [30].

Radiomics is a new translational field in which a range of attributes, such as geometry, strength, and texture [3132], are determined from radiological images to allow for capturing different imaging patterns [3334]. These patterns have the potential to aid in tumor classification, grading, and staging when integrated with clinical, histopathological, molecular, and traditional imaging parameters [14, 3233, 35]. In our study, a combined conventional sequence (ADC + T2WI + T2FLAIR) based radiomics and machine learning approach was used to successfully identify BCL-6 expression status. The most effective model in this study was found to be SVM with an AUC of 0.95 and a Confidence Interval (CI) of 0.883–1.000. SVM, grounded in the principles of VC dimension and minimum structural risk theory in statistics [36], has been widely applied in various medical fields including preoperative diagnosis, prognostic prediction, disease differentiation, and clinical grading [3738]. It was noteworthy that we only used three non-contrast MR sequences to achieve the satisfactory performance, and the result suggests that BCL-6 could still be detected in patients who are not suitable for contrast-enhanced MR scan.

Limitation

Firstly, this was a single-center study conducted on single scanner, and the sample size was relatively small, leading to a possible selection bias. Secondly, post-contrast MR scan or MR perfusion techniques were not used in our study. Thirdly, only BCL-6 was investigated in this study, neglecting the detection of other biomarkers such as Ki-67, BCL-2, and MYC. Lastly, default parameters were applied to all machine learning models without delving into the specific settings of individual models. Further analysis to determine the optimal parameters for each model will be pursued in future studies.

Conclusion

In this study, multiparametric MRI based machine learning combining three conventional sequences (ADC, T2WI, and T2FLAIR) with SVM had the best performance in detecting BCL-6 status, providing a potential and promising tool in therapeutic planning and prognostic assessment in PCNSL.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (29.5KB, docx)

Acknowledgements

We thank all the subjects for participating in the study.

Abbreviations

PCNSL

Primary central nervous system lymphoma

BCL-6

B-cell lymphoma-6

BCL-2

B-cell lymphoma-2

IELSG

International Extranodal Lymphoma Study Group

MSKCC

Memorial Sloan-Kettering Cancer Center

ML

Machine learning

LR

Logistic regression

NB

Naive bayes

SVM

Support vector machine

KNN

K-nearest neighbor

MLP

Multilayer perception

AUC

Area under the receiver operating characteristic curve

CI

Confidence interval

ICC

Intraclass correlation coefficients

LASSO

Least absolute shrinkage and selection operator

Funding

The authors declare that no funding support was received for this work.

Declarations

Ethical approval

Compliance with ethical standards: We declare that all human and animal studies have been approved by the ethics committee of Chinese PLA General Hospital (Registration Number: S2022-671-01) and have therefore been performed in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. The requirement for written informed consent was waived.

Conflict of interest

We declare that we have no conflict of interest.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Mingxiao Wang and Guoli Liu contributed equally to this work.

References

  • 1.Löw S, Han CH, Batchelor TT (2018) Primary central nervous system lymphoma. Ther Adv Neurol Disord 11:1756286418793562. 10.1177/1756286418793562 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Barajas RF, Politi LS, Anzalone N et al (2021) Consensus recommendations for MRI and PET imaging of primary central nervous system lymphoma: guideline statement from the international primary CNS lymphoma collaborative group (IPCG). Neuro-Oncol 23(7):1056–1071. 10.1093/neuonc/noab020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Schob S, Münch B, Dieckow J et al (2018) Whole tumor histogram-profiling of diffusion-weighted magnetic resonance images reflects tumorbiological features of primary central nervous system lymphoma. Transl Oncol 11(2):504510. 10.1016/j.tranon.2018.02.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Feng Y, Liu Y, Zhong M et al (2021) Complete blood count score model predicts inferior prognosis in primary central nervous system lymphoma. Front Oncol 11:618694. 10.3389/fonc.2021.618694 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lossos IS, Jones CD, Warnke R et al (2001) Expression of a single gene, BCL-6, strongly predicts survival in patients with diffuse large B-cell lymphoma. Blood 98(4):945–951. 10.1182/blood.v98.4.945 [DOI] [PubMed] [Google Scholar]
  • 6.Rubenstein JL, Hsi ED, Johnson JL et al (2013) Intensive chemotherapy and immunotherapy in patients with newly diagnosed primary CNS lymphoma: CALGB 50202 (Alliance 50202). J Clin Oncol 31(25):3061–3068. 10.1200/JCO.2012.46.9957 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kreher S, Jöhrens K, Strehlow F et al (2015) Prognostic impact of B-cell lymphoma 6 in primary CNS lymphoma. Neuro-Oncol 17(7):1016–1021. 10.1093/neuonc/nov046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lossos C, Bayraktar S, Weinzierl E et al (2014) LMO2 and BCL6 are associated with improved survival in primary central nervous system lymphoma. Br J Haematol 165(5):640–648. 10.1111/bjh.12801 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Chitalia RD, Kontos D (2019) Role of texture analysis in breast MRI as a cancer biomarker: A review. J Magn Reson Imaging 2019;49(4):927–938. 10.1002/jmri.26556 [DOI] [PMC free article] [PubMed]
  • 10.Hatzoglou V, Oh JH, Buck O et al (2018) Pretreatment dynamic contrast-enhanced MRI biomarkers correlate with progression-free survival in primary central nervous system lymphoma. J Neurooncol 140(2):351–358. 10.1007/s11060-018-2960-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kim M, Jung SY, Park JE et al (2020) Diffusion- and perfusion-weighted MRI radiomics model may predict isocitrate dehydrogenase (IDH) mutation and tumor aggressiveness in diffuse lower grade glioma. Eur Radiol 30(4):2142–2151. 10.1007/s00330-019-06548-3 [DOI] [PubMed] [Google Scholar]
  • 12.Guha A, Goda JS, Dasgupta A et al (2022) Classifying primary central nervous system lymphoma from glioblastoma using deep learning and radiomics based machine learning approach - a systematic review and meta-analysis. Front Oncol 12:884173. 10.3389/fonc.2022.884173 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Liu G, Zhang X, Zhang N et al (2024) Detecting double expression status in primary central nervous system lymphoma using multiparametric MRI based machine learning. J Magn Reson Imaging 59(1):231–239. 10.1002/jmri.28782 [DOI] [PubMed] [Google Scholar]
  • 14.Bathla G, Priya S, Liu Y et al (2021) Radiomics-based differentiation between glioblastoma and primary central nervous system lymphoma: a comparison of diagnostic performance across different MRI sequences and machine learning techniques. Eur Radiol 31(11):8703–8713. 10.1007/s00330-021-07845-6 [DOI] [PubMed] [Google Scholar]
  • 15.Nguyen AV, Blears EE, Ross E et al (2018) Machine learning applications for the differentiation of primary central nervous system lymphoma from glioblastoma on imaging: a systematic review and meta-analysis. Neurosurg Focus 45(5):E5. 10.3171/2018.8.FOCUS18325 [DOI] [PubMed] [Google Scholar]
  • 16.Kunimatsu A, Kunimatsu N, Yasaka K et al (2019) Machine learning-based texture analysis of contrast-enhanced MR imaging to differentiate between glioblastoma and primary central nervous system lymphoma. Magn Reson Med Sci 18(1):44–52. 10.2463/mrms.mp.2017-0178 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Cindil E, Sendur HN, Cerit MN et al (2021) Validation of combined use of DWI and percentage signal recovery-optimized protocol of DSC-MRI in differentiation of high-grade glioma, metastasis, and lymphoma. Neuroradiology 63(3):331–342. 10.1007/s00234-020-02522-9 [DOI] [PubMed] [Google Scholar]
  • 18.Kim Y, Cho Hho, Kim ST et al (2018) Radiomics features to distinguish glioblastoma from primary central nervous system lymphoma on multi-parametric MRI. Neuroradiology 60(12):1297–1305. 10.1007/s00234-018-2091-4 [DOI] [PubMed] [Google Scholar]
  • 19.Abrey LE, Batchelor TT, Ferreri AJM, etal et al (2005) Report of an international workshop to standardize baseline evaluation and response criteria for primary CNS lymphoma. J Clin Oncol 23(22):5034–5043. 10.1200/JCO.2005.13.524 [DOI] [PubMed]
  • 20.Abrey LE, Ben-Porat L, Panageas KS et al (2006) Primary central nervous system lymphoma: the Memorial Sloan-Kettering Cancer Center Prognostic Model. J Clin Oncol 24(36):5711–5715. 10.1200/JCO.2006.08.2941 [DOI] [PubMed] [Google Scholar]
  • 21.Yushkevich PA, Piven J, Hazlett HC et al (2006) User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. NeuroImage 31(3):1116–1128. 10.1016/j.neuroimage.2006.01.015 [DOI] [PubMed] [Google Scholar]
  • 22.Coulon A, Lafitte F, Hoang-Xuan K et al (2002) Radiographic findings in 37 cases of primary CNS lymphoma in immunocompetent patients. Eur Radiol 12(2):329–340. 10.1007/s003300101037 [DOI] [PubMed] [Google Scholar]
  • 23.Küker W, Nägele T, Korfel A et al (2005) Primary central nervous system lymphomas (PCNSL): MRI features at presentation in 100 patients. J Neurooncol 72(2):169–177. 10.1007/s11060-004-3390-7 [DOI] [PubMed] [Google Scholar]
  • 24.Alizadeh AA, Eisen MB, Davis RE et al (2000) Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403(6769):503–511. 10.1038/35000501 [DOI] [PubMed] [Google Scholar]
  • 25.Staudt LM, Dent AL, Shaffer AL et al (1999) Regulation of lymphocyte cell fate decisions and lymphomagenesis by BCL-6. Int Rev Immunol 18(4):381–403. 10.3109/08830189909088490 [DOI] [PubMed] [Google Scholar]
  • 26.Chang CC, Kampalath B, Schultz C et al (2003) Expression of p53, c-Myc, or Bcl-6 suggests a poor prognosis in primary central nervous system diffuse large B-cell lymphoma among immunocompetent individuals. Arch Pathol Lab Med 127(2):208–212. 10.5858/2003-127-208-EOPMOB [DOI] [PubMed] [Google Scholar]
  • 27.Gourtsoyianni S, Santinha J, Matos C (2020) Diffusion weighted imaging and texture analysis:current role for diffuse liver disease[J]. Abdom Radiol 45(11):3523–3531. 10.1007/s00261-020-02772-4 [DOI] [PubMed] [Google Scholar]
  • 28.Chikarmane SA, Gombos EC, Jagadeesan J, Raut C (2015) MRI findings of radiation-associated angiosarcoma of the breast (RAS). J Magn Reson Imaging 42:763–770. 10.1002/jmri.24822 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Yamada S, Morine Y, Imura S (2020) Prognostic prediction of apparent diffusion coefficient obtained by diffusion-weighted MRI in mass-forming intrahepatic cholangiocarcinoma. J Hepatobiliary Pancreat Sci 27(7):388–395. 10.1002/jhbp.732 [DOI] [PubMed] [Google Scholar]
  • 30.Baek DW, Cho HJ, Bae JH (2020) Apparent diffusion coefficient as a valuable quantitative parameter for predicting clinical outcomes in patients with newly diagnosed primary CNS lymphoma. Blood Res 55(2):99–106. 10.5045/br.2020.2020032 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278(2):563–577. 10.1148/radiol.2015151169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lohmann P, Galldiks N, Kocher M et al (2021) Radiomics in neuro-oncology: basics, workflow, and applications. Methods 188:112–121. 10.1007/s00330-021-08146-8 [DOI] [PubMed] [Google Scholar]
  • 33.Abdel Razek AAK, Alksas A, Shehata M et al (2021) Clinical applications of artificial intelligence and radiomics in neuro-oncology imaging. Insights Imaging 12(1):152. 10.1186/s13244-021-01102-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lee JY, Lee KS, Seo BK et al (2022) Radiomic machine learning for predicting prognostic biomarkers and molecular subtypes of breast cancer using tumor heterogeneity and angiogenesis properties on MRI. Eur Radiol 32(1):650–660. 10.1007/s00330-021-08146-8 [DOI] [PubMed] [Google Scholar]
  • 35.Liu J, Tu J, Yao L et al (2024)MRI-based radiomics virtual biopsy for BCL6 in primary central nervous system lymphoma. Clin Radiol Published Online November 8. 10.1016/j.crad.2024.106746 [DOI] [PubMed]
  • 36.Wang X, Zhai M, Ren Z et al (2021) Exploratory study on classification of diabetes mellitus through a combined random forest classifier. BMC Med Inf Decis Mak 21(1):105. 10.1186/s12911-021-01471-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Zhang L, Liu X, Xu X et al (2023) An integrative non-invasive malignant brain tumors classification and Ki-67 labeling index prediction pipeline with radiomics approach. Eur J Radiol 158:110639. 10.1016/j.ejrad.2022.110639 [DOI] [PubMed] [Google Scholar]
  • 38.Xia W, Hu B, Li H et Al(2021)Multiparametric-MRI-based radiomics model for differentiating primary central nervous system lymphoma from glioblastoma: development and cross-vendor validation. J Magn Reson Imaging 53(1):242–250. 10.1002/jmri.27344 [DOI] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (29.5KB, docx)

Articles from Neuroradiology are provided here courtesy of Springer

RESOURCES