Skip to main content
Neuro-Oncology logoLink to Neuro-Oncology
. 2019 Jan 17;21(4):527–536. doi: 10.1093/neuonc/noz004

Guiding the first biopsy in glioma patients using estimated Ki-67 maps derived from MRI: conventional versus advanced imaging

Evan D H Gates 1,2,#, Jonathan S Lin 1,3,4,#, Jeffrey S Weinberg 5, Jackson Hamilton 6,7, Sujit S Prabhu 5, John D Hazle 1, Gregory N Fuller 8, Veera Baladandayuthapani 9, David Fuentes 1, Dawid Schellingerhout 6,10,
PMCID: PMC6422438  PMID: 30657997

Abstract

Background

Undersampling of gliomas at first biopsy is a major clinical problem, as accurate grading determines all subsequent treatment. We submit a technological solution to reduce the problem of undersampling by estimating a marker of tumor proliferation (Ki-67) using MR imaging data as inputs, against a stereotactic histopathology gold standard.

Methods

MR imaging was performed with anatomic, diffusion, permeability, and perfusion sequences, in untreated glioma patients in a prospective clinical trial. Stereotactic biopsies were harvested from each patient immediately prior to surgical resection. For each biopsy, an imaging description (23 parameters) was developed, and the Ki-67 index was recorded. Machine learning models were built to estimate Ki-67 from imaging inputs, and cross validation was undertaken to determine the error in estimates. The best model was used to generate graphical maps of Ki-67 estimates across the whole brain.

Results

Fifty-two image-guided biopsies were collected from 23 evaluable patients. The random forest algorithm best modeled Ki-67 with 4 imaging inputs (T2-weighted, fractional anisotropy, cerebral blood flow, Ktrans). It predicted the Ki-67 expression levels with a root mean square (RMS) error of 3.5% (R2 = 0.75). A less accurate predictive result (RMS error 5.4%, R2 = 0.50) was found using conventional imaging only.

Conclusion

Ki-67 can be predicted to clinically useful accuracies using clinical imaging data. Advanced imaging (diffusion, perfusion, and permeability) improves predictive accuracy over conventional imaging alone. Ki-67 predictions, displayed as graphical maps, could be used to guide biopsy, resection, and/or radiation in the care of glioma patients.

Keywords: glioma, machine learning, magnetic resonance imaging


Key Points.

  1. Proliferative activity in gliomas can be predicted with MRI to guide biopsy and therapy.

  2. Machine learning of clinical imaging data can be used to predict quantitative pathological markers.

Importance of the Study.

At the time this study was conceived there were very few brain tumor imaging studies that attempted to make quantitative correlations to a hard histopathologic gold standard, and it was considered unlikely that such correlations could be successfully made. Most practitioners still do no more than qualitative imaging interpretation, believing that the level of prognostic information that histopathology yields is out of reach for imaging. The published evidence to the contrary is still sparse and incomplete. Our study shows proof-of-concept that modern imaging and machine learning techniques can make clinically useful quantitative predictions of important histopathologic findings such as proliferation index. These methods can dramatically increase the (already high) value of imaging in the care of brain tumor patients, and point the way for similar advances to be made in multiple other tumors and conditions also.

Ki-67 is a nuclear antigen expressed only by cells that have left the resting (G0) state and are in the mid G1, S, G2, or the M phase of the cell cycle.1 The background levels of Ki-67 expression in the normal brain are very low.2,3 Ki-67 levels correlate with higher tumor grade and poor prognosis,4 although molecular alterations such as isocitrate dehydrogenase (IDH) mutation, 1p/19q codeletion, and telomerase reverse transcriptase promoter mutation are thought to be more fundamental predictors of prognosis, as illustrated by their emphasis in the 2016 revision of the World Health Organization (WHO) CNS tumor classification.5,6 The accuracy of tumor grade, the final synthesis of prognostic information, depends critically on accurate biopsy sampling of the highest grade disease present, and is vulnerable to undersampling.7 Targeting highly proliferative areas should increase the likelihood of sampling highly malignant tumor areas, which will improve the accuracy of prognosis.

Conventional MRI, the mainstay of imaging care for brain tumor patients, has been augmented by multiple advanced sequences: diffusion, perfusion, and permeability imaging. These functional imaging methods provide additional information about tissue pathology.8

There is a gap in knowledge regarding the quantitative strength of radiologic-pathologic correlations for Ki-67 expression. We show how, and with what degree of confidence, predictive hypotheses about Ki-67 might be made from imaging data inputs. Specifically, in this study we use: (i) tight correlation between imaging and pathological data at the resolution of modern stereotactic biopsy technology, (ii) an extensive set of modern imaging techniques (including anatomic, diffusion weighted, permeability/dynamic contrast enhanced [DCE], and perfusion/dynamic susceptibility contrast [DSC] imaging techniques) matched to (iii) quantitative histopathology for proliferation/Ki-67, in (iv) a reasonable number of glioma patients/biopsies comprising most relevant tumors and grades encountered in adult practice. We also made use of the latest machine learning techniques to analyze our data into clinically applicable models.

We hypothesized that MR contains sufficient information to quantitatively estimate the Ki-67 expression levels of biopsies. We demonstrate that Ki-67 levels can be estimated to clinically useful levels of certainty using a relatively small set of non-redundant imaging inputs, displayed as clinically informative graphical maps, and can indicate which imaging sequences are best suited for such predictions.

Materials and Methods

Patients

This was a prospective, institutional review board–approved clinical trial compliant with the Health Insurance Portability and Accountability Act and requiring signed consent. Adult patients with treatment-naïve glioma were consecutively recruited based on specific criteria for inclusion/exclusion (see Supplementary Table 1 for full list of criteria).

Biopsy Sites—Selection and Pathology Analysis

Biopsy samples were collected using either a Nashold-type needle prior to the opening of the dura or surgical forceps after dural opening but before any resection or debulking. The surgical technique specifically focused on minimizing brain shift, which was negligible for needle biopsies and minimal for open forceps biopsies. In all cases, cranial neuronavigation software (Brainlab) was used to guide the biopsy and record the spatial coordinates as the sample was collected.

Up to 5 biopsy targets were specified per patient, based on neuroradiological and surgical assessments of either the anatomic imaging findings or areas of relatively high cerebral blood volume (CBV), high Ktrans, and/or low apparent diffusion coefficient (ADC). The specific procedure is as follows: biopsy sites were first targeted using conventional imaging findings followed by the addition of advanced imaging data from diffusion, perfusion, and DCE sequences. The neurosurgeon/radiologist first selected a biopsy target using standard clinical procedures while blinded to advanced imaging data. These “conventional” targets were placed in areas of contrast enhancement or T2 hyperintensity closest to the surface of the tumor in the planned operative bed. After being unblinded to the imaging data, “advanced” targets were placed in areas of high CBV, high Ktrans, or low ADC (restricted diffusion) (in order of importance). The conventional and advanced targets were considered the same if they were less than 3 mm apart. Additionally, samples were collected along the needle tract as it traversed other tissue fated for resection, with tumor border and normal brain samples obtained where possible. These selection procedures are close to regular clinical practice at our institution, and ensure a realistic range of values for Ki-67 in our samples.

A board-certified neuropathologist performed pathological assessment on a per-sample basis while blinded to imaging data. Biopsies were immunostained for cell proliferation (Ki-67; MIB-1 antibody, Dako Denmark) and reported as the Ki-67 index (% positive nuclei).

For each biopsy, a control volume of interest (VOI) was placed in normal-appearing white matter contralateral to the lesion by an oncologic neuroradiologist. These “virtual biopsies” balanced the high number of tumor biopsies with data from normal tissues. For virtual biopsy sites, Ki-67 expression was assumed to be absent (0%).3,9–12

Imaging Sequences and Maps

Four categories of imaging were obtained for each patient: anatomic/conventional, diffusion, perfusion, and permeability.13–15 We chose these categories of imaging because metrics such as decreased diffusivity and increased blood flow have been linked to tumor presence and malignancy.16,17 As malignant tumors are generally more actively proliferating, we suspected these metrics will also be associated with Ki-67 expression. In order to identify the strongest specific predictors, we acquired a wide range of imaging sequences and parametric maps and removed redundant and non-informative variables with aggressive variable selection.

Patients were imaged for surgical planning on clinical GE 3.0T MR systems as detailed in the the Supplementary Material (section S1.1, Supplementary Tables 2 and 3). Specific imaging parameters are detailed in Supplementary Table 4 and Supplementary Table 5). Conventional images included T1, T1 post-gadolinium (T1C), T2, T2*/susceptibility, fluid attenuation inversion recovery (FLAIR), and susceptibility-weighted angiography (SWAN) sequences.18 These sequences are routinely used in the clinic and provide structural anatomic information and localize the tumor. Advanced imaging included: diffusion, perfusion, and permeability imaging. Diffusion images were processed to maps of ADC, exponential ADC, and fractional anisotropy (FA).19–21 Permeability (DCE) images were processed using NordicICE (Nordic Neuro Lab) to parameter maps using the extended Tofts pharmacokinetic model: Ktrans, kep, vp, and ve.22–24 Also calculated were the parameters of: wash-in (upslope of the time-signal curve), wash-out (downslope), time to peak, area under the time-signal curve, and peak enhancement value. The DCE sequence serves as a preload for the perfusion (DSC) sequence, which was processed to yield maps of leakage-corrected relative CBV, relative cerebral blood flow (CBF), mean transit time, delay, and K2 (leakage parameter, cutoff of 0.1).25–27

Image Registration, Normalization, and VOI Extraction of Imaging Values

Brain-extracted images were registered to the T2 reference image.28 Image registration was performed using the open-source program Advanced Normalization Tools (ANTs21), which first used a rigid, 6 degrees-of-freedom (DOF), mutual-information (MI) based registration for each image volume, followed by an affine, 12 DOF, MI-based registration. Note that the registration performance of the ANTs package has been previously evaluated for brain applications.22 Anatomic image intensities were independently normalized by a linear transformation so that the average intensity of internal reference tissues (gray matter, white matter, cerebrospinal fluid [CSF]) was consistent between patients. To do this, regions of interest (ROIs) were placed in the ventricles (for CSF), the putamen in the lentiform nucleus (for gray matter), and normal-appearing white matter to construct internal intensity reference standards for quantitative analysis. Each image was independently linearly transformed such that the average CSF, gray matter, and/or white matter voxel intensities inside the ROI had mean 0 or 1, respectively, with the specific pair of tissues used depending on the sequence being normalized. This process enforces a consistent intensity normalization across the study population (Supplementary Fig. 1) and reduced interpatient and interscanner variability.29 The Supplementary Material (section S1.2, Supplementary Fig. 1) provides details.

Five-millimeter diameter spherical VOIs were centered on the biopsy coordinates, and mean imaging values from both real and virtual biopsy locations were recorded (Supplementary Fig. 2), meaning that each biopsy site had 23 imaging parameters associated with it (Supplementary Tables 4 and 5). The 5 mm diameter was chosen based on the physical size of the tissue sample. Needle biopsies produce a tissue sample approximately 10 mm long, of which half is used for pathological analysis. So, a 5 mm diameter VOI contains the whole sample. For consistency, the same size 5 mm VOI was used for forceps biopsies as well. The level of spatial uncertainty due to registration is small, less than about 1.5 mm.28

Variable Reduction and Modeling of Image Features to Estimate Ki-67

We evaluated our modeling performance using 5-fold cross validation by randomly dividing our data pool into 5 roughly equal parts (folds). We divided the set of real biopsy samples equally into 5 random folds. Then, the virtual biopsies were assigned to the same folds as their corresponding real biopsies, so each fold was composed of half real and half (matched) virtual biopsies. We performed variable selection and trained a random forest model on the combined data from 4 of the 5 folds. This model was then used to estimate the Ki-67 expression in the remaining, fifth, fold. We repeated this process 5 times: once for each combination of 4 training folds. Cross validation allowed us to estimate the generalization error and the sensitivity of the model to data partitioning. We implemented variable selection and modeling using R.30

Variable reduction serves to (i) identify the most informative predictive variables and (ii) simplify the resulting models by reducing collinearity, making them more generalizable. Based on clinical knowledge, we suspected that images or maps from the same imaging family (anatomic, diffusion, perfusion, and permeability) contained substantial redundant information. We used only the dominant predictor, as defined by the random forest algorithm (see Supplementary section S1.3), from each imaging family in order to minimize and reduce this overlap in information to a total of 4 predictors. Finally, to assess the added benefit of advanced imaging, we repeated the variable selection process, but using only conventional imaging sequences (T1, T1C, T2, FLAIR, T2*, SWAN), and selected the most important 4 inputs from this set.

We chose to use the random forest algorithm based on past success in medical imaging applications.31 The random forest is an ensemble classifier that uses many voting decision trees to categorize new data. Each decision tree is trained on a subset of the input variables and training data to learn different relationships between the training data. We measured the model performance using the coefficient of determination (R2) between observed and estimated values in the testing fold32 (see Supplementary Fig. 3 for a schematic of this process and Supplementary section S1.3 for a description of R2), the root mean square error (RMSE), and the maximum error. Additionally, we tested other popular machine learning algorithms, including single decision trees, support vector machines, neural networks, and elastic net linear regression,33,34 but did not find any that outperformed the random forest.

After evaluating our best models based on all available imaging (conventional plus advanced) or just conventional sequences, we combined the results of the 5 separate variable selections into a single variable set based on the consensus of the 5 selections. This allows us to define a final model with patient-independent variable selections. We repeated the cross validation with these fixed variable sets to estimate final accuracy.

Mapping of Ki-67 Predictions

Based on the fixed variable selections for our best performing approach (Supplementary Table 6), we chose a single 4-variable set (T2, FA, CBF, Ktrans) to generate quantitative maps of estimated Ki-67. We estimated the Ki-67 for each voxel in a patient’s brain using a leave-one-patient-out scheme. This represents the closest approximation to a clinical implementation of the algorithm and uses all the available data while maintaining independence in the resulting maps. Each map was smoothed using a 2 mm Gaussian filter.

Results

Patients

Thirty-one patients were recruited from January 2013 to May 2016 (demographics and clinical data are given in Tables 1 and 2). Exclusions were due to various technical and surgical difficulties (Fig. 1), leaving 23 evaluable patients with 2–4 biopsies each, for a total of 52 real biopsies (average of 2.3 per patient) and 52 virtual biopsies. Thirty-four biopsies were collected with the Nashold-type needle and 18 were collected with forceps. Summary statistics of the 23 evaluable patients are provided in Table 1.

Table 1.

Patient demographic information

N = 23 Patients
Age, mean 43.9 ± 16.9 y; range 21–80
Sex 14 Females (42.3 ± 16.0 y; range 21–75)
9 Males (46.4 ± 18.8 y; range 28–80)
Ethnicity (white/black/Hispanic/Asian) 17/2/4/0
WHO grade (I/II/III/IV) 0/7/9/7
Biopsy samples/patient 2.26 ± 0.54
No. of biopsy samples in final analysis 52

Patients were prospectively recruited between January 2013 and May 2016. A summary of the 23 evaluable patients is listed here and a summary of exclusions is given in Fig. 1.

Table 2.

Pathology description

Patient WHO Grade # Patients # Real Samples Ki-67 (% Pos. Nuclei)
II 7 13 2.36 ± 1.07
III 9 21 6.04 ± 3.93
IV 7 18 12.59 ± 13.17
Total 23 52 7.39 ± 9.00

Number of real biopsy samples in each WHO grade, along with quantitative pathology values (mean ± SD) for both real and virtual biopsies.

Fig. 1.

Fig. 1

Patient recruitment. Among the 26 patients with successful tissue harvest of a total of 64 biopsies, additional patient exclusion occurred due to missing DCE imaging (1 patient, 3 biopsies) and missing histological values due to lack of analyzable tissue (2 patients, 4 total biopsies). Further exclusion of biopsies occurred due to poor VOI placement (n = 3 biopsies), insufficient quality of tissue for pathology analysis (n = 1), and missing grade (n = 1). When a biopsy site was excluded, its corresponding imputed virtual biopsy site was also excluded. Note that 5 biopsies had SWAN data imputed as median values. In the final analysis, the 52 evaluable samples were repeatedly split into 80% training and 20% validation datasets for cross validation of variable selection and model building.

Biopsy Sites—Selection and Pathology Analysis

Patients were fairly evenly distributed between low-, mid-, and high-grade tumors. We had 13 biopsies from 7 grade II gliomas, 21 biopsies from 9 grade III gliomas, and 18 biopsies from 7 grade IV gliomas. The lack of grade I disease is expected in a non-pediatric population. The average Ki-67 measured was 7.39% and correlated with tumor grade (Table 2).

Image Sequences, Processing, Registration, Normalization, and VOI Extraction of Imaging Values

SWAN image coverage was lacking for 5 biopsies in 2 patients and was imputed as median values. The composite spatial distribution of real and virtual biopsies is shown in Supplementary Fig. 2.

Variable Reduction and Modeling Results

There were strong correlations between images and derivatives from the same family (Supplementary Fig. 4A). Variable reduction using the random forest variable importance ranking reduced the variability in R2 values across folds compared with using all variables, which suggests increased generalizability (Table 3). The variable selections were also highly consistent between folds of cross validation (Supplementary Table 6). Principal component analysis (Supplementary section S2.0 and Supplementary Table 7) was attempted but did not improve model performance.

Table 3.

Predictive modeling performance

Metric Variables (#) Conventional Conventional Plus Advanced
R2 All variables 0.508 ± 0.093 (6) 0.660 ± 0.201 (23)
RF rank 0.504 ± 0.065 (4) 0.749 ± 0.137 (4)
RF (fixed variables) 0.496 ± 0.083 (T2, T1C, FLAIR, T1) 0.747 ± 0.168 (T2, FA, CBF, Ktrans)
RMSE (%) All variables 5.34 ± 2.25 4.19 ± 1.78
RF rank 5.47 ± 2.33 3.85 ± 1.82
RF (fixed variables) 5.40 ± 2.34 3.46 ± 1.39
Max error across all folds (%) All variables 30.50 27.03
RF rank 28.94 24.93
RF (fixed variables) 28.75 20.49

RF = random forest. Summary of the performance of the random forest model using variables selected by random forest rankings. Accuracy was estimated using 5-fold cross validation. Conventional = conventional MR only (T1, T2, T1 contrast enhanced, FLAIR, SWAN, T2*). Conventional plus advanced = conventional imaging plus diffusion, permeability and perfusion imaging. Numbers in parentheses indicate the number of variables included to obtain the predictive performance stated (see text).

As a baseline we used all 23 imaging inputs and found an average predictive performance of R2 = 0.660 ± 0.201, RMSE = 4.19% ± 1.78%. In order to reduce the number of input variables, we started with all of the conventional plus advanced imaging and selected the dominant predictor from each family of sequences. For conventional, diffusion imaging, and perfusion imaging T2, FA, and CBF were selected in all 5 folds of cross validation. For permeability (DCE) imaging, Ktrans was selected in 3 of 5 folds, making it the dominant predictor. The predicted and observed values using a random forest trained on these single dominant predictors from each family within each fold were significantly correlated (P < 0.0001) with R2 = 0.749 ± 0.137. The RMSE was 3.85% ± 1.82% and the maximum error in any fold was 24.93%, summarized in Table 3.

To assess the benefit of the advanced imaging in predicting Ki-67, we repeated the cross validation using conventional imaging data only. The top 4 predictors were T2, T1C, FLAIR, and T1 in 3 folds, 1 fold selected T2* instead of T1C, and 1 fold selected T2* and SWAN instead of T1 and T1C. This model performed slightly worse than the model based on conventional plus advanced imaging with R2 = 0.504 ± 0.065 and a slightly higher RMSE of 5.47% ± 2.33%. The correlation between predicted and observed values was still significant (P < 0.001).

Finally, we aggregated the results of the repeated variable selection from each of the 5 folds of cross validation to create a single set of input parameters for clinical use. From conventional, diffusion, and perfusion imaging, T2-weighted, FA, and CBF, respectively, were obvious choices. For permeability imaging we chose Ktrans because it was selected in 60% of folds. We repeated the cross validation using these 4 variables (T2, FA, CBF, Ktrans; Supplementary Fig. 4B) on every fold. With these fixed variable selections, the model performance increased to R2 = 0.747 and RMSE of 3.46%. Similarly, when we considered conventional sequences only, the variable selections were highly consistent and we selected T2, T1C, FLAIR, and T1 as the conventional fixed variable set (Supplementary Fig. 4C). However, using only these variables left the model performance unchanged at R2 = 0.496 and RMSE of 5.4%. The predicted and observed Ki-67 for the final models are shown in Fig. 2, demonstrating the superiority of conventional plus advanced over conventional imaging only, using these fixed variable sets.

Fig. 2.

Fig. 2

Model predictions. Plots show the observed Ki-67 at each biopsy location compared with predicted Ki-67 from a random forest using 5-fold cross validation. Each point represents a prediction made on a member of a fold not used for training during cross validation. The dashed lines indicate perfect agreement (predicted equals observed) and the solid line is a best-fit. In both plots the correlation between predicted and observed values is significant (P < 0.0001). (A) The final model using 4 imaging inputs (T2, FA, CBF, Ktrans) based on conventional plus advanced imaging gives good estimates of Ki-67 compared with the ground truth immunohistochemistry (best fit line: predicted = 0.680 * observed + 1.10, R2 = 0.771). (B) Using conventional imaging data, only the model performance decreases, as shown by the weakened correlation between predicted and observed values (best fit line: predicted = 0.426 * observed + 2.09, R2 = 0.397). There are also considerably more outliers using conventional imaging only. The R2 values for these plots are comparable to the average R2 values from cross validation listed in Table 3, which calculates the correlations between predicted and observed values for each fold separately and averages.

Maps of Predicted Ki-67 Expression

Ki-67 estimate maps were generated for all patients (representative examples are shown in Fig. 3) with observations: first, the highest estimated Ki-67 was almost always inside the contrast enhancing volume for enhancing tumors (panel B). This is consistent with clinically known proliferation-leakage associations. Second, for some non-enhancing tumors, areas of high estimated Ki-67 provided a clear biopsy target, often different from the closest-to-brain-surface rule usually employed (panel A). Finally, for some low-grade (II) tumors, estimated Ki-67 was homogeneous and low with no local maximum, reflective of their low actual Ki-67 values and quiescent biology (panel C).

Fig. 3.

Fig. 3

Quantitative maps. Ki-67 map generated using the random forest trained on the 4 imaging inputs shown in axial view: T2-weighted image, FA, Ktrans transfer constant, and CBF. Areas of increased estimated Ki-67 (white arrow) likely contain more actively dividing tumor cells and provide a better target for biopsy. In tumors like (A), a grade III anaplastic astrocytoma, sampling the most malignant area of the tumor is important for correctly grading the tumor. This tumor did not demonstrate contrast enhancement, which is clinically used for biopsy target selection, when present. (B) for enhancing tumors (glioblastoma multiforme shown), the highest estimated Ki-67 generally corresponds to enhancement (the currently used clinical target for biopsy selection). (C) For low-grade tumors (shown grade II diffuse glioma) the estimated (and measured) Ki-67 is generally low. Seemingly raised Ki-67 values in the sulci and cisterns are artefactual, but are unlikely to confuse an experienced clinician.

Discussion

Predictive models using conventional plus advanced imaging and a random forest model can estimate the Ki-67 expression of biopsies across all grades with an R2 of 0.75 and an average error of about 3.5%. This error is much smaller than the range of Ki-67 values found in our samples (0.13% to 39.9%). Therefore, it is reasonable to expect that extrapolated maps like Fig. 3 should accurately localize regions of high and low Ki-67 expression prospectively in new patients using only the dominant 4 input variables (T2, FA, Ktrans, and CBF). When only conventional imaging data are used for predictions, R2 is 0.50 and the error is 5.4%. These results indicate: (i) imaging data can be used to make clinically useful predictions of a quantitative pathological parameter, Ki-67, (ii) advanced imaging (diffusion, perfusion, and permeability) adds considerable value to the predictive accuracies obtained over conventional imaging alone, and (iii) graphical maps of Ki-67 expression can be generated to aid image-guided therapies.

Our results generally agree with other studies comparing Ki-67 measurements with imaging. Studies by Price and Barajas found that perfusion imaging (CBV) was significantly correlated with Ki-67 in stereotactic biopsies of glioblastoma patients.35,36 We also found this correlation, but in our final model we selected CBF, a near relation of CBV, since it appeared more often in our variable selection procedure. Barajas also found T1 contrast enhancement to be associated with proliferation, similar to our findings (Fig. 3B). However, in our study we found T2-weighted image intensity to be a better predictor of Ki-67. This might be because we combined values from a variety of grades and normal brain, as opposed to sampling from the enhancing volumes of glioblastomas.

Ki-67 estimation accuracy improved when advanced imaging data were added to conventional imaging data. The informative nature of advanced imaging agrees with clinical practice. However, the quantification of this improvement (R2 improved from 0.496 to 0.747 and error from 5.4% to 3.5%) has to our knowledge never been published previously. This provides quantitative evidence for the usefulness of advanced imaging in addition to routine conventional imaging.

Ki-67 predictions, presented as graphical maps, are clinically useful to plan image-guided therapies, and can help overcome the pitfall of undersampling.37 For this purpose, it stands independent of other prognostic factors like IDH1,6,38 and correlates well with tumor grade.39 We are not aware of previous work presenting Ki-67 predictive models in a graphical format, although nuclear density maps and tumor infiltration maps have been presented in work by Durst,40 Akbari,41 and Chang.42 However, Akbari and colleagues had no histopathologic ground truth in their modeling and the study by Chang et al included only T1 post-contrast, FLAIR, and ADC measurements. Both Durst and Chang report correlations between predicted and observed values around R2 ~ 0.55, considerably lower than our result of R2 = 0.75. In the same way that clinical imaging is used to inform and guide treatment, we propose maps like the ones presented currently as a tool to improve the ability for clinicians to deliver effective therapy.

Limitations

Our study has a relatively small dataset (n = 23 patients, 52 biopsy sites) but still did yield statistically significant results and correlations. Stereotactic biopsy collection is expensive and time-consuming. The use of contralateral normal VOIs could be challenged due to the fact they are not real biopsies. However, this practice is justified based on well-documented pathology values for normal tissue and the ethical constraints of removing of tissue from normal brain.

We had fewer high Ki-67 expressing biopsies than preferred, even though the clinical tumor grades were well distributed. This deficit reflects the clinical problem of sampling error and false undergrading in the face of tumor heterogeneity, as clinical tumor behavior follows the highest grade present. This reduces our predictive power for predicting very high Ki-67 and indicates the need for larger datasets.

Future Work

While we are able to achieve accurate estimation of Ki-67, there are still many challenges to overcome before a “virtual” biopsy from a mathematical model can compete with gold standard tissue biopsy. Perhaps one of the greatest is overcoming the substantial variability in MR image data across vendor platforms, sequence parameters, and institutions. Dynamic contrast imaging is further complicated by sensitivity to post-processing techniques. Training data for our models were carefully collected and curated to minimize these sources of uncertainty, which allows better results but limits the immediate generalizability. The next logical step is a validation trial where prospective predictions are compared with the results of tissue biopsy for patients outside the training cohort.

We foresee that much better predictive models could be constructed in future with more data of similar quality. Further future goals will link our imaging information to genomic information, 1p/19q codeletion status, and methylation status of O6-methylguanine-DNA methyltransferase, as well as clinical information, such as survival43 and recurrence.41 Another common target for imaging-genomic correlation is IDH1 mutation status. However, IDH mutations are known to be very homogeneous within tumors and thus predictions of IDH status are likely better suited to whole-tumor analysis rather than to spatially precise tissue samples.44

In conclusion, a few well-chosen clinical MR sequences with machine learning modeling can be used to quantitatively estimate the histopathologic proliferation marker, Ki-67, to clinically useful accuracies in glioma patients. Ki-67 predictions can be presented as graphical maps for image-guided therapies such as biopsy, surgery, and radiation, and have great potential to improve the care of glioma patients. Further clinical trials are justified to verify and build on these findings.

Funding

This work was partially supported by the National Institutes of Health and National Cancer Institute grants R01-CA194391, R01-CA160736 and Cancer Center Support grant P30 CA016672 (Biostatistics Shared Resource) to V.B. The Baylor College of Medicine Medical Scientist Training Program and the Cullen Trust for Higher Education Physician/Scientist Fellowship Programs supported J.S.L. during the duration of this research. The John S. Dunn Sr. Distinguished Chair in Diagnostic Imaging (to Dr William A Murphy) and M D Anderson Cancer Center Internal Research Grant and Clinical Research Support mechanisms contributed funding to support data collection.

Conflict of interest statement.

The authors disclose no conflicts of interest related to this work.

Authorship statement.

Study design: EDHG, JSL, JW, SP, JH, DS. Figures: EDHG, JSL, GNF, DF, DS. Data collection, analysis, interpretation: EDHG, JSL, JW, SP, JDH, JH, GNF, VB, DF, DS. Manuscript writing: EDHG, JSL, JDH, JH, GNF, DF, DS.

Supplementary Material

Supplementary Figures
Supplementary Material

References

  • 1. Burger PC, Shibata T, Kleihues P. The use of the monoclonal antibody Ki-67 in the identification of proliferating cells: application to surgical neuropathology. Am J Surg Pathol. 1986;10(9):611–617. [DOI] [PubMed] [Google Scholar]
  • 2. Parkins CS, Darling JL, Gill SS, Revesz T, Thomas DG. Cell proliferation in serial biopsies through human malignant brain tumours: measurement using Ki-67 antibody labelling. Br J Neurosurg. 1991;5(3):289–298. [DOI] [PubMed] [Google Scholar]
  • 3. Skjulsvik AJ, Mørk JN, Torp MO, Torp SH. Ki-67/MIB-1 immunostaining in a cohort of human gliomas. Int J Clin Exp Pathol. 2014;7(12):8905–8910. [PMC free article] [PubMed] [Google Scholar]
  • 4. Torp SH. Diagnostic and prognostic role of Ki-67 immunostaining in human astrocytomas using four different antibodies. Clin Neuropathol. 2002;21(6):252–257. [PubMed] [Google Scholar]
  • 5. Louis DN, Perry A, Reifenberger G, et al. . The 2016 World Health Organization classification of tumors of the central nervous system: a summary. Acta Neuropathol. 2016;131(6):803–820. [DOI] [PubMed] [Google Scholar]
  • 6. Eckel-Passow JE, Lachance DH, Molinaro AM, et al. . Glioma groups based on 1p/19q, IDH, and TERT promoter mutations in tumors. N Engl J Med. 2015;372(26):2499–2508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Jackson RJ, Fuller GN, Abi-Said D, et al. . Limitations of stereotactic biopsy in the initial management of gliomas. Neuro Oncol. 2001;3(3):193–200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Villanueva-Meyer JE, Mabray MC, Cha S. Current clinical brain tumor imaging. Neurosurgery. 2017;81(3):397–415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Cattoretti G, Becker MH, Key G, et al. . Monoclonal antibodies against recombinant parts of the Ki-67 antigen (MIB 1 and MIB 3) detect proliferating cells in microwave-processed formalin-fixed paraffin sections. J Pathol. 1992;168(4):357–363. [DOI] [PubMed] [Google Scholar]
  • 10. Katsuzo K, Nobuya M, Takashi M, et al. . Immunohistochemical demonstration of DNA polymerase α in human brain-tumor cells. J Neurosurgery. 1990;72(2):268–272. [DOI] [PubMed] [Google Scholar]
  • 11. Park SH, Suh YL. Expression of cyclin A and topoisomerase IIalpha of oligodendrogliomas is correlated with tumour grade, MIB-1 labelling index and survival. Histopathology. 2003;42(4):395–402. [DOI] [PubMed] [Google Scholar]
  • 12. Gerdes J, Schwab U, Lemke H, Stein H. Production of a mouse monoclonal antibody reactive with a human nuclear antigen associated with cell proliferation. Int J Cancer. 1983;31(1):13–20. [DOI] [PubMed] [Google Scholar]
  • 13. Aronen HJ, Gazit IE, Louis DN, et al. . Cerebral blood volume maps of gliomas: comparison with tumor grade and histologic findings. Radiology. 1994;191(1):41–51. [DOI] [PubMed] [Google Scholar]
  • 14. Brix G, Semmler W, Port R, Schad LR, Layer G, Lorenz WJ. Pharmacokinetic parameters in CNS Gd-DTPA enhanced MR imaging. J Comput Assist Tomogr. 1991;15(4):621–628. [DOI] [PubMed] [Google Scholar]
  • 15. Tofts PS. Modeling tracer kinetics in dynamic Gd-DTPA MR imaging. J Magn Reson Imaging. 1997;7(1):91–101. [DOI] [PubMed] [Google Scholar]
  • 16. Zhao M, Guo LL, Huang N, et al. . Quantitative analysis of permeability for glioma grading using dynamic contrast-enhanced magnetic resonance imaging. Oncol Lett. 2017;14(5):5418–5426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Prah MA, Al-Gizawiy MM, Mueller WM, et al. . Spatial discrimination of glioblastoma and treatment effect with histologically-validated perfusion and diffusion magnetic resonance imaging metrics. J Neurooncol. 2018;136(1):13–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Brown RW, Haacke EM, Cheng Y-CN, Thompson MR, Venkatesan R.. Magnetic Resonance Imaging: Physical Principles and Sequence Design. Hoboken, NJ: John Wiley & Sons; 2014. [Google Scholar]
  • 19. Arlinghaus L, Yankeelov TE. Diffusion-weighted MRI. In: Yankeelov TE, Pickens DR, Price RR, eds. Quantitative MRI in Cancer. Boca Raton, FL: Taylor & Francis; 2011:81–98. [Google Scholar]
  • 20. Basser PJ, Pierpaoli C. Microstructural and physiological features of tissues elucidated by quantitative-diffusion-tensor MRI. J Magn Reson B. 1996;111(3):209–219. [DOI] [PubMed] [Google Scholar]
  • 21. Moseley ME, Cohen Y, Kucharczyk J, et al. . Diffusion-weighted MR imaging of anisotropic water diffusion in cat central nervous system. Radiology. 1990;176(2):439–445. [DOI] [PubMed] [Google Scholar]
  • 22. Tofts PS, Brix G, Buckley DL, et al. . Estimating kinetic parameters from dynamic contrast-enhanced T(1)-weighted MRI of a diffusable tracer: standardized quantities and symbols. J Magn Reson Imaging. 1999;10(3):223–232. [DOI] [PubMed] [Google Scholar]
  • 23. Sung YS, Park B, Choi Y, et al. . Dynamic contrast-enhanced MRI for oncology drug development. J Magn Reson Imaging. 2016;44(2):251–264. [DOI] [PubMed] [Google Scholar]
  • 24. Sourbron SP, Buckley DL. Classic models for dynamic contrast-enhanced MRI. NMR Biomed. 2013;26(8):1004–1027. [DOI] [PubMed] [Google Scholar]
  • 25. Ostergaard L, Weisskoff RM, Chesler DA, Gyldensted C, Rosen BR. High resolution measurement of cerebral blood flow using intravascular tracer bolus passages. Part I: Mathematical approach and statistical analysis. Magn Reson Med. 1996;36(5):715–725. [DOI] [PubMed] [Google Scholar]
  • 26. Ostergaard L, Sorensen AG, Kwong KK, Weisskoff RM, Gyldensted C, Rosen BR. High resolution measurement of cerebral blood flow using intravascular tracer bolus passages. Part II: experimental comparison and preliminary results. Magn Reson Med. 1996;36(5):726–736. [DOI] [PubMed] [Google Scholar]
  • 27. Boxerman JL, Schmainda KM, Weisskoff RM. Relative cerebral blood volume maps corrected for contrast agent extravasation significantly correlate with glioma tumor grade, whereas uncorrected maps do not. AJNR Am J Neuroradiol. 2006;27(4):859–867. [PMC free article] [PubMed] [Google Scholar]
  • 28. Lin JS, Fuentes DT, Chandler A, et al. . Performance assessment for brain MR imaging registration methods. AJNR Am J Neuroradiol. 2017;38(5):973–980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Leung KK, Clarkson MJ, Bartlett JW, et al. ; Alzheimer’s Disease Neuroimaging Initiative Robust atrophy rate measurement in Alzheimer’s disease using multi-site serial MRI: tissue-specific intensity normalization and parameter selection. Neuroimage. 2010;50(2):516–523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. R Core Team R: A Language and Environment for Statistical Computing [computer program]. Vienna, Austria: R Foundation for Statistical Computing; 2013. [Google Scholar]
  • 31. Parmar C, Grossmann P, Bussink J, Lambin P, Aerts HJ. Machine learning methods for quantitative radiomic biomarkers. Sci Rep. 2015; 5:13087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. McFadden D. Conditional logit analysis of qualitative choice behavior. Frontiers in Econometrics. New York: Academic Press; 1973:105–142. [Google Scholar]
  • 33. Cortes C, Vapnik V. Support-vector networks. Machine Learning. 1995;20(3):273–297. [Google Scholar]
  • 34. Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Series B Stat Methodol. 1996;58(1):267–288. [Google Scholar]
  • 35. Price SJ, Green HA, Dean AF, Joseph J, Hutchinson PJ, Gillard JH. Correlation of MR relative cerebral blood volume measurements with cellular density and proliferation in high-grade gliomas: an image-guided biopsy study. AJNR Am J Neuroradiol. 2011;32(3):501–506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Barajas RF Jr, Phillips JJ, Parvataneni R, et al. . Regional variation in histopathologic features of tumor specimens from treatment-naive glioblastoma correlates with anatomic and physiologic MR imaging. Neuro Oncol. 2012;14(7):942–954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Parker NR, Khong P, Parkinson JF, Howell VM, Wheeler HR. Molecular heterogeneity in glioblastoma: potential clinical implications. Front Oncol. 2015;5:55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Ichimura K, Pearson DM, Kocialkowski S, et al. . IDH1 mutations are present in the majority of common adult gliomas but rare in primary glioblastomas. Neuro Oncol. 2009;11(4):341–347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Coons SW, Johnson PC. Regional heterogeneity in the proliferative activity of human gliomas as measured by the Ki-67 labeling index. J Neuropathol Exp Neurol. 1993;52(6):609–618. [DOI] [PubMed] [Google Scholar]
  • 40. Durst CR, Raghavan P, Shaffrey ME, et al. . Multimodal MR imaging model to predict tumor infiltration in patients with gliomas. Neuroradiology. 2014;56(2):107–115. [DOI] [PubMed] [Google Scholar]
  • 41. Akbari H, Macyszyn L, Da X, et al. . Imaging surrogates of infiltration obtained via multiparametric imaging pattern analysis predict subsequent location of recurrence of glioblastoma. Neurosurgery. 2016;78(4):572–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Chang PD, Malone HR, Bowden SG, et al. . A multiparametric model for mapping cellularity in glioblastoma using radiographically localized biopsies. AJNR Am J Neuroradiol. 2017;38(5):890–898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Kickingereder P, Burth S, Wick A, et al. . Radiomic profiling of glioblastoma: identifying an imaging predictor of patient survival with improved performance over established clinical and radiologic risk models. Radiology. 2016;280(3):880–9. [DOI] [PubMed] [Google Scholar]
  • 44. Zhang X, Yan LF, Hu YC, et al. . Optimizing a machine learning based glioma grading system using multi-parametric MRI histogram and texture features. Oncotarget. 2017;8(29):47816–47830. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Figures
Supplementary Material

Articles from Neuro-Oncology are provided here courtesy of Society for Neuro-Oncology and Oxford University Press

RESOURCES